8. CommTroubleshooting 1
description
Transcript of 8. CommTroubleshooting 1
Slide 1© Max Zornada (2011)
Trouble Shooting and Problem SolvingTrouble Shooting and Problem Solving
Slide 2© Max Zornada (2011)
Murphy’s LawMurphy’s Law
If some thing can go wrong, it will go wrong!
During start-up there are lots of things that can go wrong, and they all do!
Hopefully not all the once.
Slide 5© Max Zornada (2011)
The Current SituationThe Current Situation An Ethylene Glycol Plant has been commissioned and was now An Ethylene Glycol Plant has been commissioned and was now
operating successfully and producing product.operating successfully and producing product. The plant had been started up and shutdown on several occasions as The plant had been started up and shutdown on several occasions as
part of its performance testing schedule.part of its performance testing schedule. The normal start up procedure consisted of placing a batch of glycerol The normal start up procedure consisted of placing a batch of glycerol
in the reactor. This was circulated through the heat exchanger where it in the reactor. This was circulated through the heat exchanger where it was heated.was heated.
When the glycerol temperature reached 115º C the addition of ethylene When the glycerol temperature reached 115º C the addition of ethylene oxide was commenced by starting pump J1.oxide was commenced by starting pump J1.
The reaction was exothermic, so once the batch reactor reached 115º C The reaction was exothermic, so once the batch reactor reached 115º C and the ethylene oxide feed commenced, the heat exchanger was used and the ethylene oxide feed commenced, the heat exchanger was used as a cooler to keep the temperature within the specified range and to as a cooler to keep the temperature within the specified range and to control the reaction dynamics.control the reaction dynamics.
Slide 6© Max Zornada (2011)
Safety InterlocksSafety Interlocks Addition of ethylene oxide could not commence Addition of ethylene oxide could not commence
unless the circulation pump (J2) was running;unless the circulation pump (J2) was running; The glycerol or batch temperature (read by the The glycerol or batch temperature (read by the
temperature indicator at TZA) was above 115 º C temperature indicator at TZA) was above 115 º C (otherwise the ethylene oxide would not react);(otherwise the ethylene oxide would not react);
The temperature (TZA) was below 125 º C The temperature (TZA) was below 125 º C (Otherwise the reaction would be too fast.(Otherwise the reaction would be too fast.
Slide 7© Max Zornada (2011)
The ProblemThe Problem The process had been started up and shutdown successfully on The process had been started up and shutdown successfully on
2 occasions as part of the commissioning process.2 occasions as part of the commissioning process. The operators on the commissioning team moved to start the The operators on the commissioning team moved to start the
plant up a third time as per the testing schedule. plant up a third time as per the testing schedule. As the ethylene oxide was added, the lead operator noted that As the ethylene oxide was added, the lead operator noted that
the pressure in the reactor (as measured by PR) was starting to the pressure in the reactor (as measured by PR) was starting to increase. This suggested that the ethylene oxide was not increase. This suggested that the ethylene oxide was not reacting.reacting.
QuestionQuestion What actions would you take next? What information would What actions would you take next? What information would
you want to collect/review before taking action?you want to collect/review before taking action?
Slide 8© Max Zornada (2011)
What the operator did nextWhat the operator did next
The operator decided that perhaps the temperature point was The operator decided that perhaps the temperature point was reading low, or perhaps a bit more heat was required to get the reading low, or perhaps a bit more heat was required to get the reaction rate up, so he adjusted the trip setting (TZA High) and reaction rate up, so he adjusted the trip setting (TZA High) and allowed the indicated temperature to rise to 200 º C.allowed the indicated temperature to rise to 200 º C.
Still the pressure did not fall.Still the pressure did not fall. Next he checked the valve at the bottom of the reactor and Next he checked the valve at the bottom of the reactor and
found it shut.found it shut.
QuestionQuestion What actions would you take next? What information would What actions would you take next? What information would
you want to collect/review before taking action?you want to collect/review before taking action?
Slide 9© Max Zornada (2011)
What happened!What happened!
The operator opened the valve! Three tonnes of unreacted ethylene oxide, together with
glycerol passed through the heater and catalyser and a violent uncontrolled reaction occurred. The reactor burst and the escaping gases exploded.
Question What measures would you take to ensure an incident like
this cannot happen again?
Slide 10© Max Zornada (2011)
Why did it happenWhy did it happen The explosion was due to an operator forgetting to The explosion was due to an operator forgetting to
open a valve;open a valve; Superficially, we could say Superficially, we could say human error!human error! However, it was one of those mistakes that even However, it was one of those mistakes that even
well trained, motivated and capable people make;well trained, motivated and capable people make; As is the case with almost all accidents and failures As is the case with almost all accidents and failures
attributed to human error, there is usually an attributed to human error, there is usually an underlying cause;underlying cause;
Opening the valve was probably an instinctive action.Opening the valve was probably an instinctive action.
Slide 11© Max Zornada (2011)
Further Analysis - 1Further Analysis - 1 Failure to assess situation and “jumping to conclusions”Failure to assess situation and “jumping to conclusions”
High pressure in reactor was an early warning sign;High pressure in reactor was an early warning sign; The operator “jumped to a conclusion” about what was causing this and The operator “jumped to a conclusion” about what was causing this and
therefore failed to look for addition data to diagnose the problem. Eg. Checking therefore failed to look for addition data to diagnose the problem. Eg. Checking other temperature points and flow reading.other temperature points and flow reading.
Explosion was due to the failure to directly measure the property we need to know;Explosion was due to the failure to directly measure the property we need to know; Temperature point was not measuring the temperature in the reactor, it was Temperature point was not measuring the temperature in the reactor, it was
measuring the temperature near the pump - these might be the same during measuring the temperature near the pump - these might be the same during normal operations but during start up they can be different (as this example normal operations but during start up they can be different (as this example shows).shows).
Slide 12© Max Zornada (2011)
Further Analysis - 2Further Analysis - 2
The temperature point near the pump got hot because The temperature point near the pump got hot because the pump was running with a closed suction valve;the pump was running with a closed suction valve;
Similarly, the tip initiator on J2 showed that its motor Similarly, the tip initiator on J2 showed that its motor was running, but it did not prove there was a flow;was running, but it did not prove there was a flow;
Operator should not have raised the trip setting;Operator should not have raised the trip setting; The operator thought he knew the answer - The operator thought he knew the answer -
unfortunately he was wrong!unfortunately he was wrong!
Slide 13© Max Zornada (2011)
What should we do?What should we do?
It is no use telling the operator to be more careful;It is no use telling the operator to be more careful; We need to recognise the possibility of a mistake We need to recognise the possibility of a mistake
like this one is inherent in the work situation;like this one is inherent in the work situation; How do we change the “work situation”How do we change the “work situation”
Change the design of the system;Change the design of the system; Change the operating procedures;Change the operating procedures; Change control system - hardware/software.Change control system - hardware/software.
Slide 14© Max Zornada (2011)
Some general principlesSome general principles
Measure the property we wish to know directly;Measure the property we wish to know directly; Measuring the temperature at the pump was not a measure Measuring the temperature at the pump was not a measure
of the temperature in the vessel;of the temperature in the vessel; Pump energised and running as an interlock, did not mean Pump energised and running as an interlock, did not mean
there was flow;there was flow; Operators should not be able to tamper with process standards Operators should not be able to tamper with process standards
- e.g. changing trip settings;- e.g. changing trip settings; A high pressure trip should have been installed on the reactor;A high pressure trip should have been installed on the reactor; Process should have been stopped once the close valves was Process should have been stopped once the close valves was
found and re-start from “scratch”.found and re-start from “scratch”.
Slide 15© Max Zornada (2011)
Trouble Shooting and Problem SolvingTrouble Shooting and Problem Solving
Slide 16© Max Zornada (2011)
What is Trouble ShootingWhat is Trouble Shooting
Application of Problem Solving methods to the Application of Problem Solving methods to the diagnosis and improvement after deviations occur diagnosis and improvement after deviations occur in a system.in a system.
We do this in our every-day lives all the timeWe do this in our every-day lives all the time
Slide 17© Max Zornada (2011)
Problem Solving ProcessProblem Solving ProcessIdentify the problem
Brain storm possible solutions
Identify most likely cause
Collect data
Identify preferred solution
Brainstorm possible causes
Implement
Check that it worked
Slide 18© Max Zornada (2011)
Identify the problemIdentify the problemIdentify the problem
Brain storm possible solutions
Identify most likely cause
Collect data
Identify preferred solution
Brainstorm possible causes
Implement
Check that it worked
Develop a problem statement:Develop a problem statement: What is the deviation - should vs What is the deviation - should vs
actual?actual? What is the symptom?What is the symptom?
Be specific in Be specific in what it iswhat it is and and what it is notwhat it is not;; What?What? Where?Where? When?When? How many?How many? How big?How big?
Slide 19© Max Zornada (2011)
Problems as deviationsProblems as deviations
What should be happening
What is happening
What is happening= What should be happeningNo Problem!
What is happening different to What should be happening= Problem!
Slide 20© Max Zornada (2011)
Three types of problemsThree types of problems
Ongoing an recentOngoing an recent
Has always been thereHas always been there
One-off eventOne-off event
Slide 21© Max Zornada (2011)
Brainstorm Possible CausesBrainstorm Possible CausesIdentify the problem
Brain storm possible solutions
Identify most likely cause
Collect data
Identify preferred solution
Implement
Check that it worked
Brainstorm possible causes
What could be causing the What could be causing the problem or symptom?problem or symptom?
What is different?What is different? What has changed?What has changed? How could each change have How could each change have
caused the problem?caused the problem?
Slide 22© Max Zornada (2011)
Collect DataCollect DataIdentify the problem
Brain storm possible solutions
Identify most likely cause
Collect data
Identify preferred solution
Implement
Check that it worked
Brainstorm possible causes
Collect data about the potential Collect data about the potential causes to see which can be causes to see which can be substantiated by facts and substantiated by facts and observations.observations.
Type text
Slide 23© Max Zornada (2011)
Identify most likely causesIdentify most likely causesIdentify the problem
Brain storm possible solutions
Identify most likely cause
Collect data
Identify preferred solution
Implement
Check that it worked
Brainstorm possible causes From an analysis of the data From an analysis of the data collected, what appears to be the collected, what appears to be the most likely cause?most likely cause?
What causes are not supported What causes are not supported by the data?by the data?
What data is not supported by a What data is not supported by a cause?cause?
Can you see a cause and effect Can you see a cause and effect relationship?relationship?
Is the cause actionable?Is the cause actionable?
Slide 24© Max Zornada (2011)
Brainstorm possible solutionsBrainstorm possible solutionsIdentify the problem
Brain storm possible solutions
Identify most likely cause
Collect data
Identify preferred solution
Implement
Check that it worked
Brainstorm possible causes
What actions will remove or What actions will remove or address the cause?address the cause?
Slide 25© Max Zornada (2011)
Types of potential solutionsTypes of potential solutions
Eliminate cause;Eliminate cause; Neutralise the effect or impact of the cause;Neutralise the effect or impact of the cause; Remove the need for the thing that is giving us the Remove the need for the thing that is giving us the
problem.problem.
Slide 26© Max Zornada (2011)
Identify preferred solutionIdentify preferred solutionIdentify the problem
Brain storm possible solutions
Identify most likely cause
Collect data
Identify preferred solution
Implement
Check that it worked
Brainstorm possible causes
Assess potential actions on Assess potential actions on basis of:basis of:
Ease of implementation;Ease of implementation; Cost?Cost? Degree to which they solve Degree to which they solve
the problem;the problem; Sustainability;Sustainability; Any other relevant criteria.Any other relevant criteria.
Select the preferred solution Select the preferred solution and implement;and implement;
Slide 27© Max Zornada (2011)
ImplementImplementIdentify the problem
Brain storm possible solutions
Identify most likely cause
Collect data
Identify preferred solution
Implement
Check that it worked
Brainstorm possible causes
Select the preferred solution Select the preferred solution and implement;and implement;
Slide 28© Max Zornada (2011)
Check that it workedCheck that it workedIdentify the problem
Brain storm possible solutions
Identify most likely cause
Collect data
Identify preferred solution
Implement
Check that it worked
Brainstorm possible causes
Check to see that it worked;Check to see that it worked; If not:If not:
Re-assess solutions;Re-assess solutions; Re-assess possible causes;Re-assess possible causes; Re-assess problem definition.Re-assess problem definition.
Slide 29© Max Zornada (2011)
Why solutions don’t workWhy solutions don’t work
We misunderstood the problem;We misunderstood the problem; We understood the problem but identified the We understood the problem but identified the
wrong cause;wrong cause; We understood the problems, identified the correct We understood the problems, identified the correct
cause, picked the wrong solution.cause, picked the wrong solution.
Slide 30© Max Zornada (2011)
Re-assessRe-assessIdentify the problem
Brain storm possible solutions
Identify most likely cause
Collect data
Identify preferred solution
Implement
Check that it worked
Brainstorm possible causes
Check to see that it worked;Check to see that it worked; If not:If not:
Re-assess solutions;Re-assess solutions; Re-assess possible causes;Re-assess possible causes; Re-assess problem definition.Re-assess problem definition.
Slide 31© Max Zornada (2011)
Major cause of problems requiring Major cause of problems requiring troubleshooting during commissioningtroubleshooting during commissioning
Instrumentation and Control
System
Something that has been forgotten
during preparation
Incorrect InstallationErrors with
Equipment
Slide 32© Max Zornada (2011)
Some examplesSome examplesInstrumentation and Control System Something that has been
forgotten during preparation
Incorrect InstallationErrors with Equipment Incorrectly specified equipment Incapable equipment Wrong equipment for the job Poor materials selection Underdesign – capacity/strength
Dirt and debris blocking flow lines, instrument lines and instruments
Valve left closed when should be open Blind left in place
Instrument(s) not working/broken Incorrect calibration Incorrect control logic Software bugs Not switched on
Equipment installed incorrectly; Equipment environment does not meet
specification – eg. Straight run lengths Valves, instruments etc installed back-
to-front
Slide 33© Max Zornada (2011)
Startup trouble shooting guidelines ...Startup trouble shooting guidelines ...
1.1. Compare data with those taken during safe-fluid and solvent Compare data with those taken during safe-fluid and solvent tests.tests.
2. Make small inexpensive changes in operating conditions and 2. Make small inexpensive changes in operating conditions and take more data.take more data.
3. Log more data, getting complete recordings of temperatures, 3. Log more data, getting complete recordings of temperatures, pressures, flows and conditions of utilities in the troubled pressures, flows and conditions of utilities in the troubled area.area.
4. Review design calculations.4. Review design calculations.
5.5. Inspect mechanical features of equipment.Inspect mechanical features of equipment.
6. Regularly calculate material balances, adding 6. Regularly calculate material balances, adding instrumentation and taking special samples.instrumentation and taking special samples.
Slide 34© Max Zornada (2011)
Startup trouble shooting guidelines ...Startup trouble shooting guidelines ...
7.7. Investigate the unsteady-state analysis and design of plant.Investigate the unsteady-state analysis and design of plant.
8.8. Make minor equipment changes and take more data.Make minor equipment changes and take more data.
9. Increase data recording.9. Increase data recording.
10. Provide additional automation to smoothen operations 10. Provide additional automation to smoothen operations and release operators from routine tasks.and release operators from routine tasks.
11. Furnish the operators with amenities that will make it 11. Furnish the operators with amenities that will make it easier for them to pay closer attention to operational easier for them to pay closer attention to operational details.details.
12. Automate or record, or both, the vent and purge streams.12. Automate or record, or both, the vent and purge streams.
Slide 35© Max Zornada (2011)
Trouble Shooting Instrumentation and Trouble Shooting Instrumentation and Control ProblemsControl Problems
Slide 36© Max Zornada (2011)
Case Study: Steam Boiler CommissioningCase Study: Steam Boiler Commissioning
As part of an expansion program of a Natural Gas Processing plant, As part of an expansion program of a Natural Gas Processing plant, and additional high pressure steam boiler is being added to the boiler and additional high pressure steam boiler is being added to the boiler house;house;
There are already 7 steam boilers in operation;There are already 7 steam boilers in operation; The new boiler is number 8 in the series and is of similar design to The new boiler is number 8 in the series and is of similar design to
the previous 7. A ninth boiler is currently under construction;the previous 7. A ninth boiler is currently under construction; The plant is located in a remote location, and the high pressure steam The plant is located in a remote location, and the high pressure steam
is used to generate power using steam turbine driven generators;is used to generate power using steam turbine driven generators; The low pressure steam from the steam turbine exhaust is used as The low pressure steam from the steam turbine exhaust is used as
process steam for heating in heat exchangers and in the regeneration process steam for heating in heat exchangers and in the regeneration towers that regenerate the gas sweetening solution.towers that regenerate the gas sweetening solution.
Slide 37© Max Zornada (2011)
Case Study: Steam Boiler CommissioningCase Study: Steam Boiler Commissioning
A general view of the boilerhouse area, showing some of the boilers.
A boiler unitNo. 8 Boiler
No. 9 BoilerUnder
construction
Slide 38© Max Zornada (2011)
Case Study: Steam Boiler CommissioningCase Study: Steam Boiler Commissioning
Main HP steam outlet to steam header
Burner
Exhaust Stack
Air inlet
Boiler
Front view from inside Boiler House Rear view from outside
Fuel Gas line (grey)
Air inlet fan built into air inlet stack base (not visible)
Air inlet to burner volute
Slide 39© Max Zornada (2011)
Case Study: Steam Boiler CommissioningCase Study: Steam Boiler Commissioning The boiler featured a local panel mounter display and The boiler featured a local panel mounter display and
controller that could be calibrated to display on a linear controller that could be calibrated to display on a linear or log linear scale. The linear calibration had been the or log linear scale. The linear calibration had been the norm.norm.
The boiler was rated at a normal operating capacity of The boiler was rated at a normal operating capacity of 41 tonnes of superheated steam per hour under 41 tonnes of superheated steam per hour under continuous operating conditions, with a maximum continuous operating conditions, with a maximum capacity of 45 tonnes per hour.capacity of 45 tonnes per hour.
During commissioning, the boiler was tested by During commissioning, the boiler was tested by ramping up to 45 tonnes to hour and holding and then ramping up to 45 tonnes to hour and holding and then backing off to 41 tonnes per hour.backing off to 41 tonnes per hour.
Slide 40© Max Zornada (2011)
Case Study: Steam Boiler CommissioningCase Study: Steam Boiler CommissioningProblemProblem
Air inlet stack
Fuel gas in
Fan
Air inlet to burner
Exhaust stack
CO2 Measurement of exhaust used to monitor combustion efficiency
Local panel mounted controller and display
Indicates 41 tonnes/hour
Main steam header
Main control room DCS displays
Boiler steam production show 41 tonnes/hour
Main steam header shows increase in steam of 37 tonnes/hour
Slide 41© Max Zornada (2011)
Additional informationAdditional information The controller allowed for the setting of the air-fuel ratio as part The controller allowed for the setting of the air-fuel ratio as part
of the calibration process – therefore, the proportion of gas-to-air of the calibration process – therefore, the proportion of gas-to-air going into the burner was known;going into the burner was known;
The CO2 levels in the exhaust indicated that had been set up The CO2 levels in the exhaust indicated that had been set up correctly;correctly;
There was no direct monitoring of gas and air in for individual There was no direct monitoring of gas and air in for individual boilers – only total fuel gas to boiler house;boilers – only total fuel gas to boiler house;
Fan curves existed that allowed for calculation of gas in, if air in Fan curves existed that allowed for calculation of gas in, if air in was known;was known;
There was no independent measurement of air in;There was no independent measurement of air in; Boiler curve existed showing the amount of steam produced for a Boiler curve existed showing the amount of steam produced for a
given amount of gas in at the correct gas/air ratio.given amount of gas in at the correct gas/air ratio.
Slide 42© Max Zornada (2011)
Group exerciseGroup exercise
What would you do next?What would you do next? What data would you collect and why?What data would you collect and why? What could be causing the problem?What could be causing the problem?
Slide 43© Max Zornada (2011)
Major causes of measurement errors and Major causes of measurement errors and failuresfailures
Sensing lines that are plugged or that contain liquid when they should Sensing lines that are plugged or that contain liquid when they should be dry;be dry;
Sensing elements with excessive coating, fouling or abrasion;Sensing elements with excessive coating, fouling or abrasion; Excessive bubbles or solids in process fluid;Excessive bubbles or solids in process fluid; Sensing elements with cracks, deformations or holes;Sensing elements with cracks, deformations or holes; Low Reynolds Number for process fluid;Low Reynolds Number for process fluid; Gaskets or O rings that leak;Gaskets or O rings that leak; Inappropriate materials of construction;Inappropriate materials of construction; Sensing, pneumatic or electronic components affected by process or Sensing, pneumatic or electronic components affected by process or
ambient temperature;ambient temperature; Moisture on the sensing element or signal connections;Moisture on the sensing element or signal connections; Electrical interference;Electrical interference;
Slide 44© Max Zornada (2011)
Major causes of measurement errors and Major causes of measurement errors and failuresfailures
Electrical interference;Electrical interference; High connection or wiring resistance;High connection or wiring resistance; Non-representative sensing point;Non-representative sensing point; Inadequate straight pipe runs for flow sensor;Inadequate straight pipe runs for flow sensor; Nozzle that are plugged or fouled;Nozzle that are plugged or fouled; Feedback linkages that shift or contain excessive play/untight Feedback linkages that shift or contain excessive play/untight
connections;connections; Incorrect calibrations.Incorrect calibrations.
Slide 45© Max Zornada (2011)
Causes of Control Valve Errors and Causes of Control Valve Errors and FailuresFailures
High trim-seating friction for sliding stem valves and high ball, plug or High trim-seating friction for sliding stem valves and high ball, plug or disc friction for rotary valves;disc friction for rotary valves;
High packing friction;High packing friction; Loose shaft connections on rotary valves;Loose shaft connections on rotary valves; Long shafts or linkages with loose connections on rotary valves;Long shafts or linkages with loose connections on rotary valves; Single stage positioners;Single stage positioners; Electrical interference;Electrical interference; High resistance in connections or wiring;High resistance in connections or wiring; Nozzles that are plugged or fouled;Nozzles that are plugged or fouled; Feedback linkages that shift or have untight connections;Feedback linkages that shift or have untight connections; Piston actuators with high sliding friction, wide or worm gear teeth, or Piston actuators with high sliding friction, wide or worm gear teeth, or
yoke slots with play.yoke slots with play.
Slide 46© Max Zornada (2011)
Causes of Control Valve Errors and Causes of Control Valve Errors and FailuresFailures
Flashing and cavitation;Flashing and cavitation; Variations in pressure, temperature or composition of process fluid;Variations in pressure, temperature or composition of process fluid; Low Reynolds Number for process fluid;Low Reynolds Number for process fluid; Leaking packing;Leaking packing; Pneumatic or electronic components affected by process or ambient Pneumatic or electronic components affected by process or ambient
temperature;temperature; Incorrect calibrations.Incorrect calibrations.