Risk Analysis Introduction and Overview · 2013-07-10 · 5 Terminology and Background •...

Post on 21-Apr-2020

5 views 0 download

Transcript of Risk Analysis Introduction and Overview · 2013-07-10 · 5 Terminology and Background •...

1

Risk Analysis Introduction and Overview

Thomas A. Mazzuchi Professor and Chairman

Department of Engineering Management and Systems Engineering

George Washington University

2

Terminology and Background

•  Risk - A measure of potential loss due to natural or

human activities - A combination of the probability or frequency of

the hazard and its consequence; e.g.,

•  Loss - Adverse consequences of such activities that

affect  Human life or health  Economics or property  The natural environment  Information , etc

3

Terminology and Background

•  Engineering Systems Losses Can Be -  Internal to the system; i.e,

 Damage to one of the system’s components - External to the system; i.e.,

 Damage to a component of the external environment in which the system must function; e.g.,  Humans  Organizations  Economic assets  Environmental assets

4

Terminology and Background

•  Risk Analysis -  Is the process of characterizing, managing, and

informing others about the existence, nature, magnitude, prevalence, contributing factors, and uncertainties that pertain to the potential losses - Other names for risk analysis

 Probabilistic Risk Analysis (PRA)  Quantitative Risk Analysis (QRA)  Probabilistic Safety Analysis (PSA)

5

Terminology and Background

•  Importance of Risk Analysis - While formal methods for risk analysis have

been shaped by modern demands, the concept of risk analysis is not new; it is even ancient - People are living longer, healthier, more

prosperous lives and have more to loose - Today people expect greater protection than

before from industry and government, and they react with litigation when they feel let down

6

Terminology and Background

•  Importance of Risk Analysis - Even as public concerns about risk exert

pressure on policy makers for regulations, engineering systems are increasing in complexity and autonomy  Simply making regulations without studying

their effects can be costly and suboptimal—even dangerous

- A proper risk analysis will adequately model the system, demonstrate the effect of mitigating measures, and communicate these to the public

7

Elements of Risk Analysis

Risk Assessment

Risk Management

Risk Communication

National Research Council (1994)

8

Elements of Risk Analysis

•  Risk Assessment - The process by which the probability or frequency of

loss by or to an engineering system is assessed, and the magnitude of the loss (consequences) estimated

•  Risk Management - The process by which the potential (probability or

frequency) for loss and/or the magnitude of loss is minimized and controlled

•  Risk Communication - The process by which information about the nature

and consequences of risk, as well as the risk assessment approach and the risk management options, are shared and discussed among decision makers and other stakeholders

9

Risk Assessment

•  Definition of Risk (Kaplan & Garrick, 1981) -  Risk addresses three basic questions:

  What can go wrong?   How likely is it to happen?   What are the losses (or consequences)?

-  A combination of hazard and likelihood -  A triple <Si,Pi,Ci>

  Si a specific scenario of a hazard   Pi probability of si (or frequency)   Ci consequence of si

10

Risk Assessment

•  Modifications - Si may occur with a given

probability Pi or frequency fi -  Its occurrence may be static

or dynamic over time - Pi and Ci may be uncertain

and have probability distributions - These distributions may be a

function of time or Si or a combination of the two - These quantities may be

jointly distributed

11

Quantitative Risk Assessment

•  Overview

12

Important Risk Journals

•  Health, Risk and Society •  Journal of Risk and Insurance •  Journal of Risk and Uncertainty •  Journal of Risk: Health, Safety and Environment •  Journal of Risk Research •  Journal of Safety Research •  Journal of System Safety •  Risk Analysis, An International Journal •  Risk, Decision, and Policy •  Risk Management and Insurance Review •  Risk Management: An International Journal •  Safety Science •  The Journal of Risk •  Reliability Engineering and System Safety •  International Journal of Reliability, Quality, and Safety Eng

13

Societies of Interest •  American Society of Mechanical Engineers •  Safety Engineering and Risk Analysis Division •  American Society of Safety Engineers •  American Statistical Association, Section on Risk Analysi •  IEEE Reliability Society •  International Association for Probabilistic Safety

Assessment and Management. •  Risk Assessment and Policy Association •  Risk Theory Society •  Society for Maintenance Reliability Professionals •  Society for Reliability Engineers •  Society for Risk Analysis •  System Safety Society •  The Safety and Reliability Society

• 

14

Qualitative Risk Assessment: Risk Matrices

15

Introduction

•  Risk Matrix – a table that has several categories of probability,

likelihood or frequency on its rows (or columns) and several categories of severity, impact, or consequence on its columns (or rows)

–  It associates a recommended level of risk, urgency, priority, or management action associated with each column-row pair (i,e, cell)

16

Introduction

Federal Highway Administration, 2006

Federal Aviation Administration, 2007

17

Introduction

Qualitative Risk Assessment •  NASA Risk Management Reporting

Qualitative Risk Assessment •  NASA Risk Management Reporting

20 Problems with Risk Matrices and Matrix Design Cox (2008)

•  If Risk = probability * consequence

Consequence

Probability

Probability

Consequence

Risk

p*c=constant

21 Subjective Interpretations and Input Bias Smith et al (2009)

Consequence Li

kelih

ood

0 PROBABILITY 1

1 Objective Subjective

Utility Va

lue

Objective Subjective

Utility

Value

SRP

22 Extension of Cox for Opt. 5x5 Matrix Design Hong and Mazzuchi (2013)

23 Uncertainty Distribution for Portfolios of Risks Mazzuchi and Scolese (2014)

p

c

Quantitative Risk Analysis Scenario Analysis

25

Fault Trees

•  The Basics of Fault Trees - A fault tree develops a deterministic description of

the occurrence of the top event, in terms of the occurrence or not of intermediate events  Top events represent system-level failure

- Describes intermediate events further until, at a finer level of detail, basic events are obtained  Basic events represent component-level failure

- By itself, a fault tree is only a visual model of how a system failure can occur

26

Fault Trees

1.  Identify undesirable TOP event

2.  Identify first contributors

3.  Link contributors to TOP event by logic gates

4.  Identify second level contributors

5.  Link second level contributors to TOP event by logic gate

27

Fault Tree Construction

•  Symbols Event Symbols Transfer Symbols

Basic Event

Undeveloped Event Transfer In

External Event Transfer Out

Intermediate Event

28

Fault Tree Construction

•  Symbols -  Gate Symbols

+

+ +

and gate: Output occurs if all input events occur or gate: Output occurs if any input event occurs exclusive or gate: Output occurs if exactly one input event occurs

priority and gate: Output occurs if all input events occur in a specific sequence inhibit gate: Output occurs if the single input occurs in the presence of an enabling condition not or gate: Output occurs if at least one input event does not occur not and gate: Output occurs if all input events do not occur

29

Fault Tree Example 2 Example with Success Event

Isolation Valve VAL Permanent Ignition

Source I2

Pressure Relief Valve PRV

Possible Ignition Source

I1

Leak

Gas flowing through pipe, there is a leak after the isolation valve this valve should close but then the pressure relief vale must open to relieve local pressure

30

Fault Tree Example 2 Example with Success Event

Explosion After Gas Leak Posterior to Isolation Valve

+

Explosion Prior to Isolation

Explosion Posterior to Isolation Valve

VAL Performs Correctly

PRV Fails

VAL Fails

I1 Present

31

C

C

Sensing & Control System

AC Power Source

T1

V1

V2

V3

V4

V5

P1

P2

Fault Tree Example 3 Large Example

Pumping System Example

32

No Water Delivered When Needed

+ No Water Delivered from V1

V3 Fails to Remain Open

V5 Fails to Remain Open

No Water

from P2

No Water Delivered from P1 Branch

+ No Water Delivered from V1

V4 Fails to Remain Open

V2 Fails to Remain Open

a +

P2 Fails to Function

AC Fails

a

+

AC Fails

S Fails

+

V1 Fails to Remain Open

T1 Ruptures

S Fails to Send Signal

P1 Fails to

Function

AC Fails

+

b

b

No Water Delivered from P2 Branch

No Water

from P1

S Fails to Send Signal

Fault Tree Example 3

33

No Water Delivered When Needed

+ No Water Delivered from V1

V3 Fails to Remain Open

V5 Fails to Remain Open

Pumping Branches Fail

V4 Fails to Remain Open

V2 Fails to Remain Open

+

P1 Fails to Function

AC Fails

S Fails

V1 Fails to Remain Open

T1 Ruptures

P2 Fails to

Function

+

Fault Tree Example 3

+

P2 Branch Fails

P1 Branch Fails

Alternative Construction

34

A

B

C

D

E

1

F

2

3

4

5

6

7

Fault Tree Example 4 Block Diagram Example

•  Circuit Block Diagram Example

35

No Current at Point F

+

Unit 7 Fails

No Current at D & E

No Current at Point E

+

No Current at Point C

No Current at Pnt A

+

Unit 2 Fails

Units 5 & 6 Fail

Unit 6 Fails

Unit 5 Fails

No Current at Point D

+

No Current at Point B

No Current at Pnt A

+

Unit 1 Fails

Units 3 & 4 Fail

Unit 4 Fails

Unit 3 Fails

Fault Tree Example 4

36

Event Tree Method

•  The Event Tree Method is the primary technique used in PRA to generate risk scenarios

•  This method can be used when … - … Successful operation of a system depends on the

approximately chronological and discrete operation of its units - … Previous event tree model scenarios of

successive events have led to exposure to hazards, and ultimately to undesirable consequences

37

Event Tree Method Example

Initiating Event A B C D E

Sequence Logic

System Results

Success ↑ Failure ↓

A B C E S

A B C E F

A B C D E S

A B C D E F

A B C D F

A B F Let A denote that subsystem A fails and A denote that it does not fail

Depends on sequence of events

Mutually Exclusive Events

38

Event Tree Method

•  Building an event tree - Build from left to right - Start the sequence at the initiating event - Place protective barriers as the successive (binary)

events - Calculate branching probabilities (called split

fractions) from fault trees - Calculate the probability of the end mutually

exclusive events as the multiple of the path split fractions

39

Event Tree Method Example 1

PUMP KLAXON

S

P K

A subgrade compartment containing important control equipment is protected from flooding using the above system. If the water rises it should close the float switch which operates a pump with separate power supply, A klaxon should also sound and alert operators to perform bailing.

B

40

Event Tree Method Example 1

Water Rises

I

Float Switch

S Pump

P Bailing

B System Results Klaxon

K System Logic

ISP S

ISPKB S

ISPKB F ISPK F

IS F

41

Event Tree Method Example 2

Attempted Illegal

Access by Hacker

I

Principal Firewall

F

Abnormal Signal Detected by

Operator O

System Results

Backup Firewall

Initiated by Operator

B

System Logic

IF S

IFOB S

IFOB F IFO F

42

Event Tree Method

Split fractions are calculated using fault trees

Quantifying Scenario Analysis

44

Quantifying Fault Trees and Event Trees

•  How Do You Quantify Fault Trees and Event Trees - A fault tree or an event tree by itself is only a visual

model of a system -  It can be a representation of Boolean logic, i.e. a

representation of the functioning (or not) of the system as a function of its components - Because the basic events are 0-1 (fail-no fail), we

can use Boolean algebra to reduce the system expression to the lowest terms -  In doing so we make the following assumptions

 All events are binary  The system is coherent

 I.e., failure of any component cannot improve the system

45

Boolean Reduction: Boolean Algebra

Notation Boolean Operator Set Theory t   X and Y X•Y X∩Y   X or Y X+Y = 1-(1-X)(1-Y) X∪Y   Not X X’ Xc

•  Important Laws   Commutative X•Y = Y•X X+Y = Y+X   Associative X•(Y•Z) = (X•Y)•Z X+(Y+Z)=(X+Y)+Z   Distributive X•(Y+Z) = X•Y+X•Z   Idempotent X•X = X X+X = X   Absorption X+X•Y = X   Complementation X+X’ = Ω (X’)’ = X   De Morgan’s (X•Y)’ = X’+Y’ (X+Y)’ = X’•Y’   Empty/Universal Set ∅’ = Ω

46

T = E1•E2 = (A+E3) • (C+E4) = A•C + A•E4 + C•E3 + E3•E4 = A•C + A•(A•B) + C•(B+C) +

+ (B+C)•(A•B) = A•C + A•A•B + C•B + C•C +

+ B•A•B + C•A•B = A•C + A•B + B•C + C + A•B +

+ A•B•C = A•C + A•B + B•C +C + A•B•C = A•C + A•B + C + A•B•C = A•B + C + A•B•C = A•B + C

Reducing a Fault Tree Using Boolean Algebra

This is the reduced tree and reduced Boolean expression for the tree called Min Cut Set Representation

47

Representing Systems in Terms of Their Components

Using the convention that

X•Y=X*Y and X+Y=1-(1-X)*(1-Y)

we may determine the state of the top event in terms of the component states.

From previous page T=A•B + C = 1-(1-A*B)*(1-C)

For example if A occurs and C occurs but B does not

T=1-(1-1*0)(1-1) = 1 (Then the top event occurs)

48

Representing Systems in Terms of Their Components

•  Truth tables - Generate all possible component states and the

probabilities associated with each. - For m components, each can either function or not

(i.e. 2 states for each component) thus there are 2m possible states taking in to account all components. - Evaluate the system using the Boolean formula for

each state

49

Representing Systems in Terms of Their Components

-  Generation of All Possible States

1st Col 2nd Col 3rd Col nth Col 0 1 0 1 0 1 0 1 : :

0 0 1 1 0 0 1 1 : :

0 0 0 0 1 1 1 1 : :

0 : :

0 1 : :

1 : :

20=1 20=1 21=2

21=2 22=4

22=4

2n-1

2n-1

…..

…..

…..

50

Representing Systems in Terms of Their Components

•  Truth tables

Note that if all elements of {A,B} occur or all elements of {C} occur then the top event occurs

These are called Cut Sets

T=A•B + C = 1-(1-A*B)*(1-C)

=1-(1-0*0)(1-0) =1-(1-1*0)(1-0) =1-(1-0*1)(1-0) =1-(1-1*1)(1-0) =1-(1-0*0)(1-1) =1-(1-1*0)(1-1) =1-(1-0*1)(1-1) =1-(1-1*1)(1-1)

51

Representing Systems in Terms of Their Components

•  Truth Tables in Excel T=A•B + C = 1-(1-A*B)*(1-C)

52

Some Important Definitions

•  Cut Set - A collection of basic events such that, if the events

occur together, the top event certainly occurs •  Min Cut Set

- A cut set such that, if any basic event is removed, the remaining set is no longer a cut set

•  Path Set - A collection of basic events that connect input and

output  A path set merely represents a path through the

graph •  Min Path Set

- A path set such that, if any basic event is removed, the remaining set is no longer a path set

53

Min Cut Set Representation for Fault Trees

•  What is it? - After Boolean reduction, the Boolean formula for any

fault tree will be in Min Cut Set Representation T = X11• X12• … • X1n1

+ X21• X22• … • X2n2+

….+ Xm1• Xm2• … • Xmnm

where {Xi1, Xi2, … , Xini} is the ith cut set and

Xij=1 if ith item failed and 0 otherwise, Letting Ci = Xi1• Xi2• … • Xini

where Ci is the ith cut set indicator

Ci =1 if all elements of the ith cut set fail Then T = C1+C2+….+ Cm

54

Min Cut Set Representation for Fault Trees

•  Converting Min Cut Set Representation to a Calculable Formula

T = C1+C2+….+ Cm Then we can write T = 1 – (1-C1)*(1-C2)*…*(1-Cm) And since Ci = Xi1• Xi2• … • Xini

We can write

T = 1 – (1-C1)*(1-C2)*…*(1-Cm) = 1 – (1- Πj=1,n1

X1j)*(1-Πj=1,n2X2j)…*(1-Πj=1,nm

Xmj)

55

Example

Consider the following Fault Tree

D+E B•C

(D+E)•B B•C+A

[(D+E)•B]•[B•C+A]

56

Example

T = [(D + E) • B] • [(B • C) + A] T = (B•D + B•E) • [(B•C) + A] T = (B•D•B•C) + (B•E•B•C) + (B•D•A) + (B•E•A) T = B•C•D + B•C•E + A•B•D + A•B•E

The minimal cut sets of the top event are thus C1 = {B, C, D} C2 = {B, C, E} C3 = {A, B, D} C4 = {A, B, E}

57

Example

Thus if A = 1 if component A fails and 0 otherwise and this is true for B,C,D,E as well we can write

T = 1-(1- B*C*D)*(1- B*C*E)*(1- A*B*D)*(1- A*B*E)

And if T=1 we have system failure and T=0 indicates system is functioning

58

Example

59

X1

Determining Boolean Representation for Series-Parallel Systems

X2

X3

X4

X5

X6

X7

X8

X1

X2*X3 X4

X5

X6

X7

X8

60

Determining Boolean Representation for Series-Parallel Systems

X1

X2*X3 X4

X5

X6

X7

X8

X1

X2*X3 X4

X5

X6*X7*X8

X1

1-(1-X2*X3)*(1-X4)

X5

X6*X7*X8

61

Determining Boolean Representation for Series-Parallel Systems

X1

1-(1-X2*X3)*(1-X4)

X5

X6*X7*X8

X1 [1-(1-X2*X3)*(1-X4)]X5 X6*X7*X8

1-(1-X1)*(1-[1-(1-X2*X3)*(1-X4)]X5)*(1-X6*X7*X8)

62

2

34

5

System Indicator = 1 – (1-X1)(1-(1-(1-X2X3)(1-X4))X5)(1-X6X7X8) =1-(1-X1)(1-X2X3X5-X4X5+X2X3X4X5)(1-X6X7X8) =1-(1-X1)(1-X2X3X5)(1-X4X5)(1-X6X7X8) since for binary variables (X5)2= X5 Which is called min cut representation (no Xi

n terms)

1

6

7

8

Determining Boolean Representation for Series-Parallel Systems

63

2

34

5

What is min cut set representation? 1-(1-X1)(1-X2X3X5)(1-X4X5)(1-X6X7X8) Note that for the sets of components {1}, {2,3,5}, {4,5}, {6,7,8} if all of the items in the sets fail, then the system fails – a cut set Also not that we can not reduce any set by even a single element and have it still be a cut set – a min cut set

1

6

7

8

Determining Boolean Representation for Series-Parallel Systems

64

2

34

5

What is a min path? Note that for the sets of components {1,5,6}, {1,5,7} {1,5,8}, {1,2,4,6}, {1,2,4,7}, {1,2,4,8}, {1,3,4,6}, {1,3,4,7}, {1,3,4,8}, if all of the items in the sets function, then the system functions (a path from beginning to end – a path set Also not that we can not reduce any set by even a single element and have it still be a path set – a min path set

1

6

7

8

Determining Boolean Representation for Series-Parallel Systems

65

Boolean Representation for General Systems

Z=1-(1-X1X2)(1-X1X3X5)(1-X4X5)(1-X2X3X4)

1

2

3

4

5

Non series-parallel structures

Use cut set representation

66

Boolean Representation for General Systems

As structures get more complex this becomes difficult and we may have to resort to a Fault Tree

Determine the min path and min cut sets

in out

A

B

C

D

E

F

G

H

67 Boolean Representation for General Systems

No Flow to Out

H No Flow to H

No Flow From F No Flow From G

GNo Flow to G

No Flow From A No Flow From D

+

+

F No Flow to F

+

No Flow From C No Flow From E

+

A No Flow From “in”

+

D No Flow to D

+

B No Flow From “in”

+

C No Flow From “in”

+

E No Flow to E

+

B No Flow From “in” We will discount

this in our analysis

68 Boolean Representation for General Systems

[A•(B+D)+F] •[C•(B+E)+G]+H

H [A•(B+D)+F] •[C•(B+E)+G]

A•(B+D)+F C•(B+E)+G

GC•(B+E)

A B+D

+

+

F A•(B+D)

+

C B+E

A +

D B

B

C +

E B

B

69

Boolean Representation for General Systems

Failure = [A●(B+D)+F]●[C●(B+E)+G]+H = [A●B + A●D + F] ● [C●B + C●E + G]+H = A●B●B●C+ A●B●C●E + A●B●G + A●D●B●C+

A●D●C●E + A●D●G + F●B●C+ F●C●E + F●G +H = A●B●C+ A●B●C●E + A●B●G + A●B●C●D+ A●C●D●E + A●D●G + B●C●F+ C●E●F + F●G +H = A●B●C + A●B●G + A●C●D●E + A●D●G + B●C●F

+ C●E●F + F●G + H

Cut Set: {A,B,C}, {A,B,G}, {A,C,D,E}, {A,D,G}, {B,C,F}, {C,E,F},{F,G},{H}

Using our indicator notation

T=1-(1-A*B*C)*(1-A*B*G)*(1-A*C*D*E)*(1-A*D*G) *(1-B*C*F)*(1-C*E*F)*(1-F*G)*(1-H)

70 Quantifying Event Trees (Using DeMorgan’s Laws)

Assume split fractions are calculated using fault trees A=b+c•d B=c+e C=b•d

A+

G1 b

c d

B+

e c

C

b d

I A B C ABC Scenario 1

ABC Scenario 2 AB Scenario 3 A Scenario 4

71

Quantifying Event Trees

Scenario 4 I • A = I • (b+c•d) Scenario 3 I • A • B = I • (b•c+b•d) • (c+e) = I • (b•c•e + b•c•d + b•d•e) Scenario 2 I • A • B • C = I • (b•c+b•d) • (c+e) • (b•d) = I • (b•c+b•d) • (c•e) • (b•d) ={ } Scenario 1 I • A • B • C = I • (b•c+b•d) • (c•e) • (b•d) = I • (b•c+b•d) • (c•e) • (b+d) = I • b•c•e

72

Calculating the Probability of the Top Event

•  Three Methods •  Converting Cut Set Formulation to Probability

Statements •  Using Truth Tables •  Using Binary Decision Diagrams

73

Calculating the Probability of the Top Event - Method 1

•  Additive Law for Events A1,…, An

P(A1∪…∪An) = ∑i=1,n P(Ai) – ∑i<j P(Ai∩Aj) + ∑i<j<k P(Ai∩ Aj∩Ak) +… + (–1)n+1* P(A1∩…∩An)

You know P(A1∪A2) = P(A1) + P(A2) - P(A1 ∩A2)

The above general formula is called the Inclusion-Exclusion Principle (as terms are added you overestimate then underestimate)

for example ∑i=1,n P(Ai) – ∑i<j P(Ai∩Aj) ≤ P(A1∪…∪An) ≤ ∑i=1,n P(Ai)

74

Calculating the Probability of the Top Event - Method 1

If a fault tree has minimal cut sets C1, C2, …, Cm, then T = C1 + C2 + … + Cm P(T=1) = P({C1 =1}∪ {C2 =1}∪ … ∪ {Cm=1}) and we can calculate P(T=1) = ∑P(Ci =1) - ∑P({Ci =1}∩ {Cj=1}) + … + + (–1)m+1 ∑P({C1 =1}∩{C2 =1}∩ … ∩ {Cm=1}) P(Ci=1)=P({Xi1=1}∩…∩{Xini

=1}) and we can calculate bounds

∑P1(Ci ) – ∑P1(Ci ∩ Cj) < P1(T) < ∑P1(Ci) where we use the notation henceforth P1(C)=P({C=1}) and P1(Ci ∩ Cj) = P({Ci =1}∩ {Cj=1})

75

Calculating the Probability of the Top Event – Method 1

•  Rare Event Approximation P1(T) ≈ ∑P1(Ci) (conservative) - Based on the notion that the simultaneous

occurrence of several rare events is negligible - Problematic when there is a large degree of

overlap in cut sets - An additional simplifying assumption is the

independence of components

P1(Ci) = P({Xi1=1}∩…∩{Xini=1})

= P1(Xi1) • P1(Xi2) • … • P1(Xini)

assuming independence

76 Calculating the Probability of the Top Event – Method 1

P1(A) = P1(B) = P1(C) = 0.1 and A,B,C mutually indep. P1(T) = P1(C∪A∩B) ≈ P1(C) + P1(A∩B)

≈ P1(C) + P1(A)*P1(B) = 0.110 (Bound) P1(T) = P1(C) + P1(A∩B) – P1(A∩B∩C) = P1(C) + P1(A)*P1(B) – P1(A)*P1(B)*P1(C) = 0.109 (Exact)

Example 1

C1= {A,B}

C2 = {C}

77

Calculating Probability of Top Event: Truth Tables – Method 2

  0 denotes that component does not fail

  1 denotes that component fails

  P(T) = 0.081 + 0.009 + 0.009 + 0.009 + 0.001 = 0.109

  Note: Independence of components is assumed

A B C P(A)P(B)P(C) System

0 0 0 (.9)(.9)(.9) = 0.729 0 1 0 0 (.1)(.9)(.9) = 0.081 0 0 1 0 (.9)(.1)(.9) = 0.081 0 0 0 1 (.9)(.9)(.1) = 0.081 1 1 1 0 (.1)(.1)(.9) = 0.009 1 1 0 1 (.9)(.1)(.1) = 0.009 1 0 1 1 (.9)(.1)(.1) = 0.009 1 1 1 1 (.1)(.1)(.1) = 0.001 1

78

Calculating Probability of Top Event: Binary Decision Diagrams – Method 3

A

B B

C C C C

0 1

0 1 0 1

0 1 0 1 0 1 0 10 1 0 1 0 1 1 1

 Tree represents all possible component states   Bottom of tree represents the truth table value for the tree path.  There are techniques to reduce the tree.

79

Calculating Probability of Top Event: Binary Decision Diagrams – Method 3

A

B B

C C C C

0 1

0 1 0 1

0 1 0 1 0 1 0 10 1 0 1 0 1 1 1

The tree is basically a physical representation of the truth table

80

Calculating Probability of Top Event: BDD’s – Method 3

A

B B

C C C C

.9 .1

.9 .1 .9 .1

.9 .1 0 1 0 1 0 1 1 1

  Calculate probability of top event by replacing the states with their probabilities, and folding back the tree  For example, 0.19 = 0.1 * 0.9 + 1 * 0.1

.9 .1 .9 .1 .9 .1 .1 .1 .1 1

.19 .10

.109 =.9*.10 + .1*.19

.19 =.9 *.1 + .1 * 1 0.10= .9*.1 + .1*.1

81

Putting it All Together Example

Consider the event tree and fault trees below:

Determine a Boolean equation representing each event tree scenario in terms of fault tree basic events (C1, C2, C3).

I B A

82

Putting it All Together Example

a) If the frequency of the initiating event I is 10-3 per year, and P1(C1) = 0.001, P1(C2) = 0.008, and

P1 (C3) = 0.005, calculate the risk (injuries per year).

83

Example

  Solution a)  The Boolean equations representing each of the

event tree scenarios in terms of the fault tree basic events (C1, C2, C3) are:

Scenario 1:

84

Example

Scenario 2:

Scenario 3:

85

Example Solution

86

Example: Solution

That is the rate of I

≈ 7.95x10-6+6.00x10-6

87

Example Solution

Advanced Probability Analysis

89

Probability of System Failure: Law of Total Probability

•  Notation We use the event Ci (S) to denote that component i (the system) fails and Ci’ (S’) that it does not.

We also use the indicator Xi=1 (Z=1) to indicate that component i (the system) fails and Xi=0 (Z=0) to indicate that component i (the system) does not fail

Thus P(Ci)=Pr(Xi=1) and P(S)=Pr(Z=1)

1

2 3

Z=1-(1-X1*X2)*(1-X3)

90

Probability of System Failure: Law of Total Probability

•  Use probability laws P(S) = P([C1∩C2 ]∪C3) = P(C1∩C2) + P(C3) – P(C1∩C2∩C3)

•  Or condition on component states

P(S|C1∩C2∩C3) * P(C1∩C2∩C3) + P(S|C1´∩C2∩C3) * P(C1´∩C2∩C3) + P(S|C1∩C2´∩C3) * P(C1∩C2´∩C3) + P(S|C1∩C2∩C3´) * P(C1∩C2∩C3´) +

P(S|C1´∩C2´∩C3) * P(C1´∩C2´∩C3) + … … + P(S|C1´∩C2´∩C3´) * P(C1´∩C2´∩C3´)

1

2 3

Z=1-(1-X1*X2)*(1-X3)

This side will be 0 or 1

This side will be the probability of a component state

91

Probability of System Failure: Law of Total Probability

1

2 3

Z=1-(1-X1*X2)*(1-X3)

Assuming Independence

Using SUMPRODUCT function

P(C1∩C2´∩C3)

92

Advanced Probability Laws: Conditional Probability

•  Conditional Probability - P(A|B) = P(A ∩ B) / P(B) , if P(B) > 0 - Conditional probability redefines the sample

space

AB

New Sample Space

Elements of A in the New Sample Space

93

•  Conditioning on component 2 failure: P(S|C2) = P(S∩C2) / P(C2) = P({[C1∩C2 ]∪C3}∩C2} / P(C2) = P([C1∩C2 ]∪[C3∩C2 ]) / P(C2) = {P(C1∩C2) + P(C3∩C2) - P(C1∩C2∩C3)} / P(C2)

•  If components are independent: = {P(C1)P(C2) + P(C3)P(C2) - P(C1)P(C2)P(C3)} / P(C2) = P(C1) + P(C3) - P(C1)P(C3)

Probability of System Failure: Conditional Probability

1

2 3

If component 2 fails what is the probability of System Failure –Measure Component Importance

94

Probability of System Failure: Conditional Probability

1

2 3

Note: Independence NOT Assumed

If component i fails what is the probability of System Failure

95

Probability of Component Failure: Conditional Probability

1

2 3

Note: Independence NOT Assumed

If system fails, what is the probability of component i failure – Maintenance Implications

96

Probability of Cut Set Causing Failure: Conditional Probability

Z=1-(1-X1X2)(1-X1X3X5)(1-X4X5)(1-X2X3X4)

1

2 3

4

5

Non series-parallel structures

Cut Sets: {1,2}, {1,3,5}, {4,5}, {2,3,4}

Cut Set Representation

97

Calculating Complex Structure Functions and Probability of Failure

=Pr(CS12∩Z}/Pr{Z} = Pr(CS12}/Pr{Z}

98

Importance Measures

•  Motivation − A key challenge in a PRA is to identify the elements

in the system that contribute most to the risk − Method to accomplish this is Importance Ranking − The many importance measures used for this

process can be categorized as either  Absolute

 Defines each risk element in terms of an absolute risk metric, such as the conditional frequency of a hazard exposure given the state of the element; or

 Relative  Compares risk contribution of each element to

that of another

99

Importance Measures

•  Formulation − Risk is usually composed of a collection of

scenarios that occur with a certain frequency or probability

− A series of cut sets can represent these scenarios − Wall, et al. (2001), represent total risk by a linear

function of any single risk element:

R = aP + b

100

Importance Measures

R = aP + b where

R: total System Risk a: total contribution from cut sets that involve a

particular element P: total risk contribution from a particular element b: total contribution from cut sets that do not

involve a particular element

− Wall, et al.’s, method is only useful for investigating one-at-a-time sensitivity to risk elements

101

Principles of Importance Measures

•  IB = a , RP=1 – RP=0 •  IFV = aP/(aP+b) , (Rbase – RP=0)/Rbase •  IC = aP/(aP+b) , (Rbase – RP=0)/Rbase •  II = aP ,Rbase – RP=0 •  IRRW = aP , Rbase – RP=0 (differential method) •  IRRW = (aP+b)/b , Rbase/RP=0 (fraction method) •  IRAW = a(1-P) , RP=1 – Rbase (differential method) •  IRAW = (a+b)/(aP+b) , RP=1/Rbase (fraction method) •  DIM1 , (R/Pi)/(Σj=1,nR/Pj) • DIM 2 , aiPi/Σj=1,naiPi

102

Safety Systems: k-out-of-n Systems

Consider a system where the system will function if k-out-of-n of its components function or will fail is n-k+1 or more components fail

Usually these are of identical components, each with probability of failure p, then the probability of system failure is

1

2

3

2-out-of-3 System Min Cut Sets {1,2}, {1,3}, {2,3} Prob of Failures

Why?

103

Modelling Dependent Failures

•  What is dependent failure? - Let Ci be the event that component i fails and let

P(Ci) denote its probability  If we have n components and their failures are

independent, then P(C1∩C2∩ … ∩Cn) = P(C1)P(C2) … P(Cn)  If their failures are not independent, then this is

not a simple multiplication, we use the Multiplicative Law

P(C1∩C2∩ … ∩Cn) = P(C1) • P(C2│C1) * P(C3│C1∩C2) *…* P(Cn│C1∩C2 ∩ … ∩Cn-1)  The probabilities of n joint dependent events on

the left side are usually greater than the corresponding independent probabilities

104

Modelling Dependent Failures

•  Example

105

Modelling Dependent Failures

•  What are Common Cause Failures - CCFs are considered to be the collection of all

sources of dependency, especially between components, that are not known or are difficult to model explicitly. - CCFs have been shown by many studies to

contribute significantly to the overall unreliability of complex systems; - CCFs have no unique or universal definitions. - A fairly general definition is given by Mosleh as: A

CCF is a subset of dependent events in which two or more component fault states exist at the same time, or in a short time interval, and are direct results of a shared cause.

106

Modelling Dependent Failures

•  Modelling CCFs: Two Components - As CCFs have no explicit definition, their

probabilities are modelled as possible joint combinations of failures of components - Consider a system with two redundant components

A, B; then  P(A fails) = P(AI) + P(CAB)  AI denotes A fails separately  BI denotes B fails separately  CAB denotes A & B fail together by common

cause

107

Modelling Dependent Failures

•  Modelling CCFs: Three Components - Consider a system with three redundant

components A, B and C  The total failure probability of A can be

expressed in terms of its independent failure AI and its dependent failures as follows:  CAB, CAC denote that (A,B) & (A,C) fail

together by common cause  CABC denotes that (A,B,C) fail together by

common cause - Component A fails if any of the events above occur

P(A fails) = P(AI) + P(CAB) + P(CAC) + P(CABC)

108

Modelling Dependent Failures

•  Modelling CCFs: Min Cut Representation - The equivalent Boolean representation of total

failure of component A is AT = AI+CAB+CAC+CABC -  If the success criterion for the system is “2 out of 3

components A, B and C succeed,” then failure of the system can be represented by the following cut sets: -  {AI,BI}, {AI,CI}, {BI,CI}, {CAB}, {CAC}, {CBC}, {CABC} - Thus the Boolean representation of system failure

will be S = (AI•BI) + (AI•CI) + (BI•CI) + CAB + CAC + CBC + CABC (why not include(AI•BI•CI?)

109

Modelling Dependent Failures

•  Modelling CCFs; Probability Representation

-  If independence is assumed, only the first four terms of the Boolean expression are used, i.e., P(CAB) = P(CAC) = P(CBC) = P(CABC) = 0;

 Otherwise, applying the Rare Event Approximation results

P(System Failure) = P(any 2 or 3 components fail)

QS ≈ P(AI)P(BI) + P(CAB) + P(AI)P(CI) + P(CAC) + P(BI)P(CI) + P(CBC) + P(CABC)

110

Modelling Dependent Failures

- Assume that components A, B, and C are similar, and define

Qi = Probability of i simultaneous component failures due to common cause

- and write QS = P(System Failure)

= P(any 2 or 3 components fail) = P(AI)P(BI) + P(CAB) + P(AI)P(CI) + P(CAC) + P(BI)P(CI) + P(CBC) + P(CABC)

= 3(Q1)2 + 3(Q2) + (Q3)

111

Modelling Dependent Failures

-  In general for a k out of n system to fail there must be n-k+1 or more failures

Example

112

Modelling Dependent Failures

- Generally, models for common cause failure derive expressions for Qk for a system of size m, 1 ≤ k ≤ m in terms of total probability of component failure (Qt)

113

Probability Models for Time Dependent Analysis

114

Previous Lecture: A Snap Shot in Time

► Use probability laws P(S) = P([C1∩C2 ]∪C3) = P(C1∩C2) + P(C3) – P(C1∩C2∩C3)

► Or condition on component states

P(S|C1∩C2∩C3) * P(C1∩C2∩C3) + P(S|C1´∩C2∩C3) * P(C1´∩C2∩C3) + P(S|C1∩C2´∩C3) * P(C1∩C2´∩C3) + P(S|C1∩C2∩C3´) * P(C1∩C2∩C3´) +

P(S|C1´∩C2´∩C3) * P(C1´∩C2´∩C3) + … … + P(S|C1´∩C2´∩C3´) * P(C1´∩C2´∩C3´)

1

2 3

Z=1-(1-X1*X2)*(1-X3)

This side will be 0 or 1

This side will be the probability of a component state

115 Random Variables: Time Dependent Behavior

► Random variables are important for describing system behavior as a function of time: TS is system life length, Ti life length of component i P(TS ≤ t) = P({T1 ≤ t} ∪ {T2 ≤ t}) (series system) P (TS ≤ t) = P({T1 ≤ t} ∩ {T2 ≤ t}) (parallel system)

Note: {Ti ≤ t} defines our previous notation, Ci , for a fixed value t but as t varies the probability is a function of time

When T takes values in [0, ∞), it is called a lifetime variable (used in reliability and risk analysis)

116 Important Functions for Random Variables

► Probability Distribution f(x) = Pr{X=x} for X discrete (called

pmf) f(x)dx ≈ Pr{x<X<x+dx} for X continuous

(called pdf) ► Cumulative Distribution Function:

F(x) = P(X ≤ x) = ∑ i≤x f(i) for X discrete = ∫ 0

x f(u)du for X continuous ► Reliability (Survival) Function

R(x) = P(X>x) =1– F(x) [F(x) or S(x) is often used in place of R(x)]

117 Important Functions for Random Variables

118 Important Functions for Random Variables

► Failure Rate Function (Continuous rv Only)  h(x) = Lim dx→0P(X ≤ x+dx|X>x}/dx h(x)dx ≈ P(x<X ≤ x+dx|X>x}

Denotes instantaneous probability of failure

► Cumulative Failure Rate Function (Continuous RV Only) H(x) = ∫ 0

x h(u)du (continuous only) Denotes cumulative wear or exposure

119

Classic Failure Rate Curve

Note: i. life lengths said to follow a bathtub failure rate with three phases: infant mortality, chance failure and wear out ii. if h(x) is nondecreasing, constant,

nonincreasing we say that X is IFR, CFR, or DFR for Increasing, Constant or Decreasing Failure Rate

Failure Rate

time

Infant Mortality

Chance Failure

Wear Out Failure

120

Classic Failure Rate Curve

Note: i. In practice we often only use one phase of the curve

ii. There are example phenomena from each phase (DFR –software, CFR-electronics,

IFR-mechanical devices)

Failure Rate

time

Infant Mortality

Chance Failure

Wear Out Failure

121 Parametric Families of Distributions

► When a distribution f(x) can be indexed by a set of parameters, say Θ, whose specification completely determines the distribution we say

that f(x|Θ) is a parametric family. ► Important Properties

 Failure Rate Behavior  Distribution of Minimums (for series systems) TS = Min{T1, …, Tn}  Distribution of Sums (for cold backup or switching

systems) TS = T1 +…+ Tn

122

Which Parametric Family to Use?

► Look at the data histogram

123 Use of Parametric Families: System Reliability as a Function

of Time ► Component Life Lengths

T1~Wei(2,10), T2~Wei(1,5) assume independence

► System Life TS P(TS ≤ t) = P({T1 ≤ t} ∪ {T2 ≤ t}) (series system) = P({T1 ≤ t}) + P({T2 ≤ t}) - P({T1 ≤ t})P({T2 ≤ t}) = (1 - e–(t/10)2) + (1 - e–(t/5))

- (1 - e–(t/10)2) (1 - e–(t/5)) P(TS ≤ t) = P({T1 ≤ t} ∩ {T2 ≤ t}) (parallel system) = (1 - e–(t/10)2) (1 - e–(t/5))

124

Making Risk Time Dependent

Some times you are lucky and the system lifelength distribution has a closed form

1

….

n

1 … n

1

n

….

Series Parallel Cold Standby (perfect switch)

TS=min{T1,…,Tn} TS=max{T1,…,Tn} TS=T1+…+Tn

If Ti~ Exp(λi) then Ts~ Exp(∑i=1,nλι)

No Distribution for Ti leads to a known form distribution for TS

If Ti~ gamma(νi,α) then Ts~ gamma(∑i=1,nνi ,α)

If Ti~ normal(µi,σi2)

then s~normal(∑i=1,nµi,∑i=1,nσi2)

125

Analyzing Serries Systems

1 … n

TS=min{T1,…,Tn}

System Failure = 1 - Pr{TS>t}

= 1- Pr{ min{T1,…,Tn}>t}

= 1 - Pr{T1>t, …., Tn>t}

=1 - ∏i=1,n Pr{Ti>t}

if components are independent

=1 - ∏i=1,n [1-Fi(t)]

126

Analyzing Parallel Systems

TS=max{T1,…,Tn}

System Failure = Pr{TS ≤ t}

= Pr{ max{T1,…,Tn} ≤ t}

= Pr{T1 ≤ t, …., Tn ≤ t}

= ∏i=1,n Pr{Ti ≤ t}

if components are independent

= ∏i=1,n Fi(t)

1

….

n

127

Making Risk Time Dependent

► Use probability laws (Cuts Set Rep) P(TS<t) = P([{T1<t} ∩{T2 <t}] ∪{T3<t}) = P({T1<t}∩{T2<t}) + P({T3<t}) – P({T1<t} ∩ {T2<t} ∩ {T3<t})

► Or condition on component states P({T1<t} ∩ {T2<t} ∩ {T3>t}) + P({T1>t} ∩ {T2>t} ∩ {T3<t}) +P({T1<t} ∩ {T2>t} ∩ {T3<t}) + P({T1>t} ∩ {T2<t} ∩ {T3<t}) + P({T1<t} ∩ {T2<t} ∩ {T3<t}) Assuming independent components such that the CDF of

component i is Fi(t) = Pr{Ti≤t} yields

1

2 3

Z=1-(1-X1*X2)*(1-X3)

128

Making Risk Time Dependent

► Use probability laws (Cuts Set Rep) P(TS<t) = F1(t)F2(t) + F3(t) - F1(t)F2(t)F3(t)

► Or condition on component states P(TS<t) = F1(t)F2(t)R3(t) + R1(t)R2(t)F3(t) + F1(t)R2(t)F3(t)

+ R1(t)F2(t)F3(t) + F1(t)F2(t)F3(t)

where Ri(t) = 1 – Fi(t) = Pr{Ti > t}

1

2 3

Z=1-(1-X1*X2)*(1-X3)

129

Statistical Inference for Time Dependent Model Parameters

130 Making Models Reflect Reality: Classical Estimation

► What is an estimator?  Given an unknown parameter θ and a random

sample X1, ..., Xn from (X|θ), what are some estimators Θ for θ?

 They are functions of the random sample  Θ(X) = (1/n) ∑i=1,n Xi), ,  Θ(X) = max(X1, ..., Xn )  Θ(X) = 3, ......

 An estimator is a random variable with a probability distribution and an estimate is a realization of that

random variable.   What is a good estimator?

 Look at its pdf

131

Classical Estimation

► What is a good estimator?

Unbiassedness E[Θ] = θ Minimum Variance VAR(Θ) as small as possible

(there is a Cremer-Rao Lower Bound)

Consistency Θn→ θ as n→∞

132

Classical Estimation

► Main Parametric Estimators: Given a random sample X1, ..., Xn from f(X|θ), with unknown parameter(s), θ  Method of Moments (ok properties but easy to use) Θ(X) is obtained as the solution to 1. E[X|θ] = (1/n) ∑i=1,n Xi, (θ has dimension one)

2. E[X|θ] = (1/n) ∑i=1,n Xi, VAR[X|θ] = S2 (θ has dimension two) more equations for higher dimensions

Example Exponential E[X] = 1/λ ⇒λ = 1/x Gamma E[X] = ν/α , VAR[X]= ν/α2

⇒ ν=x2/S2, α=x/S2

_ ^

_ _

^ ^

~

133

Classical Estimation

► Main Parametric Estimators: Method of Least Squares Θ(X) is obtained as the solution to Min ∑i=1,n {F(X(i)|θ) - i/n}2 , X(i) is the ith smallest Xi value and F is a particular parametric family

0

Selected F(x|θ) ~

134

Classical Estimation

 Method of Maximum Likelihood (Best Properties)

Θ(X) is obtained as that which maximizes the likelihood function, a function essentially describing the probability of observing what was observed

By selecting the values for the parameter that maximize the likelihood function, we select the parameter values which maximize the probability of observing what we oberved

There are several forms of likelihood functions

135 Formulating the Likelihood Function – Complete Samples

The Likelihood has many forms, based on the data

► Complete Samples: a random sample X1, ..., Xn L(θ|X) = Πi=1,n f(Xi|θ)

X

X

X

X

X

Exact failure times observed

136 Formulating the Likelihood Function - Censoring

► Right Censored Samples: A life test with n items that stops after time t*, if r failures are observed, let the observed failure times be denoted X(r) = X(1), ..., X(r) in addition we know X(i) > t* for i > r, L(θ|X(r),t*) ={Πi=1,r f(X(i)|θ)}R(t*|θ)n-r

~

X X

( (

X

0 t*

137 Formulating the Likelihood Function-Censoring

► Left Censored Samples: A life test with n items that begins at t = 0 but we do not get to observe the condition of the items until after time t*. Let r items be observed to be failed at t* and let the observed failure times be denoted X(n-r) = X(r+1), ..., X(n) in addition, we know X(i) ≤ t* for i ≤ r. L(θ|X) = {Πi=r+1,n f(X(i)|θ)}F(t*|θ)r

X X

)

X

0 t*

)

138 Formulating the Likelihood Function-Censoring

► Interval Censored Samples: A life test with n items begins at time t = 0 but observation of the state of the items (failed or surviving) is only at fixed time points 0 = t0 < t1< …. < tk < tk+1 = ∞. The test is stopped at tk. Let Xi, i = 1, ...,k denote the number of items observed failed in [tk-1,tk], Xk+1 is the number still surviving at tk L(θ|X) ∝ Πi=1,k+1 [F(ti|θ) - F(ti-1|θ)]Xi

( )

0 t1 t2 t3 t4

( )

( )

( )

139 Formulating the Likelihood Function

► Or any mixture

X

0 t1 t2 t3 t4

( )

)

( (

L(θ|Data)∝[F(t2|θ)]*[F(t2|θ)-F(t1|θ)]* R(t3|θ)*R(t4|θ)*f(t1|θ) Usually for numeric reasons we take the natural log and maximize

140 Formulating the Likelihood Function

► Example:Consider the following failure time data from an exponential distribution

t1=5, t2=12, t3=26, t4>10, t5>17, t6<4, t7∈[5,10], t8∈[5,10], t9∈[11,16], t10∈[20,30],

L = f(5)*f(12)*f(26)*R(10)*R(17)*F(4) *[F[10)-F(5)]2*[F(16)-F(11)]*[F(30)-F(20)]

or

141

Maximum Likelihood Estimation

- Weibull Likelihood Plots

142

Bayesian Statistical Inference for Time Dependent Model

Parameters

143

► Law of Total Probability  Given an event B and a collection of events A1, …, An which are mutually exclusive (Ai ∩ Aj =∅) and collectively exhaustive (∪Aj =Ω) then P(B) = ∑j=1,nP(B ∩ Aj) = ∑j=1,nP(B | Aj)P(Aj)

►  Bayes Law   Given an event B and a collection of events A1, …, An which are mutually exclusive (Ai ∩ Aj =∅) and collectively exhaustive (∪Aj =Ω) then P(Ai|B) = P(B | Ai)P(Ai)/ ∑j=1,nP(B | Aj)P(Aj)

Bayesian Statistical Inference

144

► Random Variables and The Law of Total Probability and Bayes Law  When a problem uses a random variable and

specifies its parameters conditioned on some physical act X~f(x|θ) where

 Unconditional questions about X – Law of TP

For example

Bayesian Statistical Inference

145

 Questions about θ given observations on X –Bayes Law

For example

Bayesian Statistical Inference

146

 Example: Products are produced by three separate machines. Machine 1,2, and 3 produce defective products with probability .1, .05 and .02 respectively and account for 10%, 40% and 50% of the total products produced.

Then X is number of defects, X|p~Bin(n,p) where n is a sample size and

Bayesian Statistical Inference

147

a. If a box of 10 product are randomly selected and we do not know which machine manufactured the products in the box, what is the probability of no defects?

Pr{X =0} = Pr{X 0|p=.10}Pr{p=.10}+Pr{X=0|p=.05}PR{p=.05}

+ Pr{X =0|p=.02} Pr{p=.02} = (.90)10(.10)+(.95)10(.40)+(.98)10(.50) = .6829

Bayesian Statistical Inference

148

b. If 1 defect is found what is the probability that machine 1 produced the box?

Bayesian Statistical Inference

149

Bayesian Statistical Inference

150

Bayesian Statistical Inference

Note: the spreadsheet works for a single observation but may be used sequentially for multiple observations

Example Pr{p=.1|X1=1,X2>3}

Pr{p=.1} Pr{p=.1|X1=1}

Pr{p=.1|X1=1,X2>3}

151

Bayesian Statistical Inference

► Bayes Theorem: Continuous Analogue  THM: Let X and Θ be continuous random vectors

with joint probability density f(x,θ) . Let f(x|θ) and f(θ|x) be the corresponding conditional densities and f(θ) = ∫ f(x,θ)dx be the marginal density of Θ. Then

f(θ|x) = f(x|θ)f(θ)/{∫ f(x|θ)f(θ)d θ}

 Proof: if f(θ) > 0 and f(x) > 0, f(θ|x) = f(x,θ)/f(x) ⇒ f(θ|x) = f(x,θ)/{∫f(x|θ)f(θ)dθ} = f(x|θ)f(θ)/{∫ f(x|θ)f(θ)dθ}

152

Bayesian Statistical Inference

► Principals of Bayesian Inference  Description of uncertainty is via probability,  Uncertainty about unknown parameters α, β, γ, etc

for statistical models is expressed via probability distributions for the parameters

 Given a model f(x|Θ) with unknown Θ, a distribution, g(θ) is specified using expert judgment. This is called the prior distribution for Θ and describes our uncertainty about Θ

153

Bayesian Statistical Inference

 If we wish to make probability statements about the random variable X, taking into account our uncertainty for Θ, we may do so using the

law of total probability f(x) = ∫ f(x|θ)g(θ)dθ This distribution is called the predictive distribution for x.

 Thus Pr{X∈A} = ∫Af(x)dx if X is continuous or using an appropriate summation if X is discrete

154

Bayesian Statistical Inference

 If data becomes available we update our uncertainty distribution for Θ using Bayes Theorem.

 We use the probability model to describe the form of the data as a function of the parameter. This is called the likelihood function.

 There are many forms of the likelihood function depending on the form of the data, however for a complete random sample X1,…, Xn from f(x|θ), the likelihood is given as

L(θ|x1,…, xn ) = Πi=1,n f(xi|θ)

155

Bayesian Statistical Inference

 Given the data, x = x1,…, xn the updated distribution which describes the uncertainty for Θ is given by Bayes Theorem as

g(θ|x) = L(θ|x)g(θ)/{∫ L(θ|x)g(θ)dθ}

This is called the posterior distribution for Θ and it describes our uncertainty for Θ in light of the data.

~

~

~ ~

156

Bayesian Statistical Inference

 If we wish to make probability statements about the random variable X, taking into account our uncertainty for Θ AFTER the a random sample is observed, we may do so using the law of total probability

f(x|x) = ∫f(x|θ)g(θ|x)dθ

This is called the predictive distribution for X after observing x.

~ ~

~

157

Bayesian Statistical Inference

BEFORE DATA AFTER DATA

OBSERVABLE

PARAMETER

Prior Predictive f(x)

Posterior Predictive f(x|x)

Prior g(θ)

Posterior g(θ|x)

~

~

158

Bayesian Statistical Inference

• Example (Effect of Prior on Posterior) Failure Data: (12,10,15,5,8)

159

Bayesian Statistical Inference

• Example (Effect of Prior on Posterior) Failure Data: (.1,.5,.3,.1,.2)

160 Bayesian Statistics: Defining the Prior

► Methods  Conjugate Priors  Noninformative Priors  Maximum Entropy Priors  Empirical Bayes Priors

161 Bayesian Statistics: Prior Selection

PRIOR ASSESSMENT

Access to Experts

Any Prior

Access to Data

Access to Partial Information Moments

Access to Computer

Conjugate Prior

Yes

No

Empirical Bayes Prior

Maximum Information Prior

Noninformative Prior

Yes

Yes

Yes

No

No

No

162

Example Bayes Analysis

The number of non serious accidents at a plant is given by a Poisson process with rate λ per year. However, since λ is unknown, a prior distribution is constructed. The gamma distribution with υ=1 and α=5 is selected. Given the above, what point estimate would you use for λ?

163

Example Bayes Analysis

The number of non serious accidents at a plant is given by a Poisson process with rate l per year. However, since λ is unknown, a prior distribution is constructed. The gamma distribution with υ=1 and α=5 is selected.

What is the probability that λ is less than .1? What is the probability of more than 2 accidents per year?

164

Example Bayes Analysis

For a year period, we observe 3 accidents. Plot the prior and posterior distribution of λ. In a year we observe 3 accidents, what is the probability of more than 2 accidents in the following year?

165

Example Bayes Analysis