Discrete Probability Distributions

42
Copyright 2010 John Wiley & Sons, Inc. 1 Copyright 2010 John Wiley & Sons, Inc. Business Statistics, 6 th ed. by Ken Black Chapter 5 Discrete Probability Distributions

description

 

Transcript of Discrete Probability Distributions

Page 1: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 1Copyright 2010 John Wiley & Sons, Inc.

Business Statistics, 6th ed.by Ken Black

Chapter 5Discrete

ProbabilityDistributions

Page 2: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 2

Learning Objectives

Distinguish between discrete random variables and continuous random variables.Know how to determine the mean and variance of a discrete distribution.Identify the type of statistical experiments that can be described by the binomial distribution, and know how to work such problems.

Page 3: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 3

Discrete vs. Continuous Distributions

Discrete distributions – constructed from discrete (individually distinct) random variablesContinuous distributions – based on continuous random variablesRandom Variable - a variable which contains the outcomes of a chance experiment

Page 4: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 4

Discrete vs. Continuous Distributions

Categories of Random VariablesDiscrete Random Variable - the set of all possible values is at most a finite or a countable infinite number of possible valuesContinuous Random Variable - takes on values at every point over a given interval

Page 5: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 5

Describing a Discrete Distribution

A discrete distribution can be described by constructing a graph of the distributionMeasures of central tendency and variability can be applied to discrete distributionsDiscrete values of outcomes are used to represent themselves

Page 6: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 6

Describing a Discrete Distribution

Mean of discrete distribution – is the long run average

If the process is repeated long enough, the average of the outcomes will approach the long run average (mean)

Requires the process to eventually have a number which is the product of many processes

Mean of a discrete distribution µ = ∑ (X * P(X))

where (X) is the long run average;X = outcome, P = Probability of X

Page 7: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 7

Describing a Discrete Distribution

Variance and Standard Deviation of a discrete distribution are solved by using the outcomes (X) and probabilities of outcomes (P(X)) in a manner similar to computing a meanStandard Deviation is computed by taking the square root of the variance

Page 8: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 8

Some Special Distributions

DiscretebinomialPoissonHypergeometric

Continuousnormaluniformexponentialtchi-squareF

Page 9: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 9

Discrete Distribution -- Example

Observe the discrete distribution in the following table. An executive is considering out-of-town business travel for a given Friday. At least one crisis could occur on the day that the executive is gone. The distribution contains the number of crises that could occur during the day the executive is gone and the probability that each number will occur. For example, there is a .37 probability that no crisis will occur, a .31 probability of one crisis, and so on.

Page 10: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 10

012345

0.370.310.180.090.040.01

Number of Crises Probability

Distribution of Daily Crises

0

0.1

0.2

0.3

0.4

0.5

0 1 2 3 4 5

Probability

Number of Crises

Discrete Distribution -- Example

Page 11: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 11

2.1)(22 XPX

212 110. .

Variance and Standard Deviation of a Discrete Distribution

X-10123

P(X).1.2.4.2.1

-2-1012

X 41014

.4

.2

.0

.2

.41.2

)(2X

2( ) ( )X P X

Page 12: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 12

Requirements for a Discrete Probability Function -- Examples

X P(X)

-10123

.1

.2

.4

.2

.11.0

X P(X)

-10123

-.1.3.4.3.1

1.0

X P(X)

-10123

.1

.3

.4

.3

.11.2

: YES NO NO

Page 13: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 13

Mean of a Discrete Distribution

E X X P X( )

X-10123

P(X).1.2.4.2.1

-.1.0.4.4.3

1.0

X P X ( )

= 1.0

Page 14: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 14

E X X P X( ) .115

Mean of the Crises Data Example

X P(X) XP(X)

0 .37 .00

1 .31 .31

2 .18 .36

3 .09 .27

4 .04 .16

5 .01 .05

1.15

0

0.1

0.2

0.3

0.4

0.5

0 1 2 3 4 5

Probability

Number of Crises

Page 15: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 15

22

141 X P X( ) . 2

141 119. .

Variance and Standard Deviationof Crises Data Example

X P(X) (X-) (X-)2 P(X)

0 .37 -1.15 1.32 .49

1 .31 -0.15 0.02 .01

2 .18 0.85 0.72 .13

3 .09 1.85 3.42 .31

4 .04 2.85 8.12 .32

5 .01 3.85 14.82 .15

1.41

(X-)2

Page 16: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 16

Binomial Distribution

nXXnX

nXP qp

XnX

0for

!!

!)(

n p

2

2

n p q

n p q

Probability function

Mean value

Variance and Standard Deviation

Page 17: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 17

According to the U.S. Census Bureau, approximately 6% of all workers in Jackson, Mississippi, are unemployed. In conducting a random telephone survey in Jackson, what is the probability of getting two or fewer unemployed workers in a sample of 20?

Binomial Distribution:Demonstration Problem 5.3

Page 18: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 18

Binomial Distribution:Demonstration Problem 5.3

In the following example, 6% are unemployed => pThe sample size is 20 => n94% are employed => qx is the number of successes desiredWhat is the probability of getting 2 or fewer unemployed workers in the sample of 20?The hard part of this problem is identifying p, n, and x – emphasis this when studying the problems.

Page 19: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 19

n

p

q

P X P X P X P X

20

06

94

2 0 1 2

2901 3703 2246 8850

.

.

( ) ( ) ( ) ( )

. . . .

2901.)2901)(.1)(1()!020(!0

!20)0( 94.06.

0200

XP

P X( )!( )!

( )(. )(. ) .. .

120!

1 20 120 06 3086 3703

1 20 1

06 94

P X( )!( )!

( )(. )(. ) .. .

220!

2 20 2190 0036 3283 2246

2 20 2

06 94

Binomial Distribution:Demonstration Problem 5.3

Page 20: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 20

Binomial Distribution Table:Demonstration Problem 5.3

n

p

q

P X P X P X P X

20

06

94

2 0 1 2

2901 3703 2246 8850

.

.

( ) ( ) ( ) ( )

. . . .

P X P X( ) ( ) . . 2 1 2 1 8850 1150

n p ( )(. ) .20 06 1 202

2

20 06 94 1 128

1 128 1 062

n p q ( )(. )(. ) .

. .

n = 20 PROBABILITY

X 0.05 0.06 0.07

0 0.3585 0.2901 0.2342

1 0.3774 0.3703 0.3526

2 0.1887 0.2246 0.2521

3 0.0596 0.0860 0.1139

4 0.0133 0.0233 0.0364

5 0.0022 0.0048 0.0088

6 0.0003 0.0008 0.0017

7 0.0000 0.0001 0.0002

8 0.0000 0.0000 0.0000

… … …

20 0.0000 0.0000 0.0000

Page 21: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 21

Excel’s Binomial Function

n = 20

p = 0.06

X P(X)

0 =BINOMDIST(A5,B$1,B$2,FALSE)

1 =BINOMDIST(A6,B$1,B$2,FALSE)

2 =BINOMDIST(A7,B$1,B$2,FALSE)

3 =BINOMDIST(A8,B$1,B$2,FALSE)

4 =BINOMDIST(A9,B$1,B$2,FALSE)

5 =BINOMDIST(A10,B$1,B$2,FALSE)

6 =BINOMDIST(A11,B$1,B$2,FALSE)

7 =BINOMDIST(A12,B$1,B$2,FALSE)

8 =BINOMDIST(A13,B$1,B$2,FALSE)

9 =BINOMDIST(A14,B$1,B$2,FALSE)

Page 22: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 22

X P(X =x)

0 0.0000001 0.0000002 0.0000003 0.0000014 0.0000065 0.0000376 0.0001997 0.0008588 0.0030519 0.00904010 0.02250011 0.04727312 0.08404113 0.12642014 0.16053315 0.17123616 0.15220917 0.11142118 0.06602719 0.03089020 0.01098321 0.00278922 0.00045123 0.000035

Binomial with n = 23 and p = 0.64

Minitab’s Binomial Function

Page 23: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 23

Mean and Std Dev of Binomial Distribution

Binomial distribution has an expected value or a long run average denoted by µ (mu)

If n items are sampled over and over for a long time and if p is the probability of success in one trial, the average long run of successes per sample is expected to be np=> Mean µ = np=> Std Dev = √(npq)

Page 24: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 24

Poisson Distribution

The Poisson distribution focuses only on the number of discrete occurrences over some interval or continuum

Poisson does not have a given number of trials (n)as a binomial experiment doesOccurrences are independent of other occurrencesOccurrences occur over an interval

Page 25: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 25

Poisson Distribution

If Poisson distribution is studied over a long periodof time, a long run average can be determined

The average is denoted by lambda (λ)Each Poisson problem contains a lambda value from which the probabilities are determinedA Poisson distribution can be described by λ alone

Page 26: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 26

Poisson Distribution

)logarithms natural of base (the ...718282.2

:

,...3,2,1,0for !

)(

e

averagerunlong

where

XX

XP eX

Probability function

Mean value

Standard deviation Variance

Page 27: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 27

Poisson Distribution:Demonstration Problem 5.7

Bank customers arrive randomly on weekday afternoons at an average of 3.2 customers every 4 minutes. What is the probability of having more than 7 customers in a 4-minute interval on a weekday afternoon?

Page 28: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 28

Poisson Distribution:Demonstration Problem 5.7

Solutionλ = 3.2 customers>minutes X > 7 customers/4 minutesThe solution requires obtaining the values of x = 8, 9, 10, 11, 12, 13, 14, . . . . Each x value is determined until the values are so far away from λ = 3.2 that the probabilities approach zero. The exact probabilities are summed to find x 7. If the bank has been averaging 3.2 customers every 4 minutes on weekday afternoons, it is unlikely that more than 7 people would randomly arrive in any one 4-minute period. This answer indicates that more than 7 people would randomly arrive in a 4-minute period only 1.69% of the time. Bank officers could use these results to help them make staffing decisions.

Page 29: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 29

0528.0!10

=)10=(

!=P(X)

minutes 8customers/ 4.6=

Adjusted

minutes 8customers/ 10 = X

minutes 4customers/ 2.3

4.610

X

6.4

e

e

XP

X

3 2

6 4

66

0 15866 4

.

!

!.

.

customers / 4 minutes

X = 6 customers / 8 minutes

Adjusted

= . customers / 8 minutes

P(X) =

( = ) =

X

66.4

e

eX

P X

Poisson Distribution:Demonstration Problem 5.7

Page 30: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 30

0060.0000.0002.0011.0047.

)9()8()7()6()5(

6.1

XPXPXPXPXP

Poisson Distribution:Using the Poisson Tables

X 0.5 1.5 1.6 3.00 0.6065 0.2231 0.2019 0.04981 0.3033 0.3347 0.3230 0.14942 0.0758 0.2510 0.2584 0.22403 0.0126 0.1255 0.1378 0.22404 0.0016 0.0471 0.0551 0.16805 0.0002 0.0141 0.0176 0.10086 0.0000 0.0035 0.0047 0.05047 0.0000 0.0008 0.0011 0.02168 0.0000 0.0001 0.0002 0.00819 0.0000 0.0000 0.0000 0.002710 0.0000 0.0000 0.0000 0.000811 0.0000 0.0000 0.0000 0.000212 0.0000 0.0000 0.0000 0.0001

Page 31: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 31

4751.3230.2019.1

)1()0(1)2(1)2(

6.1

XPXPXPXP

Poisson Distribution:Using the Poisson Tables

X 0.5 1.5 1.6 3.00 0.6065 0.2231 0.2019 0.04981 0.3033 0.3347 0.3230 0.14942 0.0758 0.2510 0.2584 0.22403 0.0126 0.1255 0.1378 0.22404 0.0016 0.0471 0.0551 0.16805 0.0002 0.0141 0.0176 0.10086 0.0000 0.0035 0.0047 0.05047 0.0000 0.0008 0.0011 0.02168 0.0000 0.0001 0.0002 0.00819 0.0000 0.0000 0.0000 0.002710 0.0000 0.0000 0.0000 0.000811 0.0000 0.0000 0.0000 0.000212 0.0000 0.0000 0.0000 0.0001

Page 32: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 32

Excel’s Poisson Function

= 1.6

X P(X)

0 =POISSON(D5,E$1,FALSE)

1 =POISSON(D6,E$1,FALSE)

2 =POISSON(D7,E$1,FALSE)

3 =POISSON(D8,E$1,FALSE)

4 =POISSON(D9,E$1,FALSE)

5 =POISSON(D10,E$1,FALSE)

6 =POISSON(D11,E$1,FALSE)

7 =POISSON(D12,E$1,FALSE)

8 =POISSON(D13,E$1,FALSE)

9 =POISSON(D14,E$1,FALSE)

Page 33: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 33

Minitab’s Poisson Function

X P(X =x)

0 0.1495691 0.2841802 0.2699713 0.1709824 0.0812165 0.0308626 0.0097737 0.0026538 0.0006309 0.00013310 0.000025

Poisson with mean = 1.9

Page 34: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 34

Mean and Std Dev of a Poisson Distribution

Mean of a Poisson Distribution is λUnderstanding the mean of a Poisson distribution gives a feel for the actual occurrences that are likely to happenVariance of a Poisson distribution is also λStd Dev = Square root of λ

Page 35: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 35

Poisson Approximation of the Binomial Distribution

Binomial problems with large sample sizes and small values of p, which then generate rare events, are potential candidates for use of the Poisson DistributionRule of thumb, if n > 20 and np < 7, the approximation is close enough to use the Poisson distribution for binomial problems

Page 36: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 36

Poisson Approximation of the Binomial Distribution

Procedure for Approximating binomial with PoissonBegin with the computation of the binomial mean distribution µ = npBecause µ is the expected value of the binomial, it becomes λ for Poisson distributionUse µ as the λ, and using the x from the binomial problem allows for the approximation of the probabilities from the Poisson table or Poisson formula

Page 37: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 37

If and the approximation is acceptable.n n p 20 7,

Use n p.

Binomial probabilities are difficult to calculate when n is large.Under certain conditions binomial probabilities may be approximated by Poisson probabilities.

Poisson approximation

Poisson Approximation of the Binomial Distribution

Page 38: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 38

Hypergeometric Distribution

Sampling without replacement from a finite populationThe number of objects in the population is denoted N.Each trial has exactly two possible outcomes, success and failure.Trials are not independentX is the number of successes in the n trialsThe binomial is an acceptable approximation,if n < 5% N. Otherwise it is not.

Page 39: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 39

Hypergeometric Distribution

A n

N

2

2

2

1

A N A n N n

NN( ) ( )

( )

P x

C C

C

A x N A n x

N n( )

Probability functionN is population sizen is sample sizeA is number of successes in populationx is number of successes in sample

MeanValue

Variance and standarddeviation

Page 40: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 40

N = 24X = 8n = 5

x0 0.10281 0.34262 0.36893 0.15814 0.02645 0.0013

P(x)

P xC C

CC C

C

A x N A n x

N n( )

,

.

3

56 120

42 504

1581

8 3 24 8 5 3

24 5

Hypergeometric Distribution:Probability Computations

Page 41: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 41

Excel’s Hypergeometric Function

N = 24

A = 8

n = 5

X P(X)

0 =HYPGEOMDIST(A6,B$3,B$2,B$1)

1 =HYPGEOMDIST(A7,B$3,B$2,B$1)

2 =HYPGEOMDIST(A8,B$3,B$2,B$1)

3 =HYPGEOMDIST(A9,B$3,B$2,B$1)

4 =HYPGEOMDIST(A10,B$3,B$2,B$1)

5 =HYPGEOMDIST(A11,B$3,B$2,B$1)

=SUM(B6:B11)

Page 42: Discrete  Probability Distributions

Copyright 2010 John Wiley & Sons, Inc. 42

Minitab’s Hypergeometric Function

X P(X =x)

0 0.1027671 0.3425562 0.3689063 0.1581034 0.0263505 0.001318

Hypergeometric with N = 24, A = 8, n = 5