Important Discrete Distributions

35
Iyer - Lecture 10 ECE 313 โ€“ Fall 2016 Important Discrete Distributions: Poisson, Geometric, & Modified Geometric ECE 313 Probability with Engineering Applications Lecture 10 Ravi K. Iyer Department of Electrical and Computer Engineering University of Illinois

Transcript of Important Discrete Distributions

Page 1: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Important Discrete Distributions: Poisson, Geometric, & Modified Geometric

ECE 313Probability with Engineering Applications

Lecture 10Ravi K. Iyer

Department of Electrical and Computer Engineering University of Illinois

Page 2: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Todayโ€™s Topics

โ€ขRandom Variablesโ€“Example: โ€“Geometric/modified Geometric Distribution โ€“ Poisson derived from Bernoulli trials; Examplesโ€“Examples: Verifying CDF/pdf/pmf;

โ€ขAnnouncements:โ€“ Homework 4 out todayโ€“ In class activity next Wednesday,.

Page 3: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Binomial Random Variable (RV) Example:Twin Engine vs 4-Engine Airplane

โ€ข Suppose that an airplane engine will fail, when in flight, withprobability 1โˆ’p independently from engine to engine. Also,suppose that the airplane makes a successful flight if at least 50percent of its engines remain operative. For what values of p isa four-engine plane preferable to a two-engine plane?

โ€ข Because each engine is assumed to fail or functionindependently: the number of engines remaining operational isa binomial random variable. Hence, the probability that a four-engine plane makes a successful flight is:

4322

04322

)1(4)1(6

)1(44

)1(34

)1(24

ppppp

pppppp

+-+-=

-รทรทรธ

รถรงรงรจ

รฆ+-รทรท

รธ

รถรงรงรจ

รฆ+-รทรท

รธ

รถรงรงรจ

รฆ

Page 4: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Binomial RV Example 3 (Contโ€™)โ€ข The corresponding probability for a two-engine plane is:

โ€ข The four-engine plane is safer if:

โ€ข Or equivalently if:

โ€ข Hence, the four-engine plane is safer when the engine successprobability is at least as large as 2/3, whereas the two-engine plane issafer when this probability falls below 2/3.

24322 )1(2)1(4)1(6 pppppppp +-ยณ+-+-

22 )1(222

)1(12

pppppp +-=รทรทรธ

รถรงรงรจ

รฆ+-รทรท

รธ

รถรงรงรจ

รฆ

pppppp -ยณ+-+- 2)1(4)1(6 322

0)23()1(02783 223 ยณ--ยณ-+- pporppp

32023 ยณยณ- porp

Page 5: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Geometric Distribution: Examples

โ€ข Some Examples where the geometric distribution occurs

1. The probability the ith item on a production line is defective is given by the geometric pmf.

2. The pmf of the random variable denoting the number of time slices needed to complete the execution of a job

Page 6: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Geometric Distribution Examples

3. Consider a repeat loop

โ€ข repeat S until B

โ€ข The number of tries until B (success) is reached (i.e.,includesB), is a geometrically distributed Random Variable with parameter p.

Page 7: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Discrete DistributionsGeometric pmf (cont.)

โ€ข To find the pmf of a geometric Random Variable (RV), Z note that the event [Z = i] occurs if and only if we have a sequence of (i โ€“ 1) โ€œfailuresโ€ followed by one success - a sequence of independent Bernoulli trials each with the probability of success equal to p and failure q.

โ€ข Hence, we have the pdffor i = 1, 2,..., (A)

โ€“ where q = 1 - p.โ€ข Using the formula for the sum of a geometric series, we have:

โ€ข CDF of Geometric distr.:

1i1iZ p)p(1pq(i)p -- -==

11

)(1 1

1 ==-

==รฅ รฅยฅ

=

ยฅ

=

-

pp

qppqip

i i

iZ

รซ รปรซ รป

รฅ=

- --=-=t

t

1i

1iZ p)(11p)p(1(t)F

Page 8: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Modified Geometric Distribution Example

โ€ข Consider the program segment consisting of a while loop:

โ€ข while ยฌ B do S

โ€ข the number of times the body (or the statement-group S) of the loop is executed: a modified geometric distribution with parameter p (probability the B is not true) โ€“ no. of failures until the first success.

Page 9: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Discrete Distributionsthe Modified Geometric pmf (cont.)

โ€ข The random variable X is said to have a modified geometric pmf, specify by

for i = 0, 1, 2,...,

โ€ข The corresponding Cumulative Distribution function is:

iX p)p(1(i)p -=

for t โ‰ฅ 0รซ รป

รซ รป

รฅ=

+--=-=t

i

tiX ppptF

0

1)1(1)1()(

Page 10: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Example: Geometric Random Variable

A representative from the NFL Marketing division randomly selects people on a random street in Chicago loop, until he/she finds a person who attended the last home football game. Let p, the probability that she succeeds in finding such a person, is 0.2 and X denote the number of people asked until the first success. โ€ข What is the probability that the representative must select 4

people until he finds one who attended the last home game?๐‘ƒ ๐‘‹ = 4 = 1 โˆ’ 0.2 * 0.2 = 0.1024

โ€ข What is the probability that the representative must select more than 6 people before finding one who attended the last home game?๐‘ƒ ๐‘‹ > 6 = 1 โˆ’ ๐‘ƒ ๐‘‹ โ‰ค 6 = 1 โˆ’ 1 โˆ’ 1 โˆ’ 0.2 . = 0.262

Page 11: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

The Poisson Random Variableโ€ข A random variable X, taking on one of the values 0,1,2,โ€ฆ, is said

to be a Poisson random variable with parameter ฮฑ, if for some ฮฑ>0,

defines a probability mass function since

1!

)(00

=== -ยฅ

=

ยฅ

=

- รฅรฅ aaa a eei

eipi

i

i

,...1,0,!

}{)( ==== - ii

eiXPipiaa

Page 12: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Poisson Random Variable

Consider smaller intervals, i.e., let ๐‘› โ†’ โˆž

๐‘ƒ ๐‘‹ = ๐‘˜ = lim6โ†’7

๐œ†๐‘ก:

๐‘˜!๐‘› ๐‘› โˆ’ 1 โ€ฆ (๐‘› โˆ’ ๐‘˜ + 1)

๐‘› @ ๐‘› @ ๐‘›โ€ฆ๐‘› 1 โˆ’๐œ†๐‘ก๐‘›

61 โˆ’

๐œ†๐‘ก๐‘›

A:

= lim6โ†’7

๐œ†๐‘ก:

๐‘˜! 1 @ 1 โˆ’1๐‘› 1 โˆ’

2๐‘› โ€ฆ (1 โˆ’

๐‘˜ + 1๐‘› ) 1 โˆ’

๐œ†๐‘ก๐‘›

61 โˆ’

๐œ†๐‘ก๐‘›

A:

= BCD

:!๐‘’ABC

Which is a Poisson process with ๐›ผ = ๐œ†๐‘ก

Page 13: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Geometric Distribution Examples

3. Consider the program segment consisting of a while loop:

โ€ข while ยฌ B do S

โ€ข the number of times the body (or the statement-group S) of the loop is executed: a modified geometric distribution with parameter p (probability the B is not true) โ€“ no. of failures until the first success.

4. Consider a repeat loop

โ€ข repeat S until B

โ€ข The number of tries until B (success) is reached will be a geometrically distributed random variable with parameter p.

Page 14: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Discrete DistributionsGeometric pmf (cont.)

โ€ข To find the pmf of Z note that the event [Z = i] occurs if and only if we have a sequence of (i โ€“ 1) failures followed by one success - a sequence of independent Bernoulli trials each with the probability of success equal to p and failure q.

โ€ข Hence, we havefor i = 1, 2,..., (A)

โ€“ where q = 1 - p.โ€ข Using the formula for the sum of a geometric series, we have:

โ€ข CDF of Geometric distr.:

1i1iZ p)p(1pq(i)p -- -==

11

)(1 1

1 ==-

==รฅ รฅยฅ

=

ยฅ

=

-

pp

qppqip

i i

iZ

รซ รปรซ รป

รฅ=

- --=-=t

t

1i

1iZ p)(11p)p(1(t)F

Page 15: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Discrete Distributionsthe Modified Geometric pmf (cont.)

โ€ข The random variable X is said to have a modified geometric pmf, specify by

for i = 0, 1, 2,...,

โ€ข The corresponding Cumulative Distribution function is:

iX p)p(1(i)p -=

for t โ‰ฅ 0รซ รป

รซ รป

รฅ=

+--=-=t

i

tiX ppptF

0

1)1(1)1()(

Page 16: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Example: Geometric Random Variable

A representative from the NFL Marketing division randomly selects people on a random street in Chicago loop, until he/she finds a person who attended the last home football game. Let p, the probability that he succeeds in finding such a person, be 0.2 and X denote the number of people he selects until he finds his first success. โ€ข What is the probability that the representative must select 4

people until he finds one who attended the last home game?๐‘ƒ ๐‘‹ = 4 = 1 โˆ’ 0.2 * 0.2 = 0.1024

โ€ข What is the probability that the representative must select more than 6 people before he finds one who attended the last home game?๐‘ƒ ๐‘‹ > 6 = 1 โˆ’ ๐‘ƒ ๐‘‹ โ‰ค 6 = 1 โˆ’ 1 โˆ’ 1 โˆ’ 0.2 . = 0.262

Page 17: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

The Poisson Random Variableโ€ข A random variable X, taking on one of the values 0,1,2,โ€ฆ, is said

to be a Poisson random variable with parameter ฮป, if for some ฮป>0,

defines a probability mass function since

1!

)(00

=== -ยฅ

=

ยฅ

=

- รฅรฅ lll l eei

eipi

i

i

,...1,0,!

}{)( ==== - ii

eiXPipill

Page 18: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Poisson Random Variable

Consider smaller intervals, i.e., let ๐‘› โ†’ โˆž

๐‘ƒ ๐‘‹ = ๐‘˜ = lim6โ†’7

๐œ†๐‘ก:

๐‘˜!๐‘› ๐‘› โˆ’ 1 โ€ฆ (๐‘› โˆ’ ๐‘˜ + 1)

๐‘› @ ๐‘› @ ๐‘›โ€ฆ๐‘› 1 โˆ’๐œ†๐‘ก๐‘›

61 โˆ’

๐œ†๐‘ก๐‘›

A:

= lim6โ†’7

๐œ†๐‘ก:

๐‘˜! 1 @ 1 โˆ’1๐‘› 1 โˆ’

2๐‘› โ€ฆ (1 โˆ’

๐‘˜ + 1๐‘› ) 1 โˆ’

๐œ†๐‘ก๐‘›

61 โˆ’

๐œ†๐‘ก๐‘›

A:

= BCD

:!๐‘’ABC

Which is a Poisson process with ๐›ผ = ๐œ†๐‘ก

Page 19: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Example: Poisson Random Variables

A cloud computing system failure occurs according to a Poisson distribution with an average of 3 failures every 10 weeks. I.e., the failure within t week(s) is Poisson distributed with ๐›ผ= 0.3ti. Calculate the probability that there will not be more than one

failure during a particular week

๐ฟ๐‘’๐‘ก๐‘‹C๐‘‘๐‘’๐‘›๐‘œ๐‘ก๐‘’๐‘กโ„Ž๐‘’๐‘›๐‘ข๐‘š๐‘๐‘’๐‘Ÿ๐‘œ๐‘“๐‘“๐‘Ž๐‘–๐‘™๐‘ข๐‘Ÿ๐‘’๐‘ ๐‘ค๐‘–๐‘กโ„Ž๐‘–๐‘›๐‘ก๐‘ค๐‘’๐‘’๐‘˜๐‘ 

๐‘ƒ ๐‘‹V = 0๐‘œ๐‘Ÿ1 = ๐‘ƒ ๐‘‹V = 0 + ๐‘ƒ ๐‘‹V = 1 =๐‘’AW.*0.3W

0! +๐‘’AW.*0.3V

1!ii. Calculate the probability that there will be at least one failure

during a particular month (4 weeks)

๐‘ƒ ๐‘‹Y โ‰ฅ 1 = 1 โˆ’ ๐‘ƒ ๐‘‹Y = 0 = 1 โˆ’๐‘’AW.*โˆ—Y(0.3 โˆ— 4)W

0!

Page 20: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Poisson Random Variable

A manufacturer produces VLSI chips, 1% of which are defective. Find the probability that in a box containing 100 chips, no defectives are found.

Using Poisson approximation, ฮฑ=100*0.01=1,

We can verify that the probabilities are non negative and sum to 1

]ฮฑ:

๐‘˜! ๐‘’Aฮฑ

7

:^W

= ๐‘’Aฮฑ]ฮฑ:

๐‘˜!

7

:^W

= ๐‘’Aฮฑ๐‘’ฮฑ = 1

Page 21: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Example: PDF, PMF

Verify whether below are valid PDF/PMF.

๐‘“ ๐‘ฅ = `VY๐‘ฅ*, 0 โ‰ค ๐‘ฅ โ‰ค 20, ๐‘œ๐‘กโ„Ž๐‘’๐‘Ÿ๐‘ค๐‘–๐‘ ๐‘’

Answer:1. ๐‘“ ๐‘ฅ โ‰ฅ 02. โˆซ ๐‘“ ๐‘ฅ ๐‘‘๐‘ฅ = โˆซ V

Y๐‘ฅ*๐‘‘๐‘ฅ = 1c

W7A7

Therefore, it is a valid PDF.

๐‘ ๐‘ฅ = eVVW

3๐‘ฅ โˆ’ 2 , ๐‘ฅ = 0,1,2,30, ๐‘œ๐‘กโ„Ž๐‘’๐‘Ÿ๐‘ค๐‘–๐‘ ๐‘’

Answer:1. โˆ‘ ๐‘7

:^W (x=k)=โˆ‘ ๐‘*:^W (x=k)=1

2. However, p(x=0)=AcVW< 0

Therefore, it is not a valid PMF.

Page 22: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Example: CDF

โ€ข Verify whether the following is a valid CDF

๐น ๐‘ฅ = e0, ๐‘ฅ < 0

๐‘ฅc, 0 โ‰ค ๐‘ฅ < 11, ๐‘ฅ โ‰ฅ 1

Answer:1. lim

rโ†’A7๐น ๐‘ฅ = 0

2. limrโ†’7

๐น ๐‘ฅ = 13. IsF(x)monotonicallynon-decreasing? Yes

Therefore, it is a valid CDF.

Page 23: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Example: Identify Random Variables

For each of the following random variables, i) determine if it is discrete or continuous, ii) specify the distribution of the identified random variable (from those you have learned in the class), iii) write the formula for probability mass function (pmf) or probability density function (pdf) based on the information provided:

I. A binary communication channel introduces a single bit error in each transmission with probability of 0.1. Let X be the random variable representing the number of errors in n independent transmissions.

i. Discreteii. Binomialiii. ๐‘ƒ ๐‘‹ = ๐‘– = 6

๏ฟฝ ๐‘๏ฟฝ 1 โˆ’ ๐‘ 6A๏ฟฝ

Page 24: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Example: Identify Random Variables

II. A sequence of characters is transmitted over a channel that introduces errors with probability 0.2. Let N be the random variable representing the number of error-free characters transmitted before an error occurs.

i. Discreteii. Modified Geometriciii. ๐‘ƒ ๐‘ = ๐‘– = 1 โˆ’ ๐‘ ๏ฟฝ๐‘

Page 25: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Review: Continuous Random Variables

โ€ข Continuous Random Variables:โ€“ Probability distribution function (pdf):

โ€ข Properties:

โ€ข All probability statements about X can be answered by f(x):

โ€“ Cumulative distribution function (CDF):

โ€ข Properties:โ€ข A continuous function

P{X โˆˆ B} = f (x)dxBโˆซ

รฒยฅ

ยฅ-=ยฅ-ยฅรŽ= dxxfXP )()},({1

รฒ=ยฃยฃb

adxxfbXaP )(}{

รฒ ===a

adxxfaXP 0)(}{

)()( afaFdad

=

รฒยฅ-

ยฅ<<ยฅ-=ยฃ=x

xx xdttfxXPxF , )()()(

Page 26: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Normal or Gaussian Distribution

Page 27: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Normal or Gaussian Distribution

โ€ข Extremely important in statistical application because of the central limit theorem:โ€“ Under very general assumptions, the mean of a sample of n mutually

independent random variables is normally distributed in the limit n ยฎยฅ.โ€ข Errors in measurement often follows this distribution.โ€ข During the wear-out phase, component lifetime follows a normal

distribution.โ€ข The normal density is given by:

where are two parameters of the distribution.

f (x) = 1ฯƒ 2ฯ€

exp โˆ’12x โˆ’ยตฯƒ

"

#$

%

&'

2(

)**

+

,--, โˆ’โˆž < x <โˆž

โˆ’โˆž < ยต <โˆž and ฯƒ > 0

Page 28: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Normal or Gaussian Distribution (cont.)

โ€ข Normal density with parameters ยต =2 and s =1

f x(x)

0.4

0.2

-2.00 1.00 4.00 7.00-5.00

x

s =1

ยต = 2

1ฯƒ 2ฯ€

Page 29: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Normal or Gaussian Distribution (cont.)

โ€ข The distribution function (CDF) F(x) has no closed form, so between every pair of limits a and b, probabilities relating to normal distributions are usually obtained numerically and recorded in special tables.

โ€ข These tables apply to the standard normal distribution[Z ~ N(0,1)] --- a normal distribution with parameters ยต = 0 , s = 1 --- and their entries are the values of:

รฒยฅ-

-=z

tZ dtezF 2

2

21)(p

Page 30: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Normal or Gaussian Distribution (cont.)

โ€ข Since the standard normal density is clearly symmetric, it follows that for z > 0:

โ€ข The tabulations of the normal distribution are made only for z โ‰ฅ 0 To find P(a โ‰ค Z โ‰ค b), use F(b) - F(a).

FZ (โˆ’z) = fZ (t)dtโˆ’โˆž

โˆ’ z

โˆซ

= fZz

โˆž

โˆซ (โˆ’t)dt

= fZz

โˆž

โˆซ (t)dt

= fZ (t)dt โˆ’ fZโˆ’โˆž

z

โˆซโˆ’โˆž

โˆž

โˆซ (t)dt

= 1โˆ’ FZ (z)

Page 31: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Normal or Gaussian Distribution (cont.)

โ€ข The CDF of the N(0,1) distribution ( ) is denoted in the tables by , and its complementary CDF is denoted by , so: F Q

)(zFZ

)()(1)( uuuQ -F=F-=

Page 32: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Normal or Gaussian Distribution (cont.)

โ€ข For a particular value, x, of a normal random variable X, the corresponding value of the standardized variable Z is:

โ€ข The Cumulative distribution function of X can be found by using:

alternatively:

โ€ข Similarly, if X is normally distributed with parameters ฮผ and ฯƒ2 thenZ = ฮฑX + ฮฒ is normally distributed with parameters ฮฑฮผ + ฮฒ and ฮฑ2ฯƒ2.

sยต /)( -= XZ

)( )(

)(

)()(

sยตsยต

sยต

zFzXP

zXP

zZPzF

X

Z

+=+ยฃ=

ยฃ-

=

ยฃ=

)()(sยต-

=xFxF ZX

Page 33: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Normal or Gaussian Distribution Example 1

โ€ข An analog signal received at a detector (measured in microvolts) may be modeled as a Gaussian random variable N(200, 256) at a fixed point in time. What is the probability that the signal will exceed 240 microvolts? What is the probability that the signal is larger than 240 microvolts, given that it is larger than 210 microvolts?

P(X > 240) = 1โˆ’ P(X โ‰ค 240)

= 1โˆ’ FZ240 โˆ’ 200

16# $ %

& ' (

= 1โˆ’ FZ (2.5) โ‰ˆ 0.00621

Page 34: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Normal or Gaussian Distribution Example 1 (cont.)

โ€ข Next:

P(X โ‰ฅ 240 | X โ‰ฅ 210) =P(X) โ‰ฅ 240)P(X โ‰ฅ 210)

=1โˆ’ FZ

240 โˆ’ 20016

# $ %

& ' (

1โˆ’ FZ210 โˆ’ 200

16# $ %

& ' (

=0.006210.26599

โ‰ˆ 0.02335

Page 35: Important Discrete Distributions

Iyer - Lecture 10 ECE 313 โ€“ Fall 2016

Normal or Gaussian Distribution Example 2

โ€ข Assuming that the life of a given subsystem, in the wear-out phase, is normally distributed with ยต = 10,000 hours and s = 1,000 hours, determine the reliability for an operating time of 500 hours given that

โ€“ (a) The age of the component is 9,000 hours,โ€“ (b) The age of the component is 11,000.

โ€ข The required quantity under (a) is R9,000(500) and under (b) is R11,000(500).