Chapter 3 Basic Concepts in Statistics and Probability

59
Chapter 3 Basic Concepts in Statistics and Probability

description

Chapter 3 Basic Concepts in Statistics and Probability. 3.6 Continuous Distributions. F distribution Beta distribution Uniform distribution. Normal distribution Student t-distribution Exponential distribution Lognormal distribution Weibull distribution Extreme value distribution - PowerPoint PPT Presentation

Transcript of Chapter 3 Basic Concepts in Statistics and Probability

Page 1: Chapter  3 Basic Concepts in Statistics and Probability

Chapter 3

Basic Concepts in Statistics and Probability

Page 2: Chapter  3 Basic Concepts in Statistics and Probability

3.6 Continuous Distributions

• Normal distribution• Student t-distribution• Exponential distribution• Lognormal distribution• Weibull distribution• Extreme value distribution• Gamma distribution• Chi-square distribution• Truncated normal distribution• Bivariate and multivariate

normal distribution

• F distribution• Beta distribution• Uniform distribution

Page 3: Chapter  3 Basic Concepts in Statistics and Probability

The normal distribution (also called the Gaussian distribution) is by far the most commonly used distribution in statistics. This distribution provides a good model for many, although not all, continuous populations.

The normal distribution is continuous rather than discrete. The mean of a normal population may have any value, and the variance may have any positive value.

3

3.6.1 Normal Distributions

Page 4: Chapter  3 Basic Concepts in Statistics and Probability

Probability Density Function, Mean, and Variance of Normal Dist.

The probability density function of a normal population with mean and variance 2 is given by

If X ~ N(, 2), then the mean and variance of X are given by

4

2 2

X

X

2 2( ) / 21( ) , 2

xf x e x

Page 5: Chapter  3 Basic Concepts in Statistics and Probability

68-95-99.7% Rule

This figure represents a plot of the normal probability density function with mean and standard deviation . Note that the curve is symmetric about , so that is the median as well as the mean. It is also the case for the normal population.

• About 68% of the population is in the interval . • About 95% of the population is in the interval 2.• About 99.7% of the population is in the interval 3.

5

Page 6: Chapter  3 Basic Concepts in Statistics and Probability

Standard Units

• The proportion of a normal population that is within a given number of standard deviations of the mean is the same for any normal population.

• For this reason, when dealing with normal populations, we often convert from the units in which the population items were originally measured to standard units.

• Standard units tell how many standard deviations an observation is from the population mean.

6

Page 7: Chapter  3 Basic Concepts in Statistics and Probability

Standard Normal Distribution

In general, we convert to standard units by subtracting the mean and dividing by the standard deviation. Thus, if x is an item sampled from a normal population with mean and variance 2, the standard unit equivalent of x is the number z, where

z = (x - )/.The number z is sometimes called the “z-score” of x. The z-score is an item sampled from a normal population with mean 0 and standard deviation of 1. This normal distribution is called the standard normal distribution.

7

Page 8: Chapter  3 Basic Concepts in Statistics and Probability

Finding Areas Under the Normal Curve

• The proportion of a normal population that lies within a given interval is equal to the area under the normal probability density above that interval. This would suggest integrating the normal pdf, but this integral does not have a closed form solution.

• So, the areas under the curve are approximated numerically and are available in Table B. This table provides area under the curve for the standard normal density. We can convert any normal into a standard normal so that we can compute areas under the curve.

• The table gives the area in the right-hand tail of the curve between and z. Other areas can be calculated by subtraction or by using the fact that the normal distribution is symmetrical and that the total area under the curve is 1.

8

Page 9: Chapter  3 Basic Concepts in Statistics and Probability

Normal Probabilities

Excel:NORM.DIST(x, mean, standard_dev, cumulative)NORM.INV(probability, mean, standard_dev)NORM.S.DIST(z)NORM.S.INV(probability)

Minitab:Calc Probability Distributions Normal

9

Page 10: Chapter  3 Basic Concepts in Statistics and Probability

Linear Functions of Normal Random Variables

Let X ~ N(, 2) and let a ≠ 0 and b be constants. Then aX + b ~ N(a + b, a22).

Let X1, X2, …, Xn be independent and normally distributed with means 1, 2,…, n and variances 1

2, 22,…, n

2. Let c1, c2,…, cn be constants, and c1 X1 + c2 X2 +…+ cnXn be a linear combination. Then

c1 X1 + c2 X2 +…+ cnXn ~ N(c11 + c2 2 +…+ cnn, c1

212 + c2

222 + … +cn

2n2)

10

Page 11: Chapter  3 Basic Concepts in Statistics and Probability

Distributions of Functions of Normal Random Variables

Let X1, X2, …, Xn be independent and normally distributed with mean and variance 2. Then

Let X and Y be independent, with X ~ N(X, X2) and

Y ~ N(Y, Y2). Then

11

2

~ , .σX N μn

2 2

2 2

~ ( , )

~ ( , )X Y X Y

X Y X Y

X Y N μ μ σ σ

X Y N μ μ σ σ

Page 12: Chapter  3 Basic Concepts in Statistics and Probability

3.6.2 t Distribution

• If X~N(, 2)

• If is Not known, but n30

• Let X1,…,Xn be a small (n < 30) random sample from a normal population with mean . Then the quantity

has a Student’s t distribution with n -1 degrees of freedom (denoted by tn-1).

12

nXZ

/)(

(3.7)

nsXZ

/)(

(3.8)

nsXt

/)(

(3.9)

Page 13: Chapter  3 Basic Concepts in Statistics and Probability

More on Student’s t

• The probability density of the Student’s t distribution is different for different degrees of freedom.

• The t curves are more spread out than the normal.

• Table C, called a t table, provides probabilities associated with the Student’s t distribution.

13

Page 14: Chapter  3 Basic Concepts in Statistics and Probability

t Distribution

The probability density function of t distribution is given by

14

Page 15: Chapter  3 Basic Concepts in Statistics and Probability

t Distribution

www.boost.org/.../graphs/students_t_pdf.png

Page 16: Chapter  3 Basic Concepts in Statistics and Probability

Other uses of t Distribution

Where is the parameter to be estimated, is the estimator based on a sample, is the sample estimator of

16

Page 17: Chapter  3 Basic Concepts in Statistics and Probability

3.6.3 Exponential Distribution

• The exponential distribution is a continuous distribution that is sometimes used to model the time that elapses before an event occurs (life testing and reliability).

• The probability density of the exponential distribution involves a parameter, which is the mean of the distribution, , whose value determines the density function’s location and shape.

17

Page 18: Chapter  3 Basic Concepts in Statistics and Probability

Exponential R.V.:pdf, cdf, mean and variance

The pdf of an exponential r.v. is

The cdf of an exponential r.v. is

The mean of an exponential r.v. isμx =

The variance of an exponential r.v. isσx

2 = 2.

18

(3.10)

Page 19: Chapter  3 Basic Concepts in Statistics and Probability

Exponential Probability Density Function

19

=1/

Page 20: Chapter  3 Basic Concepts in Statistics and Probability

Exponential Probabilities

Excel:EXPONDIST(x, lambda, cumulative)

Minitab:Calc Probability Distributions Exponential

20

Page 21: Chapter  3 Basic Concepts in Statistics and Probability

Example

A radioactive mass emits particles according to a Poisson process at a mean rate of 15 particles per minute. At some point, a clock is started.

1. What is the probability that more than 5 seconds will elapse before the next emission?

2. What is the mean waiting time until the next particle is emitted?

21

Page 22: Chapter  3 Basic Concepts in Statistics and Probability

Lack of Memory Property

The exponential distribution has a property known as the lack of memory property:

If T ~ Exp(1/), and t and s are positive numbers, then

P(T > t + s | T > s) = P(T > t).

22

Page 23: Chapter  3 Basic Concepts in Statistics and Probability

Example

The lifetime of a transistor in a particular circuit has an exponential distribution with mean 1.25 years.

1. Find the probability that the circuit lasts longer than 2 years.

2. Assume the transistor is now three years old and is still functioning. Find the probability that it functions for more than two additional years.

3. Compare the probability computed in 1. and 2.

23

Page 24: Chapter  3 Basic Concepts in Statistics and Probability

3.6.4 Lognormal Distribution

• For data that contain outliers, the normal distribution is generally not appropriate. The lognormal distribution, which is related to the normal distribution, is often a good choice for these data sets.

• If X ~ N(,2), then the random variable Y = eX has the lognormal distribution with parameters and 2.

• If Y has the lognormal distribution with parameters and 2, then the random variable X = lnY has the N(,2) distribution.

24

Page 25: Chapter  3 Basic Concepts in Statistics and Probability

Lognormal pdf, mean, and variance

The pdf of a lognormal random variable with parameters and 2 is

Mean:

Variance: 2/2

)( eXE

22 222)( eeYV

25

Page 26: Chapter  3 Basic Concepts in Statistics and Probability

Lognormal Probability Density Function

26

=0=1

Page 27: Chapter  3 Basic Concepts in Statistics and Probability

Lognormal Probabilities

Excel:LOGNORM.DIST(x, mean, standard_dev)

Minitab:Calc Probability Distributions Lognormal

27

Page 28: Chapter  3 Basic Concepts in Statistics and Probability

Example

When a pesticide comes into contact with the skin, a certain percentage of it is absorbed. The percentage that is absorbed during a given time period is often modeled with a lognormal distribution. Assume that for a given pesticide, the amount that is absorbed (in percent) within two hours is lognormally distributed with a mean of 1.5 and standard deviation of 0.5. Find the probability that more than 5% of the pesticide is absorbed within two hours.

28

Page 29: Chapter  3 Basic Concepts in Statistics and Probability

3.6.5 Weibull Distribution

The Weibull distribution is a continuous random variable that is used in a variety of situations. A common application of the Weibull distribution is to model the lifetimes of components. The Weibull probability density function has two parameters, both positive constants, that determine the scale and shape. We denote these parameters (scale) and (shape).

If = 1, the Weibull distribution is the same as the exponential distribution.

The case where =1 is called the standard Weibull distribution

29

Page 30: Chapter  3 Basic Concepts in Statistics and Probability

Weibull R.V.

The pdf of the Weibull distribution is

The mean of the Weibull is

30

The variance of the Weibull is

Page 31: Chapter  3 Basic Concepts in Statistics and Probability

Weibull Probability Density Function

31

=1, =5

=0.5, =1 =0.2, =5

Page 32: Chapter  3 Basic Concepts in Statistics and Probability

Weibull R.V.

The cdf of the Weibull distribution (with =1) is

32

Page 33: Chapter  3 Basic Concepts in Statistics and Probability

Weibull Probabilities

Excel:WEIBULL(x, alpha, beta, cumulative)

Minitab:Calc Probability Distributions Weibull

33

Page 34: Chapter  3 Basic Concepts in Statistics and Probability

3.6.6 Extreme Value Distribution

Extreme value distributions are often used in reliability work.

34

Page 35: Chapter  3 Basic Concepts in Statistics and Probability

Extreme Value Distribution

The pdf of the Extreme value distribution is

with a the location parameter and b the scale parameter

The cdf of the Extreme value distribution is

• The Weibull distribution is the limiting distribution of the extreme value distribution

• The natural log of a Weibull random time is an extreme value random observation.

35

Page 36: Chapter  3 Basic Concepts in Statistics and Probability

Extreme Value Distribution

36http://www.itl.nist.gov/div898/handbook/apr/section1/apr163.htm

Page 37: Chapter  3 Basic Concepts in Statistics and Probability

3.6.7 Gamma Distribution

The gamma distribution is frequently a probability model for waiting times; for instance, in life testing, the waiting time until death is a random variable that is frequently modeled with a gamma distribution.

Gamma function is defined by

37

(3.11)

Where >0 (shape) and >0 (scale)

Page 38: Chapter  3 Basic Concepts in Statistics and Probability

Gamma Distribution

• Standard gamma distribution when =1• Mean: Variance: 2

• If k is an integer, then the distribution represents an Erlang distribution.

• If = 1, the gamma distribution is the same as the exponential.

• The gamma function has the following properties:1. If r is any integer, then (r) = (r – 1)!.2. For any r, (r + 1) = r (r). 3.

38

Page 39: Chapter  3 Basic Concepts in Statistics and Probability

Gamma Probability Density Function

39

=1, =1

=3, =0.5

=5, =1

Page 40: Chapter  3 Basic Concepts in Statistics and Probability

Gamma Probabilities

Excel:GAMMA.DIST(x, alpha, beta, cumulative)GAMMA.INV(probability, alpha, beta)GAMMALN(x)

Minitab:Calc Probability Distributions Gamma

40

Page 41: Chapter  3 Basic Concepts in Statistics and Probability

• If = r/2, and =2 where r is a positive integer, the distribution is called a chi-square distribution with r degrees of freedom.

• The chi-square distribution results when independent variables with standard normal distributions are squared and summed.

• The probability density function of the chi-square distribution is

41

3.6.8 Chi-Squre Distribution

Page 42: Chapter  3 Basic Concepts in Statistics and Probability

42

Chi-Squre Distribution

Page 43: Chapter  3 Basic Concepts in Statistics and Probability

3.6.9 Truncated Normal Distribution

• Left truncated• Right truncated• Doubly truncated

43

Page 44: Chapter  3 Basic Concepts in Statistics and Probability

Truncated Normal Distribution

44

Page 45: Chapter  3 Basic Concepts in Statistics and Probability

Left Truncated Normal Distribution

• Left truncated normal distribution

Where

Where a is the truncation point, and is the cumulative area under the normal curve.

45

Page 46: Chapter  3 Basic Concepts in Statistics and Probability

Lift Truncated Normal Distribution

• Let • Expected value

Where

• Variance46

(3.12)

Page 47: Chapter  3 Basic Concepts in Statistics and Probability

Right Truncated Normal Distribution

• Right truncated normal distribution

Where

Where a’ is the truncation point, and is the cumulative area under the normal curve.

47

Page 48: Chapter  3 Basic Concepts in Statistics and Probability

Right Truncated Normal Distribution

• Let • Expected value

Where

• Variance48

Page 49: Chapter  3 Basic Concepts in Statistics and Probability

3.6.10 Bivariate and Multivariate Normal Distribution

• Normal distribution

• For 2 independent variables, x1~N(1,12),

x2~N(2,22)

-

49

2 2( ) / 21( ) , 2

xf x e x

(3.13)

Page 50: Chapter  3 Basic Concepts in Statistics and Probability

Bivariate Normal Distribution

50

Page 51: Chapter  3 Basic Concepts in Statistics and Probability

Bivariate Normal Distribution

• Bivariate Normal distribution-

• Let , , ,

51

(3.13)

Page 52: Chapter  3 Basic Concepts in Statistics and Probability

Multivariate Normal Distribution

• Let , ,

52

(3.14)

Page 53: Chapter  3 Basic Concepts in Statistics and Probability

3.6.11 F Distribution• Let W, and Y be independent 2 random variables with u and

degrees of freedom, the ratio F = (W/u)/(Y/ ) has F distribution.• Probability Density Function, with u, degrees of freedom,

• Mean

• Variance

x

xuu

xuu

xf u

uu

0

1)()2

()2

(

))(2

()(

2

122

2)2()( forxE

4)4()2(

)2(2)( 2

2

for

uuxV

Page 54: Chapter  3 Basic Concepts in Statistics and Probability

F Distribution• Table V in Appendix A.

uu f

f,,

,,11

Page 55: Chapter  3 Basic Concepts in Statistics and Probability

3.6.12 Beta Distribution

Beta function is defined by

Mean of Beta distribution: r/(r+s)Variance of Beta distribution: rs(r+s)-2(r+s+1)-1

55

Where r and s are shape parameters of Beta distribution

Page 57: Chapter  3 Basic Concepts in Statistics and Probability

3.6.12 Beta Distribution

Beta distribution and F distribution

57

Where B(; r, s) denotes the (1- ) percentile of a beta distribution with parameters r and s.

Page 58: Chapter  3 Basic Concepts in Statistics and Probability

3.6.13 Uniform Distributions

The uniform distribution has two parameters, a and b, with a < b. If X is a random variable with the continuous uniform distribution then it is uniformly distributed on the interval (a, b). We write X ~ U(a,b).

The pdf is

58

1 , ( )

0, otherwise

a x bf x b a

Page 59: Chapter  3 Basic Concepts in Statistics and Probability

Uniform Distribution: Mean and Variance

If X ~ U(a, b).

Then the mean is

and the variance is

59

2Xa bμ

22 ( ) .

12Xb aσ