Normal Distribution Introduction. Probability Density Functions.

of 27 /27
Normal Distribution Introduction

Embed Size (px)

Transcript of Normal Distribution Introduction. Probability Density Functions.

PowerPoint Presentation

Normal DistributionIntroductionProbability Density Functions8.3Probability Density FunctionsUnlike a discrete random variable which we studied in Chapter 7, a continuous random variable is one that can assume an uncountable number of values. We cannot list the possible values because there is an infinite number of them. Because there is an infinite number of values, the probability of each individual value is virtually 0.8.4Point Probabilities are ZeroBecause there is an infinite number of values, the probability of each individual value is virtually 0.

Thus, we can determine the probability of a range of values only.

E.g. with a discrete random variable like tossing a die, it is meaningful to talk about P(X=5), say.In a continuous setting (e.g. with time as a random variable), the probability the random variable of interest, say task length, takes exactly 5 minutes is infinitesimally small, hence P(X=5) = 0.It is meaningful to talk about P(X 5).8.5Probability Density FunctionA function f(x) is called a probability density function (over the range a x b if it meets the following requirements:

f(x) 0 for all x between a and b, and

The total area under the curve between a and b is 1.0f(x)xbaarea=18.6

8.7Uniform DistributionConsider the uniform probability distribution (sometimes called the rectangular probability distribution).It is described by the function:

f(x)xbaarea = width x height = (b a) x = 1

8.8ExampleThe amount of petrol sold daily at a service station is uniformly distributed with a minimum of 2,000 litres and a maximum of 5,000 litres.

What is the probability that the service station will sell at least 4,000 litres?Algebraically: what is P(X 4,000) ?P(X 4,000) = (5,000 4,000) x (1/3000) = .3333f(x)x5,0002,000

Bin width 25

Bin width 5

Bin width 1

Conditions for use of the Normal DistributionThe data must be continuous (or we can use a continuity correction to approximate the Normal)

The parameters must be established from a large number of trials8.14The Normal DistributionThe normal distribution is the most important of all probability distributions. The probability density function of a normal random variable is given by:

It looks like this:Bell shaped,Symmetrical around the mean

8.15The Normal DistributionImportant things to note:

The normal distribution is fully defined by two parameters:its standard deviation and meanUnlike the range of the uniform distribution (a x b)Normal distributions range from minus infinity to plus infinityThe normal distribution is bell shaped andsymmetrical about the mean8.16

Standard Normal DistributionA normal distribution whose mean is zero and standard deviation is one is called the standard normal distribution.

Any normal distribution can be converted to a standard normal distribution with simple algebra. This makes calculations much easier.

0118.17Normal DistributionIncreasing the mean shifts the curve to the right

8.18Normal DistributionIncreasing the standard deviation flattens the curve

8.19Calculating Normal ProbabilitiesExample: The time required to build a computer is normally distributed with a mean of 50 minutes and a standard deviation of 10 minutes:

What is the probability that a computer is assembled in a time between 45 and 60 minutes?

Algebraically speaking, what is P(45 < X < 60) ?

08.20Calculating Normal ProbabilitiesP(45 < X < 60) ?

0

mean of 50 minutes and astandard deviation of 10 minutesTripthi M. Mathew, MD, MPHDistinguishing Features The mean 1 standard deviation covers 66.7% of the area under the curve

The mean 2 standard deviation covers 95% of the area under the curve

The mean 3 standard deviation covers 99.7% of the area under the curve2168-95-99.7 Rule

68% of the data95% of the data99.7% of the data22SAY: within 1 standard deviation either way of the mean

within 2 standard deviations of the mean

within 3 standard deviations either way of the mean

WORKS FOR ALL NORMAL CURVES NO MATTER HOW SKINNY OR FATAre my data normal?Not all continuous random variables are normally distributed!!It is important to evaluate how well the data are approximated by a normal distribution23Are my data normally distributed?Look at the histogram! Does it appear bell shaped?Compute descriptive summary measuresare mean, median, and mode similar?Do 2/3 of observations lie within 1 std dev of the mean? Do 95% of observations lie within 2 std dev of the mean?24June 5, 2008Stat 111 - Lecture 7 - Normal Distribution 25Law of Large NumbersRest of course will be about using data statistics (x and s2) to estimate parameters of random variables ( and 2)Law of Large Numbers: as the size of our data sample increases, the mean x of the observed data variable approaches the mean of the populationIf our sample is large enough, we can be confident that our sample mean is a good estimate of the population mean!Points of note:Total area = 1Only have a probability from widthFor an infinite number of z scores each point has a probability of 0 (for the single point)Typically negative values are not reportedSymmetrical, therefore area below negative value = Area above its positive valueAlways draw a sketch!