Week1 GM533 Slides

Post on 05-Dec-2014

1.701 views 0 download

description

Charts for Week 1 Live Lecture for GM 533

Transcript of Week1 GM533 Slides

1

Welcome! Week 1 Live Lecture/Discussion

Applied Managerial Statistics (GM533)

Lecturer: Brent Heard

Please note that I borrowed these charts from Joni Bynum and the textbook publisher.

Thanks Joni!

I will put my touch on them (in blue) as we go along.

2

Tonight’s Agenda

• Week 1 Terminal Course Objectives (TCOs)

• Essential Questions and Problem Types• The Most Important Ideas in Statistics• Getting started with Minitab• Descriptive Statistics using Minitab• Questions?

3

Week 1 Terminal Course Objectives (TCOs)

• TCO A Descriptive Statistics: Given a managerial problem and accompanying data set, construct graphs (following principles of ethical data presentation), calculate and interpret numerical summaries appropriate for the situation. Use the graphs and numerical summaries as aids in determining a course of action relative to the problem at hand.

• TCO F Statistics Software Competency: Students should be able to perform the necessary calculations for objectives A through E using technology, whether that be a computer statistical package or the TI-83, and be able to use the output to address a problem at hand.

4

The Most Important Ideas in Statistics

• Central tendency (measures of center) and dispersion (spread)

• Quantitative (numbers) and qualitative (words and numbers with no meaning) variables

• Description and inference• One variable versus two or more

variables

5

Selected Slides from the Text Book

• The following slides from the text book are intended to complement the live demonstration and provide a bridge to Module 1

6

Population Parameters

6

A population parameter is a number calculated from all the population measurements that describes some aspect of the population (Remember “p” goes with “p”)

The population mean, denoted , is a population parameter and is the average of the population measurements (Fancy letters are used for the population)

7

Point Estimates and Sample Statistics

7

A point estimate is a one-number estimate of the value of a population parameterA sample statistic is a number calculated using sample measurements that describes some aspect of the sample (“s” goes with “s”)Use sample statistics as point

estimates of the population parameters

The sample mean, denoted x, is a sample statistic and is the average of the sample measurements (Plain letters for the sample)The sample mean is a point estimate

of the population mean

8

Measures of Central Tendency

8

Mean, The average or expected value

Median, Md The value of the middle point of the ordered measurements

Mode, Mo The most frequent value

9

The Mean

9

Population X1, X2, …, XN

m

Population Mean

N

X

N

=1ii

Sample x1, x2, …, xn

Sample Mean

x

n

x x

n

=1ii

10

The Sample Mean

10

and is a point estimate of the population mean • It is the value to expect, on average and in the long run

n

xxx

n

xx n

n

ii

...211

For a sample of size n, the sample mean is defined as

11

Example: Car Mileage Case

5554321

5

1 xxxxxx

x ii

26.315

3.156

5

1.326.311.307.318.30

x

11

Example 3.1: Sample mean for first five car mileages from Table 2.4

30.8, 31.7, 30.1, 31.6, 32.1

12

The Median

12

The population or sample median Md is a value such that 50% of all measurements, after having been arranged in numerical order, lie above (or below) it. (The median is the “center.”)

The median Md is found as follows:1. If the number of measurements is odd, the median is the middlemost measurement in the ordered values

2. If the number of measurements is even, the median is the average of the two middlemost measurements in the ordered values

13

Example: Sample Median

13

Internist’s Yearly Salaries (x$1000)

127 132 138 141 144 146 152 154 165 171 177 192 241

(Note that the values are in ascending numerical order from left to right)

Because n = 13 (odd,) then the median is the middlemost or 7th value of the ordered data, so

Md=152

• An annual salary of $180,000 is in the high end, well above the median salary of $152,000• In fact, $180,000 a very high and competitive salary

14

The Mode

14

The mode Mo of a population or sample of measurements is the measurement that occurs most frequently• Modes are the values that are observed “most typically”• Sometimes higher frequencies at two or more values

• If there are two modes, the data is bimodal• If more than two modes, the data is multimodal

• When data are in classes, the class with the highest frequency is the modal class• The tallest box in the histogram (The Tall Pole)

15

Relationships Among Mean, Medianand Mode

15

Notice tail to left

Notice tail to right

16

Central Tendency By Itself Not Enough

Knowing the measures of central tendency is not enough

Both of the distributions shown below have identical measures of central tendency

16

17

The Normal Curve

17

Symmetrical and bell-shaped curve for a normally distributed populationThe height of the normal over any point represents the relative proportion of values near that point

Example 2.4, The Car Mileages Case

18

The Empirical Rule forNormal Populations

2-18

If a population has mean m and standard deviation s and is described by a normal curve, then

68.26% of the population measurements lie within one standard deviation of the mean: [ - , + ]m s m s

95.44% of the population measurements lie within two standard deviations of the mean: [ -m 2 , +s m 2 ]s

99.73% of the population measurements lie within three standard deviations of the mean: [ -m 3 , +s m 3 ]s

19

z Scores (will be very important in our work with the Normal Distribution, beginning in Week 2 and for the entire course)

For any x in a population or sample, the associated z score is

The z score is the number of standard deviations that x is from the meanA positive z score is for x above (greater

than) the meanA negative z score is for x below (less than)

the mean

deviation standard

mean

xz

2-19

20

Measures of Variation (Spread)

20

Range

Largest minus the smallest measurement

VarianceThe average of the squared deviations of all

the population measurements from the population mean

Standard Deviation

The square root of the variance

21

The Range

21

Example:

Internist’s Salaries (in thousands of dollars)

127 132 138 141 144 146 152 154 165 171 177 192 241

Range = 241 - 127 = 114 ($114,000)

Range = largest measurement - smallest measurement

The range measures the interval spanned by all the data

22

Variance

22

and is a point estimate for s2

For a population of size N, the population variance s2 is defined as

For a sample of size n, the sample variance s2 is defined as

N

xxx

N

xN

N

ii 22

22

11

2

2

11

222

211

2

2

n

xxxxxx

n

xxs n

n

ii

23

The Standard Deviation

23

Population Standard Deviation, s: 2

Sample Standard Deviation, s: 2ss

24

Example: Population Varianceand Standard Deviation

%105

50

5

51215108

24

Population of profit margins for five big American companies:

8%, 10%, 15%, 12%, 5%

6115

58

5

25425045

52502

5

105101210151010108

22222

222222

.

%40636112 ..

25

Example: Sample Varianceand Standard Deviation

15

5

1

2

2

i

i xxs

2-25

Example 3.7: Sample variance and standard deviation for first five car mileages from Table 2.4

30.8, 31.7, 30.1, 31.6, 32.1 so = 31.26

s2 = 2.572 4 = 0.643

8019.0643.2 ss

4

26311322631631263113026317312631830 22222 ..........

x

26

Percentiles and Quartiles

26

For a set of measurements arranged in increasing order, the pth percentile is a value such that p percent of the measurements fall at or below the value and (100-p) percent of the measurements fall at or above the value

The first quartile Q1 is the 25th percentile

The second quartile (or median) Md is the 50th percentile

The third quartile Q3 is the 75th percentile

The interquartile range IQR is Q3 - Q1

27

Example: Quartiles

27

20 customer satisfaction ratings:

1 3 5 5 7 8 8 8 8 8 8 9 9 9 9 9 10 10 10 10

Md = (8+8)/2 = 8

Q1 = (7+8)/2 = 7.5 Q3 = (9+9)/2 = 9

IQR = Q3 Q1 = 9 7.5 = 1.5

28

Population and Sample Proportions

28

Population X1, X2, …, XN

p

Population Proportion

Sample x1, x2, …, xn

Sample Proportion

n

x p

n

1=ii

ˆ

p ˆ

p is the point estimate of p^

29

Example: Sample Proportion

29

Marketing Ethics Case

117 out of 205 marketing researchers disapproved of action taken in a hypothetical scenario

X = 117, number of researches who disapprove

n = 205, number of researchers surveyed

Sample Proportion: 570205

117.

n

Xp̂

30

Getting Started with Minitab

• Course Home: Minitab• Tutorial• Download• Getting help with your Minitab

installation

31

Summary of Descriptive Statistics using Minitab (concluded)

• Central tendency: mean, median, mode• Dispersion: Range, standard deviation,

interquartile range• Stem – and - leaf display• Histogram and frequency distribution

32

Essential Questions and Problem Types for the Week 1 Mastery Module

• For a given data set, use Minitab to find numbers, pictures, and tables which show the central tendency, including: the mean, median, and mode, and the skewness

• For a given data set, use Minitab to find numbers, pictures, and tables which show the variability, or dispersion, including: the range, the standard deviation the interquartile range, and the Empirical Rule

33

Closing

I will post a link to these charts where I hang out on the internet.

I call it the “Statcave.”

http://www.facebook.com/statcave

YOU DO NOT HAVE TO BE A FACEBOOK PERSON TO SEE THE LINKS. I DO IT BECAUSE IT’S FREE AND FUN.

In my spare time, I write a syndicated column (humor, life, feel goods, etc.) that appears in newspapers and magazines in the southeast. If you ever get bored, check it out at:

http://www.cranksmytractor.com

See you next week! Same Stat Time, Same Stat Channel.