Week1 GM533 Slides
-
Upload
brent-heard -
Category
Education
-
view
1.701 -
download
0
description
Transcript of Week1 GM533 Slides
1
Welcome! Week 1 Live Lecture/Discussion
Applied Managerial Statistics (GM533)
Lecturer: Brent Heard
Please note that I borrowed these charts from Joni Bynum and the textbook publisher.
Thanks Joni!
I will put my touch on them (in blue) as we go along.
2
Tonight’s Agenda
• Week 1 Terminal Course Objectives (TCOs)
• Essential Questions and Problem Types• The Most Important Ideas in Statistics• Getting started with Minitab• Descriptive Statistics using Minitab• Questions?
3
Week 1 Terminal Course Objectives (TCOs)
• TCO A Descriptive Statistics: Given a managerial problem and accompanying data set, construct graphs (following principles of ethical data presentation), calculate and interpret numerical summaries appropriate for the situation. Use the graphs and numerical summaries as aids in determining a course of action relative to the problem at hand.
• TCO F Statistics Software Competency: Students should be able to perform the necessary calculations for objectives A through E using technology, whether that be a computer statistical package or the TI-83, and be able to use the output to address a problem at hand.
4
The Most Important Ideas in Statistics
• Central tendency (measures of center) and dispersion (spread)
• Quantitative (numbers) and qualitative (words and numbers with no meaning) variables
• Description and inference• One variable versus two or more
variables
5
Selected Slides from the Text Book
• The following slides from the text book are intended to complement the live demonstration and provide a bridge to Module 1
6
Population Parameters
6
A population parameter is a number calculated from all the population measurements that describes some aspect of the population (Remember “p” goes with “p”)
The population mean, denoted , is a population parameter and is the average of the population measurements (Fancy letters are used for the population)
7
Point Estimates and Sample Statistics
7
A point estimate is a one-number estimate of the value of a population parameterA sample statistic is a number calculated using sample measurements that describes some aspect of the sample (“s” goes with “s”)Use sample statistics as point
estimates of the population parameters
The sample mean, denoted x, is a sample statistic and is the average of the sample measurements (Plain letters for the sample)The sample mean is a point estimate
of the population mean
8
Measures of Central Tendency
8
Mean, The average or expected value
Median, Md The value of the middle point of the ordered measurements
Mode, Mo The most frequent value
9
The Mean
9
Population X1, X2, …, XN
m
Population Mean
N
X
N
=1ii
Sample x1, x2, …, xn
Sample Mean
x
n
x x
n
=1ii
10
The Sample Mean
10
and is a point estimate of the population mean • It is the value to expect, on average and in the long run
n
xxx
n
xx n
n
ii
...211
For a sample of size n, the sample mean is defined as
11
Example: Car Mileage Case
5554321
5
1 xxxxxx
x ii
26.315
3.156
5
1.326.311.307.318.30
x
11
Example 3.1: Sample mean for first five car mileages from Table 2.4
30.8, 31.7, 30.1, 31.6, 32.1
12
The Median
12
The population or sample median Md is a value such that 50% of all measurements, after having been arranged in numerical order, lie above (or below) it. (The median is the “center.”)
The median Md is found as follows:1. If the number of measurements is odd, the median is the middlemost measurement in the ordered values
2. If the number of measurements is even, the median is the average of the two middlemost measurements in the ordered values
13
Example: Sample Median
13
Internist’s Yearly Salaries (x$1000)
127 132 138 141 144 146 152 154 165 171 177 192 241
(Note that the values are in ascending numerical order from left to right)
Because n = 13 (odd,) then the median is the middlemost or 7th value of the ordered data, so
Md=152
• An annual salary of $180,000 is in the high end, well above the median salary of $152,000• In fact, $180,000 a very high and competitive salary
14
The Mode
14
The mode Mo of a population or sample of measurements is the measurement that occurs most frequently• Modes are the values that are observed “most typically”• Sometimes higher frequencies at two or more values
• If there are two modes, the data is bimodal• If more than two modes, the data is multimodal
• When data are in classes, the class with the highest frequency is the modal class• The tallest box in the histogram (The Tall Pole)
15
Relationships Among Mean, Medianand Mode
15
Notice tail to left
Notice tail to right
16
Central Tendency By Itself Not Enough
Knowing the measures of central tendency is not enough
Both of the distributions shown below have identical measures of central tendency
16
17
The Normal Curve
17
Symmetrical and bell-shaped curve for a normally distributed populationThe height of the normal over any point represents the relative proportion of values near that point
Example 2.4, The Car Mileages Case
18
The Empirical Rule forNormal Populations
2-18
If a population has mean m and standard deviation s and is described by a normal curve, then
68.26% of the population measurements lie within one standard deviation of the mean: [ - , + ]m s m s
95.44% of the population measurements lie within two standard deviations of the mean: [ -m 2 , +s m 2 ]s
99.73% of the population measurements lie within three standard deviations of the mean: [ -m 3 , +s m 3 ]s
19
z Scores (will be very important in our work with the Normal Distribution, beginning in Week 2 and for the entire course)
For any x in a population or sample, the associated z score is
The z score is the number of standard deviations that x is from the meanA positive z score is for x above (greater
than) the meanA negative z score is for x below (less than)
the mean
deviation standard
mean
xz
2-19
20
Measures of Variation (Spread)
20
Range
Largest minus the smallest measurement
VarianceThe average of the squared deviations of all
the population measurements from the population mean
Standard Deviation
The square root of the variance
21
The Range
21
Example:
Internist’s Salaries (in thousands of dollars)
127 132 138 141 144 146 152 154 165 171 177 192 241
Range = 241 - 127 = 114 ($114,000)
Range = largest measurement - smallest measurement
The range measures the interval spanned by all the data
22
Variance
22
and is a point estimate for s2
For a population of size N, the population variance s2 is defined as
For a sample of size n, the sample variance s2 is defined as
N
xxx
N
xN
N
ii 22
22
11
2
2
11
222
211
2
2
n
xxxxxx
n
xxs n
n
ii
23
The Standard Deviation
23
Population Standard Deviation, s: 2
Sample Standard Deviation, s: 2ss
24
Example: Population Varianceand Standard Deviation
%105
50
5
51215108
24
Population of profit margins for five big American companies:
8%, 10%, 15%, 12%, 5%
6115
58
5
25425045
52502
5
105101210151010108
22222
222222
.
%40636112 ..
25
Example: Sample Varianceand Standard Deviation
15
5
1
2
2
i
i xxs
2-25
Example 3.7: Sample variance and standard deviation for first five car mileages from Table 2.4
30.8, 31.7, 30.1, 31.6, 32.1 so = 31.26
s2 = 2.572 4 = 0.643
8019.0643.2 ss
4
26311322631631263113026317312631830 22222 ..........
x
26
Percentiles and Quartiles
26
For a set of measurements arranged in increasing order, the pth percentile is a value such that p percent of the measurements fall at or below the value and (100-p) percent of the measurements fall at or above the value
The first quartile Q1 is the 25th percentile
The second quartile (or median) Md is the 50th percentile
The third quartile Q3 is the 75th percentile
The interquartile range IQR is Q3 - Q1
27
Example: Quartiles
27
20 customer satisfaction ratings:
1 3 5 5 7 8 8 8 8 8 8 9 9 9 9 9 10 10 10 10
Md = (8+8)/2 = 8
Q1 = (7+8)/2 = 7.5 Q3 = (9+9)/2 = 9
IQR = Q3 Q1 = 9 7.5 = 1.5
28
Population and Sample Proportions
28
Population X1, X2, …, XN
p
Population Proportion
Sample x1, x2, …, xn
Sample Proportion
n
x p
n
1=ii
ˆ
p ˆ
p is the point estimate of p^
29
Example: Sample Proportion
29
Marketing Ethics Case
117 out of 205 marketing researchers disapproved of action taken in a hypothetical scenario
X = 117, number of researches who disapprove
n = 205, number of researchers surveyed
Sample Proportion: 570205
117.
n
Xp̂
30
Getting Started with Minitab
• Course Home: Minitab• Tutorial• Download• Getting help with your Minitab
installation
31
Summary of Descriptive Statistics using Minitab (concluded)
• Central tendency: mean, median, mode• Dispersion: Range, standard deviation,
interquartile range• Stem – and - leaf display• Histogram and frequency distribution
32
Essential Questions and Problem Types for the Week 1 Mastery Module
• For a given data set, use Minitab to find numbers, pictures, and tables which show the central tendency, including: the mean, median, and mode, and the skewness
• For a given data set, use Minitab to find numbers, pictures, and tables which show the variability, or dispersion, including: the range, the standard deviation the interquartile range, and the Empirical Rule
33
Closing
I will post a link to these charts where I hang out on the internet.
I call it the “Statcave.”
http://www.facebook.com/statcave
YOU DO NOT HAVE TO BE A FACEBOOK PERSON TO SEE THE LINKS. I DO IT BECAUSE IT’S FREE AND FUN.
In my spare time, I write a syndicated column (humor, life, feel goods, etc.) that appears in newspapers and magazines in the southeast. If you ever get bored, check it out at:
http://www.cranksmytractor.com
See you next week! Same Stat Time, Same Stat Channel.