Mat 255 chapter 3 notes

56
MAT 255 Business Statistics I Chapter 3 Describing Data: Numerical Measures Text: Lind, Statistical Techniques in Business and Economics, 15e

Transcript of Mat 255 chapter 3 notes

Page 1: Mat 255 chapter 3 notes

MAT 255 Business Statistics IChapter 3

Describing Data: Numerical Measures

Text: Lind, Statistical Techniques in Business and Economics, 15e

Page 2: Mat 255 chapter 3 notes

Chapter 3 Objectives

Objective Section

Explain the concept of central tendency. 3.1

Identify and compute the arithmetic mean. 3.2, 3.3, 3.4

Compute and interpret the weighted mean. 3.5

Determine the median. 3.6

Identify the mode. 3.7

Describe the shape of a distribution based on the positions of the mean, median, and mode.

3.9

Calculate the geometric mean. 3.10

Explain and apply measures of dispersion. 3.11, 3.12

Explain Chebyshev’s Theorem and the empirical rule. 3.14

Compute the mean and standard deviation of grouped data. 3.15

Page 3: Mat 255 chapter 3 notes

What should you do?Learning Activities Details

Read Chapter 3. Pages 22 - 46

View the MAT 255 Chapter 3 Video Lecture Playlist and take notes as if you were in a lecture class.

There are videos on the following topics:• Measures of Location

Overview• Page 62, exercise 8• Page 62, exercise 10• Page 64, exercise 16• Page 67, exercise 20• The Relative Positions of the

Mean, Median, and Mode

• Page 73, exercise 28• Page 74, exercise 32• Measures of Dispersion Overview• Page 77, example• Page 79, exercise 38• Page 82, exercise 46• Page 85, exercise 50• Page 87, exercise 54• Page 87, exercise 56• Page 91, exercise 58

Complete exercise 1 – 11 odd, 15 – 29 odd, 35, 39, 41, 47 – 55 odd 59, 61, 65, 69, 73, 75, 85.

Complete homework for self-assessment purposes. The answers to odd questions are found in the back of the book.

Complete the Chapter 3 Practice Worksheet and view the worked out solutions.

The practice worksheet is for self-assessment purposes. You can view the worked out solutions to the worksheet after you complete it.

Post a relevant comment, insight or question to the MAT 255-4H1 Fall 2014 Google+ Community.

The Google+ Community allows you to communicate with classmates in an online open forum.

Complete MAT 255 Chapter 3 Survey. Communicate to the instructor the areas where you still need help.

Page 4: Mat 255 chapter 3 notes

Measures of Location Overview

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 5: Mat 255 chapter 3 notes

• The purpose of a measure of location is to pinpoint the center of a distribution of data.

• In addition to measures of locations, we should consider the dispersion – often called the variation or spread – in the data.

• Five measures of location:

1. The arithmetic mean

2. The weighted mean

3. The median

4. The mode

5. The geometric mean

Page 6: Mat 255 chapter 3 notes

Population Mean: 𝝁 = 𝑿

𝑵

• 𝜇 represents the population mean. It is the Greek lower case letter “mu.”

• 𝑁 is the number of values in the population.

• 𝑋 represents any particular value.

• is the Greek capital letter “sigma” and indicates the operation of adding.

• 𝑋 is the sum of the 𝑋 values in a population.

A parameter is a characteristic of a population.

Page 7: Mat 255 chapter 3 notes

Sample Mean: 𝑿 = 𝑿

𝒏

• 𝑋 represents the sample mean. It is read “X bar.”

• 𝑛 is the number of values in the sample.

• 𝑋 represents any particular value.

• is the Greek capital letter “sigma” and indicates the operation of adding.

• 𝑋 is the sum of the 𝑋 values in a population.

A statistic is a characteristic of a sample.

Page 8: Mat 255 chapter 3 notes

Important properties of the arithmetic mean:

1. Every set of interval- or ratio- level has a mean.2. All the values are included in computing the mean.3. The mean is unique.4. The sum of the deviations of each value from the mean is

0.

𝑋− 𝑋 = 0

One disadvantage: if one or two values are either extremely high or extremely low compared to the majority of the data, then the mean might not be an appropriate average to represent the data.

Page 9: Mat 255 chapter 3 notes

Weighted Mean:

𝑿𝒘 =𝒘𝟏𝑿𝟏+𝒘𝟐𝑿𝟐+𝒘𝟑𝑿𝟑+⋯+𝒘𝒏𝑿𝒏

𝒘𝟏+𝒘𝟐+𝒘𝟑+⋯𝒘𝒏

Pronounced “X bar sub w”

Or

𝑿𝒘 = 𝒘𝑿

𝒘

Page 10: Mat 255 chapter 3 notes

Median: The midpoint of the values after they have been ordered from the smallest to the largest, or the largest to the smallest.

The median is not affected by extremely large of small values.

The median can be computed for ordinal-level data or higher.

Page 11: Mat 255 chapter 3 notes

Mode: The value of the observation that appears most frequently.

The mode is especially useful in summarizing nominal-level data.

The mode can be determined for all levels of data –nominal, ordinal, interval, and ratio. The mode is not affected by extremely high or low values. For many data sets, though, there is no mode, causing it to be used less often than the mean and median.

There can be one mode, multiple modes, or no mode for a set of dat.

Page 12: Mat 255 chapter 3 notes

Page 62, exercise 8

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 13: Mat 255 chapter 3 notes

Page 62, exercise 8:

The accounting department at a mail-order company counted the following numbers of incoming calls per day to the company’s toll-free number during the first 7 days in May:

14, 24, 19, 31, 36, 26, 17.

a. Compute the arithmetic mean.

b. Indicate whether it is a statistic or a parameter.

Page 14: Mat 255 chapter 3 notes

Page 62, exercise 10

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 15: Mat 255 chapter 3 notes

Page 62, exercise 10:

The Human Relations Director at Ford began a study of the overtime hours in the Inspection Department. A sample of 15 showed they worked the following number of overtime hours last month:

13, 13, 12, 15, 7, 15, 5, 12, 6, 7, 12, 10, 9, 13, 12

a. Compute the arithmetic mean.

b. Indicate whether it is a statistic or a parameter.

Page 16: Mat 255 chapter 3 notes

Page 64, exercise 16

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 17: Mat 255 chapter 3 notes

Page 64, exercise 16:

Andrews and Associates specializes in corporate law. They charge $100 an hour for researching a case, $75 an hour for consultations, and $200 an hour for writing a brief. Last week one of the associates spent 10 hours consulting with her client, 10 hours researching the case, and 20 hours writing the brief. What was the weighted mean hourly charge for her legal services?

Page 18: Mat 255 chapter 3 notes

Page 67, exercise 20

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 19: Mat 255 chapter 3 notes

Page 67, exercise 20:

The following are the ages of the 10 people in the video arcade at the Southwyck Shopping Mall at 10 A.M.

12, 8, 17, 6, 11, 14, 8, 17, 10, 8

Determine the median age. Determine the mode.

Page 20: Mat 255 chapter 3 notes

The Relative Positions of the Mean, Median, and Mode

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 21: Mat 255 chapter 3 notes

The Relative Positions of the Mean, Median, and Mode

A histogram is a graphical display of a frequency distribution for quantitative data. That distribution can take various shapes. Here we will discuss characteristics for a symmetric distribution, a positively skewed distribution, and a negatively skewed distribution.

Page 22: Mat 255 chapter 3 notes

A Symmetric Distribution

0

0.5

1

1.5

2

2.5

3

3.5

4

4.5

0 up to 5 5 up to 10 10 up to 15 15 up to 20 20 up to 25 25 up to 30 30 up to 35

Page 23: Mat 255 chapter 3 notes

A Positively Skewed Distribution

0

1

2

3

4

5

6

7

8

9

0 up to 5 5 up to 10 10 up to 15 15 up to 20 20 up to 25 25 up to 30 30 up to 35

Page 24: Mat 255 chapter 3 notes

A Negatively Skewed Distribution

0

1

2

3

4

5

6

7

8

9

0 up to 5 5 up to 10 10 up to 15 15 up to 20 20 up to 25 25 up to 30 30 up to 35

Page 25: Mat 255 chapter 3 notes

Page 73, exercise 28

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 26: Mat 255 chapter 3 notes

Page 73, exercise 28:

Compute the geometric mean of the following percent increases: 2, 8, 6, 4, 10, 6, 8

Page 27: Mat 255 chapter 3 notes

Geometric Mean: 𝐺𝑀 =𝑛𝑋1 𝑋2 ⋯ 𝑋𝑛

The geometric mean is useful in finding the average change of percentages, ratios, indexes, or growth rates over time. The geometric mean will always be less than or equal to the arithmetic mean.

Page 28: Mat 255 chapter 3 notes

Page 74, exercise 32

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 29: Mat 255 chapter 3 notes

Page 74, exercise 32:

JetBlue Airways is an American low-cost airline headquartered in New York City. Its main base is John F. Kennedy International Airport. JetBlue’s revenue in 2002 was $635.2 million. By 2009, revenue had increased to $3,290.0 million. What was the geometric mean annual increase for the period?

Page 30: Mat 255 chapter 3 notes

Rate of Increase Over Time:

𝐺𝑀 =𝑛 Value at end of period

Value at start of period− 1

Page 31: Mat 255 chapter 3 notes

Measures of Dispersion Overview

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 32: Mat 255 chapter 3 notes

A measure of location only describes the center of data. You would also want to know about the variation (or dispersion) of the data as well in order to have a more complete picture.

A small value for a measure of dispersion indicates that the data are clustered closely. The mean is therefore considered representative of the data. Conversely, a large measure of dispersion indicates that he mean is not reliable.

Comparing the measures of dispersions of multiple distributions is also helpful.

Page 33: Mat 255 chapter 3 notes

Range: The difference between the largest and the smallest values in a data set.

Range = Largest Value – Smallest Value

Page 34: Mat 255 chapter 3 notes

Mean Deviation: The arithmetic mean of the absolute values of the deviations from the arithmetic mean. It measures the mean amount by which the values in a population, or sample vary from their mean.

Mean Deviation: 𝑴𝑫 = 𝑿− 𝑿

𝒏

Page 35: Mat 255 chapter 3 notes

Variance: The arithmetic mean of the squared deviations from the mean.

Population Variance: 𝝈𝟐 = 𝑿−𝝁 𝟐

𝑵

Read as “sigma squared”

Standard Deviation: The square root of the variance.

Population Standard Deviation: 𝛔 = 𝑿−𝝁 𝟐

𝑵

Page 36: Mat 255 chapter 3 notes

Sample Variance: 𝒔𝟐 = 𝑿− 𝑿 𝟐

𝒏−𝟏

Sample Standard Deviation: 𝒔 = 𝑿− 𝑿 𝟐

𝒏−𝟏

Although the use of 𝑛 is logical since 𝑋 is used to estimate 𝜇, it tends to underestimate the population variance, 𝜎2. The use of 𝑛 − 1 in the denominator provides the appropriate correction for this tendency. Because the primary use of the sample statistic 𝑠2 is to estimate population parameters like 𝜎2, 𝑛 − 1 is preferred to 𝑛 in defining the sample variance. This convention is also used when computing the sample standard deviation.

Page 37: Mat 255 chapter 3 notes

The variance and standard deviation are also based on the deviations from the mean. However, instead of using the absolute value of the deviations, the variance and the standard deviation square the deviations.

Page 38: Mat 255 chapter 3 notes

Page 77, example

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 39: Mat 255 chapter 3 notes

Page 77, example:

The chart below shows the number of cappuccinos sold at Starbucks in the Orange County airport and the Ontario, California, airport between 4 and 5 P.M. for a sample of five days last month. Determine the mean, median, range, and mean deviation for each location. Comment on the similarities and differences in these measures.

Orange County

Ontario

20 20

40 49

50 50

60 51

80 80

Page 40: Mat 255 chapter 3 notes

Page 79, exercise 38

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 41: Mat 255 chapter 3 notes

Page 79, exercise 38:

A sample of eight companies in the aerospace industry was surveyed as to their return on investment last year. The results are (in percent):

10.6, 12.6, 14.8, 18.2, 12.0, 14.8, 12.2, and 15.6

Calculate the range, arithmetic mean, mean deviation, and interpret the values.

Page 42: Mat 255 chapter 3 notes

Page 82, exercise 46

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 43: Mat 255 chapter 3 notes

Page 82, exercise 46:

The annual incomes of the five vice presidents of TMV industries are $125,000; $128,000; $122,000; $133,000; and $140,000. Consider this population.a. What is the range?b. What is the arithmetic mean?c. What is the population variance? The standard

deviation?d. The annual incomes of officers of another firm

similar to TMV industries were also studied. The mean was $129,000 and the standard deviation $8,612. Compare the means and dispersions in the two firms.

Page 44: Mat 255 chapter 3 notes

Page 85, exercise 50

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 45: Mat 255 chapter 3 notes

Page 85, exercise 50:

Compute the sample variance and the sample standard deviation.

The sample of eight companies in the aerospace industry was surveyed as to their return on investment last year. The results are: 10.6, 12.6, 14.8, 18.2, 12.0, 14.8, 12.2, and 15.6

Page 46: Mat 255 chapter 3 notes

Page 87, exercise 54

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 47: Mat 255 chapter 3 notes

Page 87, exercise 54:

The mean income of a group of sample observations is $500; the standard deviation is $40. According to Chebyshev’s theorem, at least what percent of incomes will lie between $400 and $600.

Page 48: Mat 255 chapter 3 notes

Chebyshev’s Theorem: For any set of observations (sample or population), the proportion of the values that lie within kstandard deviations of the mean is at least 𝟏 −𝟏

𝒌𝟐, where k is any constant greater than 1.

Page 49: Mat 255 chapter 3 notes

Page 87, exercise 56

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 50: Mat 255 chapter 3 notes

Page 87, exercise 56:

The distribution of a sample of the number of drinks sold per day at a nearby Wendy’s is symmetric and bell-shaped. The mean number of drinks sold per day is 91.9 with a standard deviation of 4.67. Using the empirical rule, sales will be between what two values on 68 percent of the days? Sales will be between what two values on 95 percent of the days?

Page 51: Mat 255 chapter 3 notes

Empirical Rule: For a symmetrical, bell-shaped frequency distribution, approximately 68 percent of the observations will lie within plus and minus one standard deviation of the mean; about 95 percent of the observations will lie within plus and minus two standard deviations of the mean; and 99.7 percent will lie within plus or minus three standard deviations of the mean.

Page 52: Mat 255 chapter 3 notes

(Chart 3-7, page 86)

Page 53: Mat 255 chapter 3 notes

Page 91, exercise 58

Text: Lind, Statistical Techniques in Business and Economics, 15e

Delaware Technical Community College

Page 54: Mat 255 chapter 3 notes

Page 91, exercise 58:

Determine the mean and standard deviation of the following frequency distribution.

Class Frequency

0 up to 5 2

5 up to 10 7

10 up to 15 12

15 up to 20 6

20 up to 25 3

Page 55: Mat 255 chapter 3 notes

Arithmetic Mean of Grouped Data:

𝑿 = 𝒇𝑴

𝒏

Page 56: Mat 255 chapter 3 notes

Standard Deviation, Grouped Data:

𝒔 = 𝒇(𝑴− 𝑿)𝟐

𝒏 − 𝟏