CHAPTER 1 Exploring Data€¦ · COMPARE distributions of quantitative data Displaying Quantitative...

11
The Practice of Statistics, 5th Edition Starnes, Tabor, Yates, Moore Bedford Freeman Worth Publishers CHAPTER 1 Exploring Data 1.2 Displaying Quantitative Data with Graphs

Transcript of CHAPTER 1 Exploring Data€¦ · COMPARE distributions of quantitative data Displaying Quantitative...

Page 1: CHAPTER 1 Exploring Data€¦ · COMPARE distributions of quantitative data Displaying Quantitative Data with Graphs. The Practice of Statistics, 5th Edition 4 Dotplots One of the

The Practice of Statistics, 5th Edition

Starnes, Tabor, Yates, Moore

Bedford Freeman Worth Publishers

CHAPTER 1Exploring Data

1.2Displaying Quantitative Data with Graphs

Page 2: CHAPTER 1 Exploring Data€¦ · COMPARE distributions of quantitative data Displaying Quantitative Data with Graphs. The Practice of Statistics, 5th Edition 4 Dotplots One of the

Learning Objectives

After this section, you should be able to:

The Practice of Statistics, 5th Edition 3

✓ MAKE and INTERPRET dotplots and stemplots of quantitative data

✓ DESCRIBE the overall pattern of a distribution and IDENTIFY any

outliers

✓ IDENTIFY the shape of a distribution

✓ MAKE and INTERPRET histograms of quantitative data

✓ COMPARE distributions of quantitative data

Displaying Quantitative Data with Graphs

Page 3: CHAPTER 1 Exploring Data€¦ · COMPARE distributions of quantitative data Displaying Quantitative Data with Graphs. The Practice of Statistics, 5th Edition 4 Dotplots One of the

The Practice of Statistics, 5th Edition 4

Dotplots

One of the simplest graphs to construct and interpret is a dotplot. Each

data value is shown as a dot above its location on a number line.

How to make a dotplot:

1)Draw a horizontal axis (a number line) and label it with

the variable name.

2)Scale the axis from the minimum to the maximum value.

3)Mark a dot above the location on the horizontal axis

corresponding to each data value.

Number of Goals Scored Per Game by the 2012 US Women’s Soccer Team

2 1 5 2 0 3 1 4 1 2 4 13 1

3 4 3 4 14 4 3 3 4 2 2 4

Page 4: CHAPTER 1 Exploring Data€¦ · COMPARE distributions of quantitative data Displaying Quantitative Data with Graphs. The Practice of Statistics, 5th Edition 4 Dotplots One of the

The Practice of Statistics, 5th Edition 5

The purpose of a graph is to help us understand the data. After you

make a graph, always ask, “What do I see?”

Examining the Distribution of a Quantitative Variable

How to Examine the Distribution of a Quantitative Variable

1)In any graph, look for the overall pattern and for striking

departures from that pattern.

2)Describe the overall pattern of a distribution by its:

• Shape

• Center

• Spread

3)Note individual values that fall outside the overall pattern.

These departures are called outliers.

Don’t forget your SOCS!

Page 5: CHAPTER 1 Exploring Data€¦ · COMPARE distributions of quantitative data Displaying Quantitative Data with Graphs. The Practice of Statistics, 5th Edition 4 Dotplots One of the

The Practice of Statistics, 5th Edition 6

Describing Shape

When you describe a distribution’s shape, concentrate on the main

features. Look for rough symmetry or clear skewness.

A distribution is roughly symmetric if the right and left sides of the

graph are approximately mirror images of each other.

A distribution is skewed to the right (right-skewed) if the right side of

the graph (containing the half of the observations with larger values) is

much longer than the left side.

It is skewed to the left (left-skewed) if the left side of the graph is

much longer than the right side.

Symmetric Skewed-left Skewed-right

Page 6: CHAPTER 1 Exploring Data€¦ · COMPARE distributions of quantitative data Displaying Quantitative Data with Graphs. The Practice of Statistics, 5th Edition 4 Dotplots One of the

The Practice of Statistics, 5th Edition 7

Frozen Pizza

Here are the number of calories per serving for 16 brands of frozen cheese

pizza, along with a dotplot of the data.

340 340 310 320 310 360 350 330

260 380 340 320 360 290 320 330

Describe the shape, center, and spread of the distribution. Are there any

outliers?

Shape: There are peaks at 320 and 340 calories and a main cluster of values

between 310 and 360 calories. The distribution is slightly left-skewed.

Center: The middle value is 330 calories.

Spread: The values vary from 260 calories to 380 calories.

Outliers: There is one pizza with an unusually small number of calories (260).

Calories

260 280 300 320 340 360 380 400

Frozen Pizza Dot Plot

Page 7: CHAPTER 1 Exploring Data€¦ · COMPARE distributions of quantitative data Displaying Quantitative Data with Graphs. The Practice of Statistics, 5th Edition 4 Dotplots One of the

The Practice of Statistics, 5th Edition 8

Comparing Distributions

Some of the most interesting statistics questions involve comparing two

or more groups.

Always discuss shape, center, spread, and possible outliers whenever

you compare distributions of a quantitative variable.

Shape: The distribution of household

size for the UK sample is roughly

symmetric and unimodal, while the

distribution for the South Africa sample

is skewed to the right and unimodal.

Center: Household sizes for the South

African students tended to be larger

than for the UK students. The middle

values of the household sizes for the

two groups are 6 people and 4 people,

respectively.

Page 8: CHAPTER 1 Exploring Data€¦ · COMPARE distributions of quantitative data Displaying Quantitative Data with Graphs. The Practice of Statistics, 5th Edition 4 Dotplots One of the

The Practice of Statistics, 5th Edition 9

Comparing Distributions

Some of the most interesting statistics questions involve comparing two

or more groups.

Always discuss shape, center, spread, and possible outliers whenever

you compare distributions of a quantitative variable.

Spread; The household sizes for the

South African students vary more

(from 3 to 26 people) than for the UK

students (from 2 to 6 people).

Outliers: There don’t appear to be any

outliers in the UK distribution. The

South African distribution seems to

have two outliers in the right tail of the

distribution – students who reported

living in households with 15 and 26

people.

Page 9: CHAPTER 1 Exploring Data€¦ · COMPARE distributions of quantitative data Displaying Quantitative Data with Graphs. The Practice of Statistics, 5th Edition 4 Dotplots One of the

The Practice of Statistics, 5th Edition 10

Energy Cost -- Top vs. Bottom FreezersHow do the annual energy costs (in dollars) compare for refrigerators with top

freezers and refrigerators with bottom freezers? The data below are from the

May 2010 issue of Consumer Reports.

Shape: The distribution for bottom freezers looks skewed to the right and

possibly bimodal, with modes near $58 and $70 per year. The distribution for

top freezers looks roughly symmetric, with its main peak centered near $55.

Center: The typical energy cost for the bottom freezers is greater than the

typical cost for the top freezers (middle value of $69 vs. middle value of $56).

Spread: There is much more variability in the energy costs for bottom

freezers.

Outliers: There are a couple of bottom freezers with unusually high energy

costs (over $140 per year). There are no outliers for the top freezers.

EnergyCost

Ty

pe

14012611298847056

bottom

top

Dotplot of EnergyCost vs Type

EnergyCost

Ty

pe

14012611298847056

bottom

top

Dotplot of EnergyCost vs Type

Page 10: CHAPTER 1 Exploring Data€¦ · COMPARE distributions of quantitative data Displaying Quantitative Data with Graphs. The Practice of Statistics, 5th Edition 4 Dotplots One of the

The Practice of Statistics, 5th Edition 11

Exercise 40: Fuel Efficiency

In an earlier example, we examined data on highway gas mileages of

model year 2012 midsize cars. The following dotplot shows the

difference (highway vs city) in EPA mileage ratings for each of the 24

car models from the earlier example.

(a) Explain what the dot above 12 represents.

(b) Describe the distribution of differences between gas mileage

ratings. Identify any major departures.

-2 0 2 4 6 8 10 12

Difference in gas mileage (highway – city)

This dot represents a car that got 12 mpg more on the highway

than it did in city driving.

Shape: This is a unimodal graph that is skewed to the left. Center: The center is at the single

peak of 9. Spread: The values vary from -3 to 12. Outliers: There is a large gap between -3 and

all other data points. This is most likely an outlier.

Page 11: CHAPTER 1 Exploring Data€¦ · COMPARE distributions of quantitative data Displaying Quantitative Data with Graphs. The Practice of Statistics, 5th Edition 4 Dotplots One of the

The Practice of Statistics, 5th Edition 12

PAGE 41

38, 44Homework