1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative...

55
1 Review • Sections 2.1-2.4 • Descriptive Statistics – Qualitative (Graphical) – Quantitative (Graphical) – Summation Notation – Qualitative (Numerical) • Central Measures (mean, median, mode and modal class) • Shape of the Data

Transcript of 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative...

Page 1: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

1

Review• Sections 2.1-2.4

• Descriptive Statistics– Qualitative (Graphical)– Quantitative (Graphical)– Summation Notation– Qualitative (Numerical)

• Central Measures (mean, median, mode and modal class)

• Shape of the Data

Page 2: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

2

Review• Sections 2.1-2.4• Descriptive Statistics

– Qualitative (Graphical)– Quantitative (Graphical)– Summation Notation– Qualitative (Numerical)

• Central Measures (mean, median, mode and modal class)• Shape of the Data• Measures of Variability

Page 3: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

3

Outlier

A data measurement which is unusually large or small compared to the rest of the data.

Usually from:– Measurement or recording error– Measurement from a different population– A rare, chance event.

Page 4: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

4

Advantages/Disadvantages Mean

• Disadvantages– is sensitive to outliers

• Advantages– always exists– very common– nice mathematical properties

Page 5: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

5

Advantages/Disadvantages Median

• Disadvantages– does not take all data into account

• Advantages– always exists– easily calculated– not affected by outliers– nice mathematical properties

Page 6: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

6

Advantages/Disadvantages Mode

• Disadvantages– does not always exist, there could be just one

of each data point– sometimes more than one

• Advantages– appropriate for qualitative data

Page 7: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

7

Review

A data set is skewed if one tail of the distribution has more extreme observations than the other.

http://www.shodor.org/interactivate/activities/SkewDistribution/

Page 8: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

8

Review

Skewed to the right: The mean is bigger than the median.

xM

Page 9: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

9

Review

Skewed to the left: The mean is less than the median.

x M

Page 10: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

10

Review

When the mean and median are equal, the data is symmetric

Mx

Page 11: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

11

Numerical Measures of Variability

These measure the variability or spread of the data.

Page 12: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

12

Numerical Measures of Variability

These measure the variability or spread of the data.

Relative Frequency

0 1 3 4 52

0.3

0.4

0.5

0.2

0.1

Mx

Page 13: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

13

Numerical Measures of Variability

These measure the variability or spread of the data.

Relative Frequency

0 1 3 4 52

0.3

0.4

0.5

0.2

0.1

Mx

Page 14: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

14

Numerical Measures of Variability

These measure the variability or spread of the data.

Relative Frequency

0 1 3 4 52

0.3

0.4

0.5

0.2

0.1

6 7

Mx

Page 15: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

15

Numerical Measures of Variability

These measure the variability, spread or relative standing of the data.

– Range– Standard Deviation– Percentile Ranking– Z-score

Page 16: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

16

Range

The range of quantitative data is denoted R and is given by:

R = Maximum – Minimum

Page 17: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

17

Range

The range of quantitative data is denoted R and is given by:

R = Maximum – Minimum

In the previous examples the first two graphs have a range of 5 and the third has a range of 7.

Page 18: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

18

Range

R = Maximum – Minimum

Disadvantages: – Since the range uses only two values in the

sample it is very sensitive to outliers.– Give you no idea about how much data is in the

center of the data.

Page 19: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

19

What else?

We want a measure which shows how far away most of the data points are from the mean.

Page 20: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

20

What else?

We want a measure which shows how far away most of the data points are from the mean.

One option is to keep track of the average distance each point is from the mean.

Page 21: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

21

Mean Deviation

The Mean Deviation is a measure of dispersion which calculates the distance between each data point and the mean, and then finds the average of these distances.

n

xx

n

xx ii

sumDeviation Mean

Page 22: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

22

Mean Deviation

Advantages: The mean deviation takes into account all values in the sample.

Disadvantages: The absolute value signs are very cumbersome in mathematical equations.

Page 23: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

23

Standard Deviation

The sample variance, denoted by s², is:

1

)( s

22

n

xxi

Page 24: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

24

Standard Deviation

The sample variance, denoted by s², is:

The sample standard deviation is

The sample standard deviation is much more commonly used as a measure of variance.

.2ss

1

)( s

22

n

xxi

Page 25: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

25

Example

Let the following be data from a sample:

2, 4, 3, 2, 5, 2, 1, 4, 5, 2.

Find:

a) The range

b) The standard deviation of this sample.

Page 26: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

26

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2.

a) The range

b) The standard deviation of this sample.

2 4 3 2 5 2 1 4 5 2

x

R

ix

)( xxi 2)( xxi

Page 27: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

27

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. a) The range

b) The standard deviation of this sample.

2 4 3 2 5 2 1 4 5 2

310

30

10

2541252342

x

415R

ix

)( xxi 2)( xxi

Page 28: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

28

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. a) The range

b) The standard deviation of this sample.

2 4 3 2 5 2 1 4 5 2

-1 1 0

310

30

10

2541252342

x

415R

ix

)( xxi 2)( xxi

Page 29: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

29

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. a) The range

b) The standard deviation of this sample.

2 4 3 2 5 2 1 4 5 2

-1 1 0 -1 2 -1 -2 1 2 -1

1 1 0 1 4 1 4 1 4 1

310

30

10

2541252342

x

415R

ix

)( xxi 2)( xxi

Page 30: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

30

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. 2 4 3 2 5 2 1 4 5 2

-1 1 0 -1 2 -1 -2 1 2 -1

1 1 0 1 4 1 4 1 4 1

ix

)( xxi 2)( xxi

1

)( s

22

n

xxi

Page 31: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

31

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. 2 4 3 2 5 2 1 4 5 2

-1 1 0 -1 2 -1 -2 1 2 -1

1 1 0 1 4 1 4 1 4 1

ix

)( xxi 2)( xxi

110

1414141011

1

)( s

22

n

xxi

Page 32: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

32

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. 2 4 3 2 5 2 1 4 5 2

-1 1 0 -1 2 -1 -2 1 2 -1

1 1 0 1 4 1 4 1 4 1

ix

)( xxi 2)( xxi

2110

1414141011

1

)( s

22

n

xxi

Page 33: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

33

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2.

2110

1414141011

1

)( s

22

n

xxi

41.12 ss 2

Standard Deviation:

Page 34: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

34

More Standard DeviationThere is a “short cut” formula for finding the variance and the standard deviation

Page 35: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

35

More Standard DeviationThere is a “short cut” formula for finding the variance and the standard deviation

1 s

2

2

2

n

n

xx ii

Page 36: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

36

More Standard Deviation

Use this to find the standard deviation of the previous example:

1 s

2

2

2

n

n

xx ii

Page 37: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

37

More Standard Deviation

Use this to find the standard deviation of the previous example:

1 s

2

2

2

n

n

xx ii

2 4 3 2 5 2 1 4 5 2ix2ix

Page 38: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

38

More Standard Deviation

Use this to find the standard deviation of the previous example:

1 s

2

2

2

n

n

xx ii

2 4 3 2 5 2 1 4 5 2

4 16 9 4 25 4 1 16 25 4

ix2ix

Page 39: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

39

More Standard Deviation

Use this to find the standard deviation of the previous example:

1 s

2

2

2

n

n

xx ii

2 4 3 2 5 2 1 4 5 2

4 16 9 4 25 4 1 16 25 4

ix2ix

Page 40: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

40

More Standard Deviation

Use this to find the standard deviation of the previous example:

1 s

2

2

2

n

n

xx ii

2 4 3 2 5 2 1 4 5 2

4 16 9 4 25 4 1 16 25 4

ix2ix

30

108

Page 41: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

41

More Standard Deviation

1 s

2

2

2

n

n

xx ii

2 4 3 2 5 2 1 4 5 2

4 16 9 4 25 4 1 16 25 4

ix2ix

30

108

Page 42: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

42

More Standard Deviation

2

1101030

108

1 s

22

2

2

n

n

xx ii

2 4 3 2 5 2 1 4 5 2

4 16 9 4 25 4 1 16 25 4

ix2ix

30

108

Page 43: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

43

More Standard Deviation

2

1101030

108

1 s

22

2

2

n

n

xx ii

2 4 3 2 5 2 1 4 5 2

4 16 9 4 25 4 1 16 25 4

ix2ix

30

108

41.12 ss 2

Page 44: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

44

More Standard DeviationLike the mean, we are also interested in the population variance (i.e. your sample is the whole population) and the population standard deviation.

The population variance and standard deviation are denoted σ and σ2 respectively.

Page 45: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

45

More Standard DeviationThe population variance and standard deviation are denoted σ and σ2 respectively.

****The formula for population variance is slightly different than sample variance

nn

xx

n

xxi

ii

2

22

2 )(

2

Page 46: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

46

Example - Calculator

Find the mean, median, mode, range and standard deviation for the following sample of data:

2.3, 2.5, 2.6, 2.7, 3.0, 3.4,

3.4, 3.5, 3.5, 3.5, 3.7, 3.8

Use your calculator

Page 47: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

47

Using your Calculator

• Change calculator to statistics mode. (SD if you have it)

• Enter in the data and then press the key, or data key.

• Keep entering data by pressing the key, or data key until complete.

• To obtain the summary data, find the key for the sample mean and the s key or n-1 key to display the sample standard deviation.

x

Page 48: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

48

2.3, 2.5, 2.6, 2.7, 3.0, 3.4,3.4, 3.5, 3.5, 3.5, 3.7, 3.8

• Change calculator to statistics mode. (SD if you have it)

• Enter in the data and then press the key, or data key.

• Keep entering data by pressing the key, or data key until complete.

• To obtain the summary data, find the key for the sample mean and the s key or n-1 key to display the sample standard deviation.

x

Page 49: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

49

Example - CalculatorFind the mean, median, mode, range and standard deviation for the following sample of data:

2.3, 2.5, 2.6, 2.7, 3.0, 3.4,

3.4, 3.5, 3.5, 3.5, 3.7, 3.8

Answer:

Mode = 3.5

M = 3.4

Range = 1.5

51.0 s

16.3 x

Page 50: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

50

Example – Using Standard Deviation

Here are eight test scores from a previous Stats 201 class:

35, 59, 70, 73, 75, 81, 84, 86.

The mean and standard deviation are 70.4 and 16.7, respectively.

Page 51: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

51

Example – Using Standard Deviation

Here are eight test scores from a previous Stats 201 class:

35, 59, 70, 73, 75, 81, 84, 86.

The mean and standard deviation are 70.4 and 16.7, respectively.

We wish to know if any of are data points are outliers. That is whether they don’t fit with the general trend of the rest of the data.

Page 52: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

52

Example – Using Standard Deviation

35, 59, 70, 73, 75, 81, 84, 86.

The mean and standard deviation are 70.4 and 16.7, respectively.

We wish to know if any of are data points are outliers. That is whether they don’t fit with the general trend of the rest of the data.

To find this we calculate the number of standard deviations each point is from the mean.

Page 53: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

53

Example – Using Standard Deviation

To find this we calculate the number of standard deviations each point is from the mean.

To simplify things for now, work out which data points are within

a) one standard deviation from the mean i.e.

b) two standard deviations from the mean i.e.

c) three standard deviations from the mean i.e.

) ,( sxsx

)2 ,2( sxsx

)3 ,3( sxsx

Page 54: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

54

Example – Using Standard Deviation

Here are eight test scores from a previous Stats 201 class:

35, 59, 70, 73, 75, 81, 84, 86.

The mean and standard deviation are 70.4 and 16.7, respectively. Work out which data points are within

a) one standard deviation from the mean i.e.

b) two standard deviations from the mean i.e.

c) three standard deviations from the mean i.e.

)1.87 ,7.53()7.160.47 ,7.164.70(

)8.301 ,0.37())7.16(20.47 ),7.16(24.70(

)5.021 ,3.21())7.16(30.47 ),7.16(34.70(

Page 55: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central.

55

Example – Using Standard Deviation

Here are eight test scores from a previous Stats 201 class:

35, 59, 70, 73, 75, 81, 84, 86.

The mean and standard deviation are 70.4 and 16.7, respectively. Work out which data points are within

a) one standard deviation from the mean i.e.

59, 70, 73, 75, 81, 84, 86

b) two standard deviations from the mean i.e.

59, 70, 73, 75, 81, 84, 86

c) three standard deviations from the mean i.e.

35, 59, 70, 73, 75, 81, 84, 86