Statistics

Post on 18-Jan-2015

852 views 4 download

Tags:

description

 

Transcript of Statistics

S TAT I S T I C S

CHAPTER 6

is the science of the collection, organization, and interpretation of data. It deals with all aspects of this, including the planning of data collection in terms of the design of surveys and experiments.

STATISTICS

Class intervals

a set of numerical data that is grouped into several classes and the range of each class is known as class interval

There should be between 5 and 20 classes. The class width should be an odd number. This will

guarantee that the class midpoints are integers instead of decimals.

The classes must be mutually exclusive. This means that no data value can fall into two different classes

The classes must be all inclusive. This means that all data values must be included.

The classes must be continuous. There are no gaps in a frequency distribution. Classes that have no values in them must be included (unless it's the first or last class which are dropped).

The classes must be equal in width.

Class intervals

Find the largest and smallest values Compute the Range = Maximum - Minimum Select the number of classes desired. This is usually

between 5 and 20. Find the class width by dividing the range by the number

of classes and rounding up. Pick a suitable starting point less than or equal to the

minimum value.

STATISTICS

The data below shows the marks obtained by 40 students in a monthly test.

99 88 75 92 58 75 80 70

64 42 70 58 90 68 50 78

43 89 45 93 61 81 58 65

69 76 88 58 91 67 71 52

55 40 80 80 78 46 61 69

STATISTICS

The lowest value : 40

The highest value : 99

Difference : 99-40 = 59

Width of class : 5

Number of class intervals =

12

5

59

5

59

The lowest value : 40

The highest value : 99

Difference : 99-40 = 59

Width of class : 10

Number of class intervals =

610

59

Class intervals

40-49

50-59

60-69

70-79

80-89

90-99

Class Limit Lower limit : the lowest value of the class interval Upper limit: the highest value of the class interval

Class Limit

Class intervals Lower limit Upper limit

40-49 40 49

50-59 50 59

60-69 60 69

70-79 70 79

80-89 80 89

90-99 90 99

Class Boundary

-lower boundary is the midpoint between the lower limit of the class interval and the upper limit of the previous class interval

- Upper boundary is the midpoint between the upper limit of the class interval and the lower limit of the succeeding class interval

Class boundary

40 - 49 50 – 59 60 – 69

Lower boundary

5.49

)5049(2

1

5.59

)6059(2

1

Upper boundary

Class Boundary

Class intervals

Lower limit Upper limit Lower Boundary

Upper boundary

40-49 40 49 39.5 49.5

50-59 50 59 49.5 59.5

60-69 60 69 59.5 69.5

70-79 70 79 69.5 79.5

80-89 80 89 79.5 89.5

90-99 90 99 89.5 99.5

Class intervals

Lower limit

Upper limit

Lower Boundary

Upper boundary

Midpoint Frequency

40-49 40 49 39.5 49.5 44.5 5

50-59 50 59 49.5 59.5 54.5 7

60-69 60 69 59.5 69.5 64.5 8

70-79 70 79 69.5 79.5 74.5 9

80-89 80 89 79.5 89.5 84.5 6

90-99 90 99 89.5 99.5 94.5 5

40

HISTOGRAM

bar chart with

i. horizontal axis represented by the upper boundary and the vertical axis represented by the frequency

Or

ii. frequency versus midpoint

Histogram

29.5-39.5 39.5-49.5 49.5-59.5 59.5-69.5 69.5-79.5 79.5-89.5 89.5-99.50

1

2

3

4

5

6

7

8

9

10

Histogram of marks

Upper boundary(marks)

freq

uen

cy

HISTOGRAM

44.5 54.5 64.5 .74.5 84.5 94.50

1

2

3

4

5

6

7

8

9

10

Histogram of Marks

midpoints ( marks )

Fre

qu

ency

Frequency polygon

is a closed line graphTwo methods :i. From histogramii. frequency versus midpoint

29.5-39.5 39.5-49.5 49.5-59.5 59.5-69.5 69.5-79.5 79.5-89.5 89.5-99.5 99.5-109.50

1

2

3

4

5

6

7

8

9

10

Histogram and Frequency polygon

Upper boundary (marks )

Fre

quen

cy

Frequency polygon

34.5 44.5 54.5 64.5 .74.5 84.5 94.5 104.50123456789

10

Frequency polygon of marks

Midpoint (marks)

Fre

quen

cy

Cumulative Frequency Table

Class intervals

Upper boundary

Frequency Cumulative frequency

30-39 39.5 0 0

40-49 49.5 5 5

50-59 59.5 7 12

60-69 69.5 8 20

70-79 79.5 9 29

80-89 89.5 6 35

90-99 99.5 5 40

40

Must add a class interval with 0 frequency

Ogive : Cumulative frequency curve

Cumulative frequency versus upper boundaries

29.5 39.5 49.5 59.5 69.5 79.5 89.5 99.50

5

10

15

20

25

30

35

40

45

Ogive of marks

Upper boundary ( marks )

Cu

mu

lati

ve f

req

uen

cy

Measures of Dispersion

- the amount or distances the values are spread out in a set of data

i. Range :midpoint of the highest class – midpoint of the lowest class

ii. Median : the value at half of the distributioniii. First quartile(Q1): the value at the first quarter iv. Third quartile (Q3): the value at the third quarterv. Interquartile range : Q3 – Q1

29.5 39.5 49.5 59.5 69.5 79.5 89.5 99.50

5

10

15

20

25

30

35

40

45

Ogive of marks

Upper boundary ( marks )

Cu

mu

lati

ve f

req

uen

cy

Exercise 1: SPM JUN’09

a) Complete the table below

Marks Frequency Midpoint Upper boundary Cumulative frequency

60-64 0 62 64.5 0

65-69 2 67 69.5 2

70-74 6