Organizing Data
-
Upload
anjelita-ortiz -
Category
Documents
-
view
31 -
download
4
description
Transcript of Organizing Data
![Page 1: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/1.jpg)
Organizing Data
![Page 2: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/2.jpg)
Raw Data
Data collected in its original form
![Page 3: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/3.jpg)
Suppose a researcher wants to do a study on the number of miles that employees of Wal-Mart travel each day. She collects the following data: (see data set 1)
1 2 6 7 12 13 2 6 9 5
18 7 3 15 15 4 17 1 14 5
5 16 4 5 8 6 5 18 5 2
9 11 12 1 9 2 10 11 4 10
9 18 8 8 4 14 7 3 2 6
![Page 4: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/4.jpg)
Frequency Distribution
The organization of raw data in table form, using classes and frequencies
Class – a subset of the data Frequency – number of times in a
class Class limit – beginning and end of a
class
![Page 5: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/5.jpg)
1 2 6 7 12 13 2 6 9 5
18 7 3 15 15 4 17 1 14 5
5 16 4 5 8 6 5 18 5 2
9 11 12 1 9 2 10 11 4 10
9 18 8 8 4 14 7 3 2 6
Class limits TallyFrequency
1-3
4-6
7-9
10-12
13-15
16-18
![Page 6: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/6.jpg)
1 2 6 7 12 13 2 6 9 5
18 7 3 15 15 4 17 1 14 5
5 16 4 5 8 6 5 18 5 2
9 11 12 1 9 2 10 11 4 10
9 18 8 8 4 14 7 3 2 6
Class limits TallyFrequency
1-3 ////’////’ 10
4-6 ////’////’//// 14
7-9 ////’////’ 10
10-12 ////’/ 6
13-15 ////’ 5
16-18 ////’ 5
![Page 7: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/7.jpg)
Categorical Frequency Dist.
Used when data can be placed in specific categories, such as nominal or ordinal data. For example, political affiliation, religion, class…
![Page 8: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/8.jpg)
See Data set 2
Twenty five army inductees were given a blood test to determine their blood type. Data set 2 is the result.
Create a frequency distribution for the data.
![Page 9: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/9.jpg)
A B B AB O
O O B AB B
B B O A O
A O O O AB
AB A O B A
Class Tally FrequencyPercent
![Page 10: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/10.jpg)
A B B AB O
O O B AB B
B B O A O
A O O O AB
AB A O B A
Class Tally FrequencyPercent
A
B
O
AB
![Page 11: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/11.jpg)
A B B AB O
O O B AB B
B B O A O
A O O O AB
AB A O B A
Class Tally FrequencyPercent
A ////’ 5 20
B ////’// 7 28
O ////’//// 9 36
AB //// 4 16
![Page 12: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/12.jpg)
Grouped Frequency Dist.
Used when data is continuous.
For example, the number of hours that boat batteries last (See Data set 3)
![Page 13: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/13.jpg)
Class Class Cum.
Limit Boundary Tally Freq. Freq.
24-30 23.5-30.5 /// 3 3
31-37 30.5-37.5 / 1 4
38-44 37.5-44.5 ////’ 5 9
45-51 44.5-51.5 ////’//// 9 18
52-58 51.5-58.5 ////’/ 6 24
59-65 58.5-65.5 / 1 25
![Page 14: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/14.jpg)
Class Class Cum.
Limit Boundary Tally Freq. Freq.
24-30 23.5-30.5 /// 3 3
31-37 30.5-37.5 / 1 4
38-44 37.5-44.5 ////’ 5 9
45-51 44.5-51.5 ////’//// 9 18
52-58 51.5-58.5 ////’/ 6 24
59-65 58.5-65.5 / 1 25
![Page 15: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/15.jpg)
Class Limits
Each limit is the same size
![Page 16: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/16.jpg)
Class Class Cum.
Limit Boundary Tally Freq. Freq.
24-30 23.5-30.5 /// 3 3
31-37 30.5-37.5 / 1 4
38-44 37.5-44.5 ////’ 5 9
45-51 44.5-51.5 ////’//// 9 18
52-58 51.5-58.5 ////’/ 6 24
59-65 58.5-65.5 / 1 25
![Page 17: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/17.jpg)
Class Boundary
Used to close the gaps in the frequency distribution.
Class limits should have the same decimal place as the raw data, and class boundaries should have one additional decimal place and end in a 5
![Page 18: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/18.jpg)
Class Class Cum.
Limit Boundary Tally Freq. Freq.
24-30 23.5-30.5 /// 3 3
31-37 30.5-37.5 / 1 4
38-44 37.5-44.5 ////’ 5 9
45-51 44.5-51.5 ////’//// 9 18
52-58 51.5-58.5 ////’/ 6 24
59-65 58.5-65.5 / 1 25
![Page 19: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/19.jpg)
Cumulative Frequency
Used when you want to know how much data falls into 2 or more classes.
![Page 20: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/20.jpg)
Frequency Distribution Rules
![Page 21: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/21.jpg)
Between 5 and 20 classes
No hard and fast rule, but should give a clear description of the collected data
![Page 22: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/22.jpg)
Class width should be an odd number
This ensures that the midpoint of the class has the same decimal place value as the data
Not always rigorously followed, especially when computers are used to group data
![Page 23: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/23.jpg)
Class must be mutually exclusive
No overlapping class limits.
For example: Age 10-20, 20-30, etc. If you are 20, which group do you belong to?
![Page 24: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/24.jpg)
The classes must be continous
If there are no values in a class, the class must be included in the distribution. The only exception is if the first class has zero frequency or if the last class has zero frequency.
![Page 25: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/25.jpg)
The classes must be exhaustive
There should be enough classes to accommodate all the data.
![Page 26: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/26.jpg)
The classes must be equal in width
This avoids distorting the view of the data.
One exception: Open-ended distributions. For example, age. 10-20, 20-30, 30-40, 40-50, over 50
![Page 27: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/27.jpg)
Data Set 4 represents the record high temperatures for each of the 50 states.
Data Set 5 represents the number of miles per gallon that 30 selected SUV’s obtained in city driving.
![Page 28: Organizing Data](https://reader036.fdocuments.in/reader036/viewer/2022062422/56812dfc550346895d935b54/html5/thumbnails/28.jpg)
Assignment
Create a frequency distribution for data set 4 using 7 classes and for data set 5 using 8. Include: class limits, class boundaries, tallies, frequency, and cumulative frequency in your tables.