SADC Course in Statistics Analysing Data Module I3 Session 1.
-
Upload
makayla-ramos -
Category
Documents
-
view
242 -
download
1
Transcript of SADC Course in Statistics Analysing Data Module I3 Session 1.
![Page 1: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/1.jpg)
SADC Course in Statistics
Analysing Data
Module I3 Session 1
![Page 2: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/2.jpg)
2To put your footer here go to View > Header and Footer
Contents of this Module
Session
1
2 & 3
4 & 5
6 & 7
8 & 9
10 to 12
13
14 to 16
17 & 18
19 & 20
Contents
Review of concepts from Basic Level
Graphical summaries for quantitative data
Numerical summaries for quantitative data
Processing single and multiple variables
Risks and return periods
Tables for frequencies and other statistics
Introducing statistics packages
Coping with common complications
Group project
Presentation and evaluation
![Page 3: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/3.jpg)
3To put your footer here go to View > Header and Footer
Modules at basic and Intermediate levels
• Module B2• From the data to the report
• Intermediate level• Module I1 – collecting data (follows from B1)
• Modules I2 to I4 follow from B2 • Module I2 – organising the data• Module I3 – analysing the data• Module I4 – presenting results
![Page 4: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/4.jpg)
4To put your footer here go to View > Header and Footer
Module Objectives Successful students will be able to:• Use descriptive analysis tools
• to answer practical questions.
• Produce descriptive statistical analyses• including summary statistics, tables and graphs.
• Interpret common summary statistics,• particularly measures of variation.
• Produce summary statistics in a range of ways• to suit different types of user.
• Suggest ways of coping with common complications when analysing survey data.
• Work constructively in a team • to produce an analysis on time.
• Evaluate team skills of themselves and others.
![Page 5: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/5.jpg)
5To put your footer here go to View > Header and Footer
Pre-requisites - computing
Assume Basic Level or equivalent
• Mainly use of Excel:• Importance of having data in list format• Pivot tables and pivot graphs• Calculations to produce percentages and proportions
from frequencies• Familiarity with SSC-Stat add-in
• Some familiarity with Word and Powerpoint• Though manly needed for Module I4
![Page 6: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/6.jpg)
6To put your footer here go to View > Header and Footer
Pre-requisites - statistics
Assumes Basic Level or equivalent
• What is statistics?• Module B2 Session 3
• Use of CAST• Interactive statistics textbook
• Types of data and appropriate summaries• Categorical and numerical
• Enthusiasm and no fear • Module B2 showed statistics was logical and not so
difficult?
![Page 7: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/7.jpg)
7To put your footer here go to View > Header and Footer
Session Overview
• Activity 1:Introduction – This PowerPoint presentation
• Activity 2 to 5:Practical– CAST – Excel for Tables – Dot plots in CAST and Excel– The objectives of an analysis
• Activity 6: Summary of key ideas– This presentation continued
![Page 8: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/8.jpg)
8To put your footer here go to View > Header and Footer
Learning objective
• Answer questions expected of students who have taken module B2.
![Page 9: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/9.jpg)
9To put your footer here go to View > Header and Footer
Practical work
• You use CAST• At basic level• To review tables and also dot plots
• You use Excel • To produce and edit pivot tables• And to produce dot plots
• You view demonstrations• To remind you of Excel• And also statistics
• Then the key ideas are discussed• and some of the case studies are re-introduced
![Page 10: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/10.jpg)
10To put your footer here go to View > Header and Footer
This is a problem based course
• Examples are used throughout:• Skills and tools are introduced to solve problems
• Survey of Principles of Official Statistics• Used extensively in B2• Also useful to remember the principles• Are countries applying them yet?
• Rice survey data• Used in B2 to illustrate many ideas• and produced in the paddy (simulation) game in I1
• Tanzania and Swaziland agriculture data• Large surveys• Will be used again in this module
![Page 11: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/11.jpg)
11To put your footer here go to View > Header and Footer
CAST
• CAST is an electronic textbook• It was used extensively in Module B2• It covers key topics in an interactive way.
– Some from the course – Others related to but not covered by the course
• As the course progresses students are expected to– Work independently more and more– Read around – Use books to enrich the course materials
![Page 12: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/12.jpg)
12To put your footer here go to View > Header and Footer
Editing a pivot table
How did you do? Was it easy?
What questions do you have?
![Page 13: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/13.jpg)
13To put your footer here go to View > Header and Footer
Rice Survey Case Study - objectives
Overall objectives are:• To estimate the total production in the district• To examine the relationship with inputs
![Page 14: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/14.jpg)
14To put your footer here go to View > Header and Footer
Analyses corresponding to simple objectives
![Page 15: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/15.jpg)
15To put your footer here go to View > Header and Footer
More complicated objectives
Objectives require analysis of a single column or variable
Some variables are categorical
Others are numerical
Objectives require analysis of multiple
variables
![Page 16: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/16.jpg)
16To put your footer here go to View > Header and Footer
Using Excel effectively
• Dot plots are not on Excel’s menus
• Dot plots are not in Excel’s help
• But you decided to do dot plots in Excel!• You therefore need to understand them better• So you can construct them yourself• And this understanding is good anyway• And helps with effective data analysis
• It is an example• Of you controlling the software• And not being limited by it
• That applies to all software
![Page 17: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/17.jpg)
17To put your footer here go to View > Header and Footer
Jittered dot plots in CAST and Excel
CAST
EXCEL
Rainfall data: 608, 746, 767, ….. 1395, 1425, 1482
![Page 18: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/18.jpg)
18To put your footer here go to View > Header and Footer
Jittered dot plots in CAST and Excel
CAST
EXCEL
Why are the vertical heights different in the 2 cases?
![Page 19: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/19.jpg)
19To put your footer here go to View > Header and Footer
Excel for analysis and training
• Excel is not designed as a training resource• Unlike CAST – that is all CAST is for
• Excel is to support • data organisation• and analysis
• But we used it also to support training• With dot plots• And stem and leaf plots• Neither of which are in the Excel menus
![Page 20: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/20.jpg)
20To put your footer here go to View > Header and Footer
Data exploration
• Before and during formal analysis
• For all variables
• But particularly for numerical variables
• That are treated extensively in this module
• Review data exploration from Module B2
![Page 21: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/21.jpg)
21To put your footer here go to View > Header and Footer
Dot plots - yield by variety
Outliers (typing errors) are clear, but only because of the 2nd variable
They are not outliers overall
![Page 22: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/22.jpg)
22To put your footer here go to View > Header and Footer
EDA is a continuous process
• EDA effectively is a continuation of the data checking process
• The example on the previous slide shows• how some oddities only become clear once the analysis
is undertaken
• This continues into the formal analysis• where it involves looking at the “residuals”
• They are the unexplained variation• As discussed in Module B2 Session 3!
• So analysis is not just a set of rules• It is a thoughtful process• Where you become the data detective!
![Page 23: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/23.jpg)
23To put your footer here go to View > Header and Footer
Swaziland data was for checking
![Page 24: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/24.jpg)
24To put your footer here go to View > Header and Footer
Investigating the column called Presence
What does 0 mean?
Why are there blanks?
Next steps:
1. Look at the questionnaire
2. Select these records
You are becoming detectives!
![Page 25: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/25.jpg)
25To put your footer here go to View > Header and Footer
Codes for the column
Seems clear enough. Zeros and blanks still a puzzle
![Page 26: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/26.jpg)
26To put your footer here go to View > Header and Footer
Selecting the blank records
i.e. serious problems with the whole record
Missing also
Too young and all the same
Crop code not recognised Areas too large
![Page 27: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/27.jpg)
27To put your footer here go to View > Header and Footer
Dot plot of area by Presence
Odd crop areas were ALL associated with odd codes for the column PRESENCE
It was found to be a data transfer problem with one byte missing in these records
![Page 28: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/28.jpg)
28To put your footer here go to View > Header and Footer
Tanzania agriculture survey
This is the variable we wish to explore. It is a value between 0 and 100
![Page 29: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/29.jpg)
29To put your footer here go to View > Header and Footer
The data in Excel
The variable to explore before analysis
![Page 30: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/30.jpg)
30To put your footer here go to View > Header and Footer
How to explore this value
• Try a pivot table• a powerful feature in Excel• used previously on categorical data
• Used here for a numerical variable
![Page 31: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/31.jpg)
31To put your footer here go to View > Header and Footer
Some results
![Page 32: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/32.jpg)
32To put your footer here go to View > Header and Footer
Drilling down – an example
Make the 6 corresponding to 2% the active cell
Then double click to give the detail
4 of these values are from the same village – so same enumerator
![Page 33: SADC Course in Statistics Analysing Data Module I3 Session 1.](https://reader036.fdocuments.in/reader036/viewer/2022062303/5515fe85550346d46f8b5a23/html5/thumbnails/33.jpg)
33To put your footer here go to View > Header and Footer
Are you now ready for module I3?
To continue to build skills for data analysis