Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)
-
Upload
netsquared-victoria -
Category
Government & Nonprofit
-
view
79 -
download
0
description
Transcript of Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)
![Page 1: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/1.jpg)
Data Analysis for Everyone
![Page 2: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/2.jpg)
Martin Monkman
• Provincial Statistician & Director, BC Stats
• been getting paid to do data analysis in one form or another since the mid-1980s
• B.Sc. and M.A. in Geography (UVic)
• member of SABR
• bayesball.blogspot.ca
![Page 3: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/3.jpg)
![Page 4: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/4.jpg)
![Page 5: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/5.jpg)
![Page 6: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/6.jpg)
1. Start with a question
ALWAYS!
And don’t start with data!
• Five Ws
![Page 7: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/7.jpg)
Some examples of questions
• What was the population of Victoria in 1996? And what will the population of Victoria be in 2029?
• What are the demographics of Victoria?
• What do Victoria residents think about infrastructure investment?
![Page 8: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/8.jpg)
![Page 9: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/9.jpg)
2. Get some data
Remember: after your research question has been asked!
Two sources:
• Third party data
• Collect your own
![Page 10: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/10.jpg)
Sources of third party data
Open Data
• Social data: Statistics Canada
• The Census of Canada
• National Household Survey
• www.statcan.gc.ca
• DataBC
• www.data.gov.bc.ca
![Page 11: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/11.jpg)
Collect your own data
Administrative sources
• Registration information
• Transactions
Original data collection
• Survey
![Page 12: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/12.jpg)
Surveys
From the Twenty Questions:
• Who is your population?
• How are you going to reach them?
• What do you already know about them?
![Page 13: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/13.jpg)
![Page 14: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/14.jpg)
![Page 15: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/15.jpg)
• Differences
• Distributions
• Magnitude
• Patterns
• Proportions
• Relationships
• Trends
3. Data Analysis
![Page 16: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/16.jpg)
• MOOCs
• google “Making Sense of Data”
• Coursera
• https://www.coursera.org/course/introstats
• https://www.coursera.org/course/dataanalysis
• https://www.class-central.com/mooc/388/coursera-computing-for-data-analysis
Data Analysis: How-to
![Page 17: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/17.jpg)
![Page 18: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/18.jpg)
“Graphics are instruments for reasoning about quantitative information.” (Edward R. Tufte)
Purposes
• Exploratory Data Analysis
• Narrative
4. Data Visualization
![Page 19: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/19.jpg)
“The greatest value of a picture is when it forces us to notice what we never expected to see.” – John Tukey
![Page 20: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/20.jpg)
Anscombe’s Quartet
STATISTICAL MEASURES OF
EACH OF THE FOUR DATA SETS
Mean of x = 9 (exact)
Variance of x = 11 (exact)
Mean of y = 7.50
Variance of y = 4.122 or 4.127
Correlation between x and y = 0.816
Regression equation:
y = 3.00 + 0.500x
![Page 21: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/21.jpg)
![Page 22: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/22.jpg)
![Page 23: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/23.jpg)
Population pyramid
![Page 24: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/24.jpg)
![Page 25: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/25.jpg)
http://cran.r-project.org/
![Page 26: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/26.jpg)
Capital Regional District, population by municipality, 2013
Data source: Statistics Canada & BC Stats
![Page 27: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/27.jpg)
Capital Regional District, population by municipality and region, 2013
Data source: Statistics Canada & BC Stats
![Page 28: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/28.jpg)
Capital Regional District population, 1996-2013
Data source: Statistics Canada & BC Stats
![Page 29: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/29.jpg)
Year-over-year population change, Capital Regional District
Data source: Statistics Canada & BC Stats
![Page 30: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/30.jpg)
Census tracts
Data source: Statistics Canada & BC Stats
![Page 31: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/31.jpg)
Victoria CMA – median after-tax income (2005), by Census Tract
Data source: Statistics Canada & BC Stats
![Page 32: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/32.jpg)
Data source: Statistics Canada
![Page 33: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/33.jpg)
Source: Harvard Dialect Survey / Joshua Katz
Mapping
![Page 34: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/34.jpg)
How can I improve my data visualizations?
• Work with data
• Experiment
• Get feedback from others
• Look for good examples
• Look for bad examples
![Page 35: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/35.jpg)
![Page 36: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/36.jpg)
Five Degrees of Obfuscation
Debris
Garbage
Rubbish
Trash
Waste
0
5
10
15
20
25
Trash Debris Rubbish Waste GarbageU
nit
s
Five Columns of Clarity
![Page 37: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/37.jpg)
Foreshortened circles
![Page 38: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/38.jpg)
An illusion of distance and volume
![Page 39: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/39.jpg)
No 3D. Ever.
![Page 40: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/40.jpg)
![Page 41: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/41.jpg)
![Page 42: Net2Vic: Effective Data Analysis for Everyone (October 23, 2014)](https://reader033.fdocuments.in/reader033/viewer/2022060203/559df4681a28ab297d8b4697/html5/thumbnails/42.jpg)