Data Journalism 2: Interrogating, Visualising and Mashing

36
Online Journalism City University Paul Bradshaw Data 2: Interrogating, visualising, mashing Monday, 7 March 2011

description

Session for MA students at City University's Journalism School

Transcript of Data Journalism 2: Interrogating, Visualising and Mashing

Page 1: Data Journalism 2: Interrogating, Visualising and Mashing

Online JournalismCity UniversityPaul Bradshaw

Data 2: Interrogating, visualising, mashing

Monday, 7 March 2011

Page 2: Data Journalism 2: Interrogating, Visualising and Mashing

5 things you need to know about eachData journalism in actionWalkthrough

Themes

Monday, 7 March 2011

Page 3: Data Journalism 2: Interrogating, Visualising and Mashing

.

Interrogating data

Monday, 7 March 2011

Page 4: Data Journalism 2: Interrogating, Visualising and Mashing

Monday, 7 March 2011

Page 5: Data Journalism 2: Interrogating, Visualising and Mashing

1. Data always needs cleaning up2. Treat the ‘source’ like a source3. Use the right ‘average’ and percentage4. Variation over time & space: context5. Spreadsheet tools are your friend - but always backup copies

5 things you need to know about interrogating data

Monday, 7 March 2011

Page 6: Data Journalism 2: Interrogating, Visualising and Mashing

Monday, 7 March 2011

Page 7: Data Journalism 2: Interrogating, Visualising and Mashing

“What the Independent have done is confuse the UK’s deficit with our debt [making] the debt problem look around eight times worse than it is. And it used the whole of its front page to do so.”

- James BallMonday, 7 March 2011

Page 8: Data Journalism 2: Interrogating, Visualising and Mashing

Monday, 7 March 2011

Page 9: Data Journalism 2: Interrogating, Visualising and Mashing

Measurement doesn't answer anything if there's only one variableStatistical significanceSample size and selectionControls and the placebo effectRead up.

What is the data worth?

Monday, 7 March 2011

Page 10: Data Journalism 2: Interrogating, Visualising and Mashing

1. Variance is interesting.2. Variance is different for different variables and in different populations.3. The amount of variance is easily quantified.

- Philip Meyer, Precision Journalism

Monday, 7 March 2011

Page 11: Data Journalism 2: Interrogating, Visualising and Mashing

Data > Text to columnsFind & replaceConditional formulas: =IF(condition, if met, if not)=COUNTIF(range, test)

Getting data in the right form

Monday, 7 March 2011

Page 12: Data Journalism 2: Interrogating, Visualising and Mashing

Edit cells > common transformsEdit cells > split multi-valued cellsFacet > text facetExport...

Walkthrough: cleaning data in Google Refine

Monday, 7 March 2011

Page 13: Data Journalism 2: Interrogating, Visualising and Mashing

.

Visualising data

Monday, 7 March 2011

Page 14: Data Journalism 2: Interrogating, Visualising and Mashing

1. Choose the chart for the purpose2. It can be used to spot a lead3. Good design is when there’s nothing more to take away4. It should be self-contained & have refs5. Be careful with scales and classes

5 things you need to know about visualising data

Monday, 7 March 2011

Page 15: Data Journalism 2: Interrogating, Visualising and Mashing

or http://chartchooser.juiceanalytics.com/Monday, 7 March 2011

Page 16: Data Journalism 2: Interrogating, Visualising and Mashing

Monday, 7 March 2011

Page 17: Data Journalism 2: Interrogating, Visualising and Mashing

Monday, 7 March 2011

Page 18: Data Journalism 2: Interrogating, Visualising and Mashing

What is wrong with this picture?

Monday, 7 March 2011

Page 19: Data Journalism 2: Interrogating, Visualising and Mashing

Monday, 7 March 2011

Page 20: Data Journalism 2: Interrogating, Visualising and Mashing

http://simplecomplexity.net/statistics-without-context/

Monday, 7 March 2011

Page 21: Data Journalism 2: Interrogating, Visualising and Mashing

http://junkcharts.typepad.com/junk_charts/trifecta-checkup/

Monday, 7 March 2011

Page 22: Data Journalism 2: Interrogating, Visualising and Mashing

ManyEyesTableauWordle, TagxedoBatchGeoGephiDelicious.com/paulb/visualisation+tools

Visualisation tools

Monday, 7 March 2011

Page 23: Data Journalism 2: Interrogating, Visualising and Mashing

.

Walkthrough: visualising data with Google Gadgets

Monday, 7 March 2011

Page 24: Data Journalism 2: Interrogating, Visualising and Mashing

.

Walkthrough: visualising data in ManyEyes

Monday, 7 March 2011

Page 25: Data Journalism 2: Interrogating, Visualising and Mashing

.

Mashing data

Monday, 7 March 2011

Page 26: Data Journalism 2: Interrogating, Visualising and Mashing

1. It is what a journalist does best2. Look for a point of connection: place? Person? Company? Date?3. What an API can do4. What APIs there are5. Mashups can be live, updated or static

5 things you need to know about mashing data

Monday, 7 March 2011

Page 27: Data Journalism 2: Interrogating, Visualising and Mashing

Monday, 7 March 2011

Page 28: Data Journalism 2: Interrogating, Visualising and Mashing

Monday, 7 March 2011

Page 29: Data Journalism 2: Interrogating, Visualising and Mashing

Yahoo! PipesOpenHeatMapMapalistxFruitsScraperwikiMaptube

Mashup tools

Monday, 7 March 2011

Page 30: Data Journalism 2: Interrogating, Visualising and Mashing

Inputs - Fetch Feed, CSV, Data, Page, YQL, Flickr, FormOperators - Filter, Sort, Unique, Union, Count, Split, Rename, Regex, Unique, Location extractor, URL BuilderOutputs - Map, Gallery, List, XML, KML

Walkthrough: making mashups with Yahoo! Pipes

Monday, 7 March 2011

Page 31: Data Journalism 2: Interrogating, Visualising and Mashing

Format the spreadsheetPublish it as CSVCopy linkPaste it at OpenHeatMapFix any problems

Walkthrough: making mashups with OpenHeatMap

Monday, 7 March 2011

Page 32: Data Journalism 2: Interrogating, Visualising and Mashing

Edit column > Add column by fetching URLsUse GREL (Google Refine Expression Language)Search web for help & examples

Walkthrough: grabbing geo data with Google Refine

Monday, 7 March 2011

Page 33: Data Journalism 2: Interrogating, Visualising and Mashing

.

Questions?

Monday, 7 March 2011

Page 34: Data Journalism 2: Interrogating, Visualising and Mashing

Links

OnlineJournalismClasses.tumblr.comDelicious.com/paulb/cityoj09Delicious.com/paulb/datajournalismDelicious.com/paulb/visualisationDelicious.com/paulb/statistics Delicious.com/paulb/mashups

Monday, 7 March 2011

Page 35: Data Journalism 2: Interrogating, Visualising and Mashing

Before the lab: play with these techniques yourself, have problems, find solutions, raise questions. Install Google Refine and Tableau on your laptop to use.- Visualise, interrogate or mash data

Lab

Monday, 7 March 2011

Page 36: Data Journalism 2: Interrogating, Visualising and Mashing

Books

Kaiser Fung - Numbers Rule Your WorldBen Goldacre - Bad ScienceDonna Wong - The WSJ Guide to Information GraphicsBrian Suda - A Practical Guide to Designing with Data

Monday, 7 March 2011