Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

21
Jukka-Pekka Onnela Harvard University Big Data for Development United Nations Headquarters, New York City July 10, 2012 Big Data, Social Networks, and Human Behavior 1 Tuesday, July 10, 2012

description

Presentation by Jukka-Pekka Onnela, Assistant Professor of Biostatistics at Harvard University's School of Public Health. Presented at roundtable on "BIg Data for Development" hosted by Global Pulse, an innovation initiative of the United Nations (www.unglobalpulse.org).

Transcript of Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

Page 1: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

Jukka-Pekka Onnela

Harvard University

Big Data for Development

United Nations Headquarters, New York City

July 10, 2012

Big Data, Social Networks, and Human Behavior

1Tuesday, July 10, 2012

Page 2: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

Overview

• Progress in science has always been driven by data

• Explosion in the amount and type of data

• “Big data” refers to large and complex data sets

• Often multidimensional, longitudinal, digitally generated

• The big data phenomenon has its origin in Moore’s law:

• The number of transistors on integrated circuits doubles every 18 months

• Sensors are cheaper, smaller, everywhere

• Enhanced computational capacity

http://en.wikipedia.org/wiki/Moore%27s_law

2Tuesday, July 10, 2012

Page 3: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

http://www.boston.com/bigpicture/2008/08/the_large_hadron_collider.html

3Tuesday, July 10, 2012

Page 4: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

Large Hadron Collider (LCH)

• Large Hadron Collider (LHC) at CERN is the biggest machine ever built

• Largest underground ring has a circumference of 27 kilometers (17 miles)

• 1232 dipole magnets, each 15 meters long weighing 30 tons

• Vacuum is 10 trillionth of an atmosphere

• Experiments generate 100MB of data (particle trajectories) each second

• Higgs boson

http://www.runfam.com/2011/10/why-the-hare-may-never-beat-the-tortoise-zenos-paradox-the-paradox-of-motion/

4Tuesday, July 10, 2012

Page 5: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

5Tuesday, July 10, 2012

Page 6: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

• Mobile phones have been used in the past few years to study the structure of human social and communication networks

• Networks consist of nodes (actors) and ties (interactions)

• The field of research that studies networks, their structure and function, is called network science (in physics and mathematics) or social network analysis (in sociology and statistics)

Networks and mobile sensing

6Tuesday, July 10, 2012

Page 7: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

• Network theory, when applied to social networks, has a simple premise

• People are connected, therefore our health is connected

• People are connected, therefore our economic wellbeing is connected

• Mobile phones have enormous potential for the study of human social networks and human behavior “in vivo,” in a natural context outside laboratories

• Social behavior has remained essentially unchanged for millennia, but now, for the first time, we have the opportunity to study it at large scale

Networks and mobile sensing

7Tuesday, July 10, 2012

Page 8: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

• Besides communication, smartphones have sensors and computing capabilities

• These possibilities have led to a new research field called mobile phone sensing

• Mobile sensing has evolved in the past few years for several reasons

(1) Availability of cheap embedded sensors

• Gyroscope, compass, accelerometer, proximity sensor, ambient light sensor, two cameras, microphone, GPS, WiFi, Bluetooth

(2) Smartphones are programmable

(3) Software can be easily distributed

(4) Significant computational power (phone & cloud)

• Each phone can generate 1kB of data / second (conservative)

Networks and mobile sensing

8Tuesday, July 10, 2012

Page 9: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

http://desktopwallpaperdownload.files.wordpress.com/2012/02/network-space-lights-planets-high-wallpapers-full-hd.jpg

9Tuesday, July 10, 2012

Page 10: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

• Expect 6 billion phone subscriptions by the end of 2012

• This results in 6 million MB / second, or 6 TB / second, of data

• This is 60,000 more data than CERN generates (conservative)

• Twofold opportunity:

• Use mobile phone sensing to learn about the individuals (nodes)

• Use mobile phone communication patterns and network theory to learn about the structural connections between individuals (ties)

Networks and mobile sensing

10Tuesday, July 10, 2012

Page 11: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

Phone calls and texts in a European network

• 20% market share • 18 weeks (126 days)• Private subscriptions• N = 7M; L = 23M

Animation by Mikko Kivelä, Aalto University

11Tuesday, July 10, 2012

Page 12: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

Tie strengths in social networks

The weak ties hypothesis

Mark Granovetter, The strength of weak ties, American Journal of Sociology 78, 1360, 1973

12Tuesday, July 10, 2012

Page 13: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

Tie strengths in social networks

Revisiting the hypothesis with aggregated cell phone data

• Tie strength

• Fraction of friends in common

Onnela, Saramäki, Hyvönen, Szabó, Lazer, Kaski, Kertész, BarabásiStructure and tie strengths in mobile communication networks, PNAS 104, 7332, 2007

15 min (3 calls)5 min

7 min

3 min

13Tuesday, July 10, 2012

Page 14: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

Tie strengths in social networks

Onnela, Saramäki, Hyvönen, Szabó, Lazer, Kaski, Kertész, BarabásiStructure and tie strengths in mobile communication networks, PNAS 104, 7332, 2007

14Tuesday, July 10, 2012

Page 15: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

15

Tie strengths in social networks

15Tuesday, July 10, 2012

Page 16: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

16

Tie strengths in social networks

16Tuesday, July 10, 2012

Page 17: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

17

Tie strengths in social networks

17Tuesday, July 10, 2012

Page 18: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

18

Tie strengths in social networks

18Tuesday, July 10, 2012

Page 19: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

19Tuesday, July 10, 2012

Page 20: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

Understanding the Demographics of Twitter Users; A Mislove, S Lehmann, YY Ahn, JP Onnela, JN Rosenquist; Proceedings of the Fifth International AAAI Conference on Weblogs and Social Media, Barcelona, Spain, 2011

Pulse of the nation: Mood from Twitter

20Tuesday, July 10, 2012

Page 21: Big Data, Social Networks & Human Behavior (Jukka-Pekka Onnela)

Thank you

jponnela.com

22Tuesday, July 10, 2012