Tracking discourse on social media

Post on 21-Mar-2017

168 views 3 download

Transcript of Tracking discourse on social media

Tracking Discourse on Social Media

Archives Unleashed: Web Archive HackathonToronto, Ontario

Team Critical Load Average

Two events:

● Charlie Hebdo shooting (Jan 7, 2015)● Bataclan attack (Nov 13, 2015)

Two social media sites:

● Reddit● Twitter

TRACKING DISCOURSE ON SOCIAL MEDIA

Four approaches:

● Attention span● Information flow● Topic modeling● Network analysis

REDDIT COMMENT:

TWEET:

REDDIT DATA

~50M comments a month on Reddit

13M comments the week following Hebdo shooting

25M comments the week following Bataclan attack

48,840 comments about the Hebdo shooting

110,520 comments about the Bataclan attack

6,964,831 bataclan tweets (english, nov. 13 - nov. 19)

4,280,030 hebdo tweets (english, jan. 7 - jan 13)

All our data

Use ALL available resources

60GB of JSON -> 1GB of txt

Attention on social media

Longitudinal Analysis (spread of information/misinformation)

(http://www.cs.odu.edu/~anwala/files/temp/archivesUnleashedHackathon/Bataclan_Twitter.html)

Longitudinal Analysis (evolution of conversation)

day 1 day 2 day 3 day 4 day 5 day 6 day 7

(http://www.cs.odu.edu/~anwala/files/temp/archivesUnleashedHackathon/Bataclan_Twitter.html)

Topic Modeling

Topic Modeling

NETWORK ANALYSIS: Word co-occurrence pattern for Charlie Hebdo

NETWORK ANALYSIS: Word co-occurrence pattern for Bataclan

FURTHER RESEARCH

● Longer time spans● Other types of events● Categorization (hashtags or subreddits)

This project brought to you by Team Critical Load Average:

Alexander Nwala, Old Dominion UniversityAllison Hegel, UCLAFederico Nanni, University of BolognaJonathan Armoza, NYUKelsey Utne, Cornell UniversityNick Ruest, York UniversityYu Xu, USC