Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

15
Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages Soudip Roy Chowdhury, Muhammad Imran , Muhammad Rizwan Asghar, Sihem Amer-Yahia, Carlos Castillo

description

This work describes our work presented at the ISCRAM-2013 conference. We presented Tweet4act system, which is used to detect and classify crisis-related messages communicated over a microblogging platform. Our system relies on extracting content features from each message. These features and the use of an incident-specific dictionary allow us to determine the period type of an incident that each message belongs to.

Transcript of Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Page 1: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Soudip Roy Chowdhury, Muhammad Imran, Muhammad Rizwan Asghar, Sihem Amer-Yahia, Carlos Castillo

Page 2: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Disaster & Social Media

Disaster Strikes, Social Media Responds

Page 3: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Virtual Collaboration, Information Sharing

• Valuable information

• Contribute to situational awareness

• Highly useful, if analyzed timely and effectively(Starbird et al., 2010; Latonero and Shklovski,

2010)

Page 4: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Social Media Response to Disaster Phases

Before

During

After

Page 5: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Disaster Management, Crisis Informatics

- Caution, warnings- Alerts etc.

- Damage- Causalities etc.

- Request for help- Donations etc.

• The main goals of our research:1. Identify messages related to an incident.2. Classify incident-messages with the corresponding

period (PRE, DURING, POST).

Page 6: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Datasets & Examples

1. Joplin Tornado on May 22, 2011

2. Nesat Typhoon in Philipines on Sep 27, 2011

3. Haiti Earthquake on Jan 12, 2010

• [PRE] New #tropical storm forms in the West #Pacific. #Nesat may hit the #Philippines & #China as a #typhoon next week

• [DURING] @Yahoo News: Powerful #typhoon with winds up to 106 mph makes landfall in #Philippines as 100,000 odered to fless homes

• [POST] News5 Action center is now accepting donations for the victims of Typhoon “pedring. Drop boxes are located @ TV5 Office :)

Page 7: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Tweet4Act System• Collection -> Filtering -> Period Classification

Page 8: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

1. Filtering Process

• Normalization: remove “RT @username” and “@username” prefixes and remove duplicate messages

• Apply the k-mediod method with the manhattan distance between medoids and messages in each cluster

• Discard all cluster having a negative number or zero as silhouette coefficient

• Select from each cluster the fraction m messages closer to the mediod

Page 9: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Filtering Process Validation

• Using CrowdFlower crowdsourcing platform

Page 10: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

2. Dictionary Based Period Classification• Most frequent words across datasets

• “warning” & “alert” typically found in the Pre

• “now”, “sweeps” etc. typically found in During

• “aftermath”, “donate” etc. typically found in Post

Page 11: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

3. NLP-Based Period Classification

• Tense of verbs can help identify period. (A. Iyengar

et al., 2011)

POS tagging

1. Dictionary based verbs get +1 (ignore below)

2. Aux verbs get +1(e.g., could-PRE, are-DURING, did-POST)

3. If a main verb in future/present/past tense, add +0.5 to pre/during/post period, respectively.

Ties: PRE > DURING > POST

Page 12: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Manual Period Classification

• CrowdFlower crowdsourcing period labeling

Page 13: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Performance of Tweet4Act

PRE

PRE

PRE

DURING

DURING

DURING

POST

POST

POST

AVG

AVG

AVG

Page 14: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

References

• A. Iyengar, T. Finin, and A. Joshi (2011) Content-based prediction of temporal boundaries for events in Twitter. In Proceedings of the Third IEEE International Conference on Social Computing.

• K. Starbird, L. Palen, A. Hughes, and S. Vieweg (2010) Chatter on the red: what hazards threat reveals about the social life of microblogged information. In Proceedings of the 2010 ACM conference on Computer supported cooperative work, pages 241–250. ACM.

• Latonero, Mark, and Irina Shklovski. "“Respectfully Yours in Safety and Service”: Emergency Management & Social Media Evangelism.” Proceedings of the 7th International ISCRAM Conference– Seattle. Vol. 1. 2010.

Page 15: Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages

Thank you!Muhammad [email protected]