Identifying rumours on Twitter

23
Kim Holmberg, PhD (e) [email protected] (w3) http://kimholmberg.fi Research plan/proposal on April 10, 2014 Identifying rumours on Twitter

description

Some thoughts about

Transcript of Identifying rumours on Twitter

Page 1: Identifying rumours on Twitter

Kim Holmberg, PhD(e) [email protected]

(w3) http://kimholmberg.fi

Research plan/proposal on April 10, 2014

Identifying rumours on Twitter

Page 2: Identifying rumours on Twitter

”We define rumor as ’unverified and instrumentally relevant information statements in circulation that arise in contexts of ambiguity, danger or potential threat, and that function to help people make sense and manage risk’ (DiFonzo and Bordia, 2007: 13).

This definition highlights three facets of social discourse: The context (the situation and/or psychological need out of which discourse arises), function (what people are trying to accomplish by engaging in the discourse), and content (the types of statements uttered).”

DiFonzo, N. & Bordia, P. (2007). Rumor, Gossip and Urban Legends. Diogenes, vol. 54, no. 19. DOI: 10.1177/0392192107073433

Page 3: Identifying rumours on Twitter

“Rumors arise from situational contexts that are ambiguous, threatening or potentially threatening, and where people feel a psychological need for understanding or security.”

• “physical insecurity and desire to enhance their sense of security”

• “psychological insecurity and desire to enhance the sense of self”

Context of a rumour

DiFonzo, N. & Bordia, P. (2007). Rumor, Gossip and Urban Legends. Diogenes, vol. 54, no. 19. DOI: 10.1177/0392192107073433

Page 4: Identifying rumours on Twitter

Function of a rumour

DiFonzo, N. & Bordia, P. (2007). Rumor, Gossip and Urban Legends. Diogenes, vol. 54, no. 19. DOI: 10.1177/0392192107073433

Rumors function to make collective sense (sense-making) in an ambiguous situation and/or manage a threat

• “Opportunity to actively avoid or neutralize the threat”• “Interpreting the dreaded negative event in a way that

reduces its emotional impact”• “To build oneself up by putting other groups down”• “Rumors can perform other functions, such as

entertainment, communication of group norms, and defining social network power structure and boundaries, but this is not their primary role.”

Page 5: Identifying rumours on Twitter

Content of a rumour

DiFonzo, N. & Bordia, P. (2007). Rumor, Gossip and Urban Legends. Diogenes, vol. 54, no. 19. DOI: 10.1177/0392192107073433

• Rumours are information statements (they are declarative rather than directive or interrogatory)

• A rumour is transmitted, the information statements are communicated through people

• These information statements are perceived as relatively useful in some way

• Rumours are unverified

Page 6: Identifying rumours on Twitter

How can we identify rumours, verify information and track their origin?

Page 7: Identifying rumours on Twitter

Context – Function – Content

Identifying rumours

Page 8: Identifying rumours on Twitter

Content of a rumour

DiFonzo, N. & Bordia, P. (2007). Rumor, Gossip and Urban Legends. Diogenes, vol. 54, no. 19. DOI: 10.1177/0392192107073433

• Rumours are information statements (they are declarative rather than directive or interrogatory)

• A rumour is transmitted, the information statements are communicated through people

• These information statements are perceived as relatively useful in some way

• Rumours are unverified

Page 9: Identifying rumours on Twitter

A rumour is transmitted

Page 10: Identifying rumours on Twitter

“There are measurable differences in the way microblog messages propagate. The paper shows that these differences are related to the newsworthiness and credibility of the information conveyed, and describes features [e.g. number of followers, URLs, sentiment, etc.] in the tweets, that are effective for classifying information automatically as credible or not credible.”

Castillo, C., Mendoza, M. & Poblete, B. (2012). Predicting information credibility in time-sensitive social media. Internet Research, vol. 23, no. 5, pp. 560-588.

A rumour is transmitted

Page 11: Identifying rumours on Twitter

A rumour is transmittedData about the tweet (e.g. number of retweets, URLs), tweeter (e.g. activity on Twitter) and tweeter’s network (number of following/followers)

Page 12: Identifying rumours on Twitter

Bla bla bla bla bla bla bla ...

Bla bla bla bla bla bla bla ...

Bla bla bla bla bla bla bla ...

Rumours are information statements

Page 13: Identifying rumours on Twitter

Rumours are information statements

Bordia, P. & Difonzo, N. (2004). Problem Solving in Social Interactions on the Internet: Rumor As Social Cognition. Social Psychology Quarterly, vol. 67, no. 33. DOI: 10.1177/019027250406700105

Most prevalent types of statements identified by RIAS (Rumor Interaction Analysis System):

• Interrogatory: questions seeking information• Sensemaking: implied attempts at solving a problem, namely

whether or not the rumor was true• Providing information: were statements that included

material relevant to the rumor under discussion• Disbelief: indicated disbelief a person’s disbelief in the rumor• Digressive: were not directly relevant to the original rumor,

as in narrating some experience or seeking information that was not relevant.

Page 14: Identifying rumours on Twitter

• Interrogatory

• Sensemaking

• Providing information

• Disbelief

• Digressive

Page 15: Identifying rumours on Twitter

Rumours are information statements

http://www.pheme.eu/

Four types of phemes (rumorous memes) or online rumours based on their level of harmfullness:

• Speculation: the forming of a theory or conjecture without firm evidence

• Controversy: prolonged public disagreement or heated discussion

• Misinformation: false or inaccurate information that is spread unintentionally

• Disinformation: intentionally false or inaccurate information that is spread deliberately

Page 16: Identifying rumours on Twitter

Identifying rumours: Research proposal

• Spike and/or Burst detectiono detect sudden increases in word frequencies

• Co-word analysiso map connections between frequent words

• Network analysiso identify word clusters, central words and outliers,

but perhaps also data about the tweeters and their social networks to identify the most likely tweeters to start/spread rumours (see Castillo et al., 2012)

To identify relevant words and phrases, that may be unique for rumours:

Page 17: Identifying rumours on Twitter

Identifying rumours: Research proposal

Context

Information

Rumours

Page 18: Identifying rumours on Twitter

Identifying rumours: Research proposal

• Annotation scheme• Tweet level vs word level annotation• Ideally something in between or a combination

• Level of ambiguity• Level of uncertainty/skepticism• Level of sensationalism• Sentiment of tweet (see Castillo et al., 2012)• and more...

Page 19: Identifying rumours on Twitter

Identifying rumours: Research proposal

Annotation scheme: Example

Context

Function

Content (that may indicate a possible rumour)

Vague, ambiguous, uncertainty

Page 20: Identifying rumours on Twitter

cat1 cat2 cat3 cat4 cat505

10152025303540

cat1 cat2 cat3 cat4 cat505

10152025303540

cat1 cat2 cat3 cat4 cat505

10152025303540

Identifying rumours: Research proposalUsing the chosen measures categorize tweets into speculative, controversal, misinformation and disinformation

cat1 cat2 cat3 cat4 cat505

10152025303540

Speculation Controversy

Misinformation Disinformation

Page 21: Identifying rumours on Twitter

Identifying rumours: Research proposal

Speculation

Controversy

Misinformation

Disinformation

Verified information

Unclear

0

200

400

Perhaps a tweet can simultaneously score on different measures?

Page 22: Identifying rumours on Twitter

World Economic Forum

Top 10 trends of 2014:10. The rapid spread of misinformation online

“Finally, we should remember that every case of misinformation is unique and should be considered independently, paying attention to the complexities of the ecosystem it circulates within. In terms of interpreting misinformation, human evaluation will remain essential to put information into context, and context is ultimately what this is all about.”

http://reports.weforum.org/outlook-14/top-ten-trends-category-page/10-the-rapid-spread-of-misinformation-online/

Page 23: Identifying rumours on Twitter

Kim [email protected] http://kimholmberg.fi @kholmber

Thank you for your attention