Identifying rumours on Twitter

Post on 28-Nov-2014

496 views 0 download

description

Some thoughts about

Transcript of Identifying rumours on Twitter

Kim Holmberg, PhD(e) kim.holmberg@abo.fi

(w3) http://kimholmberg.fi

Research plan/proposal on April 10, 2014

Identifying rumours on Twitter

”We define rumor as ’unverified and instrumentally relevant information statements in circulation that arise in contexts of ambiguity, danger or potential threat, and that function to help people make sense and manage risk’ (DiFonzo and Bordia, 2007: 13).

This definition highlights three facets of social discourse: The context (the situation and/or psychological need out of which discourse arises), function (what people are trying to accomplish by engaging in the discourse), and content (the types of statements uttered).”

DiFonzo, N. & Bordia, P. (2007). Rumor, Gossip and Urban Legends. Diogenes, vol. 54, no. 19. DOI: 10.1177/0392192107073433

“Rumors arise from situational contexts that are ambiguous, threatening or potentially threatening, and where people feel a psychological need for understanding or security.”

• “physical insecurity and desire to enhance their sense of security”

• “psychological insecurity and desire to enhance the sense of self”

Context of a rumour

DiFonzo, N. & Bordia, P. (2007). Rumor, Gossip and Urban Legends. Diogenes, vol. 54, no. 19. DOI: 10.1177/0392192107073433

Function of a rumour

DiFonzo, N. & Bordia, P. (2007). Rumor, Gossip and Urban Legends. Diogenes, vol. 54, no. 19. DOI: 10.1177/0392192107073433

Rumors function to make collective sense (sense-making) in an ambiguous situation and/or manage a threat

• “Opportunity to actively avoid or neutralize the threat”• “Interpreting the dreaded negative event in a way that

reduces its emotional impact”• “To build oneself up by putting other groups down”• “Rumors can perform other functions, such as

entertainment, communication of group norms, and defining social network power structure and boundaries, but this is not their primary role.”

Content of a rumour

DiFonzo, N. & Bordia, P. (2007). Rumor, Gossip and Urban Legends. Diogenes, vol. 54, no. 19. DOI: 10.1177/0392192107073433

• Rumours are information statements (they are declarative rather than directive or interrogatory)

• A rumour is transmitted, the information statements are communicated through people

• These information statements are perceived as relatively useful in some way

• Rumours are unverified

How can we identify rumours, verify information and track their origin?

Context – Function – Content

Identifying rumours

Content of a rumour

DiFonzo, N. & Bordia, P. (2007). Rumor, Gossip and Urban Legends. Diogenes, vol. 54, no. 19. DOI: 10.1177/0392192107073433

• Rumours are information statements (they are declarative rather than directive or interrogatory)

• A rumour is transmitted, the information statements are communicated through people

• These information statements are perceived as relatively useful in some way

• Rumours are unverified

A rumour is transmitted

“There are measurable differences in the way microblog messages propagate. The paper shows that these differences are related to the newsworthiness and credibility of the information conveyed, and describes features [e.g. number of followers, URLs, sentiment, etc.] in the tweets, that are effective for classifying information automatically as credible or not credible.”

Castillo, C., Mendoza, M. & Poblete, B. (2012). Predicting information credibility in time-sensitive social media. Internet Research, vol. 23, no. 5, pp. 560-588.

A rumour is transmitted

A rumour is transmittedData about the tweet (e.g. number of retweets, URLs), tweeter (e.g. activity on Twitter) and tweeter’s network (number of following/followers)

Bla bla bla bla bla bla bla ...

Bla bla bla bla bla bla bla ...

Bla bla bla bla bla bla bla ...

Rumours are information statements

Rumours are information statements

Bordia, P. & Difonzo, N. (2004). Problem Solving in Social Interactions on the Internet: Rumor As Social Cognition. Social Psychology Quarterly, vol. 67, no. 33. DOI: 10.1177/019027250406700105

Most prevalent types of statements identified by RIAS (Rumor Interaction Analysis System):

• Interrogatory: questions seeking information• Sensemaking: implied attempts at solving a problem, namely

whether or not the rumor was true• Providing information: were statements that included

material relevant to the rumor under discussion• Disbelief: indicated disbelief a person’s disbelief in the rumor• Digressive: were not directly relevant to the original rumor,

as in narrating some experience or seeking information that was not relevant.

• Interrogatory

• Sensemaking

• Providing information

• Disbelief

• Digressive

Rumours are information statements

http://www.pheme.eu/

Four types of phemes (rumorous memes) or online rumours based on their level of harmfullness:

• Speculation: the forming of a theory or conjecture without firm evidence

• Controversy: prolonged public disagreement or heated discussion

• Misinformation: false or inaccurate information that is spread unintentionally

• Disinformation: intentionally false or inaccurate information that is spread deliberately

Identifying rumours: Research proposal

• Spike and/or Burst detectiono detect sudden increases in word frequencies

• Co-word analysiso map connections between frequent words

• Network analysiso identify word clusters, central words and outliers,

but perhaps also data about the tweeters and their social networks to identify the most likely tweeters to start/spread rumours (see Castillo et al., 2012)

To identify relevant words and phrases, that may be unique for rumours:

Identifying rumours: Research proposal

Context

Information

Rumours

Identifying rumours: Research proposal

• Annotation scheme• Tweet level vs word level annotation• Ideally something in between or a combination

• Level of ambiguity• Level of uncertainty/skepticism• Level of sensationalism• Sentiment of tweet (see Castillo et al., 2012)• and more...

Identifying rumours: Research proposal

Annotation scheme: Example

Context

Function

Content (that may indicate a possible rumour)

Vague, ambiguous, uncertainty

cat1 cat2 cat3 cat4 cat505

10152025303540

cat1 cat2 cat3 cat4 cat505

10152025303540

cat1 cat2 cat3 cat4 cat505

10152025303540

Identifying rumours: Research proposalUsing the chosen measures categorize tweets into speculative, controversal, misinformation and disinformation

cat1 cat2 cat3 cat4 cat505

10152025303540

Speculation Controversy

Misinformation Disinformation

Identifying rumours: Research proposal

Speculation

Controversy

Misinformation

Disinformation

Verified information

Unclear

0

200

400

Perhaps a tweet can simultaneously score on different measures?

World Economic Forum

Top 10 trends of 2014:10. The rapid spread of misinformation online

“Finally, we should remember that every case of misinformation is unique and should be considered independently, paying attention to the complexities of the ecosystem it circulates within. In terms of interpreting misinformation, human evaluation will remain essential to put information into context, and context is ultimately what this is all about.”

http://reports.weforum.org/outlook-14/top-ten-trends-category-page/10-the-rapid-spread-of-misinformation-online/

Kim Holmbergkim.holmberg@abo.fi http://kimholmberg.fi @kholmber

Thank you for your attention