Cikm2011 doallbirdstweetthesame

18
Do All Birds Tweet the Same? Characterizing Twitter around the World Barbara Poblete, Ruth García , Marcelo Mendoza and Alejandro Jaimes University of Chile, Yahoo! Research Barcelona Universitat Pompeu de Fabra, Universidad Federico Santa Maria October 26, 2011, Glasgow, CIKM

Transcript of Cikm2011 doallbirdstweetthesame

Do All Birds Tweet the Same?Characterizing Twitter around the World

Barbara Poblete, Ruth García, Marcelo Mendoza and Alejandro Jaimes

University of Chile, Yahoo! Research BarcelonaUniversitat Pompeu de Fabra, Universidad Federico

Santa Maria

October 26, 2011,Glasgow, CIKM

Objective

Identify differences and similarities in the use of social media by analyzing tweets and

network of friends in different countries.

What we did

Data: analysis of one year of Tweets for 10 most active countries

Content: languages, sentiment, structure (retweets, hashtags,..)

Structure: network (modularity, average path length, reciprocity, connectivity)

Crawling Strategy

12,964,735 active users -> 6,263,457 with valid location

4,736, 629 users belonging to 10 most active countries.

5,270,609,213 tweets during 2010

.

United StatesBrazil

United KingdomJapan

IndonesiaCanadaMéxico

NetherlandsSouth Korea

Australia

0.00

%

10.0

0%

20.0

0%

30.0

0%

40.0

0%

50.0

0%

TWEETS(%)USERS(%)

Engagement

Indonesia Japan Brazil

Netherlands UK US

Australia Mexico

South Korea Canada

0 200 400 600 800 100012001400160018002000

(Tweets/User)

Languages

English Portuguese Japanese Spanish Dutch German Indonesian and Malay

Korean0

200,000,000

400,000,000

600,000,000

800,000,000

1,000,000,000

1,200,000,000

1,400,000,000

1,600,000,000

1,800,000,000

Netherlands >10%,Indonesia >10%,Mexico >10%,South Korea >10%

How do Tweets differ in function?

Macro average

Country (Tweets/User) URL (%) Hashtag (%) Mention(%) Retweet(%)

Indonesia 1813.53 14.95 7.63 58.24 9.71

Japan 1617.35 16.30 6.81 39.14 5.65

Brazil 1370.27 19.23 13.41 45.57 12.80

Netherlands 1026.44 24.40 18.24 42.33 9.12

UK 930.58 27.11 13.03 45.61 11.65

US 900.79 32.64 14.32 40.03 11.78

Australia 897.41 31.37 14.89 43.27 11.73

Mexico 865.70 17.49 12.38 49.79 12.61

South Korea 853.92 19.67 5.83 58.02 9.02

Canada 806.00 31.09 14.68 42.50 12.50

How do Tweets differ in function?

Macro average

Country (Tweets/User) URL (%) Hashtag (%) Mention(%) Retweet(%)

Indonesia 1813.53 14.95 7.63 58.24 9.71

Japan 1617.35 16.30 6.81 39.14 5.65

Brazil 1370.27 19.23 13.41 45.57 12.80

Netherlands 1026.44 24.40 18.24 42.33 9.12

UK 930.58 27.11 13.03 45.61 11.65

US 900.79 32.64 14.32 40.03 11.78

Australia 897.41 31.37 14.89 43.27 11.73

Mexico 865.70 17.49 12.38 49.79 12.61

South Korea 853.92 19.67 5.83 58.02 9.02

Canada 806.00 31.09 14.68 42.50 12.50

How do Tweets differ in function?

Macro average

Country (Tweets/User) URL (%) Hashtag (%) Mention(%) Retweet(%)

Indonesia 1813.53 14.95 7.63 58.24 9.71

Japan 1617.35 16.30 6.81 39.14 5.65

Brazil 1370.27 19.23 13.41 45.57 12.80

Netherlands 1026.44 24.40 18.24 42.33 9.12

UK 930.58 27.11 13.03 45.61 11.65

US 900.79 32.64 14.32 40.03 11.78

Australia 897.41 31.37 14.89 43.27 11.73

Mexico 865.70 17.49 12.38 49.79 12.61

South Korea 853.92 19.67 5.83 58.02 9.02

Canada 806.00 31.09 14.68 42.50 12.50

Happiness Level

ENGLISH TWEETS SPANISH TWEETS

01 02 03 04 05 06 07 08 09 10 11 125

5.2

5.4

5.6

5.8

6

6.2

6.4

6.6

6.8

7

AustraliaBrazilCanadaIndonesiaJapanMexicoNether-landsSouth Ko-reaUKUSA

01 02 03 04 05 06 07 08 09 10 11 125

5.2

5.4

5.6

5.8

6

6.2

6.4

6.6

6.8

7

Dodds et al. “Temporal patters of happiness and information in a global social network: Hedonometrics and Twitter”, 2011

Network

Coverage with regard to the initial complete graphActive nodesEdges within the same location

Modularity

13.49 17.22 17.27 18.91 22.11 23.51 26.11 26.79 28.14 32.01 (%)

06:02 AM

Connectivity

Connectivity

Connectivity

Conclusions

Some use Twitter more for conversation and others for formal information dissemination (links of news, pics, etc).

Higher conversational level seems to be related to more happy tweets (less formality?).

Twitter networks seems to be less reciprocal and more hierarchical

Smaller networks tend to have more reciprocity

High reciprocity seems to lead to more activity per user

Reciprocity does not mean more connectivity: Indonesia has reciprocity in small communities with low connectivity among them

THANK [email protected]