Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online...

19
Formation and evolution mechanisms in online network of students: the Vkontakte case Sofia Dokuka, Diliara Valeeva, Maria Yudkevich Center for Institutional Studies, NRU HSE, Moscow, Russia [email protected] April 10, 2015 S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 1 / 19

Transcript of Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online...

Page 1: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Formation and evolution mechanisms in online networkof students: the Vkontakte case

Sofia Dokuka, Diliara Valeeva, Maria Yudkevich

Center for Institutional Studies, NRU HSE, Moscow, Russia

[email protected]

April 10, 2015

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 1 / 19

Page 2: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Backgroung

Computer Science stream: Analysis of node-level characteristics andtheir changes (Backstrom 2006, Leskovec 2008, Kairam 2012)

Sociological stream: Coevolution of social network and actors’behavior (Snijders 2010, Lewis 2012)

No studies about the evolution of the social network from the verybeginning (no ties between actors) to the end of the network growth.

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 2 / 19

Page 3: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Network growth models

Erdos-Renyi model is obtained by starting with a set of n isolatednodes and adding successive edges between them at random. Themodel does not adequately describe properties of real networks.

Barabasi-Albert model begins with an initial connected network andthen new nodes are added to it. The probability of connecting to theparticular node is proportional to the number of edges that theexisting nodes already have. The model captures well the degreedistribution, but cannot describe high clustering.

Watts-Strogatz model begins from a regular ring lattice with nnodes which is connected to k neighbors. The number of closedtriads in the network will be relatively high. To provide the shortAPL, the option of edge rewiring is added. The edge disconnectsfrom one of its nodes and randomly connects to another node.

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 3 / 19

Page 4: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

This study

Research question: How does the social network form and evolve?

Network has a restricted and known number of participants (onecohort of students);

Online network from the very formation till stabilization.

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 4 / 19

Page 5: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Case description

We investigated the formation and evolution of student online socialnetwork in one of the top Russian universities.

Students get accepted to university in 3 groups of applicants:according to Olympiad results, USE results, and others get acceptedat full-tuition places;

Overall, in 2014, 97 students who studied both at tuition-free and fulltuition places were matriculated;

In this university, students study in groups of up to 30 students. Thedivision into study groups is random;

Lectures are usually delivered to several groups simultaneously, whileseminar classes are delivered to each group separately;

A very small number of students in our sample knew each otherbefore the university.

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 5 / 19

Page 6: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Data collection procedure

We found students Vkontakte profiles according to their names in apublic list of enrolled students. Overall - 97 students, we collectednetworks of 71 students;

We gathered publicly open data about student ego-networks usingVkminer;

13 waves with an average time period between waves in one week andfixing for important events.

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 6 / 19

Page 7: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Descriptive statistics

Figure: Number of nodes dynamics Figure: Number of edges dynamics

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 7 / 19

Page 8: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Vkontakte networks

Figure: The network growth. Nodes are students Vkontakte accounts, edges arefriendship links between accounts. The size of the node reflects the degreecentrality. Nodes are fixed at the same place

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 8 / 19

Page 9: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Method

Conditional Uniform Graph test (CUG-test)(Anderson et al, 1999)compares network graph-level indices with the same measures of therandom networks;

We compare the observed network at each timepoint with threerandom networks of the same size (based on Erdos-Renyi,Barabasi-Albert, and Watts-Strogatz models);

The graph-level indices of the random networks were calculated as amean value from the distribution of 10000 random networks;

Indices for comparison: clustering coefficient, size of giant component,average path length, network diameter, and degree assortativity.

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 9 / 19

Page 10: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Results (1): Clustering coefficient

Figure: Clustering coefficient dynamics. Vertical line corresponds to September 1,start of the academic year.

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 10 / 19

Page 11: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Results (2): Network APL and diameter

Figure: Average path length dynamics.Vertical line corresponds to September1, start of the academic year.

Figure: Network diameter dynamics.Vertical line corresponds to September1, start of the academic year.

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 11 / 19

Page 12: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Results (3): Network assortativity and giant component

Figure: Network assortativity dynamics.Vertical line corresponds to September1, start of the academic year.

Figure: Giant compotent size dynamics.Vertical line corresponds to September1, start of the academic year.

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 12 / 19

Page 13: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Results: description

The clustering coefficient of the observed network was much higher(sig at 0.01%) than for any random network.

The dynamics of average path length and diameter for theobserved network well-described by random networks. At thebeginning, these indices are small because there are few edges in thenetwork. After the September 1 both measures increased. Asstudents create many links, indices decrease.

The degree assortativity for the observed network during the firstobservations was much higher (0.01%) than can be expected byrandom. After the students offline acquaintance, assortativitydecreased.

The size of the giant component evolves in a similar way for bothobserved and simulated networks. The number of nodes in therandom networks tends to be higher than in an observed one (sig at0.01%).

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 13 / 19

Page 14: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Summary

We trace a tendency of friends of friends to be connected in realworld social networks;

The average path length, network diameter and giantcomponent dynamics shows that at the beginning students tend tojoin giant component and arrange connections with their peers. Atthe late moments of observation students form dense groups;

Degree assortativity dynamics shows the absence of degree-basedhierarchy after September 1. Students befriend each other withoutpreselection. The potential aim of this process is to add themaximum amount of classmates to gather information from the newenvironment.

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 14 / 19

Page 15: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Formation and evolution mechanisms in online network ofstudents

Figure: Connected componentattachment mechanism

Figure: Brokerage mechanism

The connected component mechanism implies that nodes look for anoption to join the connected component. Connectedness with peersmight be useful for receiving new information (Vaquero, 2013);

The brokerage mechanism based on the intuition that two people witha common friend tend to become friends with each othe (Burt, 2004).

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 15 / 19

Page 16: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Thank you for attention!Questions?

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 16 / 19

Page 17: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Additional materials

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 17 / 19

Page 18: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

University admission procedure

In Russia students might enroll in universities by several differenttrajectories.

1 Olympiads. Olympiads are all-Russian competitions for talentedschool students in a wide range of subjects. Students that becomewinners and awardees of these Olympiads can be accepted to anyuniversity of their choice without additional exams free of tuition;

2 USE. The USE is a standardized test in various subject areas and hasa unified grading scale throughout all of Russia. Universities acceptstudents with the highest USE scores free of tuition;

3 Students that are not Olympiad winners and have low USE scores,might study with full tuition.

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 18 / 19

Page 19: Sofia Dokuka, Diliara Valeeva, Maria Yudkevich - Formation and evolution mechanisms in online network of students: the Vkontakte case

Vkontakte

Vkontakte is a popular Russian social networking site with monthlyaudience 55 million users. Vkontakte interface and options are very similarto Facebook, it is often called ”Russian Facebook”.

1 Vkontakte is very popular among youth and students;

2 Vkontakte has an open API system which allows to download datadirectly from the site;

3 VKminer software (http://linis.hse.ru/en/soft-linis) allows todownload publicly available data from the Vkontakte site. Developedby Laboratory for Internet Studies.

S. Dokuka, D. Valeeva, M. Yudkevich (HSE) AIST 2015, Yekaterinburg April 10, 2015 19 / 19