DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

15
DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY Przemyslaw A. Grabowicz Luca M. Aiello Vìctor M. Eguìluz Alejandro Jaimes

description

DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY. Przemyslaw A. Grabowicz Luca M. Aiello Vìctor M. Eguìluz Alejandro Jaimes. We built a classifier that distinguishes if a given set of people is either a social or a topical group. What are social and - PowerPoint PPT Presentation

Transcript of DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

Page 1: DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY

AND BOND THEORY

Przemyslaw A. GrabowiczLuca M. Aiello

Vìctor M. EguìluzAlejandro Jaimes

Page 2: DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

We built a classifier that distinguishesif a given set of people is either

a social or a topical group.

Page 3: DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

What aresocial

andtopicalgroups?

Page 4: DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

Social groups

Friends

www.news.com.au

Page 5: DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

Characteristics of social groups

• Direct reciprocity of interactions

• Small talk (broad range of topics in conversations)

Page 6: DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

Topical groups

http://www.flickr.com/photos/59571907@N03/5545401056/

A cameraclub

Page 7: DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

Characteristics of topical groups

• General reciprocity of interactions

• Conversations on a narrow range of topics

Page 8: DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

Two types of metrics

Based on reciprocity of

interactions

Based on diversity of topics (Shannon’s entropy)

Page 9: DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

Reciprocity metrics

intra-group reciprocity

intra-reciprocity:

--------------------------------------------------

inter-reciprocity:

1.

2.

Page 10: DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

Diversity of topics’ metrics

H(g) – Shannon’s entropy of terms/tags

normalized by the average for all groups having the same number of terms

1.

2.

Page 11: DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

Dataset – Flickr, 2008

Tags extracted from photos:

• from a group pool• commented in a group

• favorited in a group

Page 12: DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

Human labeling of groups

Consists of exploring:

• text of comments• group profiles

• photos• tags

• maps

Page 13: DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

Results

Normalized entropyReciprocity

Page 14: DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

Classifier

score Sg

3 tg

3 ug 3

hg AUC0.75

Accuracy0.76

1.

2.classifie

r3 tg

3 ug

3 hg

3 Hg

3 ag3 bg

size

3 Eg

AUC0.88

Accuracy0.80

hg for comments1

tg for comments2

ug for comments3

hg for favorites4

bg for comments5

AUC0.87

Accuracy0.80

Page 15: DISTINGUISHING TOPICAL AND SOCIAL GROUPS BASED ON COMMON IDENTITY AND BOND THEORY

ConclusionsFindings:• The metrics work as the theory predicts• Agreement and accuracy depend on value of the

score• Groups found with a community detection algorithm

are more social than declared groups

Future work:• Entropy is a simple measure, could be replaced with

something what understands texto NLP

• Binary classifier has its limitationso multi-label classification