Quantifying the Invisible Audience in Social Networks

58
stanford hci group Quantifying the Invisible Audience in Social Networks Eytan Bakshy, Moira Burke, Brian Karrer Facebook Data Science Team Michael Bernstein Stanford Computer Science Department

description

Presented at CHI 2013 When you share content in an online social network, who is listening? Users have scarce information about who actually sees their content, making their audience seem invisible and difficult to estimate. However, understanding this invisible audience can impact both science and design, since perceived audiences influence content production and self-presentation online. In this paper, we combine survey and large-scale log data to examine how well users’ perceptions of their audience match their actual audience on Facebook. We find that social media users consistently underestimate their audience size for their posts, guessing that their audience is just 27% of its true size. Qualitative coding of survey responses reveals folk theories that attempt to reverse-engineer audience size using feedback and friend count, though none of these approaches are particularly accurate. We analyze audience logs for 222,000 Facebook users’ posts over the course of one month and find that publicly visible signals — friend count, likes, and comments — vary widely and do not strongly indicate the audience of a single post. Despite the variation, users typically reach 61% of their friends each month. Together, our results begin to reveal the invisible undercurrents of audience attention and behavior in online social networks.

Transcript of Quantifying the Invisible Audience in Social Networks

Page 1: Quantifying the Invisible Audience in Social Networks

stanford hci group

Quantifying the Invisible Audience in Social Networks

Eytan Bakshy, Moira Burke, Brian KarrerFacebook Data Science Team

Michael BernsteinStanford Computer Science Department

Page 2: Quantifying the Invisible Audience in Social Networks

Sharing on a social network is like giving a talk from behind a curtain.

Page 3: Quantifying the Invisible Audience in Social Networks

Sharing on a social network is like giving a talk from behind a curtain.

2

Page 4: Quantifying the Invisible Audience in Social Networks
Page 5: Quantifying the Invisible Audience in Social Networks

Quantify the difference between users’ estimated and actual audience

Page 6: Quantifying the Invisible Audience in Social Networks

Quantify the difference between users’ estimated and actual audience

Measure audience size uncertainty for 220,000 Facebook users

Page 7: Quantifying the Invisible Audience in Social Networks

Our perception of audience size affects our behaviorWe guide our audience’s impression of us[Go!man 1959]

We manage the boundaries of when to engage [Altman 1975]

On social media, we speak to the audience that we expect is listening[Marwick and boyd 2011, Viégas 1999]

Page 8: Quantifying the Invisible Audience in Social Networks

Our perception of audience size affects our behaviorWe guide our audience’s impression of us[Go!man 1959]

We manage the boundaries of when to engage [Altman 1975]

On social media, we speak to the audience that we expect is listening[Marwick and boyd 2011, Viégas 1999]

What if our audience size estimates are inaccurate?

Page 9: Quantifying the Invisible Audience in Social Networks

Perceived audience vs. reality - survey - folk theories of audience - desired audience size

Predictability of audience size - using friend count - using feedback

Page 10: Quantifying the Invisible Audience in Social Networks

Perceived audience vs. reality - survey - folk theories of audience - desired audience size

Predictability of audience size - using friend count - using feedback

Page 11: Quantifying the Invisible Audience in Social Networks

MethodData

220,000 U.S. Facebook users who share with friends-only privacy

Collected audience information for their status updates and link shares over 30 days

150,000,000 viewer-story pairs

Page 12: Quantifying the Invisible Audience in Social Networks

MethodAudience size measurement

Javascript tracking whether a story remained in the browser viewport for at least 900ms

Page 13: Quantifying the Invisible Audience in Social Networks

MethodAudience size measurement

Javascript tracking whether a story remained in the browser viewport for at least 900ms

Page 14: Quantifying the Invisible Audience in Social Networks

MethodAudience size measurement

Javascript tracking whether a story remained in the browser viewport for at least 900ms

900ms

Page 15: Quantifying the Invisible Audience in Social Networks

MethodAudience size measurement

Javascript tracking whether a story remained in the browser viewport for at least 900ms

Page 16: Quantifying the Invisible Audience in Social Networks

MethodAudience size measurement

Javascript tracking whether a story remained in the browser viewport for at least 900ms

Not a direct measure of attention: users remember ~70% of posts they see[Counts and Fisher 2011]

Page 17: Quantifying the Invisible Audience in Social Networks

MethodSurvey

Recruited users with recent content (2-90 days ago) via a request at the top of news feed

N=589; 61% female; mean age 33

Audience size surveyShow participants their most recent story“How many people do you think saw it?”“Describe how you came up with that number.”“How many people do you wish saw this content?”

Page 18: Quantifying the Invisible Audience in Social Networks

MethodAnalysis

Compare participants’ actual audience size totheir estimated audience size

Page 19: Quantifying the Invisible Audience in Social Networks

MethodAnalysis

Compare participants’ actual audience size totheir estimated audience size

Consider your own most recent status update: What percentage of your social network do you think saw it?

Page 20: Quantifying the Invisible Audience in Social Networks

Users underestimate by 4xResults

Page 21: Quantifying the Invisible Audience in Social Networks

Users underestimate by 4xResults

0%

20%

40%

60%

80%

100%

0% 20% 40% 60% 80% 100%Actual audience (% of friends)

Page 22: Quantifying the Invisible Audience in Social Networks

0%

20%

40%

60%

80%

100%

0% 20% 40% 60% 80% 100%Actual audience (% of friends)

Perc

eive

d au

dien

ce (%

of f

riend

s)

Users underestimate by 4xResults

Page 23: Quantifying the Invisible Audience in Social Networks

Users underestimate by 4xResults

Accurate estimations along the diagonal

0%

20%

40%

60%

80%

100%

0% 20% 40% 60% 80% 100%Actual audience (% of friends)

Perc

eive

d au

dien

ce (%

of f

riend

s)

Page 24: Quantifying the Invisible Audience in Social Networks

Users underestimate by 4xResults

Accurate estimations along the diagonal

0%

20%

40%

60%

80%

100%

0% 20% 40% 60% 80% 100%Actual audience (% of friends)

Perc

eive

d au

dien

ce (%

of f

riend

s)

overestimates

underestimates

Page 25: Quantifying the Invisible Audience in Social Networks

0%

20%

40%

60%

80%

100%

0% 20% 40% 60% 80% 100%Actual audience (% of friends)

Perc

eive

d au

dien

ce (%

of f

riend

s)

Users underestimate by 4xResults

Estimated20 friends = 6% of network

Actual78 friends = 24% of network

R2 = 0.04

overestimates

underestimates

Page 26: Quantifying the Invisible Audience in Social Networks

Folk theories of audienceResults

Inductive coding on participants’ reasons for how estimating their audience (Fleiss’s Kappa = 0.72)

Page 27: Quantifying the Invisible Audience in Social Networks

Folk theories of audienceResults

Inductive coding on participants’ reasons for how estimating their audience (Fleiss’s Kappa = 0.72)

Random guess 23%

Page 28: Quantifying the Invisible Audience in Social Networks

Folk theories of audienceResults

Inductive coding on participants’ reasons for how estimating their audience (Fleiss’s Kappa = 0.72)

Random guess 23%Feedback — likes and comments 21%

“I figured about half of the people who see it will ‘like’ it, or comment on it”

Page 29: Quantifying the Invisible Audience in Social Networks

Folk theories of audienceResults

Inductive coding on participants’ reasons for how estimating their audience (Fleiss’s Kappa = 0.72)

Random guess 23%Feedback — likes and comments 21%Fraction of friend count 15%

“Maybe a third of my friends saw it.”

Page 30: Quantifying the Invisible Audience in Social Networks

Folk theories of audienceResults

Inductive coding on participants’ reasons for how estimating their audience (Fleiss’s Kappa = 0.72)

Random guess 23%Feedback — likes and comments 21%Fraction of friend count 15%Login timing 9%

“Not a lot of people stay up late at night”

Page 31: Quantifying the Invisible Audience in Social Networks

Folk theories of audienceResults

Inductive coding on participants’ reasons for how estimating their audience (Fleiss’s Kappa = 0.72)

Random guess 23%Feedback — likes and comments 21%Fraction of friend count 15%Login timing 9%Friends seen active on the site 5%Number of close friends and family 3%Who might be interested in the topic 2%Other 10%

Page 32: Quantifying the Invisible Audience in Social Networks

Folk theories of audienceResults

No folk theory was more accurate than a random guess

Random guess 23%Feedback — likes and comments 21%Fraction of friend count 15%Login timing 9%Friends seen active on the site 5%Number of close friends and family 3%Who might be interested in the topic 2%Other 10%

Page 33: Quantifying the Invisible Audience in Social Networks

Users want larger audiencesResults

same more far morefewerfar fewer

“How many people do you wish saw this content?”

50% 25% 22%

Page 34: Quantifying the Invisible Audience in Social Networks

Users want larger audiencesResults

Roughly half want a larger audience...but they already have it.

same more far morefewerfar fewer

“How many people do you wish saw this content?”

50% 25% 22%

Page 35: Quantifying the Invisible Audience in Social Networks

Users underestimate their audience by 4x

Common folk theories use feedback and friend count

Users want larger audiences, but already have them

Page 36: Quantifying the Invisible Audience in Social Networks

Perceived audience vs. reality - survey - folk theories of audience - desired audience size

Predictability of audience size - using friend count - using feedback

Page 37: Quantifying the Invisible Audience in Social Networks

Perceived audience vs. reality - survey - folk theories of audience - desired audience size

Predictability of audience size - using friend count - using feedback

Page 38: Quantifying the Invisible Audience in Social Networks

Can we predict a post’s audience using public signals?using the full 220,000 user and 150,000,000 view dataset

Page 39: Quantifying the Invisible Audience in Social Networks

35% of friends see median postResults

More friends means higher variability in audience

0.00

0.01

0.02

0.03

0 50 100 150 200Number of friends who saw post

Den

sity

50th percentile by friend count (266)

Page 40: Quantifying the Invisible Audience in Social Networks

0.00

0.01

0.02

0.03

0 50 100 150 200Number of friends who saw post

Den

sity

35% of friends see median postResults

More friends means higher variability in audience

25th percentile by friend count (138)

Page 41: Quantifying the Invisible Audience in Social Networks

35% of friends see median postResults

More friends means higher variability in audience

0.00

0.01

0.02

0.03

0 50 100 150 200Number of friends who saw post

Den

sity

75th percentile by friend count (484)

Page 42: Quantifying the Invisible Audience in Social Networks

Audience size is highly variableResults

Highly variable: 50th percentile range is 20% of friends

0%

20%

40%

60%

80%

100%

lllllllllllllllllllllllllllllllll

0 200 400 600 800Number of friends

Perc

ent o

f frie

nds

who

saw

pos

t

Page 43: Quantifying the Invisible Audience in Social Networks

Audience size is highly variableResults

Highly variable: 50th percentile range is 20% of friends

50th percentile range90th percentile range

0%

20%

40%

60%

80%

100%

lllllllllllllllllllllllllllllllll

0 200 400 600 800Number of friends

Perc

ent o

f frie

nds

who

saw

pos

t

Page 44: Quantifying the Invisible Audience in Social Networks

Audience size predictionOLS regression

Model predictors R2 Mean absolute error

Friend count 0.12 8% of friend count

Feedback 0.13 8%

Friend count and feedback

0.27 7%

Page 45: Quantifying the Invisible Audience in Social Networks

0%

20%

40%

60%

80%

100%

ll

l l l l l l l l l l l l l l l l l l

0 5 10 15Unique friends liking the post

Perc

ent o

f frie

nds

who

saw

pos

tFeedback is not predictiveResults

Rapid audience growth until the post receives feedback from five unique friends

Posts with no likes or comments have especially large variance: 90th percentile is 2%–55%

Page 46: Quantifying the Invisible Audience in Social Networks

Model predictors R2 Mean absolute error

Friend count 0.12 8% of friend count

Feedback 0.13 8% of friend count

Friend count and feedback

0.27 7% of friend count

Audience size predictionOLS regression

Page 47: Quantifying the Invisible Audience in Social Networks

Model predictors R2 Mean absolute error

Friend count 0.12 8% of friend count

Feedback 0.13 8% of friend count

Friend count and feedback

0.27 7% of friend count

Audience size predictionOLS regression

Page 48: Quantifying the Invisible Audience in Social Networks

Model predictors R2 Mean absolute error

Friend count 0.12 8% of friend count

Feedback 0.13 8% of friend count

Friend count and feedback

0.27 7% of friend count

Even with access to all user-visible signals, audience size is still unpredictable.

Audience size predictionOLS regression

Page 49: Quantifying the Invisible Audience in Social Networks

How predictable is a user’s cumulative audience?Consider the audience for all of a user’s posts over 30 days instead of a single post

50% of the users in our sample produced five or more pieces of content during the month

Page 50: Quantifying the Invisible Audience in Social Networks

0%

20%

40%

60%

80%

100%

ll

lllllllllllllllllllllllllllllll

0 200 400 600 800Number of friends

Cum

ulat

ive a

udie

nce

(% o

f frie

nds)

61% of friends see at least one of the user’s posts each month

However, actual audience size is still highly variable

Page 51: Quantifying the Invisible Audience in Social Networks

Discussion

Page 52: Quantifying the Invisible Audience in Social Networks

Fundamental mismatch between perceived and actual audienceHow might a 4x underestimate be impacting user behavior?

Type of content shared, sharing volume, motivation

Ambiguous whether a more socially transparent design would be desirable

Page 53: Quantifying the Invisible Audience in Social Networks

Fundamental mismatch between perceived and actual audienceHow might a 4x underestimate be impacting user behavior?

Type of content shared, sharing volume, motivation

Ambiguous whether a more socially transparent design would be desirable

Page 54: Quantifying the Invisible Audience in Social Networks

Why underestimate audience size?

The wishful thinking hypothesis: more comfortable to blame a noisy distribution channel than to blame yourself for writing bad content

Page 55: Quantifying the Invisible Audience in Social Networks

Why underestimate audience size?

The wishful thinking hypothesis: more comfortable to blame a noisy distribution channel than to blame yourself for writing bad content

What role might be played by...The availability heuristic?Algorithmic feed filtering?

Page 56: Quantifying the Invisible Audience in Social Networks

Your invisible audience is larger than you probably think.

Page 57: Quantifying the Invisible Audience in Social Networks

Your invisible audience is larger than you probably think.Users underestimate audience size by 4xMedian reach is 35% per post and 61% per monthMany want larger audiences but already have them

Page 58: Quantifying the Invisible Audience in Social Networks

stanford hci group

Quantifying the Invisible Audience in Social Networks

Eytan Bakshy, Moira Burke, Brian KarrerFacebook Data Science Team

Michael BernsteinStanford Computer Science Department