(Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be...
Transcript of (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be...
![Page 1: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/1.jpg)
licensed from the C
artoon Bank
(Online) Discussion Dynamics
Can we better manage conversations on Twitter, Facebook, Reddit, etc.?
Lillian LeeCornell University
#DSRD19
![Page 2: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/2.jpg)
licensed from the C
artoon Bank
Something's brewing! Early prediction of controversy-causing posts from discussion features
Looks like the proposal will be controversial.
Jack Hessel and Lillian Lee, NAACL 2019
#DSRD19
![Page 3: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/3.jpg)
… …. ….. ……
Task: predict whether a social media post will get many positive and negative responses, or no?
✖Yes, controversial
✔✔✔✔✔✔
✖✖✖✖
No, not controversial✖ ✔✖✖ ✖✖
#DSRD19
![Page 4: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/4.jpg)
Utility to site moderators and administrators
• Monitoring for “bad” controversy can prevent harm to the group
• Bringing “productive” controversy to the community’s attention can help the group solve problems
Controversy (as we have defined it) is not necessarily a bad thing.
#DSRD19
![Page 5: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/5.jpg)
Observation: controversy is community-specific
“break up”: controversial in the Reddit group on relationships,but not in the group for posing questions to women
“my parents”: controversial for the personal-finance group(example: “live with my parents”)
but not in the relationships group
#DSRD19
![Page 6: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/6.jpg)
Our datasets ("fill-in" of Baumgartner's crawl)
- 6 communities on www.reddit.com:
- two QA subreddits: AskMen, AskWomen- a special interest community: Fitness- three advice communities:
LifeProTips, personalfinance, relationships- Posts and comments mostly web-English
- Up/downvote information: eventual percent-upvoted
(we can’t use early votes: no timestamps)
#DSRD19
![Page 7: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/7.jpg)
Observation: we can use early reactions• Early opinions can greatly affect subsequent opinion dynamics (Salganik et al. MusicLab experiment, Science 2006, inter alia)
• Both the content and structure of the early discussion tree may prove helpful.
was controversial
wasn’t controversial
#DSRD19
![Page 8: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/8.jpg)
Early comments: how many?
=15% of eventual
=32% of eventual
#DSRD19
![Page 9: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/9.jpg)
We predict community-specific controversy of a post, examining domain transferability of features, using an early detection paradigm.
Predicting controversy from posting-time-only features(Dori-Hacohen and Allan, 2013; Mejova et al., 2014; Klenner et al., 2014; Dori-Hacohen et al., 2016; Jang and Allan, 2016; Jang et al., 2017; Addawood et al., 2017; Timmermans et al., 2017; Rethmeier et al., 2018; Kaplun et al., 2018)
Retrospective analyses: was a given hashtag/entity/word controversial previously?(Popescu and Pennacchiotti, 2010; Choi et al., 2010; Rad and Barbosa, 2012; Cao et al., 2015; Lourentzouet al., 2015; Chen et al., 2016; Addawood et al., 2017; Beelen et al., 2017; Al-Ayyoub et al., 2017; Garimella et al., 2018)
Disagreement or antisocial behavior(Mishne and Glance, 2006; Yin et al., 2012; Awadallah et al., 2012; Allen et al., 2014; Wang and Cardie, 2014; Marres, 2015; Borra et al., 2015; Jang et al., 2017; Basile et al., 2017; Liu et al., 2018; Zhang et al., 2018; Chang & Danescu-Niculescu-Mizil., 2019)
#DSRD19
![Page 10: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/10.jpg)
Prediction results incorporating comment features:One community
AskWomen
4 comments,on average
Best baseline on original post: MeanpoolBERT 1st 512 words, L2 normalize, PCA-> 100 dims, linear classifier
*Significant diff over baseline at 45 mins#DSRD19
![Page 11: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/11.jpg)
AskMen AskWomen Fitness
LifeProTips personalfinance relationships
* * *
* *
![Page 12: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/12.jpg)
Tree/Rate features transfer better than content
Training Subreddit
Testing Subreddit
#DSRD19
![Page 13: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/13.jpg)
Takeaways (modulo caveats! see paper)● We advocate an early-detection, community-specific approach to
controversial-post detection
○ Early detection outperforms posting-time-only features in 5 of 6 Reddit communities tested, even, sometimes, for quite small early-time windows
○ Early comment content is most effective, but tree-shape and rate features transfer across domains better
#DSRD19
![Page 14: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/14.jpg)
licensed from the C
artoon Bank
Content removal as a moderation strategy
What if I censure that "oink"?
Kumar Bhargav SrinivasanCristian Danescu-Niculescu-MizilLillian LeeChenhao Tan
rule-breaking comment
CSCW 2019}#DSRD19
![Page 15: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/15.jpg)
Test case: ChangeMyView subreddit:
Known to be surprisingly productive
● CMV moderators manually removed 22,788 comments between January
2015 and March 2018.
● Users consider moderator intervention to be one of the main factors
behind the quality of discussions in CMV.
○ “I’ve seen threads go ugly so fast [on other subreddits], and I think
that having active mods helps CMV not get bogged down by trolls.”
[Jhaver, Vora, Bruckman 2017]
● We have moderator-log access through previous CMV work.
#DSRD19
![Page 16: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/16.jpg)
comment deletion for rule violation on CMV
[username]
#DSRD19
![Page 17: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/17.jpg)
Comment deletion and future activity(or lack thereof)
![Page 18: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/18.jpg)
The effect of comment deletion on those who stay?
Possible reasons that comment deletion may not causecompliant behavior:
○ Comment deletion can “backfire” [Chancellor, Pater, Clear, Gilbert, De Choudhury 2016 vs. Chandrasekharan, Pavalanthan, Srinivasan, Glynn, Eisenstein, Gilbert 2017 vs Chang and Danescu-Niculescu-Mizil WWW 2019]
○ (and see two slides from now)#DSRD19
![Page 19: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/19.jpg)
In this work, we don't do A/B testing
• Randomizing comment dele/on may disrupt a popular and produc/ve community.
• Randomizing comment removals seems wrong for non-viola/ng comments.
![Page 20: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/20.jpg)
Interrupted time-series analysis at removal?
-10-8 -6 -4 -2 0 2 4 6 8 10comment index
-0.5%
0.0%
0.5%
1.0%
1.5%
2.0%
fract
ion
rem
oved
�2 =-1.31e+0(***)�3 =-8.35e-2(***)
8.4K discussion trees with total 22K mod-removed comments, 73K trees and 4M comments total
"Comment 0" made, then deleted by mod
Stat. sig change in slope, level?
"Comment 0" user's comment timeline
Confound: effect of having made a removal-meriting comment. (Drop an "F-bomb", then self-censorregardless of moderator action?)
![Page 21: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/21.jpg)
Observational delayed-feedback paradigmDelay (>2 hours in 40% of cases)
User's comment timeline
Comment that will be removed is made
That comment is removed by a mod
#DSRD19
![Page 22: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/22.jpg)
Delayed-feedback paradigm
C1C-1
Delay (pre-removal window)
If c-1 is not rule-abiding, but c1 is, now do we know dele?on is the cause?
Alas, no – cannot rule out temporal effects. #DSRD19
![Page 23: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/23.jpg)
Delayed-feedback paradigm
C1C-1
Delay (pre-removal window)
C’1C’-1
Matched delay
pseudo removal
"treatment" user
"control" user
#DSRD19
![Page 24: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/24.jpg)
Less non-compliance (non-targe-deletion trees)?
-10-8 -6 -4 -2 0 2 4 6 8 10comment index
-0.5%
0.0%
0.5%
1.0%
1.5%
2.0%
fract
ion
rem
oved
�2 =-1.31e+0(***)�3 =-8.35e-2(***)
Interrupted *me series ✅ Delayed feedback ✅
before(c�1 or c0
�1)after
(c1 or c01)
0.0%
2.5%
5.0%
7.5%
frac
tion
rem
oved
DF treatment
DF control
p < 0.001
#DSRD19
![Page 25: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/25.jpg)
Increased engagement (comment length)?
-10 -8 -6 -4 -2 0 2 4 6 8 10comment index
95.0
100.0
105.0
110.0
115.0
120.0
wor
dco
unt
�2 =6.22e+0(*)�3 =9.18e-2
Interrupted *me series ✅ Delayed feedback
before(c�1 or c0
�1)after
(c1 or c01)
80
100
120
wor
dco
unt
DF treatment
DF control
⊝#DSRD19
![Page 26: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/26.jpg)
Takeaways (modulo caveats! see paper)● "Delayed feedback" observational paradigm – better
controls compared to "standard" ITS application
○ Limitation: only applicable to users active enough to post in the delay window
● For applicable users, comment moderator-deletion causes immediate non-compliance drop with no significant change in "post effort" (length)
#DSRD19
![Page 27: (Online) Discussion Dynamics - Cornell University · Test case: ChangeMyViewsubreddit: Known to be surprisingly productive CMV moderators manually removed 22,788 comments between](https://reader034.fdocuments.in/reader034/viewer/2022042316/5f04e0b27e708231d41028dc/html5/thumbnails/27.jpg)
licensed from the C
artoon Bank
Summary: "Movie trailers" of controversy, comment removal
Thanks for listening!
Please see the NAACL 2019 and CSCW 2019 paper for (many more) details:http://www.cs.cornell.edu/home/llee/papers.html
#DSRD19