Comparing Automated Factual Claim Detection Against...

Comparing Automated Factual Claim Detection Against Judgments of Journalism OrganizationsNaeemul Hassan#, Mark Tremayne, Fatma Arslan, Chengkai Li

University of Texas at Arlington 2016 Computation + Journalism Symposium, Sept. 30th, 2016

# The author is now with University of Mississippi

Fact-checking at Peak

Fact-checking the Presidential Debates

http://www.politifact.com/

Closed-caption decoder

Live Coverage of Debates

Presidential Debate

Transcripts (1960-2012)

Ground Truth

Finding Important Factual Claims: A Supervised Learning Task

Human Annotation Feature

Vectors

Feature Extraction

Learning Algorithm

2016 Presidential Debates

Important factual claims

Training Dataset o Transcripts of all 30 debates (11 general elections) in history o 20k sentences by candidates: removed very short (< 5 words) sentences o Ground truth collection website

Experiments 2016 U.S. Presidential Election Primary Debates o Transcripts of 21 debate (12 Republican, 9 Democratic) o 30737 sentences o Collected fact-checks by PolitiFact and CNN o Detected topics of the sentences 12 Most Important Topics (MIP) used by Gallup* For each topic, we manually created a set of keywords representing that

topic. E.g., keywords for topic “Abortion”: {abortion, pregnancy, planned parenthood}

* M. E. McCombs and D. L. Shaw. The agenda-setting function of mass media. Public opinion quarterly, 1972 * T. W. Smith. America’s most important problem-a trend analysis. Public opinion quarterly, 1980

(1) Sentences selected by Journalism organizations (CNN and PolitiFact) for checking had ClaimBuster scores significantly higher (were more check-worthy) than sentences not selected for checking.

Distribution of topics over sentences scored low (≤ 0.3) by ClaimBuster

Distribution of topics over sentences scored high (≥ 0.5) by ClaimBuster

(2) ClaimBuster is similar to Journalism organizations (CNN and PolitiFact) in the distribution of topics among highly scored sentences (≥ 0.5) and fact-checked sentences.

new factual claims: solicit analyses from professionals; algorithmic fact-checking

o debates o interviews o speeches o political ads o social media o web o news

repository of fact-checks by professionals

matched existing fact-checks: delivered via browser extensions, mobile apps, and smart-TV apps

The Quest to Automate Fact-Checking

claimed detected and ranked by check-worthiness

One Day Robot Fact-checkers will Dominate

o Less subjective

o Restless and more efficient

o More intelligent

Acknowledgment UTA Students o Naeemul Hassan (graduated) o Fatma Arslan o Siddhant Gawsane o Aaditya Kulkarni o Joseph Minumol (graduated) o Vikas Sable o Gensheng Zhang o Many more students who are

currently working on it

Collaborators o Bill Adair (Duke) o Pankaj Agarwal (Duke) o Peter Fray (UTS) o James Hamilton (Stanford) o Mark Tremayne (UTA) o Jun Yang (Duke) o Cong Yu (Google Research)

Acknowledgment Funding sponsors

Disclaimer: This material is based upon work partially supported by the National Science Foundation Grants 1117369, 1408928 and 1565699 and a Knight Prototype Fund from the Knight Foundation. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the funding agencies.

Thank You! Questions? o http://ranger.uta.edu/~cli cli@uta.edu o Twitter @ClaimBusterTM o Prototype http://idir.uta.edu/claimbuster o You are invited to contribute http://bit.ly/claimbusters

Comparing Automated Factual Claim Detection Against...

Documents

Transcript of Comparing Automated Factual Claim Detection Against...

Yanni Li 1, a, Zhijuan Li 2, Jiayin Li 3, Likun Liu 1 ...

LI - SIVACON Verteileranschluss / LI - SIVACON switchboard ...

Li Exovedes - Li Exovede & Métis Names List

Jiaxin Xie, Chenyang Lei, Zhuwen Li, Li Erran Li, and ... · Jiaxin Xie, Chenyang Lei, Zhuwen Li, Li Erran Li, and Qifeng Chen Abstract—Depth from a monocular video can enable billions

Chi-Li-Chi-Li-Bon !

DuraBand - XS BANDED BELTS Belt Catalogue.pdfIS 2494, BS 3790, ISO 4184 IS 2494 Li Li Li Li Li Li Li Li Li 39" 9.5 13" 16" 31.5" 31" 57" 44.5" 90" " 174" 176 357 900 900 900 900 ”

ClaimBuster - Rangerranger.uta.edu/~cli/talks/2015/claimbuster_Dec2_2015_ATA.pdfImplementation: Python NLP/ML Tools . Data wrangling . o Use NLTK (Natural Language Toolkit) to transform

TEMPERATURE TRANSMITTERS LI-24L LI-24G

Visualization & Journalism: Four Vignettestmm/talks/cj16/cj16.pdf · Brehmer, Ingram, Stray, ... Car-X-Ray in-car networks Cerebral genomics RelEx in-car networks AutobahnVis ...

Mascot 2215-2216 Power Supplies · 2215 NiMH battery charger, max 35W 2216 9940 LI Li-Ion batt. charger, max 2,3A 9941 LI 2541 LI Li-Ion batt. charger, max 2,7A 2542 LI Additional

Monitoru de Cj16

Seminar 5520 (Li Li)

S5700-LI Series Gigabit Enterprise Switches S5700-LI Switch Datasheet (25... · S5700-28P-LI-AC S5700-28X-LI-AC S5700-10P-LI-AC S5700-10P-PWR-LI-AC S5700-28X-PWR-LI-AC The S5700-LI

Xiangtai Li, Xia Li, Ansheng You, Li Zhang, Guangliang ...

Hui Li and Jing Li - Atlantis Press

Instructor: Li Erran Li ( lierranli@cs.columbia )

FR:TI-LI TO:TI-LI C.

LI-COR Radiation Sensors Instruction Manual · LI-COR Radiation Sensors Instruction Manual Terrestrial Type SA: LI-190SA Quantum Sensor LI-200SA Pyranometer Sensor LI-210SA Photometric

2.3 He Li (Rui Zhen Li)

LI KACOLBAL Q’UEBIL XBAN LI DIOS Hu a'in "Ll KACOLBAL Q'UEBIL XBAN Ll DIOS" xc'aba', cuan na'bal rakal li Ratin li Dios chi sa'. Natauman sa' li Santil Hu. Li junjunk chi página