Automatically Identifying Fake News in Popular Twitter Threads

Buntain, Cody; Golbeck, Jennifer

doi:10.1109/SmartCloud.2017.40

Computer Science > Social and Information Networks

arXiv:1705.01613 (cs)

[Submitted on 3 May 2017 (v1), last revised 30 May 2018 (this version, v2)]

Title:Automatically Identifying Fake News in Popular Twitter Threads

Authors:Cody Buntain, Jennifer Golbeck

View PDF

Abstract:Information quality in social media is an increasingly important issue, but web-scale data hinders experts' ability to assess and correct much of the inaccurate content, or `fake news,' present in these platforms. This paper develops a method for automating fake news detection on Twitter by learning to predict accuracy assessments in two credibility-focused Twitter datasets: CREDBANK, a crowdsourced dataset of accuracy assessments for events in Twitter, and PHEME, a dataset of potential rumors in Twitter and journalistic assessments of their accuracies. We apply this method to Twitter content sourced from BuzzFeed's fake news dataset and show models trained against crowdsourced workers outperform models based on journalists' assessment and models trained on a pooled dataset of both crowdsourced workers and journalists. All three datasets, aligned into a uniform format, are also publicly available. A feature analysis then identifies features that are most predictive for crowdsourced and journalistic accuracy assessments, results of which are consistent with prior work. We close with a discussion contrasting accuracy and credibility and why models of non-experts outperform models of journalists for fake news detection in Twitter.

Subjects:	Social and Information Networks (cs.SI)
Cite as:	arXiv:1705.01613 [cs.SI]
	(or arXiv:1705.01613v2 [cs.SI] for this version)
	https://doi.org/10.48550/arXiv.1705.01613
Journal reference:	2017 IEEE International Conference on Smart Cloud (SmartCloud)
Related DOI:	https://doi.org/10.1109/SmartCloud.2017.40

Submission history

From: Cody Buntain [view email]
[v1] Wed, 3 May 2017 20:34:19 UTC (326 KB)
[v2] Wed, 30 May 2018 21:08:44 UTC (401 KB)

Computer Science > Social and Information Networks

Title:Automatically Identifying Fake News in Popular Twitter Threads

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Social and Information Networks

Title:Automatically Identifying Fake News in Popular Twitter Threads

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators