What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection

Feng, Shangbin; Wan, Herun; Wang, Ningnan; Tan, Zhaoxuan; Luo, Minnan; Tsvetkov, Yulia

Computer Science > Computation and Language

arXiv:2402.00371v1 (cs)

[Submitted on 1 Feb 2024 (this version), latest version 4 Jul 2024 (v2)]

Title:What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection

Authors:Shangbin Feng, Herun Wan, Ningnan Wang, Zhaoxuan Tan, Minnan Luo, Yulia Tsvetkov

View PDF

Abstract:Social media bot detection has always been an arms race between advancements in machine learning bot detectors and adversarial bot strategies to evade detection. In this work, we bring the arms race to the next level by investigating the opportunities and risks of state-of-the-art large language models (LLMs) in social bot detection. To investigate the opportunities, we design novel LLM-based bot detectors by proposing a mixture-of-heterogeneous-experts framework to divide and conquer diverse user information modalities. To illuminate the risks, we explore the possibility of LLM-guided manipulation of user textual and structured information to evade detection. Extensive experiments with three LLMs on two datasets demonstrate that instruction tuning on merely 1,000 annotated examples produces specialized LLMs that outperform state-of-the-art baselines by up to 9.1% on both datasets, while LLM-guided manipulation strategies could significantly bring down the performance of existing bot detectors by up to 29.6% and harm the calibration and reliability of bot detection systems.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2402.00371 [cs.CL]
	(or arXiv:2402.00371v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2402.00371

Submission history

From: Shangbin Feng [view email]
[v1] Thu, 1 Feb 2024 06:21:19 UTC (595 KB)
[v2] Thu, 4 Jul 2024 23:37:40 UTC (604 KB)

Computer Science > Computation and Language

Title:What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:What Does the Bot Say? Opportunities and Risks of Large Language Models in Social Media Bot Detection

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators