Evaluation of an LLM in Identifying Logical Fallacies: A Call for Rigor When Adopting LLMs in HCI Research

Lim, Gionnieve; Perrault, Simon T.

Computer Science > Human-Computer Interaction

arXiv:2404.05213 (cs)

[Submitted on 8 Apr 2024]

Title:Evaluation of an LLM in Identifying Logical Fallacies: A Call for Rigor When Adopting LLMs in HCI Research

Authors:Gionnieve Lim, Simon T. Perrault

View PDF HTML (experimental)

Abstract:There is increasing interest in the adoption of LLMs in HCI research. However, LLMs may often be regarded as a panacea because of their powerful capabilities with an accompanying oversight on whether they are suitable for their intended tasks. We contend that LLMs should be adopted in a critical manner following rigorous evaluation. Accordingly, we present the evaluation of an LLM in identifying logical fallacies that will form part of a digital misinformation intervention. By comparing to a labeled dataset, we found that GPT-4 achieves an accuracy of 0.79, and for our intended use case that excludes invalid or unidentified instances, an accuracy of 0.90. This gives us the confidence to proceed with the application of the LLM while keeping in mind the areas where it still falls short. The paper describes our evaluation approach, results and reflections on the use of the LLM for our intended task.

Subjects:	Human-Computer Interaction (cs.HC); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2404.05213 [cs.HC]
	(or arXiv:2404.05213v1 [cs.HC] for this version)
	https://doi.org/10.48550/arXiv.2404.05213

Submission history

From: Gionnieve Lim [view email]
[v1] Mon, 8 Apr 2024 06:00:14 UTC (120 KB)

Full-text links:

Access Paper:

view license

Current browse context:

< prev | next >

new | recent | 2024-04

Change to browse by:

cs.AI
cs.HC

References & Citations

export BibTeX citation

Computer Science > Human-Computer Interaction

Title:Evaluation of an LLM in Identifying Logical Fallacies: A Call for Rigor When Adopting LLMs in HCI Research

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Human-Computer Interaction

Title:Evaluation of an LLM in Identifying Logical Fallacies: A Call for Rigor When Adopting LLMs in HCI Research

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators