Can LLMs Correct Physicians, Yet? Investigating Effective Interaction Methods in the Medical Domain

Sayin, Burcu; Minervini, Pasquale; Staiano, Jacopo; Passerini, Andrea

Computer Science > Computation and Language

arXiv:2403.20288 (cs)

[Submitted on 29 Mar 2024 (v1), last revised 6 May 2024 (this version, v2)]

Title:Can LLMs Correct Physicians, Yet? Investigating Effective Interaction Methods in the Medical Domain

Authors:Burcu Sayin, Pasquale Minervini, Jacopo Staiano, Andrea Passerini

View PDF HTML (experimental)

Abstract:We explore the potential of Large Language Models (LLMs) to assist and potentially correct physicians in medical decision-making tasks. We evaluate several LLMs, including Meditron, Llama2, and Mistral, to analyze the ability of these models to interact effectively with physicians across different scenarios. We consider questions from PubMedQA and several tasks, ranging from binary (yes/no) responses to long answer generation, where the answer of the model is produced after an interaction with a physician. Our findings suggest that prompt design significantly influences the downstream accuracy of LLMs and that LLMs can provide valuable feedback to physicians, challenging incorrect diagnoses and contributing to more accurate decision-making. For example, when the physician is accurate 38% of the time, Mistral can produce the correct answer, improving accuracy up to 74% depending on the prompt being used, while Llama2 and Meditron models exhibit greater sensitivity to prompt choice. Our analysis also uncovers the challenges of ensuring that LLM-generated suggestions are pertinent and useful, emphasizing the need for further research in this area.

Comments:	Accepted for oral presentation at NAACL 2024, The 6th Clinical Natural Language Processing Workshop
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2403.20288 [cs.CL]
	(or arXiv:2403.20288v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2403.20288

Submission history

From: Burcu Sayin Günel [view email]
[v1] Fri, 29 Mar 2024 16:59:13 UTC (3,551 KB)
[v2] Mon, 6 May 2024 14:13:51 UTC (4,646 KB)

Computer Science > Computation and Language

Title:Can LLMs Correct Physicians, Yet? Investigating Effective Interaction Methods in the Medical Domain

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Can LLMs Correct Physicians, Yet? Investigating Effective Interaction Methods in the Medical Domain

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators