Critique of Impure Reason: Unveiling the reasoning behaviour of medical Large Language Models

Sim, Shamus; Chen, Tyrone

Computer Science > Computation and Language

arXiv:2412.15748 (cs)

[Submitted on 20 Dec 2024]

Title:Critique of Impure Reason: Unveiling the reasoning behaviour of medical Large Language Models

Authors:Shamus Sim, Tyrone Chen

View PDF HTML (experimental)

Abstract:Background: Despite the current ubiquity of Large Language Models (LLMs) across the medical domain, there is a surprising lack of studies which address their reasoning behaviour. We emphasise the importance of understanding reasoning behaviour as opposed to high-level prediction accuracies, since it is equivalent to explainable AI (XAI) in this context. In particular, achieving XAI in medical LLMs used in the clinical domain will have a significant impact across the healthcare sector. Results: Therefore, we define the concept of reasoning behaviour in the specific context of medical LLMs. We then categorise and discuss the current state of the art of methods which evaluate reasoning behaviour in medical LLMs. Finally, we propose theoretical frameworks which can empower medical professionals or machine learning engineers to gain insight into the low-level reasoning operations of these previously obscure models. Conclusion: The subsequent increased transparency and trust in medical machine learning models by clinicians as well as patients will accelerate the integration, application as well as further development of medical AI for the healthcare system as a whole

Comments:	16 pages, 5 figures, 2 tables. Conceptualization, both authors. formal analysis, both authors. funding acquisition, both authors. investigation, both authors. resources, both authors. supervision, T.C.. validation, both authors. visualization, both authors. writing original draft, both authors. writing review and editing, both authors
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2412.15748 [cs.CL]
	(or arXiv:2412.15748v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.15748

Submission history

From: Shamus Zi Yang Sim [view email]
[v1] Fri, 20 Dec 2024 10:06:52 UTC (429 KB)

Computer Science > Computation and Language

Title:Critique of Impure Reason: Unveiling the reasoning behaviour of medical Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Critique of Impure Reason: Unveiling the reasoning behaviour of medical Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators