Probabilistic Medical Predictions of Large Language Models

Gu, Bowen; Desai, Rishi J.; Lin, Kueiyu Joshua; Yang, Jie

Computer Science > Artificial Intelligence

arXiv:2408.11316 (cs)

[Submitted on 21 Aug 2024 (v1), last revised 3 Dec 2024 (this version, v2)]

Title:Probabilistic Medical Predictions of Large Language Models

Authors:Bowen Gu, Rishi J. Desai, Kueiyu Joshua Lin, Jie Yang

View PDF

Abstract:Large Language Models (LLMs) have shown promise in clinical applications through prompt engineering, allowing flexible clinical predictions. However, they struggle to produce reliable prediction probabilities, which are crucial for transparency and decision-making. While explicit prompts can lead LLMs to generate probability estimates, their numerical reasoning limitations raise concerns about reliability. We compared explicit probabilities from text generation to implicit probabilities derived from the likelihood of predicting the correct label token. Across six advanced open-source LLMs and five medical datasets, explicit probabilities consistently underperformed implicit probabilities in discrimination, precision, and recall. This discrepancy is more pronounced with smaller LLMs and imbalanced datasets, highlighting the need for cautious interpretation, improved probability estimation methods, and further research for clinical use of LLMs.

Comments:	Preprint. Under review
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2408.11316 [cs.AI]
	(or arXiv:2408.11316v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2408.11316

Submission history

From: Bowen Gu [view email]
[v1] Wed, 21 Aug 2024 03:47:17 UTC (4,189 KB)
[v2] Tue, 3 Dec 2024 21:54:39 UTC (970 KB)

Computer Science > Artificial Intelligence

Title:Probabilistic Medical Predictions of Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Probabilistic Medical Predictions of Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators