HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Chen, Junying; Cai, Zhenyang; Ji, Ke; Wang, Xidong; Liu, Wanlong; Wang, Rongsheng; Hou, Jianye; Wang, Benyou

Computer Science > Computation and Language

arXiv:2412.18925 (cs)

[Submitted on 25 Dec 2024]

Title:HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Authors:Junying Chen, Zhenyang Cai, Ke Ji, Xidong Wang, Wanlong Liu, Rongsheng Wang, Jianye Hou, Benyou Wang

View PDF

Abstract:The breakthrough of OpenAI o1 highlights the potential of enhancing reasoning to improve LLM. Yet, most research in reasoning has focused on mathematical tasks, leaving domains like medicine underexplored. The medical domain, though distinct from mathematics, also demands robust reasoning to provide reliable answers, given the high standards of healthcare. However, verifying medical reasoning is challenging, unlike those in mathematics. To address this, we propose verifiable medical problems with a medical verifier to check the correctness of model outputs. This verifiable nature enables advancements in medical reasoning through a two-stage approach: (1) using the verifier to guide the search for a complex reasoning trajectory for fine-tuning LLMs, (2) applying reinforcement learning (RL) with verifier-based rewards to enhance complex reasoning further. Finally, we introduce HuatuoGPT-o1, a medical LLM capable of complex reasoning, which outperforms general and medical-specific baselines using only 40K verifiable problems. Experiments show complex reasoning improves medical problem-solving and benefits more from RL. We hope our approach inspires advancements in reasoning across medical and other specialized domains.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2412.18925 [cs.CL]
	(or arXiv:2412.18925v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.18925

Submission history

From: Junying Chen [view email]
[v1] Wed, 25 Dec 2024 15:12:34 UTC (1,151 KB)

Computer Science > Computation and Language

Title:HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators