Long Is More Important Than Difficult for Training Reasoning Models

Shen, Si; Huang, Fei; Zhao, Zhixiao; Liu, Chang; Zheng, Tiansheng; Zhu, Danhao

Computer Science > Computation and Language

arXiv:2503.18069 (cs)

[Submitted on 23 Mar 2025]

Title:Long Is More Important Than Difficult for Training Reasoning Models

Authors:Si Shen, Fei Huang, Zhixiao Zhao, Chang Liu, Tiansheng Zheng, Danhao Zhu

View PDF HTML (experimental)

Abstract:Difficult problems, which often result in long reasoning traces, are widely recognized as key factors for enhancing the performance of reasoning models. However, such high-challenge problems are scarce, limiting the size of available datasets. In this paper, we propose a simple method to decouple the reliance on problem difficulty. First, we empirically demonstrate that reasoning length, rather than problem difficulty, primarily influences the performance of trained models. Second, we identify a scaling law on reasoning length, showing that model performance increases in a log-linear fashion as the reasoning data length grows. Finally, we introduce a straightforward technique to generate reasoning data of arbitrary length, and show that synthesized data is effective for training reasoning models. After fine-tuning the Qwen2.5-32B-Instruct language model on our Long1K dataset, we present our model, Long1K-32B, which achieves remarkable performance with only 1,000 training samples, achieving 95.6\% accuracy on MATH, and 71.1\% on GPQA outperforming DeepSeek-R1-Distill-Qwen-32B. The model, code, and dataset are all open-sourced, available at this https URL.

Comments:	15 pages,6 figures
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2503.18069 [cs.CL]
	(or arXiv:2503.18069v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2503.18069

Submission history

From: Fei Huang [view email]
[v1] Sun, 23 Mar 2025 13:33:59 UTC (213 KB)

Computer Science > Computation and Language

Title:Long Is More Important Than Difficult for Training Reasoning Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Long Is More Important Than Difficult for Training Reasoning Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators