Do Larger Language Models Imply Better Reasoning? A Pretraining Scaling Law for Reasoning

Wang, Xinyi; Tan, Shawn; Jin, Mingyu; Wang, William Yang; Panda, Rameswar; Shen, Yikang

Computer Science > Artificial Intelligence

arXiv:2504.03635 (cs)

[Submitted on 4 Apr 2025]

Title:Do Larger Language Models Imply Better Reasoning? A Pretraining Scaling Law for Reasoning

Authors:Xinyi Wang, Shawn Tan, Mingyu Jin, William Yang Wang, Rameswar Panda, Yikang Shen

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have demonstrated remarkable capabilities across a wide range of tasks requiring complex reasoning. However, the effects of scaling on their reasoning abilities remain insufficiently understood. In this paper, we introduce a synthetic multihop reasoning environment designed to closely replicate the structure and distribution of real-world large-scale knowledge graphs. Our reasoning task involves completing missing edges in the graph, which requires advanced multi-hop reasoning and mimics real-world reasoning scenarios. To evaluate this, we pretrain language models (LMs) from scratch solely on triples from the incomplete graph and assess their ability to infer the missing edges. Interestingly, we observe that overparameterization can impair reasoning performance due to excessive memorization. We investigate different factors that affect this U-shaped loss curve, including graph structure, model size, and training steps. To predict the optimal model size for a specific knowledge graph, we find an empirical scaling that linearly maps the knowledge graph search entropy to the optimal model size. This work provides new insights into the relationship between scaling and reasoning in LLMs, shedding light on possible ways to optimize their performance for reasoning tasks.

Subjects:	Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2504.03635 [cs.AI]
	(or arXiv:2504.03635v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2504.03635

Submission history

From: Xinyi Wang [view email]
[v1] Fri, 4 Apr 2025 17:57:22 UTC (4,864 KB)

Computer Science > Artificial Intelligence

Title:Do Larger Language Models Imply Better Reasoning? A Pretraining Scaling Law for Reasoning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Do Larger Language Models Imply Better Reasoning? A Pretraining Scaling Law for Reasoning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators