A Simple Model of Inference Scaling Laws

Levi, Noam

Statistics > Machine Learning

arXiv:2410.16377 (stat)

[Submitted on 21 Oct 2024 (v1), last revised 7 Dec 2024 (this version, v2)]

Title:A Simple Model of Inference Scaling Laws

Authors:Noam Levi

View PDF HTML (experimental)

Abstract:Neural scaling laws have garnered significant interest due to their ability to predict model performance as a function of increasing parameters, data, and compute. In this work, we propose a simple statistical ansatz based on memorization to study scaling laws in the context of inference, specifically how performance improves with multiple inference attempts. We explore the coverage, or pass@k metric, which measures the chance of success over repeated attempts and provide a motivation for the observed functional form of the inference scaling behavior of the coverage in large language models (LLMs) on reasoning tasks. We then define an "inference loss", which exhibits a power law decay as the number of trials increases, and connect this result with prompting costs. We further test our construction by conducting experiments on a simple generative model, and find that our predictions are in agreement with the empirical coverage curves in a controlled setting. Our simple framework sets the ground for incorporating inference scaling with other known scaling laws.

Comments:	12 pages, 7 figures
Subjects:	Machine Learning (stat.ML); Artificial Intelligence (cs.AI); Information Theory (cs.IT); Machine Learning (cs.LG)
Cite as:	arXiv:2410.16377 [stat.ML]
	(or arXiv:2410.16377v2 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.2410.16377

Submission history

From: Noam Levi [view email]
[v1] Mon, 21 Oct 2024 18:00:06 UTC (3,378 KB)
[v2] Sat, 7 Dec 2024 22:50:41 UTC (3,384 KB)

Statistics > Machine Learning

Title:A Simple Model of Inference Scaling Laws

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:A Simple Model of Inference Scaling Laws

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators