A Resource Model For Neural Scaling Law

Song, Jinyeop; Liu, Ziming; Tegmark, Max; Gore, Jeff

Computer Science > Machine Learning

arXiv:2402.05164 (cs)

[Submitted on 7 Feb 2024 (v1), last revised 15 May 2024 (this version, v2)]

Title:A Resource Model For Neural Scaling Law

Authors:Jinyeop Song, Ziming Liu, Max Tegmark, Jeff Gore

View PDF HTML (experimental)

Abstract:Neural scaling laws characterize how model performance improves as the model size scales up. Inspired by empirical observations, we introduce a resource model of neural scaling. A task is usually composite hence can be decomposed into many subtasks, which compete for resources (measured by the number of neurons allocated to subtasks). On toy problems, we empirically find that: (1) The loss of a subtask is inversely proportional to its allocated neurons. (2) When multiple subtasks are present in a composite task, the resources acquired by each subtask uniformly grow as models get larger, keeping the ratios of acquired resources constants. We hypothesize these findings to be generally true and build a model to predict neural scaling laws for general composite tasks, which successfully replicates the neural scaling law of Chinchilla models reported in arXiv:2203.15556. We believe that the notion of resource used in this paper will be a useful tool for characterizing and diagnosing neural networks.

Comments:	10 pages, 8 figures, Published as a workshop paper at ICLR 2024
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2402.05164 [cs.LG]
	(or arXiv:2402.05164v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.05164

Submission history

From: Jinyeop Song [view email]
[v1] Wed, 7 Feb 2024 18:58:18 UTC (4,112 KB)
[v2] Wed, 15 May 2024 15:39:38 UTC (9,835 KB)

Computer Science > Machine Learning

Title:A Resource Model For Neural Scaling Law

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:A Resource Model For Neural Scaling Law

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators