How to Upscale Neural Networks with Scaling Law? A Survey and Practical Guidelines

Sengupta, Ayan; Goel, Yash; Chakraborty, Tanmoy

Computer Science > Computation and Language

arXiv:2502.12051 (cs)

[Submitted on 17 Feb 2025]

Title:How to Upscale Neural Networks with Scaling Law? A Survey and Practical Guidelines

Authors:Ayan Sengupta, Yash Goel, Tanmoy Chakraborty

View PDF HTML (experimental)

Abstract:Neural scaling laws have revolutionized the design and optimization of large-scale AI models by revealing predictable relationships between model size, dataset volume, and computational resources. Early research established power-law relationships in model performance, leading to compute-optimal scaling strategies. However, recent studies highlighted their limitations across architectures, modalities, and deployment contexts. Sparse models, mixture-of-experts, retrieval-augmented learning, and multimodal models often deviate from traditional scaling patterns. Moreover, scaling behaviors vary across domains such as vision, reinforcement learning, and fine-tuning, underscoring the need for more nuanced approaches. In this survey, we synthesize insights from over 50 studies, examining the theoretical foundations, empirical findings, and practical implications of scaling laws. We also explore key challenges, including data efficiency, inference scaling, and architecture-specific constraints, advocating for adaptive scaling strategies tailored to real-world applications. We suggest that while scaling laws provide a useful guide, they do not always generalize across all architectures and training strategies.

Comments:	20 pages, 8 tables, 4 figures
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:2502.12051 [cs.CL]
	(or arXiv:2502.12051v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.12051

Submission history

From: Ayan Sengupta [view email]
[v1] Mon, 17 Feb 2025 17:20:41 UTC (312 KB)

Computer Science > Computation and Language

Title:How to Upscale Neural Networks with Scaling Law? A Survey and Practical Guidelines

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:How to Upscale Neural Networks with Scaling Law? A Survey and Practical Guidelines

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators