IterNorm: Fast Iterative Normalization

Ye, ChangMin; Sim, Yonguk; Kim, Youngchae; Jin, SeongMin; Jeong, Doo Seok

Computer Science > Machine Learning

arXiv:2412.04778 (cs)

[Submitted on 6 Dec 2024]

Title:IterNorm: Fast Iterative Normalization

Authors:ChangMin Ye, Yonguk Sim, Youngchae Kim, SeongMin Jin, Doo Seok Jeong

View PDF HTML (experimental)

Abstract:Transformer-based large language models are a memory-bound model whose operation is based on a large amount of data that are marginally reused. Thus, the data movement between a host and accelerator likely dictates the total wall-clock time. Layer normalization is one of the key workloads in the transformer model, following each of multi-head attention and feed-forward network blocks. To reduce data movement, layer normalization needs to be performed on the same chip as the matrix-matrix multiplication engine. To this end, we introduce an iterative L2-normalization method for 1D input (IterNorm), ensuring fast convergence to the steady-state solution within five iteration steps and high precision, outperforming the fast inverse square root algorithm in six out of nine cases for FP32 and five out of nine for BFloat16 across the embedding lengths used in the OPT models. Implemented in 32/28nm CMOS, the IterNorm macro normalizes $d$-dimensional vectors, where $64 \leq d \leq 1024$, with a latency of 112-227 cycles at 100MHz/1.05V.

Comments:	Design, Automation & Test in Europe Conference 2025
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2412.04778 [cs.LG]
	(or arXiv:2412.04778v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2412.04778

Submission history

From: Doo Seok Jeong [view email]
[v1] Fri, 6 Dec 2024 05:00:01 UTC (5,135 KB)

Computer Science > Machine Learning

Title:IterNorm: Fast Iterative Normalization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:IterNorm: Fast Iterative Normalization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators