Fine-Grained Embedding Dimension Optimization During Training for Recommender Systems

Luo, Qinyi; Wang, Penghan; Zhang, Wei; Lai, Fan; Mao, Jiachen; Wei, Xiaohan; Song, Jun; Tsai, Wei-Yu; Yang, Shuai; Hu, Yuxi; Qian, Xuehai

Computer Science > Information Retrieval

arXiv:2401.04408 (cs)

[Submitted on 9 Jan 2024]

Title:Fine-Grained Embedding Dimension Optimization During Training for Recommender Systems

Authors:Qinyi Luo, Penghan Wang, Wei Zhang, Fan Lai, Jiachen Mao, Xiaohan Wei, Jun Song, Wei-Yu Tsai, Shuai Yang, Yuxi Hu, Xuehai Qian

View PDF

Abstract:Huge embedding tables in modern Deep Learning Recommender Models (DLRM) require prohibitively large memory during training and inference. Aiming to reduce the memory footprint of training, this paper proposes FIne-grained In-Training Embedding Dimension optimization (FIITED). Given the observation that embedding vectors are not equally important, FIITED adjusts the dimension of each individual embedding vector continuously during training, assigning longer dimensions to more important embeddings while adapting to dynamic changes in data. A novel embedding storage system based on virtually-hashed physically-indexed hash tables is designed to efficiently implement the embedding dimension adjustment and effectively enable memory saving. Experiments on two industry models show that FIITED is able to reduce the size of embeddings by more than 65% while maintaining the trained model's quality, saving significantly more memory than a state-of-the-art in-training embedding pruning method. On public click-through rate prediction datasets, FIITED is able to prune up to 93.75%-99.75% embeddings without significant accuracy loss.

Comments:	16 pages, 9 figures
Subjects:	Information Retrieval (cs.IR); Machine Learning (cs.LG)
ACM classes:	I.2.6; H.3.3
Cite as:	arXiv:2401.04408 [cs.IR]
	(or arXiv:2401.04408v1 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2401.04408

Submission history

From: Qinyi Luo [view email]
[v1] Tue, 9 Jan 2024 08:04:11 UTC (1,411 KB)

Computer Science > Information Retrieval

Title:Fine-Grained Embedding Dimension Optimization During Training for Recommender Systems

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Fine-Grained Embedding Dimension Optimization During Training for Recommender Systems

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators