QINCODEC: Neural Audio Compression with Implicit Neural Codebooks

Lahrichi, Zineb; Hadjeres, Gaëtan; Richard, Gael; Peeters, Geoffroy

Computer Science > Sound

arXiv:2503.19597 (cs)

[Submitted on 19 Mar 2025]

Title:QINCODEC: Neural Audio Compression with Implicit Neural Codebooks

Authors:Zineb Lahrichi (S2A), Gaëtan Hadjeres, Gael Richard (S2A), Geoffroy Peeters (S2A)

View PDF

Abstract:Neural audio codecs, neural networks which compress a waveform into discrete tokens, play a crucial role in the recent development of audio generative models. State-of-the-art codecs rely on the end-to-end training of an autoencoder and a quantization bottleneck. However, this approach restricts the choice of the quantization methods as it requires to define how gradients propagate through the quantizer and how to update the quantization parameters online. In this work, we revisit the common practice of joint training and propose to quantize the latent representations of a pre-trained autoencoder offline, followed by an optional finetuning of the decoder to mitigate degradation from quantization. This strategy allows to consider any off-the-shelf quantizer, especially state-of-the-art trainable quantizers with implicit neural codebooks such as QINCO2. We demonstrate that with the latter, our proposed codec termed QINCODEC, is competitive with baseline codecs while being notably simpler to train. Finally, our approach provides a general framework that amortizes the cost of autoencoder pretraining, and enables more flexible codec design.

Subjects:	Sound (cs.SD); Signal Processing (eess.SP)
Cite as:	arXiv:2503.19597 [cs.SD]
	(or arXiv:2503.19597v1 [cs.SD] for this version)
	https://doi.org/10.48550/arXiv.2503.19597

Submission history

From: Zineb Lahrichi [view email] [via CCSD proxy]
[v1] Wed, 19 Mar 2025 09:06:13 UTC (494 KB)

Computer Science > Sound

Title:QINCODEC: Neural Audio Compression with Implicit Neural Codebooks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Sound

Title:QINCODEC: Neural Audio Compression with Implicit Neural Codebooks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators