Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR

Khassanov, Yerbolat; Chen, Zhipeng; Chen, Tianfeng; Chong, Tze Yuang; Li, Wei; Zhang, Jun; Lu, Lu; Wang, Yuxuan

Electrical Engineering and Systems Science > Audio and Speech Processing

arXiv:2406.07842 (eess)

[Submitted on 12 Jun 2024]

Title:Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR

Authors:Yerbolat Khassanov, Zhipeng Chen, Tianfeng Chen, Tze Yuang Chong, Wei Li, Jun Zhang, Lu Lu, Yuxuan Wang

View PDF HTML (experimental)

Abstract:This paper addresses challenges in integrating new languages into a pre-trained multilingual automatic speech recognition (mASR) system, particularly in scenarios where training data for existing languages is limited or unavailable. The proposed method employs a dual-pipeline with low-rank adaptation (LoRA). It maintains two data flow pipelines-one for existing languages and another for new languages. The primary pipeline follows the standard flow through the pre-trained parameters of mASR, while the secondary pipeline additionally utilizes language-specific parameters represented by LoRA and a separate output decoder module. Importantly, the proposed approach minimizes the performance degradation of existing languages and enables a language-agnostic operation mode, facilitated by a decoder selection strategy. We validate the effectiveness of the proposed method by extending the pre-trained Whisper model to 19 new languages from the FLEURS dataset

Comments:	5 pages, 2 figures, 4 tables
Subjects:	Audio and Speech Processing (eess.AS); Computation and Language (cs.CL)
Cite as:	arXiv:2406.07842 [eess.AS]
	(or arXiv:2406.07842v1 [eess.AS] for this version)
	https://doi.org/10.48550/arXiv.2406.07842

Submission history

From: Yerbolat Khassanov [view email]
[v1] Wed, 12 Jun 2024 03:17:57 UTC (902 KB)

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Audio and Speech Processing

Title:Dual-Pipeline with Low-Rank Adaptation for New Language Integration in Multilingual ASR

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators