Simple and Effective Unsupervised Speech Translation

Wang, Changhan; Inaguma, Hirofumi; Chen, Peng-Jen; Kulikov, Ilia; Tang, Yun; Hsu, Wei-Ning; Auli, Michael; Pino, Juan

Computer Science > Computation and Language

arXiv:2210.10191 (cs)

[Submitted on 18 Oct 2022]

Title:Simple and Effective Unsupervised Speech Translation

Authors:Changhan Wang, Hirofumi Inaguma, Peng-Jen Chen, Ilia Kulikov, Yun Tang, Wei-Ning Hsu, Michael Auli, Juan Pino

View PDF

Abstract:The amount of labeled data to train models for speech tasks is limited for most languages, however, the data scarcity is exacerbated for speech translation which requires labeled data covering two different languages. To address this issue, we study a simple and effective approach to build speech translation systems without labeled data by leveraging recent advances in unsupervised speech recognition, machine translation and speech synthesis, either in a pipeline approach, or to generate pseudo-labels for training end-to-end speech translation models. Furthermore, we present an unsupervised domain adaptation technique for pre-trained speech models which improves the performance of downstream unsupervised speech recognition, especially for low-resource settings. Experiments show that unsupervised speech-to-text translation outperforms the previous unsupervised state of the art by 3.2 BLEU on the Libri-Trans benchmark, on CoVoST 2, our best systems outperform the best supervised end-to-end models (without pre-training) from only two years ago by an average of 5.0 BLEU over five X-En directions. We also report competitive results on MuST-C and CVSS benchmarks.

Subjects:	Computation and Language (cs.CL); Sound (cs.SD); Audio and Speech Processing (eess.AS)
Cite as:	arXiv:2210.10191 [cs.CL]
	(or arXiv:2210.10191v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2210.10191

Submission history

From: Changhan Wang [view email]
[v1] Tue, 18 Oct 2022 22:26:13 UTC (82 KB)

Computer Science > Computation and Language

Title:Simple and Effective Unsupervised Speech Translation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Simple and Effective Unsupervised Speech Translation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators