What Do Compressed Multilingual Machine Translation Models Forget?

Mohammadshahi, Alireza; Nikoulina, Vassilina; Berard, Alexandre; Brun, Caroline; Henderson, James; Besacier, Laurent

Computer Science > Computation and Language

arXiv:2205.10828 (cs)

[Submitted on 22 May 2022 (v1), last revised 27 Jun 2023 (this version, v4)]

Title:What Do Compressed Multilingual Machine Translation Models Forget?

Authors:Alireza Mohammadshahi, Vassilina Nikoulina, Alexandre Berard, Caroline Brun, James Henderson, Laurent Besacier

View PDF

Abstract:Recently, very large pre-trained models achieve state-of-the-art results in various natural language processing (NLP) tasks, but their size makes it more challenging to apply them in resource-constrained environments. Compression techniques allow to drastically reduce the size of the models and therefore their inference time with negligible impact on top-tier metrics. However, the general performance averaged across multiple tasks and/or languages may hide a drastic performance drop on under-represented features, which could result in the amplification of biases encoded by the models. In this work, we assess the impact of compression methods on Multilingual Neural Machine Translation models (MNMT) for various language groups, gender, and semantic biases by extensive analysis of compressed models on different machine translation benchmarks, i.e. FLORES-101, MT-Gender, and DiBiMT. We show that the performance of under-represented languages drops significantly, while the average BLEU metric only slightly decreases. Interestingly, the removal of noisy memorization with compression leads to a significant improvement for some medium-resource languages. Finally, we demonstrate that compression amplifies intrinsic gender and semantic biases, even in high-resource languages. Code: this https URL

Comments:	Accepted to Findings of EMNLP 2022, presented at WMT 2022
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2205.10828 [cs.CL]
	(or arXiv:2205.10828v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2205.10828
Journal reference:	https://aclanthology.org/2022.findings-emnlp.317/

Submission history

From: Alireza Mohammadshahi [view email]
[v1] Sun, 22 May 2022 13:54:44 UTC (6,823 KB)
[v2] Thu, 20 Oct 2022 22:14:23 UTC (12,570 KB)
[v3] Tue, 27 Dec 2022 15:56:57 UTC (12,573 KB)
[v4] Tue, 27 Jun 2023 09:34:34 UTC (3,857 KB)

Computer Science > Computation and Language

Title:What Do Compressed Multilingual Machine Translation Models Forget?

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:What Do Compressed Multilingual Machine Translation Models Forget?

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators