DialUp! Modeling the Language Continuum by Adapting Models to Dialects and Dialects to Models

Bafna, Niyati; Chang, Emily; Robinson, Nathaniel R.; Mortensen, David R.; Murray, Kenton; Yarowsky, David; Sirin, Hale

Computer Science > Computation and Language

arXiv:2501.16581 (cs)

[Submitted on 27 Jan 2025]

Title:DialUp! Modeling the Language Continuum by Adapting Models to Dialects and Dialects to Models

Authors:Niyati Bafna, Emily Chang, Nathaniel R. Robinson, David R. Mortensen, Kenton Murray, David Yarowsky, Hale Sirin

View PDF HTML (experimental)

Abstract:Most of the world's languages and dialects are low-resource, and lack support in mainstream machine translation (MT) models. However, many of them have a closely-related high-resource language (HRL) neighbor, and differ in linguistically regular ways from it. This underscores the importance of model robustness to dialectical variation and cross-lingual generalization to the HRL dialect continuum. We present DialUp, consisting of a training-time technique for adapting a pretrained model to dialectical data (M->D), and an inference-time intervention adapting dialectical data to the model expertise (D->M). M->D induces model robustness to potentially unseen and unknown dialects by exposure to synthetic data exemplifying linguistic mechanisms of dialectical variation, whereas D->M treats dialectical divergence for known target dialects. These methods show considerable performance gains for several dialects from four language families, and modest gains for two other language families. We also conduct feature and error analyses, which show that language varieties with low baseline MT performance are more likely to benefit from these approaches.

Comments:	9 pages, 46 incl. appendix
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2501.16581 [cs.CL]
	(or arXiv:2501.16581v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2501.16581

Submission history

From: Niyati Bafna [view email]
[v1] Mon, 27 Jan 2025 23:53:04 UTC (4,981 KB)

Computer Science > Computation and Language

Title:DialUp! Modeling the Language Continuum by Adapting Models to Dialects and Dialects to Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:DialUp! Modeling the Language Continuum by Adapting Models to Dialects and Dialects to Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators