Commute Your Domains: Trajectory Optimality Criterion for Multi-Domain Learning

Rukhovich, Alexey; Podolskiy, Alexander; Piontkovskaya, Irina

Computer Science > Machine Learning

arXiv:2501.15556 (cs)

[Submitted on 26 Jan 2025]

Title:Commute Your Domains: Trajectory Optimality Criterion for Multi-Domain Learning

Authors:Alexey Rukhovich, Alexander Podolskiy, Irina Piontkovskaya

View PDF HTML (experimental)

Abstract:In multi-domain learning, a single model is trained on diverse data domains to leverage shared knowledge and improve generalization. The order in which the data from these domains is used for training can significantly affect the model's performance on each domain. However, this dependence is under-studied. In this paper, we investigate the influence of training order (or data mixing) in multi-domain learning using the concept of Lie bracket of gradient vector fields. By analyzing the infinitesimal effects of changing the training order, we identify regions in the parameter space where altering the order between two training domains can benefit the target loss. We validate the predictions of our theoretical framework on the influence of training order (or data mixing) both on a toy example and bilingual LLM pre-training.

Comments:	NeurIPS 2024 Workshop on Mathematics of Modern Machine Learning
Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2501.15556 [cs.LG]
	(or arXiv:2501.15556v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2501.15556

Submission history

From: Alexander Podolskiy Vadimovich [view email]
[v1] Sun, 26 Jan 2025 15:12:06 UTC (672 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2025-01

Change to browse by:

cs
cs.CL

References & Citations

export BibTeX citation

Computer Science > Machine Learning

Title:Commute Your Domains: Trajectory Optimality Criterion for Multi-Domain Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Commute Your Domains: Trajectory Optimality Criterion for Multi-Domain Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators