DUET: Optimizing Training Data Mixtures via Feedback from Unseen Evaluation Tasks

Chen, Zhiliang; Lau, Gregory Kang Ruey; Foo, Chuan-Sheng; Low, Bryan Kian Hsiang

Computer Science > Machine Learning

arXiv:2502.00270 (cs)

[Submitted on 1 Feb 2025]

Title:DUET: Optimizing Training Data Mixtures via Feedback from Unseen Evaluation Tasks

Authors:Zhiliang Chen, Gregory Kang Ruey Lau, Chuan-Sheng Foo, Bryan Kian Hsiang Low

View PDF HTML (experimental)

Abstract:The performance of a machine learning (ML) model depends heavily on the relevance of its training data to the domain of the downstream evaluation task. However, in practice, the data involved in an unseen evaluation task is often not known to us (e.g., conversations between an LLM and a user are end-to-end encrypted). So, it is not obvious what data would be relevant for training/fine-tuning the ML model to maximize its task performance. Instead, one can only deploy the ML model in the unseen evaluation task to gather multiple rounds of coarse feedback on how well the model has performed. This paper presents a novel global-to-local algorithm called DUET that can exploit the feedback loop by interleaving a data selection method with Bayesian optimization. As a result, DUET can efficiently refine the training data mixture from a pool of data domains to maximize the model's performance on the unseen evaluation task and its convergence to the optimal data mixture can be theoretically guaranteed by analyzing its cumulative regret. Empirical evaluation on image and LLM evaluation tasks shows that DUET finds better training data mixtures than conventional baselines.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
Cite as:	arXiv:2502.00270 [cs.LG]
	(or arXiv:2502.00270v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.00270

Submission history

From: Zhiliang Chen [view email]
[v1] Sat, 1 Feb 2025 01:52:32 UTC (6,976 KB)

Computer Science > Machine Learning

Title:DUET: Optimizing Training Data Mixtures via Feedback from Unseen Evaluation Tasks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:DUET: Optimizing Training Data Mixtures via Feedback from Unseen Evaluation Tasks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators