Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction

Prasad, Kritarth; Zaki, Mohammadi; Singh, Pratik; Wasnik, Pankaj

Computer Science > Computation and Language

arXiv:2501.15219 (cs)

[Submitted on 25 Jan 2025]

Title:Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction

Authors:Kritarth Prasad, Mohammadi Zaki, Pratik Singh, Pankaj Wasnik

View PDF HTML (experimental)

Abstract:Ensembling neural machine translation (NMT) models to produce higher-quality translations than the $L$ individual models has been extensively studied. Recent methods typically employ a candidate selection block (CSB) and an encoder-decoder fusion block (FB), requiring inference across \textit{all} candidate models, leading to significant computational overhead, generally $\Omega(L)$. This paper introduces \textbf{SmartGen}, a reinforcement learning (RL)-based strategy that improves the CSB by selecting a small, fixed number of candidates and identifying optimal groups to pass to the fusion block for each input sentence. Furthermore, previously, the CSB and FB were trained independently, leading to suboptimal NMT performance. Our DQN-based \textbf{SmartGen} addresses this by using feedback from the FB block as a reward during training. We also resolve a key issue in earlier methods, where candidates were passed to the FB without modification, by introducing a Competitive Correction Block (CCB). Finally, we validate our approach with extensive experiments on English-Hindi translation tasks in both directions.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2501.15219 [cs.CL]
	(or arXiv:2501.15219v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2501.15219

Submission history

From: Mohammadi Zaki [view email]
[v1] Sat, 25 Jan 2025 13:50:18 UTC (2,301 KB)

Computer Science > Computation and Language

Title:Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Faster Machine Translation Ensembling with Reinforcement Learning and Competitive Correction

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators