Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions

Zhang, Yizhe; Bai, Richard; Gu, Zijin; Zhang, Ruixiang; Gu, Jiatao; Abbe, Emmanuel; Bengio, Samy; Jaitly, Navdeep

Computer Science > Computation and Language

arXiv:2502.18435 (cs)

[Submitted on 25 Feb 2025]

Title:Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions

Authors:Yizhe Zhang, Richard Bai, Zijin Gu, Ruixiang Zhang, Jiatao Gu, Emmanuel Abbe, Samy Bengio, Navdeep Jaitly

View PDF HTML (experimental)

Abstract:Language models usually use left-to-right (L2R) autoregressive factorization. However, L2R factorization may not always be the best inductive bias. Therefore, we investigate whether alternative factorizations of the text distribution could be beneficial in some tasks. We investigate right-to-left (R2L) training as a compelling alternative, focusing on multiple-choice questions (MCQs) as a test bed for knowledge extraction and reasoning. Through extensive experiments across various model sizes (2B-8B parameters) and training datasets, we find that R2L models can significantly outperform L2R models on several MCQ benchmarks, including logical reasoning, commonsense understanding, and truthfulness assessment tasks. Our analysis reveals that this performance difference may be fundamentally linked to multiple factors including calibration, computability and directional conditional entropy. We ablate the impact of these factors through controlled simulation studies using arithmetic tasks, where the impacting factors can be better disentangled. Our work demonstrates that exploring alternative factorizations of the text distribution can lead to improvements in LLM capabilities and provides theoretical insights into optimal factorization towards approximating human language distribution, and when each reasoning order might be more advantageous.

Subjects:	Computation and Language (cs.CL); Information Theory (cs.IT); Machine Learning (cs.LG)
Cite as:	arXiv:2502.18435 [cs.CL]
	(or arXiv:2502.18435v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.18435

Submission history

From: Yizhe Zhang [view email]
[v1] Tue, 25 Feb 2025 18:30:25 UTC (2,462 KB)

Computer Science > Computation and Language

Title:Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators