Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence

Yu, Yijiong

Computer Science > Computation and Language

arXiv:2503.20533 (cs)

[Submitted on 26 Mar 2025 (v1), last revised 2 Apr 2025 (this version, v2)]

Title:Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence

Authors:Yijiong Yu

View PDF

Abstract:Recent advances in reasoning models have demonstrated significant improvements in accuracy, particularly for complex tasks such as mathematical reasoning, by employing detailed and comprehensive reasoning processes. However, generating these lengthy reasoning sequences is computationally expensive and time-consuming. To address this inefficiency, we leverage the inherent parallelizability of certain tasks to accelerate the reasoning process. Specifically, when multiple parallel reasoning branches exist, we decode multiple tokens per step using a specialized attention mask, processing them within a single sequence, avoiding additional memory usage. Experimental results show that our method achieves over 100% speedup in decoding time while maintaining the answer quality.

Comments:	Our code is available in this https URL
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2503.20533 [cs.CL]
	(or arXiv:2503.20533v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2503.20533

Submission history

From: Yijiong Yu [view email]
[v1] Wed, 26 Mar 2025 13:28:57 UTC (418 KB)
[v2] Wed, 2 Apr 2025 08:29:16 UTC (420 KB)

Computer Science > Computation and Language

Title:Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Accelerate Parallelizable Reasoning via Parallel Decoding within One Sequence

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators