Efficient Reasoning with Hidden Thinking

Shen, Xuan; Wang, Yizhou; Shi, Xiangxi; Wang, Yanzhi; Zhao, Pu; Gu, Jiuxiang

Computer Science > Computation and Language

arXiv:2501.19201 (cs)

[Submitted on 31 Jan 2025]

Title:Efficient Reasoning with Hidden Thinking

Authors:Xuan Shen, Yizhou Wang, Xiangxi Shi, Yanzhi Wang, Pu Zhao, Jiuxiang Gu

View PDF HTML (experimental)

Abstract:Chain-of-Thought (CoT) reasoning has become a powerful framework for improving complex problem-solving capabilities in Multimodal Large Language Models (MLLMs). However, the verbose nature of textual reasoning introduces significant inefficiencies. In this work, we propose $\textbf{Heima}$ (as hidden llama), an efficient reasoning framework that leverages reasoning CoTs at hidden latent space. We design the Heima Encoder to condense each intermediate CoT into a compact, higher-level hidden representation using a single thinking token, effectively minimizing verbosity and reducing the overall number of tokens required during the reasoning process. Meanwhile, we design corresponding Heima Decoder with traditional Large Language Models (LLMs) to adaptively interpret the hidden representations into variable-length textual sequence, reconstructing reasoning processes that closely resemble the original CoTs. Experimental results across diverse reasoning MLLM benchmarks demonstrate that Heima model achieves higher generation efficiency while maintaining or even better zero-shot task accuracy. Moreover, the effective reconstruction of multimodal reasoning processes with Heima Decoder validates both the robustness and interpretability of our approach.

Comments:	Preprint version
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2501.19201 [cs.CL]
	(or arXiv:2501.19201v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2501.19201

Submission history

From: Xuan Shen [view email]
[v1] Fri, 31 Jan 2025 15:10:29 UTC (576 KB)

Computer Science > Computation and Language

Title:Efficient Reasoning with Hidden Thinking

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Efficient Reasoning with Hidden Thinking

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators