Learning to Stop Overthinking at Test Time

Bao, Hieu Tran; Dat, Nguyen Cong; Anh, Nguyen Duc; Thanh-Tung, Hoang

Computer Science > Computer Vision and Pattern Recognition

arXiv:2502.10954 (cs)

[Submitted on 16 Feb 2025 (v1), last revised 18 Feb 2025 (this version, v2)]

Title:Learning to Stop Overthinking at Test Time

Authors:Hieu Tran Bao, Nguyen Cong Dat, Nguyen Duc Anh, Hoang Thanh-Tung

View PDF HTML (experimental)

Abstract:Test time scaling is currently one of the most active research areas that shows promise after training time scaling has reached its limits. Deep-thinking (DT) models are a class of recurrent models that can perform easy-to-hard generalization by assigning more compute to harder test samples. However, due to their inability to determine the complexity of a test sample, DT models have to use a large amount of computation for both easy and hard test samples. Excessive test time computation is wasteful and can cause the ``overthinking'' problem where more test time computation leads to worse results. In this paper, we introduce a test time training method for determining the optimal amount of computation needed for each sample during test time. We also propose Conv-LiGRU, a novel recurrent architecture for efficient and robust visual reasoning. Extensive experiments demonstrate that Conv-LiGRU is more stable than DT, effectively mitigates the ``overthinking'' phenomenon, and achieves superior accuracy.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2502.10954 [cs.CV]
	(or arXiv:2502.10954v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2502.10954

Submission history

From: Thanh-Tung Hoang [view email]
[v1] Sun, 16 Feb 2025 02:17:05 UTC (18,113 KB)
[v2] Tue, 18 Feb 2025 03:41:03 UTC (18,113 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Learning to Stop Overthinking at Test Time

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning to Stop Overthinking at Test Time

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators