Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning

Lin, Jingyang; Wong, Andy; Xia, Tian; He, Shenghua; Wei, Hui; Han, Mei; Luo, Jiebo

Computer Science > Computation and Language

arXiv:2502.13127 (cs)

[Submitted on 18 Feb 2025]

Title:Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning

Authors:Jingyang Lin, Andy Wong, Tian Xia, Shenghua He, Hui Wei, Mei Han, Jiebo Luo

View PDF HTML (experimental)

Abstract:Recent advances in Large Language Models (LLMs) have enabled them to process increasingly longer sequences, ranging from 2K to 2M tokens and even beyond. However, simply extending the input sequence length does not necessarily lead to effective long-context understanding. In this study, we integrate Chain-of-Thought (CoT) reasoning into LLMs in a supervised manner to facilitate effective long-context understanding. To achieve this, we introduce LongFinanceQA, a synthetic dataset in the financial domain designed to improve long-context reasoning. Unlike existing long-context synthetic data, LongFinanceQA includes intermediate CoT reasoning before the final conclusion, which encourages LLMs to perform explicit reasoning, improving accuracy and interpretability in long-context understanding. To generate synthetic CoT reasoning, we propose Property-driven Agentic Inference (PAI), an agentic framework that simulates human-like reasoning steps, including property extraction, retrieval, and summarization. We evaluate PAI's reasoning capabilities by assessing GPT-4o-mini w/ PAI on the Loong benchmark, outperforming standard GPT-4o-mini by 20.0%. Furthermore, we fine-tune LLaMA-3.1-8B-Instruct on LongFinanceQA, achieving a 24.6% gain on Loong's financial subset.

Comments:	15 Pages, 6 Tables, 8 Figures
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2502.13127 [cs.CL]
	(or arXiv:2502.13127v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2502.13127

Submission history

From: Jingyang Lin [view email]
[v1] Tue, 18 Feb 2025 18:50:06 UTC (2,271 KB)

Computer Science > Computation and Language

Title:Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators