LLM$\times$MapReduce-V2: Entropy-Driven Convolutional Test-Time Scaling for Generating Long-Form Articles from Extremely Long Resources

Wang, Haoyu; Fu, Yujia; Zhang, Zhu; Wang, Shuo; Ren, Zirui; Wang, Xiaorong; Li, Zhili; He, Chaoqun; An, Bo; Liu, Zhiyuan; Sun, Maosong

Computer Science > Computation and Language

arXiv:2504.05732 (cs)

[Submitted on 8 Apr 2025 (v1), last revised 15 Apr 2025 (this version, v2)]

Title:LLM$\times$MapReduce-V2: Entropy-Driven Convolutional Test-Time Scaling for Generating Long-Form Articles from Extremely Long Resources

Authors:Haoyu Wang, Yujia Fu, Zhu Zhang, Shuo Wang, Zirui Ren, Xiaorong Wang, Zhili Li, Chaoqun He, Bo An, Zhiyuan Liu, Maosong Sun

View PDF HTML (experimental)

Abstract:Long-form generation is crucial for a wide range of practical applications, typically categorized into short-to-long and long-to-long generation. While short-to-long generations have received considerable attention, generating long texts from extremely long resources remains relatively underexplored. The primary challenge in long-to-long generation lies in effectively integrating and analyzing relevant information from extensive inputs, which remains difficult for current large language models (LLMs). In this paper, we propose LLM$\times$MapReduce-V2, a novel test-time scaling strategy designed to enhance the ability of LLMs to process extremely long inputs. Drawing inspiration from convolutional neural networks, which iteratively integrate local features into higher-level global representations, LLM$\times$MapReduce-V2 utilizes stacked convolutional scaling layers to progressively expand the understanding of input materials. Both quantitative and qualitative experimental results demonstrate that our approach substantially enhances the ability of LLMs to process long inputs and generate coherent, informative long-form articles, outperforming several representative baselines. Both LLM$\times$MapReduce-V2 and SurveyEval are publicly available at this https URL .

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2504.05732 [cs.CL]
	(or arXiv:2504.05732v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2504.05732

Submission history

From: Haoyu Wang [view email]
[v1] Tue, 8 Apr 2025 07:03:48 UTC (1,974 KB)
[v2] Tue, 15 Apr 2025 03:28:58 UTC (1,974 KB)

Computer Science > Computation and Language

Title:LLM$\times$MapReduce-V2: Entropy-Driven Convolutional Test-Time Scaling for Generating Long-Form Articles from Extremely Long Resources

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:LLM$\times$MapReduce-V2: Entropy-Driven Convolutional Test-Time Scaling for Generating Long-Form Articles from Extremely Long Resources

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators