Auto-Evolve: Enhancing Large Language Model's Performance via Self-Reasoning Framework

Aswani, Krishna; Lu, Huilin; Patankar, Pranav; Dhalwani, Priya; Tan, Iris; Ganeshmohan, Jayant; Lacasse, Simon

Computer Science > Computation and Language

arXiv:2410.06328 (cs)

[Submitted on 8 Oct 2024 (v1), last revised 11 Oct 2024 (this version, v2)]

Title:Auto-Evolve: Enhancing Large Language Model's Performance via Self-Reasoning Framework

Authors:Krishna Aswani, Huilin Lu, Pranav Patankar, Priya Dhalwani, Iris Tan, Jayant Ganeshmohan, Simon Lacasse

View PDF HTML (experimental)

Abstract:Recent advancements in prompt engineering strategies, such as Chain-of-Thought (CoT) and Self-Discover, have demonstrated significant potential in improving the reasoning abilities of Large Language Models (LLMs). However, these state-of-the-art (SOTA) prompting strategies rely on single or fixed set of static seed reasoning modules like "think step by step" or "break down this problem" intended to simulate human approach to problem-solving. This constraint limits the flexibility of models in tackling diverse problems effectively. In this paper, we introduce Auto-Evolve, a novel framework that enables LLMs to self-create dynamic reasoning modules and downstream action plan, resulting in significant improvements over current SOTA methods. We evaluate Auto-Evolve on the challenging BigBench-Hard (BBH) dataset with Claude 2.0, Claude 3 Sonnet, Mistral Large, and GPT 4, where it consistently outperforms the SOTA prompt strategies. Auto-Evolve outperforms CoT by up to 10.4% and on an average by 7% across these four models. Our framework introduces two innovations: a) Auto-Evolve dynamically generates reasoning modules for each task while aligning with human reasoning paradigm, thus eliminating the need for predefined templates. b) We introduce an iterative refinement component, that incrementally refines instruction guidance for LLMs and helps boost performance by average 2.8% compared to doing it in a single step.

Comments:	Accepted at EMNLP 2024
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2410.06328 [cs.CL]
	(or arXiv:2410.06328v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.06328

Submission history

From: Huilin Lu [view email]
[v1] Tue, 8 Oct 2024 20:07:47 UTC (5,260 KB)
[v2] Fri, 11 Oct 2024 20:39:00 UTC (5,260 KB)

Computer Science > Computation and Language

Title:Auto-Evolve: Enhancing Large Language Model's Performance via Self-Reasoning Framework

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Auto-Evolve: Enhancing Large Language Model's Performance via Self-Reasoning Framework

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators