ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback

Zhai, Bohan; Xu, Canwen; He, Yuxiong; Yao, Zhewei

Computer Science > Machine Learning

arXiv:2503.19988 (cs)

[Submitted on 25 Mar 2025]

Title:ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback

Authors:Bohan Zhai, Canwen Xu, Yuxiong He, Zhewei Yao

View PDF HTML (experimental)

Abstract:Text-to-SQL demands precise reasoning to convert natural language questions into structured queries. While large language models (LLMs) excel in many reasoning tasks, their ability to leverage Chain-of-Thought (CoT) reasoning for text-to-SQL remains underexplored. We identify critical limitations: zero-shot CoT offers minimal gains, and Direct Preference Optimization (DPO) applied without CoT yields marginal improvements. We propose ExCoT, a novel framework that iteratively optimizes open-source LLMs by combining CoT reasoning with off-policy and on-policy DPO, relying solely on execution accuracy as feedback. This approach eliminates the need for reward models or human-annotated preferences.
Our experimental results demonstrate significant performance gains: ExCoT improves execution accuracy on BIRD dev set from 57.37% to 68.51% and on Spider test set from 78.81% to 86.59% for LLaMA-3 70B, with Qwen-2.5-Coder demonstrating similar improvements. Our best model achieves state-of-the-art performance in the single-model setting on both BIRD and Spider datasets, notably achieving 68.53% on the BIRD test set.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Databases (cs.DB)
Cite as:	arXiv:2503.19988 [cs.LG]
	(or arXiv:2503.19988v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2503.19988

Submission history

From: Bohan Zhai [view email]
[v1] Tue, 25 Mar 2025 18:17:36 UTC (190 KB)

Computer Science > Machine Learning

Title:ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:ExCoT: Optimizing Reasoning for Text-to-SQL with Execution Feedback

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators