A Survey of Query Optimization in Large Language Models

Song, Mingyang; Zheng, Mao

Computer Science > Computation and Language

arXiv:2412.17558 (cs)

[Submitted on 23 Dec 2024]

Title:A Survey of Query Optimization in Large Language Models

Authors:Mingyang Song, Mao Zheng

View PDF HTML (experimental)

Abstract:\textit{Query Optimization} (QO) refers to techniques aimed at enhancing the efficiency and quality of Large Language Models (LLMs) in understanding and answering queries, especially complex ones in scenarios like Retrieval-Augmented Generation (RAG). Specifically, RAG mitigates the limitations of LLMs by dynamically retrieving and leveraging up-to-date relevant information, which provides a cost-effective solution to the challenge of LLMs producing plausible but potentially inaccurate responses. Recently, as RAG evolves and incorporates multiple components that influence its performance, QO has emerged as a critical element, playing a pivotal role in determining the effectiveness of RAG's retrieval stage in accurately sourcing the necessary multiple pieces of evidence to answer queries correctly. In this paper, we trace the evolution of QO techniques by summarizing and analyzing significant studies. Through an organized framework and categorization, we aim to consolidate existing QO techniques in RAG, elucidate their technological foundations, and highlight their potential to enhance the versatility and applications of LLMs.

Comments:	Ongoing Work
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2412.17558 [cs.CL]
	(or arXiv:2412.17558v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.17558

Submission history

From: Mingyang Song [view email]
[v1] Mon, 23 Dec 2024 13:26:04 UTC (1,340 KB)

Computer Science > Computation and Language

Title:A Survey of Query Optimization in Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Survey of Query Optimization in Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators