Learning to Rank in Generative Retrieval

Li, Yongqi; Yang, Nan; Wang, Liang; Wei, Furu; Li, Wenjie

Computer Science > Computation and Language

arXiv:2306.15222 (cs)

[Submitted on 27 Jun 2023 (v1), last revised 16 Dec 2023 (this version, v2)]

Title:Learning to Rank in Generative Retrieval

Authors:Yongqi Li, Nan Yang, Liang Wang, Furu Wei, Wenjie Li

View PDF HTML (experimental)

Abstract:Generative retrieval stands out as a promising new paradigm in text retrieval that aims to generate identifier strings of relevant passages as the retrieval target. This generative paradigm taps into powerful generative language models, distinct from traditional sparse or dense retrieval methods. However, only learning to generate is insufficient for generative retrieval. Generative retrieval learns to generate identifiers of relevant passages as an intermediate goal and then converts predicted identifiers into the final passage rank list. The disconnect between the learning objective of autoregressive models and the desired passage ranking target leads to a learning gap. To bridge this gap, we propose a learning-to-rank framework for generative retrieval, dubbed LTRGR. LTRGR enables generative retrieval to learn to rank passages directly, optimizing the autoregressive model toward the final passage ranking target via a rank loss. This framework only requires an additional learning-to-rank training phase to enhance current generative retrieval systems and does not add any burden to the inference stage. We conducted experiments on three public benchmarks, and the results demonstrate that LTRGR achieves state-of-the-art performance among generative retrieval methods. The code and checkpoints are released at this https URL.

Comments:	AAAI 2024
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR)
Cite as:	arXiv:2306.15222 [cs.CL]
	(or arXiv:2306.15222v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2306.15222

Submission history

From: Yongqi Li [view email]
[v1] Tue, 27 Jun 2023 05:48:14 UTC (7,097 KB)
[v2] Sat, 16 Dec 2023 13:26:02 UTC (2,299 KB)

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computation and Language

Title:Learning to Rank in Generative Retrieval

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

✅2024-10-01: arxiv.org is back to normal.✅

Computer Science > Computation and Language

Title:Learning to Rank in Generative Retrieval

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators