Two-stage Conformal Risk Control with Application to Ranked Retrieval

Xu, Yunpeng; Ying, Mufang; Guo, Wenge; Wei, Zhi

Computer Science > Information Retrieval

arXiv:2404.17769 (cs)

[Submitted on 27 Apr 2024 (v1), last revised 2 Nov 2024 (this version, v2)]

Title:Two-stage Conformal Risk Control with Application to Ranked Retrieval

Authors:Yunpeng Xu, Mufang Ying, Wenge Guo, Zhi Wei

View PDF HTML (experimental)

Abstract:Many practical machine learning systems, such as ranking and recommendation systems, consist of two concatenated stages: retrieval and ranking. These systems present significant challenges in accurately assessing and managing the uncertainty inherent in their predictions. To address these challenges, we extend the recently developed framework of conformal risk control, originally designed for single-stage problems, to accommodate the more complex two-stage setup. We first demonstrate that a straightforward application of conformal risk control, treating each stage independently, may fail to maintain risk at their pre-specified levels. Therefore, we propose an integrated approach that considers both stages simultaneously, devising algorithms to control the risk of each stage by jointly identifying thresholds for both stages. Our algorithm further optimizes for a weighted combination of prediction set sizes across all feasible thresholds, resulting in more effective prediction sets. Finally, we apply the proposed method to the critical task of two-stage ranked retrieval. We validate the efficacy of our method through extensive experiments on two large-scale public datasets, MSLR-WEB and MS MARCO, commonly used for ranked retrieval tasks.

Comments:	13 pages, 3 figures; 5 supplementary pages, 3 supplementary figures
Subjects:	Information Retrieval (cs.IR); Methodology (stat.ME); Machine Learning (stat.ML)
Cite as:	arXiv:2404.17769 [cs.IR]
	(or arXiv:2404.17769v2 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2404.17769

Submission history

From: Wenge Guo [view email]
[v1] Sat, 27 Apr 2024 03:37:12 UTC (775 KB)
[v2] Sat, 2 Nov 2024 08:06:32 UTC (845 KB)

Computer Science > Information Retrieval

Title:Two-stage Conformal Risk Control with Application to Ranked Retrieval

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Two-stage Conformal Risk Control with Application to Ranked Retrieval

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators