Enhancing Hypergradients Estimation: A Study of Preconditioning and Reparameterization

Ye, Zhenzhang; Peyré, Gabriel; Cremers, Daniel; Ablin, Pierre

Computer Science > Machine Learning

arXiv:2402.16748 (cs)

[Submitted on 26 Feb 2024]

Title:Enhancing Hypergradients Estimation: A Study of Preconditioning and Reparameterization

Authors:Zhenzhang Ye, Gabriel Peyré, Daniel Cremers, Pierre Ablin

View PDF

Abstract:Bilevel optimization aims to optimize an outer objective function that depends on the solution to an inner optimization problem. It is routinely used in Machine Learning, notably for hyperparameter tuning. The conventional method to compute the so-called hypergradient of the outer problem is to use the Implicit Function Theorem (IFT). As a function of the error of the inner problem resolution, we study the error of the IFT method. We analyze two strategies to reduce this error: preconditioning the IFT formula and reparameterizing the inner problem. We give a detailed account of the impact of these two modifications on the error, highlighting the role played by higher-order derivatives of the functionals at stake. Our theoretical findings explain when super efficiency, namely reaching an error on the hypergradient that depends quadratically on the error on the inner problem, is achievable and compare the two approaches when this is impossible. Numerical evaluations on hyperparameter tuning for regression problems substantiate our theoretical findings.

Comments:	Accepted in AISTATS 2024
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2402.16748 [cs.LG]
	(or arXiv:2402.16748v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.16748

Submission history

From: Zhenzhang Ye [view email]
[v1] Mon, 26 Feb 2024 17:09:18 UTC (1,123 KB)

Computer Science > Machine Learning

Title:Enhancing Hypergradients Estimation: A Study of Preconditioning and Reparameterization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Enhancing Hypergradients Estimation: A Study of Preconditioning and Reparameterization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators