Bilevel Optimization without Lower-Level Strong Convexity from the Hyper-Objective Perspective

Chen, Lesi; Xu, Jing; Zhang, Jingzhao

Mathematics > Optimization and Control

arXiv:2301.00712v3 (math)

[Submitted on 2 Jan 2023 (v1), revised 29 May 2023 (this version, v3), latest version 5 Jan 2025 (v7)]

Title:Bilevel Optimization without Lower-Level Strong Convexity from the Hyper-Objective Perspective

Authors:Lesi Chen, Jing Xu, Jingzhao Zhang

View PDF

Abstract:Bilevel optimization reveals the inner structure of otherwise oblique optimization problems, such as hyperparameter tuning and meta-learning. A common goal in bilevel optimization is to find stationary points of the hyper-objective function. Although this hyper-objective approach is widely used, its theoretical properties have not been thoroughly investigated in cases where the lower-level functions lack strong convexity. In this work, we take a step forward and study the hyper-objective approach without the typical lower-level strong convexity assumption. Our hardness results show that the hyper-objective of general convex lower-level functions can be intractable either to evaluate or to optimize. To tackle this challenge, we introduce the gradient dominant condition, which strictly relaxes the strong convexity assumption by allowing the lower-level solution set to be non-singleton. Under the gradient dominant condition, we propose the Inexact Gradient-Free Method (IGFM), which uses the Switching Gradient Method (SGM) as the zeroth order oracle, to find an approximate stationary point of the hyper-objective. We also extend our results to nonsmooth lower-level functions under the weak sharp minimum condition.

Subjects:	Optimization and Control (math.OC); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2301.00712 [math.OC]
	(or arXiv:2301.00712v3 [math.OC] for this version)
	https://doi.org/10.48550/arXiv.2301.00712

Submission history

From: Lesi Chen [view email]
[v1] Mon, 2 Jan 2023 15:09:12 UTC (45 KB)
[v2] Fri, 26 May 2023 08:59:55 UTC (798 KB)
[v3] Mon, 29 May 2023 01:07:24 UTC (798 KB)
[v4] Thu, 8 Feb 2024 07:49:07 UTC (835 KB)
[v5] Tue, 14 May 2024 10:32:46 UTC (75 KB)
[v6] Sat, 28 Dec 2024 05:44:48 UTC (75 KB)
[v7] Sun, 5 Jan 2025 06:43:46 UTC (75 KB)

Mathematics > Optimization and Control

Title:Bilevel Optimization without Lower-Level Strong Convexity from the Hyper-Objective Perspective

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Mathematics > Optimization and Control

Title:Bilevel Optimization without Lower-Level Strong Convexity from the Hyper-Objective Perspective

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators