On Penalty-based Bilevel Gradient Descent Method
From MaRDI portal
Publication:6426038
arXiv2302.05185MaRDI QIDQ6426038
Author name not available (Why is that?)
Publication date: 10 February 2023
Abstract: Bilevel optimization enjoys a wide range of applications in hyper-parameter optimization, meta-learning and reinforcement learning. However, bilevel optimization problems are difficult to solve. Recent progress on scalable bilevel algorithms mainly focuses on bilevel optimization problems where the lower-level objective is either strongly convex or unconstrained. In this work, we tackle the bilevel problem through the lens of the penalty method. We show that under certain conditions, the penalty reformulation recovers the solutions of the original bilevel problem. Further, we propose the penalty-based bilevel gradient descent (PBGD) algorithm and establish its finite-time convergence for the constrained bilevel problem without lower-level strong convexity. Experiments showcase the efficiency of the proposed PBGD algorithm.
Has companion code repository: https://github.com/hanshen95/penalized-bilevel-gradient-descent
This page was built for publication: On Penalty-based Bilevel Gradient Descent Method
Report a bug (only for logged in users!)Click here to report a bug for this page (MaRDI item Q6426038)