Back to Search
Start Over
A General Descent Aggregation Framework for Gradient-Based Bi-Level Optimization
- Source :
- IEEE Transactions on Pattern Analysis and Machine Intelligence. 45:38-57
- Publication Year :
- 2023
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2023.
-
Abstract
- In recent years, a variety of gradient-based methods have been developed to solve Bi-Level Optimization (BLO) problems in machine learning and computer vision areas. However, the theoretical correctness and practical effectiveness of these existing approaches always rely on some restrictive conditions (e.g., Lower-Level Singleton, LLS), which could hardly be satisfied in real-world applications. Moreover, previous literature only proves theoretical results based on their specific iteration strategies, thus lack a general recipe to uniformly analyze the convergence behaviors of different gradient-based BLOs. In this work, we formulate BLOs from an optimistic bi-level viewpoint and establish a new gradient-based algorithmic framework, named Bi-level Descent Aggregation (BDA), to partially address the above issues. Specifically, BDA provides a modularized structure to hierarchically aggregate both the upper- and lower-level subproblems to generate our bi-level iterative dynamics. Theoretically, we establish a general convergence analysis template and derive a new proof recipe to investigate the essential theoretical properties of gradient-based BLO methods. Furthermore, this work systematically explores the convergence behavior of BDA in different optimization scenarios, i.e., considering various solution qualities (i.e., global/local/stationary solution) returned from solving approximation subproblems. Extensive experiments justify our theoretical results and demonstrate the superiority of the proposed algorithm for hyper-parameter optimization and meta-learning tasks. Source code is available at https://github.com/vis-opt-group/BDA.<br />18 pages, accepted for publication in IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
- Subjects :
- FOS: Computer and information sciences
Computer Science - Machine Learning
Computer Vision and Pattern Recognition (cs.CV)
Applied Mathematics
Computer Science - Computer Vision and Pattern Recognition
Dynamical Systems (math.DS)
Machine Learning (cs.LG)
Computational Theory and Mathematics
Optimization and Control (math.OC)
Artificial Intelligence
FOS: Mathematics
Computer Vision and Pattern Recognition
Mathematics - Dynamical Systems
Mathematics - Optimization and Control
Software
Subjects
Details
- ISSN :
- 19393539 and 01628828
- Volume :
- 45
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Pattern Analysis and Machine Intelligence
- Accession number :
- edsair.doi.dedup.....a84492b738b95d8d9dfdb677497a084a