Back to Search
Start Over
On the convergence of a stochastic approximation method for structured bi-level optimization
- Publication Year :
- 2018
- Publisher :
- HAL CCSD, 2018.
-
Abstract
- We analyze the convergence of stochastic gradient methods for well structured bi-level optimization problems. We address two specific cases: first when the outer objective function can be expressed as a finite sum of independent terms, and next when both the outer and inner objective functions can be expressed as finite sums of independent terms. We assume Lipschitz continuity and differentiability of both objectives as well as convexity of the inner objective and consider diminishing steps sizes. We show that, under these conditions and some other assumptions on the implicit function and the variance of the gradient errors, both methods converge in expectation to a stationary point of the problem if gradient approximations are chosen so as to satisfy a sufficient decrease condition. We also discuss the satisfaction of our assumptions in machine learning problems where these methods can be nicely applied to automatically tune hyperparameters when the loss functions are very large sums of error terms.
- Subjects :
- [ MATH ] Mathematics [math]
[ MATH.MATH-OC ] Mathematics [math]/Optimization and Control [math.OC]
[INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]
[MATH.MATH-OC]Mathematics [math]/Optimization and Control [math.OC]
[ INFO.INFO-LG ] Computer Science [cs]/Machine Learning [cs.LG]
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Accession number :
- edsair.dedup.wf.001..994281fb37862094cff9a61b33e49778