1. Conjugate-Gradient-like Based Adaptive Moment Estimation Optimization Algorithm for Deep Learning
- Author
-
Tian, Jiawu, Xu, Liwei, Zhang, Xiaowei, and Li, Yongqi
- Subjects
Computer Science - Machine Learning ,Computer Science - Artificial Intelligence ,Computer Science - Computer Vision and Pattern Recognition ,Mathematics - Optimization and Control - Abstract
Training deep neural networks is a challenging task. In order to speed up training and enhance the performance of deep neural networks, we rectify the vanilla conjugate gradient as conjugate-gradient-like and incorporate it into the generic Adam, and thus propose a new optimization algorithm named CG-like-Adam for deep learning. Specifically, both the first-order and the second-order moment estimation of generic Adam are replaced by the conjugate-gradient-like. Convergence analysis handles the cases where the exponential moving average coefficient of the first-order moment estimation is constant and the first-order moment estimation is unbiased. Numerical experiments show the superiority of the proposed algorithm based on the CIFAR10/100 dataset., Comment: 32 pages, 13 figures
- Published
- 2024