Back to Search Start Over

Structure and Gradient Dynamics Near Global Minima of Two-layer Neural Networks

Authors :
Zhang, Leyang
Zhang, Yaoyu
Luo, Tao
Zhang, Leyang
Zhang, Yaoyu
Luo, Tao
Publication Year :
2023

Abstract

Under mild assumptions, we investigate the structure of loss landscape of two-layer neural networks near global minima, determine the set of parameters which give perfect generalization, and fully characterize the gradient flows around it. With novel techniques, our work uncovers some simple aspects of the complicated loss landscape and reveals how model, target function, samples and initialization affect the training dynamics differently. Based on these results, we also explain why (overparametrized) neural networks could generalize well.

Details

Database :
OAIster
Publication Type :
Electronic Resource
Accession number :
edsoai.on1438477187
Document Type :
Electronic Resource