Back to Search
Start Over
Linear Frequency Principle Model to Understand the Absence of Overfitting in Neural Networks
- Publication Year :
- 2021
-
Abstract
- Why heavily parameterized neural networks (NNs) do not overfit the data is an important long standing open question. We propose a phenomenological model of the NN training to explain this non-overfitting puzzle. Our linear frequency principle (LFP) model accounts for a key dynamical feature of NNs: they learn low frequencies first, irrespective of microscopic details. Theory based on our LFP model shows that low frequency dominance of target functions is the key condition for the non-overfitting of NNs and is verified by experiments. Furthermore, through an ideal two-layer NN, we unravel how detailed microscopic NN training dynamics statistically gives rise to a LFP model with quantitative prediction power.<br />Comment: to appear in Chinese Physics Letters
Details
- Database :
- arXiv
- Publication Type :
- Report
- Accession number :
- edsarx.2102.00200
- Document Type :
- Working Paper
- Full Text :
- https://doi.org/10.1088/0256-307X/38/3/038701