Start Over

Effect of the initial configuration of weights on the training and function of artificial neural networks

Authors :: Jesus, R. J.
Antunes, M. L.
da Costa, R. A.
Dorogovtsev, S. N.
Mendes, J. F. F.
Aguiar, R. L.
Source :: Mathematics 9, 2246 (2021)
Publication Year :: 2020
Abstract: The function and performance of neural networks is largely determined by the evolution of their weights and biases in the process of training, starting from the initial configuration of these parameters to one of the local minima of the loss function. We perform the quantitative statistical characterization of the deviation of the weights of two-hidden-layer ReLU networks of various sizes trained via Stochastic Gradient Descent (SGD) from their initial random configuration. We compare the evolution of the distribution function of this deviation with the evolution of the loss during training. We observed that successful training via SGD leaves the network in the close neighborhood of the initial configuration of its weights. For each initial weight of a link we measured the distribution function of the deviation from this value after training and found how the moments of this distribution and its peak depend on the initial weight. We explored the evolution of these deviations during training and observed an abrupt increase within the overfitting region. This jump occurs simultaneously with a similarly abrupt increase recorded in the evolution of the loss function. Our results suggest that SGD's ability to efficiently find local minima is restricted to the vicinity of the random initial configuration of weights.<br />Comment: 12 pages, 8 figures

Subjects :: Computer Science - Machine Learning

Details

Database :: arXiv
Journal :: Mathematics 9, 2246 (2021)
Publication Type :: Report
Accession number :: edsarx.2012.02550
Document Type :: Working Paper
Full Text :: https://doi.org/10.3390/math9182246

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Effect of the initial configuration of weights on the training and function of artificial neural networks

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Effect of the initial configuration of weights on the training and function of artificial neural networks

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources