Descriptor: "Stochastic Gradient Estimation" / Journal: informs journal on computing - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Stochastic Gradient Estimation"' showing total 3 results

Start Over Descriptor "Stochastic Gradient Estimation" Journal informs journal on computing

3 results on '"Stochastic Gradient Estimation"'

1. A New Likelihood Ratio Method for Training Artificial Neural Networks.

Author: Peng, Yijie, Xiao, Li, Heidergott, Bernd, Hong, L. Jeff, and Lam, Henry
Subjects: *ARTIFICIAL neural networks, *DEEP learning, *ARTIFICIAL intelligence, *MACHINE learning, *COMPUTATIONAL complexity
Abstract: We investigate a new approach to compute the gradients of artificial neural networks (ANNs), based on the so-called push-out likelihood ratio method. Unlike the widely used backpropagation (BP) method that requires continuity of the loss function and the activation function, our approach bypasses this requirement by injecting artificial noises into the signals passed along the neurons. We show how this approach has a similar computational complexity as BP, and moreover is more advantageous in terms of removing the backward recursion and eliciting transparent formulas. We also formalize the connection between BP, a pivotal technique for training ANNs, and infinitesimal perturbation analysis, a classic path-wise derivative estimation approach, so that both our new proposed methods and BP can be better understood in the context of stochastic gradient estimation. Our approach allows efficient training for ANNs with more flexibility on the loss and activation functions, and shows empirical improvements on the robustness of ANNs under adversarial attacks and corruptions of natural noises. Summary of Contribution: Stochastic gradient estimation has been studied actively in simulation for decades and becomes more important in the era of machine learning and artificial intelligence. The stochastic gradient descent is a standard technique for training the artificial neural networks (ANNs), a pivotal problem in deep learning. The most popular stochastic gradient estimation technique is the backpropagation method. We find that the backpropagation method lies in the family of infinitesimal perturbation analysis, a path-wise gradient estimation technique in simulation. Moreover, we develop a new likelihood ratio-based method, another popular family of gradient estimation technique in simulation, for training more general ANNs, and demonstrate that the new training method can improve the robustness of the ANN. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

2. A New Likelihood Ratio Method for Training Artificial Neural Networks

Author: Li Xiao, Bernd Heidergott, Henry Lam, Yijie Peng, L. Jeff Hong, Operations Analytics, Amsterdam Business Research Institute, and Tinbergen Institute
Subjects: image identification, SDG 16 - Peace, Artificial neural network, business.industry, Computer science, Likelihood ratio method, SDG 16 - Peace, Justice and Strong Institutions, Computer Science::Neural and Evolutionary Computation, General Engineering, Training (meteorology), Pattern recognition, Backpropagation, Justice and Strong Institutions, Image identification, stochastic gradient estimation, Artificial intelligence, business, artificial neural network
Abstract: We investigate a new approach to compute the gradients of artificial neural networks (ANNs), based on the so-called push-out likelihood ratio method. Unlike the widely used backpropagation (BP) method that requires continuity of the loss function and the activation function, our approach bypasses this requirement by injecting artificial noises into the signals passed along the neurons. We show how this approach has a similar computational complexity as BP, and moreover is more advantageous in terms of removing the backward recursion and eliciting transparent formulas. We also formalize the connection between BP, a pivotal technique for training ANNs, and infinitesimal perturbation analysis, a classic path-wise derivative estimation approach, so that both our new proposed methods and BP can be better understood in the context of stochastic gradient estimation. Our approach allows efficient training for ANNs with more flexibility on the loss and activation functions, and shows empirical improvements on the robustness of ANNs under adversarial attacks and corruptions of natural noises. Summary of Contribution: Stochastic gradient estimation has been studied actively in simulation for decades and becomes more important in the era of machine learning and artificial intelligence. The stochastic gradient descent is a standard technique for training the artificial neural networks (ANNs), a pivotal problem in deep learning. The most popular stochastic gradient estimation technique is the backpropagation method. We find that the backpropagation method lies in the family of infinitesimal perturbation analysis, a path-wise gradient estimation technique in simulation. Moreover, we develop a new likelihood ratio-based method, another popular family of gradient estimation technique in simulation, for training more general ANNs, and demonstrate that the new training method can improve the robustness of the ANN.
Published: 2022

3. Regression Models Augmented with Direct Stochastic Gradient Estimators.

Author: Fu, Michael C. and Huashuai Qu
Subjects: *STOCHASTIC processes, *REGRESSION analysis, *MONTE Carlo method, *DERIVATIVES (Mathematics), *INFORMATION theory, *MAXIMUM likelihood statistics, *PARAMETER estimation
Abstract: Traditional regression assumes that the only data available are measurements of the value of the dependent variable for each combination of values for the independent variable. However, in many settings in stochastic (Monte Carlo) simulation, directly estimated derivative information is also available via techniques such as perturbation analysis or the likelihood ratio method. In this paper, we investigate potential modeling improvements that can be achieved by exploiting this additional gradient information in the regression setting. Using least squares and maximum likelihood estimation, we propose various direct gradient augmented regression (DiGAR) models that incorporate direct gradient estimators, starting with a one-dimensional independent variable and then extending to multidimensional input. For some special settings, we are able to characterize the variance of the estimated parameters in DiGAR and compare them analytically with the standard regression model. For a more typical stochastic simulation setting, we investigate the potential effectiveness of the augmented model by comparing it with standard regression in fitting a functional relationship for a simple queueing model, including both one-dimensional and four-dimensional examples. The preliminary empirical results are quite encouraging, as they indicate how DiGAR can capture trends that the standard model would miss. Even in queueing examples where there is a high correlation between the output and the gradient estimators, the basic DiGAR model that does not explicitly account for these correlations performs significantly better than the standard regression model. [ABSTRACT FROM AUTHOR]
Published: 2014
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

3 results on '"Stochastic Gradient Estimation"'

1. A New Likelihood Ratio Method for Training Artificial Neural Networks.

2. A New Likelihood Ratio Method for Training Artificial Neural Networks

3. Regression Models Augmented with Direct Stochastic Gradient Estimators.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

Publisher

3 results on '"Stochastic Gradient Estimation"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources