Descriptor: "Stochastic Gradient Estimation" - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Stochastic Gradient Estimation"' showing total 11 results

Start Over Descriptor "Stochastic Gradient Estimation"

11 results on '"Stochastic Gradient Estimation"'

1. Variance reduction for generalized likelihood ratio method by conditional Monte Carlo and randomized Quasi-Monte Carlo methods

Author: Yijie Peng, Michael C. Fu, Jiaqiao Hu, Pierre L’Ecuyer, and Bruno Tuffin
Subjects: Simulation, Stochastic gradient estimation, Conditional Monte Carlo, Randomized quasi-Monte Carlo, Industrial engineering. Management engineering, T55.4-60.8
Abstract: The generalized likelihood ratio (GLR) method is a recently introduced gradient estimation method for handling discontinuities in a wide range of sample performances. We put the GLR methods from previous work into a single framework, simplify regularity conditions to justify the unbiasedness of GLR, and relax some of those conditions that are difficult to verify in practice. Moreover, we combine GLR with conditional Monte Carlo methods and randomized quasi-Monte Carlo methods to reduce the variance. Numerical experiments show that variance reduction could be significant in various applications.
Published: 2022
Full Text: View/download PDF

2. Learning to refine source representations for neural machine translation.

Author: Geng, Xinwei, Wang, Longyue, Wang, Xing, Yang, Mingtao, Feng, Xiaocheng, Qin, Bing, and Tu, Zhaopeng
Abstract: Machine translation is one of the most classic application technologies in artificial intelligence and natural language processing. Neural machine translation models generally adopt an encoder–decoder architecture for modeling the entire translation process. However, without considering target context (e.g., decoding state) to guide the encoding, encoded source representations struggle to put great emphasis on important information for predicting some target word, yielding the weakness in generating more discriminative attentive representations across different decoding steps. Towards tackling this issue, we propose a novel encoder–refiner–decoder framework, which dynamically refines the source representations based on the generated target-side information at each decoding step. Since the refining operations are time-consuming, we propose a policy network to decide when to refine at specific decoding steps. We solve such a problem using the Gumbel-Softmax reparameterization, which makes our network differentiable and trainable through standard stochastic gradient methods. Experimental results on both Chinese–English and English–German translation tasks show that the proposed approach significantly and consistently improves translation performance over the standard encoder–decoder framework. Furthermore, when refining strategy is applied, experimental results still show a reasonable improvement over the baseline with much decrease in decoding speed. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

3. A New Likelihood Ratio Method for Training Artificial Neural Networks.

Author: Peng, Yijie, Xiao, Li, Heidergott, Bernd, Hong, L. Jeff, and Lam, Henry
Subjects: *ARTIFICIAL neural networks, *DEEP learning, *ARTIFICIAL intelligence, *MACHINE learning, *COMPUTATIONAL complexity
Abstract: We investigate a new approach to compute the gradients of artificial neural networks (ANNs), based on the so-called push-out likelihood ratio method. Unlike the widely used backpropagation (BP) method that requires continuity of the loss function and the activation function, our approach bypasses this requirement by injecting artificial noises into the signals passed along the neurons. We show how this approach has a similar computational complexity as BP, and moreover is more advantageous in terms of removing the backward recursion and eliciting transparent formulas. We also formalize the connection between BP, a pivotal technique for training ANNs, and infinitesimal perturbation analysis, a classic path-wise derivative estimation approach, so that both our new proposed methods and BP can be better understood in the context of stochastic gradient estimation. Our approach allows efficient training for ANNs with more flexibility on the loss and activation functions, and shows empirical improvements on the robustness of ANNs under adversarial attacks and corruptions of natural noises. Summary of Contribution: Stochastic gradient estimation has been studied actively in simulation for decades and becomes more important in the era of machine learning and artificial intelligence. The stochastic gradient descent is a standard technique for training the artificial neural networks (ANNs), a pivotal problem in deep learning. The most popular stochastic gradient estimation technique is the backpropagation method. We find that the backpropagation method lies in the family of infinitesimal perturbation analysis, a path-wise gradient estimation technique in simulation. Moreover, we develop a new likelihood ratio-based method, another popular family of gradient estimation technique in simulation, for training more general ANNs, and demonstrate that the new training method can improve the robustness of the ANN. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

4. Solving Bayesian risk optimization via nested stochastic gradient estimation.

Author: Cakmak, Sait, Wu, Di, and Zhou, Enlu
Subjects: *STOCHASTIC approximation, *PROBLEM solving, *APPROXIMATION algorithms, *VALUE at risk, *ALGORITHMS
Abstract: In this article, we aim to solve Bayesian Risk Optimization (BRO), which is a recently proposed framework that formulates simulation optimization under input uncertainty. In order to efficiently solve the BRO problem, we derive nested stochastic gradient estimators and propose corresponding stochastic approximation algorithms. We show that our gradient estimators are asymptotically unbiased and consistent, and that the algorithms converge asymptotically. We demonstrate the empirical performance of the algorithms on a two-sided market model. Our estimators are of independent interest in extending the literature of stochastic gradient estimation to the case of nested risk measures. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

5. Robust integral sliding mode controller for optimisation of measurable cost functions with constraints.

Author: Solis, C. U., Clempner, J. B., and Poznyak, A. S.
Subjects: *SLIDING mode control, *UNCERTAIN systems, *INTEGRALS, *CONVEX functions, *DYNAMICAL systems, *CONSTRAINED optimization
Abstract: This paper proposes an online constrained extremum-seeking approach for an unknown convex function with unknown constraints within a class of uncertain dynamical systems with an available output disturbed by a stochastic noise. It is assumed that the objective function along with the constraints are available for measurement. The main contribution of the paper is the formulation of the problem using the penalty method and the development of an extremum seeking algorithm based on a modified synchronous detection method for computing a stochastic gradient descent procedure. In order to reject the undesirable uncertainties and perturbations of the dynamic plant from the beginning of the process, we employ the standard deterministic Integral Sliding Mode Control transforming the initial dynamic plant to static one. Then, we apply the gradient decedent technique. We consider time varying parameters of the suggested procedure for compensating the unknown dynamics. To validate the exposition, we perform a numerical example simulation. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

6. Stochastic mutual information gradient estimation for dimensionality reduction networks.

Author: Özdenizci, Ozan and Erdoğmuş, Deniz
Subjects: *FEATURE selection, *MACHINE learning, *NETWORK analysis (Planning), *MENTAL representation
Abstract: Feature ranking and selection is a widely used approach in various applications of supervised dimensionality reduction in discriminative machine learning. Nevertheless there exists significant evidence on feature ranking and selection algorithms based on any criterion leading to potentially sub-optimal solutions for class separability. In that regard, we introduce emerging information theoretic feature transformation protocols as an end-to-end neural network training approach. We present a dimensionality reduction network (MMINet) training procedure based on the stochastic estimate of the mutual information gradient. The network projects high-dimensional features onto an output feature space where lower dimensional representations of features carry maximum mutual information with their associated class labels. Furthermore, we formulate the training objective to be estimated non-parametrically with no distributional assumptions. We experimentally evaluate our method with applications to high-dimensional biological data sets, and relate it to conventional feature selection algorithms to form a special case of our approach. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

7. A New Likelihood Ratio Method for Training Artificial Neural Networks

Author: Li Xiao, Bernd Heidergott, Henry Lam, Yijie Peng, L. Jeff Hong, Operations Analytics, Amsterdam Business Research Institute, and Tinbergen Institute
Subjects: image identification, SDG 16 - Peace, Artificial neural network, business.industry, Computer science, Likelihood ratio method, SDG 16 - Peace, Justice and Strong Institutions, Computer Science::Neural and Evolutionary Computation, General Engineering, Training (meteorology), Pattern recognition, Backpropagation, Justice and Strong Institutions, Image identification, stochastic gradient estimation, Artificial intelligence, business, artificial neural network
Abstract: We investigate a new approach to compute the gradients of artificial neural networks (ANNs), based on the so-called push-out likelihood ratio method. Unlike the widely used backpropagation (BP) method that requires continuity of the loss function and the activation function, our approach bypasses this requirement by injecting artificial noises into the signals passed along the neurons. We show how this approach has a similar computational complexity as BP, and moreover is more advantageous in terms of removing the backward recursion and eliciting transparent formulas. We also formalize the connection between BP, a pivotal technique for training ANNs, and infinitesimal perturbation analysis, a classic path-wise derivative estimation approach, so that both our new proposed methods and BP can be better understood in the context of stochastic gradient estimation. Our approach allows efficient training for ANNs with more flexibility on the loss and activation functions, and shows empirical improvements on the robustness of ANNs under adversarial attacks and corruptions of natural noises. Summary of Contribution: Stochastic gradient estimation has been studied actively in simulation for decades and becomes more important in the era of machine learning and artificial intelligence. The stochastic gradient descent is a standard technique for training the artificial neural networks (ANNs), a pivotal problem in deep learning. The most popular stochastic gradient estimation technique is the backpropagation method. We find that the backpropagation method lies in the family of infinitesimal perturbation analysis, a path-wise gradient estimation technique in simulation. Moreover, we develop a new likelihood ratio-based method, another popular family of gradient estimation technique in simulation, for training more general ANNs, and demonstrate that the new training method can improve the robustness of the ANN.
Published: 2022

8. Regression Models Augmented with Direct Stochastic Gradient Estimators.

Author: Fu, Michael C. and Huashuai Qu
Subjects: *STOCHASTIC processes, *REGRESSION analysis, *MONTE Carlo method, *DERIVATIVES (Mathematics), *INFORMATION theory, *MAXIMUM likelihood statistics, *PARAMETER estimation
Abstract: Traditional regression assumes that the only data available are measurements of the value of the dependent variable for each combination of values for the independent variable. However, in many settings in stochastic (Monte Carlo) simulation, directly estimated derivative information is also available via techniques such as perturbation analysis or the likelihood ratio method. In this paper, we investigate potential modeling improvements that can be achieved by exploiting this additional gradient information in the regression setting. Using least squares and maximum likelihood estimation, we propose various direct gradient augmented regression (DiGAR) models that incorporate direct gradient estimators, starting with a one-dimensional independent variable and then extending to multidimensional input. For some special settings, we are able to characterize the variance of the estimated parameters in DiGAR and compare them analytically with the standard regression model. For a more typical stochastic simulation setting, we investigate the potential effectiveness of the augmented model by comparing it with standard regression in fitting a functional relationship for a simple queueing model, including both one-dimensional and four-dimensional examples. The preliminary empirical results are quite encouraging, as they indicate how DiGAR can capture trends that the standard model would miss. Even in queueing examples where there is a high correlation between the output and the gradient estimators, the basic DiGAR model that does not explicitly account for these correlations performs significantly better than the standard regression model. [ABSTRACT FROM AUTHOR]
Published: 2014
Full Text: View/download PDF

9. Stochastic Simulation: New Stochastic Approximation Methods and Sensitivity Analyses

Author: Chau, Marie
Subjects: Sensitivity Analysis, Stochastic Approximation, Stochastic Gradient Estimation, Monte Carlo Simulation, Operations research, Applied mathematics, Simulation Optimization
Abstract: In this dissertation, we propose two new types of stochastic approximation (SA) methods and study the sensitivity of SA and of a stochastic gradient method to various input parameters. First, we summarize the most common stochastic gradient estimation techniques, both direct and indirect, as well as the two classical SA algorithms, Robbins-Monro (RM) and Kiefer-Wolfowitz (KW), followed by some well-known modifications to the step size, output, gradient, and projection operator. Second, we introduce two new stochastic gradient methods in SA for univariate and multivariate stochastic optimization problems. Under a setting where both direct and indirect gradients are available, our new SA algorithms estimate the gradient using a hybrid estimator, which is a convex combination of a symmetric finite difference-type gradient estimate and an average of two associated direct gradient estimates. We derive variance minimizing weights that lead to desirable theoretical properties and prove convergence of the SA algorithms. Next, we study the finite-time performance of the KW algorithm and its sensitivity to the step size parameter, along with two of its adaptive variants, namely Kesten's rule and scale-and-shifted KW (SSKW). We conduct a sensitivity analysis of KW and explore the tightness of an mean-squared error (MSE) bound for quadratic functions, a relevant issue for determining how long to run an SA algorithm. Then, we propose two new adaptive step size sequences inspired by both Kesten's rule and SSKW, which address some of their weaknesses. Instead of us- ing one step size sequence, our adaptive step size is based on two deterministic sequences, and the step size used in the current iteration depends on the perceived proximity of the current iterate to the optimum. In addition, we introduce a method to adaptively adjust the two deterministic sequences. Lastly, we investigate the performance of a modified pathwise gradient estimation method that is applied to financial options with discontinuous payoffs, and in particular, used to estimate the Greeks, which measure the rate of change of (financial) derivative prices with respect to underlying market parameters and are central to financial risk management. The newly proposed kernel estimator relies on a smoothing bandwidth parameter. We explore the accuracy of the Greeks with varying bandwidths and investigate the sensitivity of a proposed iterative scheme that generates an estimate of the optimal bandwidth.
Published: 2015
Full Text: View/download PDF

10. Optimal Policies Search for Sensor Management

Author: Br��hard, Thomas, Duflos, Emmanuel, Vanheeghe, Philippe, Coquelin, Pierre-Arnaud, Sequential Learning (SEQUEL), Laboratoire d'Informatique Fondamentale de Lille (LIFL), Université de Lille, Sciences et Technologies-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lille, Sciences Humaines et Sociales-Centre National de la Recherche Scientifique (CNRS)-Université de Lille, Sciences et Technologies-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lille, Sciences Humaines et Sociales-Centre National de la Recherche Scientifique (CNRS)-Laboratoire d'Automatique, Génie Informatique et Signal (LAGIS), Université de Lille, Sciences et Technologies-Centrale Lille-Centre National de la Recherche Scientifique (CNRS)-Centrale Lille-Centre National de la Recherche Scientifique (CNRS)-Inria Lille - Nord Europe, Institut National de Recherche en Informatique et en Automatique (Inria), LAGIS-SI, Laboratoire d'Automatique, Génie Informatique et Signal (LAGIS), Université de Lille, Sciences et Technologies-Centrale Lille-Centre National de la Recherche Scientifique (CNRS)-Université de Lille, Sciences et Technologies-Centrale Lille-Centre National de la Recherche Scientifique (CNRS), Université de Lille, Sciences et Technologies-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lille, Sciences Humaines et Sociales-Centre National de la Recherche Scientifique (CNRS)-Université de Lille, Sciences et Technologies-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lille, Sciences Humaines et Sociales-Centre National de la Recherche Scientifique (CNRS)-Inria Lille - Nord Europe, Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire d'Automatique, Génie Informatique et Signal (LAGIS), and Université de Lille, Sciences et Technologies-Centrale Lille-Centre National de la Recherche Scientifique (CNRS)-Centrale Lille-Centre National de la Recherche Scientifique (CNRS)
Subjects: FOS: Computer and information sciences, ACM: G.: Mathematics of Computing/G.3: PROBABILITY AND STATISTICS/G.3.7: Probabilistic algorithms (including Monte Carlo), [STAT.AP]Statistics [stat]/Applications [stat.AP], Stochastic Gradient Estimation, Statistics - Applications, Partially Observable Markov Decision Process, Machine Learning (cs.LG), Computer Science - Learning, [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing, [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], AESA Radar, Applications (stat.AP), Sensor(s) Management, [SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing
Abstract: International audience; This paper introduces a new approach to solve sensor management problems. Classically sensor management problems can be well formalized as Partially-Observed Markov Decision Processes (POMPD). The original approach developped here consists in deriving the optimal parameterized policy based on a stochastic gradient estimation. We assume in this work that it is possible to learn the optimal policy off-line (in simulation ) using models of the environement and of the sensor(s). The learned policy can then be used to manage the sensor(s). In order to approximate the gradient in a stochastic context, we introduce a new method to approximate the gradient, based on Infinitesimal Perturbation Approximation (IPA). The effectiveness of this general framework is illustrated by the managing of an Electronically Scanned Array Radar. First simulations results are finally proposed.
Published: 2008

11. Optimal policies search for sensor management : Application to the ESA radar

Author: Bréhard, Thomas, Coquelin, Pierre-Arnaud, Duflos, Emmanuel, Vanheeghe, Philippe, Sequential Learning (SEQUEL), Laboratoire d'Informatique Fondamentale de Lille (LIFL), Université de Lille, Sciences et Technologies-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lille, Sciences Humaines et Sociales-Centre National de la Recherche Scientifique (CNRS)-Université de Lille, Sciences et Technologies-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lille, Sciences Humaines et Sociales-Centre National de la Recherche Scientifique (CNRS)-Inria Lille - Nord Europe, Institut National de Recherche en Informatique et en Automatique (Inria)-Laboratoire d'Automatique, Génie Informatique et Signal (LAGIS), Université de Lille, Sciences et Technologies-Centrale Lille-Centre National de la Recherche Scientifique (CNRS)-Centrale Lille-Centre National de la Recherche Scientifique (CNRS), LAGIS-SI, Laboratoire d'Automatique, Génie Informatique et Signal (LAGIS), Université de Lille, Sciences et Technologies-Centrale Lille-Centre National de la Recherche Scientifique (CNRS)-Université de Lille, Sciences et Technologies-Centrale Lille-Centre National de la Recherche Scientifique (CNRS), Université de Lille, Sciences et Technologies-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lille, Sciences Humaines et Sociales-Centre National de la Recherche Scientifique (CNRS)-Université de Lille, Sciences et Technologies-Institut National de Recherche en Informatique et en Automatique (Inria)-Université de Lille, Sciences Humaines et Sociales-Centre National de la Recherche Scientifique (CNRS)-Laboratoire d'Automatique, Génie Informatique et Signal (LAGIS), Université de Lille, Sciences et Technologies-Centrale Lille-Centre National de la Recherche Scientifique (CNRS)-Centrale Lille-Centre National de la Recherche Scientifique (CNRS)-Inria Lille - Nord Europe, and Institut National de Recherche en Informatique et en Automatique (Inria)
Subjects: [INFO.INFO-AU]Computer Science [cs]/Automatic Control Engineering, AESA Radar, Stochastic Gradient Estimation, Sensor(s) Management, Partially Observable Markov Decision Process
Abstract: International audience; This paper introduces a new approach to solve sensor management problems. Classically sensor management problems can be well formalized as partially-observed Markov decision processes (POMPD). The original approach developped here consists in deriving the optimal parameterized policy based on a stochastic gradient estimation. We assume in this work that it is possible to learn the optimal policy off-line (in simulation) using models of the environement and of the sensor(s). The learned policy can then be used to manage the sensor(s). In order to approximate the gradient in a stochastic context, we introduce a new method to approximate the gradient, based on infinitesimal perturbation approximation (IPA). The effectiveness of this general framework is illustrated by the managing of an electronically scanned array radar. First simulations results are finally proposed.
Published: 2008

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

11 results on '"Stochastic Gradient Estimation"'

1. Variance reduction for generalized likelihood ratio method by conditional Monte Carlo and randomized Quasi-Monte Carlo methods

2. Learning to refine source representations for neural machine translation.

3. A New Likelihood Ratio Method for Training Artificial Neural Networks.

4. Solving Bayesian risk optimization via nested stochastic gradient estimation.

5. Robust integral sliding mode controller for optimisation of measurable cost functions with constraints.

6. Stochastic mutual information gradient estimation for dimensionality reduction networks.

7. A New Likelihood Ratio Method for Training Artificial Neural Networks

8. Regression Models Augmented with Direct Stochastic Gradient Estimators.

9. Stochastic Simulation: New Stochastic Approximation Methods and Sensitivity Analyses

10. Optimal Policies Search for Sensor Management

11. Optimal policies search for sensor management : Application to the ESA radar

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

Publisher

11 results on '"Stochastic Gradient Estimation"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources