Author: "Meyn, Sean P." / Topic: mathematical model - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"Meyn, Sean P."' showing total 10 results

Start Over Author "Meyn, Sean P." Topic mathematical model

10 results on '"Meyn, Sean P."'

1. Differential Temporal Difference Learning.

Author: Devraj, Adithya M., Kontoyiannis, Ioannis, and Meyn, Sean P.
Subjects: MACHINE learning, MARKOV processes, STOCHASTIC control theory, CENTRAL limit theorem, KEY performance indicators (Management), REINFORCEMENT learning
Abstract: Value functions derived from Markov decision processes arise as a central component of algorithms as well as performance metrics in many statistics and engineering applications of machine learning. Computation of the solution to the associated Bellman equations is challenging in most practical cases of interest. A popular class of approximation techniques, known as temporal difference (TD) learning algorithms, are an important subclass of general reinforcement learning methods. The algorithms introduced in this article are intended to resolve two well-known issues with TD-learning algorithms. Their slow convergence due to very high central limit theorem variance, and the fact that, for the problem of computing the relative value function, consistent algorithms exist only in special cases. First we show that the gradients of these value functions admit a representation that lends itself to algorithm design. Based on this result, a new class of differential TD-learning algorithms is introduced. For Markovian models on Euclidean space with smooth dynamics, the algorithms are shown to be consistent under general conditions. Numerical results show dramatic variance reduction in comparison to standard methods. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

2. State Estimation for the Individual and the Population in Mean Field Control With Application to Demand Dispatch.

Author: Chen, Yue, Busic, Ana, and Meyn, Sean P.
Subjects: ESTIMATION theory, FINITE element method, ALGORITHM software, QUALITY of service, ELECTRIC power distribution grids
Abstract: This paper concerns state estimation problems in a mean field control setting. In a finite population model, the goal is to estimate the joint distribution of the population state and the state of a typical individual. The observation equations are a noisy measurement of the population. The general results are applied to demand dispatch for regulation of the power grid, based on randomized local control algorithms. In prior work by the authors it is shown that local control can be designed so that the aggregate of loads behaves as a controllable resource, with accuracy matching or exceeding traditional sources of frequency regulation. The operational cost is nearly zero in many cases. The information exchange between grid and load is minimal, but it is assumed in the overall control architecture that the aggregate power consumption of loads is available to the grid operator. It is shown that the Kalman filter can be constructed to reduce these communication requirements, and to provide the grid operator with accurate estimates of the mean and variance of quality of service (QoS) for an individual load. [ABSTRACT FROM AUTHOR]
Published: 2017
Full Text: View/download PDF

3. Multivariable feedback particle filter.

Author: Yang, Tao, Laugesen, Richard S., Mehta, Prashant G., and Meyn, Sean P.
Abstract: In recent work it is shown that importance sampling can be avoided in the particle filter through an innovation structure inspired by traditional nonlinear filtering combined with Mean-Field Game formalisms [9], [19]. The resulting feedback particle filter (FPF) offers significant variance improvements; in particular, the algorithm can be applied to systems that are not stable. The filter comes with an up-front computational cost to obtain the filter gain. This paper describes new representations and algorithms to compute the gain in the general multivariable setting. The main contributions are, (i) Theory surrounding the FPF is improved: Consistency is established in the multivariate setting, as well as well-posedness of the associated PDE to obtain the filter gain. (ii) The gain can be expressed as the gradient of a function, which is precisely the solution to Poisson's equation for a related MCMC diffusion (the Smoluchowski equation). This provides a bridge to MCMC as well as to approximate optimal filtering approaches such as TD-learning, which can in turn be used to approximate the gain. (iii) Motivated by a weak formulation of Poisson's equation, a Galerkin finite-element algorithm is proposed for approximation of the gain. Its performance is illustrated in numerical experiments. [ABSTRACT FROM PUBLISHER]
Published: 2012
Full Text: View/download PDF

4. Bifurcation analysis of a heterogeneous mean-field oscillator game model.

Author: Yin, Huibing, Mehta, Prashant G., Meyn, Sean P., and Shanbhag, Uday V.
Abstract: This paper studies the phase transition in a heterogeneous mean-field oscillator game model using methods from bifurcation theory. In our earlier paper [1], we had obtained a coupled PDE model using mean-field approximation and described linear analysis of the PDEs that suggested possibility of a Hamiltonian Hopf bifurcation. In this paper, we simplify the analysis somewhat by relating the solutions of the PDE model to the solutions of a certain nonlinear eigenvalue problem. Both analysis and computations are much easier for the nonlinear eigenvalue problem. Apart from the bifurcation analysis that shows existence of a phase transition, we also describe a Lyapunov-Schmidt perturbation method to obtain asymptotic formulae for the small amplitude bifurcated solutions. For comparison, we also depict numerical solutions that are obtained using the continuation software AUTO. [ABSTRACT FROM PUBLISHER]
Published: 2011
Full Text: View/download PDF

5. Feedback particle filter with mean-field coupling.

Author: Yang, Tao, Mehta, Prashant G., and Meyn, Sean P.
Abstract: A new formulation of the particle filter for nonlinear filtering is presented, based on concepts from optimal control, and from the mean-field game theory. The optimal control is chosen so that the posterior distribution of a particle matches as closely as possible the posterior distribution of the true state given the observations: This is achieved by introducing a cost function, defined by the Kullback-Leibler (K-L) divergence between the actual posterior, and the posterior of any particle. [ABSTRACT FROM PUBLISHER]
Published: 2011
Full Text: View/download PDF

6. TD-learning with exploration.

Author: Meyn, Sean P. and Surana, Amit
Abstract: We introduce exploration in the TD-learning algorithm to approximate the value function for a given policy. In this way we can modify the norm used for approximation, “zooming in” to a region of interest in the state space. We also provide extensions to SARSA to eliminate the need for numerical integration in policy improvement. Construction of the algorithm and its analysis build on recent general results concerning the spectral theory of Markov chains and positive operators. [ABSTRACT FROM PUBLISHER]
Published: 2011
Full Text: View/download PDF

7. Feedback Particle Filter for a Continuous-Time Markov Chain.

Author: Yang, Tao, Mehta, Prashant G., and Meyn, Sean P.
Subjects: MARKOV processes, ESTIMATION theory, INFORMATION filtering, STOCHASTIC systems, ALGORITHMS
Abstract: This technical note extends the feedback particle filter (FPF) methodology and algorithms to the problem of filtering a continuous-time Markov chain. The main contribution is the development of a feedback control-based transformation of the Wonham filter, where the control input is realized via a time-modulated Poisson counter process. A complete characterization of the feedback mechanism that defines the FPF is obtained, which leads to tractable algorithms for the nonlinear filtering problem even for large state spaces. Numerical examples are introduced to help illustrate the application of these techniques. [ABSTRACT FROM PUBLISHER]
Published: 2016
Full Text: View/download PDF

8. Ancillary Service to the Grid Using Intelligent Deferrable Loads.

Author: Meyn, Sean P., Barooah, Prabir, Busic, Ana, Chen, Yue, and Ehren, Jordan
Subjects: *LINEAR time invariant systems, *MARKOV processes, *APPROXIMATION theory, *NONLINEAR statistical models, *COMPUTER architecture
Abstract: Renewable energy sources such as wind and solar power have a high degree of unpredictability and time-variation, which makes balancing demand and supply challenging. One possible way to address this challenge is to harness the inherent flexibility in demand of many types of loads. Introduced in this paper is a technique for decentralized control for automated demand response that can be used by grid operators as ancillary service for maintaining demand-supply balance. A randomized control architecture is proposed, motivated by the need for decentralized decision making, and the need to avoid synchronization that can lead to large and detrimental spikes in demand. An aggregate model for a large number of loads is then developed by examining the mean field limit. A key innovation is a linear time-invariant (LTI) system approximation of the aggregate nonlinear model, with a scalar signal as the input and a measure of the aggregate demand as the output. This makes the approximation particularly convenient for control design at the grid level. [ABSTRACT FROM AUTHOR]
Published: 2015
Full Text: View/download PDF

9. Learning in Mean-Field Games.

Author: Yin, Huibing, Mehta, Prashant G., Meyn, Sean P., and Shanbhag, Uday V.
Subjects: APPROXIMATION error, NONLINEAR systems, DYNAMIC programming, SYNCHRONIZATION, PHASE transitions, STOCHASTIC learning models
Abstract: The purpose of this paper is to show how insight obtained from a mean-field model can be used to create an architecture for approximate dynamic programming (ADP) for a certain class of games comprising of a large number of agents. The general technique is illustrated with the aid of a mean-field oscillator game model introduced in our prior work. The states of the model are interpreted as the phase angles for a collection of nonhomogeneous oscillators, and in this way the model may be regarded as an extension of the classical coupled oscillator model of Kuramoto. The paper introduces ADP techniques for design and adaptation (learning) of approximately optimal control laws for this model. For this purpose, a parameterization is proposed, based on an analysis of the mean-field PDE model for the game. In an offline setting, a Galerkin procedure is introduced to choose the optimal parameters while in an online setting, a steepest descent algorithm is proposed. The paper provides a detailed analysis of the optimal parameter values as well as the Bellman error with both the Galerkin approximation and the online algorithm. Finally, a phase transition result is described for the large population limit when each oscillator uses the approximately optimal control law. A critical value of the control penalty parameter is identified: above this value, the oscillators are incoherent; and below this value (when control is sufficiently cheap) the oscillators synchronize. These conclusions are illustrated with results from numerical experiments. [ABSTRACT FROM AUTHOR]
Published: 2014
Full Text: View/download PDF

10. Synchronization of Coupled Oscillators is a Game.

Author: Yin, Huibing, Mehta, Prashant G., Meyn, Sean P., and Shanbhag, Uday V.
Subjects: SYNCHRONIZATION, ELECTRIC oscillators, PHASE transitions, NONCOOPERATIVE games (Mathematics), DYNAMICS, H2 control, STOCHASTIC differential equations
Abstract: The purpose of this paper is to understand phase transition in noncooperative dynamic games with a large number of agents. Applications are found in neuroscience, biology, and economics, as well as traditional engineering applications. The focus of analysis is a variation of the large population linear quadratic Gaussian (LQG) model of Huang 2007, comprised here of a controlled N-dimensional stochastic differential equation model, coupled only through a cost function. The states are interpreted as phase angles for a collection of heterogeneous oscillators, and in this way the model may be regarded as an extension of the classical coupled oscillator model of Kuramoto. A deterministic PDE model is proposed, which is shown to approximate the stochastic system as the population size approaches infinity. Key to the analysis of the PDE model is the existence of a particular Nash equilibrium in which the agents ‘opt out’ of the game, setting their controls to zero, resulting in the ‘incoherence’ equilibrium. Methods from dynamical systems theory are used in a bifurcation analysis, based on a linearization of the partial differential equation (PDE) model about the incoherence equilibrium. A critical value of the control cost parameter is identified: above this value, the oscillators are incoherent; and below this value (when control is sufficiently cheap) the oscillators synchronize. These conclusions are illustrated with results from numerical experiments. [ABSTRACT FROM PUBLISHER]
Published: 2012
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

10 results on '"Meyn, Sean P."'

1. Differential Temporal Difference Learning.

2. State Estimation for the Individual and the Population in Mean Field Control With Application to Demand Dispatch.

3. Multivariable feedback particle filter.

4. Bifurcation analysis of a heterogeneous mean-field oscillator game model.

5. Feedback particle filter with mean-field coupling.

6. TD-learning with exploration.

7. Feedback Particle Filter for a Continuous-Time Markov Chain.

8. Ancillary Service to the Grid Using Intelligent Deferrable Loads.

9. Learning in Mean-Field Games.

10. Synchronization of Coupled Oscillators is a Game.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Journal

Database

10 results on '"Meyn, Sean P."'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources