Descriptor: "stochastic optimal control" / Journal: systems & control letters / Topic: stochastic control theory - Searchworks@Jio Institute Digital Library Search Results

Your search keyword '"stochastic optimal control"' showing total 6 results

Start Over Descriptor "stochastic optimal control" Topic stochastic control theory Journal systems & control letters

6 results on '"stochastic optimal control"'

1. A Q-learning algorithm for Markov decision processes with continuous state spaces.

Author: Hu, Jiaqiao, Yang, Xiangyu, Hu, Jian-Qiang, and Peng, Yijie
Subjects: *MARKOV processes, *CONTINUOUS processing, *ASYNCHRONOUS learning, *OPTIMIZATION algorithms, *ALGORITHMS, *STOCHASTIC control theory
Abstract: We propose an online algorithm for solving a class of continuous-state Markov decision processes. The algorithm combines classical Q-learning with an asynchronous averaging procedure, which allows Q-function estimates at sampled state–action pairs to be adaptively updated based on observations collected along a single sample trajectory. These estimates are then used to iteratively construct an interpolation-based function approximator of the Q-function. We prove the convergence of the algorithm and provide numerical results to illustrate its performance. • Proposed a model-free Q-learning algorithm for solving a class of infinite horizon Markov decision processes with continuous state spaces. • Proved the strong convergence of the algorithm when used in conjunction with a class of non-linear function approximators. • Illustrated the performance of the proposed algorithm through simulation comparison studies. [ABSTRACT FROM AUTHOR]
Published: 2024
Full Text: View/download PDF

2. FBSDEs involving time delays and advancements on infinite horizon and LQ problems with delays.

Author: Yang, Xueyang and Yu, Zhiyong
Subjects: *STOCHASTIC differential equations, *TIME delay systems, *STOCHASTIC control theory, *DELAY differential equations
Abstract: This paper is concerned with a class of coupled forward–backward stochastic differential equations (FBSDEs, for short) involving time delays and time advancements on infinite horizon. By introducing a randomized Lipschitz condition and a randomized monotonicity condition, the unique solvability of FBSDEs is obtained. Then the theoretical result is applied to a linear–quadratic (LQ, for short) problem of a time-delayed system with random coefficients. An explicit expression of the unique optimal control is obtained. [ABSTRACT FROM AUTHOR]
Published: 2022
Full Text: View/download PDF

3. Forward–backward linear quadratic stochastic optimal control problem with delay

Author: Huang, Jianhui, Li, Xun, and Shi, Jingtao
Subjects: *FORWARD-backward algorithm, *STOCHASTIC control theory, *DELAY differential equations, *RICCATI equation, *MATHEMATICAL variables, *FEEDBACK control systems, *MATHEMATICAL functions, *REGULATORS (Mathematics)
Abstract: Abstract: This paper is concerned with one kind of forward–backward linear quadratic stochastic control problem whose system is described by a linear anticipated forward–backward stochastic differential delayed equation. The explicit form of the optimal control is derived. Optimal state feedback regulators are studied in two special cases. For the case with delay in just the control variable, the optimal state feedback regulator is obtained by the Riccati equation. For the other case with delay in just the state variable, the optimal state feedback regulator is analyzed by the value function approach. [Copyright &y& Elsevier]
Published: 2012
Full Text: View/download PDF

4. Non-equivalence of stochastic optimal control problems with open and closed loop controls.

Author: Yong, Jiongmin and Zhang, Jianfeng
Subjects: *STOCHASTIC control theory, *STOCHASTIC differential equations
Abstract: For an optimal control problem of an Itô's type stochastic differential equation, the control process could be taken in open-loop or closed-loop forms. In the standard literature, provided appropriate regularity, the value functions under these two types of controls are equal and are the unique (viscosity) solution to the corresponding (path-dependent) HJB equation. In this short note, we provide a counterexample in the path dependent setting showing that these value functions can be different in general. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

5. On tight bounds for function approximation error in risk-sensitive reinforcement learning.

Author: Karmakar, Prasenjit and Bhatnagar, Shalabh
Subjects: *ERROR functions, *APPROXIMATION error, *STOCHASTIC control theory, *MARKOV processes, *REINFORCEMENT learning, *STOCHASTIC systems
Abstract: In this letter we provide several informative tight error bounds when using value function approximators for the risk-sensitive cost setting for a given policy represented using exponential utility. The novelty of our approach is that we make use of the irreducibility of the underlying Markov chain (resulting in better bounds using Perron–Frobenius eigenvectors) to derive new bounds whereas the earlier work used primarily the spectral variation bound which holds for any matrix, hence did not make use of the irreducibility. All our bounds have a perturbation term for large state spaces. We also present examples where we show that the new bounds perform 90-100% better than the earlier proposed spectral variation bound. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

6. A global maximum principle for stochastic optimal control problems with delay and applications.

Author: Meng, Weijun and Shi, Jingtao
Subjects: *STOCHASTIC control theory, *STOCHASTIC differential equations, *STOCHASTIC systems, *MAXIMUM principles (Mathematics), *DELAY differential equations
Abstract: In this paper, an open problem is solved, for the stochastic optimal control problem with delay where the control domain is nonconvex and the diffusion term contains both control and its delayed term. Inspired by previous results about delayed stochastic control systems, Peng's global stochastic maximum principle is generalized to the time delayed case. A special backward stochastic differential equation is introduced to deal with the cross terms, when applying the duality technique. Comparing with the classical result, the maximum condition contains an indicator function, which in fact is the characteristic of the stochastic optimal control problem with delay. Furthermore, to illustrate the applications of our theoretical results, three dynamic optimization problems are addressed based on the global maximum principle. [ABSTRACT FROM AUTHOR]
Published: 2021
Full Text: View/download PDF

Catalog

Books, media, physical & digital resources

See catalog results

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Refine your results

6 results on '"stochastic optimal control"'

1. A Q-learning algorithm for Markov decision processes with continuous state spaces.

2. FBSDEs involving time delays and advancements on infinite horizon and LQ problems with delays.

3. Forward–backward linear quadratic stochastic optimal control problem with delay

4. Non-equivalence of stochastic optimal control problems with open and closed loop controls.

5. On tight bounds for function approximation error in risk-sensitive reinforcement learning.

6. A global maximum principle for stochastic optimal control problems with delay and applications.

Catalog

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Search

Search Constraints

Refine your results

Search Limiters

Topic

Publication Year Range

Language

Publication Type

Database

6 results on '"stochastic optimal control"'

Search Results

Catalog

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources