Back to Search Start Over

Safe Optimal Control of Dynamic Systems: Learning from Experts and Safely Exploring New Policies.

Authors :
Candelieri, Antonio
Ponti, Andrea
Fersini, Elisabetta
Messina, Enza
Archetti, Francesco
Source :
Mathematics (2227-7390); Oct2023, Vol. 11 Issue 20, p4347, 16p
Publication Year :
2023

Abstract

Many real-life systems are usually controlled through policies replicating experts' knowledge, typically favouring "safety" at the expense of optimality. Indeed, these control policies are usually aimed at avoiding a system's disruptions or deviations from a target behaviour, leading to suboptimal performances. This paper proposes a statistical learning approach to exploit the historical safe experience—collected through the application of a safe control policy based on experts' knowledge— to "safely explore" new and more efficient policies. The basic idea is that performances can be improved by facing a reasonable and quantifiable risk in terms of safety. The proposed approach relies on Gaussian Process regression to obtain a probabilistic model of both a system's dynamics and performances, depending on the historical safe experience. The new policy consists of solving a constrained optimization problem, with two Gaussian Processes modelling, respectively, the safety constraints and the performance metric (i.e., objective function). As a probabilistic model, Gaussian Process regression provides an estimate of the target variable and the associated uncertainty; this property is crucial for dealing with uncertainty while new policies are safely explored. Another important benefit is that the proposed approach does not require any implementation of an expensive digital twin of the original system. Results on two real-life systems are presented, empirically proving the ability of the approach to improve performances with respect to the initial safe policy without significantly affecting safety. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
22277390
Volume :
11
Issue :
20
Database :
Complementary Index
Journal :
Mathematics (2227-7390)
Publication Type :
Academic Journal
Accession number :
173316874
Full Text :
https://doi.org/10.3390/math11204347