Start Over

Deep Imitation Learning for Optimal Trajectory Planning and Initial Condition Optimization for an Unstable Dynamic System.

Authors :: Chen, Bo-Hsun
Lin, Pei-Chun
Source :: Advanced Intelligent Systems (2640-4567); Jan2024, Vol. 6 Issue 1, p1-24, 24p
Publication Year :: 2024
Abstract: In this article, an innovative offline deep imitation learning algorithm for optimal trajectory planning is proposed. While many state‐of‐the‐art works achieved optimal trajectory planning, their systems were stable or quasistable, and their approaches rarely optimized the system's initial conditions (ICs). Here, a new unstable dynamic system task called "internal sliding object stabilization control" is proposed, modeled, and solved by deep imitation learning. Given the system's ICs, the neural networks (NNs) can imitate the iterative linear quadratic regulator (iLQR), generate optimal trajectories, and compute faster. A proportional–integral–derivative (PID) controller is used to track the unstable trajectories. Leveraging on the gradients of NNs, it can optimize the system's ICs, avoid obstacles stepwise, and ensure the worst bounds of NNs for safety. Subsequently, thorough simulations are conducted, including comparing the iLQR and PID controllers in the task, optimizing the system's different ICs by gradient descent, and finding the worst bound of the performance by gradient ascent. Results show that the proposed algorithm achieves considerably improved performance. Finally, experiments are conducted with a real manipulator to compare the proposed structure with the original iLQR. Results indicate that the proposed algorithm resembles the iLQR well. Program code and experiment results are in https://github.com/DanielYamChen/ISOSC.git. [ABSTRACT FROM AUTHOR]

Subjects :: MACHINE learning
DYNAMICAL systems
PID controllers
TRAJECTORY optimization

Details

Language :: English
ISSN :: 26404567
Volume :: 6
Issue :: 1
Database :: Complementary Index
Journal :: Advanced Intelligent Systems (2640-4567)
Publication Type :: Academic Journal
Accession number :: 174935973
Full Text :: https://doi.org/10.1002/aisy.202300379

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Deep Imitation Learning for Optimal Trajectory Planning and Initial Condition Optimization for an Unstable Dynamic System.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Deep Imitation Learning for Optimal Trajectory Planning and Initial Condition Optimization for an Unstable Dynamic System.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources