1. Learning Hybrid Policies for MPC with Application to Drone Flight in Unknown Dynamic Environments.
- Author
-
Feng, Zhaohan, Chen, Jie, Xiao, Wei, Sun, Jian, Xin, Bin, and Wang, Gang
- Subjects
BLENDED learning ,ARTIFICIAL neural networks ,ARTIFICIAL intelligence ,MULTIAGENT systems ,CONTROL theory (Engineering) ,EVOLUTIONARY computation ,MECHANICS (Physics) - Abstract
The article discusses the use of model predictive control (MPC) for drone flight control in unknown dynamic environments. MPC is a practical method for drone flight control due to its robustness against modeling errors and uncertainties. However, it can degrade in performance when faced with unknown environmental dynamics. The paper introduces a parameterized MPC approach called hyMPC that adapts to uncertain environmental conditions. It uses a novel policy search framework to train a high-level Gaussian policy and neural network policies to provide real-time decision variables. The effectiveness of hyMPC is validated through numerical simulations, achieving a 100% success rate in drone flight tests. The text discusses a framework for maneuvering drones through swinging gates using reinforcement learning (RL) and model predictive control (MPC). The framework utilizes episode-based policy search methods to acquire parameterized motor primitives and skills for controlling the drone. The RL-driven MPC framework incorporates a multi-layer perceptron to predict the gate's future state and a hybrid MPC cost function for trajectory optimization. The policy parameters are updated using a reward function that evaluates the quality of sampled trajectories. The proposed framework is validated through numerical simulations. This document presents a novel framework for navigating drones through swinging gates with unknown dynamics. The framework combines real-time Model Predictive Control (MPC) and offline Reinforcement Learning (RL) techniques. The approach involves training a neural network to predict gate motions and using an MLP to guide trajectory planning. The framework divides the traversal task into gate-following and gate-traversing [Extracted from the article]
- Published
- 2024
- Full Text
- View/download PDF