强化学习
稳健性(进化)
计算机科学
控制理论(社会学)
人工神经网络
控制(管理)
人工智能
控制工程
工程类
生物化学
基因
化学
作者
Yuanda Wang,Jia Lin Sun,Haibo He,Changyin Sun
标识
DOI:10.1109/tsmc.2018.2884725
摘要
In this paper, a deep reinforcement learning-based robust control strategy for quadrotor helicopters is proposed. The quadrotor is controlled by a learned neural network which directly maps the system states to control commands in an end-to-end style. The learning algorithm is developed based on the deterministic policy gradient algorithm. By introducing an integral compensator to the actor-critic structure, the tracking accuracy and robustness have been greatly enhanced. Moreover, a two-phase learning protocol which includes both offline and online learning phase is proposed for practical implementation. An offline policy is first learned based on a simplified quadrotor model. Then, the policy is online optimized in actual flight. The proposed approach is evaluated in the flight simulator. The results demonstrate that the offline learned policy is highly robust to model errors and external disturbances. It also shows that the online learning could significantly improve the control performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI