强化学习
计算机科学
跟踪误差
跟踪(教育)
转化(遗传学)
约束(计算机辅助设计)
人工神经网络
方案(数学)
最优控制
人工智能
控制器(灌溉)
控制(管理)
控制工程
控制理论(社会学)
数学优化
工程类
数学
化学
基因
数学分析
农学
生物
机械工程
生物化学
教育学
心理学
作者
Ning Wang,Ying Gao,Xuefeng Zhang
出处
期刊:IEEE transactions on neural networks and learning systems
[Institute of Electrical and Electronics Engineers]
日期:2021-02-20
卷期号:32 (12): 5456-5467
被引量:146
标识
DOI:10.1109/tnnls.2021.3056444
摘要
An unmanned surface vehicle (USV) under complicated marine environments can hardly be modeled well such that model-based optimal control approaches become infeasible. In this article, a self-learning-based model-free solution only using input–output signals of the USV is innovatively provided. To this end, a data-driven performance-prescribed reinforcement learning control (DPRLC) scheme is created to pursue control optimality and prescribed tracking accuracy simultaneously. By devising state transformation with prescribed performance, constrained tracking errors are substantially converted into constraint-free stabilization of tracking errors with unknown dynamics. Reinforcement learning paradigm using neural network-based actor–critic learning framework is further deployed to directly optimize controller synthesis deduced from the Bellman error formulation such that transformed tracking errors evolve a data-driven optimal controller. Theoretical analysis eventually ensures that the entire DPRLC scheme can guarantee prescribed tracking accuracy, subject to optimal cost. Both simulations and virtual-reality experiments demonstrate the remarkable effectiveness and superiority of the proposed DPRLC scheme.
科研通智能强力驱动
Strongly Powered by AbleSci AI