弹道
强化学习
跟踪(教育)
计算机科学
人工智能
机器学习
物理
天文
心理学
教育学
作者
Haoran Xu,Jianyin Fan,Hongxu Ma,Qiang Wang
标识
DOI:10.1109/tim.2024.3370760
摘要
This paper aims to solve the trajectory tracking task of the pneumatic musculoskeletal robot within a model-based reinforcement learning framework. Considering the limited sensors and short lifespan of self-made pneumatic artificial muscles, physics priors are encoded into Gaussian process regression to implement a semi-parametric model for micro-data system identification and the identified model is combined with cross-entropy method (CEM)-based model predictive control to plan for the optimal action online. To further compensate for the model imperfection and improve the control performance, a hybrid feedforward and feedback controller-like strategy is proposed to guide the search space of the original CEM solver. The effectiveness of our approach is verified on a real musculoskeletal manipulator with two degrees of freedom and the results show that only 50 s of interacting with the environment is enough for the robot to learn writing alphabet letters from scratch.
科研通智能强力驱动
Strongly Powered by AbleSci AI