强化学习
计算机科学
主动悬架
非线性系统
控制器(灌溉)
控制理论(社会学)
趋同(经济学)
控制工程
人工智能
参数统计
PID控制器
控制(管理)
工程类
数学
执行机构
温度控制
物理
统计
经济
生物
量子力学
经济增长
农学
作者
Zhao Tan,Guilin Wen,Zebang Pan,Shan Yin,Xiaojian Wu,Gulbahar Tohti
标识
DOI:10.1177/09544070231191842
摘要
A well-controlled active suspension system has the potential to provide better ride comfort. Benefiting from its powerful feature extraction and nonlinear generalization capabilities, the deep reinforcement learning (DRL), such as deep deterministic policy gradient (DDPG), has shown great potential to make decisions adaptively and intelligently in the control of active suspension system. However, the DDPG is troubled by the problem of low training efficiency due to the high proportion of illegal strategies. This paper proposed a novel DDPG controller for a nonlinear uncertain active suspension system by combining DRL with expert demonstrations. Specifically, the improved training method integrated with both a pre-training mechanism based on PID expert samples and an adaptive experience replay mechanism, is put forward for the DDPG to achieve both the goals of imitating the expert and improving the training efficiency. Moreover, considering the ride comfort and the state constraints as targets, a mixed reward function is designed to guide RL agents for learning effective actions. It is shown that the proposed training methods effectively accelerate the convergence of the DDPG. Furthermore, the comparison experiments demonstrate that the proposed controller provides great vibration attenuation, and has better adaptiveness to various working conditions and parametric uncertainty.
科研通智能强力驱动
Strongly Powered by AbleSci AI