强化学习
非线性系统
控制理论(社会学)
钢筋
振动
计算机科学
振动控制
控制(管理)
控制工程
人工智能
工程类
结构工程
声学
物理
量子力学
作者
Jiyuan Jiang,Jie Tang,Kun Zhao,Meng Li,Yinghui Li,Dengqing Cao
标识
DOI:10.1142/s0219455425500798
摘要
Optimal control of nonlinear vibration requires precise knowledge of the system and the solution to Hamilton–Jacobi–Bellman (HJB) equation. However, in practical engineering applications, acquiring precise system parameters poses challenges, and the analytical solutions for the HJB equation are difficult to obtain. In this paper, two reinforcement learning algorithms, Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm and Soft Actor-Critic (SAC) algorithm, are employed to train neural network-based optimal controllers for the van der Pol vibration system in the presence of unknown system parameters. To validate their performance, the controllers undergo testing in a series of experiments, including assessments of free vibration, frequency sweep excitation, and Gaussian noise excitation. The results indicate that both the TD3-trained and SAC-trained neural network-based controller are capable of proficiently suppress the vibration of the van der Pol oscillator. Additionally, these two model-free controllers can approximate the optimal control law which solved based on the dynamic model of the nonlinear system.
科研通智能强力驱动
Strongly Powered by AbleSci AI