强化学习
计算机科学
维数(图论)
过程(计算)
控制(管理)
增强学习
国家(计算机科学)
最优控制
能量(信号处理)
数学优化
控制理论(社会学)
算法
人工智能
数学
统计
纯数学
操作系统
作者
Fei Chen,Peng Mei,Hehui Xie,Shichun Yang,Bin Xu,Cong Huang
标识
DOI:10.1109/iccar55106.2022.9782662
摘要
This article is aimed at developing a control strategy based on the Q-learning algorithm for HEVs. The Q-learning algorithm deals with high-dimensional state space problems, and the agent will have a "dimension disaster" problem during the training process. Then a control strategy based on the Deep Q Network (DQN) algorithm is introduced. Since DQN can only output discrete actions, in order to achieve continuous action control, an optimized control strategy based on the Deep Deterministic Policy Gradient (DDPG) algorithm is proposed. Simulation results show that compared with Q-learning and DQN algorithms, the DDPG algorithm converges faster, and the training process is more robust. Besides, the energy optimization control strategy based on the DDPG algorithm can better control the energy of HEVs.
科研通智能强力驱动
Strongly Powered by AbleSci AI