强化学习
弹道
高超音速
钢筋
计算机科学
航空航天工程
人工智能
航空学
工程类
物理
结构工程
天文
作者
Jianfeng Li,Shenmin Song,Xiaoping Shi
标识
DOI:10.1177/09544100241278023
摘要
To overcome the shortcomings of traditional NLP methods for trajectory planning problems, an intelligent trajectory real-time planning method is designed for hypersonic gliding vehicles (HGVs), which is composed of two stages: the agent training stage and the real-time trajectory generation stage. During the training stage, the HGV model is considered as an agent, and an environment containing flight information and relative information is constructed. Given the trajectory planning problem possessing continuous state-action space, the twin delayed deep deterministic policy gradient (TD3) is employed, based on which the HGV agent is trained in the environment. To match the real flight environment for HGVs, the process and terminal constraints are taken into consideration, such as the limit of dynamic pressure, overload, and the terminal miss distance, etc. The reward shaping technique is adopted to tackle the multiple constraints. The second stage is the real-time trajectory generation stage, during which a trajectory satisfying the multiple constraints is generated online by the TD3-based method. The simulation results verify the effectiveness of the proposed method.
科研通智能强力驱动
Strongly Powered by AbleSci AI