克星
托卡马克
前馈
BETA(编程语言)
钢筋
计算机科学
物理
核物理学
材料科学
等离子体
工程类
控制工程
复合材料
程序设计语言
作者
Jaemin Seo,Yong-Su Na,B. Kim,Chanyoung Lee,M.S. Park,Seong‐Jik Park,Y.H. Lee
出处
期刊:Nuclear Fusion
[IOP Publishing]
日期:2021-07-07
卷期号:61 (10): 106010-106010
被引量:38
标识
DOI:10.1088/1741-4326/ac121b
摘要
In this work, we address a new feedforward control scheme of the normalized beta (βN) in tokamak plasmas, using the deep reinforcement learning (RL) technique. The deep RL algorithm optimizes an artificial decision-making agent that adjusts the discharge scenario to obtain the given target βN, from the state-action-reward sets explored by trials and errors of itself in the virtual tokamak environment. The virtual environment for the RL training is constructed with the LSTM network that imitates the plasma responses by external actuator controls, which is trained from 5-year KSTAR experimental data. Then, the RL agent experiences tons of discharges with different actuator controls in the LSTM simulator, and its internal parameters are optimized in the direction of maximizing the reward. We analyze a series of KSTAR experiments conducted with the RL-determined scenarios to validate the feasibility of the beta control scheme in a real device. We discuss the successes and limitations of the feedforward beta control by RL, and suggest our future works about it.
科研通智能强力驱动
Strongly Powered by AbleSci AI