Yi Zhong,Weijie Zheng,Zhongyong Chen,Fan Xia,Liming Yu,Qiqi Wu,Xinkun Ai,Chengshuo Shen,Zhiyou Yang,W. Yan,Yonghua Ding,Yunjia Liang,Zhipeng Chen,R. H. Tong,Wei Bai,Jiarong Fang,Fang Li
出处
期刊:Plasma Physics and Controlled Fusion [IOP Publishing] 日期:2021-05-21卷期号:63 (7): 075008-075008被引量:15
标识
DOI:10.1088/1361-6587/abfa74
摘要
Abstract Using machine learning (ML) techniques to develop disruption predictors is an effective way to avoid or mitigate the disruption in a large-scale tokamak. The recent ML-based disruption predictors have made great progress regarding accuracy, but most of them have not achieved acceptable cross-machine performance. Before we develop a cross-machine predictor, it is very important to investigate the method of developing a cross-tokamak ML-based disruption prediction model. To ascertain the elements which impact the model’s performance and achieve a deep understanding of the predictor, multiple models are trained using data from two different tokamaks, J-TEXT and HL-2A, based on an implementation of the gradient-boosted decision trees algorithm called LightGBM, which can provide detailed information about the model and input features. The predictor models are not only built and tested for performance, but also analyzed from a feature importance perspective as well as for model performance variation. The relative feature importance ranking of two tokamaks is caused by differences in disruption types between different tokamaks. The result of two models with seven inputs showed that common diagnostics is very important in building a cross-machine predictor. This provided a strategy for selecting diagnostics and shots data for developing cross-machine predictors.