结冰
聚类分析
Boosting(机器学习)
过采样
人工智能
算法
风洞
机器学习
计算机科学
数学
数据挖掘
工程类
物理
航空航天工程
气象学
计算机网络
带宽(计算)
作者
Sai Li,Junsheng Cheng,Guangfu Bin
出处
期刊:IEEE Sensors Journal
[Institute of Electrical and Electronics Engineers]
日期:2023-09-01
卷期号:23 (17): 19726-19736
被引量:1
标识
DOI:10.1109/jsen.2023.3296086
摘要
Supervisory control and data acquisition (SCADA) is widely used in wind farms as an effective data acquisition system for wind turbines (WTs). However, in practical engineering applications, it is difficult for us to have adequate conditions to collect enough WT blade icing data, which leads to data imbalance and uneven distribution in the feature space. Using the classical synthetic minority oversampling technique (SMOTE) to balance the data may increase the overlap of positive and negative samples, or produce some redundant samples without useful information. A center jumping boosting machine (CJBM) method is proposed that combines an improved clustering-based oversampling (γ mini density peaks clustering SMOTE, γMiniDPC-SMOTE) and light gradient boosting machine (LightGBM) for blade icing prediction. First, to solve the problem of imbalanced and uneven distribution of WT data, a ${\gamma }$ MiniDPC-SMOTE method is proposed, which divides icing samples into multiple clusters, then increases icing samples, and alleviates uneven distribution in feature space. Second, calculating the intercept distance ${d}_{c}$ based on the binary search method and the adaptive selection of DPC parameters based on the step phenomenon of $\gamma $ parameters and verified by $\gamma $ -step of two WT icing data are proposed. Then, for the problem of low operating efficiency of the model under a large amount of imbalanced data, LightGBM is used for model training and icing prediction. Finally, validation was performed on two SCADA datasets. The results showed that the accuracy, precision, recall, F1-measure, and running times increased significantly, proving the superiority of the CJBM.
科研通智能强力驱动
Strongly Powered by AbleSci AI