结冰
聚类分析
Boosting(机器学习)
过采样
人工智能
算法
风洞
SCADA系统
机器学习
计算机科学
数学
数据挖掘
工程类
物理
航空航天工程
气象学
计算机网络
带宽(计算)
电气工程
作者
Sai Li,Yanfeng Peng,Guangfu Bin
标识
DOI:10.1109/jsen.2023.3296086
摘要
Supervisory control and data acquisition (SCADA) is widely used in wind farms as an effective data acquisition system for wind turbines (WTs). However, in practical engineering applications, it is difficult for us to have adequate conditions to collect enough WT blade icing data, which leads to data imbalance and uneven distribution in the feature space. Using the classical synthetic minority oversampling technique (SMOTE) to balance the data may increase the overlap of positive and negative samples, or produce some redundant samples without useful information. A center jumping boosting machine (CJBM) method is proposed that combines an improved clustering-based oversampling (γ mini density peaks clustering SMOTE, γMiniDPC-SMOTE) and light gradient boosting machine (LightGBM) for blade icing prediction. First, to solve the problem of imbalanced and uneven distribution of WT data, a ${\gamma }$ MiniDPC-SMOTE method is proposed, which divides icing samples into multiple clusters, then increases icing samples, and alleviates uneven distribution in feature space. Second, calculating the intercept distance ${d}_{c}$ based on the binary search method and the adaptive selection of DPC parameters based on the step phenomenon of $\gamma $ parameters and verified by $\gamma $ -step of two WT icing data are proposed. Then, for the problem of low operating efficiency of the model under a large amount of imbalanced data, LightGBM is used for model training and icing prediction. Finally, validation was performed on two SCADA datasets. The results showed that the accuracy, precision, recall, F1-measure, and running times increased significantly, proving the superiority of the CJBM.
科研通智能强力驱动
Strongly Powered by AbleSci AI