民用航空
集合预报
航空
集成学习
航空安全
计算机科学
随机森林
Boosting(机器学习)
数据建模
航空事故
商用航空
机器学习
人工智能
梯度升压
特征(语言学)
飞行安全
数据挖掘
工程类
数据库
航空学
航空航天工程
哲学
语言学
标识
DOI:10.1109/tsczh58792.2023.10233447
摘要
In recent times, aviation accidents have emerged as a significant contributor to severe injuries and fatalities globally. This has prompted the research community to explore aviation safety using advanced machine learning algorithms and data analysis techniques. To address the safety concerns in aviation, a novel ensemble classification model has been proposed, leveraging the Aviation Safety Reporting System (ASRS) data. The focus of this model is to analyze and assess the safety aspects pertaining to individuals affected within the aviation system. The ensemble classification model shall contain two modules: the data-driven module consisting of data cleaning, feature selection, and imbalanced data division and reorganization, and the modeldriven module stacked by Random Forest (RF), XGBoost (XGB), and Light Gradient Boosting Machine (LGBM) separately. The results indicate that the ensemble model could solve the data imbalance while vastly improving accuracy. LGBM illustrates higher accuracy and faster run in the analysis of a single model of the ASRS-based imbalanced data, while the ensemble model has the best performance in classification at the same time. The ensemble model proposed for imbalanced data classification can provide a certain reference for similar data processing while improving the safety of civil aviation.
科研通智能强力驱动
Strongly Powered by AbleSci AI