Cross-project Defect Prediction with An Enhanced Transfer Boosting Algorithm

Boosting(机器学习) 计算机科学 加权 数据挖掘 软件 机器学习 学习迁移 人工智能 算法 数据建模 软件错误 预测建模 数据库 医学 放射科 程序设计语言
作者
Nazgol Nikravesh,Mohammad Reza Keyvanpour
标识
DOI:10.1109/iccke57176.2022.9960103
摘要

A growing number of software projects makes it increasingly crucial to predict software defects. If adequate historical data are accessible, within-project defect prediction models can be reliable. However, during the early phases of software development, sufficient data are not available to train an effective predictor. Cross-project defect prediction (CPDP) utilizes information from previous mature projects (source data) for predicting whether new software modules (target data) will be defective. CPDP models must take into account the fact that data distributions between source and target projects are different. Cross-project defect prediction often reduces distribution differences by either selecting training data or using transfer learning methods. Using transfer learning effectively reduces distribution differences in recent CPDP models, yet none of them have taken into account the possibility that negative transfer may occur as a result of the imbalanced nature of defect data. In this paper, a four-step model is proposed, of which three steps are dedicated to the preparation of training data and their initial weights for use in the fourth step, which involves an enhanced version of the transfer boosting algorithm. In this algorithm, the imbalance nature of data is considered and the weighting of the source data is updated to enhance the prediction performance. Therefore, aside from reducing distribution discrepancy between source and target data, this model also addresses the issues related to defect data class imbalance. As compared to four state-of-the-art CPDP models, this model provided consistent and accurate predictions for fifteen projects from PROMISE, AEEEM, and SOFTLAB. Our proposed model provided the best average results for both AUC and F-measure and in some datasets, the improvements were more than 5%.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
香芋应助猛犸象冲冲冲采纳,获得10
刚刚
如意厉完成签到,获得积分10
1秒前
1秒前
自乳化系统又creaming了完成签到 ,获得积分10
8秒前
Slemon发布了新的文献求助10
8秒前
jin发布了新的文献求助10
8秒前
子车茗应助ikea1984采纳,获得10
12秒前
12秒前
Jasper应助勤恳的若翠采纳,获得10
14秒前
zyt关闭了zyt文献求助
14秒前
Fred发布了新的文献求助50
15秒前
七月夏栀完成签到,获得积分10
15秒前
顾矜应助ljy采纳,获得10
16秒前
sunshine发布了新的文献求助10
17秒前
17秒前
顾矜应助jin采纳,获得10
19秒前
大闲鱼铭一完成签到 ,获得积分10
19秒前
20秒前
FashionBoy应助棋子一小枚采纳,获得10
22秒前
害羞的裘完成签到 ,获得积分10
22秒前
倪璞清完成签到,获得积分10
23秒前
w_应助禀受采纳,获得10
23秒前
大模型应助lcd采纳,获得10
26秒前
moyawen发布了新的文献求助10
26秒前
27秒前
我是老大应助小白兔采纳,获得10
29秒前
子舟完成签到,获得积分10
30秒前
斯文败类应助Mryuan采纳,获得10
30秒前
拾忆完成签到 ,获得积分10
30秒前
31秒前
31秒前
34秒前
李建涛完成签到,获得积分10
37秒前
1257应助快乐的90后fjk采纳,获得10
38秒前
39秒前
lcd发布了新的文献求助10
39秒前
39秒前
40秒前
李爱国应助moyawen采纳,获得30
40秒前
41秒前
高分求助中
Impact of Mitophagy-Related Genes on the Diagnosis and Development of Esophageal Squamous Cell Carcinoma via Single-Cell RNA-seq Analysis and Machine Learning Algorithms 2000
Die Elektra-Partitur von Richard Strauss : ein Lehrbuch für die Technik der dramatischen Komposition 1000
How to Create Beauty: De Lairesse on the Theory and Practice of Making Art 1000
Gerard de Lairesse : an artist between stage and studio 670
大平正芳: 「戦後保守」とは何か 550
LNG地下タンク躯体の構造性能照査指針 500
Cathodoluminescence and its Application to Geoscience 500
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 生物化学 物理 内科学 纳米技术 计算机科学 化学工程 复合材料 基因 遗传学 催化作用 物理化学 免疫学 量子力学 细胞生物学
热门帖子
关注 科研通微信公众号,转发送积分 3000581
求助须知:如何正确求助?哪些是违规求助? 2660351
关于积分的说明 7205018
捐赠科研通 2296234
什么是DOI,文献DOI怎么找? 1217586
科研通“疑难数据库(出版商)”最低求助积分说明 593826
版权声明 592931