Software Defect Prediction Using Ensemble Learning: A Systematic Literature Review

Matloob, F, Ghazal, T M, Taleb, N, Aftab, S, Ahmad, M, Khan, M A, Abbas, S and Soomro, T R (2021) Software Defect Prediction Using Ensemble Learning: A Systematic Literature Review. IEEE Access, 9. pp. 98754-98771. ISSN 2169-3536

[thumbnail of 13.pdf] Text
13.pdf - Published Version

Download (3MB)

Abstract

Recent advances in the domain of software defect prediction (SDP) include the integration of multiple classification techniques to create an ensemble or hybrid approach. This technique was introduced to improve the prediction performance by overcoming the limitations of any single classification technique. This research provides a systematic literature review on the use of the ensemble learning approach for software defect prediction. The review is conducted after critically analyzing research papers published since 2012 in four well-known online libraries: ACM, IEEE, Springer Link, and Science Direct. In this study, five research questions covering the different aspects of research progress on the use of ensemble learning for software defect prediction are addressed. To extract the answers to identified questions, 46 most relevant papers are shortlisted after a thorough systematic research process. This study will provide compact information regarding the latest trends and advances in ensemble learning for software defect prediction and provide a baseline for future innovations and further reviews. Through our study, we discovered that frequently employed ensemble methods by researchers are the random forest, boosting, and bagging. Less frequently employed methods include stacking, voting and Extra Trees. Researchers proposed many promising frameworks, such as EMKCA, SMOTE-Ensemble, MKEL, SDAEsTSE, TLEL, and LRCR, using ensemble learning methods. The AUC, accuracy, F-measure, Recall, Precision, and MCC were mostly utilized to measure the prediction performance of models. WEKA was widely adopted as a platform for machine learning. Many researchers showed through empirical analysis that features selection, and data sampling was necessary pre-processing steps that improve the performance of ensemble classifiers.

Affiliation: Skyline University College
SUC Author(s): Ghazal, T M ORCID: https://orcid.org/0000-0003-0672-7924
All Author(s): Matloob, F, Ghazal, T M, Taleb, N, Aftab, S, Ahmad, M, Khan, M A, Abbas, S and Soomro, T R
Item Type: Article
Uncontrolled Keywords: Systematic literature review (SLR), ensemble classifier, hybrid classifier, software defect prediction.
Subjects: B Information Technology > BL Machine Learning
Divisions: Skyline University College > School of IT
Depositing User: Mr Veeramani Rasu
Date Deposited: 02 Feb 2022 14:26
Last Modified: 02 Feb 2022 14:26
URI: https://research.skylineuniversity.ac.ae/id/eprint/74
Publisher URL: https://doi.org/10.1109/ACCESS.2021.3095559
Publisher OA policy: https://v2.sherpa.ac.uk/id/publication/24685
Related URLs:

Actions (login required)

View Item
View Item
Statistics for SkyRep ePrint 74 Statistics for this ePrint Item