Home > Archive > 2024 > Volume 14 Number 2 (2024) >
IJML 2024 Vol.14(2): 59-64
DOI: 10.18178/ijml.2024.14.2.1159

Software Defect Prediction Based on Tree-structured Parzen Estimator Using Machine Learning Classifiers

Faiza Khan1, Sultan Almari2,*, Muhammad Haseeb Khan3, and Summrina Kanwal4
1Riphah International University, Faculty of Computing, Islamabad 45211, Pakistan
2Department of Computing and Informatics, Saudi Electronic University, Riyadh 11673, Saudi Arabia
3Pak-Austria Fachhochschule Institute of Applied Sciences and Technology, Haripur, Pakistan
4Center for Applied Intelligent Systems Research, Halmstad University, Sweden
Email: monikacies@gmail.com(M.C.); clairefbonilla@gmail.com(C.F.B.); carolode@alumni.uv.es(C.O.-L.A.)
*Corresponding author

anuscript received January 10, 2023; revised January 25; accepted February 9, 2023; published June 21, 2024.

Abstract—Software testing is the most significant task in software development and it takes maximum amount of time, cost, and effort. Therefore, to decrease these resources SDP is utilized to improve the work of the SQA process with the help of predicting faulty or defective components. Numerous methods have been proposed by researchers to predict defective components but these methods generate partial results when applied to imbalanced data sets. An imbalanced dataset has nonuniform class distribution with very limited illustrations of a precise class as compared to that of the other class. The usage of imbalanced datasets leads to off-target predictions of the smaller class, that are usually considered to be more significant than the mainstream class. Thus, handling imbalanced data and HPO efficiently is important for the successful development of a capable bug prediction model. In this paper SDP model is anticipated that utilizes different machine learning classifiers with Tree-structured Parzen Estimator Method (TPE) as hyperparameter optimizer to enhance defect prediction accuracy through HPO and SMOTE algorithm to solve class imbalance issue. The proposed method was evaluated on eighteen software defect datasets from the promise repository. Experimental results demonstrated that the proposed technique achieved improved accuracy than when the classifiers are used with default parameters.

Keywords—Software bug prediction (SDP), Tree-structured Parzen Estimator Method (TPE), Synthetic Minority Oversampling Technique (SMOTE), Hyperparameter optimization (HPO)

[PDF]

Cite: Faiza Khan, Sultan Almari, Muhammad Haseeb Khan, and Summrina Kanwal, "Software Defect Prediction Based on Tree-structured Parzen Estimator Using Machine Learning Classifiers," International Journal of Machine Learning vol. 14, no. 2, pp. 59-64, 2024.

Copyright © 2024 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

General Information

  • E-ISSN: 2972-368X
  • Abbreviated Title: Int. J. Mach. Learn.
  • Frequency: Quaterly
  • DOI: 10.18178/IJML
  • Editor-in-Chief: Dr. Lin Huang
  • Executive Editor:  Ms. Cherry L. Chen
  • Abstracing/Indexing: Inspec (IET), Google Scholar, Crossref, ProQuest, Electronic Journals LibraryCNKI.
  • E-mail: ijml@ejournal.net


Article Metrics in Dimensions