Open Access iconOpen Access

ARTICLE

Graph-Based Feature Learning for Cross-Project Software Defect Prediction

Ahmed Abdu1, Zhengjun Zhai1,2, Hakim A. Abdo3, Redhwan Algabri4,*, Sungon Lee5,*

1 School of Software, Northwestern Polytechnical University, Xi’an, China
2 School of Computer Science, Northwestern Polytechnical University, Xi’an, China
3 Department of Computer Science, Hodeidah University, PO Box 3114, Al-Hudaydah, Yemen
4 Research Institute of Engineering and Technology, Hanyang University, Ansan, Korea
5 Department of Robotics, Hanyang University, Ansan, Korea

* Corresponding Authors: Redhwan Algabri. Email: email; Sungon Lee. Email: email

Computers, Materials & Continua 2023, 77(1), 161-180. https://doi.org/10.32604/cmc.2023.043680

Abstract

Cross-project software defect prediction (CPDP) aims to enhance defect prediction in target projects with limited or no historical data by leveraging information from related source projects. The existing CPDP approaches rely on static metrics or dynamic syntactic features, which have shown limited effectiveness in CPDP due to their inability to capture higher-level system properties, such as complex design patterns, relationships between multiple functions, and dependencies in different software projects, that are important for CPDP. This paper introduces a novel approach, a graph-based feature learning model for CPDP (GB-CPDP), that utilizes NetworkX to extract features and learn representations of program entities from control flow graphs (CFGs) and data dependency graphs (DDGs). These graphs capture the structural and data dependencies within the source code. The proposed approach employs Node2Vec to transform CFGs and DDGs into numerical vectors and leverages Long Short-Term Memory (LSTM) networks to learn predictive models. The process involves graph construction, feature learning through graph embedding and LSTM, and defect prediction. Experimental evaluation using nine open-source Java projects from the PROMISE dataset demonstrates that GB-CPDP outperforms state-of-the-art CPDP methods in terms of F1-measure and Area Under the Curve (AUC). The results showcase the effectiveness of GB-CPDP in improving the performance of cross-project defect prediction.

Keywords


Cite This Article

APA Style
Abdu, A., Zhai, Z., Abdo, H.A., Algabri, R., Lee, S. (2023). Graph-based feature learning for cross-project software defect prediction. Computers, Materials & Continua, 77(1), 161-180. https://doi.org/10.32604/cmc.2023.043680
Vancouver Style
Abdu A, Zhai Z, Abdo HA, Algabri R, Lee S. Graph-based feature learning for cross-project software defect prediction. Comput Mater Contin. 2023;77(1):161-180 https://doi.org/10.32604/cmc.2023.043680
IEEE Style
A. Abdu, Z. Zhai, H.A. Abdo, R. Algabri, and S. Lee "Graph-Based Feature Learning for Cross-Project Software Defect Prediction," Comput. Mater. Contin., vol. 77, no. 1, pp. 161-180. 2023. https://doi.org/10.32604/cmc.2023.043680



cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 877

    View

  • 262

    Download

  • 1

    Like

Share Link