Atif Raza Zaidi1, Tahir Abbas1,*, Ali Daud2,*, Omar Alghushairy3, Hussain Dawood4, Nadeem Sarwar5
CMC-Computers, Materials & Continua, Vol.84, No.2, pp. 3281-3304, 2025, DOI:10.32604/cmc.2025.063646
- 03 July 2025
Abstract Safeguarding against malware requires precise machine-learning algorithms to classify harmful apps. The Drebin dataset of 15,036 samples and 215 features yielded significant and reliable results for two hybrid models, CNN + XGBoost and KNN + XGBoost. To address the class imbalance issue, SMOTE (Synthetic Minority Over-sampling Technique) was used to preprocess the dataset, creating synthetic samples of the minority class (malware) to balance the training set. XGBoost was then used to choose the most essential features for separating malware from benign programs. The models were trained and tested using 6-fold cross-validation, measuring accuracy, precision, recall,… More >