Open Access iconOpen Access

ARTICLE

crossmark

High Performance Classification of Android Malware Using Ensemble Machine Learning

Pagnchakneat C. Ouk1, Wooguil Pak2,*

1 Department of Computer Engineering, Keimyung University, Daegu, 42601, Korea
2 Department of Information and Communication Engineering, Yeungnam University, Gyeongsan, Gyeongbuk, 38541, Korea

* Corresponding Author: Wooguil Pak. Email: email

Computers, Materials & Continua 2022, 72(1), 381-398. https://doi.org/10.32604/cmc.2022.024540

Abstract

Although Android becomes a leading operating system in market, Android users suffer from security threats due to malwares. To protect users from the threats, the solutions to detect and identify the malware variant are essential. However, modern malware evades existing solutions by applying code obfuscation and native code. To resolve this problem, we introduce an ensemble-based malware classification algorithm using malware family grouping. The proposed family grouping algorithm finds the optimal combination of families belonging to the same group while the total number of families is fixed to the optimal total number. It also adopts unified feature extraction technique for handling seamless both bytecode and native code. We propose a unique feature selection algorithm that improves classification performance and time simultaneously. 2-gram based features are generated from the instructions and segments, and then selected by using multiple filters to choose most effective features. Through extensive simulation with many obfuscated and native code malware applications, we confirm that it can classify malwares with high accuracy and short processing time. Most existing approaches failed to achieve classification speed and detection time simultaneously. Therefore, the approach can help Android users to keep themselves safe from various and evolving cyber-attacks very effectively.

Keywords


Cite This Article

P. C. Ouk and W. Pak, "High performance classification of android malware using ensemble machine learning," Computers, Materials & Continua, vol. 72, no.1, pp. 381–398, 2022. https://doi.org/10.32604/cmc.2022.024540



cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1160

    View

  • 940

    Download

  • 0

    Like

Share Link