Open Access iconOpen Access

ARTICLE

crossmark

An Enhanced Memetic Algorithm for Feature Selection in Big Data Analytics with MapReduce

Umanesan Ramakrishnan1,*, Nandhagopal Nachimuthu2

1 Anna University, Chennai, 600025, India
2 Department of Electronics and Communication Engineering, Excel Engineering College, Namakkal, 637303, India

* Corresponding Author: Umanesan Ramakrishnan. Email: email

Intelligent Automation & Soft Computing 2022, 31(3), 1547-1559. https://doi.org/10.32604/iasc.2022.017123

Abstract

Recently, various research fields have begun dealing with massive datasets forseveral functions. The main aim of a feature selection (FS) model is to eliminate noise, repetitive, and unnecessary featuresthat reduce the efficiency of classification. In a limited period, traditional FS models cannot manage massive datasets and filterunnecessary features. It has been discovered from the state-of-the-art literature that metaheuristic algorithms perform better compared to other FS wrapper-based techniques. Common techniques such as the Genetic Algorithm (GA) andParticle Swarm Optimization (PSO) algorithm, however, suffer from slow convergence and local optima problems. Even with new generation algorithms such as Firefly heuristic and Fish Swarm Heuristic, these questions have been shown to overcome. This paper introduces an improved memetic optimization (EMO) algorithm for FS in this perspective by using conditional criteria in large datasets. The proposed EMO algorithm divides the entire dataset into sample blocksandconducts the task of learning in the map steps. The partial result obtained is combined into a final vector of feature weights in the reductionprocess which defines the appropriate collection of characteristics. Finally, the method of grouping based on the support vector machine (SVM) takes place. Within the Spark system, the proposed EMO algorithm is applied and the experimental results claim that it is superior to other approaches. The simulation results show that the maximum AUC values of 0.79 and 0.74 respectively are obtained by the EMO-FS model.

Keywords


Cite This Article

U. Ramakrishnan and N. Nachimuthu, "An enhanced memetic algorithm for feature selection in big data analytics with mapreduce," Intelligent Automation & Soft Computing, vol. 31, no.3, pp. 1547–1559, 2022. https://doi.org/10.32604/iasc.2022.017123



cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1562

    View

  • 887

    Download

  • 0

    Like

Share Link