Open Access

ARTICLE

Metaheuristic Based Clustering with Deep Learning Model for Big Data Classification

R. Krishnaswamy1, Kamalraj Subramaniam2, V. Nandini3, K. Vijayalakshmi4, Seifedine Kadry5, Yunyoung Nam6,*
1 Department of Electronics and Communication Engineering, University College of Engineering Ariyalur, Ariyalur, 621704, India
2 Department of Biomedical Engineering, Faculty of Engineering, Karpagam Academy of Higher Education, Coimbatore, 641021, India
3 Department of Computer Science and Engineering, Sona College of Technology, Salem, 636 005, India
4 Department of Electronics and Communication Engineering, Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Chennai, 600077, India
5 Deparmtent of Applied Data Science, Noroff University College, Kristiansand, Norway
6 Department of Computer Science and Engineering, Soonchunhyang University, Asan, Korea
* Corresponding Author: Yunyoung Nam. Email:

Computer Systems Science and Engineering 2023, 44(1), 391-406. https://doi.org/10.32604/csse.2023.024901

Received 03 November 2021; Accepted 20 December 2021; Issue published 01 June 2022

Abstract

Recently, a massive quantity of data is being produced from a distinct number of sources and the size of the daily created on the Internet has crossed two Exabytes. At the same time, clustering is one of the efficient techniques for mining big data to extract the useful and hidden patterns that exist in it. Density-based clustering techniques have gained significant attention owing to the fact that it helps to effectively recognize complex patterns in spatial dataset. Big data clustering is a trivial process owing to the increasing quantity of data which can be solved by the use of Map Reduce tool. With this motivation, this paper presents an efficient Map Reduce based hybrid density based clustering and classification algorithm for big data analytics (MR-HDBCC). The proposed MR-HDBCC technique is executed on Map Reduce tool for handling the big data. In addition, the MR-HDBCC technique involves three distinct processes namely pre-processing, clustering, and classification. The proposed model utilizes the Density-Based Spatial Clustering of Applications with Noise (DBSCAN) technique which is capable of detecting random shapes and diverse clusters with noisy data. For improving the performance of the DBSCAN technique, a hybrid model using cockroach swarm optimization (CSO) algorithm is developed for the exploration of the search space and determine the optimal parameters for density based clustering. Finally, bidirectional gated recurrent neural network (BGRNN) is employed for the classification of big data. The experimental validation of the proposed MR-HDBCC technique takes place using the benchmark dataset and the simulation outcomes demonstrate the promising performance of the proposed model interms of different measures.

Keywords

Big data; data classification; clustering; mapreduce; dbscan algorithm

Cite This Article

R. Krishnaswamy, K. Subramaniam, V. Nandini, K. Vijayalakshmi, S. Kadry et al., "Metaheuristic based clustering with deep learning model for big data classification," Computer Systems Science and Engineering, vol. 44, no.1, pp. 391–406, 2023.



This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 655

    View

  • 389

    Download

  • 0

    Like

Share Link

WeChat scan