Tech Science Press - Publisher of Open Access Journals

Open Access

ARTICLE

Hybrid Grey Wolf and Dipper Throated Optimization in Network Intrusion Detection Systems

Reem Alkanhel^1,*, Doaa Sami Khafaga², El-Sayed M. El-kenawy³, Abdelaziz A. Abdelhamid^4,5, Abdelhameed Ibrahim⁶, Rashid Amin⁷, Mostafa Abotaleb⁸, B. M. El-den⁶

CMC-Computers, Materials & Continua, Vol.74, No.2, pp. 2695-2709, 2023, DOI:10.32604/cmc.2023.033153

Abstract The Internet of Things (IoT) is a modern approach that enables connection with a wide variety of devices remotely. Due to the resource constraints and open nature of IoT nodes, the routing protocol for low power and lossy (RPL) networks may be vulnerable to several routing attacks. That’s why a network intrusion detection system (NIDS) is needed to guard against routing assaults on RPL-based IoT networks. The imbalance between the false and valid attacks in the training set degrades the performance of machine learning employed to detect network attacks. Therefore, we propose in this paper… More >

Open Access

ARTICLE

MCBC-SMOTE: A Majority Clustering Model for Classification of Imbalanced Data

Jyoti Arora¹, Meena Tushir², Keshav Sharma¹, Lalit Mohan¹, Aman Singh^3,*, Abdullah Alharbi⁴, Wael Alosaimi⁴

CMC-Computers, Materials & Continua, Vol.73, No.3, pp. 4801-4817, 2022, DOI:10.32604/cmc.2022.025960

Abstract Datasets with the imbalanced class distribution are difficult to handle with the standard classification algorithms. In supervised learning, dealing with the problem of class imbalance is still considered to be a challenging research problem. Various machine learning techniques are designed to operate on balanced datasets; therefore, the state of the art, different under-sampling, over-sampling and hybrid strategies have been proposed to deal with the problem of imbalanced datasets, but highly skewed datasets still pose the problem of generalization and noise generation during resampling. To over-come these problems, this paper proposes a majority clustering model for… More >

Open Access

ARTICLE

Water Quality Index Using Modified Random Forest Technique: Assessing Novel Input Features

Wen Yee Wong¹, Ayman Khallel Ibrahim Al-Ani¹, Khairunnisa Hasikin^1,*, Anis Salwa Mohd Khairuddin², Sarah Abdul Razak³, Hanee Farzana Hizaddin⁴, Mohd Istajib Mokhtar⁵, Muhammad Mokhzaini Azizan⁶

CMES-Computer Modeling in Engineering & Sciences, Vol.132, No.3, pp. 1011-1038, 2022, DOI:10.32604/cmes.2022.019244

Abstract Water quality analysis is essential to understand the ecological status of aquatic life. Conventional water quality index (WQI) assessment methods are limited to features such as water acidic or basicity (pH), dissolved oxygen (DO), biological oxygen demand (BOD), chemical oxygen demand (COD), ammoniacal nitrogen (NH₃-N), and suspended solids (SS). These features are often insufficient to represent the water quality of a heavy metal–polluted river. Therefore, this paper aims to explore and analyze novel input features in order to formulate an improved WQI. In this work, prospective insights on the feasibility of alternative water quality input variables… More >

Open Access

ARTICLE

An Imbalanced Dataset and Class Overlapping Classification Model for Big Data

Mini Prince^1,*, P. M. Joe Prathap²

Computer Systems Science and Engineering, Vol.44, No.2, pp. 1009-1024, 2023, DOI:10.32604/csse.2023.024277

Abstract Most modern technologies, such as social media, smart cities, and the internet of things (IoT), rely on big data. When big data is used in the real-world applications, two data challenges such as class overlap and class imbalance arises. When dealing with large datasets, most traditional classifiers are stuck in the local optimum problem. As a result, it’s necessary to look into new methods for dealing with large data collections. Several solutions have been proposed for overcoming this issue. The rapid growth of the available data threatens to limit the usefulness of many traditional methods.… More >

Open Access

ARTICLE

Hyper-Parameter Optimization of Semi-Supervised GANs Based-Sine Cosine Algorithm for Multimedia Datasets

Anas Al-Ragehi¹, Said Jadid Abdulkadir^1,2,*, Amgad Muneer^1,2, Safwan Sadeq³, Qasem Al-Tashi^4,5

CMC-Computers, Materials & Continua, Vol.73, No.1, pp. 2169-2186, 2022, DOI:10.32604/cmc.2022.027885

Abstract Generative Adversarial Networks (GANs) are neural networks that allow models to learn deep representations without requiring a large amount of training data. Semi-Supervised GAN Classifiers are a recent innovation in GANs, where GANs are used to classify generated images into real and fake and multiple classes, similar to a general multi-class classifier. However, GANs have a sophisticated design that can be challenging to train. This is because obtaining the proper set of parameters for all models-generator, discriminator, and classifier is complex. As a result, training a single GAN model for different datasets may not produce… More >

Open Access

ARTICLE

SMOTEDNN: A Novel Model for Air Pollution Forecasting and AQI Classification

Mohd Anul Haq^*

CMC-Computers, Materials & Continua, Vol.71, No.1, pp. 1403-1425, 2022, DOI:10.32604/cmc.2022.021968

Abstract Rapid industrialization and urbanization are rapidly deteriorating ambient air quality, especially in the developing nations. Air pollutants impose a high risk on human health and degrade the environment as well. Earlier studies have used machine learning (ML) and statistical modeling to classify and forecast air pollution. However, these methods suffer from the complexity of air pollution dataset resulting in a lack of efficient classification and forecasting of air pollution. ML-based models suffer from improper data pre-processing, class imbalance issues, data splitting, and hyperparameter tuning. There is a gap in the existing ML-based studies on air… More >

Open Access

ARTICLE

Improving Routine Immunization Coverage Through Optimally Designed Predictive Models

Fareeha Sameen¹, Abdul Momin Kazi², Majida Kazmi^1,*, Munir A Abbasi³, Saad Ahmed Qazi^1,4, Lampros K Stergioulas^3,5

CMC-Computers, Materials & Continua, Vol.70, No.1, pp. 375-395, 2022, DOI:10.32604/cmc.2022.019167

Abstract Routine immunization (RI) of children is the most effective and timely public health intervention for decreasing child mortality rates around the globe. Pakistan being a low-and-middle-income-country (LMIC) has one of the highest child mortality rates in the world occurring mainly due to vaccine-preventable diseases (VPDs). For improving RI coverage, a critical need is to establish potential RI defaulters at an early stage, so that appropriate interventions can be targeted towards such population who are identified to be at risk of missing on their scheduled vaccine uptakes. In this paper, a machine learning (ML) based predictive… More >

Open Access

ARTICLE

Multi-Class Sentiment Analysis of Social Media Data with Machine Learning Algorithms

Galimkair Mutanov, Vladislav Karyukin^*, Zhanl Mamykova

CMC-Computers, Materials & Continua, Vol.69, No.1, pp. 913-930, 2021, DOI:10.32604/cmc.2021.017827

Abstract The volume of social media data on the Internet is constantly growing. This has created a substantial research field for data analysts. The diversity of articles, posts, and comments on news websites and social networks astonishes imagination. Nevertheless, most researchers focus on posts on Twitter that have a specific format and length restriction. The majority of them are written in the English language. As relatively few works have paid attention to sentiment analysis in the Russian and Kazakh languages, this article thoroughly analyzes news posts in the Kazakhstan media space. The amassed datasets include texts… More >

Open Access

ARTICLE

Oversampling Methods Combined Clustering and Data Cleaning for Imbalanced Network Data

Yang Yang^1,*, Qian Zhao¹, Linna Ruan², Zhipeng Gao¹, Yonghua Huo³, Xuesong Qiu¹

Intelligent Automation & Soft Computing, Vol.26, No.5, pp. 1139-1155, 2020, DOI:10.32604/iasc.2020.011705

Abstract In network anomaly detection, network traffic data are often imbalanced, that is, certain classes of network traffic data have a large sample data volume while other classes have few, resulting in reduced overall network traffic anomaly detection on a minority class of samples. For imbalanced data, researchers have proposed the use of oversampling techniques to balance data sets; in particular, an oversampling method called the SMOTE provides a simple and effective solution for balancing data sets. However, current oversampling methods suffer from the generation of noisy samples and poor information quality. Hence, this study proposes More >

Open Access

ARTICLE

Improving Performance Prediction on Education Data with Noise and Class Imbalance

Akram M. Radwan^a,b, Zehra Cataltepe^a,c

Intelligent Automation & Soft Computing, Vol.24, No.4, pp. 777-783, 2018, DOI:10.1080/10798587.2017.1337673

Abstract This paper proposes to apply machine learning techniques to predict students’ performance on two real-world educational data-sets. The first data-set is used to predict the response of students with autism while they learn a specific task, whereas the second one is used to predict students’ failure at a secondary school. The two data-sets suffer from two major problems that can negatively impact the ability of classification models to predict the correct label; class imbalance and class noise. A series of experiments have been carried out to improve the quality of training data, and hence improve… More >

Displaying 11-20 on page 2 of 21. Per Page

View

877

Download

526

Like

1

View

1247

Download

706

Like

2

View

2457

Download

963

Like

0

View

1889

Download

928

Like

0

View

1253

Download

754

Like

0

View

2990

Download

1619

Like

0

View

2111

Download

1069

Like

2

View

3236

Download

1757

Like

0

Cited by

1

View

1650

Download

1126

Like

1

Cited by

2

View

1424

Download

952

Like

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp: