Tech Science Press - Publisher of Open Access Journals

Open Access

ARTICLE

A Hybrid CNN-Transformer Framework for Normal Blood Cell Classification: Towards Automated Hematological Analysis

Osama M. Alshehri¹, Ahmad Shaf^2,*, Muhammad Irfan^3,*, Mohammed M. Jalal⁴, Malik A. Altayar⁴, Mohammed H. Abu-Alghayth⁵, Humood Al Shmrany⁶, Tariq Ali⁷, Toufique A. Soomro⁸, Ali G. Alkhathami⁹

CMES-Computer Modeling in Engineering & Sciences, Vol.144, No.1, pp. 1165-1196, 2025, DOI:10.32604/cmes.2025.067150 - 31 July 2025

Abstract Background: Accurate classification of normal blood cells is a critical foundation for automated hematological analysis, including the detection of pathological conditions like leukemia. While convolutional neural networks (CNNs) excel in local feature extraction, their ability to capture global contextual relationships in complex cellular morphologies is limited. This study introduces a hybrid CNN-Transformer framework to enhance normal blood cell classification, laying the groundwork for future leukemia diagnostics. Methods: The proposed architecture integrates pre-trained CNNs (ResNet50, EfficientNetB3, InceptionV3, CustomCNN) with Vision Transformer (ViT) layers to combine local and global feature modeling. Four hybrid models were evaluated on… More >

Open Access

ARTICLE

Enhancing Phoneme Labeling in Dysarthric Speech with Digital Twin-Driven Multi-Modal Architecture

Saeed Alzahrani¹, Nazar Hussain², Farah Mohammad^3,*

CMC-Computers, Materials & Continua, Vol.84, No.3, pp. 4825-4849, 2025, DOI:10.32604/cmc.2025.066322 - 30 July 2025

Abstract Digital twin technology is revolutionizing personalized healthcare by creating dynamic virtual replicas of individual patients. This paper presents a novel multi-modal architecture leveraging digital twins to enhance precision in predictive diagnostics and treatment planning of phoneme labeling. By integrating real-time images, electronic health records, and genomic information, the system enables personalized simulations for disease progression modeling, treatment response prediction, and preventive care strategies. In dysarthric speech, which is characterized by articulation imprecision, temporal misalignments, and phoneme distortions, existing models struggle to capture these irregularities. Traditional approaches, often relying solely on audio features, fail to address… More >

Open Access

ARTICLE

Improving Fashion Sentiment Detection on X through Hybrid Transformers and RNNs

Bandar Alotaibi^1,*, Aljawhara Almutarie², Shuaa Alotaibi³, Munif Alotaibi⁴

CMC-Computers, Materials & Continua, Vol.84, No.3, pp. 4451-4467, 2025, DOI:10.32604/cmc.2025.066050 - 30 July 2025

Abstract X (formerly known as Twitter) is one of the most prominent social media platforms, enabling users to share short messages (tweets) with the public or their followers. It serves various purposes, from real-time news dissemination and political discourse to trend spotting and consumer engagement. X has emerged as a key space for understanding shifting brand perceptions, consumer preferences, and product-related sentiment in the fashion industry. However, the platform’s informal, dynamic, and context-dependent language poses substantial challenges for sentiment analysis, mainly when attempting to detect sarcasm, slang, and nuanced emotional tones. This study introduces a hybrid… More >

Transformers for Multi-Modal Image Analysis in Healthcare

Sameera V Mohd Sagheer^1,*, Meghana K H², P M Ameer³, Muneer Parayangat⁴, Mohamed Abbas⁴

CMC-Computers, Materials & Continua, Vol.84, No.3, pp. 4259-4297, 2025, DOI:10.32604/cmc.2025.063726 - 30 July 2025

Abstract Integrating multiple medical imaging techniques, including Magnetic Resonance Imaging (MRI), Computed Tomography, Positron Emission Tomography (PET), and ultrasound, provides a comprehensive view of the patient health status. Each of these methods contributes unique diagnostic insights, enhancing the overall assessment of patient condition. Nevertheless, the amalgamation of data from multiple modalities presents difficulties due to disparities in resolution, data collection methods, and noise levels. While traditional models like Convolutional Neural Networks (CNNs) excel in single-modality tasks, they struggle to handle multi-modal complexities, lacking the capacity to model global relationships. This research presents a novel approach for… More >

Open Access

ARTICLE

Integrating Speech-to-Text for Image Generation Using Generative Adversarial Networks

Smita Mahajan¹, Shilpa Gite^1,2, Biswajeet Pradhan^3,*, Abdullah Alamri⁴, Shaunak Inamdar⁵, Deva Shriyansh⁵, Akshat Ashish Shah⁵, Shruti Agarwal⁵

CMES-Computer Modeling in Engineering & Sciences, Vol.143, No.2, pp. 2001-2026, 2025, DOI:10.32604/cmes.2025.058456 - 30 May 2025

Abstract The development of generative architectures has resulted in numerous novel deep-learning models that generate images using text inputs. However, humans naturally use speech for visualization prompts. Therefore, this paper proposes an architecture that integrates speech prompts as input to image-generation Generative Adversarial Networks (GANs) model, leveraging Speech-to-Text translation along with the CLIP + VQGAN model. The proposed method involves translating speech prompts into text, which is then used by the Contrastive Language-Image Pretraining (CLIP) + Vector Quantized Generative Adversarial Network (VQGAN) model to generate images. This paper outlines the steps required to implement such a… More >

Open Access

ARTICLE

Leveraging Transformers for Detection of Arabic Cyberbullying on Social Media: Hybrid Arabic Transformers

Amjad A. Alsuwaylimi^1,*, Zaid S. Alenezi²

CMC-Computers, Materials & Continua, Vol.83, No.2, pp. 3165-3185, 2025, DOI:10.32604/cmc.2025.061674 - 16 April 2025

Abstract Cyberbullying is a remarkable issue in the Arabic-speaking world, affecting children, organizations, and businesses. Various efforts have been made to combat this problem through proposed models using machine learning (ML) and deep learning (DL) approaches utilizing natural language processing (NLP) methods and by proposing relevant datasets. However, most of these endeavors focused predominantly on the English language, leaving a substantial gap in addressing Arabic cyberbullying. Given the complexities of the Arabic language, transfer learning techniques and transformers present a promising approach to enhance the detection and classification of abusive content by leveraging large and pretrained… More >

Open Access

ARTICLE

Leveraging Unlabeled Corpus for Arabic Dialect Identification

Mohammed Abdelmajeed^1,*, Jiangbin Zheng¹, Ahmed Murtadha¹, Youcef Nafa¹, Mohammed Abaker², Muhammad Pervez Akhter³

CMC-Computers, Materials & Continua, Vol.83, No.2, pp. 3471-3491, 2025, DOI:10.32604/cmc.2025.059870 - 16 April 2025

Abstract Arabic Dialect Identification (DID) is a task in Natural Language Processing (NLP) that involves determining the dialect of a given piece of text in Arabic. The state-of-the-art solutions for DID are built on various deep neural networks that commonly learn the representation of sentences in response to a given dialect. Despite the effectiveness of these solutions, the performance heavily relies on the amount of labeled examples, which is labor-intensive to attain and may not be readily available in real-world scenarios. To alleviate the burden of labeling data, this paper introduces a novel solution that leverages… More >

Open Access

ARTICLE

Token Masked Pose Transformers Are Efficient Learners

Xinyi Song¹, Haixiang Zhang^1,*, Shaohua Li²

CMC-Computers, Materials & Continua, Vol.83, No.2, pp. 2735-2750, 2025, DOI:10.32604/cmc.2025.059006 - 16 April 2025

Abstract In recent years, Transformer has achieved remarkable results in the field of computer vision, with its built-in attention layers effectively modeling global dependencies in images by transforming image features into token forms. However, Transformers often face high computational costs when processing large-scale image data, which limits their feasibility in real-time applications. To address this issue, we propose Token Masked Pose Transformers (TMPose), constructing an efficient Transformer network for pose estimation. This network applies semantic-level masking to tokens and employs three different masking strategies to optimize model performance, aiming to reduce computational complexity. Experimental results show More >

Open Access

ARTICLE

Semi-Supervised New Intention Discovery for Syntactic Elimination and Fusion in Elastic Neighborhoods

Di Wu^*, Liming Feng, Xiaoyu Wang

CMC-Computers, Materials & Continua, Vol.83, No.1, pp. 977-999, 2025, DOI:10.32604/cmc.2025.060319 - 26 March 2025

Abstract Semi-supervised new intent discovery is a significant research focus in natural language understanding. To address the limitations of current semi-supervised training data and the underutilization of implicit information, a Semi-supervised New Intent Discovery for Elastic Neighborhood Syntactic Elimination and Fusion model (SNID-ENSEF) is proposed. Syntactic elimination contrast learning leverages verb-dominant syntactic features, systematically replacing specific words to enhance data diversity. The radius of the positive sample neighborhood is elastically adjusted to eliminate invalid samples and improve training efficiency. A neighborhood sample fusion strategy, based on sample distribution patterns, dynamically adjusts neighborhood size and fuses sample More >

Open Access

ARTICLE

LEGF-DST: LLMs-Enhanced Graph-Fusion Dual-Stream Transformer for Fine-Grained Chinese Malicious SMS Detection

Xin Tong¹, Jingya Wang^1,*, Ying Yang², Tian Peng³, Hanming Zhai¹, Guangming Ling⁴

CMC-Computers, Materials & Continua, Vol.82, No.2, pp. 1901-1924, 2025, DOI:10.32604/cmc.2024.059018 - 17 February 2025

Abstract With the widespread use of SMS (Short Message Service), the proliferation of malicious SMS has emerged as a pressing societal issue. While deep learning-based text classifiers offer promise, they often exhibit suboptimal performance in fine-grained detection tasks, primarily due to imbalanced datasets and insufficient model representation capabilities. To address this challenge, this paper proposes an LLMs-enhanced graph fusion dual-stream Transformer model for fine-grained Chinese malicious SMS detection. During the data processing stage, Large Language Models (LLMs) are employed for data augmentation, mitigating dataset imbalance. In the data input stage, both word-level and character-level features are More >

Displaying 11-20 on page 2 of 44. Per Page

View

1745

Download

805

View

1177

Download

627

View

1313

Download

723

View

1960

Download

869

View

1979

Download

723

View

1349

Download

486

View

1207

Download

442

View

1356

Download

677

View

969

Download

669

View

1459

Download

642

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp: