Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (44)
  • Open Access

    ARTICLE

    A Hybrid CNN-Transformer Framework for Normal Blood Cell Classification: Towards Automated Hematological Analysis

    Osama M. Alshehri1, Ahmad Shaf2,*, Muhammad Irfan3,*, Mohammed M. Jalal4, Malik A. Altayar4, Mohammed H. Abu-Alghayth5, Humood Al Shmrany6, Tariq Ali7, Toufique A. Soomro8, Ali G. Alkhathami9

    CMES-Computer Modeling in Engineering & Sciences, Vol.144, No.1, pp. 1165-1196, 2025, DOI:10.32604/cmes.2025.067150 - 31 July 2025

    Abstract Background: Accurate classification of normal blood cells is a critical foundation for automated hematological analysis, including the detection of pathological conditions like leukemia. While convolutional neural networks (CNNs) excel in local feature extraction, their ability to capture global contextual relationships in complex cellular morphologies is limited. This study introduces a hybrid CNN-Transformer framework to enhance normal blood cell classification, laying the groundwork for future leukemia diagnostics. Methods: The proposed architecture integrates pre-trained CNNs (ResNet50, EfficientNetB3, InceptionV3, CustomCNN) with Vision Transformer (ViT) layers to combine local and global feature modeling. Four hybrid models were evaluated on… More >

  • Open Access

    ARTICLE

    Enhancing Phoneme Labeling in Dysarthric Speech with Digital Twin-Driven Multi-Modal Architecture

    Saeed Alzahrani1, Nazar Hussain2, Farah Mohammad3,*

    CMC-Computers, Materials & Continua, Vol.84, No.3, pp. 4825-4849, 2025, DOI:10.32604/cmc.2025.066322 - 30 July 2025

    Abstract Digital twin technology is revolutionizing personalized healthcare by creating dynamic virtual replicas of individual patients. This paper presents a novel multi-modal architecture leveraging digital twins to enhance precision in predictive diagnostics and treatment planning of phoneme labeling. By integrating real-time images, electronic health records, and genomic information, the system enables personalized simulations for disease progression modeling, treatment response prediction, and preventive care strategies. In dysarthric speech, which is characterized by articulation imprecision, temporal misalignments, and phoneme distortions, existing models struggle to capture these irregularities. Traditional approaches, often relying solely on audio features, fail to address… More >

  • Open Access

    ARTICLE

    Improving Fashion Sentiment Detection on X through Hybrid Transformers and RNNs

    Bandar Alotaibi1,*, Aljawhara Almutarie2, Shuaa Alotaibi3, Munif Alotaibi4

    CMC-Computers, Materials & Continua, Vol.84, No.3, pp. 4451-4467, 2025, DOI:10.32604/cmc.2025.066050 - 30 July 2025

    Abstract X (formerly known as Twitter) is one of the most prominent social media platforms, enabling users to share short messages (tweets) with the public or their followers. It serves various purposes, from real-time news dissemination and political discourse to trend spotting and consumer engagement. X has emerged as a key space for understanding shifting brand perceptions, consumer preferences, and product-related sentiment in the fashion industry. However, the platform’s informal, dynamic, and context-dependent language poses substantial challenges for sentiment analysis, mainly when attempting to detect sarcasm, slang, and nuanced emotional tones. This study introduces a hybrid… More >

  • Open Access

    REVIEW

    Transformers for Multi-Modal Image Analysis in Healthcare

    Sameera V Mohd Sagheer1,*, Meghana K H2, P M Ameer3, Muneer Parayangat4, Mohamed Abbas4

    CMC-Computers, Materials & Continua, Vol.84, No.3, pp. 4259-4297, 2025, DOI:10.32604/cmc.2025.063726 - 30 July 2025

    Abstract Integrating multiple medical imaging techniques, including Magnetic Resonance Imaging (MRI), Computed Tomography, Positron Emission Tomography (PET), and ultrasound, provides a comprehensive view of the patient health status. Each of these methods contributes unique diagnostic insights, enhancing the overall assessment of patient condition. Nevertheless, the amalgamation of data from multiple modalities presents difficulties due to disparities in resolution, data collection methods, and noise levels. While traditional models like Convolutional Neural Networks (CNNs) excel in single-modality tasks, they struggle to handle multi-modal complexities, lacking the capacity to model global relationships. This research presents a novel approach for… More >

  • Open Access

    ARTICLE

    Integrating Speech-to-Text for Image Generation Using Generative Adversarial Networks

    Smita Mahajan1, Shilpa Gite1,2, Biswajeet Pradhan3,*, Abdullah Alamri4, Shaunak Inamdar5, Deva Shriyansh5, Akshat Ashish Shah5, Shruti Agarwal5

    CMES-Computer Modeling in Engineering & Sciences, Vol.143, No.2, pp. 2001-2026, 2025, DOI:10.32604/cmes.2025.058456 - 30 May 2025

    Abstract The development of generative architectures has resulted in numerous novel deep-learning models that generate images using text inputs. However, humans naturally use speech for visualization prompts. Therefore, this paper proposes an architecture that integrates speech prompts as input to image-generation Generative Adversarial Networks (GANs) model, leveraging Speech-to-Text translation along with the CLIP + VQGAN model. The proposed method involves translating speech prompts into text, which is then used by the Contrastive Language-Image Pretraining (CLIP) + Vector Quantized Generative Adversarial Network (VQGAN) model to generate images. This paper outlines the steps required to implement such a… More >

  • Open Access

    ARTICLE

    Leveraging Transformers for Detection of Arabic Cyberbullying on Social Media: Hybrid Arabic Transformers

    Amjad A. Alsuwaylimi1,*, Zaid S. Alenezi2

    CMC-Computers, Materials & Continua, Vol.83, No.2, pp. 3165-3185, 2025, DOI:10.32604/cmc.2025.061674 - 16 April 2025

    Abstract Cyberbullying is a remarkable issue in the Arabic-speaking world, affecting children, organizations, and businesses. Various efforts have been made to combat this problem through proposed models using machine learning (ML) and deep learning (DL) approaches utilizing natural language processing (NLP) methods and by proposing relevant datasets. However, most of these endeavors focused predominantly on the English language, leaving a substantial gap in addressing Arabic cyberbullying. Given the complexities of the Arabic language, transfer learning techniques and transformers present a promising approach to enhance the detection and classification of abusive content by leveraging large and pretrained… More >

  • Open Access

    ARTICLE

    Leveraging Unlabeled Corpus for Arabic Dialect Identification

    Mohammed Abdelmajeed1,*, Jiangbin Zheng1, Ahmed Murtadha1, Youcef Nafa1, Mohammed Abaker2, Muhammad Pervez Akhter3

    CMC-Computers, Materials & Continua, Vol.83, No.2, pp. 3471-3491, 2025, DOI:10.32604/cmc.2025.059870 - 16 April 2025

    Abstract Arabic Dialect Identification (DID) is a task in Natural Language Processing (NLP) that involves determining the dialect of a given piece of text in Arabic. The state-of-the-art solutions for DID are built on various deep neural networks that commonly learn the representation of sentences in response to a given dialect. Despite the effectiveness of these solutions, the performance heavily relies on the amount of labeled examples, which is labor-intensive to attain and may not be readily available in real-world scenarios. To alleviate the burden of labeling data, this paper introduces a novel solution that leverages… More >

  • Open Access

    ARTICLE

    Token Masked Pose Transformers Are Efficient Learners

    Xinyi Song1, Haixiang Zhang1,*, Shaohua Li2

    CMC-Computers, Materials & Continua, Vol.83, No.2, pp. 2735-2750, 2025, DOI:10.32604/cmc.2025.059006 - 16 April 2025

    Abstract In recent years, Transformer has achieved remarkable results in the field of computer vision, with its built-in attention layers effectively modeling global dependencies in images by transforming image features into token forms. However, Transformers often face high computational costs when processing large-scale image data, which limits their feasibility in real-time applications. To address this issue, we propose Token Masked Pose Transformers (TMPose), constructing an efficient Transformer network for pose estimation. This network applies semantic-level masking to tokens and employs three different masking strategies to optimize model performance, aiming to reduce computational complexity. Experimental results show More >

  • Open Access

    ARTICLE

    Semi-Supervised New Intention Discovery for Syntactic Elimination and Fusion in Elastic Neighborhoods

    Di Wu*, Liming Feng, Xiaoyu Wang

    CMC-Computers, Materials & Continua, Vol.83, No.1, pp. 977-999, 2025, DOI:10.32604/cmc.2025.060319 - 26 March 2025

    Abstract Semi-supervised new intent discovery is a significant research focus in natural language understanding. To address the limitations of current semi-supervised training data and the underutilization of implicit information, a Semi-supervised New Intent Discovery for Elastic Neighborhood Syntactic Elimination and Fusion model (SNID-ENSEF) is proposed. Syntactic elimination contrast learning leverages verb-dominant syntactic features, systematically replacing specific words to enhance data diversity. The radius of the positive sample neighborhood is elastically adjusted to eliminate invalid samples and improve training efficiency. A neighborhood sample fusion strategy, based on sample distribution patterns, dynamically adjusts neighborhood size and fuses sample More >

  • Open Access

    ARTICLE

    LEGF-DST: LLMs-Enhanced Graph-Fusion Dual-Stream Transformer for Fine-Grained Chinese Malicious SMS Detection

    Xin Tong1, Jingya Wang1,*, Ying Yang2, Tian Peng3, Hanming Zhai1, Guangming Ling4

    CMC-Computers, Materials & Continua, Vol.82, No.2, pp. 1901-1924, 2025, DOI:10.32604/cmc.2024.059018 - 17 February 2025

    Abstract With the widespread use of SMS (Short Message Service), the proliferation of malicious SMS has emerged as a pressing societal issue. While deep learning-based text classifiers offer promise, they often exhibit suboptimal performance in fine-grained detection tasks, primarily due to imbalanced datasets and insufficient model representation capabilities. To address this challenge, this paper proposes an LLMs-enhanced graph fusion dual-stream Transformer model for fine-grained Chinese malicious SMS detection. During the data processing stage, Large Language Models (LLMs) are employed for data augmentation, mitigating dataset imbalance. In the data input stage, both word-level and character-level features are More >

Displaying 11-20 on page 2 of 44. Per Page