Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (115)
  • Open Access

    ARTICLE

    Improving Hornet Detection with the YOLOv7-Tiny Model: A Case Study on Asian Hornets

    Yung-Hsiang Hung, Chuen-Kai Fan, Wen-Pai Wang*

    CMC-Computers, Materials & Continua, Vol.83, No.2, pp. 2323-2349, 2025, DOI:10.32604/cmc.2025.063270 - 16 April 2025

    Abstract Bees play a crucial role in the global food chain, pollinating over 75% of food and producing valuable products such as bee pollen, propolis, and royal jelly. However, the Asian hornet poses a serious threat to bee populations by preying on them and disrupting agricultural ecosystems. To address this issue, this study developed a modified YOLOv7tiny (You Only Look Once) model for efficient hornet detection. The model incorporated space-to-depth (SPD) and squeeze-and-excitation (SE) attention mechanisms and involved detailed annotation of the hornet’s head and full body, significantly enhancing the detection of small objects. The Taguchi… More >

  • Open Access

    ARTICLE

    Deep Learning Algorithm for Person Re-Identification Based on Dual Network Architecture

    Meng Zhu1,2, Xingyue Wang3, Honge Ren3,4,*, Abeer Hakeem5, Linda Mohaisen5,*

    CMC-Computers, Materials & Continua, Vol.83, No.2, pp. 2889-2905, 2025, DOI:10.32604/cmc.2025.061421 - 16 April 2025

    Abstract Changing a person’s posture and low resolution are the key challenges for person re-identification (ReID) in various deep learning applications. In this paper, we introduce an innovative architecture using a dual attention network that includes an attention module and a joint measurement module of spatial-temporal information. The proposed approach can be classified into two main tasks. Firstly, the spatial attention feature map is formed by aggregating features in the spatial dimension. Additionally, the same operation is carried out on the channel dimension to form channel attention feature maps. Therefore, the receptive field size is adjusted… More >

  • Open Access

    ARTICLE

    Dynamic Characteristic Testing of Wind Turbine Structure Based on Visual Monitoring Data Fusion

    Wenhai Zhao1, Wanrun Li1,2,*, Ximei Li1,2, Shoutu Li3, Yongfeng Du1,2

    Structural Durability & Health Monitoring, Vol.19, No.3, pp. 593-611, 2025, DOI:10.32604/sdhm.2024.057759 - 03 April 2025

    Abstract Addressing the current challenges in transforming pixel displacement into physical displacement in visual monitoring technologies, as well as the inability to achieve precise full-field monitoring, this paper proposes a method for identifying the structural dynamic characteristics of wind turbines based on visual monitoring data fusion. Firstly, the Lucas-Kanade Tomasi (LKT) optical flow method and a multi-region of interest (ROI) monitoring structure are employed to track pixel displacements, which are subsequently subjected to band pass filtering and resampling operations. Secondly, the actual displacement time history is derived through double integration of the acquired acceleration data and… More >

  • Open Access

    ARTICLE

    A Novel CAPTCHA Recognition System Based on Refined Visual Attention

    Zaid Derea1,2,*, Beiji Zou1, Xiaoyan Kui1,*, Monir Abdullah3, Alaa Thobhani1, Amr Abdussalam4

    CMC-Computers, Materials & Continua, Vol.83, No.1, pp. 115-136, 2025, DOI:10.32604/cmc.2025.062729 - 26 March 2025

    Abstract Improving website security to prevent malicious online activities is crucial, and CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) has emerged as a key strategy for distinguishing human users from automated bots. Text-based CAPTCHAs, designed to be easily decipherable by humans yet challenging for machines, are a common form of this verification. However, advancements in deep learning have facilitated the creation of models adept at recognizing these text-based CAPTCHAs with surprising efficiency. In our comprehensive investigation into CAPTCHA recognition, we have tailored the renowned UpDown image captioning model specifically for this… More >

  • Open Access

    REVIEW

    A Comprehensive Review of Pill Image Recognition

    Linh Nguyen Thi My1,2,*, Viet-Tuan Le3, Tham Vo1, Vinh Truong Hoang3,*

    CMC-Computers, Materials & Continua, Vol.82, No.3, pp. 3693-3740, 2025, DOI:10.32604/cmc.2025.060793 - 06 March 2025

    Abstract Pill image recognition is an important field in computer vision. It has become a vital technology in healthcare and pharmaceuticals due to the necessity for precise medication identification to prevent errors and ensure patient safety. This survey examines the current state of pill image recognition, focusing on advancements, methodologies, and the challenges that remain unresolved. It provides a comprehensive overview of traditional image processing-based, machine learning-based, deep learning-based, and hybrid-based methods, and aims to explore the ongoing difficulties in the field. We summarize and classify the methods used in each article, compare the strengths and More >

  • Open Access

    ARTICLE

    A Dual-Layer Attention Based CAPTCHA Recognition Approach with Guided Visual Attention

    Zaid Derea1,2, Beiji Zou1, Xiaoyan Kui1,*, Alaa Thobhani1, Amr Abdussalam3

    CMES-Computer Modeling in Engineering & Sciences, Vol.142, No.3, pp. 2841-2867, 2025, DOI:10.32604/cmes.2025.059586 - 03 March 2025

    Abstract Enhancing website security is crucial to combat malicious activities, and CAPTCHA (Completely Automated Public Turing tests to tell Computers and Humans Apart) has become a key method to distinguish humans from bots. While text-based CAPTCHAs are designed to challenge machines while remaining human-readable, recent advances in deep learning have enabled models to recognize them with remarkable efficiency. In this regard, we propose a novel two-layer visual attention framework for CAPTCHA recognition that builds on traditional attention mechanisms by incorporating Guided Visual Attention (GVA), which sharpens focus on relevant visual features. We have specifically adapted the… More >

  • Open Access

    ARTICLE

    Plant Disease Detection Algorithm Based on Efficient Swin Transformer

    Wei Liu1,*, Ao Zhang

    CMC-Computers, Materials & Continua, Vol.82, No.2, pp. 3045-3068, 2025, DOI:10.32604/cmc.2024.058640 - 17 February 2025

    Abstract Plant diseases present a significant threat to global agricultural productivity, endangering both crop yields and quality. Traditional detection methods largely rely on manual inspection, a process that is not only labor-intensive and time-consuming but also subject to subjective biases and dependent on operators’ expertise. Recent advancements in Transformer-based architectures have shown substantial progress in image classification tasks, particularly excelling in global feature extraction. However, despite their strong performance, the high computational complexity and large parameter requirements of Transformer models limit their practical application in plant disease detection. To address these constraints, this study proposes an… More >

  • Open Access

    ARTICLE

    Steel Surface Defect Detection Using Learnable Memory Vision Transformer

    Syed Tasnimul Karim Ayon1,#, Farhan Md. Siraj1,#, Jia Uddin2,*

    CMC-Computers, Materials & Continua, Vol.82, No.1, pp. 499-520, 2025, DOI:10.32604/cmc.2025.058361 - 03 January 2025

    Abstract This study investigates the application of Learnable Memory Vision Transformers (LMViT) for detecting metal surface flaws, comparing their performance with traditional CNNs, specifically ResNet18 and ResNet50, as well as other transformer-based models including Token to Token ViT, ViT without memory, and Parallel ViT. Leveraging a widely-used steel surface defect dataset, the research applies data augmentation and t-distributed stochastic neighbor embedding (t-SNE) to enhance feature extraction and understanding. These techniques mitigated overfitting, stabilized training, and improved generalization capabilities. The LMViT model achieved a test accuracy of 97.22%, significantly outperforming ResNet18 (88.89%) and ResNet50 (88.90%), as well… More >

  • Open Access

    ARTICLE

    Congruent Feature Selection Method to Improve the Efficacy of Machine Learning-Based Classification in Medical Image Processing

    Mohd Anjum1, Naoufel Kraiem2, Hong Min3,*, Ashit Kumar Dutta4, Yousef Ibrahim Daradkeh5

    CMES-Computer Modeling in Engineering & Sciences, Vol.142, No.1, pp. 357-384, 2025, DOI:10.32604/cmes.2024.057889 - 17 December 2024

    Abstract Machine learning (ML) is increasingly applied for medical image processing with appropriate learning paradigms. These applications include analyzing images of various organs, such as the brain, lung, eye, etc., to identify specific flaws/diseases for diagnosis. The primary concern of ML applications is the precise selection of flexible image features for pattern detection and region classification. Most of the extracted image features are irrelevant and lead to an increase in computation time. Therefore, this article uses an analytical learning paradigm to design a Congruent Feature Selection Method to select the most relevant image features. This process… More >

  • Open Access

    ARTICLE

    Automated Angle Detection for Industrial Production Lines Using Combined Image Processing Techniques

    Pawat Chunhachatrachai1,*, Chyi-Yeu Lin1,2

    Intelligent Automation & Soft Computing, Vol.39, No.4, pp. 599-618, 2024, DOI:10.32604/iasc.2024.055385 - 06 September 2024

    Abstract Angle detection is a crucial aspect of industrial automation, ensuring precise alignment and orientation of components in manufacturing processes. Despite the widespread application of computer vision in industrial settings, angle detection remains an underexplored domain, with limited integration into production lines. This paper addresses the need for automated angle detection in industrial environments by presenting a methodology that eliminates training time and higher computation cost on Graphics Processing Unit (GPU) from machine learning in computer vision (e.g., Convolutional Neural Networks (CNN)). Our approach leverages advanced image processing techniques and a strategic combination of algorithms, including More >

Displaying 1-10 on page 1 of 115. Per Page