Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (235)
  • Open Access

    ARTICLE

    Defect Detection of Wind Turbine Blades Using Multiscale Feature Extraction and Attention Mechanism

    Yajuan Lu*, Yongtao Hu, Jie Li, Jinping Zhang, Jingjing Si

    Structural Durability & Health Monitoring, Vol.20, No.2, 2026, DOI:10.32604/sdhm.2025.071110 - 31 March 2026

    Abstract To address challenges in wind turbine blade defect detection models, primarily due to insufficient feature extraction capabilities and the difficulty of deploying models on drone-type edge devices, this study proposes a wind turbine blade defect detection model, WtCS-YOLO11, that incorporates multiscale feature extraction and an attention mechanism. Firstly, the cross-stage partial with two kernels and a wavelet convolution module (C3k2_WTConv) is proposed by introducing wavelet convolution into the module. The cross-stage partial with two kernels (C3k2) module in the necking network is replaced with the C3k2_WTConv module to increase the model’s receptive field, enable multiscale… More >

  • Open Access

    ARTICLE

    Robust Human Pose Estimation and Action Recognition Utilizing Feature Extraction

    Sheng Luo1, Rashid Abbasi1,*, Hao Wang2, Jinghua Xu3, Dongyang Lyu4, Aaron Zhang1, Farhan Amin5,*, Isabel de la Torre6, Gerardo Mendez Mezquita7, Henry Fabian Gongora7

    CMES-Computer Modeling in Engineering & Sciences, Vol.146, No.3, 2026, DOI:10.32604/cmes.2026.075080 - 30 March 2026

    Abstract Human pose estimation is crucial across diverse applications, from healthcare to human–computer interaction. Integrating inertial measurement units (IMUs) with monocular vision methods holds great potential for leveraging complementary modalities; however, existing approaches are often limited by IMU drift, noise, and underutilization of visual information. To address these limitations, we propose a novel dual-stream feature extraction framework that effectively combines temporal IMU data and single-view image features for improved pose estimation. Short-term dependencies in IMU sequences are captured with convolutional layers, while a Transformer-based architecture models long-range temporal dynamics. To mitigate IMU drift and inter-sensor inconsistencies, More >

  • Open Access

    ARTICLE

    A Deep Learning Approach for Three-Dimensional Thyroid Nodule Detection from Ultrasound Images

    Huda F. Al-Shahad1,2, Razali Yaakob1,*, Nurfadhlina Mohd Sharef1, Hazlina Hamdan1, Hasyma Abu Hassan3, Xiaoyi Jiang4

    CMES-Computer Modeling in Engineering & Sciences, Vol.146, No.3, 2026, DOI:10.32604/cmes.2025.074109 - 30 March 2026

    Abstract Currently, thyroid diseases are prevalent worldwide; therefore, it is necessary to develop techniques that help doctors improve their diagnostic skills for such diseases. In previous studies, 2-dimensional convolutional neural network (2D CNN) techniques were employed to classify thyroid nodules as benign and malignant without detecting the presence of thyroid nodules in the obtained ultrasound images. To address this issue, we propose a 3-dimensional convolutional neural network (3D CNN) for thyroid nodule detection. The proposed CNN exploits the 3D information and spatial features contained in ultrasound images and generates distinctive features during its training using multiple… More >

  • Open Access

    ARTICLE

    MSC-DeepLabV3+: A Segmentation Model for Slender Fabric Roll Seam Detection

    Weimin Shi1,*, Kuntao Lv1, Chang Xuan1, Ji Wu2

    CMC-Computers, Materials & Continua, Vol.87, No.2, 2026, DOI:10.32604/cmc.2025.075203 - 12 March 2026

    Abstract The application of deep learning in fabric defect detection has become increasingly widespread. To address false positives and false negatives in fabric roll seam detection, and to improve automation efficiency and product quality, we propose the Multi-scale Context DeepLabV3+ (MSC-DeepLabV3+), a semantic segmentation network designed for fabric roll seam detection, based on DeepLabV3+. The model improvements include enhancing the backbone performance through optimization of the UIB-MobileNetV2 network; designing the Dynamic Atrous and Sliding-window Fusion (DASF) module to improve adaptability to multi-scale seam structures with dynamic dilation rates and a sliding-window mechanism; and utilizing the Progressive… More >

  • Open Access

    ARTICLE

    DL-YOLO: A Multi-Scale Feature Fusion Detection Algorithm for Low-Light Environments

    Yuanmeng Chang, Hongmei Liu*

    CMC-Computers, Materials & Continua, Vol.87, No.2, 2026, DOI:10.32604/cmc.2026.074204 - 12 March 2026

    Abstract Driven by rapid advances in deep learning, object detection has been widely adopted across diverse application scenarios. However, in low-light conditions, critical visual cues of target objects are severely degraded, posing a significant challenge for accurate low-light object detection. Existing methods struggle to preserve discriminative features while maintaining semantic consistency between low-light and normal-light images. For this purpose, this study proposes a DL-YOLO model specially tailored for low-light detection. To mitigate target feature attenuation introduced by repeated downsampling, we design a Multi-Scale Feature Convolution (MSF-Conv) module that captures rich, multi-level details via multi-scale feature learning, More >

  • Open Access

    ARTICLE

    Fuzzy C-Means Clustering-Driven Pooling for Robust and Generalizable Convolutional Neural Networks

    Seunggyu Byeon1, Jung-hun Lee2, Jong-Deok Kim3,*

    CMC-Computers, Materials & Continua, Vol.87, No.2, 2026, DOI:10.32604/cmc.2025.074033 - 12 March 2026

    Abstract This paper introduces a fuzzy C-means-based pooling layer for convolutional neural networks that explicitly models local uncertainty and ambiguity. Conventional pooling operations, such as max and average, apply rigid aggregation and often discard fine-grained boundary information. In contrast, our method computes soft memberships within each receptive field and aggregates cluster-wise responses through membership-weighted pooling, thereby preserving informative structure while reducing dimensionality. Being differentiable, the proposed layer operates as standard two-dimensional pooling. We evaluate our approach across various CNN backbones and open datasets, including CIFAR-10/100, STL-10, LFW, and ImageNette, and further probe small training set restrictions More >

  • Open Access

    ARTICLE

    Securing Restricted Zones with a Novel Face Recognition Approach Using Face Feature Descriptors and Evidence Theory

    Rafika Harrabi1,2,*, Slim Ben Chaabane1,2, Hassene Seddik2

    CMC-Computers, Materials & Continua, Vol.87, No.2, 2026, DOI:10.32604/cmc.2026.072054 - 12 March 2026

    Abstract Securing restricted zones such as airports, research facilities, and military bases requires robust and reliable access control mechanisms to prevent unauthorized entry and safeguard critical assets. Face recognition has emerged as a key biometric approach for this purpose; however, existing systems are often sensitive to variations in illumination, occlusion, and pose, which degrade their performance in real-world conditions. To address these challenges, this paper proposes a novel hybrid face recognition method that integrates complementary feature descriptors such as Fuzzy-Gabor 2D Fisher Linear Discriminant (FG-2DFLD), Generalized 2D Linear Discriminant Analysis (G2DLDA), and Modular-Local Binary Patterns (Modular-LBP)… More >

  • Open Access

    ARTICLE

    Intelligent Human Interaction Recognition with Multi-Modal Feature Extraction and Bidirectional LSTM

    Muhammad Hamdan Azhar1,2,#, Yanfeng Wu1,#, Nouf Abdullah Almujally3, Shuaa S. Alharbi4, Asaad Algarni5, Ahmad Jalal2,6, Hui Liu1,7,8,*

    CMC-Computers, Materials & Continua, Vol.87, No.1, 2026, DOI:10.32604/cmc.2025.071988 - 10 February 2026

    Abstract Recognizing human interactions in RGB videos is a critical task in computer vision, with applications in video surveillance. Existing deep learning-based architectures have achieved strong results, but are computationally intensive, sensitive to video resolution changes and often fail in crowded scenes. We propose a novel hybrid system that is computationally efficient, robust to degraded video quality and able to filter out irrelevant individuals, making it suitable for real-life use. The system leverages multi-modal handcrafted features for interaction representation and a deep learning classifier for capturing complex dependencies. Using Mask R-CNN and YOLO11-Pose, we extract grayscale… More >

  • Open Access

    ARTICLE

    Enhanced Multi-Scale Feature Extraction Lightweight Network for Remote Sensing Object Detection

    Xiang Luo1, Yuxuan Peng2, Renghong Xie1, Peng Li3, Yuwen Qian3,*

    CMC-Computers, Materials & Continua, Vol.86, No.3, 2026, DOI:10.32604/cmc.2025.073700 - 12 January 2026

    Abstract Deep learning has made significant progress in the field of oriented object detection for remote sensing images. However, existing methods still face challenges when dealing with difficult tasks such as multi-scale targets, complex backgrounds, and small objects in remote sensing. Maintaining model lightweight to address resource constraints in remote sensing scenarios while improving task completion for remote sensing tasks remains a research hotspot. Therefore, we propose an enhanced multi-scale feature extraction lightweight network EM-YOLO based on the YOLOv8s architecture, specifically optimized for the characteristics of large target scale variations, diverse orientations, and numerous small objects… More >

  • Open Access

    ARTICLE

    Industrial EdgeSign: NAS-Optimized Real-Time Hand Gesture Recognition for Operator Communication in Smart Factories

    Meixi Chu1, Xinyu Jiang1,*, Yushu Tao2

    CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-23, 2026, DOI:10.32604/cmc.2025.071533 - 09 December 2025

    Abstract Industrial operators need reliable communication in high-noise, safety-critical environments where speech or touch input is often impractical. Existing gesture systems either miss real-time deadlines on resource-constrained hardware or lose accuracy under occlusion, vibration, and lighting changes. We introduce Industrial EdgeSign, a dual-path framework that combines hardware-aware neural architecture search (NAS) with large multimodal model (LMM) guided semantics to deliver robust, low-latency gesture recognition on edge devices. The searched model uses a truncated ResNet50 front end, a dimensional-reduction network that preserves spatiotemporal structure for tubelet-based attention, and localized Transformer layers tuned for on-device inference. To reduce… More >

Displaying 1-10 on page 1 of 235. Per Page