Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (232)
  • Open Access

    ARTICLE

    A UAV Image Object Detection Algorithm Based on Deep Diverse Branch Block and Multi-Scale Auxiliary Feature

    Wenfeng Wang1,*, Wenjie Fan1, Fang Dong1, Bin Zeng1, Wenxin Yu1, Xiangping Deng2

    CMC-Computers, Materials & Continua, Vol.88, No.1, 2026, DOI:10.32604/cmc.2026.078416 - 08 May 2026

    Abstract Unmanned Aerial Vehicle (UAV) image object detection has been widely applied in many fields. However, compared with ordinary natural images, UAV images often exhibit complex backgrounds, a predominance of small objects, and significant variations in target scales, which cause traditional detection algorithms to easily suffer from missed or false detections with insufficient accuracy. To address these issues, this paper proposes a novel UAV image object detection algorithm named DMA-YOLO based on the YOLOv8s model, incorporating a deep diverse branch block and multi-scale auxiliary feature. First, a DF-C2f module integrating a deep diverse branch block and… More >

  • Open Access

    REVIEW

    Applying Deep Learning to Defect Detection in Steel Manufacturing

    Duane G. Noé1, Ku-Chin Lin2, Chang-Lin Chuang3, Yung-Tsung Cheng3,*

    CMC-Computers, Materials & Continua, Vol.88, No.1, 2026, DOI:10.32604/cmc.2026.077838 - 08 May 2026

    Abstract Steel manufacturing requires high-throughput and high-reliability surface inspection to minimize safety risks, scrap rates, and downstream quality reductions. Conventional rule-based vision and manual inspection are often impeded in real production environments by variable illumination, complex textures, subtle defect morphology, and stringent latency constraints imposed by production-line operation. Deep learning (DL) has become a dominant paradigm for the detection and classification of defects when inspecting steel, but many previous studies have performed broad architectural overviews without explicitly connecting model and pipeline choices to deployment-critical factors such as processing speed, hardware availability, annotation cost, and robustness during… More >

  • Open Access

    ARTICLE

    FSS: Focusing on Suboptimal Samples for Detector-Agnostic Label Assignment in Object Detection

    Lijuan Huang1,2, Zhixian Liu3, Xinyu Zhou4, Jinping Liu4,*, Kunyi Zheng4, Yimei Yang2,4,*

    CMC-Computers, Materials & Continua, Vol.88, No.1, 2026, DOI:10.32604/cmc.2026.077655 - 08 May 2026

    Abstract Many occluded and ambiguous ground truths exist in object detection, making detectors unable to obtain optimal training samples. In this article, we revisit the suboptimal sample issue in label assignment for object detection and propose a novel detector-agnostic strategy, termed FSS, to address it. FSS reformulates label assignment as the process of selecting high-quality sub-optimal samples and progressively transforming them into optimal ones. Specifically, for each candidate, we estimate the probability of being an optimal sample by jointly considering localization quality and classification confidence, thereby constructing an instance-wise probability matrix. Based on the spatial distribution More >

  • Open Access

    ARTICLE

    DA-T3D: Distribution-Aware Cross-Modal Distillation Framework for Temporal 3D Object Detection

    Tianzhe Jiao, Yuming Chen, Xiaoyue Feng, Chaopeng Guo, Jie Song*

    CMES-Computer Modeling in Engineering & Sciences, Vol.147, No.1, 2026, DOI:10.32604/cmes.2026.080595 - 27 April 2026

    Abstract Knowledge distillation bridges the performance gap between camera-based and LiDAR-based 3D detectors by leveraging the precise geometric information from LiDAR. However, cross-modal knowledge transfer remains challenging due to the inherent modality heterogeneity between LiDAR and camera data, which often leads to instability during training. In this work, we find that these instabilities are closely related to distribution mismatch in the cross-modal feature space and noisy teacher signals. To address this issue, we propose a novel distribution-aware cross-modal distillation framework, named DA-T3D. Specifically, we first explicitly model the LiDAR teacher’s Bird’s-Eye-View (BEV) feature distribution and use… More >

  • Open Access

    ARTICLE

    Multi-Scene Traffic Light Detection and Fault Identification via Dual-Attention Image Fusion

    Yuxiao Shi1, Jinglin Zhang2, Yuxia Li2,*

    CMES-Computer Modeling in Engineering & Sciences, Vol.147, No.1, 2026, DOI:10.32604/cmes.2026.078601 - 27 April 2026

    Abstract Traffic light detection and fault identification using images from road traffic cameras are important for intelligent traffic management and urban safety monitoring. However, images collected in real traffic environments show clear differences in camera view, lighting conditions, weather, and background complexity. As a result, traffic lights vary greatly in scale, spatial location, and appearance, which reduces detection accuracy in complex scenes. To deal with this problem, this paper presents a multi-scene traffic light detection and fault identification framework based on dual-attention image fusion. Large-scale road camera data from the Chengdu Traffic Management Bureau are used,… More >

  • Open Access

    ARTICLE

    Optimizing YOLOv11 for Rice Disease Detection: Integrating RepViT Backbone, BiFPN, and CBAM Attention

    Sang-Hyun Lee*, Qingtao Meng

    CMC-Computers, Materials & Continua, Vol.87, No.3, 2026, DOI:10.32604/cmc.2026.077207 - 09 April 2026

    Abstract Accurate and timely detection of rice leaf diseases is critical for ensuring global food security and maximizing agricultural yields. However, existing deep learning methods often struggle to balance the high accuracy required for detecting multi-scale lesions in complex field environments with the computational efficiency necessary for edge device deployment. This paper proposes You Only Look Once for Lightweight Detection (YOLOv11-LD), a lightweight object detection model for multi-scale rice leaf disease detection in real paddy field environments. The model is built on YOLOv11n and integrates a Re-parameterized Vision Transformer (RepViT) backbone, a Bidirectional Feature Pyramid Network… More >

  • Open Access

    ARTICLE

    AugTrans: Boosting Adversarial Transferability in Object Detection with a Dynamic, Object-Aware Augmentation Pipeline

    Sudhir Kumar Pandey1, Jian-Xun Mi1,*, Zahid Ullah2, Mona Jamjoom3

    CMC-Computers, Materials & Continua, Vol.87, No.3, 2026, DOI:10.32604/cmc.2026.074811 - 09 April 2026

    Abstract Adversarial examples in object detection frequently fail to transfer between different models because attacks overfit to the source model’s architecture and feature space. We propose AugTrans, a framework that addresses this limitation through input-space regularization. Our key innovation is a multi-stage augmentation pipeline that incorporates object-level semantic awareness into transformation design. The pipeline comprises three novel components: dynamic object-centric rotation with adaptive scheduling, multi-box aware resizing based on ground-truth annotations, and composite noise injection. These transformations are integrated within the Expectation over Transformation (EOT) framework. By optimizing perturbations to remain effective across semantically meaningful transformations, our… More >

  • Open Access

    ARTICLE

    Railway Track Defect Detection Based on Dynamic Multi-Modal Fusion and Challenging Object Enhanced Perception

    Yaguan Wang1, Linlin Kou2, Yang Gao3,*, Qiang Sun1, Yong Qin3, Genwang Peng3

    Structural Durability & Health Monitoring, Vol.20, No.2, 2026, DOI:10.32604/sdhm.2025.072538 - 31 March 2026

    Abstract The fasteners employed in the railway tracks are susceptible to defects arising from their intricate composition. Foreign objects are frequently observed on the track bed in an open environment. These two types of defects pose potential threats to high-speed trains, thus necessitating timely and accurate track inspection. The majority of extant automatic inspection methods are predicated on the utilization of single visible light data, and the efficacy of the algorithmic processes is influenced by complex environments. Furthermore, due to the single information dimension, the detection accuracy of defects in similar, occluded, and small object categories… More >

  • Open Access

    ARTICLE

    Defect Detection of Wind Turbine Blades Using Multiscale Feature Extraction and Attention Mechanism

    Yajuan Lu*, Yongtao Hu, Jie Li, Jinping Zhang, Jingjing Si

    Structural Durability & Health Monitoring, Vol.20, No.2, 2026, DOI:10.32604/sdhm.2025.071110 - 31 March 2026

    Abstract To address challenges in wind turbine blade defect detection models, primarily due to insufficient feature extraction capabilities and the difficulty of deploying models on drone-type edge devices, this study proposes a wind turbine blade defect detection model, WtCS-YOLO11, that incorporates multiscale feature extraction and an attention mechanism. Firstly, the cross-stage partial with two kernels and a wavelet convolution module (C3k2_WTConv) is proposed by introducing wavelet convolution into the module. The cross-stage partial with two kernels (C3k2) module in the necking network is replaced with the C3k2_WTConv module to increase the model’s receptive field, enable multiscale… More >

  • Open Access

    ARTICLE

    TQU-GraspingObject: 3D Common Objects Detection, Recognition, and Localization on Point Cloud for Hand Grasping in Sharing Environments

    Thi-Loan Nguyen1,2,*, Huy-Nam Chu3, The-Thanh Hua3, Trung-Nghia Phung2, Van-Hung Le3,*

    CMC-Computers, Materials & Continua, Vol.87, No.2, 2026, DOI:10.32604/cmc.2026.076732 - 12 March 2026

    Abstract To support the process of grasping objects on a tabletop for the blind or robotic arm, it is necessary to address fundamental computer vision tasks, such as detecting, recognizing, and locating objects in space, and determining the position of the grasping information. These results can then be used to guide the visually impaired or to execute grasping tasks with a robotic arm. In this paper, we collected, annotated, and published the benchmark TQU-GraspingObject dataset for testing, validation, and evaluation of deep learning (DL) models for detecting, recognizing, and localizing grasping objects in 2D and 3D… More >

Displaying 1-10 on page 1 of 232. Per Page