Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (326)
  • Open Access

    ARTICLE

    PIDINet-MC: Real-Time Multi-Class Edge Detection with PiDiNet

    Mingming Huang1, Yunfan Ye1,*, Zhiping Cai2

    CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-17, 2026, DOI:10.32604/cmc.2025.072399 - 09 December 2025

    Abstract As a fundamental component in computer vision, edges can be categorized into four types based on discontinuities in reflectance, illumination, surface normal, or depth. While deep CNNs have significantly advanced generic edge detection, real-time multi-class semantic edge detection under resource constraints remains challenging. To address this, we propose a lightweight framework based on PiDiNet that enables fine-grained semantic edge detection. Our model simultaneously predicts background and four edge categories from full-resolution inputs, balancing accuracy and efficiency. Key contributions include: a multi-channel output structure expanding binary edge prediction to five classes, supported by a deep supervision More >

  • Open Access

    ARTICLE

    Zero-Shot Vision-Based Robust 3D Map Reconstruction and Obstacle Detection in Geometry-Deficient Room-Scale Environments

    Taehoon Kim, Sehun Lee, Junho Ahn*

    CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-30, 2026, DOI:10.32604/cmc.2025.071597 - 09 December 2025

    Abstract As large, room-scale environments become increasingly common, their spatial complexity increases due to variable, unstructured elements. Consequently, demand for room-scale service robots is surging, yet most technologies remain corridor-centric, and autonomous navigation in expansive rooms becomes unstable even around static obstacles. Existing approaches face several structural limitations. These include the labor-intensive requirement for large-scale object annotation and continual retraining, as well as the vulnerability of vanishing point or line-based methods when geometric cues are insufficient. In addition, the high cost of LiDAR and 3D perception errors caused by limited wall cues and dense interior clutter… More >

  • Open Access

    ARTICLE

    Research on Automated Game QA Reporting Based on Natural Language Captions

    Jun Myeong Kim, Jang Young Jeong, Shin Jin Kang, Beomjoo Seo*

    CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-16, 2026, DOI:10.32604/cmc.2025.071084 - 09 December 2025

    Abstract Game Quality Assurance (QA) currently relies heavily on manual testing, a process that is both costly and time-consuming. Traditional script- and log-based automation tools are limited in their ability to detect unpredictable visual bugs, especially those that are context-dependent or graphical in nature. As a result, many issues go unnoticed during manual QA, which reduces overall game quality, degrades the user experience, and creates inefficiencies throughout the development cycle. This study proposes two approaches to address these challenges. The first leverages a Large Language Model (LLM) to directly analyze gameplay videos, detect visual bugs, and… More >

  • Open Access

    ARTICLE

    Lightweight Airborne Vision Abnormal Behavior Detection Algorithm Based on Dual-Path Feature Optimization

    Baixuan Han1, Yueping Peng1,*, Zecong Ye2, Hexiang Hao1, Xuekai Zhang1, Wei Tang1, Wenchao Kang1, Qilong Li1

    CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-31, 2026, DOI:10.32604/cmc.2025.071071 - 09 December 2025

    Abstract Aiming at the problem of imbalance between detection accuracy and algorithm model lightweight in UAV aerial image target detection algorithm, a lightweight multi-category abnormal behavior detection algorithm based on improved YOLOv11n is designed. By integrating multi-head grouped self-attention mechanism and Partial-Conv, a two-way feature grouping fusion module (DFPF) was designed, which carried out effective channel segmentation and fusion strategies to reduce redundant calculations and memory access. C3K2 module was improved, and then unstructured pruning and feature distillation technology were used. The algorithm model is lightweight, and the feature extraction ability for airborne visual abnormal behavior… More >

  • Open Access

    ARTICLE

    Improving Person Recognition for Single-Person-in-Photos: Intimacy in Photo Collections

    Xiaoyi Duan, Tianqi Zou, Chenyang Wang, Yu Gu, Xiuying Li*

    CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-24, 2026, DOI:10.32604/cmc.2025.070683 - 09 December 2025

    Abstract Person recognition in photo collections is a critical yet challenging task in computer vision. Previous studies have used social relationships within photo collections to address this issue. However, these methods often fail when performing single-person-in-photos recognition in photo collections, as they cannot rely on social connections for recognition. In this work, we discard social relationships and instead measure the relationships between photos to solve this problem. We designed a new model that includes a multi-parameter attention network for adaptively fusing visual features and a unified formula for measuring photo intimacy. This model effectively recognizes individuals More >

  • Open Access

    ARTICLE

    A Hybrid Deep Learning Approach Using Vision Transformer and U-Net for Flood Segmentation

    Cyreneo Dofitas1, Yong-Woon Kim2, Yung-Cheol Byun3,*

    CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-19, 2026, DOI:10.32604/cmc.2025.069374 - 09 December 2025

    Abstract Recent advances in deep learning have significantly improved flood detection and segmentation from aerial and satellite imagery. However, conventional convolutional neural networks (CNNs) often struggle in complex flood scenarios involving reflections, occlusions, or indistinct boundaries due to limited contextual modeling. To address these challenges, we propose a hybrid flood segmentation framework that integrates a Vision Transformer (ViT) encoder with a U-Net decoder, enhanced by a novel Flood-Aware Refinement Block (FARB). The FARB module improves boundary delineation and suppresses noise by combining residual smoothing with spatial-channel attention mechanisms. We evaluate our model on a UAV-acquired flood More >

  • Open Access

    ARTICLE

    RetinexWT: Retinex-Based Low-Light Enhancement Method Combining Wavelet Transform

    Hongji Chen, Jianxun Zhang*, Tianze Yu, Yingzhu Zeng, Huan Zeng

    CMC-Computers, Materials & Continua, Vol.86, No.2, pp. 1-20, 2026, DOI:10.32604/cmc.2025.067041 - 09 December 2025

    Abstract Low-light image enhancement aims to improve the visibility of severely degraded images captured under insufficient illumination, alleviating the adverse effects of illumination degradation on image quality. Traditional Retinex-based approaches, inspired by human visual perception of brightness and color, decompose an image into illumination and reflectance components to restore fine details. However, their limited capacity for handling noise and complex lighting conditions often leads to distortions and artifacts in the enhanced results, particularly under extreme low-light scenarios. Although deep learning methods built upon Retinex theory have recently advanced the field, most still suffer from insufficient interpretability… More >

  • Open Access

    REVIEW

    Deep Learning for Brain Tumor Segmentation and Classification: A Systematic Review of Methods and Trends

    Ameer Hamza, Robertas Damaševičius*

    CMC-Computers, Materials & Continua, Vol.86, No.1, pp. 1-41, 2026, DOI:10.32604/cmc.2025.069721 - 10 November 2025

    Abstract This systematic review aims to comprehensively examine and compare deep learning methods for brain tumor segmentation and classification using MRI and other imaging modalities, focusing on recent trends from 2022 to 2025. The primary objective is to evaluate methodological advancements, model performance, dataset usage, and existing challenges in developing clinically robust AI systems. We included peer-reviewed journal articles and high-impact conference papers published between 2022 and 2025, written in English, that proposed or evaluated deep learning methods for brain tumor segmentation and/or classification. Excluded were non-open-access publications, books, and non-English articles. A structured search was… More >

  • Open Access

    ARTICLE

    CAFE-GAN: CLIP-Projected GAN with Attention-Aware Generation and Multi-Scale Discrimination

    Xuanhong Wang1, Hongyu Guo1, Jiazhen Li1, Mingchen Wang1, Xian Wang1, Yijun Zhang2,*

    CMC-Computers, Materials & Continua, Vol.86, No.1, pp. 1-19, 2026, DOI:10.32604/cmc.2025.069482 - 10 November 2025

    Abstract Over the past decade, large-scale pre-trained autoregressive and diffusion models rejuvenated the field of text-guided image generation. However, these models require enormous datasets and parameters, and their multi-step generation processes are often inefficient and difficult to control. To address these challenges, we propose CAFE-GAN, a CLIP-Projected GAN with Attention-Aware Generation and Multi-Scale Discrimination, which incorporates a pre-trained CLIP model along with several key architectural innovations. First, we embed a coordinate attention mechanism into the generator to capture long-range dependencies and enhance feature representation. Second, we introduce a trainable linear projection layer after the CLIP text… More >

  • Open Access

    ARTICLE

    Intelligent Semantic Segmentation with Vision Transformers for Aerial Vehicle Monitoring

    Moneerah Alotaibi*

    CMC-Computers, Materials & Continua, Vol.86, No.1, pp. 1-20, 2026, DOI:10.32604/cmc.2025.069195 - 10 November 2025

    Abstract Advanced traffic monitoring systems encounter substantial challenges in vehicle detection and classification due to the limitations of conventional methods, which often demand extensive computational resources and struggle with diverse data acquisition techniques. This research presents a novel approach for vehicle classification and recognition in aerial image sequences, integrating multiple advanced techniques to enhance detection accuracy. The proposed model begins with preprocessing using Multiscale Retinex (MSR) to enhance image quality, followed by Expectation-Maximization (EM) Segmentation for precise foreground object identification. Vehicle detection is performed using the state-of-the-art YOLOv10 framework, while feature extraction incorporates Maximally Stable Extremal… More >

Displaying 1-10 on page 1 of 326. Per Page