Special Issues
Table of Content

Recent Advances in Signal Processing and Computer Vision

Submission Deadline: 31 January 2026 View: 3161 Submit to Special Issue

Guest Editors

Dr. Bo Yang

Email: boyang@uestc.edu.cn

Affiliation: School of Automation Engineering, University of Electronic Science and Technology of China, China

Homepage:

Research Interests: Computer Vision, Surgical Robotics, Surgical (endoscopic) Vison, and Medical Image Processing

图片6.png


Dr. Chao Liu

Email: liu@lirmm.fr

Affiliation: CNRS (French National Center for Scientific Research), France

Homepage:

Research Interests: Visual Augmentation and Reconstruction, 3D Reconstruction of Deformable Surface, Haptics in Human-Machine Interaction, Multimodal Sensor-Based Analysis of Manipulation Skills, Surgical Robot, Medical Image Processing

图片7.png


Summary

Over the past decade or so, artificial intelligence technologies represented by deep learning have made remarkable progress, especially in the domains of signal processing and computer vision. In these domains, deep learning-based methods are being iterated and commercialized at an unprecedented rate, dramatically changing the way humans live, learn, and work.

 

Since the resurgence of convolutional neural networks in 2010, computer vision has been one of the most dynamic areas for deep learning technology. Recently, image and video synthesis and generation, 3D vision, visual language model, and multimodal learning have gradually been the research hotspots in the field. The transformative wave of Large Language Models (LLMs) in the field of Natural Language Processing (NLP) has inspired further exploration of their potential in computer vision. On the other hand, AI technologies are increasingly transitioning from the virtual realm to the physical, integrating with automated devices or machinery to create embodied, intelligent entities capable of physical interaction, known as Embodied Artificial Intelligence (EAI). In this context, AI needs to sink down to deal with more underlying signal or hardware information. In the field of signal processing, AI intersects with traditional control, automation, and robotics technologies, leading to a fusion of these disciplines.

 

In short, AI has made great strides from initially recognizing the world (traditional vision tasks such as classification and recognition) to simulating the world (generative models) and changing the world (embodied intelligence). This special issue hopes to document and advance this trend by focusing on the latest advances in AI technology in the areas of signal processing and computer vision. We seek original research articles, reviews, and survey papers that explore the latest developments, challenges, and solutions in these rapidly evolving areas. The potential topics encompassed may include, but are not limited to, the following topics:

 

· Multimodal artificial intelligence

· 2D&3D generative modes

· Image & video segmentation

· 3D reconstruction

· Large models and their applications in signal processing and computer vision

· Visual question and answer (VQA), visual reasoning

· Meta-learning, transfer learning, few-shot learning.

· Embodied Artificial Intelligence

· Reinforcement Learning

· Medical image processing

· Medical robot

· Vision Foundation Models

· Efficient and robust AI



Published Papers


  • Open Access

    ARTICLE

    Novel Quantum-Integrated CNN Model for Improved Human Activity Recognition in Smart Surveillance

    Tanvir Fatima Naik Bukht, Yanfeng Wu, Nouf Abdullah Almujally, Shuoa S. AItarbi, Hameedur Rahman, Ahmad Jalal, Hui Liu
    CMES-Computer Modeling in Engineering & Sciences, DOI:10.32604/cmes.2025.071850
    (This article belongs to the Special Issue: Recent Advances in Signal Processing and Computer Vision)
    Abstract Human activity recognition (HAR) is crucial in fields like robotics, surveillance, and healthcare, enabling systems to understand and respond to human actions. Current models often struggle with complex datasets, making accurate recognition challenging. This study proposes a quantum-integrated Convolutional Neural Network (QI-CNN) to enhance HAR performance. The traditional models demonstrate weak performance in transferring learned knowledge between diverse complex data collections, including D3D-HOI and Sysu 3D HOI. HAR requires better extraction models and techniques that must address current challenges to achieve improved accuracy and scalability. The model aims to enhance HAR task performance by combining… More >

  • Open Access

    ARTICLE

    A Computational Modeling Approach for Joint Calibration of Low-Deviation Surgical Instruments

    Bo Yang, Yu Zhou, Jiawei Tian, Xiang Zhang, Fupei Guo, Shan Liu
    CMES-Computer Modeling in Engineering & Sciences, Vol.145, No.2, pp. 2253-2276, 2025, DOI:10.32604/cmes.2025.072031
    (This article belongs to the Special Issue: Recent Advances in Signal Processing and Computer Vision)
    Abstract Accurate calibration of surgical instruments and ultrasound probes is essential for achieving high precision in image guided minimally invasive procedures. However, existing methods typically treat the calibration of the needle tip and the ultrasound probe as two independent processes, lacking an integrated calibration mechanism, which often leads to cumulative errors and reduced spatial consistency. To address this challenge, we propose a joint calibration model that unifies the calibration of the surgical needle tip and the ultrasound probe within a single coordinate system. The method formulates the calibration process through a series of mathematical models and… More >

  • Open Access

    ARTICLE

    Optimizing Haze Removal: A Variable Scattering Approach to Transmission Mapping

    Gaurav Saxena, Kiran Napte, Neeraj Kumar Shukla, Sushma Parihar
    CMES-Computer Modeling in Engineering & Sciences, Vol.144, No.2, pp. 2307-2323, 2025, DOI:10.32604/cmes.2025.067530
    (This article belongs to the Special Issue: Recent Advances in Signal Processing and Computer Vision)
    Abstract The ill-posed character of haze or fog makes it difficult to remove from a single image. While most existing methods rely on a transmission map refined through depth estimation and assume a constant scattering coefficient, this assumption limits their effectiveness. In this paper, we propose an enhanced transmission map that incorporates spatially varying scattering information inherent in hazy images. To improve linearity, the model utilizes the ratio of the difference between intensity and saturation to their sum. Our approach also addresses critical issues such as edge preservation and color fidelity. In terms of qualitative as More >

  • Open Access

    ARTICLE

    An Efficient Explainable AI Model for Accurate Brain Tumor Detection Using MRI Images

    Fatma M. Talaat, Mohamed Salem, Mohamed Shehata, Warda M. Shaban
    CMES-Computer Modeling in Engineering & Sciences, Vol.144, No.2, pp. 2325-2358, 2025, DOI:10.32604/cmes.2025.067195
    (This article belongs to the Special Issue: Recent Advances in Signal Processing and Computer Vision)
    Abstract The diagnosis of brain tumors is an extended process that significantly depends on the expertise and skills of radiologists. The rise in patient numbers has substantially elevated the data processing volume, making conventional methods both costly and inefficient. Recently, Artificial Intelligence (AI) has gained prominence for developing automated systems that can accurately diagnose or segment brain tumors in a shorter time frame. Many researchers have examined various algorithms that provide both speed and accuracy in detecting and classifying brain tumors. This paper proposes a new model based on AI, called the Brain Tumor Detection (BTD)… More >

  • Open Access

    REVIEW

    Bridging 2D and 3D Object Detection: Advances in Occlusion Handling through Depth Estimation

    Zainab Ouardirhi, Mostapha Zbakh, Sidi Ahmed Mahmoudi
    CMES-Computer Modeling in Engineering & Sciences, Vol.143, No.3, pp. 2509-2571, 2025, DOI:10.32604/cmes.2025.064283
    (This article belongs to the Special Issue: Recent Advances in Signal Processing and Computer Vision)
    Abstract Object detection in occluded environments remains a core challenge in computer vision (CV), especially in domains such as autonomous driving and robotics. While Convolutional Neural Network (CNN)-based two-dimensional (2D) and three-dimensional (3D) object detection methods have made significant progress, they often fall short under severe occlusion due to depth ambiguities in 2D imagery and the high cost and deployment limitations of 3D sensors such as Light Detection and Ranging (LiDAR). This paper presents a comparative review of recent 2D and 3D detection models, focusing on their occlusion-handling capabilities and the impact of sensor modalities such More >

  • Open Access

    ARTICLE

    Non-Neural 3D Nasal Reconstruction: A Sparse Landmark Algorithmic Approach for Medical Applications

    Nguyen Khac Toan, Ho Nguyen Anh Tuan, Nguyen Truong Thinh
    CMES-Computer Modeling in Engineering & Sciences, Vol.143, No.2, pp. 1273-1295, 2025, DOI:10.32604/cmes.2025.064218
    (This article belongs to the Special Issue: Recent Advances in Signal Processing and Computer Vision)
    Abstract This paper presents a novel method for reconstructing a highly accurate 3D nose model of the human from 2D images and pre-marked landmarks based on algorithmic methods. The study focuses on the reconstruction of a 3D nose model tailored for applications in healthcare and cosmetic surgery. The approach leverages advanced image processing techniques, 3D Morphable Models (3DMM), and deformation techniques to overcome the limitations of deep learning models, particularly addressing the interpretability issues commonly encountered in medical applications. The proposed method estimates the 3D coordinates of landmark points using a 3D structure estimation algorithm. Sub-landmarks… More >

    Graphic Abstract

    Non-Neural 3D Nasal Reconstruction: A Sparse Landmark Algorithmic Approach for Medical Applications

  • Open Access

    ARTICLE

    A Dual-Layer Attention Based CAPTCHA Recognition Approach with Guided Visual Attention

    Zaid Derea, Beiji Zou, Xiaoyan Kui, Alaa Thobhani, Amr Abdussalam
    CMES-Computer Modeling in Engineering & Sciences, Vol.142, No.3, pp. 2841-2867, 2025, DOI:10.32604/cmes.2025.059586
    (This article belongs to the Special Issue: Recent Advances in Signal Processing and Computer Vision)
    Abstract Enhancing website security is crucial to combat malicious activities, and CAPTCHA (Completely Automated Public Turing tests to tell Computers and Humans Apart) has become a key method to distinguish humans from bots. While text-based CAPTCHAs are designed to challenge machines while remaining human-readable, recent advances in deep learning have enabled models to recognize them with remarkable efficiency. In this regard, we propose a novel two-layer visual attention framework for CAPTCHA recognition that builds on traditional attention mechanisms by incorporating Guided Visual Attention (GVA), which sharpens focus on relevant visual features. We have specifically adapted the… More >

  • Open Access

    REVIEW

    A Survey on Enhancing Image Captioning with Advanced Strategies and Techniques

    Alaa Thobhani, Beiji Zou, Xiaoyan Kui, Amr Abdussalam, Muhammad Asim, Sajid Shah, Mohammed ELAffendi
    CMES-Computer Modeling in Engineering & Sciences, Vol.142, No.3, pp. 2247-2280, 2025, DOI:10.32604/cmes.2025.059192
    (This article belongs to the Special Issue: Recent Advances in Signal Processing and Computer Vision)
    Abstract Image captioning has seen significant research efforts over the last decade. The goal is to generate meaningful semantic sentences that describe visual content depicted in photographs and are syntactically accurate. Many real-world applications rely on image captioning, such as helping people with visual impairments to see their surroundings. To formulate a coherent and relevant textual description, computer vision techniques are utilized to comprehend the visual content within an image, followed by natural language processing methods. Numerous approaches and models have been developed to deal with this multifaceted problem. Several models prove to be state-of-the-art solutions… More >

Share Link