Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (12)
  • Open Access

    ARTICLE

    MSA-ViT: A Multi-Scale Vision Transformer for Robust Malware Image Classification

    Bofan Yang, Bingbing Li, Chuanping Hu*

    CMC-Computers, Materials & Continua, Vol.87, No.3, 2026, DOI:10.32604/cmc.2026.077697 - 09 April 2026

    Abstract The rapid evolution of malware obfuscation and packing techniques significantly undermines the effectiveness of traditional static detection approaches. Transforming malware binaries into grayscale or RGB images enables learning-based classification, yet existing CNN- and ViT-based models depend heavily on fixed-resolution inputs and exhibit poor robustness under cross-resolution distortions. This study proposes a lightweight and sample-adaptive Multi-Scale Vision Transformer (MSA-ViT) for efficient and robust malware image classification. MSA-ViT leverages a fixed set of input scales and integrates them using a Scale-Attention Fusion (SAF) module, where the largest-scale CLS token serves as the query to dynamically aggregate cross-scale More >

  • Open Access

    ARTICLE

    LCDM-Mono: Lightweight Conditional Diffusion Model for Self-Supervised Monocular Depth Estimation

    Hao Li1,2, Zhoujingzi Qiu1,2, Jianxiao Zou1,2, Haojie Wu1, Shicai Fan1,2,*

    CMC-Computers, Materials & Continua, Vol.87, No.3, 2026, DOI:10.32604/cmc.2026.076784 - 09 April 2026

    Abstract Self-supervised monocular depth estimation has attracted considerable attention due to its ability to learn without ground-truth depth annotations and its strong scalability. However, existing approaches still suffer from inaccurate object boundaries and limited inference efficiency. To address these issues, we present a Lightweight Conditional Diffusion Model for Monocular Depth Estimation (LCDM-Mono). The proposed framework integrates an efficient diffusion inference strategy with a knowledge distillation scheme, enabling the model to generate high-quality depth maps with only two sampling steps during inference. This design substantially reduces computational overhead and ensures real-time performance on resource-constrained platforms. In addition, More >

  • Open Access

    ARTICLE

    A Cooperative Hybrid Learning Framework for Automated Dandruff Severity Grading

    Sin-Ye Jhong1, Hui-Che Hsu1,2, Hsin-Hua Huang2, Chih-Hsien Hsia3,4,*, Yulius Harjoseputro2,5, Yung-Yao Chen2,*

    CMC-Computers, Materials & Continua, Vol.87, No.1, 2026, DOI:10.32604/cmc.2026.072633 - 10 February 2026

    Abstract Automated grading of dandruff severity is a clinically significant but challenging task due to the inherent ordinal nature of severity levels and the high prevalence of label noise from subjective expert annotations. Standard classification methods fail to address these dual challenges, limiting their real-world performance. In this paper, a novel, three-phase training framework is proposed that learns a robust ordinal classifier directly from noisy labels. The approach synergistically combines a rank-based ordinal regression backbone with a cooperative, semi-supervised learning strategy to dynamically partition the data into clean and noisy subsets. A hybrid training objective is… More >

  • Open Access

    ARTICLE

    DeblurTomo: Self-Supervised Computed Tomography Reconstruction from Blurry Images

    Qingyang Zhou1, Guofeng Lu2, Yunfan Ye3,*, Zhiping Cai1

    CMC-Computers, Materials & Continua, Vol.84, No.2, pp. 2411-2427, 2025, DOI:10.32604/cmc.2025.066810 - 03 July 2025

    Abstract Computed Tomography (CT) reconstruction is essential in medical imaging and other engineering fields. However, blurring of the projection during CT imaging can lead to artifacts in the reconstructed images. Projection blur combines factors such as larger ray sources, scattering and imaging system vibration. To address the problem, we propose DeblurTomo, a novel self-supervised learning-based deblurring and reconstruction algorithm that efficiently reconstructs sharp CT images from blurry input without needing external data and blur measurement. Specifically, we constructed a coordinate-based implicit neural representation reconstruction network, which can map the coordinates to the attenuation coefficient in the… More >

  • Open Access

    ARTICLE

    A Self-Supervised Hybrid Similarity Framework for Underwater Coral Species Classification

    Yu-Shiuan Tsai*, Zhen-Rong Wu, Jian-Zhi Liu

    CMC-Computers, Materials & Continua, Vol.84, No.2, pp. 3431-3457, 2025, DOI:10.32604/cmc.2025.066509 - 03 July 2025

    Abstract Few-shot learning has emerged as a crucial technique for coral species classification, addressing the challenge of limited labeled data in underwater environments. This study introduces an optimized few-shot learning model that enhances classification accuracy while minimizing reliance on extensive data collection. The proposed model integrates a hybrid similarity measure combining Euclidean distance and cosine similarity, effectively capturing both feature magnitude and directional relationships. This approach achieves a notable accuracy of 71.8% under a 5-way 5-shot evaluation, outperforming state-of-the-art models such as Prototypical Networks, FEAT, and ESPT by up to 10%. Notably, the model demonstrates high… More >

  • Open Access

    ARTICLE

    Transient Stability Assessment Model and Its Updating Based on Dual-Tower Transformer

    Nan Li1,2,*, Jingxiong Dong2, Liang Tao3, Liang Huang3

    Energy Engineering, Vol.122, No.7, pp. 2957-2975, 2025, DOI:10.32604/ee.2025.062667 - 27 June 2025

    Abstract With the continuous expansion and increasing complexity of power system scales, the binary classification for transient stability assessment in power systems can no longer meet the safety requirements of power system control and regulation. Therefore, this paper proposes a multi-class transient stability assessment model based on an improved Transformer. The model is designed with a dual-tower encoder structure: one encoder focuses on the time dependency of data, while the other focuses on the dynamic correlations between variables. Feature extraction is conducted from both time and variable perspectives to ensure the completeness of the feature extraction… More > Graphic Abstract

    Transient Stability Assessment Model and Its Updating Based on Dual-Tower Transformer

  • Open Access

    ARTICLE

    Self-Supervised Monocular Depth Estimation with Scene Dynamic Pose

    Jing He1, Haonan Zhu2, Chenhao Zhao1, Minrui Zhao3,*

    CMC-Computers, Materials & Continua, Vol.83, No.3, pp. 4551-4573, 2025, DOI:10.32604/cmc.2025.062437 - 19 May 2025

    Abstract Self-supervised monocular depth estimation has emerged as a major research focus in recent years, primarily due to the elimination of ground-truth depth dependence. However, the prevailing architectures in this domain suffer from inherent limitations: existing pose network branches infer camera ego-motion exclusively under static-scene and Lambertian-surface assumptions. These assumptions are often violated in real-world scenarios due to dynamic objects, non-Lambertian reflectance, and unstructured background elements, leading to pervasive artifacts such as depth discontinuities (“holes”), structural collapse, and ambiguous reconstruction. To address these challenges, we propose a novel framework that integrates scene dynamic pose estimation into… More >

  • Open Access

    ARTICLE

    Robust Deep One-Class Classification Time Series Anomaly Detection

    Zhengdao Yang1, Xuewei Wang2, Yuling Chen1,*, Hui Dou1, Haiwei Sang3

    CMC-Computers, Materials & Continua, Vol.83, No.3, pp. 5181-5197, 2025, DOI:10.32604/cmc.2025.060564 - 19 May 2025

    Abstract Anomaly detection (AD) in time series data is widely applied across various industries for monitoring and security applications, emerging as a key research focus within the field of deep learning. While many methods based on different normality assumptions perform well in specific scenarios, they often neglected the overall normality issue. Some feature extraction methods incorporate pre-training processes but they may not be suitable for time series anomaly detection, leading to decreased performance. Additionally, real-world time series samples are rarely free from noise, making them susceptible to outliers, which further impacts detection accuracy. To address these More >

  • Open Access

    ARTICLE

    From Imperfection to Perfection: Advanced 3D Facial Reconstruction Using MICA Models and Self-Supervision Learning

    Thinh D. Le, Duong Q. Nguyen, Phuong D. Nguyen, H. Nguyen-Xuan*

    CMES-Computer Modeling in Engineering & Sciences, Vol.142, No.2, pp. 1459-1479, 2025, DOI:10.32604/cmes.2024.056753 - 27 January 2025

    Abstract Research on reconstructing imperfect faces is a challenging task. In this study, we explore a data-driven approach using a pre-trained MICA (MetrIC fAce) model combined with 3D printing to address this challenge. We propose a training strategy that utilizes the pre-trained MICA model and self-supervised learning techniques to improve accuracy and reduce the time needed for 3D facial structure reconstruction. Our results demonstrate high accuracy, evaluated by the geometric loss function and various statistical measures. To showcase the effectiveness of the approach, we used 3D printing to create a model that covers facial wounds. The More >

  • Open Access

    ARTICLE

    A Novel Self-Supervised Learning Network for Binocular Disparity Estimation

    Jiawei Tian1, Yu Zhou1, Xiaobing Chen2, Salman A. AlQahtani3, Hongrong Chen4, Bo Yang4,*, Siyu Lu4, Wenfeng Zheng3,4,*

    CMES-Computer Modeling in Engineering & Sciences, Vol.142, No.1, pp. 209-229, 2025, DOI:10.32604/cmes.2024.057032 - 17 December 2024

    Abstract Two-dimensional endoscopic images are susceptible to interferences such as specular reflections and monotonous texture illumination, hindering accurate three-dimensional lesion reconstruction by surgical robots. This study proposes a novel end-to-end disparity estimation model to address these challenges. Our approach combines a Pseudo-Siamese neural network architecture with pyramid dilated convolutions, integrating multi-scale image information to enhance robustness against lighting interferences. This study introduces a Pseudo-Siamese structure-based disparity regression model that simplifies left-right image comparison, improving accuracy and efficiency. The model was evaluated using a dataset of stereo endoscopic videos captured by the Da Vinci surgical robot, comprising More >

Displaying 1-10 on page 1 of 12. Per Page