Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (407)
  • Open Access

    ARTICLE

    Deepfake Detection Method Based on Spatio-Temporal Information Fusion

    Xinyi Wang*, Wanru Song, Chuanyan Hao, Feng Liu

    CMC-Computers, Materials & Continua, Vol.83, No.2, pp. 3351-3368, 2025, DOI:10.32604/cmc.2025.062922 - 16 April 2025

    Abstract As Deepfake technology continues to evolve, the distinction between real and fake content becomes increasingly blurred. Most existing Deepfake video detection methods rely on single-frame facial image features, which limits their ability to capture temporal differences between frames. Current methods also exhibit limited generalization capabilities, struggling to detect content generated by unknown forgery algorithms. Moreover, the diversity and complexity of forgery techniques introduced by Artificial Intelligence Generated Content (AIGC) present significant challenges for traditional detection frameworks, which must balance high detection accuracy with robust performance. To address these challenges, we propose a novel Deepfake detection… More >

  • Open Access

    ARTICLE

    Deep Learning Algorithm for Person Re-Identification Based on Dual Network Architecture

    Meng Zhu1,2, Xingyue Wang3, Honge Ren3,4,*, Abeer Hakeem5, Linda Mohaisen5,*

    CMC-Computers, Materials & Continua, Vol.83, No.2, pp. 2889-2905, 2025, DOI:10.32604/cmc.2025.061421 - 16 April 2025

    Abstract Changing a person’s posture and low resolution are the key challenges for person re-identification (ReID) in various deep learning applications. In this paper, we introduce an innovative architecture using a dual attention network that includes an attention module and a joint measurement module of spatial-temporal information. The proposed approach can be classified into two main tasks. Firstly, the spatial attention feature map is formed by aggregating features in the spatial dimension. Additionally, the same operation is carried out on the channel dimension to form channel attention feature maps. Therefore, the receptive field size is adjusted… More >

  • Open Access

    ARTICLE

    TIPS: Tailored Information Extraction in Public Security Using Domain-Enhanced Large Language Model

    Yue Liu1, Qinglang Guo2, Chunyao Yang1, Yong Liao1,*

    CMC-Computers, Materials & Continua, Vol.83, No.2, pp. 2555-2572, 2025, DOI:10.32604/cmc.2025.060318 - 16 April 2025

    Abstract Processing police incident data in public security involves complex natural language processing (NLP) tasks, including information extraction. This data contains extensive entity information—such as people, locations, and events—while also involving reasoning tasks like personnel classification, relationship judgment, and implicit inference. Moreover, utilizing models for extracting information from police incident data poses a significant challenge—data scarcity, which limits the effectiveness of traditional rule-based and machine-learning methods. To address these, we propose TIPS. In collaboration with public security experts, we used de-identified police incident data to create templates that enable large language models (LLMs) to populate data More >

  • Open Access

    ARTICLE

    Performance vs. Complexity Comparative Analysis of Multimodal Bilinear Pooling Fusion Approaches for Deep Learning-Based Visual Arabic-Question Answering Systems

    Sarah M. Kamel1,*, Mai A. Fadel2, Lamiaa Elrefaei1,3, Shimaa I. Hassan1,4

    CMES-Computer Modeling in Engineering & Sciences, Vol.143, No.1, pp. 373-411, 2025, DOI:10.32604/cmes.2025.062837 - 11 April 2025

    Abstract Visual question answering (VQA) is a multimodal task, involving a deep understanding of the image scene and the question’s meaning and capturing the relevant correlations between both modalities to infer the appropriate answer. In this paper, we propose a VQA system intended to answer yes/no questions about real-world images, in Arabic. To support a robust VQA system, we work in two directions: (1) Using deep neural networks to semantically represent the given image and question in a fine-grained manner, namely ResNet-152 and Gated Recurrent Units (GRU). (2) Studying the role of the utilized multimodal bilinear… More >

  • Open Access

    ARTICLE

    Rolling Bearing Fault Diagnosis Method Based on FFT-VMD Multiscale Information Fusion and SE-TCN Model

    Chaozhi Cai, Yuqi Ren, Yingfang Xue*, Jianhua Ren

    Structural Durability & Health Monitoring, Vol.19, No.3, pp. 665-682, 2025, DOI:10.32604/sdhm.2025.059044 - 03 April 2025

    Abstract Rolling bearings are important parts of industrial equipment, and their fault diagnosis is crucial to maintaining these equipment’s regular operations. With the goal of improving the fault diagnosis accuracy of rolling bearings under complex working conditions and noise, this study proposes a multiscale information fusion method for fault diagnosis of rolling bearings based on fast Fourier transform (FFT) and variational mode decomposition (VMD), as well as the Senet (SE)-TCNnet (TCN) model. FFT is used to transform the original one-dimensional time domain vibration signal into a frequency domain signal, while VMD is used to decompose the… More >

  • Open Access

    ARTICLE

    Multi-Scale Feature Fusion Network for Accurate Detection of Cervical Abnormal Cells

    Chuanyun Xu1,#, Die Hu1,#, Yang Zhang1,*, Shuaiye Huang1, Yisha Sun1, Gang Li2

    CMC-Computers, Materials & Continua, Vol.83, No.1, pp. 559-574, 2025, DOI:10.32604/cmc.2025.061579 - 26 March 2025

    Abstract Detecting abnormal cervical cells is crucial for early identification and timely treatment of cervical cancer. However, this task is challenging due to the morphological similarities between abnormal and normal cells and the significant variations in cell size. Pathologists often refer to surrounding cells to identify abnormalities. To emulate this slide examination behavior, this study proposes a Multi-Scale Feature Fusion Network (MSFF-Net) for detecting cervical abnormal cells. MSFF-Net employs a Cross-Scale Pooling Model (CSPM) to effectively capture diverse features and contextual information, ranging from local details to the overall structure. Additionally, a Multi-Scale Fusion Attention (MSFA)… More >

  • Open Access

    ARTICLE

    A Generative Image Steganography Based on Disentangled Attribute Feature Transformation and Invertible Mapping Rule

    Xiang Zhang1,2,*, Shenyan Han1,2, Wenbin Huang1,2, Daoyong Fu1,2

    CMC-Computers, Materials & Continua, Vol.83, No.1, pp. 1149-1171, 2025, DOI:10.32604/cmc.2025.060876 - 26 March 2025

    Abstract Generative image steganography is a technique that directly generates stego images from secret information. Unlike traditional methods, it theoretically resists steganalysis because there is no cover image. Currently, the existing generative image steganography methods generally have good steganography performance, but there is still potential room for enhancing both the quality of stego images and the accuracy of secret information extraction. Therefore, this paper proposes a generative image steganography algorithm based on attribute feature transformation and invertible mapping rule. Firstly, the reference image is disentangled by a content and an attribute encoder to obtain content features… More >

  • Open Access

    ARTICLE

    Amalgamation of Classical and Large Language Models for Duplicate Bug Detection: A Comparative Study

    Sai Venkata Akhil Ammu1, Sukhjit Singh Sehra1,*, Sumeet Kaur Sehra2, Jaiteg Singh3

    CMC-Computers, Materials & Continua, Vol.83, No.1, pp. 435-453, 2025, DOI:10.32604/cmc.2025.057792 - 26 March 2025

    Abstract Duplicate bug reporting is a critical problem in the software repositories’ mining area. Duplicate bug reports can lead to redundant efforts, wasted resources, and delayed software releases. Thus, their accurate identification is essential for streamlining the bug triage process mining area. Several researchers have explored classical information retrieval, natural language processing, text and data mining, and machine learning approaches. The emergence of large language models (LLMs) (ChatGPT and Huggingface) has presented a new line of models for semantic textual similarity (STS). Although LLMs have shown remarkable advancements, there remains a need for longitudinal studies to… More >

  • Open Access

    ARTICLE

    Efficient Spatiotemporal Information Utilization for Video Camouflaged Object Detection

    Dongdong Zhang, Chunping Wang, Huiying Wang, Qiang Fu*

    CMC-Computers, Materials & Continua, Vol.82, No.3, pp. 4319-4338, 2025, DOI:10.32604/cmc.2025.060653 - 06 March 2025

    Abstract Video camouflaged object detection (VCOD) has become a fundamental task in computer vision that has attracted significant attention in recent years. Unlike image camouflaged object detection (ICOD), VCOD not only requires spatial cues but also needs motion cues. Thus, effectively utilizing spatiotemporal information is crucial for generating accurate segmentation results. Current VCOD methods, which typically focus on exploring motion representation, often ineffectively integrate spatial and motion features, leading to poor performance in diverse scenarios. To address these issues, we design a novel spatiotemporal network with an encoder-decoder structure. During the encoding stage, an adjacent space-time More >

  • Open Access

    ARTICLE

    DIEONet: Domain-Invariant Information Extraction and Optimization Network for Visual Place Recognition

    Shaoqi Hou1,2,3,*, Zebang Qin2, Chenyu Wu2, Guangqiang Yin2, Xinzhong Wang1, Zhiguo Wang2,*

    CMC-Computers, Materials & Continua, Vol.82, No.3, pp. 5019-5033, 2025, DOI:10.32604/cmc.2025.058233 - 06 March 2025

    Abstract Visual Place Recognition (VPR) technology aims to use visual information to judge the location of agents, which plays an irreplaceable role in tasks such as loop closure detection and relocation. It is well known that previous VPR algorithms emphasize the extraction and integration of general image features, while ignoring the mining of salient features that play a key role in the discrimination of VPR tasks. To this end, this paper proposes a Domain-invariant Information Extraction and Optimization Network (DIEONet) for VPR. The core of the algorithm is a newly designed Domain-invariant Information Mining Module (DIMM)… More >

Displaying 1-10 on page 1 of 407. Per Page