Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (654)
  • Open Access

    ARTICLE

    Point-Based Fusion for Multimodal 3D Detection in Autonomous Driving

    Xinxin Liu, Bin Ye*

    Computer Systems Science and Engineering, Vol.49, pp. 287-300, 2025, DOI:10.32604/csse.2025.061655 - 20 February 2025

    Abstract In the broader field of mechanical technology, and particularly in the context of self-driving vehicles, cameras and Light Detection and Ranging (LiDAR) sensors provide complementary modalities that hold significant potential for sensor fusion. However, directly merging multi-sensor data through point projection often results in information loss due to quantization, and managing the differing data formats from multiple sensors remains a persistent challenge. To address these issues, we propose a new fusion method that leverages continuous convolution, point-pooling, and a learned Multilayer Perceptron (MLP) to achieve superior detection performance. Our approach integrates the segmentation mask with… More >

  • Open Access

    ARTICLE

    LEGF-DST: LLMs-Enhanced Graph-Fusion Dual-Stream Transformer for Fine-Grained Chinese Malicious SMS Detection

    Xin Tong1, Jingya Wang1,*, Ying Yang2, Tian Peng3, Hanming Zhai1, Guangming Ling4

    CMC-Computers, Materials & Continua, Vol.82, No.2, pp. 1901-1924, 2025, DOI:10.32604/cmc.2024.059018 - 17 February 2025

    Abstract With the widespread use of SMS (Short Message Service), the proliferation of malicious SMS has emerged as a pressing societal issue. While deep learning-based text classifiers offer promise, they often exhibit suboptimal performance in fine-grained detection tasks, primarily due to imbalanced datasets and insufficient model representation capabilities. To address this challenge, this paper proposes an LLMs-enhanced graph fusion dual-stream Transformer model for fine-grained Chinese malicious SMS detection. During the data processing stage, Large Language Models (LLMs) are employed for data augmentation, mitigating dataset imbalance. In the data input stage, both word-level and character-level features are More >

  • Open Access

    ARTICLE

    ASL-OOD: Hierarchical Contextual Feature Fusion with Angle-Sensitive Loss for Oriented Object Detection

    Kexin Wang1,#, Jiancheng Liu1,#,*, Yuqing Lin2,*, Tuo Wang1, Zhipeng Zhang1, Wanlong Qi1, Xingye Han1, Runyuan Wen3

    CMC-Computers, Materials & Continua, Vol.82, No.2, pp. 1879-1899, 2025, DOI:10.32604/cmc.2024.058952 - 17 February 2025

    Abstract Detecting oriented targets in remote sensing images amidst complex and heterogeneous backgrounds remains a formidable challenge in the field of object detection. Current frameworks for oriented detection modules are constrained by intrinsic limitations, including excessive computational and memory overheads, discrepancies between predefined anchors and ground truth bounding boxes, intricate training processes, and feature alignment inconsistencies. To overcome these challenges, we present ASL-OOD (Angle-based SIOU Loss for Oriented Object Detection), a novel, efficient, and robust one-stage framework tailored for oriented object detection. The ASL-OOD framework comprises three core components: the Transformer-based Backbone (TB), the Transformer-based Neck… More >

  • Open Access

    ARTICLE

    External Knowledge-Enhanced Cross-Attention Fusion Model for Tobacco Sentiment Analysis

    Lihua Xie1, Ni Tang1, Qing Chen1,*, Jun Li2,*

    CMC-Computers, Materials & Continua, Vol.82, No.2, pp. 3381-3397, 2025, DOI:10.32604/cmc.2024.058950 - 17 February 2025

    Abstract In the age of information explosion and artificial intelligence, sentiment analysis tailored for the tobacco industry has emerged as a pivotal avenue for cigarette manufacturers to enhance their tobacco products. Existing solutions have primarily focused on intrinsic features within consumer reviews and achieved significant progress through deep feature extraction models. However, they still face these two key limitations: (1) neglecting the influence of fundamental tobacco information on analyzing the sentiment inclination of consumer reviews, resulting in a lack of consistent sentiment assessment criteria across thousands of tobacco brands; (2) overlooking the syntactic dependencies between Chinese… More >

  • Open Access

    ARTICLE

    Multisource Data Fusion Using MLP for Human Activity Recognition

    Sujittra Sarakon1, Wansuree Massagram1,2, Kreangsak Tamee1,3,*

    CMC-Computers, Materials & Continua, Vol.82, No.2, pp. 2109-2136, 2025, DOI:10.32604/cmc.2025.058906 - 17 February 2025

    Abstract This research investigates the application of multisource data fusion using a Multi-Layer Perceptron (MLP) for Human Activity Recognition (HAR). The study integrates four distinct open-source datasets—WISDM, DaLiAc, MotionSense, and PAMAP2—to develop a generalized MLP model for classifying six human activities. Performance analysis of the fused model for each dataset reveals accuracy rates of 95.83 for WISDM, 97 for DaLiAc, 94.65 for MotionSense, and 98.54 for PAMAP2. A comparative evaluation was conducted between the fused MLP model and the individual dataset models, with the latter tested on separate validation sets. The results indicate that the MLP More >

  • Open Access

    ARTICLE

    Telecontext-Enhanced Recursive Interactive Attention Fusion Method for Line-Level Defect Prediction

    Haitao He1, Bingjian Yan1, Ke Xu1,*, Lu Yu1,2

    CMC-Computers, Materials & Continua, Vol.82, No.2, pp. 2077-2108, 2025, DOI:10.32604/cmc.2024.058779 - 17 February 2025

    Abstract Software defect prediction aims to use measurement data of code and historical defects to predict potential problems, optimize testing resources and defect management. However, current methods face challenges: (1) Coarse-grained file level detection cannot accurately locate specific defects. (2) Fine-grained line-level defect prediction methods rely solely on local information of a single line of code, failing to deeply analyze the semantic context of the code line and ignoring the heuristic impact of line-level context on the code line, making it difficult to capture the interaction between global and local information. Therefore, this paper proposes a… More >

  • Open Access

    ARTICLE

    Multi-Scale Feature Fusion Network Model for Wireless Capsule Endoscopic Intestinal Lesion Detection

    Shiren Ye, Qi Meng, Shuo Zhang, Hui Wang*

    CMC-Computers, Materials & Continua, Vol.82, No.2, pp. 2415-2429, 2025, DOI:10.32604/cmc.2024.058250 - 17 February 2025

    Abstract WCE (Wireless Capsule Endoscopy) is a new technology that combines computer vision and medicine, allowing doctors to visualize the conditions inside the intestines, achieving good diagnostic results. However, due to the complex intestinal environment and limited pixel resolution of WCE videos, lesions are not easily detectable, and it takes an experienced doctor 1–2 h to analyze a complete WCE video. The use of computer-aided diagnostic methods, assisting or even replacing manual WCE diagnosis, has significant application value. In response to the issue of intestinal lesion detection in WCE videos, this paper proposes a multi-scale feature… More >

  • Open Access

    ARTICLE

    Research on Multimodal Brain Tumor Segmentation Algorithm Based on Feature Decoupling and Information Bottleneck Theory

    Xuemei Yang1, Yuting Zhou2, Shiqi Liu1, Junping Yin2,3,*

    CMC-Computers, Materials & Continua, Vol.82, No.2, pp. 3281-3307, 2025, DOI:10.32604/cmc.2024.057991 - 17 February 2025

    Abstract Aiming at the problems of information loss and the relationship between features and target tasks in multimodal medical image segmentation, a multimodal medical image segmentation algorithm based on feature decoupling and information bottleneck theory is proposed in this paper. Based on the reversible network, the bottom-up learning method for different modal information is constructed, which enhances the features’ expression ability and the network’s learning ability. The feature fusion module is designed to balance multi-directional information flow. To retain the information relevant to the target task to the maximum extent and suppress the information irrelevant to… More >

  • Open Access

    ARTICLE

    Lip-Audio Modality Fusion for Deep Forgery Video Detection

    Yong Liu1,4, Zhiyu Wang2,*, Shouling Ji3, Daofu Gong1,5, Lanxin Cheng1, Ruosi Cheng1

    CMC-Computers, Materials & Continua, Vol.82, No.2, pp. 3499-3515, 2025, DOI:10.32604/cmc.2024.057859 - 17 February 2025

    Abstract In response to the problem of traditional methods ignoring audio modality tampering, this study aims to explore an effective deep forgery video detection technique that improves detection precision and reliability by fusing lip images and audio signals. The main method used is lip-audio matching detection technology based on the Siamese neural network, combined with MFCC (Mel Frequency Cepstrum Coefficient) feature extraction of band-pass filters, an improved dual-branch Siamese network structure, and a two-stream network structure design. Firstly, the video stream is preprocessed to extract lip images, and the audio stream is preprocessed to extract MFCC… More >

  • Open Access

    ARTICLE

    ACSF-ED: Adaptive Cross-Scale Fusion Encoder-Decoder for Spatio-Temporal Action Detection

    Wenju Wang1, Zehua Gu1,*, Bang Tang1, Sen Wang2, Jianfei Hao2

    CMC-Computers, Materials & Continua, Vol.82, No.2, pp. 2389-2414, 2025, DOI:10.32604/cmc.2024.057392 - 17 February 2025

    Abstract Current spatio-temporal action detection methods lack sufficient capabilities in extracting and comprehending spatio-temporal information. This paper introduces an end-to-end Adaptive Cross-Scale Fusion Encoder-Decoder (ACSF-ED) network to predict the action and locate the object efficiently. In the Adaptive Cross-Scale Fusion Spatio-Temporal Encoder (ACSF ST-Encoder), the Asymptotic Cross-scale Feature-fusion Module (ACCFM) is designed to address the issue of information degradation caused by the propagation of high-level semantic information, thereby extracting high-quality multi-scale features to provide superior features for subsequent spatio-temporal information modeling. Within the Shared-Head Decoder structure, a shared classification and regression detection head is constructed. A More >

Displaying 11-20 on page 2 of 654. Per Page