Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (271)
  • Open Access

    ARTICLE

    3D Vehicle Detection Algorithm Based on Multimodal Decision-Level Fusion

    Peicheng Shi1,*, Heng Qi1, Zhiqiang Liu1, Aixi Yang2

    CMES-Computer Modeling in Engineering & Sciences, Vol.135, No.3, pp. 2007-2023, 2023, DOI:10.32604/cmes.2023.022304

    Abstract 3D vehicle detection based on LiDAR-camera fusion is becoming an emerging research topic in autonomous driving. The algorithm based on the Camera-LiDAR object candidate fusion method (CLOCs) is currently considered to be a more effective decision-level fusion algorithm, but it does not fully utilize the extracted features of 3D and 2D. Therefore, we proposed a 3D vehicle detection algorithm based on multimodal decision-level fusion. First, project the anchor point of the 3D detection bounding box into the 2D image, calculate the distance between 2D and 3D anchor points, and use this distance as a new fusion feature to enhance the… More > Graphic Abstract

    3D Vehicle Detection Algorithm Based on Multimodal Decision-Level Fusion

  • Open Access

    ARTICLE

    Profiling of Urban Noise Using Artificial Intelligence

    Le Quang Thao1,2,*, Duong Duc Cuong2, Tran Thi Tuong Anh3, Tran Duc Luong4

    Computer Systems Science and Engineering, Vol.45, No.2, pp. 1309-1321, 2023, DOI:10.32604/csse.2023.031010

    Abstract Noise pollution tends to receive less awareness compared to other types of pollution, however, it greatly impacts the quality of life for humans such as causing sleep disruption, stress or hearing impairment. Profiling urban sound through the identification of noise sources in cities could help to benefit livability by reducing exposure to noise pollution through methods such as noise control, planning of the soundscape environment, or selection of safe living space. In this paper, we proposed a self-attention long short-term memory (LSTM) method that can improve sound classification compared to previous baselines. An attention mechanism will be designed solely to… More >

  • Open Access

    ARTICLE

    An End-to-End Transformer-Based Automatic Speech Recognition for Qur’an Reciters

    Mohammed Hadwan1,2,*, Hamzah A. Alsayadi3,4, Salah AL-Hagree5

    CMC-Computers, Materials & Continua, Vol.74, No.2, pp. 3471-3487, 2023, DOI:10.32604/cmc.2023.033457

    Abstract The attention-based encoder-decoder technique, known as the trans-former, is used to enhance the performance of end-to-end automatic speech recognition (ASR). This research focuses on applying ASR end-to-end transformer-based models for the Arabic language, as the researchers’ community pays little attention to it. The Muslims Holy Qur’an book is written using Arabic diacritized text. In this paper, an end-to-end transformer model to building a robust Qur’an vs. recognition is proposed. The acoustic model was built using the transformer-based model as deep learning by the PyTorch framework. A multi-head attention mechanism is utilized to represent the encoder and decoder in the acoustic… More >

  • Open Access

    ARTICLE

    Residual Attention Deep SVDD for COVID-19 Diagnosis Using CT Scans

    Akram Ali Alhadad1,2,*, Omar Tarawneh3, Reham R. Mostafa1, Hazem M. El-Bakry1

    CMC-Computers, Materials & Continua, Vol.74, No.2, pp. 3333-3350, 2023, DOI:10.32604/cmc.2023.033413

    Abstract COVID-19 is the common name of the disease caused by the novel coronavirus (2019-nCoV) that appeared in Wuhan, China in 2019. Discovering the infected people is the most important factor in the fight against the disease. The gold-standard test to diagnose COVID-19 is polymerase chain reaction (PCR), but it takes 5–6 h and, in the early stages of infection, may produce false-negative results. Examining Computed Tomography (CT) images to diagnose patients infected with COVID-19 has become an urgent necessity. In this study, we propose a residual attention deep support vector data description SVDD (RADSVDD) approach to diagnose COVID-19. It is… More >

  • Open Access

    ARTICLE

    A Dual Attention Encoder-Decoder Text Summarization Model

    Nada Ali Hakami1, Hanan Ahmed Hosni Mahmoud2,*

    CMC-Computers, Materials & Continua, Vol.74, No.2, pp. 3697-3710, 2023, DOI:10.32604/cmc.2023.031525

    Abstract A worthy text summarization should represent the fundamental content of the document. Recent studies on computerized text summarization tried to present solutions to this challenging problem. Attention models are employed extensively in text summarization process. Classical attention techniques are utilized to acquire the context data in the decoding phase. Nevertheless, without real and efficient feature extraction, the produced summary may diverge from the core topic. In this article, we present an encoder-decoder attention system employing dual attention mechanism. In the dual attention mechanism, the attention algorithm gathers main data from the encoder side. In the dual attention model, the system… More >

  • Open Access

    ARTICLE

    Forecasting Future Trajectories with an Improved Transformer Network

    Wei Wu1, Weigong Zhang1,*, Dong Wang1, Lydia Zhu2, Xiang Song3

    CMC-Computers, Materials & Continua, Vol.74, No.2, pp. 3811-3828, 2023, DOI:10.32604/cmc.2023.029787

    Abstract An increase in car ownership brings convenience to people’s life. However, it also leads to frequent traffic accidents. Precisely forecasting surrounding agents’ future trajectories could effectively decrease vehicle-vehicle and vehicle-pedestrian collisions. Long-short-term memory (LSTM) network is often used for vehicle trajectory prediction, but it has some shortages such as gradient explosion and low efficiency. A trajectory prediction method based on an improved Transformer network is proposed to forecast agents’ future trajectories in a complex traffic environment. It realizes the transformation from sequential step processing of LSTM to parallel processing of Transformer based on attention mechanism. To perform trajectory prediction more… More >

  • Open Access

    ARTICLE

    A Multi-Level Circulant Cross-Modal Transformer for Multimodal Speech Emotion Recognition

    Peizhu Gong1, Jin Liu1, Zhongdai Wu2, Bing Han2, Y. Ken Wang3, Huihua He4,*

    CMC-Computers, Materials & Continua, Vol.74, No.2, pp. 4203-4220, 2023, DOI:10.32604/cmc.2023.028291

    Abstract Speech emotion recognition, as an important component of human-computer interaction technology, has received increasing attention. Recent studies have treated emotion recognition of speech signals as a multimodal task, due to its inclusion of the semantic features of two different modalities, i.e., audio and text. However, existing methods often fail in effectively represent features and capture correlations. This paper presents a multi-level circulant cross-modal Transformer (MLCCT) for multimodal speech emotion recognition. The proposed model can be divided into three steps, feature extraction, interaction and fusion. Self-supervised embedding models are introduced for feature extraction, which give a more powerful representation of the… More >

  • Open Access

    ARTICLE

    CEMA-LSTM: Enhancing Contextual Feature Correlation for Radar Extrapolation Using Fine-Grained Echo Datasets

    Zhiyun Yang1,#, Qi Liu1,#,*, Hao Wu1, Xiaodong Liu2, Yonghong Zhang3

    CMES-Computer Modeling in Engineering & Sciences, Vol.135, No.1, pp. 45-64, 2023, DOI:10.32604/cmes.2022.022045

    Abstract Accurate precipitation nowcasting can provide great convenience to the public so they can conduct corresponding arrangements in advance to deal with the possible impact of upcoming heavy rain. Recent relevant research activities have shown their concerns on various deep learning models for radar echo extrapolation, where radar echo maps were used to predict their consequent moment, so as to recognize potential severe convective weather events. However, these approaches suffer from an inaccurate prediction of echo dynamics and unreliable depiction of echo aggregation or dissipation, due to the size limitation of convolution filter, lack of global feature, and less attention to… More > Graphic Abstract

    CEMA-LSTM: Enhancing Contextual Feature Correlation for Radar Extrapolation Using Fine-Grained Echo Datasets

  • Open Access

    ARTICLE

    Image Color Rendering Based on Hinge-Cross-Entropy GAN in Internet of Medical Things

    Hong’an Li1, Min Zhang1,*, Dufeng Chen2, Jing Zhang1, Meng Yang3, Zhanli Li1

    CMES-Computer Modeling in Engineering & Sciences, Vol.135, No.1, pp. 779-794, 2023, DOI:10.32604/cmes.2022.022369

    Abstract Computer-aided diagnosis based on image color rendering promotes medical image analysis and doctor-patient communication by highlighting important information of medical diagnosis. To overcome the limitations of the color rendering method based on deep learning, such as poor model stability, poor rendering quality, fuzzy boundaries and crossed color boundaries, we propose a novel hinge-cross-entropy generative adversarial network (HCEGAN). The self-attention mechanism was added and improved to focus on the important information of the image. And the hinge-cross-entropy loss function was used to stabilize the training process of GAN models. In this study, we implement the HCEGAN model for image color rendering… More > Graphic Abstract

    Image Color Rendering Based on Hinge-Cross-Entropy GAN in Internet of Medical Things

  • Open Access

    ARTICLE

    Facial Expression Recognition Based on Multi-Channel Attention Residual Network

    Tongping Shen1,2,*, Huanqing Xu1

    CMES-Computer Modeling in Engineering & Sciences, Vol.135, No.1, pp. 539-560, 2023, DOI:10.32604/cmes.2022.022312

    Abstract For the problems of complex model structure and too many training parameters in facial expression recognition algorithms, we proposed a residual network structure with a multi-headed channel attention (MCA) module. The migration learning algorithm is used to pre-train the convolutional layer parameters and mitigate the overfitting caused by the insufficient number of training samples. The designed MCA module is integrated into the ResNet18 backbone network. The attention mechanism highlights important information and suppresses irrelevant information by assigning different coefficients or weights, and the multi-head structure focuses more on the local features of the pictures, which improves the efficiency of facial… More >

Displaying 151-160 on page 16 of 271. Per Page