Tech Science Press - Publisher of Open Access Journals

Open Access

ARTICLE

3D Vehicle Detection Algorithm Based on Multimodal Decision-Level Fusion

Peicheng Shi^1,*, Heng Qi¹, Zhiqiang Liu¹, Aixi Yang²

CMES-Computer Modeling in Engineering & Sciences, Vol.135, No.3, pp. 2007-2023, 2023, DOI:10.32604/cmes.2023.022304

Abstract 3D vehicle detection based on LiDAR-camera fusion is becoming an emerging research topic in autonomous driving. The algorithm based on the Camera-LiDAR object candidate fusion method (CLOCs) is currently considered to be a more effective decision-level fusion algorithm, but it does not fully utilize the extracted features of 3D and 2D. Therefore, we proposed a 3D vehicle detection algorithm based on multimodal decision-level fusion. First, project the anchor point of the 3D detection bounding box into the 2D image, calculate the distance between 2D and 3D anchor points, and use this distance as a new fusion feature to enhance the… More > Graphic Abstract

3D Vehicle Detection Algorithm Based on Multimodal Decision-Level Fusion

Open Access

ARTICLE

Profiling of Urban Noise Using Artificial Intelligence

Le Quang Thao^1,2,*, Duong Duc Cuong², Tran Thi Tuong Anh³, Tran Duc Luong⁴

Computer Systems Science and Engineering, Vol.45, No.2, pp. 1309-1321, 2023, DOI:10.32604/csse.2023.031010

Abstract Noise pollution tends to receive less awareness compared to other types of pollution, however, it greatly impacts the quality of life for humans such as causing sleep disruption, stress or hearing impairment. Profiling urban sound through the identification of noise sources in cities could help to benefit livability by reducing exposure to noise pollution through methods such as noise control, planning of the soundscape environment, or selection of safe living space. In this paper, we proposed a self-attention long short-term memory (LSTM) method that can improve sound classification compared to previous baselines. An attention mechanism will be designed solely to… More >

Open Access

ARTICLE

An End-to-End Transformer-Based Automatic Speech Recognition for Qur’an Reciters

Mohammed Hadwan^1,2,*, Hamzah A. Alsayadi^3,4, Salah AL-Hagree⁵

CMC-Computers, Materials & Continua, Vol.74, No.2, pp. 3471-3487, 2023, DOI:10.32604/cmc.2023.033457

Abstract The attention-based encoder-decoder technique, known as the trans-former, is used to enhance the performance of end-to-end automatic speech recognition (ASR). This research focuses on applying ASR end-to-end transformer-based models for the Arabic language, as the researchers’ community pays little attention to it. The Muslims Holy Qur’an book is written using Arabic diacritized text. In this paper, an end-to-end transformer model to building a robust Qur’an vs. recognition is proposed. The acoustic model was built using the transformer-based model as deep learning by the PyTorch framework. A multi-head attention mechanism is utilized to represent the encoder and decoder in the acoustic… More >

Open Access

ARTICLE

Residual Attention Deep SVDD for COVID-19 Diagnosis Using CT Scans

Akram Ali Alhadad^1,2,*, Omar Tarawneh³, Reham R. Mostafa¹, Hazem M. El-Bakry¹

CMC-Computers, Materials & Continua, Vol.74, No.2, pp. 3333-3350, 2023, DOI:10.32604/cmc.2023.033413

Abstract COVID-19 is the common name of the disease caused by the novel coronavirus (2019-nCoV) that appeared in Wuhan, China in 2019. Discovering the infected people is the most important factor in the fight against the disease. The gold-standard test to diagnose COVID-19 is polymerase chain reaction (PCR), but it takes 5–6 h and, in the early stages of infection, may produce false-negative results. Examining Computed Tomography (CT) images to diagnose patients infected with COVID-19 has become an urgent necessity. In this study, we propose a residual attention deep support vector data description SVDD (RADSVDD) approach to diagnose COVID-19. It is… More >

Open Access

ARTICLE

A Dual Attention Encoder-Decoder Text Summarization Model

Nada Ali Hakami¹, Hanan Ahmed Hosni Mahmoud^2,*

CMC-Computers, Materials & Continua, Vol.74, No.2, pp. 3697-3710, 2023, DOI:10.32604/cmc.2023.031525

Abstract A worthy text summarization should represent the fundamental content of the document. Recent studies on computerized text summarization tried to present solutions to this challenging problem. Attention models are employed extensively in text summarization process. Classical attention techniques are utilized to acquire the context data in the decoding phase. Nevertheless, without real and efficient feature extraction, the produced summary may diverge from the core topic. In this article, we present an encoder-decoder attention system employing dual attention mechanism. In the dual attention mechanism, the attention algorithm gathers main data from the encoder side. In the dual attention model, the system… More >

Open Access

ARTICLE

Forecasting Future Trajectories with an Improved Transformer Network

Wei Wu¹, Weigong Zhang^1,*, Dong Wang¹, Lydia Zhu², Xiang Song³

CMC-Computers, Materials & Continua, Vol.74, No.2, pp. 3811-3828, 2023, DOI:10.32604/cmc.2023.029787

Abstract An increase in car ownership brings convenience to people’s life. However, it also leads to frequent traffic accidents. Precisely forecasting surrounding agents’ future trajectories could effectively decrease vehicle-vehicle and vehicle-pedestrian collisions. Long-short-term memory (LSTM) network is often used for vehicle trajectory prediction, but it has some shortages such as gradient explosion and low efficiency. A trajectory prediction method based on an improved Transformer network is proposed to forecast agents’ future trajectories in a complex traffic environment. It realizes the transformation from sequential step processing of LSTM to parallel processing of Transformer based on attention mechanism. To perform trajectory prediction more… More >

Open Access

ARTICLE

A Multi-Level Circulant Cross-Modal Transformer for Multimodal Speech Emotion Recognition

Peizhu Gong¹, Jin Liu¹, Zhongdai Wu², Bing Han², Y. Ken Wang³, Huihua He^4,*

CMC-Computers, Materials & Continua, Vol.74, No.2, pp. 4203-4220, 2023, DOI:10.32604/cmc.2023.028291

Abstract Speech emotion recognition, as an important component of human-computer interaction technology, has received increasing attention. Recent studies have treated emotion recognition of speech signals as a multimodal task, due to its inclusion of the semantic features of two different modalities, i.e., audio and text. However, existing methods often fail in effectively represent features and capture correlations. This paper presents a multi-level circulant cross-modal Transformer (MLCCT) for multimodal speech emotion recognition. The proposed model can be divided into three steps, feature extraction, interaction and fusion. Self-supervised embedding models are introduced for feature extraction, which give a more powerful representation of the… More >

Open Access

ARTICLE

CEMA-LSTM: Enhancing Contextual Feature Correlation for Radar Extrapolation Using Fine-Grained Echo Datasets

Zhiyun Yang^1,#, Qi Liu^1,#,*, Hao Wu¹, Xiaodong Liu², Yonghong Zhang³

CMES-Computer Modeling in Engineering & Sciences, Vol.135, No.1, pp. 45-64, 2023, DOI:10.32604/cmes.2022.022045

Abstract Accurate precipitation nowcasting can provide great convenience to the public so they can conduct corresponding arrangements in advance to deal with the possible impact of upcoming heavy rain. Recent relevant research activities have shown their concerns on various deep learning models for radar echo extrapolation, where radar echo maps were used to predict their consequent moment, so as to recognize potential severe convective weather events. However, these approaches suffer from an inaccurate prediction of echo dynamics and unreliable depiction of echo aggregation or dissipation, due to the size limitation of convolution filter, lack of global feature, and less attention to… More > Graphic Abstract

CEMA-LSTM: Enhancing Contextual Feature Correlation for Radar Extrapolation Using Fine-Grained Echo Datasets

Open Access

ARTICLE

Image Color Rendering Based on Hinge-Cross-Entropy GAN in Internet of Medical Things

Hong’an Li¹, Min Zhang^1,*, Dufeng Chen², Jing Zhang¹, Meng Yang³, Zhanli Li¹

CMES-Computer Modeling in Engineering & Sciences, Vol.135, No.1, pp. 779-794, 2023, DOI:10.32604/cmes.2022.022369

Abstract Computer-aided diagnosis based on image color rendering promotes medical image analysis and doctor-patient communication by highlighting important information of medical diagnosis. To overcome the limitations of the color rendering method based on deep learning, such as poor model stability, poor rendering quality, fuzzy boundaries and crossed color boundaries, we propose a novel hinge-cross-entropy generative adversarial network (HCEGAN). The self-attention mechanism was added and improved to focus on the important information of the image. And the hinge-cross-entropy loss function was used to stabilize the training process of GAN models. In this study, we implement the HCEGAN model for image color rendering… More > Graphic Abstract

Image Color Rendering Based on Hinge-Cross-Entropy GAN in Internet of Medical Things

Open Access

ARTICLE

Facial Expression Recognition Based on Multi-Channel Attention Residual Network

Tongping Shen^1,2,*, Huanqing Xu¹

CMES-Computer Modeling in Engineering & Sciences, Vol.135, No.1, pp. 539-560, 2023, DOI:10.32604/cmes.2022.022312

Abstract For the problems of complex model structure and too many training parameters in facial expression recognition algorithms, we proposed a residual network structure with a multi-headed channel attention (MCA) module. The migration learning algorithm is used to pre-train the convolutional layer parameters and mitigate the overfitting caused by the insufficient number of training samples. The designed MCA module is integrated into the ResNet18 backbone network. The attention mechanism highlights important information and suppresses irrelevant information by assigning different coefficients or weights, and the multi-head structure focuses more on the local features of the pictures, which improves the efficiency of facial… More >

Displaying 151-160 on page 16 of 271. Per Page

View

1094

Download

561

Like

2

View

1155

Download

466

Like

1

View

953

Download

435

Like

0

View

1298

Download

410

Like

0

View

690

Download

388

Like

0

View

749

Download

381

Like

0

View

897

Download

424

Like

1

View

1887

Download

886

Like

0

View

1056

Download

755

Like

2

View

1087

Download

700

Like

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp: