Tech Science Press - Publisher of Open Access Journals

Open Access

ARTICLE

WMA: A Multi-Scale Self-Attention Feature Extraction Network Based on Weight Sharing for VQA

Yue Li, Jin Liu^*, Shengjie Shang

Journal on Big Data, Vol.3, No.3, pp. 111-118, 2021, DOI:10.32604/jbd.2021.017169

Abstract Visual Question Answering (VQA) has attracted extensive research focus and has become a hot topic in deep learning recently. The development of computer vision and natural language processing technology has contributed to the advancement of this research area. Key solutions to improve the performance of VQA system exist in feature extraction, multimodal fusion, and answer prediction modules. There exists an unsolved issue in the popular VQA image feature extraction module that extracts the fine-grained features from objects of different scale difficultly. In this paper, a novel feature extraction network that combines multi-scale convolution and self-attention More >

Open Access

ARTICLE

Effective Video Summarization Approach Based on Visual Attention

Hilal Ahmad¹, Habib Ullah Khan², Sikandar Ali^3,*, Syed Ijaz Ur Rahman¹, Fazli Wahid³, Hizbullah Khattak⁴

CMC-Computers, Materials & Continua, Vol.71, No.1, pp. 1427-1442, 2022, DOI:10.32604/cmc.2022.021158

Abstract Video summarization is applied to reduce redundancy and develop a concise representation of key frames in the video, more recently, video summaries have been used through visual attention modeling. In these schemes, the frames that stand out visually are extracted as key frames based on human attention modeling theories. The schemes for modeling visual attention have proven to be effective for video summaries. Nevertheless, the high cost of computing in such techniques restricts their usability in everyday situations. In this context, we propose a method based on KFE (key frame extraction) technique, which is recommended… More >

Open Access

ARTICLE

The Acute Effects of Aerobic Dance Exercise with and without Face Mask Use on Attention, Perceived Exertion and Mood States

Maamer Slimani^1,2,*, Nicola Bragazzi³, Amri Hammami², Hela Znazen⁴, Qian Yu^5,6, Zhaowei Kong⁶, Liye Zou⁵

International Journal of Mental Health Promotion, Vol.23, No.4, pp. 513-520, 2021, DOI:10.32604/IJMHP.2021.017639

Abstract The present study aimed to determine the effect of wearing a face mask during aerobic dance exercise on cognitive function, more specifically on attention, as well as on perceived exertion and mood states. Thirteen healthy college students (9 males and 4 females: mean age = 17.5 years, height = 1.72 m, weight = 71.00 kg) volunteered to participate in this study. They were randomized to perform aerobic dance exercise while wearing a cloth face mask or no mask or a control condition (sitting on a comfortable chair and reading information about the health benefits of aerobic dance exercise) on three separate occasions (with… More >

Open Access

ARTICLE

Facial Expression Recognition Using Enhanced Convolution Neural Network with Attention Mechanism

K. Prabhu^1,*, S. SathishKumar², M. Sivachitra³, S. Dineshkumar², P. Sathiyabama⁴

Computer Systems Science and Engineering, Vol.41, No.1, pp. 415-426, 2022, DOI:10.32604/csse.2022.019749

Abstract Facial Expression Recognition (FER) has been an interesting area of research in places where there is human-computer interaction. Human psychology, emotions and behaviors can be analyzed in FER. Classifiers used in FER have been perfect on normal faces but have been found to be constrained in occluded faces. Recently, Deep Learning Techniques (DLT) have gained popularity in applications of real-world problems including recognition of human emotions. The human face reflects emotional states and human intentions. An expression is the most natural and powerful way of communicating non-verbally. Systems which form communications between the two are More >

Open Access

ARTICLE

Artifacts Reduction Using Multi-Scale Feature Attention Network in Compressed Medical Images

Seonjae Kim, Dongsan Jun^*

CMC-Computers, Materials & Continua, Vol.70, No.2, pp. 3267-3279, 2022, DOI:10.32604/cmc.2022.020651

Abstract Medical image compression is one of the essential technologies to facilitate real-time medical data transmission in remote healthcare applications. In general, image compression can introduce undesired coding artifacts, such as blocking artifacts and ringing effects. In this paper, we proposed a Multi-Scale Feature Attention Network (MSFAN) with two essential parts, which are multi-scale feature extraction layers and feature attention layers to efficiently remove coding artifacts of compressed medical images. Multi-scale feature extraction layers have four Feature Extraction (FE) blocks. Each FE block consists of five convolution layers and one CA block for weighted skip connection. More >

Open Access

ARTICLE

An improved CRNN for Vietnamese Identity Card Information Recognition

Trinh Tan Dat¹, Le Tran Anh Dang^1,2, Nguyen Nhat Truong^1,2, Pham Cung Le Thien Vu¹, Vu Ngoc Thanh Sang¹, Pham Thi Vuong¹, Pham The Bao^1,*

Computer Systems Science and Engineering, Vol.40, No.2, pp. 539-555, 2022, DOI:10.32604/csse.2022.019064

Abstract This paper proposes an enhancement of an automatic text recognition system for extracting information from the front side of the Vietnamese citizen identity (CID) card. First, we apply Mask-RCNN to segment and align the CID card from the background. Next, we present two approaches to detect the CID card’s text lines using traditional image processing techniques compared to the EAST detector. Finally, we introduce a new end-to-end Convolutional Recurrent Neural Network (CRNN) model based on a combination of Connectionist Temporal Classification (CTC) and attention mechanism for Vietnamese text recognition by jointly train the CTC and… More >

Open Access

ARTICLE

A Position-Aware Transformer for Image Captioning

Zelin Deng^1,*, Bo Zhou¹, Pei He², Jianfeng Huang³, Osama Alfarraj⁴, Amr Tolba^4,5

CMC-Computers, Materials & Continua, Vol.70, No.1, pp. 2065-2081, 2022, DOI:10.32604/cmc.2022.019328

Abstract Image captioning aims to generate a corresponding description of an image. In recent years, neural encoder-decoder models have been the dominant approaches, in which the Convolutional Neural Network (CNN) and Long Short Term Memory (LSTM) are used to translate an image into a natural language description. Among these approaches, the visual attention mechanisms are widely used to enable deeper image understanding through fine-grained analysis and even multiple steps of reasoning. However, most conventional visual attention mechanisms are based on high-level image features, ignoring the effects of other image features, and giving insufficient consideration to the… More >

Open Access

ARTICLE

Attention-Based and Time Series Models for Short-Term Forecasting of COVID-19 Spread

Jurgita Markevičiūtė^1,*, Jolita Bernatavičienė², Rūta Levulienė¹, Viktor Medvedev², Povilas Treigys², Julius Venskus²

CMC-Computers, Materials & Continua, Vol.70, No.1, pp. 695-714, 2022, DOI:10.32604/cmc.2022.018735

Abstract The growing number of COVID-19 cases puts pressure on healthcare services and public institutions worldwide. The pandemic has brought much uncertainty to the global economy and the situation in general. Forecasting methods and modeling techniques are important tools for governments to manage critical situations caused by pandemics, which have negative impact on public health. The main purpose of this study is to obtain short-term forecasts of disease epidemiology that could be useful for policymakers and public institutions to make necessary short-term decisions. To evaluate the effectiveness of the proposed attention-based method combining certain data mining… More >

Open Access

ARTICLE

A Multi-Feature Learning Model with Enhanced Local Attention for Vehicle Re-Identification

Wei Sun^1,2,*, Xuan Chen³, Xiaorui Zhang^1,3, Guangzhao Dai², Pengshuai Chang², Xiaozheng He⁴

CMC-Computers, Materials & Continua, Vol.69, No.3, pp. 3549-3561, 2021, DOI:10.32604/cmc.2021.021627

Abstract Vehicle re-identification (ReID) aims to retrieve the target vehicle in an extensive image gallery through its appearances from various views in the cross-camera scenario. It has gradually become a core technology of intelligent transportation system. Most existing vehicle re-identification models adopt the joint learning of global and local features. However, they directly use the extracted global features, resulting in insufficient feature expression. Moreover, local features are primarily obtained through advanced annotation and complex attention mechanisms, which require additional costs. To solve this issue, a multi-feature learning model with enhanced local attention for vehicle re-identification (MFELA)… More >

Open Access

ARTICLE

Visual Saliency Prediction Using Attention-based Cross-modal Integration Network in RGB-D Images

Xinyue Zhang¹, Ting Jin^1,*, Mingjie Han¹, Jingsheng Lei², Zhichao Cao³

Intelligent Automation & Soft Computing, Vol.30, No.2, pp. 439-452, 2021, DOI:10.32604/iasc.2021.018643

Abstract Saliency prediction has recently gained a large number of attention for the sake of the rapid development of deep neural networks in computer vision tasks. However, there are still dilemmas that need to be addressed. In this paper, we design a visual saliency prediction model using attention-based cross-model integration strategies in RGB-D images. Unlike other symmetric feature extraction networks, we exploit asymmetric networks to effectively extract depth features as the complementary information of RGB information. Then we propose attention modules to integrate cross-modal feature information and emphasize the feature representation of salient regions, meanwhile neglect… More >

Displaying 251-260 on page 26 of 309. Per Page

View

1379

Download

853

Like

0

View

1874

Download

1076

Like

0

View

2016

Download

1070

Like

2

View

2433

Download

1660

Like

0

View

1699

Download

1029

Like

0

View

3098

Download

1090

Like

0

View

1801

Download

1480

Like

0

View

2034

Download

1216

Like

0

View

2302

Download

1185

Like

0

View

1493

Download

914

Like

0

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp: