Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (309)
  • Open Access

    ARTICLE

    WMA: A Multi-Scale Self-Attention Feature Extraction Network Based on Weight Sharing for VQA

    Yue Li, Jin Liu*, Shengjie Shang

    Journal on Big Data, Vol.3, No.3, pp. 111-118, 2021, DOI:10.32604/jbd.2021.017169

    Abstract Visual Question Answering (VQA) has attracted extensive research focus and has become a hot topic in deep learning recently. The development of computer vision and natural language processing technology has contributed to the advancement of this research area. Key solutions to improve the performance of VQA system exist in feature extraction, multimodal fusion, and answer prediction modules. There exists an unsolved issue in the popular VQA image feature extraction module that extracts the fine-grained features from objects of different scale difficultly. In this paper, a novel feature extraction network that combines multi-scale convolution and self-attention More >

  • Open Access

    ARTICLE

    Effective Video Summarization Approach Based on Visual Attention

    Hilal Ahmad1, Habib Ullah Khan2, Sikandar Ali3,*, Syed Ijaz Ur Rahman1, Fazli Wahid3, Hizbullah Khattak4

    CMC-Computers, Materials & Continua, Vol.71, No.1, pp. 1427-1442, 2022, DOI:10.32604/cmc.2022.021158

    Abstract Video summarization is applied to reduce redundancy and develop a concise representation of key frames in the video, more recently, video summaries have been used through visual attention modeling. In these schemes, the frames that stand out visually are extracted as key frames based on human attention modeling theories. The schemes for modeling visual attention have proven to be effective for video summaries. Nevertheless, the high cost of computing in such techniques restricts their usability in everyday situations. In this context, we propose a method based on KFE (key frame extraction) technique, which is recommended… More >

  • Open Access

    ARTICLE

    The Acute Effects of Aerobic Dance Exercise with and without Face Mask Use on Attention, Perceived Exertion and Mood States

    Maamer Slimani1,2,*, Nicola Bragazzi3, Amri Hammami2, Hela Znazen4, Qian Yu5,6, Zhaowei Kong6, Liye Zou5

    International Journal of Mental Health Promotion, Vol.23, No.4, pp. 513-520, 2021, DOI:10.32604/IJMHP.2021.017639

    Abstract The present study aimed to determine the effect of wearing a face mask during aerobic dance exercise on cognitive function, more specifically on attention, as well as on perceived exertion and mood states. Thirteen healthy college students (9 males and 4 females: mean age = 17.5 years, height = 1.72 m, weight = 71.00 kg) volunteered to participate in this study. They were randomized to perform aerobic dance exercise while wearing a cloth face mask or no mask or a control condition (sitting on a comfortable chair and reading information about the health benefits of aerobic dance exercise) on three separate occasions (with… More >

  • Open Access

    ARTICLE

    Facial Expression Recognition Using Enhanced Convolution Neural Network with Attention Mechanism

    K. Prabhu1,*, S. SathishKumar2, M. Sivachitra3, S. Dineshkumar2, P. Sathiyabama4

    Computer Systems Science and Engineering, Vol.41, No.1, pp. 415-426, 2022, DOI:10.32604/csse.2022.019749

    Abstract Facial Expression Recognition (FER) has been an interesting area of research in places where there is human-computer interaction. Human psychology, emotions and behaviors can be analyzed in FER. Classifiers used in FER have been perfect on normal faces but have been found to be constrained in occluded faces. Recently, Deep Learning Techniques (DLT) have gained popularity in applications of real-world problems including recognition of human emotions. The human face reflects emotional states and human intentions. An expression is the most natural and powerful way of communicating non-verbally. Systems which form communications between the two are More >

  • Open Access

    ARTICLE

    Artifacts Reduction Using Multi-Scale Feature Attention Network in Compressed Medical Images

    Seonjae Kim, Dongsan Jun*

    CMC-Computers, Materials & Continua, Vol.70, No.2, pp. 3267-3279, 2022, DOI:10.32604/cmc.2022.020651

    Abstract Medical image compression is one of the essential technologies to facilitate real-time medical data transmission in remote healthcare applications. In general, image compression can introduce undesired coding artifacts, such as blocking artifacts and ringing effects. In this paper, we proposed a Multi-Scale Feature Attention Network (MSFAN) with two essential parts, which are multi-scale feature extraction layers and feature attention layers to efficiently remove coding artifacts of compressed medical images. Multi-scale feature extraction layers have four Feature Extraction (FE) blocks. Each FE block consists of five convolution layers and one CA block for weighted skip connection. More >

  • Open Access

    ARTICLE

    An improved CRNN for Vietnamese Identity Card Information Recognition

    Trinh Tan Dat1, Le Tran Anh Dang1,2, Nguyen Nhat Truong1,2, Pham Cung Le Thien Vu1, Vu Ngoc Thanh Sang1, Pham Thi Vuong1, Pham The Bao1,*

    Computer Systems Science and Engineering, Vol.40, No.2, pp. 539-555, 2022, DOI:10.32604/csse.2022.019064

    Abstract This paper proposes an enhancement of an automatic text recognition system for extracting information from the front side of the Vietnamese citizen identity (CID) card. First, we apply Mask-RCNN to segment and align the CID card from the background. Next, we present two approaches to detect the CID card’s text lines using traditional image processing techniques compared to the EAST detector. Finally, we introduce a new end-to-end Convolutional Recurrent Neural Network (CRNN) model based on a combination of Connectionist Temporal Classification (CTC) and attention mechanism for Vietnamese text recognition by jointly train the CTC and… More >

  • Open Access

    ARTICLE

    A Position-Aware Transformer for Image Captioning

    Zelin Deng1,*, Bo Zhou1, Pei He2, Jianfeng Huang3, Osama Alfarraj4, Amr Tolba4,5

    CMC-Computers, Materials & Continua, Vol.70, No.1, pp. 2065-2081, 2022, DOI:10.32604/cmc.2022.019328

    Abstract Image captioning aims to generate a corresponding description of an image. In recent years, neural encoder-decoder models have been the dominant approaches, in which the Convolutional Neural Network (CNN) and Long Short Term Memory (LSTM) are used to translate an image into a natural language description. Among these approaches, the visual attention mechanisms are widely used to enable deeper image understanding through fine-grained analysis and even multiple steps of reasoning. However, most conventional visual attention mechanisms are based on high-level image features, ignoring the effects of other image features, and giving insufficient consideration to the… More >

  • Open Access

    ARTICLE

    Attention-Based and Time Series Models for Short-Term Forecasting of COVID-19 Spread

    Jurgita Markevičiūtė1,*, Jolita Bernatavičienė2, Rūta Levulienė1, Viktor Medvedev2, Povilas Treigys2, Julius Venskus2

    CMC-Computers, Materials & Continua, Vol.70, No.1, pp. 695-714, 2022, DOI:10.32604/cmc.2022.018735

    Abstract The growing number of COVID-19 cases puts pressure on healthcare services and public institutions worldwide. The pandemic has brought much uncertainty to the global economy and the situation in general. Forecasting methods and modeling techniques are important tools for governments to manage critical situations caused by pandemics, which have negative impact on public health. The main purpose of this study is to obtain short-term forecasts of disease epidemiology that could be useful for policymakers and public institutions to make necessary short-term decisions. To evaluate the effectiveness of the proposed attention-based method combining certain data mining… More >

  • Open Access

    ARTICLE

    A Multi-Feature Learning Model with Enhanced Local Attention for Vehicle Re-Identification

    Wei Sun1,2,*, Xuan Chen3, Xiaorui Zhang1,3, Guangzhao Dai2, Pengshuai Chang2, Xiaozheng He4

    CMC-Computers, Materials & Continua, Vol.69, No.3, pp. 3549-3561, 2021, DOI:10.32604/cmc.2021.021627

    Abstract Vehicle re-identification (ReID) aims to retrieve the target vehicle in an extensive image gallery through its appearances from various views in the cross-camera scenario. It has gradually become a core technology of intelligent transportation system. Most existing vehicle re-identification models adopt the joint learning of global and local features. However, they directly use the extracted global features, resulting in insufficient feature expression. Moreover, local features are primarily obtained through advanced annotation and complex attention mechanisms, which require additional costs. To solve this issue, a multi-feature learning model with enhanced local attention for vehicle re-identification (MFELA)… More >

  • Open Access

    ARTICLE

    Visual Saliency Prediction Using Attention-based Cross-modal Integration Network in RGB-D Images

    Xinyue Zhang1, Ting Jin1,*, Mingjie Han1, Jingsheng Lei2, Zhichao Cao3

    Intelligent Automation & Soft Computing, Vol.30, No.2, pp. 439-452, 2021, DOI:10.32604/iasc.2021.018643

    Abstract Saliency prediction has recently gained a large number of attention for the sake of the rapid development of deep neural networks in computer vision tasks. However, there are still dilemmas that need to be addressed. In this paper, we design a visual saliency prediction model using attention-based cross-model integration strategies in RGB-D images. Unlike other symmetric feature extraction networks, we exploit asymmetric networks to effectively extract depth features as the complementary information of RGB information. Then we propose attention modules to integrate cross-modal feature information and emphasize the feature representation of salient regions, meanwhile neglect… More >

Displaying 251-260 on page 26 of 309. Per Page