Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (271)
  • Open Access

    ARTICLE

    Apex Frame Spotting Using Attention Networks for Micro-Expression Recognition System

    Ng Lai Yee1, Mohd Asyraf Zulkifley2,*, Adhi Harmoko Saputro3, Siti Raihanah Abdani4

    CMC-Computers, Materials & Continua, Vol.73, No.3, pp. 5331-5348, 2022, DOI:10.32604/cmc.2022.028801

    Abstract Micro-expression is manifested through subtle and brief facial movements that relay the genuine person’s hidden emotion. In a sequence of videos, there is a frame that captures the maximum facial differences, which is called the apex frame. Therefore, apex frame spotting is a crucial sub-module in a micro-expression recognition system. However, this spotting task is very challenging due to the characteristics of micro-expression that occurs in a short duration with low-intensity muscle movements. Moreover, most of the existing automated works face difficulties in differentiating micro-expressions from other facial movements. Therefore, this paper presents a deep learning model with an attention… More >

  • Open Access

    ARTICLE

    Triple Multimodal Cyclic Fusion and Self-Adaptive Balancing for Video Q&A Systems

    Xiliang Zhang1, Jin Liu1,*, Yue Li1, Zhongdai Wu2,3, Y. Ken Wang4

    CMC-Computers, Materials & Continua, Vol.73, No.3, pp. 6407-6424, 2022, DOI:10.32604/cmc.2022.027097

    Abstract Performance of Video Question and Answer (VQA) systems relies on capturing key information of both visual images and natural language in the context to generate relevant questions’ answers. However, traditional linear combinations of multimodal features focus only on shallow feature interactions, fall far short of the need of deep feature fusion. Attention mechanisms were used to perform deep fusion, but most of them can only process weight assignment of single-modal information, leading to attention imbalance for different modalities. To address above problems, we propose a novel VQA model based on Triple Multimodal feature Cyclic Fusion (TMCF) and Self-Adaptive Multimodal Balancing… More >

  • Open Access

    ARTICLE

    Enhanced Attention-Based Encoder-Decoder Framework for Text Recognition

    S. Prabu, K. Joseph Abraham Sundar*

    Intelligent Automation & Soft Computing, Vol.35, No.2, pp. 2071-2086, 2023, DOI:10.32604/iasc.2023.029105

    Abstract Recognizing irregular text in natural images is a challenging task in computer vision. The existing approaches still face difficulties in recognizing irregular text because of its diverse shapes. In this paper, we propose a simple yet powerful irregular text recognition framework based on an encoder-decoder architecture. The proposed framework is divided into four main modules. Firstly, in the image transformation module, a Thin Plate Spline (TPS) transformation is employed to transform the irregular text image into a readable text image. Secondly, we propose a novel Spatial Attention Module (SAM) to compel the model to concentrate on text regions and obtain… More >

  • Open Access

    ARTICLE

    Low-Cost Real-Time Automated Optical Inspection Using Deep Learning and Attention Map

    Yu Shih, Chien-Chih Kuo, Ching-Hung Lee*

    Intelligent Automation & Soft Computing, Vol.35, No.2, pp. 2087-2099, 2023, DOI:10.32604/iasc.2023.027659

    Abstract The recent trends in Industry 4.0 and Internet of Things have encouraged many factory managers to improve inspection processes to achieve automation and high detection rates. However, the corresponding cost results of sample tests are still used for quality control. A low-cost automated optical inspection system that can be integrated with production lines to fully inspect products without adjustments is introduced herein. The corresponding mechanism design enables each product to maintain a fixed position and orientation during inspection to accelerate the inspection process. The proposed system combines image recognition and deep learning to measure the dimensions of the thread and… More >

  • Open Access

    ARTICLE

    Disease Recognition of Apple Leaf Using Lightweight Multi-Scale Network with ECANet

    Helong Yu, Xianhe Cheng, Ziqing Li, Qi Cai, Chunguang Bi*

    CMES-Computer Modeling in Engineering & Sciences, Vol.132, No.3, pp. 711-738, 2022, DOI:10.32604/cmes.2022.020263

    Abstract To solve the problem of difficulty in identifying apple diseases in the natural environment and the low application rate of deep learning recognition networks, a lightweight ResNet (LW-ResNet) model for apple disease recognition is proposed. Based on the deep residual network (ResNet18), the multi-scale feature extraction layer is constructed by group convolution to realize the compression model and improve the extraction ability of different sizes of lesion features. By improving the identity mapping structure to reduce information loss. By introducing the efficient channel attention module (ECANet) to suppress noise from a complex background. The experimental results show that the average… More >

  • Open Access

    ARTICLE

    Visual Object Tracking via Cascaded RPN Fusion and Coordinate Attention

    Jianming Zhang1,2,*, Kai Wang1,2, Yaoqi He1,2, Lidan Kuang1,2

    CMES-Computer Modeling in Engineering & Sciences, Vol.132, No.3, pp. 909-927, 2022, DOI:10.32604/cmes.2022.020471

    Abstract Recently, Siamese-based trackers have achieved excellent performance in object tracking. However, the high speed and deformation of objects in the movement process make tracking difficult. Therefore, we have incorporated cascaded region-proposal-network (RPN) fusion and coordinate attention into Siamese trackers. The proposed network framework consists of three parts: a feature-extraction sub-network, coordinate attention block, and cascaded RPN block.We exploit the coordinate attention block, which can embed location information into channel attention, to establish long-term spatial location dependence while maintaining channel associations. Thus, the features of different layers are enhanced by the coordinate attention block. We then send these features separately into… More >

  • Open Access

    ARTICLE

    Criss-Cross Attentional Siamese Networks for Object Tracking

    Zhangdong Wang1, Jiaohua Qin1,*, Xuyu Xiang1, Yun Tan1, Neal N. Xiong2

    CMC-Computers, Materials & Continua, Vol.73, No.2, pp. 2931-2946, 2022, DOI:10.32604/cmc.2022.028896

    Abstract Visual object tracking is a hot topic in recent years. In the meanwhile, Siamese networks have attracted extensive attention in this field because of its balanced precision and speed. However, most of the Siamese network methods can only distinguish foreground from the non-semantic background. The fine-tuning and retraining of fully-convolutional Siamese networks for object tracking(SiamFC) can achieve higher precision under interferences, but the tracking accuracy is still not ideal, especially in the environment with more target interferences, dim light, and shadows. In this paper, we propose criss-cross attentional Siamese networks for object tracking (SiamCC). To solve the imbalance between foreground… More >

  • Open Access

    ARTICLE

    Multi-Scale Network with Integrated Attention Unit for Crowd Counting

    Adel Hafeezallah1, Ahlam Al-Dhamari2,3,*, Syed Abd Rahman Abu-Bakar2

    CMC-Computers, Materials & Continua, Vol.73, No.2, pp. 3879-3903, 2022, DOI:10.32604/cmc.2022.028289

    Abstract Estimating the crowd count and density of highly dense scenes witnessed in Muslim gatherings at religious sites in Makkah and Madinah is critical for developing control strategies and organizing such a large gathering. Moreover, since the crowd images in this case can range from low density to high density, detection-based approaches are hard to apply for crowd counting. Recently, deep learning-based regression has become the prominent approach for crowd counting problems, where a density-map is estimated, and its integral is further computed to acquire the final count result. In this paper, we put forward a novel multi-scale network (named 2U-Net)… More >

  • Open Access

    ARTICLE

    EEG Emotion Recognition Using an Attention Mechanism Based on an Optimized Hybrid Model

    Huiping Jiang1,*, Demeng Wu1, Xingqun Tang1, Zhongjie Li1, Wenbo Wu2

    CMC-Computers, Materials & Continua, Vol.73, No.2, pp. 2697-2712, 2022, DOI:10.32604/cmc.2022.027856

    Abstract Emotions serve various functions. The traditional emotion recognition methods are based primarily on readily accessible facial expressions, gestures, and voice signals. However, it is often challenging to ensure that these non-physical signals are valid and reliable in practical applications. Electroencephalogram (EEG) signals are more successful than other signal recognition methods in recognizing these characteristics in real-time since they are difficult to camouflage. Although EEG signals are commonly used in current emotional recognition research, the accuracy is low when using traditional methods. Therefore, this study presented an optimized hybrid pattern with an attention mechanism (FFT_CLA) for EEG emotional recognition. First, the… More >

  • Open Access

    ARTICLE

    TP-MobNet: A Two-pass Mobile Network for Low-complexity Classification of Acoustic Scene

    Soonshin Seo1, Junseok Oh2, Eunsoo Cho2, Hosung Park2, Gyujin Kim2, Ji-Hwan Kim2,*

    CMC-Computers, Materials & Continua, Vol.73, No.2, pp. 3291-3303, 2022, DOI:10.32604/cmc.2022.026259

    Abstract Acoustic scene classification (ASC) is a method of recognizing and classifying environments that employ acoustic signals. Various ASC approaches based on deep learning have been developed, with convolutional neural networks (CNNs) proving to be the most reliable and commonly utilized in ASC systems due to their suitability for constructing lightweight models. When using ASC systems in the real world, model complexity and device robustness are essential considerations. In this paper, we propose a two-pass mobile network for low-complexity classification of the acoustic scene, named TP-MobNet. With inverse residuals and linear bottlenecks, TP-MobNet is based on MobileNetV2, and following mobile blocks,… More >

Displaying 171-180 on page 18 of 271. Per Page