Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (17)
  • Open Access

    ARTICLE

    TEAM: Transformer Encoder Attention Module for Video Classification

    Hae Sung Park1, Yong Suk Choi2,*

    Computer Systems Science and Engineering, Vol.48, No.2, pp. 451-477, 2024, DOI:10.32604/csse.2023.043245

    Abstract Much like humans focus solely on object movement to understand actions, directing a deep learning model’s attention to the core contexts within videos is crucial for improving video comprehension. In the recent study, Video Masked Auto-Encoder (VideoMAE) employs a pre-training approach with a high ratio of tube masking and reconstruction, effectively mitigating spatial bias due to temporal redundancy in full video frames. This steers the model’s focus toward detailed temporal contexts. However, as the VideoMAE still relies on full video frames during the action recognition stage, it may exhibit a progressive shift in attention towards spatial contexts, deteriorating its ability… More >

  • Open Access

    ARTICLE

    Detection Algorithm of Laboratory Personnel Irregularities Based on Improved YOLOv7

    Yongliang Yang, Linghua Xu*, Maolin Luo, Xiao Wang, Min Cao

    CMC-Computers, Materials & Continua, Vol.78, No.2, pp. 2741-2765, 2024, DOI:10.32604/cmc.2024.046768

    Abstract Due to the complex environment of the university laboratory, personnel flow intensive, personnel irregular behavior is easy to cause security risks. Monitoring using mainstream detection algorithms suffers from low detection accuracy and slow speed. Therefore, the current management of personnel behavior mainly relies on institutional constraints, education and training, on-site supervision, etc., which is time-consuming and ineffective. Given the above situation, this paper proposes an improved You Only Look Once version 7 (YOLOv7) to achieve the purpose of quickly detecting irregular behaviors of laboratory personnel while ensuring high detection accuracy. First, to better capture the shape features of the target,… More >

  • Open Access

    ARTICLE

    Traffic Sign Recognition for Autonomous Vehicle Using Optimized YOLOv7 and Convolutional Block Attention Module

    P. Kuppusamy1,*, M. Sanjay1, P. V. Deepashree1, C. Iwendi2

    CMC-Computers, Materials & Continua, Vol.77, No.1, pp. 445-466, 2023, DOI:10.32604/cmc.2023.042675

    Abstract The infrastructure and construction of roads are crucial for the economic and social development of a region, but traffic-related challenges like accidents and congestion persist. Artificial Intelligence (AI) and Machine Learning (ML) have been used in road infrastructure and construction, particularly with the Internet of Things (IoT) devices. Object detection in Computer Vision also plays a key role in improving road infrastructure and addressing traffic-related problems. This study aims to use You Only Look Once version 7 (YOLOv7), Convolutional Block Attention Module (CBAM), the most optimized object-detection algorithm, to detect and identify traffic signs, and analyze effective combinations of adaptive… More >

  • Open Access

    ARTICLE

    Siamese Dense Pixel-Level Fusion Network for Real-Time UAV Tracking

    Zhenyu Huang1,2, Gun Li2, Xudong Sun1, Yong Chen1, Jie Sun1, Zhangsong Ni1,*, Yang Yang1,*

    CMC-Computers, Materials & Continua, Vol.76, No.3, pp. 3219-3238, 2023, DOI:10.32604/cmc.2023.039489

    Abstract Onboard visual object tracking in unmanned aerial vehicles (UAVs) has attracted much interest due to its versatility. Meanwhile, due to high precision, Siamese networks are becoming hot spots in visual object tracking. However, most Siamese trackers fail to balance the tracking accuracy and time within onboard limited computational resources of UAVs. To meet the tracking precision and real-time requirements, this paper proposes a Siamese dense pixel-level network for UAV object tracking named SiamDPL. Specifically, the Siamese network extracts features of the search region and the template region through a parameter-shared backbone network, then performs correlation matching to obtain the candidate… More >

  • Open Access

    ARTICLE

    Single Image Deraining Using Dual Branch Network Based on Attention Mechanism for IoT

    Di Wang, Bingcai Wei, Liye Zhang*

    CMES-Computer Modeling in Engineering & Sciences, Vol.137, No.2, pp. 1989-2000, 2023, DOI:10.32604/cmes.2023.028529

    Abstract Extracting useful details from images is essential for the Internet of Things project. However, in real life, various external environments,such as badweather conditions,will cause the occlusion of key target information and image distortion, resulting in difficulties and obstacles to the extraction of key information, affecting the judgment of the real situation in the process of the Internet of Things, and causing system decision-making errors and accidents. In this paper, we mainly solve the problem of rain on the image occlusion, remove the rain grain in the image, and get a clear image without rain. Therefore, the single image deraining algorithm… More >

  • Open Access

    ARTICLE

    Classifying Hematoxylin and Eosin Images Using a Super-Resolution Segmentor and a Deep Ensemble Classifier

    P. Sabitha*, G. Meeragandhi

    Intelligent Automation & Soft Computing, Vol.37, No.2, pp. 1983-2000, 2023, DOI:10.32604/iasc.2023.034402

    Abstract Developing an automatic and credible diagnostic system to analyze the type, stage, and level of the liver cancer from Hematoxylin and Eosin (H&E) images is a very challenging and time-consuming endeavor, even for experienced pathologists, due to the non-uniform illumination and artifacts. Albeit several Machine Learning (ML) and Deep Learning (DL) approaches are employed to increase the performance of automatic liver cancer diagnostic systems, the classification accuracy of these systems still needs significant improvement to satisfy the real-time requirement of the diagnostic situations. In this work, we present a new Ensemble Classifier (hereafter called ECNet) to classify the H&E stained… More >

  • Open Access

    ARTICLE

    PF-YOLOv4-Tiny: Towards Infrared Target Detection on Embedded Platform

    Wenbo Li, Qi Wang*, Shang Gao

    Intelligent Automation & Soft Computing, Vol.37, No.1, pp. 921-938, 2023, DOI:10.32604/iasc.2023.038257

    Abstract Infrared target detection models are more required than ever before to be deployed on embedded platforms, which requires models with less memory consumption and better real-time performance while considering accuracy. To address the above challenges, we propose a modified You Only Look Once (YOLO) algorithm PF-YOLOv4-Tiny. The algorithm incorporates spatial pyramidal pooling (SPP) and squeeze-and-excitation (SE) visual attention modules to enhance the target localization capability. The PANet-based-feature pyramid networks (P-FPN) are proposed to transfer semantic information and location information simultaneously to ameliorate detection accuracy. To lighten the network, the standard convolutions other than the backbone network are replaced with depthwise… More >

  • Open Access

    ARTICLE

    An Efficient Indoor Localization Based on Deep Attention Learning Model

    Amr Abozeid1,*, Ahmed I. Taloba1,2, Rasha M. Abd El-Aziz1,3, Alhanoof Faiz Alwaghid1, Mostafa Salem3, Ahmed Elhadad1,4

    Computer Systems Science and Engineering, Vol.46, No.2, pp. 2637-2650, 2023, DOI:10.32604/csse.2023.037761

    Abstract Indoor localization methods can help many sectors, such as healthcare centers, smart homes, museums, warehouses, and retail malls, improve their service areas. As a result, it is crucial to look for low-cost methods that can provide exact localization in indoor locations. In this context, image-based localization methods can play an important role in estimating both the position and the orientation of cameras regarding an object. Image-based localization faces many issues, such as image scale and rotation variance. Also, image-based localization’s accuracy and speed (latency) are two critical factors. This paper proposes an efficient 6-DoF deep-learning model for image-based localization. This… More >

  • Open Access

    ARTICLE

    3D Vehicle Detection Algorithm Based on Multimodal Decision-Level Fusion

    Peicheng Shi1,*, Heng Qi1, Zhiqiang Liu1, Aixi Yang2

    CMES-Computer Modeling in Engineering & Sciences, Vol.135, No.3, pp. 2007-2023, 2023, DOI:10.32604/cmes.2023.022304

    Abstract 3D vehicle detection based on LiDAR-camera fusion is becoming an emerging research topic in autonomous driving. The algorithm based on the Camera-LiDAR object candidate fusion method (CLOCs) is currently considered to be a more effective decision-level fusion algorithm, but it does not fully utilize the extracted features of 3D and 2D. Therefore, we proposed a 3D vehicle detection algorithm based on multimodal decision-level fusion. First, project the anchor point of the 3D detection bounding box into the 2D image, calculate the distance between 2D and 3D anchor points, and use this distance as a new fusion feature to enhance the… More > Graphic Abstract

    3D Vehicle Detection Algorithm Based on Multimodal Decision-Level Fusion

  • Open Access

    ARTICLE

    Disease Recognition of Apple Leaf Using Lightweight Multi-Scale Network with ECANet

    Helong Yu, Xianhe Cheng, Ziqing Li, Qi Cai, Chunguang Bi*

    CMES-Computer Modeling in Engineering & Sciences, Vol.132, No.3, pp. 711-738, 2022, DOI:10.32604/cmes.2022.020263

    Abstract To solve the problem of difficulty in identifying apple diseases in the natural environment and the low application rate of deep learning recognition networks, a lightweight ResNet (LW-ResNet) model for apple disease recognition is proposed. Based on the deep residual network (ResNet18), the multi-scale feature extraction layer is constructed by group convolution to realize the compression model and improve the extraction ability of different sizes of lesion features. By improving the identity mapping structure to reduce information loss. By introducing the efficient channel attention module (ECANet) to suppress noise from a complex background. The experimental results show that the average… More >

Displaying 1-10 on page 1 of 17. Per Page