Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (164)
  • Open Access

    ARTICLE

    Multi-Stream Temporally Enhanced Network for Video Salient Object Detection

    Dan Xu*, Jiale Ru, Jinlong Shi

    CMC-Computers, Materials & Continua, Vol.78, No.1, pp. 85-104, 2024, DOI:10.32604/cmc.2023.045258

    Abstract Video salient object detection (VSOD) aims at locating the most attractive objects in a video by exploring the spatial and temporal features. VSOD poses a challenging task in computer vision, as it involves processing complex spatial data that is also influenced by temporal dynamics. Despite the progress made in existing VSOD models, they still struggle in scenes of great background diversity within and between frames. Additionally, they encounter difficulties related to accumulated noise and high time consumption during the extraction of temporal features over a long-term duration. We propose a multi-stream temporal enhanced network (MSTENet) to address these problems. It… More >

  • Open Access

    ARTICLE

    Design of a Lightweight Compressed Video Stream-Based Patient Activity Monitoring System

    Sangeeta Yadav1, Preeti Gulia1,*, Nasib Singh Gill1,*, Piyush Kumar Shukla2, Arfat Ahmad Khan3, Sultan Alharby4, Ahmed Alhussen4, Mohd Anul Haq5

    CMC-Computers, Materials & Continua, Vol.78, No.1, pp. 1253-1274, 2024, DOI:10.32604/cmc.2023.042869

    Abstract Inpatient falls from beds in hospitals are a common problem. Such falls may result in severe injuries. This problem can be addressed by continuous monitoring of patients using cameras. Recent advancements in deep learning-based video analytics have made this task of fall detection more effective and efficient. Along with fall detection, monitoring of different activities of the patients is also of significant concern to assess the improvement in their health. High computation-intensive models are required to monitor every action of the patient precisely. This requirement limits the applicability of such networks. Hence, to keep the model lightweight, the already designed… More >

  • Open Access

    REVIEW

    Exploring the Latest Applications of OpenAI and ChatGPT: An In-Depth Survey

    Hong Zhang1,*, Haijian Shao2

    CMES-Computer Modeling in Engineering & Sciences, Vol.138, No.3, pp. 2061-2102, 2024, DOI:10.32604/cmes.2023.030649

    Abstract OpenAI and ChatGPT, as state-of-the-art language models driven by cutting-edge artificial intelligence technology, have gained widespread adoption across diverse industries. In the realm of computer vision, these models have been employed for intricate tasks including object recognition, image generation, and image processing, leveraging their advanced capabilities to fuel transformative breakthroughs. Within the gaming industry, they have found utility in crafting virtual characters and generating plots and dialogues, thereby enabling immersive and interactive player experiences. Furthermore, these models have been harnessed in the realm of medical diagnosis, providing invaluable insights and support to healthcare professionals in the realm of disease detection.… More > Graphic Abstract

    Exploring the Latest Applications of OpenAI and ChatGPT: An In-Depth Survey

  • Open Access

    ARTICLE

    Automated Video Generation of Moving Digits from Text Using Deep Deconvolutional Generative Adversarial Network

    Anwar Ullah1, Xinguo Yu1,*, Muhammad Numan2

    CMC-Computers, Materials & Continua, Vol.77, No.2, pp. 2359-2383, 2023, DOI:10.32604/cmc.2023.041219

    Abstract Generating realistic and synthetic video from text is a highly challenging task due to the multitude of issues involved, including digit deformation, noise interference between frames, blurred output, and the need for temporal coherence across frames. In this paper, we propose a novel approach for generating coherent videos of moving digits from textual input using a Deep Deconvolutional Generative Adversarial Network (DD-GAN). The DD-GAN comprises a Deep Deconvolutional Neural Network (DDNN) as a Generator (G) and a modified Deep Convolutional Neural Network (DCNN) as a Discriminator (D) to ensure temporal coherence between adjacent frames. The proposed research involves several steps.… More >

  • Open Access

    ARTICLE

    HybridHR-Net: Action Recognition in Video Sequences Using Optimal Deep Learning Fusion Assisted Framework

    Muhammad Naeem Akbar1,*, Seemab Khan2, Muhammad Umar Farooq1, Majed Alhaisoni3, Usman Tariq4, Muhammad Usman Akram1

    CMC-Computers, Materials & Continua, Vol.76, No.3, pp. 3275-3295, 2023, DOI:10.32604/cmc.2023.039289

    Abstract The combination of spatiotemporal videos and essential features can improve the performance of human action recognition (HAR); however, the individual type of features usually degrades the performance due to similar actions and complex backgrounds. The deep convolutional neural network has improved performance in recent years for several computer vision applications due to its spatial information. This article proposes a new framework called for video surveillance human action recognition dubbed HybridHR-Net. On a few selected datasets, deep transfer learning is used to pre-trained the EfficientNet-b0 deep learning model. Bayesian optimization is employed for the tuning of hyperparameters of the fine-tuned deep… More >

  • Open Access

    ARTICLE

    New Fragile Watermarking Technique to Identify Inserted Video Objects Using H.264 and Color Features

    Raheem Ogla1,*, Eman Shakar Mahmood1, Rasha I. Ahmed1, Abdul Monem S. Rahma2

    CMC-Computers, Materials & Continua, Vol.76, No.3, pp. 3075-3096, 2023, DOI:10.32604/cmc.2023.039818

    Abstract The transmission of video content over a network raises various issues relating to copyright authenticity, ethics, legality, and privacy. The protection of copyrighted video content is a significant issue in the video industry, and it is essential to find effective solutions to prevent tampering and modification of digital video content during its transmission through digital media. However, there are still many unresolved challenges. This paper aims to address those challenges by proposing a new technique for detecting moving objects in digital videos, which can help prove the credibility of video content by detecting any fake objects inserted by hackers. The… More >

  • Open Access

    ARTICLE

    Deep Learning-Based Action Classification Using One-Shot Object Detection

    Hyun Yoo1, Seo-El Lee2, Kyungyong Chung3,*

    CMC-Computers, Materials & Continua, Vol.76, No.2, pp. 1343-1359, 2023, DOI:10.32604/cmc.2023.039263

    Abstract Deep learning-based action classification technology has been applied to various fields, such as social safety, medical services, and sports. Analyzing an action on a practical level requires tracking multiple human bodies in an image in real-time and simultaneously classifying their actions. There are various related studies on the real-time classification of actions in an image. However, existing deep learning-based action classification models have prolonged response speeds, so there is a limit to real-time analysis. In addition, it has low accuracy of action of each object if multiple objects appear in the image. Also, it needs to be improved since it… More >

  • Open Access

    ARTICLE

    Broad Federated Meta-Learning of Damaged Objects in Aerial Videos

    Zekai Li1, Wenfeng Wang2,3,4,5,6,*

    CMES-Computer Modeling in Engineering & Sciences, Vol.137, No.3, pp. 2881-2899, 2023, DOI:10.32604/cmes.2023.028670

    Abstract We advanced an emerging federated learning technology in city intelligentization for tackling a real challenge— to learn damaged objects in aerial videos. A meta-learning system was integrated with the fuzzy broad learning system to further develop the theory of federated learning. Both the mixed picture set of aerial video segmentation and the 3D-reconstructed mixed-reality data were employed in the performance of the broad federated meta-learning system. The study results indicated that the object classification accuracy is up to 90% and the average time cost in damage detection is only 0.277 s. Consequently, the broad federated meta-learning system is efficient and… More >

  • Open Access

    ARTICLE

    Suspicious Activities Recognition in Video Sequences Using DarkNet-NasNet Optimal Deep Features

    Safdar Khan1, Muhammad Attique Khan2, Jamal Hussain Shah1,*, Faheem Shehzad2, Taerang Kim3, Jae-Hyuk Cha3

    Computer Systems Science and Engineering, Vol.47, No.2, pp. 2337-2360, 2023, DOI:10.32604/csse.2023.040410

    Abstract Human Suspicious Activity Recognition (HSAR) is a critical and active research area in computer vision that relies on artificial intelligence reasoning. Significant advances have been made in this field recently due to important applications such as video surveillance. In video surveillance, humans are monitored through video cameras when doing suspicious activities such as kidnapping, fighting, snatching, and a few more. Although numerous techniques have been introduced in the literature for routine human actions (HAR), very few studies are available for HSAR. This study proposes a deep convolutional neural network (CNN) and optimal featuresbased framework for HSAR in video frames. The… More >

  • Open Access

    ARTICLE

    SlowFast Based Real-Time Human Motion Recognition with Action Localization

    Gyu-Il Kim1, Hyun Yoo2, Kyungyong Chung3,*

    Computer Systems Science and Engineering, Vol.47, No.2, pp. 2135-2152, 2023, DOI:10.32604/csse.2023.041030

    Abstract Artificial intelligence is increasingly being applied in the field of video analysis, particularly in the area of public safety where video surveillance equipment such as closed-circuit television (CCTV) is used and automated analysis of video information is required. However, various issues such as data size limitations and low processing speeds make real-time extraction of video data challenging. Video analysis technology applies object classification, detection, and relationship analysis to continuous 2D frame data, and the various meanings within the video are thus analyzed based on the extracted basic data. Motion recognition is key in this analysis. Motion recognition is a challenging… More >

Displaying 11-20 on page 2 of 164. Per Page