Home / Advanced Search

  • Title/Keywords

  • Author/Affliations

  • Journal

  • Article Type

  • Start Year

  • End Year

Update SearchingClear
  • Articles
  • Online
Search Results (22)
  • Open Access

    REVIEW

    Overview of 3D Human Pose Estimation

    Jianchu Lin1,2, Shuang Li3, Hong Qin3,4, Hongchang Wang3, Ning Cui6, Qian Jiang7, Haifang Jian3,*, Gongming Wang5,*

    CMES-Computer Modeling in Engineering & Sciences, Vol.134, No.3, pp. 1621-1651, 2023, DOI:10.32604/cmes.2022.020857 - 20 September 2022

    Abstract 3D human pose estimation is a major focus area in the field of computer vision, which plays an important role in practical applications. This article summarizes the framework and research progress related to the estimation of monocular RGB images and videos. An overall perspective of methods integrated with deep learning is introduced. Novel image-based and video-based inputs are proposed as the analysis framework. From this viewpoint, common problems are discussed. The diversity of human postures usually leads to problems such as occlusion and ambiguity, and the lack of training datasets often results in poor generalization… More >

  • Open Access

    ARTICLE

    Research on Multi-View Image Reconstruction Technology Based on Auto-Encoding Learning

    Tao Zhang1, Shaokui Gu1, Jinxing Niu1,*, Yi Cao2

    CMC-Computers, Materials & Continua, Vol.72, No.3, pp. 4603-4614, 2022, DOI:10.32604/cmc.2022.027079 - 21 April 2022

    Abstract Traditional three-dimensional (3D) image reconstruction method, which highly dependent on the environment and has poor reconstruction effect, is easy to lead to mismatch and poor real-time performance. The accuracy of feature extraction from multiple images affects the reliability and real-time performance of 3D reconstruction technology. To solve the problem, a multi-view image 3D reconstruction algorithm based on self-encoding convolutional neural network is proposed in this paper. The algorithm first extracts the feature information of multiple two-dimensional (2D) images based on scale and rotation invariance parameters of Scale-invariant feature transform (SIFT) operator. Secondly, self-encoding learning neural… More >

  • Open Access

    ARTICLE

    Multi-View Auxiliary Diagnosis Algorithm for Lung Nodules

    Shi Qiu1, Bin Li2,*, Tao Zhou3, Feng Li4, Ting Liang5

    CMC-Computers, Materials & Continua, Vol.72, No.3, pp. 4897-4910, 2022, DOI:10.32604/cmc.2022.026855 - 21 April 2022

    Abstract Lung is an important organ of human body. More and more people are suffering from lung diseases due to air pollution. These diseases are usually highly infectious. Such as lung tuberculosis, novel coronavirus COVID-19, etc. Lung nodule is a kind of high-density globular lesion in the lung. Physicians need to spend a lot of time and energy to observe the computed tomography image sequences to make a diagnosis, which is inefficient. For this reason, the use of computer-assisted diagnosis of lung nodules has become the current main trend. In the process of computer-aided diagnosis, how… More >

  • Open Access

    ARTICLE

    An Innovative Approach Utilizing Binary-View Transformer for Speech Recognition Task

    Muhammad Babar Kamal1, Arfat Ahmad Khan2, Faizan Ahmed Khan3, Malik Muhammad Ali Shahid4, Chitapong Wechtaisong2,*, Muhammad Daud Kamal5, Muhammad Junaid Ali6, Peerapong Uthansakul2

    CMC-Computers, Materials & Continua, Vol.72, No.3, pp. 5547-5562, 2022, DOI:10.32604/cmc.2022.024590 - 21 April 2022

    Abstract The deep learning advancements have greatly improved the performance of speech recognition systems, and most recent systems are based on the Recurrent Neural Network (RNN). Overall, the RNN works fine with the small sequence data, but suffers from the gradient vanishing problem in case of large sequence. The transformer networks have neutralized this issue and have shown state-of-the-art results on sequential or speech-related data. Generally, in speech recognition, the input audio is converted into an image using Mel-spectrogram to illustrate frequencies and intensities. The image is classified by the machine learning mechanism to generate a… More >

  • Open Access

    ARTICLE

    Brain Tumor Segmentation using Multi-View Attention based Ensemble Network

    Noreen Mushtaq1, Arfat Ahmad Khan2, Faizan Ahmed Khan3, Muhammad Junaid Ali4, Malik Muhammad Ali Shahid5, Chitapong Wechtaisong2,*, Peerapong Uthansakul2

    CMC-Computers, Materials & Continua, Vol.72, No.3, pp. 5793-5806, 2022, DOI:10.32604/cmc.2022.024316 - 21 April 2022

    Abstract Astrocytoma IV or glioblastoma is one of the fatal and dangerous types of brain tumors. Early detection of brain tumor increases the survival rate and helps in reducing the fatality rate. Various imaging modalities have been used for diagnosing by expert radiologists, and Medical Resonance Image (MRI) is considered a better option for detecting brain tumors as MRI is a non-invasive technique and provides better visualization of the brain region. One of the challenging issues is to identify the tumorous region from the MRI scans correctly. Manual segmentation is performed by medical experts, which is… More >

  • Open Access

    ARTICLE

    Multi-View Multi-Modal Head-Gaze Estimation for Advanced Indoor User Interaction

    Jung-Hwa Kim1, Jin-Woo Jeong2,*

    CMC-Computers, Materials & Continua, Vol.70, No.3, pp. 5107-5132, 2022, DOI:10.32604/cmc.2022.021107 - 11 October 2021

    Abstract Gaze estimation is one of the most promising technologies for supporting indoor monitoring and interaction systems. However, previous gaze estimation techniques generally work only in a controlled laboratory environment because they require a number of high-resolution eye images. This makes them unsuitable for welfare and healthcare facilities with the following challenging characteristics: 1) users’ continuous movements, 2) various lighting conditions, and 3) a limited amount of available data. To address these issues, we introduce a multi-view multi-modal head-gaze estimation system that translates the user’s head orientation into the gaze direction. The proposed system captures the… More >

  • Open Access

    ARTICLE

    Trade-Off between Efficiency and Effectiveness: A Late Fusion Multi-View Clustering Algorithm

    Yunping Zhao1, Weixuan Liang1, Jianzhuang Lu1,*, Xiaowen Chen1, Nijiwa Kong2

    CMC-Computers, Materials & Continua, Vol.66, No.3, pp. 2709-2722, 2021, DOI:10.32604/cmc.2021.013389 - 28 December 2020

    Abstract Late fusion multi-view clustering (LFMVC) algorithms aim to integrate the base partition of each single view into a consensus partition. Base partitions can be obtained by performing kernel k-means clustering on all views. This type of method is not only computationally efficient, but also more accurate than multiple kernel k-means, and is thus widely used in the multi-view clustering context. LFMVC improves computational efficiency to the extent that the computational complexity of each iteration is reduced from O(n3) to O(n) (where n is the number of samples). However, LFMVC also limits the search space of the… More >

  • Open Access

    ARTICLE

    A Multi-View Gait Recognition Method Using Deep Convolutional Neural Network and Channel Attention Mechanism

    Jiabin Wang*, Kai Peng

    CMES-Computer Modeling in Engineering & Sciences, Vol.125, No.1, pp. 345-363, 2020, DOI:10.32604/cmes.2020.011046 - 18 September 2020

    Abstract In many existing multi-view gait recognition methods based on images or video sequences, gait sequences are usually used to superimpose and synthesize images and construct energy-like template. However, information may be lost during the process of compositing image and capture EMG signals. Errors and the recognition accuracy may be introduced and affected respectively by some factors such as period detection. To better solve the problems, a multi-view gait recognition method using deep convolutional neural network and channel attention mechanism is proposed. Firstly, the sliding time window method is used to capture EMG signals. Then, the… More >

  • Open Access

    ARTICLE

    Bandwidth-Efficient Transmission Method for User View-Oriented Video Services

    Minjae Seo1, Jong-Ho Paik2, *

    CMC-Computers, Materials & Continua, Vol.65, No.3, pp. 2571-2589, 2020, DOI:10.32604/cmc.2020.011347 - 16 September 2020

    Abstract The trend in video viewing has been evolving beyond simply providing a multiview option. Recently, a function that allows selection and viewing of a clip from a multiview service that captures a specific range or object has been added. In particular, the freeview service is an extended concept of multi-view and provides a freer viewpoint. However, since numerous videos and additional data are required for its construction, all of the clips constituting the content cannot be simultaneously provided. Only certain clips are selected and provided to the user. If the video is not the preferred… More >

  • Open Access

    ARTICLE

    Multi-Index Image Retrieval Hash Algorithm Based on Multi-View Feature Coding

    Rong Duan1, Junshan Tan1, *, Jiaohua Qin1, Xuyu Xiang1, Yun Tan1, Neal N. Xiong2

    CMC-Computers, Materials & Continua, Vol.65, No.3, pp. 2335-2350, 2020, DOI:10.32604/cmc.2020.012161 - 16 September 2020

    Abstract In recent years, with the massive growth of image data, how to match the image required by users quickly and efficiently becomes a challenge. Compared with single-view feature, multi-view feature is more accurate to describe image information. The advantages of hash method in reducing data storage and improving efficiency also make us study how to effectively apply to large-scale image retrieval. In this paper, a hash algorithm of multi-index image retrieval based on multi-view feature coding is proposed. By learning the data correlation between different views, this algorithm uses multi-view data with deeper level image More >

Displaying 11-20 on page 2 of 22. Per Page