Tech Science Press - Publisher of Open Access Journals

Overview of 3D Human Pose Estimation

Jianchu Lin^1,2, Shuang Li³, Hong Qin^3,4, Hongchang Wang³, Ning Cui⁶, Qian Jiang⁷, Haifang Jian^3,*, Gongming Wang^5,*

CMES-Computer Modeling in Engineering & Sciences, Vol.134, No.3, pp. 1621-1651, 2023, DOI:10.32604/cmes.2022.020857 - 20 September 2022

Abstract 3D human pose estimation is a major focus area in the field of computer vision, which plays an important role in practical applications. This article summarizes the framework and research progress related to the estimation of monocular RGB images and videos. An overall perspective of methods integrated with deep learning is introduced. Novel image-based and video-based inputs are proposed as the analysis framework. From this viewpoint, common problems are discussed. The diversity of human postures usually leads to problems such as occlusion and ambiguity, and the lack of training datasets often results in poor generalization… More >

Open Access

ARTICLE

Research on Multi-View Image Reconstruction Technology Based on Auto-Encoding Learning

Tao Zhang¹, Shaokui Gu¹, Jinxing Niu^1,*, Yi Cao²

CMC-Computers, Materials & Continua, Vol.72, No.3, pp. 4603-4614, 2022, DOI:10.32604/cmc.2022.027079 - 21 April 2022

Abstract Traditional three-dimensional (3D) image reconstruction method, which highly dependent on the environment and has poor reconstruction effect, is easy to lead to mismatch and poor real-time performance. The accuracy of feature extraction from multiple images affects the reliability and real-time performance of 3D reconstruction technology. To solve the problem, a multi-view image 3D reconstruction algorithm based on self-encoding convolutional neural network is proposed in this paper. The algorithm first extracts the feature information of multiple two-dimensional (2D) images based on scale and rotation invariance parameters of Scale-invariant feature transform (SIFT) operator. Secondly, self-encoding learning neural… More >

Open Access

ARTICLE

Multi-View Auxiliary Diagnosis Algorithm for Lung Nodules

Shi Qiu¹, Bin Li^2,*, Tao Zhou³, Feng Li⁴, Ting Liang⁵

CMC-Computers, Materials & Continua, Vol.72, No.3, pp. 4897-4910, 2022, DOI:10.32604/cmc.2022.026855 - 21 April 2022

Abstract Lung is an important organ of human body. More and more people are suffering from lung diseases due to air pollution. These diseases are usually highly infectious. Such as lung tuberculosis, novel coronavirus COVID-19, etc. Lung nodule is a kind of high-density globular lesion in the lung. Physicians need to spend a lot of time and energy to observe the computed tomography image sequences to make a diagnosis, which is inefficient. For this reason, the use of computer-assisted diagnosis of lung nodules has become the current main trend. In the process of computer-aided diagnosis, how… More >

Open Access

ARTICLE

An Innovative Approach Utilizing Binary-View Transformer for Speech Recognition Task

Muhammad Babar Kamal¹, Arfat Ahmad Khan², Faizan Ahmed Khan³, Malik Muhammad Ali Shahid⁴, Chitapong Wechtaisong^2,*, Muhammad Daud Kamal⁵, Muhammad Junaid Ali⁶, Peerapong Uthansakul²

CMC-Computers, Materials & Continua, Vol.72, No.3, pp. 5547-5562, 2022, DOI:10.32604/cmc.2022.024590 - 21 April 2022

Abstract The deep learning advancements have greatly improved the performance of speech recognition systems, and most recent systems are based on the Recurrent Neural Network (RNN). Overall, the RNN works fine with the small sequence data, but suffers from the gradient vanishing problem in case of large sequence. The transformer networks have neutralized this issue and have shown state-of-the-art results on sequential or speech-related data. Generally, in speech recognition, the input audio is converted into an image using Mel-spectrogram to illustrate frequencies and intensities. The image is classified by the machine learning mechanism to generate a… More >

Open Access

ARTICLE

Brain Tumor Segmentation using Multi-View Attention based Ensemble Network

Noreen Mushtaq¹, Arfat Ahmad Khan², Faizan Ahmed Khan³, Muhammad Junaid Ali⁴, Malik Muhammad Ali Shahid⁵, Chitapong Wechtaisong^2,*, Peerapong Uthansakul²

CMC-Computers, Materials & Continua, Vol.72, No.3, pp. 5793-5806, 2022, DOI:10.32604/cmc.2022.024316 - 21 April 2022

Abstract Astrocytoma IV or glioblastoma is one of the fatal and dangerous types of brain tumors. Early detection of brain tumor increases the survival rate and helps in reducing the fatality rate. Various imaging modalities have been used for diagnosing by expert radiologists, and Medical Resonance Image (MRI) is considered a better option for detecting brain tumors as MRI is a non-invasive technique and provides better visualization of the brain region. One of the challenging issues is to identify the tumorous region from the MRI scans correctly. Manual segmentation is performed by medical experts, which is… More >

Open Access

ARTICLE

Multi-View Multi-Modal Head-Gaze Estimation for Advanced Indoor User Interaction

Jung-Hwa Kim¹, Jin-Woo Jeong^2,*

CMC-Computers, Materials & Continua, Vol.70, No.3, pp. 5107-5132, 2022, DOI:10.32604/cmc.2022.021107 - 11 October 2021

Abstract Gaze estimation is one of the most promising technologies for supporting indoor monitoring and interaction systems. However, previous gaze estimation techniques generally work only in a controlled laboratory environment because they require a number of high-resolution eye images. This makes them unsuitable for welfare and healthcare facilities with the following challenging characteristics: 1) users’ continuous movements, 2) various lighting conditions, and 3) a limited amount of available data. To address these issues, we introduce a multi-view multi-modal head-gaze estimation system that translates the user’s head orientation into the gaze direction. The proposed system captures the… More >

Open Access

ARTICLE

Trade-Off between Efficiency and Effectiveness: A Late Fusion Multi-View Clustering Algorithm

Yunping Zhao¹, Weixuan Liang¹, Jianzhuang Lu^1,*, Xiaowen Chen¹, Nijiwa Kong²

CMC-Computers, Materials & Continua, Vol.66, No.3, pp. 2709-2722, 2021, DOI:10.32604/cmc.2021.013389 - 28 December 2020

Abstract Late fusion multi-view clustering (LFMVC) algorithms aim to integrate the base partition of each single view into a consensus partition. Base partitions can be obtained by performing kernel k-means clustering on all views. This type of method is not only computationally efficient, but also more accurate than multiple kernel k-means, and is thus widely used in the multi-view clustering context. LFMVC improves computational efficiency to the extent that the computational complexity of each iteration is reduced from O(n³) to O(n) (where n is the number of samples). However, LFMVC also limits the search space of the… More >

Open Access

ARTICLE

A Multi-View Gait Recognition Method Using Deep Convolutional Neural Network and Channel Attention Mechanism

Jiabin Wang^*, Kai Peng

CMES-Computer Modeling in Engineering & Sciences, Vol.125, No.1, pp. 345-363, 2020, DOI:10.32604/cmes.2020.011046 - 18 September 2020

Abstract In many existing multi-view gait recognition methods based on images or video sequences, gait sequences are usually used to superimpose and synthesize images and construct energy-like template. However, information may be lost during the process of compositing image and capture EMG signals. Errors and the recognition accuracy may be introduced and affected respectively by some factors such as period detection. To better solve the problems, a multi-view gait recognition method using deep convolutional neural network and channel attention mechanism is proposed. Firstly, the sliding time window method is used to capture EMG signals. Then, the… More >

Open Access

ARTICLE

Bandwidth-Efficient Transmission Method for User View-Oriented Video Services

Minjae Seo¹, Jong-Ho Paik^{2, *}

CMC-Computers, Materials & Continua, Vol.65, No.3, pp. 2571-2589, 2020, DOI:10.32604/cmc.2020.011347 - 16 September 2020

Abstract The trend in video viewing has been evolving beyond simply providing a multiview option. Recently, a function that allows selection and viewing of a clip from a multiview service that captures a specific range or object has been added. In particular, the freeview service is an extended concept of multi-view and provides a freer viewpoint. However, since numerous videos and additional data are required for its construction, all of the clips constituting the content cannot be simultaneously provided. Only certain clips are selected and provided to the user. If the video is not the preferred… More >

Open Access

ARTICLE

Multi-Index Image Retrieval Hash Algorithm Based on Multi-View Feature Coding

Rong Duan¹, Junshan Tan^{1, *}, Jiaohua Qin¹, Xuyu Xiang¹, Yun Tan¹, Neal N. Xiong²

CMC-Computers, Materials & Continua, Vol.65, No.3, pp. 2335-2350, 2020, DOI:10.32604/cmc.2020.012161 - 16 September 2020

Abstract In recent years, with the massive growth of image data, how to match the image required by users quickly and efficiently becomes a challenge. Compared with single-view feature, multi-view feature is more accurate to describe image information. The advantages of hash method in reducing data storage and improving efficiency also make us study how to effectively apply to large-scale image retrieval. In this paper, a hash algorithm of multi-index image retrieval based on multi-view feature coding is proposed. By learning the data correlation between different views, this algorithm uses multi-view data with deeper level image More >

Displaying 11-20 on page 2 of 22. Per Page

View

2654

Download

1252

View

1712

Download

973

View

1520

Download

921

View

1647

Download

1000

View

2156

Download

1226

View

1933

Download

1617

Cited by

1

View

1880

Download

1658

View

4073

Download

2670

Like

1

Cited by

1

View

2164

Download

1551

View

2030

Download

1356

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp: