Tech Science Press - Publisher of Open Access Journals

Open Access

ARTICLE

Text Detection and Recognition for Natural Scene Images Using Deep Convolutional Neural Networks

Xianyu Wu¹, Chao Luo¹, Qian Zhang², Jiliu Zhou¹, Hao Yang^{1, 3, *}, Yulian Li¹

CMC-Computers, Materials & Continua, Vol.61, No.1, pp. 289-300, 2019, DOI:10.32604/cmc.2019.05990

Abstract Words are the most indispensable information in human life. It is very important to analyze and understand the meaning of words. Compared with the general visual elements, the text conveys rich and high-level moral information, which enables the computer to better understand the semantic content of the text. With the rapid development of computer technology, great achievements have been made in text information detection and recognition. However, when dealing with text characters in natural scene images, there are still some limitations in the detection and recognition of natural scene images. Because natural scene image has more interference and complexity than… More >

Open Access

ARTICLE

Tibetan Multi-Dialect Speech and Dialect Identity Recognition

Yue Zhao¹, Jianjian Yue¹, Wei Song^1,*, Xiaona Xu¹, Xiali Li¹, Licheng Wu¹, Qiang Ji²

CMC-Computers, Materials & Continua, Vol.60, No.3, pp. 1223-1235, 2019, DOI:10.32604/cmc.2019.05636

Abstract Tibetan language has very limited resource for conventional automatic speech recognition so far. It lacks of enough data, sub-word unit, lexicons and word inventories for some dialects. And speech content recognition and dialect classification have been treated as two independent tasks and modeled respectively in most prior works. But the two tasks are highly correlated. In this paper, we present a multi-task WaveNet model to perform simultaneous Tibetan multi-dialect speech recognition and dialect identification. It avoids processing the pronunciation dictionary and word segmentation for new dialects, while, in the meantime, allows training speech recognition and dialect identification in a single… More >

Open Access

ARTICLE

Attention-Aware Network with Latent Semantic Analysis for Clothing Invariant Gait Recognition

Hefei Ling¹, Jia Wu¹, Ping Li^1,*, Jialie Shen²

CMC-Computers, Materials & Continua, Vol.60, No.3, pp. 1041-1054, 2019, DOI:10.32604/cmc.2019.05605

Abstract Gait recognition is a complicated task due to the existence of co-factors like carrying conditions, clothing, viewpoints, and surfaces which change the appearance of gait more or less. Among those co-factors, clothing analysis is the most challenging one in the area. Conventional methods which are proposed for clothing invariant gait recognition show the body parts and the underlying relationships from them are important for gait recognition. Fortunately, attention mechanism shows dramatic performance for highlighting discriminative regions. Meanwhile, latent semantic analysis is known for the ability of capturing latent semantic variables to represent the underlying attributes and capturing the relationships from… More >

Open Access

ARTICLE

A Novel Scene Text Recognition Method Based on Deep Learning

Maosen Wang¹, Shaozhang Niu^1,*, Zhenguang Gao²

CMC-Computers, Materials & Continua, Vol.60, No.2, pp. 781-794, 2019, DOI:10.32604/cmc.2019.05595

Abstract Scene text recognition is one of the most important techniques in pattern recognition and machine intelligence due to its numerous practical applications. Scene text recognition is also a sequence model task. Recurrent neural network (RNN) is commonly regarded as the default starting point for sequential models. Due to the non-parallel prediction and the gradient disappearance problem, the performance of the RNN is difficult to improve substantially. In this paper, a new TRDD network architecture which base on dilated convolution and residual block is proposed, using Convolutional Neural Networks (CNN) instead of RNN realizes the recognition task of sequence texts. Our… More >

Open Access

ARTICLE

Adaptive Median Filtering Algorithm Based on Divide and Conquer and Its Application in CAPTCHA Recognition

Wentao Ma¹, Jiaohua Qin^1,*, Xuyu Xiang¹, Yun Tan¹, Yuanjing Luo¹, Neal N. Xiong²

CMC-Computers, Materials & Continua, Vol.58, No.3, pp. 665-677, 2019, DOI:10.32604/cmc.2019.05683

Abstract As the first barrier to protect cyberspace, the CAPTCHA has made significant contributions to maintaining Internet security and preventing malicious attacks. By researching the CAPTCHA, we can find its vulnerability and improve the security of CAPTCHA. Recently, many studies have shown that improving the image preprocessing effect of the CAPTCHA, which can achieve a better recognition rate by the state-of-the-art machine learning algorithms. There are many kinds of noise and distortion in the CAPTCHA images of this experiment. We propose an adaptive median filtering algorithm based on divide and conquer in this paper. Firstly, the filtering window data quickly sorted… More >

Open Access

ARTICLE

Detecting Iris Liveness with Batch Normalized Convolutional Neural Network

Min Long^1,2,*, Yan Zeng¹

CMC-Computers, Materials & Continua, Vol.58, No.2, pp. 493-504, 2019, DOI:10.32604/cmc.2019.04378

Abstract Aim to countermeasure the presentation attack for iris recognition system, an iris liveness detection scheme based on batch normalized convolutional neural network (BNCNN) is proposed to improve the reliability of the iris authentication system. The BNCNN architecture with eighteen layers is constructed to detect the genuine iris and fake iris, including convolutional layer, batch-normalized (BN) layer, Relu layer, pooling layer and full connected layer. The iris image is first preprocessed by iris segmentation and is normalized to 256×256 pixels, and then the iris features are extracted by BNCNN. With these features, the genuine iris and fake iris are determined by… More >

Open Access

ARTICLE

Cross-Lingual Non-Ferrous Metals Related News Recognition Method Based on CNN with A Limited Bi-Lingual Dictionary

Xudong Hong¹, Xiao Zheng^1,*, Jinyuan Xia¹, Linna Wei¹, Wei Xue¹

CMC-Computers, Materials & Continua, Vol.58, No.2, pp. 379-389, 2019, DOI:10.32604/cmc.2019.04059

Abstract To acquire non-ferrous metals related news from different countries’ internet, we proposed a cross-lingual non-ferrous metals related news recognition method based on CNN with a limited bilingual dictionary. Firstly, considering the lack of related language resources of non-ferrous metals, we use a limited bilingual dictionary and CCA to learn cross-lingual word vector and to represent news in different languages uniformly. Then, to improve the effect of recognition, we use a variant of the CNN to learn recognition features and construct the recognition model. The experimental results show that our proposed method acquires better results. More >

Open Access

ARTICLE

ia-PNCC: Noise Processing Method for Underwater Target Recognition Convolutional Neural Network

Nianbin Wang¹, Ming He^1,2, Jianguo Sun^1,*, Hongbin Wang¹, Lianke Zhou¹, Ci Chu¹, Lei Chen³

CMC-Computers, Materials & Continua, Vol.58, No.1, pp. 169-181, 2019, DOI:10.32604/cmc.2019.03709

Abstract Underwater target recognition is a key technology for underwater acoustic countermeasure. How to classify and recognize underwater targets according to the noise information of underwater targets has been a hot topic in the field of underwater acoustic signals. In this paper, the deep learning model is applied to underwater target recognition. Improved anti-noise Power-Normalized Cepstral Coefficients (ia-PNCC) is proposed, based on PNCC applied to underwater noises. Multitaper and normalized Gammatone filter banks are applied to improve the anti-noise capacity. The method is combined with a convolutional neural network in order to recognize the underwater target. Experiment results show that the… More >

Open Access

ARTICLE

A Method for Improving CNN-Based Image Recognition Using DCGAN

Wei Fang^1,2, Feihong Zhang^1,*, Victor S. Sheng³, Yewen Ding¹

CMC-Computers, Materials & Continua, Vol.57, No.1, pp. 167-178, 2018, DOI:10.32604/cmc.2018.02356

Abstract Image recognition has always been a hot research topic in the scientific community and industry. The emergence of convolutional neural networks(CNN) has made this technology turned into research focus on the field of computer vision, especially in image recognition. But it makes the recognition result largely dependent on the number and quality of training samples. Recently, DCGAN has become a frontier method for generating images, sounds, and videos. In this paper, DCGAN is used to generate sample that is difficult to collect and proposed an efficient design method of generating model. We combine DCGAN with CNN for the second time.… More >

Open Access

ARTICLE

Improved VGG Model for Road Traffic Sign Recognition

Shuren Zhou^1,2,*, Wenlong Liang^1,2, Junguo Li^1,2, Jeong-Uk Kim³

CMC-Computers, Materials & Continua, Vol.57, No.1, pp. 11-24, 2018, DOI:10.32604/cmc.2018.02617

Abstract Road traffic sign recognition is an important task in intelligent transportation system. Convolutional neural networks (CNNs) have achieved a breakthrough in computer vision tasks and made great success in traffic sign classification. In this paper, it presents a road traffic sign recognition algorithm based on a convolutional neural network. In natural scenes, traffic signs are disturbed by factors such as illumination, occlusion, missing and deformation, and the accuracy of recognition decreases, this paper proposes a model called Improved VGG (IVGG) inspired by VGG model. The IVGG model includes 9 layers, compared with the original VGG model, it is added max-pooling… More >

Displaying 511-520 on page 52 of 521. Per Page

View

4264

Download

1943

Like

0

Cited by

6

View

2558

Download

1544

Like

0

Cited by

2

View

2350

Download

1334

Like

0

Cited by

2

View

2352

Download

1568

Like

0

Cited by

6

View

3288

Download

1507

Like

0

Cited by

8

View

3829

Download

1851

Like

1

Cited by

47

View

2935

Download

1339

Like

0

Cited by

6

View

3936

Download

1650

Like

1

Cited by

12

View

8399

Download

3557

Like

1

Cited by

30

View

6906

Download

2783

Like

2

Cited by

27