Table of Content

Open AccessOpen Access


Text Detection and Recognition for Natural Scene Images Using Deep Convolutional Neural Networks

Xianyu Wu1, Chao Luo1, Qian Zhang2, Jiliu Zhou1, Hao Yang1, 3, *, Yulian Li1

School of Computer Science, Chengdu University of Information Technology, Chengdu, 610225, China.
School of Computer Science, University of Nottingham Jubilee Campus, NG8 1BB, UK.
School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu, 610054, China.

* Corresponding Author: Hao Yang. Email: .

Computers, Materials & Continua 2019, 61(1), 289-300.


Words are the most indispensable information in human life. It is very important to analyze and understand the meaning of words. Compared with the general visual elements, the text conveys rich and high-level moral information, which enables the computer to better understand the semantic content of the text. With the rapid development of computer technology, great achievements have been made in text information detection and recognition. However, when dealing with text characters in natural scene images, there are still some limitations in the detection and recognition of natural scene images. Because natural scene image has more interference and complexity than text, these factors make the detection and recognition of natural scene image text face many challenges. To solve this problem, a new text detection and recognition method based on depth convolution neural network is proposed for natural scene image in this paper. In text detection, this method obtains high-level visual features from the bottom pixels by ResNet network, and extracts the context features from character sequences by BLSTM layer, then introduce to the idea of faster R-CNN vertical anchor point to find the bounding box of the detected text, which effectively improves the effect of text object detection. In addition, in text recognition task, DenseNet model is used to construct character recognition based on Kares. Finally, the output of Softmax is used to classify each character. Our method can replace the artificially defined features with automatic learning and context-based features. It improves the efficiency and accuracy of recognition, and realizes text detection and recognition of natural scene images. And on the PAC2018 competition platform, the experimental results have achieved good results.


Cite This Article

X. Wu, C. Luo, Q. Zhang, J. Zhou, H. Yang et al., "Text detection and recognition for natural scene images using deep convolutional neural networks," Computers, Materials & Continua, vol. 61, no.1, pp. 289–300, 2019.


This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 3685


  • 1757


  • 0


Share Link