Open Access iconOpen Access

ARTICLE

crossmark

End-to-end Handwritten Chinese Paragraph Text Recognition Using Residual Attention Networks

Yintong Wang1,2,*, Yingjie Yang2, Haiyan Chen3, Hao Zheng1, Heyou Chang1

1 School of Information Engineering, Nanjing Xiaozhuang University, Nanjing, 211171, China
2 Institute of Artificial Intelligence, De Montfort University, Leicester, LE1 9BH, United Kingdom
3 College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing, 210016, China

* Corresponding Author: Yintong Wang. Email: email

Intelligent Automation & Soft Computing 2022, 34(1), 371-388. https://doi.org/10.32604/iasc.2022.027146

Abstract

Handwritten Chinese recognition which involves variant writing style, thousands of character categories and monotonous data mark process is a long-term focus in the field of pattern recognition research. The existing methods are facing huge challenges including the complex structure of character/line-touching, the discriminate ability of similar characters and the labeling of training datasets. To deal with these challenges, an end-to-end residual attention handwritten Chinese paragraph text recognition method is proposed, which uses fully convolutional neural networks as the main structure of feature extraction and employs connectionist temporal classification as a loss function. The novel residual attention gate block is more helpful in extracting essential features and making the training of deep convolutional neural networks more effective. In addition, we introduce the operations of batch bilinear interpolation which implement the mapping of two dimension text representation to one dimension text line representation without any position information of characters or text lines, and greatly reduce the labeling workload in preparing training datasets. In experimental, the proposed method is verified with two widely adopted handwritten Chinese text datasets, and achieves competitive results to the current state-of-the-art methods. Without using any position information of characters and text line, an accuracy rate of 90.53% is obtained in CASIA-HWDB test set.

Keywords


Cite This Article

Y. Wang, Y. Yang, H. Chen, H. Zheng and H. Chang, "End-to-end handwritten chinese paragraph text recognition using residual attention networks," Intelligent Automation & Soft Computing, vol. 34, no.1, pp. 371–388, 2022. https://doi.org/10.32604/iasc.2022.027146



cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1331

    View

  • 766

    Download

  • 0

    Like

Share Link