Open Access iconOpen Access

ARTICLE

crossmark

Multi-Domain Deep Convolutional Neural Network for Ancient Urdu Text Recognition System

K. O. Mohammed Aarif1,*, P. Sivakumar2

1 Department of Electronics and Communication Engineering, C. Abdul Hakeem College of Engineering and Technology, Melvisharam, 632509, India
2 Department of Electronics and Communication Engineering, Dr. NGP Institute of Technology, Coimbatore, 614048, India

* Corresponding Author: K. O. Mohammed Aarif. Email: email

Intelligent Automation & Soft Computing 2022, 33(1), 275-289. https://doi.org/10.32604/iasc.2022.022805

Abstract

Deep learning has achieved magnificent success in the field of pattern recognition. In recent years Urdu character recognition system has significantly benefited from the effectiveness of the deep convolutional neural network. Majority of the research on Urdu text recognition are concentrated on formal handwritten and printed Urdu text document. In this paper, we experimented the Challenging issue of text recognition in Urdu ancient literature documents. Due to its cursiveness, complex word formation (ligatures), and context-sensitivity, and inadequate benchmark dataset, recognition of Urdu text from the literature document is very difficult to process compared to the formal Urdu text document. In this work, first, we generated a dataset by extracting the recurrent ligatures from an ancient Urdu fatawa book. Secondly, we categorized and augment the ligatures to generate batches of augmented images that improvise the training efficiency and classification accuracy. Finally, we proposed a multi-domain deep Convolutional Neural Network which integrates a spatial domain and a frequency domain CNN to learn the modular relations between features originating from the two different domain networks to train and improvise the classification accuracy. The experimental results show that the proposed network with the augmented dataset achieves an averaged accuracy of 97.8% which outperforms the other CNN models in this class. The experimental results also show that for the recognition of ancient Urdu literature, well-known benchmark datasets are not appropriate which is also verified with our prepared dataset.

Keywords


Cite This Article

APA Style
Aarif, K.O.M., Sivakumar, P. (2022). Multi-domain deep convolutional neural network for ancient urdu text recognition system. Intelligent Automation & Soft Computing, 33(1), 275-289. https://doi.org/10.32604/iasc.2022.022805
Vancouver Style
Aarif KOM, Sivakumar P. Multi-domain deep convolutional neural network for ancient urdu text recognition system. Intell Automat Soft Comput . 2022;33(1):275-289 https://doi.org/10.32604/iasc.2022.022805
IEEE Style
K.O.M. Aarif and P. Sivakumar, "Multi-Domain Deep Convolutional Neural Network for Ancient Urdu Text Recognition System," Intell. Automat. Soft Comput. , vol. 33, no. 1, pp. 275-289. 2022. https://doi.org/10.32604/iasc.2022.022805



cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1471

    View

  • 968

    Download

  • 0

    Like

Share Link