Open Access iconOpen Access


Menu Text Recognition of Few-shot Learning

Xiaoyu1,2, Tian Zhenzhen2, Xin Zihao2, Liu Suolan2, Chen Fuhua3, Wang Hongyuan2,*

1 School of Computer Science and Artificial Intelligencea, Changzhou, 213164, China
2 Changzhou University, Changzhou, Jiangsu, 213164, China
3 West Liberty University, 208 University Drive, West Liberty, 26074, USA

* Corresponding Author: Wang Hongyuan. Email: email

Journal of New Media 2022, 4(3), 137-143.


Recent advances in OCR show that end-to-end (E2E) training pipelines including detection and identification can achieve the best results. However, many existing methods usually focus on case insensitive English characters. In this paper, we apply an E2E approach, the multiplex multilingual mask TextSpotter, which performs script recognition at the word level and uses different recognition headers to process different scripts while maintaining uniform loss, thus optimizing script recognition and multiple recognition headers simultaneously. Experiments show that this method is superior to the single-head model with similar number of parameters in end-to-end identification tasks.


Cite This Article

. Xiaoyu, T. Zhenzhen, X. Zihao, L. Suolan, C. Fuhua et al., "Menu text recognition of few-shot learning," Journal of New Media, vol. 4, no.3, pp. 137–143, 2022.

cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1057


  • 580


  • 0


Share Link