KurdSet: A Kurdish Handwritten Characters Recognition Dataset Using Convolutional Neural Network

Sardar Ali; Maiwan Abdulrazzaq

doi:10.32604/cmc.2024.048356

Open Access icon Open Access

ARTICLE

KurdSet: A Kurdish Handwritten Characters Recognition Dataset Using Convolutional Neural Network

Sardar Hasen Ali^*, Maiwan Bahjat Abdulrazzaq

Computer Science Department, University of Zakho, Duhok, Iraq

* Corresponding Author: Sardar Hasen Ali. Email: email

(This article belongs to the Special Issue: Advanced Artificial Intelligence and Machine Learning Frameworks for Signal and Image Processing Applications)

Computers, Materials & Continua 2024, 79(1), 429-448. https://doi.org/10.32604/cmc.2024.048356

Received 05 December 2023; Accepted 14 February 2024; Issue published 25 April 2024

Abstract

Handwritten character recognition (HCR) involves identifying characters in images, documents, and various sources such as forms surveys, questionnaires, and signatures, and transforming them into a machine-readable format for subsequent processing. Successfully recognizing complex and intricately shaped handwritten characters remains a significant obstacle. The use of convolutional neural network (CNN) in recent developments has notably advanced HCR, leveraging the ability to extract discriminative features from extensive sets of raw data. Because of the absence of pre-existing datasets in the Kurdish language, we created a Kurdish handwritten dataset called (KurdSet). The dataset consists of Kurdish characters, digits, texts, and symbols. The dataset consists of 1560 participants and contains 45,240 characters. In this study, we chose characters only from our dataset. We utilized a Kurdish dataset for handwritten character recognition. The study also utilizes various models, including InceptionV3, Xception, DenseNet121, and a custom CNN model. To show the performance of the KurdSet dataset, we compared it to Arabic handwritten character recognition dataset (AHCD). We applied the models to both datasets to show the performance of our dataset. Additionally, the performance of the models is evaluated using test accuracy, which measures the percentage of correctly classified characters in the evaluation phase. All models performed well in the training phase, DenseNet121 exhibited the highest accuracy among the models, achieving a high accuracy of 99.80% on the Kurdish dataset. And Xception model achieved 98.66% using the Arabic dataset.

Keywords

CNN models; Kurdish handwritten recognition; KurdSet dataset; Arabic handwritten recognition; DenseNet121 model; InceptionV3 model; Xception model

Cite This Article

APA Style

Ali, S.H., Abdulrazzaq, M.B. (2024). KurdSet: A Kurdish Handwritten Characters Recognition Dataset Using Convolutional Neural Network. Computers, Materials & Continua, 79(1), 429–448. https://doi.org/10.32604/cmc.2024.048356

Vancouver Style

Ali SH, Abdulrazzaq MB. KurdSet: A Kurdish Handwritten Characters Recognition Dataset Using Convolutional Neural Network. Comput Mater Contin. 2024;79(1):429–448. https://doi.org/10.32604/cmc.2024.048356

IEEE Style

S. H. Ali and M. B. Abdulrazzaq, “KurdSet: A Kurdish Handwritten Characters Recognition Dataset Using Convolutional Neural Network,” Comput. Mater. Contin., vol. 79, no. 1, pp. 429–448, 2024. https://doi.org/10.32604/cmc.2024.048356

BibTex EndNote RIS

Copyright © 2024 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

KurdSet: A Kurdish Handwritten Characters Recognition Dataset Using Convolutional Neural Network

Abstract

Keywords

Cite This Article

2022

876

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link