Vol.66, No.2, 2021, pp.2061-2076, doi:10.32604/cmc.2020.013905
Intelligent Dynamic Gesture Recognition Using CNN Empowered by Edit Distance
  • Shazia Saqib1, Allah Ditta2, Muhammad Adnan Khan1,*, Syed Asad Raza Kazmi3, Hani Alquhayz4
1 Department of Computer Science, Lahore Garrison University, Lahore, 54000, Pakistan
2 Department of Information Sciences, Division of Science & Technology, University of Education, Lahore, 54000, Pakistan
3 GC University, Lahore, 54000, Pakistan
4 Department of Computer Science and Information, College of Science in Zulfi, Majmaah University, Al-Majmaah, 11952, Saudi Arabia
* Corresponding Author: Muhammad Adnan Khan. Email:
Received 26 August 2020; Accepted 28 September 2020; Issue published 26 November 2020
Human activity detection and recognition is a challenging task. Video surveillance can benefit greatly by advances in Internet of Things (IoT) and cloud computing. Artificial intelligence IoT (AIoT) based devices form the basis of a smart city. The research presents Intelligent dynamic gesture recognition (IDGR) using a Convolutional neural network (CNN) empowered by edit distance for video recognition. The proposed system has been evaluated using AIoT enabled devices for static and dynamic gestures of Pakistani sign language (PSL). However, the proposed methodology can work efficiently for any type of video. The proposed research concludes that deep learning and convolutional neural networks give a most appropriate solution retaining discriminative and dynamic information of the input action. The research proposes recognition of dynamic gestures using image recognition of the keyframes based on CNN extracted from the human activity. Edit distance is used to find out the label of the word to which those sets of frames belong to. The simulation results have shown that at 400 videos per human action, 100 epochs, 234 × 234 image size, the accuracy of the system is 90.79%, which is a reasonable accuracy for a relatively small dataset as compared to the previously published techniques.
Sign languages; keyframe; edit distance; misrate; accuracy
Cite This Article
S. Saqib, A. Ditta, M. A. Khan, S. A. R. Kazmi and H. Alquhayz, "Intelligent dynamic gesture recognition using cnn empowered by edit distance," Computers, Materials & Continua, vol. 66, no.2, pp. 2061–2076, 2021.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.