Semantic Analysis of Urdu English Tweets Empowered by Machine Learning

Nadia Tabassum; Tahir Alyas; Muhammad Hamid; Muhammad Saleem; Saadia Malik; Zain Ali; Umer Farooq

doi:10.32604/iasc.2021.018998

Open Access icon Open Access

ARTICLE

Semantic Analysis of Urdu English Tweets Empowered by Machine Learning

Nadia Tabassum¹, Tahir Alyas², Muhammad Hamid^3,*, Muhammad Saleem⁴, Saadia Malik⁵, Zain Ali², Umer Farooq²

1 Department of Computer Science, Virtual University of Pakistan, Lahore, 54000, Pakistan
2 Department of Computer Science, Lahore Garrison University, Lahore, 54000, Pakistan
3 Department of Statistics and Computer science, University of veterinary and animal sciences, Lahore, 54000, Pakistan
4 Department of Industrial Engineering, Faculty of Engineering, Rabigh, King Abdulaziz University, Jeddah, 21589, Saudi Arabia
5 Department of Information Systems, Faculty of Computing and Information Technology - Rabigh, King Abdulaziz University, Jeddah, 21589, Saudi Arabia

* Corresponding Author: Muhammad Hamid. Email: email

Intelligent Automation & Soft Computing 2021, 30(1), 175-186. https://doi.org/10.32604/iasc.2021.018998

Received 28 March 2021; Accepted 29 April 2021; Issue published 26 July 2021

Abstract

Development in the field of opinion mining and sentiment analysis has been rapid and aims to explore views or texts on various social media sites through machine-learning techniques with the sentiment, subjectivity analysis and calculations of polarity. Sentiment analysis is a natural language processing strategy used to decide if the information is positive, negative, or neutral and it is frequently performed on literature information to help organizations screen brand, item sentiment in client input, and comprehend client needs. In this paper, two strategies for sentiment analysis is proposed for word embedding and a bag of words on Urdu and English tweets. Word embedding is a notable arrangement of procedures that can remember words linguistics dependent on the spread theory which expresses that word is utilized and happens within the same settings tend to indicate comparable implications. Bag of words is an approach used in natural language processing to retrieve information and features from written documents. For the bag of words, machine learning techniques like naive bayes, decision tree, k-nearest neighbor, and support vector machine is used to enhance the accuracy. For word embedding the neural network technique is proposed by the combination of recurrent neural network (RNN) with long-short term memory (LSTM) for sentimental analysis of tweets. Datasets of Urdu and English tweets are used for negative and positive classification tweets with machine learning techniques. The contribution of this paper involves the implementation of a hybrid approach that focused on a sentiment analyzer to overcome social network challenges and also provided the comparative analysis of different machine learning algorithms. The results indicate improvement while using the combination of RNN with the help of LSTM showed accuracy 87% on the Urdu dataset and 92% on the English dataset.

Keywords

Short term memory; natural language processing; tweets; support vector machine; word embedding

Cite This Article

APA Style

Tabassum, N., Alyas, T., Hamid, M., Saleem, M., Malik, S. et al. (2021). Semantic Analysis of Urdu English Tweets Empowered by Machine Learning. Intelligent Automation & Soft Computing, 30(1), 175–186. https://doi.org/10.32604/iasc.2021.018998

Vancouver Style

Tabassum N, Alyas T, Hamid M, Saleem M, Malik S, Ali Z, et al. Semantic Analysis of Urdu English Tweets Empowered by Machine Learning. Intell Automat Soft Comput. 2021;30(1):175–186. https://doi.org/10.32604/iasc.2021.018998

IEEE Style

N. Tabassum et al., “Semantic Analysis of Urdu English Tweets Empowered by Machine Learning,” Intell. Automat. Soft Comput., vol. 30, no. 1, pp. 175–186, 2021. https://doi.org/10.32604/iasc.2021.018998

BibTex EndNote RIS

Citations

1

[click to view]

Copyright © 2021 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Semantic Analysis of Urdu English Tweets Empowered by Machine Learning

Abstract

Keywords

Cite This Article

Citations

3751

1944

3

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link