Open Access iconOpen Access

ARTICLE

crossmark

Developing Lexicons for Enhanced Sentiment Analysis in Software Engineering: An Innovative Multilingual Approach for Social Media Reviews

Zohaib Ahmad Khan1, Yuanqing Xia1,*, Ahmed Khan2, Muhammad Sadiq2, Mahmood Alam3, Fuad A. Awwad4, Emad A. A. Ismail4

1 School of Automation, Beijing Institute of Technology, Beijing, 100081, China
2 Department of Computer Science and Technology, University of Science and Technology Bannu, KPK, Bannu, 28100, Pakistan
3 School of Computer Science and Engineering, Central South University, Changsha, 410083, China
4 Department of Quantitative Analysis, College of Business Administration, King Saud University, P.O. Box 71115, Riyadh, 11587, Saudi Arabia

* Corresponding Author: Yuanqing Xia. Email: email

(This article belongs to the Special Issue: Requirements Engineering: Bridging Theory, Research and Practice)

Computers, Materials & Continua 2024, 79(2), 2771-2793. https://doi.org/10.32604/cmc.2024.046897

Abstract

Sentiment analysis is becoming increasingly important in today’s digital age, with social media being a significant source of user-generated content. The development of sentiment lexicons that can support languages other than English is a challenging task, especially for analyzing sentiment analysis in social media reviews. Most existing sentiment analysis systems focus on English, leaving a significant research gap in other languages due to limited resources and tools. This research aims to address this gap by building a sentiment lexicon for local languages, which is then used with a machine learning algorithm for efficient sentiment analysis. In the first step, a lexicon is developed that includes five languages: Urdu, Roman Urdu, Pashto, Roman Pashto, and English. The sentiment scores from SentiWordNet are associated with each word in the lexicon to produce an effective sentiment score. In the second step, a naive Bayesian algorithm is applied to the developed lexicon for efficient sentiment analysis of Roman Pashto. Both the sentiment lexicon and sentiment analysis steps were evaluated using information retrieval metrics, with an accuracy score of 0.89 for the sentiment lexicon and 0.83 for the sentiment analysis. The results showcase the potential for improving software engineering tasks related to user feedback analysis and product development.

Keywords


Cite This Article

APA Style
Khan, Z.A., Xia, Y., Khan, A., Sadiq, M., Alam, M. et al. (2024). Developing lexicons for enhanced sentiment analysis in software engineering: an innovative multilingual approach for social media reviews. Computers, Materials & Continua, 79(2), 2771-2793. https://doi.org/10.32604/cmc.2024.046897
Vancouver Style
Khan ZA, Xia Y, Khan A, Sadiq M, Alam M, Awwad FA, et al. Developing lexicons for enhanced sentiment analysis in software engineering: an innovative multilingual approach for social media reviews. Comput Mater Contin. 2024;79(2):2771-2793 https://doi.org/10.32604/cmc.2024.046897
IEEE Style
Z.A. Khan et al., "Developing Lexicons for Enhanced Sentiment Analysis in Software Engineering: An Innovative Multilingual Approach for Social Media Reviews," Comput. Mater. Contin., vol. 79, no. 2, pp. 2771-2793. 2024. https://doi.org/10.32604/cmc.2024.046897



cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 355

    View

  • 131

    Download

  • 0

    Like

Share Link