Chinese DeepSeek: Performance of Various Oversampling Techniques on Public Perceptions Using Natural Language Processing

Anees Ara; Muhammad Mujahid; Amal Al-Rasheed; Shaha Al-Otaibi; Tanzila Saba

doi:10.32604/cmc.2025.065566

Open Access icon Open Access

ARTICLE

Chinese DeepSeek: Performance of Various Oversampling Techniques on Public Perceptions Using Natural Language Processing

Anees Ara¹, Muhammad Mujahid¹, Amal Al-Rasheed^2,*, Shaha Al-Otaibi², Tanzila Saba¹

1 Artificial Intelligence & Data Analytics Lab, CCIS, Prince Sultan University, Riyadh, 11586, Saudi Arabia
2 Department of Information Systems, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, Riyadh, 11671, Saudi Arabia

* Corresponding Author: Amal Al-Rasheed. Email: email

(This article belongs to the Special Issue: Advancements and Challenges in Artificial Intelligence, Data Analysis and Big Data)

Computers, Materials & Continua 2025, 84(2), 2717-2731. https://doi.org/10.32604/cmc.2025.065566

Received 16 March 2025; Accepted 28 April 2025; Issue published 03 July 2025

Abstract

DeepSeek Chinese artificial intelligence (AI) open-source model, has gained a lot of attention due to its economical training and efficient inference. DeepSeek, a model trained on large-scale reinforcement learning without supervised fine-tuning as a preliminary step, demonstrates remarkable reasoning capabilities of performing a wide range of tasks. DeepSeek is a prominent AI-driven chatbot that assists individuals in learning and enhances responses by generating insightful solutions to inquiries. Users possess divergent viewpoints regarding advanced models like DeepSeek, posting both their merits and shortcomings across several social media platforms. This research presents a new framework for predicting public sentiment to evaluate perceptions of DeepSeek. To transform the unstructured data into a suitable manner, we initially collect DeepSeek-related tweets from Twitter and subsequently implement various preprocessing methods. Subsequently, we annotated the tweets utilizing the Valence Aware Dictionary and sentiment Reasoning (VADER) methodology and the lexicon-driven TextBlob. Next, we classified the attitudes obtained from the purified data utilizing the proposed hybrid model. The proposed hybrid model consists of long-term, short-term memory (LSTM) and bidirectional gated recurrent units (BiGRU). To strengthen it, we include multi-head attention, regularizer activation, and dropout units to enhance performance. Topic modeling employing KMeans clustering and Latent Dirichlet Allocation (LDA), was utilized to analyze public behavior concerning DeepSeek. The perceptions demonstrate that 82.5% of the people are positive, 15.2% negative, and 2.3% neutral using TextBlob, and 82.8% positive, 16.1% negative, and 1.2% neutral using the VADER analysis. The slight difference in results ensures that both analyses concur with their overall perceptions and may have distinct views of language peculiarities. The results indicate that the proposed model surpassed previous state-of-the-art approaches.

Keywords

DeepSeek; prediction; natural language processing; deep learning; analysis; TextBlob; imbalance data

Cite This Article

APA Style

Ara, A., Mujahid, M., Al-Rasheed, A., Al-Otaibi, S., Saba, T. (2025). Chinese DeepSeek: Performance of Various Oversampling Techniques on Public Perceptions Using Natural Language Processing. Computers, Materials & Continua, 84(2), 2717–2731. https://doi.org/10.32604/cmc.2025.065566

Vancouver Style

Ara A, Mujahid M, Al-Rasheed A, Al-Otaibi S, Saba T. Chinese DeepSeek: Performance of Various Oversampling Techniques on Public Perceptions Using Natural Language Processing. Comput Mater Contin. 2025;84(2):2717–2731. https://doi.org/10.32604/cmc.2025.065566

IEEE Style

A. Ara, M. Mujahid, A. Al-Rasheed, S. Al-Otaibi, and T. Saba, “Chinese DeepSeek: Performance of Various Oversampling Techniques on Public Perceptions Using Natural Language Processing,” Comput. Mater. Contin., vol. 84, no. 2, pp. 2717–2731, 2025. https://doi.org/10.32604/cmc.2025.065566

BibTex EndNote RIS

Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Chinese DeepSeek: Performance of Various Oversampling Techniques on Public Perceptions Using Natural Language Processing

Abstract

Keywords

Cite This Article

1014

372

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link