Deep-BERT: Transfer Learning for Classifying Multilingual Offensive Texts on Social Media

Md. Anwar; M. Mridha; Jungpil Shin; Kamruddin Nur; Aloke Saha

doi:10.32604/csse.2023.027841

Open Access icon Open Access

ARTICLE

Deep-BERT: Transfer Learning for Classifying Multilingual Offensive Texts on Social Media

Md. Anwar Hussen Wadud¹, M. F. Mridha¹, Jungpil Shin^2,*, Kamruddin Nur³, Aloke Kumar Saha⁴

1 Department of Computer Science and Engineering, Bangladesh University of Business and Technology, Dhaka, Bangladesh
2 School of Computer Science and Engineering, University of Aizu, Aizuwakamatsu, Japan
3 Department of Computer Science, American International University-Bangladesh, Dhaka, Bangladesh
4 Department of Computer Science and Engineering, University of Asia Pacific, Dhaka, Bangladesh

* Corresponding Author: Jungpil Shin. Email: email

Computer Systems Science and Engineering 2023, 44(2), 1775-1791. https://doi.org/10.32604/csse.2023.027841

Received 26 January 2022; Accepted 11 March 2022; Issue published 15 June 2022

Abstract

Offensive messages on social media, have recently been frequently used to harass and criticize people. In recent studies, many promising algorithms have been developed to identify offensive texts. Most algorithms analyze text in a unidirectional manner, where a bidirectional method can maximize performance results and capture semantic and contextual information in sentences. In addition, there are many separate models for identifying offensive texts based on monolingual and multilingual, but there are a few models that can detect both monolingual and multilingual-based offensive texts. In this study, a detection system has been developed for both monolingual and multilingual offensive texts by combining deep convolutional neural network and bidirectional encoder representations from transformers (Deep-BERT) to identify offensive posts on social media that are used to harass others. This paper explores a variety of ways to deal with multilingualism, including collaborative multilingual and translation-based approaches. Then, the Deep-BERT is tested on the Bengali and English datasets, including the different bidirectional encoder representations from transformers (BERT) pre-trained word-embedding techniques, and found that the proposed Deep-BERT’s efficacy outperformed all existing offensive text classification algorithms reaching an accuracy of 91.83%. The proposed model is a state-of-the-art model that can classify both monolingual-based and multilingual-based offensive texts.

Keywords

Offensive text classification; deep convolutional neural network (DCNN); bidirectional encoder representations from transformers (BERT); natural language processing (NLP)

Cite This Article

APA Style

Wadud, M.A.H., Mridha, M.F., Shin, J., Nur, K., Saha, A.K. (2023). Deep-bert: transfer learning for classifying multilingual offensive texts on social media. Computer Systems Science and Engineering, 44(2), 1775-1791. https://doi.org/10.32604/csse.2023.027841

Vancouver Style

Wadud MAH, Mridha MF, Shin J, Nur K, Saha AK. Deep-bert: transfer learning for classifying multilingual offensive texts on social media. Comput Syst Sci Eng. 2023;44(2):1775-1791 https://doi.org/10.32604/csse.2023.027841

IEEE Style

M.A.H. Wadud, M.F. Mridha, J. Shin, K. Nur, and A.K. Saha "Deep-BERT: Transfer Learning for Classifying Multilingual Offensive Texts on Social Media," Comput. Syst. Sci. Eng., vol. 44, no. 2, pp. 1775-1791. 2023. https://doi.org/10.32604/csse.2023.027841

BibTex EndNote RIS

This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Deep-BERT: Transfer Learning for Classifying Multilingual Offensive Texts on Social Media

Abstract

Keywords

Cite This Article

2830

1023

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Share Link