Fethi Fkih1,2,*, Mohammed Alsuhaibani1, Delel Rhouma1,2, Ali Mustafa Qamar1
CMC-Computers, Materials & Continua, Vol.75, No.3, pp. 5871-5886, 2023, DOI:10.32604/cmc.2023.035910
Abstract Text classification is an essential task for many applications related to the Natural Language Processing domain. It can be applied in many fields, such as Information Retrieval, Knowledge Extraction, and Knowledge modeling. Even though the importance of this task, Arabic Text Classification tools still suffer from many problems and remain incapable of responding to the increasing volume of Arabic content that circulates on the web or resides in large databases. This paper introduces a novel machine learning-based approach that exclusively uses hybrid (stylistic and semantic) features. First, we clean the Arabic documents and translate them to English using translation tools.… More >