Home / Journals / CMC / Online First / doi:10.32604/cmc.2025.061702
Special Issues
Table of Content

Open Access

ARTICLE

Multi-Label Movie Genre Classification with Attention Mechanism on Movie Plots

Faheem Shaukat1, Naveed Ejaz1,2, Rashid Kamal3,4, Tamim Alkhalifah5,*, Sheraz Aslam6,7,*, Mu Mu4
1 Department of Computing and Technology, Iqra University, H-9 Campus, Islamabad, 04436, Pakistan
2 School of Computing, Queens University, Kingston, ON K7L2N8, Canada
3 School of Computing, Ulster University, Belfast, BT15 1ED, Northern Ireland, UK
4 Faculty of Arts, Science and Technology, University of Northampton, Waterside Campus, Northampton Northamptonshire, NN1 5PH, UK
5 Department of Computer Engineering, College of Computer, Qassim University, Buraydah, Saudi Arabia
6 Department of Computer Science, CTL Eurocollege, Limassol, 3077, Cyprus
7 Department of Electrical Engineering, Computer Engineering, and Informatics, Cyprus University of Technology, Limassol, 3036, Cyprus
* Corresponding Author: Tamim Alkhalifah. Email: email; Sheraz Aslam. Email: email

Computers, Materials & Continua https://doi.org/10.32604/cmc.2025.061702

Received 01 December 2024; Accepted 25 March 2025; Published online 22 April 2025

Abstract

Automated and accurate movie genre classification is crucial for content organization, recommendation systems, and audience targeting in the film industry. Although most existing approaches focus on audiovisual features such as trailers and posters, the text-based classification remains underexplored despite its accessibility and semantic richness. This paper introduces the Genre Attention Model (GAM), a deep learning architecture that integrates transformer models with a hierarchical attention mechanism to extract and leverage contextual information from movie plots for multi-label genre classification. In order to assess its effectiveness, we assess multiple transformer-based models, including Bidirectional Encoder Representations from Transformers (BERT), A Lite BERT (ALBERT), Distilled BERT (DistilBERT), Robustly Optimized BERT Pretraining Approach (RoBERTa), Efficiently Learning an Encoder that Classifies Token Replacements Accurately (ELECTRA), eXtreme Learning Network (XLNet) and Decoding-enhanced BERT with Disentangled Attention (DeBERTa). Experimental results demonstrate the superior performance of DeBERTa-based GAM, which employs a two-tier hierarchical attention mechanism: word-level attention highlights key terms, while sentence-level attention captures critical narrative segments, ensuring a refined and interpretable representation of movie plots. Evaluated on three benchmark datasets Trailers12K, Large Movie Trailer Dataset-9 (LMTD-9), and MovieLens37K. GAM achieves micro-average precision scores of 83.63%, 83.32%, and 83.34%, respectively, surpassing state-of-the-art models. Additionally, GAM is computationally efficient, requiring just 6.10 Giga Floating Point Operations Per Second (GFLOPS), making it a scalable and cost-effective solution. These results highlight the growing potential of text-based deep learning models in genre classification and GAM’s effectiveness in improving predictive accuracy while maintaining computational efficiency. With its robust performance, GAM offers a versatile and scalable framework for content recommendation, film indexing, and media analytics, providing an interpretable alternative to traditional audiovisual-based classification techniques.

Keywords

Multi-label classification; artificial intelligence; movie genre classification; hierarchical attention mechanisms; natural language processing; content recommendation; text-based genre classification; explainable AI (Artificial Intelligence); transformer models; BERT
  • 485

    View

  • 249

    Download

  • 0

    Like

Share Link