Open Access iconOpen Access

ARTICLE

crossmark

Enhancing Classroom Behavior Recognition with Lightweight Multi-Scale Feature Fusion

Chuanchuan Wang1,2, Ahmad Sufril Azlan Mohamed2,*, Xiao Yang 2, Hao Zhang 2, Xiang Li1, Mohd Halim Bin Mohd Noor 2

1 School of Engineering, Guangzhou College of Technology and Business, Guangzhou, 510850, China
2 School of Computer Sciences, Universiti Sains Malaysia, Minden, 11800, Penang, Malaysia

* Corresponding Author: Ahmad Sufril Azlan Mohamed. Email: email

Computers, Materials & Continua 2025, 85(1), 855-874. https://doi.org/10.32604/cmc.2025.066343

Abstract

Classroom behavior recognition is a hot research topic, which plays a vital role in assessing and improving the quality of classroom teaching. However, existing classroom behavior recognition methods have challenges for high recognition accuracy with datasets with problems such as scenes with blurred pictures, and inconsistent objects. To address this challenge, we proposed an effective, lightweight object detector method called the RFNet model (YOLO-FR). The YOLO-FR is a lightweight and effective model. Specifically, for efficient multi-scale feature extraction, effective feature pyramid shared convolutional (FPSC) was designed to improve the feature extract performance by leveraging convolutional layers with varying dilation rates from the input image in the backbone. Secondly, to address the problem of multi-scale variability in the scene, we design the Rep Ghost fusion Cross Stage Partial and Efficient Layer Aggregation Network (RGCSPELAN) to improve the network performance further and reduce the amount of computation and the number of parameters. In addition, by conducting experimental valuation on the SCB dataset3 and STBD-08 dataset. Experimental results indicate that, compared to the baseline model, the RFNet model has increased mean accuracy precision (mAP@50) from 69.6% to 71.0% on the SCB dataset3 and from 91.8% to 93.1% on the STBD-08 dataset. The RFNet approach has effectiveness precision at 68.6%, surpassing the baseline method (YOLOv11) at 3.3% and archieve the minimal size (4.9 M) on the SCB dataset3. Finally, comparing it with other algorithms, it accurately detects student behavior in complex classroom environments results confirmed that RFNet is well-suited for real-time and efficiently recognizing classroom behaviors.

Keywords

Classroom action recognition; YOLO-FR; feature pyramid shared convolutional; rep ghost cross stage partial efficient layer aggregation network (RGCSPELAN)

Cite This Article

APA Style
Wang, C., Mohamed, A.S.A., Yang, X., Zhang, H., Li, X. et al. (2025). Enhancing Classroom Behavior Recognition with Lightweight Multi-Scale Feature Fusion. Computers, Materials & Continua, 85(1), 855–874. https://doi.org/10.32604/cmc.2025.066343
Vancouver Style
Wang C, Mohamed ASA, Yang X, Zhang H, Li X, Noor MHBM. Enhancing Classroom Behavior Recognition with Lightweight Multi-Scale Feature Fusion. Comput Mater Contin. 2025;85(1):855–874. https://doi.org/10.32604/cmc.2025.066343
IEEE Style
C. Wang, A. S. A. Mohamed, X. Yang, H. Zhang, X. Li, and M.H.B.M. Noor, “Enhancing Classroom Behavior Recognition with Lightweight Multi-Scale Feature Fusion,” Comput. Mater. Contin., vol. 85, no. 1, pp. 855–874, 2025. https://doi.org/10.32604/cmc.2025.066343



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 2767

    View

  • 2112

    Download

  • 0

    Like

Share Link