Home / Journals / CMC / Online First / doi:10.32604/cmc.2025.074899
Special Issues
Table of Content

Open Access

ARTICLE

YOLO-Drive: Robust Driver Distraction Recognition under Fine-Grained and Overlapping Behaviors

Zhichao Yu1, Jiahui Yu1, Simon James Fong1,*, Yaoyang Wu1,2
1 Department of Computer and Information Science, Faculty of Science and Technology, University of Macau, Macau SAR, 999078, China
2 Faculty of Digital Science and Technology, Macau Millennium College, Macau SAR, 999078, China
* Corresponding Author: Simon James Fong. Email: ccfong@um.edu.mo

Computers, Materials & Continua https://doi.org/10.32604/cmc.2025.074899

Received 21 October 2025; Accepted 12 December 2025; Published online 30 December 2025

Abstract

Accurately recognizing driver distraction is critical for preventing traffic accidents, yet current detection models face two persistent challenges. First, distractions are often fine-grained, involving subtle cues such as brief eye closures or partial yawns, which are easily missed by conventional detectors. Second, in real-world scenarios, drivers frequently exhibit overlapping behaviors, such as simultaneously holding a cup, closing their eyes, and yawning, leading to multiple detection boxes and degraded model performance. Existing approaches fail to robustly address these complexities, resulting in limited reliability in safety-critical applications. To overcome these pain points, we propose YOLO-Drive, a novel framework that enhances YOLO-based driver monitoring with EfficientViM and Polarized Spectral–Spatial Attention (PSSA) modules. EfficientViM provides lightweight yet powerful global–local feature extraction, enabling accurate recognition of subtle driver states. PSSA further amplifies discriminative features across spatial and spectral domains, ensuring robust separation of concurrent distraction cues. By explicitly modeling fine-grained and overlapping behaviors, our approach delivers significant improvements in both precision and robustness. Extensive experiments on benchmark driver distraction datasets demonstrate that YOLO-Drive consistently outperforms state-of-the-art models, achieving higher detection accuracy while maintaining real-time efficiency. These results validate YOLO-Drive as a practical and reliable solution for advanced driver monitoring systems, addressing long-standing challenges of subtle cue recognition and multi-cue distraction detection.

Keywords

Driver distraction recognition; attention mechanism; fine-grained feature modeling; object detection; overlapping behavior detection; state space model; YOLO extensions
  • 124

    View

  • 18

    Download

  • 0

    Like

Share Link