Open Access iconOpen Access

ARTICLE

crossmark

ASL-OOD: Hierarchical Contextual Feature Fusion with Angle-Sensitive Loss for Oriented Object Detection

Kexin Wang1,#, Jiancheng Liu1,#,*, Yuqing Lin2,*, Tuo Wang1, Zhipeng Zhang1, Wanlong Qi1, Xingye Han1, Runyuan Wen3

1 Northwest Institute of Mechanical and Electrical Engineering, Xianyang, 712099, China
2 School of Information Engineering, Chang’an University, Xi’an, 710064, China
3 School of Computer Science and Technology, Xidian University, Xi’an, 710071, China

* Corresponding Authors: Jiancheng Liu. Email: email; Yuqing Lin. Email: email
# Kexin Wang and Jiancheng Liu contributed equally to this work

(This article belongs to the Special Issue: Advances in Object Detection: Methods and Applications)

Computers, Materials & Continua 2025, 82(2), 1879-1899. https://doi.org/10.32604/cmc.2024.058952

Abstract

Detecting oriented targets in remote sensing images amidst complex and heterogeneous backgrounds remains a formidable challenge in the field of object detection. Current frameworks for oriented detection modules are constrained by intrinsic limitations, including excessive computational and memory overheads, discrepancies between predefined anchors and ground truth bounding boxes, intricate training processes, and feature alignment inconsistencies. To overcome these challenges, we present ASL-OOD (Angle-based SIOU Loss for Oriented Object Detection), a novel, efficient, and robust one-stage framework tailored for oriented object detection. The ASL-OOD framework comprises three core components: the Transformer-based Backbone (TB), the Transformer-based Neck (TN), and the Angle-SIOU (Scylla Intersection over Union) based Decoupled Head (ASDH). By leveraging the Swin Transformer, the TB and TN modules offer several key advantages, such as the capacity to model long-range dependencies, preserve high-resolution feature representations, seamlessly integrate multi-scale features, and enhance parameter efficiency. These improvements empower the model to accurately detect objects across varying scales. The ASDH module further enhances detection performance by incorporating angle-aware optimization based on SIOU, ensuring precise angular consistency and bounding box coherence. This approach effectively harmonizes shape loss and distance loss during the optimization process, thereby significantly boosting detection accuracy. Comprehensive evaluations and ablation studies on standard benchmark datasets such as DOTA with an mAP (mean Average Precision) of 80.16 percent, HRSC2016 with an mAP of 91.07 percent, MAR20 with an mAP of 85.45 percent, and UAVDT with an mAP of 39.7 percent demonstrate the clear superiority of ASL-OOD over state-of-the-art oriented object detection models. These findings underscore the model’s efficacy as an advanced solution for challenging remote sensing object detection tasks.

Keywords

Oriented object detection; transformer; deep learning

Cite This Article

APA Style
Wang, K., Liu, J., Lin, Y., Wang, T., Zhang, Z. et al. (2025). ASL-OOD: Hierarchical Contextual Feature Fusion with Angle-Sensitive Loss for Oriented Object Detection. Computers, Materials & Continua, 82(2), 1879–1899. https://doi.org/10.32604/cmc.2024.058952
Vancouver Style
Wang K, Liu J, Lin Y, Wang T, Zhang Z, Qi W, et al. ASL-OOD: Hierarchical Contextual Feature Fusion with Angle-Sensitive Loss for Oriented Object Detection. Comput Mater Contin. 2025;82(2):1879–1899. https://doi.org/10.32604/cmc.2024.058952
IEEE Style
K. Wang et al., “ASL-OOD: Hierarchical Contextual Feature Fusion with Angle-Sensitive Loss for Oriented Object Detection,” Comput. Mater. Contin., vol. 82, no. 2, pp. 1879–1899, 2025. https://doi.org/10.32604/cmc.2024.058952



cc Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 1159

    View

  • 2555

    Download

  • 0

    Like

Share Link