TY - EJOU AU - Wu, Jianping AU - Li, Nian AU - Dong, Libin AU - Zhang, Ping TI - Camera-LiDAR Fusion for Enhanced Object Detection T2 - Journal on Artificial Intelligence PY - 2026 VL - 8 IS - 1 SN - 2579-003X AB - This paper presents a static fusion framework that enhances object detection by integrating camera and LiDAR-based detection results. The proposed method focuses on associating 2D candidate bounding boxes from a camera detector with 3D candidate boxes from a LiDAR detector using an Intersection over Union (IoU)-based matching approach. To enhance the quality of 2D detection, we refine the baseline Cascade R-CNN detector by incorporating a dual self-attention mechanism into both the backbone and the region proposal network (RPN), resulting in the DA-Cascade R-CNN. This enhancement strengthens the network’s ability to detect small or distant objects by improving feature sensitivity and localization accuracy. Once 2D and 3D candidate boxes are obtained, they are associated through IoU-aware matching and subsequently refined using non-maximum suppression (NMS) to remove redundant or conflicting hypotheses across modalities, effectively preserving positive detection results to improve accuracy. Experimental results on the KITTI dataset demonstrate that the proposed static fusion method yields improved detection average precision for three different levels of difficulty compared to single-sensor baselines. KW - Camera object detection; LiDAR object detection; fused object detection; attention mechanism DO - 10.32604/jai.2026.075753