Research Progress on Multi-Modal Fusion Object Detection Algorithms for Autonomous Driving: A Review

Peicheng Shi; Li Yang; Xinlong Dong; Heng Qi; Aixi Yang

doi:10.32604/cmc.2025.063205

Open Access icon Open Access

REVIEW

Research Progress on Multi-Modal Fusion Object Detection Algorithms for Autonomous Driving: A Review

Peicheng Shi^1,*, Li Yang¹, Xinlong Dong¹, Heng Qi², Aixi Yang³

1 School of Mechanical and Automotive Engineering, Anhui Polytechnic University, Wuhu, 241000, China
2 State Key Laboratory of Information Engineering in Surveying, Mapping and Remote Sensing, Wuhan University, Wuhan, 430072, China
3 Polytechnic Institute, Zhejiang University, Hangzhou, 310015, China

* Corresponding Author: Peicheng Shi. Email: email

(This article belongs to the Special Issue: Advances in Object Detection: Methods and Applications)

Computers, Materials & Continua 2025, 83(3), 3877-3917. https://doi.org/10.32604/cmc.2025.063205

Received 08 January 2025; Accepted 11 March 2025; Issue published 19 May 2025

Abstract

As the number and complexity of sensors in autonomous vehicles continue to rise, multimodal fusion-based object detection algorithms are increasingly being used to detect 3D environmental information, significantly advancing the development of perception technology in autonomous driving. To further promote the development of fusion algorithms and improve detection performance, this paper discusses the advantages and recent advancements of multimodal fusion-based object detection algorithms. Starting from single-modal sensor detection, the paper provides a detailed overview of typical sensors used in autonomous driving and introduces object detection methods based on images and point clouds. For image-based detection methods, they are categorized into monocular detection and binocular detection based on different input types. For point cloud-based detection methods, they are classified into projection-based, voxel-based, point cluster-based, pillar-based, and graph structure-based approaches based on the technical pathways for processing point cloud features. Additionally, multimodal fusion algorithms are divided into Camera-LiDAR fusion, Camera-Radar fusion, Camera-LiDAR-Radar fusion, and other sensor fusion methods based on the types of sensors involved. Furthermore, the paper identifies five key future research directions in this field, aiming to provide insights for researchers engaged in multimodal fusion-based object detection algorithms and to encourage broader attention to the research and application of multimodal fusion-based object detection.

Keywords

Multi-modal fusion; 3D object detection; deep learning; autonomous driving

Cite This Article

APA Style

Shi, P., Yang, L., Dong, X., Qi, H., Yang, A. (2025). Research Progress on Multi-Modal Fusion Object Detection Algorithms for Autonomous Driving: A Review. Computers, Materials & Continua, 83(3), 3877–3917. https://doi.org/10.32604/cmc.2025.063205

Vancouver Style

Shi P, Yang L, Dong X, Qi H, Yang A. Research Progress on Multi-Modal Fusion Object Detection Algorithms for Autonomous Driving: A Review. Comput Mater Contin. 2025;83(3):3877–3917. https://doi.org/10.32604/cmc.2025.063205

IEEE Style

P. Shi, L. Yang, X. Dong, H. Qi, and A. Yang, “Research Progress on Multi-Modal Fusion Object Detection Algorithms for Autonomous Driving: A Review,” Comput. Mater. Contin., vol. 83, no. 3, pp. 3877–3917, 2025. https://doi.org/10.32604/cmc.2025.063205

BibTex EndNote RIS

Copyright © 2025 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Research Progress on Multi-Modal Fusion Object Detection Algorithms for Autonomous Driving: A Review

Abstract

Keywords

Cite This Article

3424

1512

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link