Railway Track Defect Detection Based on Dynamic Multi-Modal Fusion and Challenging Object Enhanced Perception

Yaguan Wang; Linlin Kou; Yang Gao; Qiang Sun; Yong Qin; Genwang Peng

doi:10.32604/sdhm.2025.072538

Open Access icon Open Access

ARTICLE

Railway Track Defect Detection Based on Dynamic Multi-Modal Fusion and Challenging Object Enhanced Perception

Yaguan Wang¹, Linlin Kou², Yang Gao^3,*, Qiang Sun¹, Yong Qin³, Genwang Peng³

1 Institute of Technological Innovation, Beijing Subway Operation Co. Ltd., Beijing, 100044, China
2 Technical Department, Beijing Subway Operation Co. Ltd., Beijing, 100044, China
3 State Key Laboratory of Advanced Rail Autonomous Operation, Beijing Jiaotong University, Beijing, 100044, China

* Corresponding Author: Yang Gao. Email: email

(This article belongs to the Special Issue: AI-Enhanced Low-Altitude Technology Applications in Structural Integrity Evaluation and Safety Management of Transportation Infrastructure Systems)

Structural Durability & Health Monitoring 2026, 20(2), 10 https://doi.org/10.32604/sdhm.2025.072538

Received 29 August 2025; Accepted 27 October 2025; Issue published 31 March 2026

Abstract

The fasteners employed in the railway tracks are susceptible to defects arising from their intricate composition. Foreign objects are frequently observed on the track bed in an open environment. These two types of defects pose potential threats to high-speed trains, thus necessitating timely and accurate track inspection. The majority of extant automatic inspection methods are predicated on the utilization of single visible light data, and the efficacy of the algorithmic processes is influenced by complex environments. Furthermore, due to the single information dimension, the detection accuracy of defects in similar, occluded, and small object categories is low. To address the aforementioned issues, this paper proposes a track defect detection method based on dynamic multi-modal fusion and challenging object enhanced perception. First, in light of the variances in the representation dimensions of multimodal information, this paper proposes a dynamic weighted multi-modal feature fusion module. The fused multi-modal features are assigned weights, and then multiplied with the extracted single-modal features at multiple levels, achieving adaptive adjustment of the response degree of fusion features. Second, a novel stepwise multi-scale convolution feature aggregation module is proposed for challenging objects. The proposed method employs depth separable convolution and cross-scale aggregation operations of different receptive fields to enhance feature extraction and reuse, thereby reducing the degree of progressive loss of effective information. The experimental results demonstrate the efficacy of the proposed method in comparison to eight established methods, encompassing both single-modal and multi-modal methods, as evidenced by the extensive findings within the constructed RGBD dataset.

Keywords

Railway safety; track defect detection; multi-modal data; object detection

Cite This Article

APA Style

Wang, Y., Kou, L., Gao, Y., Sun, Q., Qin, Y. et al. (2026). Railway Track Defect Detection Based on Dynamic Multi-Modal Fusion and Challenging Object Enhanced Perception. Structural Durability & Health Monitoring, 20(2), 10. https://doi.org/10.32604/sdhm.2025.072538

Vancouver Style

Wang Y, Kou L, Gao Y, Sun Q, Qin Y, Peng G. Railway Track Defect Detection Based on Dynamic Multi-Modal Fusion and Challenging Object Enhanced Perception. Structural Durability Health Monit. 2026;20(2):10. https://doi.org/10.32604/sdhm.2025.072538

IEEE Style

Y. Wang, L. Kou, Y. Gao, Q. Sun, Y. Qin, and G. Peng, “Railway Track Defect Detection Based on Dynamic Multi-Modal Fusion and Challenging Object Enhanced Perception,” Structural Durability Health Monit., vol. 20, no. 2, pp. 10, 2026. https://doi.org/10.32604/sdhm.2025.072538

BibTex EndNote RIS

Copyright © 2026 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Railway Track Defect Detection Based on Dynamic Multi-Modal Fusion and Challenging Object Enhanced Perception

Abstract

Keywords

Cite This Article

2061

883

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link