Home / Journals / CMC / Online First / doi:10.32604/cmc.2025.072550
Special Issues
Table of Content

Open Access

ARTICLE

A Dual-Stream Framework for Landslide Segmentation with Cross-Attention Enhancement and Gated Multimodal Fusion

Md Minhazul Islam1,2, Yunfei Yin1,2,*, Md Tanvir Islam1,2, Zheng Yuan1,2, Argho Dey1,2
College of Computer Science, Chongqing University, Chongqing, 400044, China
SUGON Industrial Control and Security Center, Chengdu, 610225, China
* Corresponding Author: Yunfei Yin. Email: email

Computers, Materials & Continua https://doi.org/10.32604/cmc.2025.072550

Received 29 August 2025; Accepted 04 November 2025; Published online 22 December 2025

Abstract

Automatic segmentation of landslides from remote sensing imagery is challenging because traditional machine learning and early CNN-based models often fail to generalize across heterogeneous landscapes, where segmentation maps contain sparse and fragmented landslide regions under diverse geographical conditions. To address these issues, we propose a lightweight dual-stream siamese deep learning framework that integrates optical and topographical data fusion with an adaptive decoder, guided multimodal fusion, and deep supervision. The framework is built upon the synergistic combination of cross-attention, gated fusion, and sub-pixel upsampling within a unified dual-stream architecture specifically optimized for landslide segmentation, enabling efficient context modeling and robust feature exchange between modalities. The decoder captures long-range context at deeper levels using lightweight cross-attention and refines spatial details at shallower levels through attention-gated skip fusion, enabling precise boundary delineation and fewer false positives. The gated fusion further enhances multimodal integration of optical and topographical cues, and the deep supervision stabilizes training and improves generalization. Moreover, to mitigate checkerboard artifacts, a learnable sub-pixel upsampling is devised to replace the traditional transposed convolution. Despite its compact design with fewer parameters, the model consistently outperforms state-of-the-art baselines. Experiments on two benchmark datasets, Landslide4Sense and Bijie, confirm the effectiveness of the framework. On the Bijie dataset, it achieves an F1-score of 0.9110 and an intersection over union (IoU) of 0.8839. These results highlight its potential for accurate large-scale landslide inventory mapping and real-time disaster response. The implementation is publicly available at https://github.com/mishaown/DiGATe-UNet-LandSlide-Segmentation (accessed on 3 November 2025).

Keywords

Landslide segmentation; remote sensing; dual-stream lightweight networks; digital elevation model (DEM); gated fusion
  • 286

    View

  • 51

    Download

  • 0

    Like

Share Link