Open Access iconOpen Access

ARTICLE

Late-Fusion of Heterogeneous Maritime Data Using Self-Attention for Interpretable Anomaly Detection

Raza Hasan*, Shakeel Ahmad, Ismet Gocer, Zakirul Bhuiyan

School of Technology and Maritime Industries, Southampton Solent University, East Park Terrace, Southampton, Hampshire, UK

* Corresponding Author: Raza Hasan. Email: email

(This article belongs to the Special Issue: Artificial Intelligence in Visual and Audio Signal Processing)

Computers, Materials & Continua 2026, 88(1), 20 https://doi.org/10.32604/cmc.2026.079708

Abstract

Maritime Domain Awareness (MDA) is critical for global security and economic stability, yet it is increasingly challenged by sophisticated adversarial tactics such as signal spoofing and “dark vessel” activities. Traditional surveillance systems, often reliant on single-sensor modalities, are ill-equipped to handle these deceptive behaviors. To address this, we propose the Multimodal Attention-based Fusion Transformer (MAFT), a novel deep learning architecture that integrates four distinct data modalities—Aerial imagery, Synthetic Aperture Radar (SAR), acoustic signatures, and Automatic Identification System (AIS) data—to achieve robust and interpretable maritime anomaly detection. A key contribution of our work is a principled synthetic data generation pipeline that creates a large-scale, labeled dataset (16,000 samples) for four critical anomaly types: Correlated Activity, Dark Vessels, AIS Spoofing, and Kinematic Anomalies. MAFT architecture employs modality-specific encoders to project heterogeneous data into a common 320-dimensional embedding space. These embeddings are then tokenized and supplied to a multi-layer Transformer Encoder, which leverages a self-attention mechanism for late-fusion, learning complex, non-linear inter-modal relationships. We also introduce “modality dropout” (p = 0.3) as a regularization technique to enhance model robustness against sensor failure or data unavailability. Quantitative analysis shows our model achieves a 97.02% F1-score and a significantly improved Expected Calibration Error (ECE) of 0.011, outperforming Early Fusion CNN, Mid-Fusion MLP, and Decision-Ensemble baselines. Furthermore, computational profiling confirms an inference latency of 26.54 ms, demonstrating operational readiness for real-time deployment. Analysis of the model’s attention weights suggests that MAFT not only accurately classifies maritime activities but also provides a high degree of model interpretability, offering crucial, data-driven insights for maritime security operators.

Graphic Abstract

Late-Fusion of Heterogeneous Maritime Data Using Self-Attention for Interpretable Anomaly Detection

Keywords

Multimodal AI; transformer architecture; sensor fusion; maritime domain awareness (MDA); anomaly detection; interpretability

Cite This Article

APA Style
Hasan, R., Ahmad, S., Gocer, I., Bhuiyan, Z. (2026). Late-Fusion of Heterogeneous Maritime Data Using Self-Attention for Interpretable Anomaly Detection. Computers, Materials & Continua, 88(1), 20. https://doi.org/10.32604/cmc.2026.079708
Vancouver Style
Hasan R, Ahmad S, Gocer I, Bhuiyan Z. Late-Fusion of Heterogeneous Maritime Data Using Self-Attention for Interpretable Anomaly Detection. Comput Mater Contin. 2026;88(1):20. https://doi.org/10.32604/cmc.2026.079708
IEEE Style
R. Hasan, S. Ahmad, I. Gocer, and Z. Bhuiyan, “Late-Fusion of Heterogeneous Maritime Data Using Self-Attention for Interpretable Anomaly Detection,” Comput. Mater. Contin., vol. 88, no. 1, pp. 20, 2026. https://doi.org/10.32604/cmc.2026.079708



cc Copyright © 2026 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 432

    View

  • 66

    Download

  • 0

    Like

Share Link