TY - EJOU
AU - Tumrani, Saifullah
AU - Siddiqui, Abdul Jabbar
TI - Robust Swin Transformer for Vehicle Re-Identification with Dynamic Feature Fusion
T2 - Computers, Materials \& Continua
PY - 2026
VL - 87
IS - 2
SN - 1546-2226
AB - Vehicle re-identification (ReID) is a challenging task in intelligent transportation, and urban surveillance systems due to its complications in camera viewpoints, vehicle scales, and environmental conditions. Recent transformer-based approaches have shown impressive performance by utilizing global dependencies, these models struggle with aspect ratio distortions and may overlook fine-grained local attributes crucial for distinguishing visually similar vehicles. We introduce a framework based on Swin Transformers that addresses these challenges by implementing three components. First, to improve feature robustness and maintain vehicle proportions, our Aspect Ratio-Aware Swin Transformer (AR-Swin) preserve the native ratio via letterbox, uses a non-square (16 × 8) patch-embedding stem, and keeps fixed 7 × 7 token windows. Second, we introduce a Dynamic Feature Fusion Network (DFFNet) that adaptively integrates global Swin features with local attribute embeddings; such as color and vehicle type enabling more discriminative representations. Third, our Regional Attention Blocks incorporate regional masks into the transformer’s windowed attention mechanism, effectively highlighting critical details like manufacturer logos or lights. On VeRi-776, we obtain 82.55 mAP, 97.26 Rank-1 and 99.23 Rank-5, and on VehicleID we obtain 91.8 Rank-1 and 97.75 Rank-5. The design is drop-in for Swin backbones and emphasizes robustness without increasing architectural complexity. Code: https://github.com/sft110/Swinvreid.
KW - Vehicle ReID; swin transformer; aspect ratio robustness; multi-attribute learning
DO - 10.32604/cmc.2025.075152