Adversarial Example Transfer Method for Vision-Language Pre-Training Models Based on Negative Sample Feature Perturbation

Zhichao Pei; Ou Ye; Panyu Yang; Kaiwen He

doi:10.32604/cmc.2026.081490

Open Access icon Open Access

ARTICLE

Adversarial Example Transfer Method for Vision-Language Pre-Training Models Based on Negative Sample Feature Perturbation

Zhichao Pei, Ou Ye^*, Panyu Yang, Kaiwen He

College of Artificial Intelligence & Computer Science, Xi’an University of Science and Technology, Xi’an, China

* Corresponding Author: Ou Ye. Email: email

Computers, Materials & Continua 2026, 88(2), 48 https://doi.org/10.32604/cmc.2026.081490

Received 03 March 2026; Accepted 17 April 2026; Issue published 15 June 2026

Abstract

To address the issue of insufficient transferability of existing adversarial example generation methods for vision-language pre-training (VLP) models, this paper proposes an adversarial example transfer method for VLP models based on negative sample feature perturbation. First, a novel cross-modal collaborative perturbation strategy is constructed. By introducing negative samples into the cross-modal perturbation mechanism, the strategy explores more perturbation directions, breaks the original modal alignment constraints and avoids the local focus of adversarial perturbations. Then, to reduce the computational cost, a dynamic threshold attack strategy is built to measure the modal similarity of the generated adversarial examples. Finally, with the help of a multi-modal fusion encoder, a cross-modal fusion semantic attack (CFSA) module is designed. This module extracts the middle-layer features of image-text pairs and improves the transfer attack effect of adversarial examples. The proposed attack method is experimentally evaluated on the Flickr30K and MSCOCO datasets. The results show that for the adversarial examples generated on the Flickr30K dataset, the attack success rate (ASR) of the proposed method reaches up to 95.3% on multiple black-box models; for those generated on the MSCOCO dataset, the maximum attack success rate on multiple black-box models reaches 70.17%. Compared with the current methods, the adversarial examples generated by the proposed method achieve better attack performance.

Keywords

Vision-language pre-training model; multimodal; adversarial attack transferability; cross-modality perturbation; negative samples

Cite This Article

APA Style

Pei, Z., Ye, O., Yang, P., He, K. (2026). Adversarial Example Transfer Method for Vision-Language Pre-Training Models Based on Negative Sample Feature Perturbation. Computers, Materials & Continua, 88(2), 48. https://doi.org/10.32604/cmc.2026.081490

Vancouver Style

Pei Z, Ye O, Yang P, He K. Adversarial Example Transfer Method for Vision-Language Pre-Training Models Based on Negative Sample Feature Perturbation. Comput Mater Contin. 2026;88(2):48. https://doi.org/10.32604/cmc.2026.081490

IEEE Style

Z. Pei, O. Ye, P. Yang, and K. He, “Adversarial Example Transfer Method for Vision-Language Pre-Training Models Based on Negative Sample Feature Perturbation,” Comput. Mater. Contin., vol. 88, no. 2, pp. 48, 2026. https://doi.org/10.32604/cmc.2026.081490

BibTex EndNote RIS

Copyright © 2026 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Table of Content

Adversarial Example Transfer Method for Vision-Language Pre-Training Models Based on Negative Sample Feature Perturbation

Abstract

Keywords

Cite This Article

637

241

0

Related articles

Further Information

Guidelines

Follow Us

Join Us

Contact Us

WhatsApp:

Share Link