Efficient Generation of Targeted and Transferable Adversarial Examples for Vision-Language Models Via Diffusion Models

This repository is an official implementation of the paper "Efficient Generation of Targeted and Transferable Adversarial Examples for Vision-Language Models Via Diffusion Models".

TODO

Release Arxiv paper
Release core code
Release adversarial example generation code
Release test code

Introduction

we propose AdvDiffVLM, which uses diffusion models to generate natural, unrestricted and targeted adversarial examples via score matching. Specifically, AdvDiffVLM uses Adaptive Ensemble Gradient Estimation (AEGE) to modify the score during the diffusion model’s reverse generation process, ensuring that the produced adversarial examples have natural adversarial targeted semantics, which improves their transferability. Simultaneously, to improve the quality of adversarial examples, we use the GradCAM-guided Mask Generation (GCMG) to disperse adversarial semantics throughout the image rather than concentrating them in a single area.

Quick Start

The target data and evaluation methods can be found in AttackVLM.

The weights of the adversarial sample generation model are in https://github.com/CompVis/latent-diffusion.

The code of GradCAM is in https://github.com/ramprs/grad-cam/.

The hyperparameters in ldm/models/diffusion/ddim_main can be adjusted to obtain a trade-off between attack capability and image quality.

python demo.py

python main.py

Citation

@article{guo2024efficient,
  title={Efficient Generation of Targeted and Transferable Adversarial Examples for Vision-Language Models Via Diffusion Models},
  author={Guo, Qi and Pang, Shanmin and Jia, Xiaojun and Liu, Yang and Guo, Qing},
  journal={IEEE Transactions on Information Forensics and Security},
  year={2024},
  publisher={IEEE}
}

Related work

AdvDiffuser AttackVLM Attack-Bard SIA

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.ipynb_checkpoints		.ipynb_checkpoints
configs		configs
data		data
ldm		ldm
models		models
scripts		scripts
taming-transformers		taming-transformers
README.md		README.md
demo.py		demo.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Efficient Generation of Targeted and Transferable Adversarial Examples for Vision-Language Models Via Diffusion Models

TODO

Introduction

Quick Start

Citation

Related work

About

Uh oh!

Releases

Packages

Uh oh!

Languages

gq-max/AdvDiffVLM

Folders and files

Latest commit

History

Repository files navigation

Efficient Generation of Targeted and Transferable Adversarial Examples for Vision-Language Models Via Diffusion Models

TODO

Introduction

Quick Start

Citation

Related work

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages