OT-Attack: Enhancing Adversarial Transferability of Vision-Language Models via Optimal Transport Optimization

The AI Digest

optimal transport

adversarial attack

vision-language models

This paper introduces OT-Attack, an Optimal Transport-based Adversarial Attack, to enhance the transferability of adversarial examples in vision-language models. By optimally mapping features of image and text sets, OT-Attack improves the generation of adversarial examples, addressing issues of overfitting to source models. Highlights of this method include:

Leverages optimal transport theory for feature mapping between image-text pairs.
Demonstrates improvement in adversarial transferability across network architectures.
Proposes data augmentation as a strategy to align image-text pairs more effectively.

The introduction of OT-Attack marks a significant advancement in the field of adversarial machine learning by effectively tackling the overfitting problem and opening new possibilities for robust multimodal model evaluations.

Personalized AI news from scientific papers.