Image Hijacking: Adversarial Images can Control Generative Models at Runtime. (arXiv:2309.00236v1 [cs.LG]) | allinfosecnews.com

Sept. 4, 2023, 1:10 a.m. | Luke Bailey, Euan Ong, Stuart Russell, Scott Emmons

cs.CR updates on arXiv.org arxiv.org

Are foundation models secure from malicious actors? In this work, we focus on
the image input to a vision-language model (VLM). We discover image hijacks,
adversarial images that control generative models at runtime. We introduce
Behavior Matching, a general method for creating image hijacks, and we use it
to explore three types of attacks. Specific string attacks generate arbitrary
output of the adversary's choosing. Leak context attacks leak information from
the context window into the output. Jailbreak attacks circumvent a …

adversarial control discover focus foundation foundation models general generative generative models hijacking image images input language malicious malicious actors runtime work

More from arxiv.org / cs.CR updates on arXiv.org

Lightweight and Scalable Post-Quantum Authentication for Medical Internet of Things 3 hours ago | arxiv.org

analysis arxiv authentication collect +27

DYST (Did You See That?): An Amplified Covert Channel That Points To Previously Seen Data 3 hours ago | arxiv.org

adversary arxiv call channel +18

Sandboxing Adoption in Open Source Ecosystems 3 hours ago | arxiv.org

access adoption applications arxiv +14

DP-DyLoRA: Fine-Tuning Transformer-Based Models On-Device under Differentially Private Federated Learning using Dynamic Low-Rank Adaptation 3 hours ago | arxiv.org

adaptation addresses arxiv can +29

Exploring the Interplay of Interpretability and Robustness in Deep Neural Networks: A Saliency-guided Approach 3 hours ago | arxiv.org

adversarial adversarial attacks applications arxiv +15

Disttack: Graph Adversarial Attacks Toward Distributed GNN Training 3 hours ago | arxiv.org

address adversarial adversarial attack adversarial attacks +21

Anomaly Detection in Graph Structured Data: A Survey 3 hours ago | arxiv.org

analysis anomaly detection arxiv cs.cr +14

Quantum Secure Anonymous Communication Networks 3 hours ago | arxiv.org

advertisers a network anonymous arxiv +18

Hard Work Does Not Always Pay Off: Poisoning Attacks on Neural Architecture Search 3 hours ago | arxiv.org

architecture architectures arxiv attack +20

Digital Security Infrastructure Manager

@ Wizz Air | Budapest, HU, H-1103

View on infosec-jobs.com

Sr. Solution Consultant

@ Highspot | Sydney

View on infosec-jobs.com

Cyber Security Analyst III

@ Love's Travel Stops | Oklahoma City, OK, US, 73120

View on infosec-jobs.com

Lead Security Engineer

@ JPMorgan Chase & Co. | Tampa, FL, United States

View on infosec-jobs.com

GTI Manager of Cybersecurity Operations

@ Grant Thornton | Tulsa, OK, United States

View on infosec-jobs.com

GCP Incident Response Engineer

@ Publicis Groupe | Dallas, Texas, United States

View on infosec-jobs.com