all InfoSec news
Indirect Instruction Injection in Multi-Modal LLMs
Schneier on Security www.schneier.com
Interesting research: “(Ab)using Images and Sounds for Indirect Instruction Injection in Multi-Modal LLMs“:
Abstract: We demonstrate how images and sounds can be used for indirect prompt and instruction injection in multi-modal LLMs. An attacker generates an adversarial perturbation corresponding to the prompt and blends it into an image or audio recording. When the user asks the (unmodified, benign) model about the perturbed image or audio, the perturbation steers the model to output the attacker-chosen text and/or make the …
academic papers adversarial artificial intelligence audio image images injection llm llms machine learning modal recording research