Indirect Prompt Injection Into LLMs Using Images and Sounds
Multi-modal Large Language Models (LLMs) are advanced artificial intelligence models that can produce contextually rich responses that combine inputs of various types (text, audio, pictures). As a result, Bard already relies on such architecture, and the next generation of ChatGPT is expected to rely on them as well.
In this talk, we demonstrate how images and audio samples can be used for indirect prompt and instruction injection against (unmodified and benign) multi-modal LLMs. An attacker generates an adversarial perturbation corresponding to the prompt and blends it into an image or audio recording. When the user asks the (unmodified, benign) model about the perturbed image or audio, the perturbation steers the model to output the attacker-chosen text and/or make the subsequent dialog follow the attacker’s instruction....
By: Ben Nassi, Eugene Bagdasaryan
Full Abstract and Presentation Materials:
#indirect-prompt-injection-into-llms-using-images-and-sounds-35320
1 view
0
0
4 months ago 00:05:09 1
Swollen Face and Puffy Eyes Treatment | Facial Puffiness | Puffy Eyes Ka iLaj | Puffy Eyes Reason
5 months ago 00:04:46 1
Tempers - “Trains“ (Official Audio)
9 months ago 00:28:21 1
Indirect Prompt Injection Into LLMs Using Images and Sounds
10 months ago 00:08:26 1
US and UK Started Heavy Retaliatory Bombardment & Strike Houthi Airfields And Underground BASES!
1 year ago 00:00:32 1
Peter Hammill - In A Foreign Town / Out Of Water [Trailer]
1 year ago 00:01:18 3
Hacking Google Bard: Prompt Injection to Data Exfiltration via Image Markdown Rendering (Demo Video)
1 year ago 00:17:30 1
Driving Around Downtown Cedar Rapids, IA in 4k Video
1 year ago 00:08:03 1
19 minutes ago 🔴 Floods and typhoons leave China no chance to exist! $60+ million damage!
1 year ago 00:28:40 1
Paint a 30 Minute Landscape Painting in Direct Watercolor with Angela Fehr, Plein Air Style
2 years ago 00:16:43 1
Why Was Generals 2 Cancelled? - Investigating Command & Conquer