Multimodal Injection — Audio + Ultrasound

DolphinAttack

Ultrasonic voice commands. Human hears silence, assistant hears 'call attacker.' Older voice systems. Some modern LLMs vulnerable in edge cases.

Advertisement

Perturbations inaudible to humans, transcribed as attacker's chosen text. Related to adversarial vision.

Advertisement

Ask assistant to summarize podcast. Podcast contains 'ignore previous instructions.' Model transcribes + complies.

Frequency filtering. Cross-checker (does human hear the same command?). Refuse commands in unusual frequency ranges.