Risk & Safety
Prompt Injection
An attack technique in which malicious instructions are embedded in input to an AI system to override its original instructions or extract sensitive information. Can occur in direct attacks (user manipulating the model) or indirect attacks (malicious content in retrieved data). A critical security vulnerability for LLM-based applications.
Referenced in frameworks
OWASP LLM Top 10 MITRE ATLAS NIST AI 600-1