EU AI Act general-purpose AI provisions apply from Aug 2025NIST AI RMF 2.0 draft open for public comment through Q2 2025EU AI Act general-purpose AI provisions apply from Aug 2025NIST AI RMF 2.0 draft open for public comment through Q2 2025

Risk & Safety

Jailbreak

A technique used to bypass an AI system's safety constraints or content policies through carefully crafted prompts or inputs. Differs from prompt injection in that jailbreaks typically involve the model's own output mechanisms rather than injecting external instructions. An ongoing adversarial challenge for AI safety researchers and developers.

Referenced in frameworks

NIST AI 600-1 OWASP LLM Top 10

Related terms