Risk & Safety
AI Safety
The field concerned with ensuring AI systems behave as intended and do not cause unintended harms. Encompasses both near-term safety (preventing AI failures in deployed systems) and long-term safety (preventing catastrophic outcomes from advanced AI). Increasingly a policy priority following advances in frontier AI capabilities.
Referenced in frameworks
NIST AI RMF Bletchley Declaration