Technical
Adversarial Robustness
The ability of an AI system to maintain correct performance when exposed to adversarial inputs designed to cause errors or unexpected behaviour. Adversarial examples are inputs crafted to fool AI systems while appearing normal to humans. Critical for AI systems used in security-sensitive contexts.
Referenced in frameworks
NIST AI RMF MITRE ATLAS