Ethics & Fairness
Interpretability
The degree to which a human can understand the internal mechanisms of an AI model and predict its behaviour. More interpretable models (e.g., decision trees, linear regression) are inherently easier to audit but may be less accurate than complex models. A research area seeking to make complex models more interpretable.
Referenced in frameworks
NIST AI RMF EU AI Act