AI Governance Institute logo
AI Governance Institute

Practical Governance for Enterprise AI

← All news

Topic

AI Red Teaming

Red teaming, adversarial testing of AI systems by teams tasked with finding failures, is emerging as a core component of AI safety and compliance programs. It involves probing models for harmful outputs, testing for prompt injection and jailbreaks, assessing bias and fairness, and evaluating behavior at the edges of intended use cases.

The US government has mandated red teaming for frontier AI systems under executive order requirements. The EU AI Act requires conformity assessments for high-risk systems that include testing against foreseeable misuse. NIST's AI RMF includes red teaming in its Measure function. Several major AI providers now publish red team reports as part of their responsible disclosure programs.

This hub tracks red teaming requirements, methodologies, findings, and the standards taking shape around adversarial AI testing.

5 items