AI Governance Institute logo
AI Governance Institute

Practical Governance for Enterprise AI

Agentic AI
AGT · Agentic AIAGT-012Medium effortAgent-relevant

Agent Kill Switch and Emergency Stop

Maintain the operational capability to halt any running agent session, workflow, or agent class immediately — without relying on the agent itself to stop — and recover to a known-safe state.

Objective

Preserve human control over in-flight agent executions by ensuring a reliable, tested mechanism exists to stop agents that are behaving unexpectedly, consuming excessive resources, or taking unauthorized actions.

Maturity Levels

1

Initial

No kill mechanism exists; stopping a runaway agent requires manually terminating infrastructure with significant lead time.

2

Developing

Infrastructure-level termination is possible but requires engineering access and is not documented or tested.

3

Defined

A documented stop procedure exists for each agent class with named responsible parties; procedures are known to the on-call team.

4

Managed

Stop procedures are tested quarterly; recovery to known-safe state is verified after each test; response time targets are met.

5

Optimizing

Automated circuit breakers pause agents when behavioral thresholds are exceeded; human-triggered stop is a single, audited action available to designated operators 24/7.

Evidence Requirements

What an auditor or assessor would expect to see for this control.

  • Stop procedure documentation covering individual session, agent class, and full deployment scopes, with named responsible parties, trigger criteria, and target response times
  • Quarterly stop procedure test records showing time-to-stop for each scope and verification that recovery to known-safe state succeeded
  • Authority matrix confirming who can trigger each stop scope and under what conditions, without requiring additional approval
  • Partial-completion recovery playbook documenting how in-flight state is assessed and handled after an emergency stop
  • On-call contact list for stop-authorized personnel reviewed and updated at least quarterly, with evidence of last review

Implementation Notes

Key steps

  • Design stop capability into agents at the architecture level: every agent execution environment must support immediate session termination without waiting for the agent to reach a natural stopping point.
  • Define three stop scopes and document each separately: (1) individual session termination, (2) agent class pause (all instances of one agent type), and (3) full deployment suspension.
  • Assign explicit stop authority: who can trigger each scope, under what conditions, and without requiring additional approval — ambiguity in a crisis costs time.
  • Test stop procedures on a schedule and document the result: an untested kill switch is not a kill switch.
  • Plan for partial-completion states before you need them: if an agent is stopped mid-workflow, what data was written, what external calls were made, and how do you recover or roll back to a safe state?

Example Implementation

Healthcare organization running scheduling and medical records agents over patient data

Agent Emergency Stop Runbook — Healthcare Agent Platform

Stop scope levels:

ScopeTrigger CriteriaStop AuthorityTarget ResponseMethod
Session stopSingle agent behaving unexpectedlyAny authorized operator<60 secondsSession termination API; session flagged in audit log
Agent class pausePattern of anomalies across agent typeAI Eng lead or on-call<5 minutesFeature flag disable; queued sessions rejected
Full deployment stopActive patient harm or data breachCISO or designee<15 minutesInfrastructure shutdown; incident declared

On-call stop authority: AI Eng on-call + CISO (or delegate) — contacts reviewed monthly

Partial-completion recovery:

  • Any session terminated mid-workflow: state snapshot written before termination; human reviewer notified within 15 minutes to assess and complete manually if needed
  • No external write (EHR update, message send) confirmed without human verification after emergency stop

Test cadence: Quarterly — session stop tested monthly; class pause and full stop tested quarterly; results logged and signed off by AI Governance lead