Agent Kill Switch and Emergency Stop

Maintain the operational capability to halt any running agent session, workflow, or agent class immediately — without relying on the agent itself to stop — and recover to a known-safe state.

Objective

Preserve human control over in-flight agent executions by ensuring a reliable, tested mechanism exists to stop agents that are behaving unexpectedly, consuming excessive resources, or taking unauthorized actions.

Maturity Levels

Initial

No kill mechanism exists; stopping a runaway agent requires manually terminating infrastructure with significant lead time.

Developing

Infrastructure-level termination is possible but requires engineering access and is not documented or tested.

Defined

A documented stop procedure exists for each agent class with named responsible parties; procedures are known to the on-call team.

Managed

Stop procedures are tested quarterly; recovery to known-safe state is verified after each test; response time targets are met.

Optimizing

Automated circuit breakers pause agents when behavioral thresholds are exceeded; human-triggered stop is a single, audited action available to designated operators 24/7.

Evidence Requirements

What an auditor or assessor would expect to see for this control.

—Stop procedure documentation covering individual session, agent class, and full deployment scopes, with named responsible parties, trigger criteria, and target response times

—Quarterly stop procedure test records showing time-to-stop for each scope and verification that recovery to known-safe state succeeded

—Authority matrix confirming who can trigger each stop scope and under what conditions, without requiring additional approval

—Partial-completion recovery playbook documenting how in-flight state is assessed and handled after an emergency stop

—On-call contact list for stop-authorized personnel reviewed and updated at least quarterly, with evidence of last review

Implementation Notes

Key steps

Design stop capability into agents at the architecture level: every agent execution environment must support immediate session termination without waiting for the agent to reach a natural stopping point.
Define three stop scopes and document each separately: (1) individual session termination, (2) agent class pause (all instances of one agent type), and (3) full deployment suspension.
Assign explicit stop authority: who can trigger each scope, under what conditions, and without requiring additional approval — ambiguity in a crisis costs time.
Test stop procedures on a schedule and document the result: an untested kill switch is not a kill switch.
Plan for partial-completion states before you need them: if an agent is stopped mid-workflow, what data was written, what external calls were made, and how do you recover or roll back to a safe state?

Healthcare organization running scheduling and medical records agents over patient data

Agent Emergency Stop Runbook — Healthcare Agent Platform

Stop scope levels:

Scope	Trigger Criteria	Stop Authority	Target Response	Method
Session stop	Single agent behaving unexpectedly	Any authorized operator	<60 seconds	Session termination API; session flagged in audit log
Agent class pause	Pattern of anomalies across agent type	AI Eng lead or on-call	<5 minutes	Feature flag disable; queued sessions rejected
Full deployment stop	Active patient harm or data breach	CISO or designee	<15 minutes	Infrastructure shutdown; incident declared

On-call stop authority: AI Eng on-call + CISO (or delegate) — contacts reviewed monthly

Partial-completion recovery:

Any session terminated mid-workflow: state snapshot written before termination; human reviewer notified within 15 minutes to assess and complete manually if needed
No external write (EHR update, message send) confirmed without human verification after emergency stop

Test cadence: Quarterly — session stop tested monthly; class pause and full stop tested quarterly; results logged and signed off by AI Governance lead

Agent Kill Switch and Emergency Stop

Maturity Levels

Evidence Requirements

Implementation Notes

Key steps

Example Implementation

Agent Emergency Stop Runbook — Healthcare Agent Platform

Control Details

Tags

Mapped Regulations

Related Controls

Related Playbook

Recent Coverage