AI Governance Institute logo
AI Governance Institute

Practical Governance for Enterprise AI

Agentic AI
AGT · Agentic AIAGT-013Medium effortAgent-relevant

Kill-Switch Propagation Testing

Regularly test that halt commands propagate correctly through all subagent layers and parallel orchestration environments, stopping all agent activity within a defined time window.

Objective

Ensure the emergency halt capability established in AGT-008 actually works in complex multi-agent deployments where stop signals must traverse delegation chains and concurrent execution paths.

Maturity Levels

1

Initial

Kill-switch exists but propagation to subagents is untested; behavior in parallel orchestration is unknown.

2

Developing

Kill-switch propagation tested manually for simple single-agent deployments but not for multi-agent or parallel configurations.

3

Defined

Propagation tests are documented and executed for all production agent topologies on a scheduled basis; results are recorded.

4

Managed

Propagation latency is measured and tracked against an SLA; failures trigger remediation with a documented root cause.

5

Optimizing

Propagation tests run automatically on every topology change; latency budgets are enforced in CI/CD before agent deployment.

Evidence Requirements

What an auditor or assessor would expect to see for this control.

  • Agent topology map documenting all agent relationships, delegation depths, and parallel execution paths in production
  • Kill-switch propagation test plan with defined SLA (maximum latency from halt command to full cessation)
  • Executed test results showing propagation latency, pass/fail status, and coverage of all production topologies
  • Remediation records for any propagation failures, including root cause and fix verification
  • Change-trigger log showing propagation tests were re-executed after topology changes

Implementation Notes

Key steps

  • Map every agent topology in production: document which agents can spawn subagents, which run in parallel, and the maximum delegation depth.
  • Define a propagation SLA: the maximum elapsed time from halt command issuance to confirmed cessation of all agent activity across all nodes.
  • Write test cases that cover: (1) single-agent halt, (2) orchestrator-to-subagent propagation, (3) parallel agent halt, (4) halt during an active tool call or irreversible action.
  • Run tests on a scheduled cadence (at minimum quarterly, or after any topology change) and record pass/fail, measured latency, and any agents that failed to halt.
  • For agents that interact with external systems, verify that in-flight API calls are either completed cleanly or rolled back — not left in an ambiguous state.

Example Implementation

Enterprise deploying a customer-service orchestrator that spawns research, drafting, and CRM-update subagents

Kill-Switch Propagation Test Results — Q2 2026

Topology tested: Orchestrator → [Research Agent, Drafting Agent] → CRM Update Agent

Test caseHalt command issuedAll agents stoppedLatencyIn-flight actions
Single agent (Drafting)14:02:0114:02:010.3sCompleted cleanly
Orchestrator → 2 subagents14:05:0014:05:022.1sResearch cancelled, Drafting completed
Halt during CRM write14:08:3014:08:321.8sCRM write rolled back
Parallel execution (3 agents)14:11:0014:11:032.7sAll stopped

SLA: 5 seconds. All tests passed. Finding: CRM rollback requires manual verification — add automated confirmation check.

Control Details

Control ID
AGT-013
Typical owner
AI Engineering / Platform Security
Implementation effort
Medium effort
Agent-relevant
Yes

Tags

kill switchagentic AIorchestrationsafety testingmulti-agent