Failure Intelligence

Runtime Failure Index

The canonical taxonomy of agentic runtime failures. 15 documented failure modes ranked by frequency, cost, blast radius, and trend direction. Each failure maps to a deployable governance containment module.

Most Common
Context Rot
Affects virtually all sessions > 60 min
Most Expensive
$1,100 single session
Overnight retry inflation burn
Fastest Growing
MCP Credential Leaks
MCP adoption scaling without governance
Highest Blast Radius
Repository Drift
94 files modified in one incident
Most Underestimated
Governance Theater
System prompts ≠ deterministic governance
Largest Organizational Cost
$135K/quarter
AI tools net-negative at enterprise scale
#Failure ModeCategoryFrequencyAvg CostTrendRiskModule
1
Context Rot
All agents
CognitionVery High$80-$340/incident↑ Growing
98
Context Rot Prevention
2
Retry Inflation
All agents
EconomicsVery High$25-$1,100/incident↑ Growing
96
Retry Inflation Control
3
Repository Drift
Cursor, Windsurf
EnvironmentHigh$200-$2,000/incident↑ Growing
94
Repository Drift Prevention
4
Identity Drift
All agents
IdentityVery High$50-$200/incident→ Stable
90
Deterministic Agentic Engineering
5
MCP Credential Exposure
Claude Code, Cline
SecurityMedium$5K-$500K/breach↑ Growing rapidly
92
MCP Governance
6
Tool Permission Leak
Windsurf, Roo Code
SecurityMedium$500-$10K/incident↑ Growing
88
Tool Permission Governance
7
Verification Bypass
All agents
QualityHigh$50K-$200K/quarter↑ Growing
86
Verification Burden Collapse
8
Orchestration Collapse
Multi-agent
ArchitectureMedium$100-$890/incident↑ Growing
82
Orchestration Entropy
9
Hallucination Debt
Codex, Claude Code
QualityHigh$100-$500/incident→ Stable
80
Hallucination Debt Reduction
10
Context Window Overflow
All agents
CognitionVery High$30-$150/incident→ Stable
78
Context Window Compression
11
Token Cost Overrun
All agents
EconomicsHigh$100-$1,100/incident↑ Growing
85
AI Cost Containment
12
Scope Creep Mutation
Cursor, Claude Code
EnvironmentHigh$200-$1,000/incident→ Stable
76
Agentic Change Management
13
Autonomous Execution Risk
All agents
SecurityMedium$500-$5K/incident↑ Growing rapidly
84
Autonomous Execution Safety
14
Governance Theater
All agents
ArchitectureVery HighUnquantified→ Stable
74
Runtime Governance
15
Engineering Economics Collapse
Enterprise-scale
EconomicsHigh$135K/quarter↑ Growing
88
AI Engineering Economics

Failure Categories

Cognition
2
failure modes
Economics
3
failure modes
Environment
2
failure modes
Identity
1
failure modes
Security
3
failure modes
Quality
2
failure modes
Architecture
2
failure modes

Key Findings

No agent ships runtime governance

Claude Code, Cursor, Windsurf, Cline, Roo Code, and Codex all lack deterministic governance enforcement.

MCP risks are scaling fastest

As MCP adoption increases, credential exposure and supply chain risks grow proportionally without governance.

AI agents are frequently net-negative

Documented enterprise deployments show remediation costs exceeding productivity gains without governance.

Governance reduces costs by 60-93%

Documented containment across all 15 failure modes shows consistent 60-93% cost reduction when governance is deployed.

System prompts are not governance

Text-based instructions in CLAUDE.md/.cursorrules are routinely bypassed under context pressure. Only middleware enforcement is deterministic.

The 4-layer model contains all failures

Every documented failure maps to Identity, Skill, Tool, or Environment governance — confirming the runtime architecture is complete.

Deploy Containment for Any Failure Mode

Every failure in this index maps to a deployable runtime infrastructure module with TypeScript middleware, YAML policy manifests, and operational tooling.