They shipped an update on a Friday evening. Over the weekend, one AI agent got stuck retrying malformed patient records for nearly 62 hours straight. There was no token ceiling, so the cost compounded silently until a US$340K invoice landed on Monday — by then the tokens were spent. A post-mortem traced the damage to four governance gaps. The eight-week rebuild brought monthly AI spend from US$412K down to US$71K.
budget_usd_month ceiling and tier-aware model
routing (ADR-014). A runaway agent hits a cap measured in dollars, not discovered in an invoice.cc_audit_log with a SHA-256 hash
chain. "Which agent touched this record, with which model, when?" is a query, not an investigation.ask-before-execute guardrails mean
sensitive or production actions require explicit sign-off. Nothing high-stakes goes out unattended.