Catalog
Curated runbooks for major incident types and mitigation flows.
| Name | Version | Last reviewed | Owner | Status | Summary |
|---|---|---|---|---|---|
| Telemetry pipeline backlog mitigation | v6.2 | Oct 10, 2025 | Observability Core | Validated | Steps to triage ingestion slowdowns, scale throughput, and validate catch-up. |
| Contoso checkout failover | v4.9 | Sep 28, 2025 | Commerce Ops | Validated | Guidance for promoting standby region and rebalancing queue workers. |
| Identity broker hard reset | v3.4 | Aug 30, 2025 | Identity Platform | Needs review | Protocol for coordinated cache eviction and certificate rotation. |
| Service bus poison queue cleanup | v2.1 | Jul 18, 2025 | Messaging | Needs review | Automation for isolating bad payloads and replaying validated messages. |
Update reminders
Top follow-ups to keep the catalog fresh.