Incident Management
Native incident timeline, AI-powered investigation, and automated post-mortem generation.
Incident Management
Koalr provides native incident management that correlates deployment events with service health — so you can answer "did this deploy cause the incident?" instantly.
What Koalr tracks
| Signal | Source |
|---|---|
| Incident created / resolved | PagerDuty, OpsGenie, incident.io |
| Severity & priority | Synced from your alert provider |
| Affected services | Correlated via service catalog |
| Linked deployments | Auto-matched within ±2 hours of incident start |
| Time to detection | Alert fired → human acknowledged |
| MTTR | Incident opened → resolved |
AI-Powered Investigation
When an incident opens, Koalr's AI panel automatically generates an investigation summary:
- Root cause candidates — which recent deployments touched the affected service
- Risk signal audit — what the deploy risk score was at merge time
- Runbook suggestions — pulled from previous similar incidents
- Blast radius estimate — which other services depend on the affected component
To view the investigation, open the incident detail page and click AI Investigation in the right panel.
Post-Mortem Generator
After an incident resolves, Koalr generates a structured post-mortem draft:
- Navigate to Incidents → [incident name] → Post-Mortem
- Review the auto-populated timeline, contributing factors, and remediation steps
- Edit directly in the browser or export to Markdown / PDF
- Publish internally — Koalr tracks open action items and nudges owners
The post-mortem template follows the Google SRE format with sections for impact, timeline, root cause, contributing factors, and action items.
Connecting an alert provider
To enable incident management, connect at least one alert provider:
Once connected, incidents sync automatically via webhook. Historical incidents are backfilled up to 90 days.
MTTR & DORA correlation
Koalr surfaces MTTR (Mean Time to Restore) as a first-class DORA metric on the DORA dashboard. Each data point links to the underlying incident so you can drill into outliers.
See DORA Metrics for benchmarks and trend analysis.