**TL;DR:** A structured AI agent for coordinating real-time incident response across your entire stack — with built-in rollback triggers, stakeholder dashboards, and audit trails.
This skill implements a tier-ARCHITECT incident management protocol with five distinct phases:
1. Detection — Monitors your alerting pipeline and fires on-threshold events from your APM
2. Triage — Runs root-cause analysis using your system's existing runbooks
3. Coordination — Assigns tasks, updates status pages, pings on-call via webhooks
4. Resolution — Executes predefined rollback playbooks
5. Review — Generates a postmortem with timeline reconstruction
```bash
npm install @mrtech/incident-commander
```
```json
{
"incidentCommander": {
"sources": [
{ "type": "pagerduty", "apiKey": "$PD_API_KEY" },
{ "type": "prometheus", "url": "http://prometheus:9090" }
]
}
}
```
```bash
npx incident-commander drill --scenario=memory-leak --target=api-service-07
```
```
Run an incident drill for a memory leak scenario targeting api-service-07.
Assume full prod access. Generate a postmortem markdown in /tmp/postmortem.md.
```
| Dimension | Rating | Notes |
|-----------|--------|-------|
| Speed | 4/5 | Sub-5-minute mean time to respond |
| Coverage | 5/5 | Handles 90%+ of standard SRE playbook scenarios |
| Integrations | 4/5 | Supports major APM tools, webhook-flexible |
| Learning Curve | 3/5 | Requires some YAML config |
| Cost | 4/5 | Open source core |
Incident Commander is the agent you want between your alerting system and your human on-call. It removes the cognitive overhead during a P0. The audit trail alone (timestamped decisions, attributed actions, auto-generated postmortems) makes it worth the setup investment.
Skill: incident-commander | Ecosystem: claude-code | Runtime: node | Auth: api-key