Skip to content

Labs

Collection of short interactive stories about IT, DevOps, and systems in operation. Hypothetical scenarios inspired by real-world problems. It is not meant to be a production simulator.

Available labs

Adaptive 5 min

The noisy alert

Triage a noisy latency alert, preserve context, and decide when recovery is not enough.

Topics
Incident response Observability Communication
First decision
What do you do first?
Start lab
Deterministic 7 min

When production failed

Follow the first production failure behind the post and decide what to learn from it.

Topics
Recovery Replicability Automation
First decision
The requirement is simple: publish open geographic data. What do you optimize for first?
Start lab