Operational Excellence for Engineering Managers
A practical guide for engineering managers building operational excellence on their own team — reliability, SLOs, on-call, incident response, and bug management — while aligning to a standard practice shared across teams. Covers the technical mechanics, the safety and learning culture underneath them, and how to win buy-in from PMs and the wider org.
How this plan was made
Each plan on learnings is built by a hand-crafted agentic pipeline: research agents gather primary sources, a claim reviewer verifies facts against them, and a sequencer orders modules for how people actually learn. The curation — topic selection, framing, editorial standards — is Nicolas's. The research and writing is AI-assembled.