Incident Response doesn’t have to be painful: Common pitfalls and recommendations


This post challenges misconceptions about chaotic on-call and livesite practices, offering lessons from extensive experience. It introduces common red flags like call hell, hero worship, and the wild west, and provides solutions. These include customer-focused monitoring, monitoring pruning, 1-2-3 troubleshooting rule, follow-the-sun schedules, and repair item deadlines. As services mature, standardized incident response and efficient toil control practices become crucial.

The cow and the chicken: overcoming resistance to change


The Cow and Chicken are journeying through the countryside and see a diner with a sign that reads "Steak and Eggs". The chicken nudges the cow and cackles delightfully: "Look! Look, Cow!! We're famous!!!". Whereupon, the cow looks at the same sign and snorts derisively: you're involved! I am committed!!

Help! I’m stuck! I want to get to the next level!!


This article clarifies how promotions work and highlights potential pitfalls. It describes a career-growth framework based on technical skills, interpersonal relationships, and influence. As one becomes more senior, the balance between these skills shifts, demanding more focus on relationships and influence.

Keep calm and carry on: Taming the siren call of overwhelm and overwork


If you are constantly overwhelmed and too busy - something somewhere somehow is wrong. Read on to learn more about my journey from an overwhelmed, feisty, flustered newbie to an equanimous, calm, collected professional with arguably more impact. 

The complicated parts of leadership: Eliminate chaos


Introduction "How can I trust you? You keep telling me about a new direction every other month"  My report after the 3rd organizational pivot within three months. Background I was dumbstruck – there was nothing I could say. As a line manager, I was responsible for delivering the message even though I didn't make some … Continue reading The complicated parts of leadership: Eliminate chaos

Evaluating Managers: 5 heuristics to measure managerial impact


Measuring a manager's impact is hard since outcomes take time. The manager takes full responsibility for the team - be it stagnation, execution woes, poor collaboration, churn, or a lack of focus. This post provides early evaluation metrics as well as tips for course correction.

Scaling past Port Exhaustion: when 100k+ ports were not enough


One of my favorite technical projects involved overcoming a network constraint. The virtual machines (VMs) hosting the core services kept exhausting available ports. Once all ports were used up, new connections would fail, tanking our availability and reliability. Read on to learn how we overcame this issue and opened up opportunities to reduce costs by a third.

Book Review: A Philosophy of Software Design


The book’s core thesis is minimizing complexity in software development by adopting complexity-eliminating approaches. The upfront investment in learning and adopting better designs pays off because it leads to high-quality software. Recommended read for software developers and line managers.