Könyv Engineering Highly Available Systems Mark A. Guyton

Engineering Highly Available Systems

A Guide to Fault Tolerance

Szerző: Mark A. Guyton
Nyelv: Angol
Kötés: Puha kötésű
Elérhetőség: Várható készletfeltöltés
Küldés 29. 06. 2026
11 893 Ft
I know the feeling perfectly. You are staring at a blinding monitor at 3:00 AM. Alarms are blaring i...

Információk a könyvről

Szerző
Nyelv
Angol
Kötés
Könyv - Puha kötésű
Kiadva
2026
oldal
230
EAN
9798184275246
Enbook ID
53017331
Súly
375
Méretek
170 x 244 x 12

Teljes leírás

I know the feeling perfectly. You are staring at a blinding monitor at 3:00 AM. Alarms are blaring in the incident channel, thousands of dollars in revenue are vanishing by the minute, and executive leadership is demanding immediate answers. Your microservices are trapped in a cascading death spiral, and nobody knows why. We have all been there. It is a terrifying, gut-wrenching experience.

But what if you could completely rewrite that narrative? What if, the next time a critical database node exploded, your architecture simply caught the error, tripped a circuit breaker, rerouted the global traffic, and seamlessly healed itself before a single customer even noticed the glitch? I wrote this book to pull you out of the frantic, reactive firefighting business and place you firmly in the proactive, architectural driver's seat.

What's inside
  • Defensive Design Patterns: Master the implementation of circuit breakers, architectural bulkheads, load shedding, and intelligent retry loops.
  • Deep Observability: Transform operational blind spots into crystal-clear insights using distributed tracing, structured logging, and eBPF technology.
  • Incident Command Systems: Discover proven, military-grade strategies to bring absolute order to the chaos of a SEV-1 production outage.
  • The Culture of Reliability: Learn the psychological frameworks required to execute truly blameless postmortems, manage technical debt, and enforce strict error budgets.
  • The Production Checklist: Gain access to a comprehensive, uncompromising gatekeeper checklist to validate your system before it ever touches the public internet.
Who it's meant for

If you are a backend developer tired of writing fragile code, a cloud architect tasked with safely scaling a platform to millions of concurrent users, or an engineering manager desperately trying to protect your operations team from alert fatigue and burnout, this is for you. Whether you are actively transitioning into a dedicated Site Reliability Engineering role or you are a veteran technical lead looking to modernize your infrastructure, you will find actionable, real-world wisdom on every single page.

Powerful Call to action

The next catastrophic outage is already ticking like a time bomb inside your legacy codebase. The only question is whether you will be a victim of the chaos, or the architect who engineered the cure. Stop waiting for your systems to break. Grab your copy today, master the art of fault tolerance, and start building distributed software that flat-out refuses to fail.