the_site_reliability_workbook.pdf

(11770 KB) Pobierz
Praise for
The Site Reliability Workbook
This new workbook will help people to take the sometimes theoretical and abstract
concepts covered in
Site Reliability Engineering
out of the special context of the
Googleplex and see how the same concepts work in other organizations. I’m especially
excited to see more detail in the analysis of toil, how to apply SRE principles to data
pipelines, and the case study reports discussing practical service level management.
—Kurt Andersen, Site Reliability Engineer, LinkedIn
This practical hands-on guide to implementing SRE is valuable for engineers at
companies of all sizes. It’s excellent to see this workbook being shared so that we
can all move forward and build more reliable systems together. I was impressed
with the level of detail shared; you can pick this book up and get started
implementing SRE practices today.
—Tammy Bütow, Principal SRE, Gremlin
A timely reminder, from the team that made SRE a required practice for everyone
operating at scale, that reliability is created by people. This book is full of practical
examples of how to optimize for reliability by focusing on the interactions between
users and engineers and between technology and tools, without losing sight of feature
velocity. The result is a compelling, interesting, and thought-provoking companion to
Site Reliability Engineering.
—Casey Rosenthal, CTO, Backplane.io
Google’s first book explained the what and why of SRE. This book shows you how to
implement SRE at any company, startup or giant. Great work by the editorial team.
—Jonah Horowitz, SRE at Stripe
In 2016, Google dropped
Site Reliability Engineering
on the operations world, and the
operations world was never the same. For the first time people had access to over 500
pages of distilled information on what Google does to run its planet-wide infrastructure.
Most people liked the book, a handful didn’t, but nobody ignored it. It became a seminal
work and an important touchstone for how people thought about SRE (especially the
Google implementation of it) from that point on. But it was missing something….
Now in 2018, Google returns to fill in a crucial piece of the puzzle: in their first
volume they described what they do, but that didn’t help those who couldn’t see
themselves in Google’s story. This book aims to demonstrate
how
Google does SRE—
and how you can do it, too.
—David N. Blank-Edelman, editor of
Seeking SRE:
Conversations about Running Production Systems at Scale
and cofounder of the global set of SREcon conferences
The Site Reliability Workbook
Practical Ways to Implement SRE
Edited by Betsy Beyer, Niall Richard Murphy,
David K. Rensin, Kent Kawahara,
and Stephen Thorne
Beijing
Boston Farnham Sebastopol
Tokyo
Zgłoś jeśli naruszono regulamin