preface chapter 1: introduction a brief history what is sre? what is in the book? sre as a framework for new projects summary references chapter 2: monitoring why monitoring? instrumenting an application what should we measure? a short introduction to slis, slos, and error budgets service levels error budgets collecting and saving monitoring data polling applications nagios prometheus cacti sensu push applications statsd telegraf elk disying monitoring information arbitrary queries graphs dashboards chatbots managing and maintaining monitoring data municating about monitoring do they even know there is monitoring? references and related rea future rea summary chapter 3: incident response what is an incident? what is incident response? alerting when do you alert? how do you alert? alerting services what is in an alert? who do you alert? being on call munication incident mand system (ics) where do you municate? recovering the system calling all clear summary chapter 4: tmortems what is a tmortem? why write a tmortem? when to write a tmortem document carrying out incident analysis how to write a tmortem document summary impact timeline root cause action items tmortems without action items appendix blameless tmortems hol a tmortem meeting analyzing past tmortems mtfr and mtbf alert fatigue discussing past outages summary references chapter 5: testing_and releasing_ testing what do you test? testing code testing infrastructure testing processes releasing when to release releasing to production validating your release rollbacks automation continuous everything summary chapter 6: capacity nning a quick introduction to business finance why n? managing risk and managing expectations defining a n what is our current capacity? when are we going to run out of capacity? how should we change our capacity? state and concurrency is your service limited by another service? scaling for events unpredictable growth-user-generated content prenned versus autoscaling delivering execute the n architecture--where performance changes e from tech as a profit center and procurement summary chapter 7: buil tools fin projects defining projects rdd example design documents nning projects example retrospectives and standu allocation buil projects advice for writing code separation of concerns long-term work example okrs notebooks documenting and maintaining projects summary chapter 8: user experience an introduction to design and ux real-world interaction design user testing picking an experience designing the test fin people to test developer experience experience of tools performance budgets security authentication authorization risk profile phishing acm code of ethics summary references chapter 9: working foundations the inter sen an request dns dig ether and tcp/ip ether ip cidr notation icmp udp tcp
curl and wget tools for watching the work stat nc tcpdump summary chapter 10: linux and cloud foundations linux fundamentals everything is a file files, directories, and inodes sockets devices /proc filesystem layout what is a process? zombies orphans what is nice? syscalls how to trace watching processes build your own cloud fundamentals vms containers load balancing autoscaling storage queues and pub/sub units of scale example architecture interview summary references other books you may enjoy index
以下为对购买帮助不大的评价