foreword preface part ⅰ.introduction 1.introduction the sysadmin approach to service management googles approach to service management: site reliability engineering tes of sre the end of the beginning 2.the production environment at 6oogle, from the viewpoint of an sre hardware system software that "organizes" the hardware other system software our software infrastructure our development environment shakespeare: a sample service part ⅱ.principles 3.embracing risk managing risk measuring service risk risk tolerance of services motivation for error budgets 4.service level objectives service level terminology indicators in practice objectives in practice agreements in practice 5.eliminating toil toil defined why less toil is better what qualifies as engineering? is toil always bad? conclusion 6.monitoring distributed systems definitions why monitor? setting reasonable expectations for monitoring symptoms versus causes black-box versus white-box the four golden signals worrying about your tail (or, instrumentation and performance) choosing an appropriate resolution for measurements as simple as sible, no simpler tying these principles together monitoring for the long term conclusion 7.the evolution of automation at google the value of automation the value for google sre the use cases for automation automate yourself out of a job: automate all the things! soothing the pain: applying automation to cluster turnu borg: birth of the warehouse-scale puter reliability is the fundamental feature remendations 8.release engineering the role of a release engineer philosophy ……
以下为对购买帮助不大的评价