消息首页搜索举报

sre实战(影印版)(英文版) 英文原版书 (美)纳特·韦尔奇

正版书籍支持7天无理由

62.7 6.5折 96 九五品

库存4件

河北保定

认证卖家担保交易快速发货售后保障

作者(美)纳特·韦尔奇

出版社东南大学

ISBN9787564182939

出版时间2019-03

版次1

装帧平装

开本16

页数323页

定价96元

货号xhwx_1201871661

上书时间2024-09-14

典则俊雅图书专营店

五年老店

已实名已认证进店收藏店铺

在售商品暂无
平均发货时间 29小时
好评率暂无

最新上架

中国舞蹈史话戏剧、舞蹈常 ¥14.80

梓翁说园(精装本) 园林艺术陈从周 ¥14.80

弟子规(图文版) (清)李毓秀 ¥9.20

森林报少儿中外名著 (苏)维·比安基 ¥5.90

有机化学(中等职业学校规划教材) 化工技术黎春南 ¥11.80

汽车构造(底盘部分) 大中专高职交通沈沉编 ¥15.80

建筑结构大中专理科建筑刘雁主编 ¥30.90

文徵明书千字文三种毛笔书法孙宝文编 ¥18.10

史记中国古典小说、诗词何永丰编 ¥6.90

商品详情

品相描述：九五品: 正版特价书籍

商品描述: 目录：

preface
chapter 1: introduction
a brief history
what is sre?
what is in the book?
sre as a framework for new projects
summary
references
chapter 2: monitoring
why monitoring?
instrumenting an application
what should we measure?
a short introduction to slis, slos, and error budgets
service levels
error budgets
collecting and saving monitoring data
polling applications
nagios
prometheus
cacti
sensu
push applications
statsd
telegraf
elk
disying monitoring information
arbitrary queries
graphs
dashboards
chatbots
managing and maintaining monitoring data
municating about monitoring
do they even know there is monitoring?
references and related rea
future rea
summary
chapter 3: incident response
what is an incident?
what is incident response?
alerting
when do you alert?
how do you alert?
alerting services
what is in an alert?
who do you alert?
being on call
munication
incident mand system (ics)
where do you municate?
recovering the system
calling all clear
summary
chapter 4: tmortems
what is a tmortem?
why write a tmortem?
when to write a tmortem document
carrying out incident analysis
how to write a tmortem document
summary
impact
timeline
root cause
action items
tmortems without action items
appendix
blameless tmortems
hol a tmortem meeting
analyzing past tmortems
mtfr and mtbf
alert fatigue
discussing past outages
summary
references
chapter 5: testing_and releasing_
testing
what do you test?
testing code
testing infrastructure
testing processes
releasing
when to release
releasing to production
validating your release
rollbacks
automation
continuous everything
summary
chapter 6: capacity nning
a quick introduction to business finance
why n?
managing risk and managing expectations
defining a n
what is our current capacity?
when are we going to run out of capacity?
how should we change our capacity?
state and concurrency
is your service limited by another service?
scaling for events
unpredictable growth-user-generated content
prenned versus autoscaling
delivering
execute the n
architecture--where performance changes e from
tech as a profit center and procurement
summary
chapter 7: buil tools
fin projects
defining projects
rdd
example
design documents
nning projects
example
retrospectives and standu
allocation
buil projects
advice for writing code
separation of concerns
long-term work
example okrs
notebooks
documenting and maintaining projects
summary
chapter 8: user experience
an introduction to design and ux
real-world interaction design
user testing
picking an experience
designing the test
fin people to test
developer experience
experience of tools
performance budgets
security
authentication
authorization
risk profile
phishing
acm code of ethics
summary
references
chapter 9: working foundations
the inter
sen an request
dns
dig
ether and tcp/ip
ether
ip
cidr notation
icmp
udp
tcp

curl and wget
tools for watching the work
stat
nc
tcpdump
summary
chapter 10: linux and cloud foundations
linux fundamentals
everything is a file
files, directories, and inodes
sockets
devices
/proc
filesystem layout
what is a process?
zombies
orphans
what is nice?
syscalls
how to trace
watching processes
build your own
cloud fundamentals
vms
containers
load balancing
autoscaling
storage
queues and pub/sub
units of scale
example architecture interview
summary
references
other books you may enjoy
index

内容简介：

本书是软件开发人员在灾难故障中的优选生存指南。随着企业力求实现正常运行时间的很大化，站点可靠工程（ite reliability engineering，re）首当其冲。当你的站点出现问题，修复故障已经迫在眉睫的时候，本书可以作为一个手把手的作框架。nat welch在可靠工程方面丰富的实战经验源自于inter上某些很大的公司，这些公司对于系统中断事件极为敏感。他所用于监控现代web服务、设置警报和评估事件响应的方法都经过了实践的验，学会这些必将助你一臂之力。

— 没有更多了 —