treat operations as if it’s a software problem. Our mission is to protect, provide for, and progress the software and systems behind all of Google’s public services with an ever-watchful eye on their availability, latency, performance, and capacity. SREɺӡ༻্ͷΛιϑτΣΞతʹղܾ͢ΔͨΊͷΤϯδχΞϦϯάͰ͢ɻ ࢲͨͪͷ໋ɺGoogleͷαʔϏεͷՄ༻ੑɺϨΠςϯγɺύϑΥʔϚϯεɺ ΩϟύγςΟΛ ৗʹࢹ͠ͳ͕ΒकΓɺਐาͤ͞Δ͜ͱͰ͢ɻ
for the availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of their service(s). ҰൠతʹɺSREνʔϜɺαʔϏεͷՄ༻ੑɺϨΠςϯγɺ ύϑΥʔϚϯεɺޮੑɺมߋཧɺϞχλϦϯάɺۓٸରԠɺ ΩϟύγςΟϓϥϯχϯάʹΛ࣋ͪ·͢ɻ
https://landing.google.com/sre/sre-book/toc/ • The Site Reliability Workbook Chapter 2 - Implementing SLOs, “this is the most important chapter in this book” https://landing.google.com/sre/workbook/toc/ • SRE νʔϜͷධՁʹཱͭϨϕϧผνΣοΫ Ϧετ SREͷجຊͱͯ͠ɺ࠷ॳͷ߲ͱͯ͠հ͞Ε͍ͯΔ https://cloudplatform-jp.googleblog.com/2019/02/how-to-start-and-assess-your-sre-journey.html • Google͕ղઆ - ଞࣾͷSRE࣮ફͳͥޡΓͳͷ͔ https://www.infoq.com/jp/news/2018/08/google-explains-sre ඇৗʹॏཁͳࣄͱͯ͠ड़ΒΕ͍ͯΔ