Upgrade to Pro — share decks privately, control downloads, hide ads and more …

RPC Metrics at Google

Avatar for JBD JBD
August 09, 2018

RPC Metrics at Google

Avatar for JBD

JBD

August 09, 2018
Tweet

More Decks by JBD

Other Decks in Programming

Transcript

  1. @rakyll "100% is the wrong reliability target for basically everything."

    -- Benjamin Treynor Sloss, VP of Engineering, Google
  2. @rakyll Principled way of saying what level of downtime is

    acceptable. • Error rate • Latency expectations SLOs
  3. @rakyll Questions infra teams want to ask: • Are we

    meeting the SLO for the other team? • What’s the impact of a product on infra? • How much do we need to scale up if product grows 10%?
  4. @rakyll Query the collected data in various ways: • Latency

    distribution for RPCs originated at Google Analytics. • Requests take took more than 100ms for the customer #123. • Compare the request latency initiated at web vs mobile frontend.