Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Reliability in a Serverless world

Avatar for Driss Amri Driss Amri
February 24, 2023

Reliability in a Serverless world

Serverless @ Nederlandse Spoorwegen

Avatar for Driss Amri

Driss Amri

February 24, 2023
Tweet

More Decks by Driss Amri

Other Decks in Technology

Transcript

  1. Availability Downtime Per Year Downtime Per Month 99% (“Two Nines”)

    99.9% (“Three Nines”) 99.99% (“Four Nines”) 7,2 hours 3,65 days 43,2 minutes 8,76 hours 4,32 minutes 52,6 minutes
  2. Challenges • Protecting downstream services that don’t scale as well

    • Service Limit Quotas • More granular architectures • Denial Of Wallet • Per Function and Service (mis)configuration • Lots of services to choose from
  3. Chaos Engineering is the discipline of experimenting on a system

    in order to build confidence in the system’s capability to withstand turbulent conditions in production.
  4. Chaos Engineering is the discipline of experimenting on a system

    in order to build confidence in the system’s capability to withstand turbulent conditions in production.
  5. Chaos Engineering is the discipline of experimenting on a system

    in order to build confidence in the system’s capability to withstand turbulent conditions in production.
  6. Chaos Engineering is the discipline of experimenting on a system

    in order to build confidence in the system’s capability to withstand turbulent conditions in production.
  7. Common faults • Network ◦ Latency ◦ Bandwidth ◦ Failure

    to connect ◦ 4XX/5XX HTTP Response • Resource Exhaustion: ◦ CPU Stress ◦ Memory ◦ Disk Space • Weaknesses ◦ Error handling ◦ Timeout values ◦ Events ◦ Fallbacks ◦ Fail overs
  8. Static stability using Availability Zones https://aws.amazon.com/builders-library/static-stability-using-availability-zones/ Beyond five 9s: Lessons

    from our highest available data planes https://www.youtube.com/watch?v=2L1S0zfnIzo Chaos testen voor betrouwbaarheid https://nsdigitaal.sharepoint.com/sites/TestenBijNS/SitePages/Chaos-testen-voor-betrouwbaarheid.as px?source=https%3A%2F%2Fnsdigitaal.sharepoint.com%2Fsites%2FTestenBijNS