Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Monitoring and Debugging Containers
Search
JBD
December 04, 2018
Programming
2
1.1k
Monitoring and Debugging Containers
JBD
December 04, 2018
Tweet
Share
More Decks by JBD
See All by JBD
eBPF in Microservices Observability at eBPF Day
rakyll
1
2.1k
eBPF in Microservices Observability
rakyll
1
1.7k
OpenTelemetry at AWS
rakyll
1
1.8k
Debugging Code Generation in Go
rakyll
5
1.6k
Are you ready for production?
rakyll
8
2.8k
Servers are doomed to fail
rakyll
3
1.5k
Serverless Containers
rakyll
1
250
Critical Path Analysis
rakyll
0
600
CPDD
rakyll
0
4.2k
Other Decks in Programming
See All in Programming
AIにコードを生成するコードを作らせて、再現性を担保しよう! / Let AI generate code to ensure reproducibility
yamachu
7
6k
MLOps Japan 勉強会 #52 - 特徴量を言語を越えて一貫して管理する, 『特徴量ドリブン』な MLOps の実現への試み
taniiicom
2
570
OpenTelemetryで始めるベンダーフリーなobservability / Vendor-free observability starting with OpenTelemetry
seike460
PRO
0
160
Reactive Thinking with Signals, Resource API, and httpResource @Devm.io Angular 20 Launch Party
manfredsteyer
PRO
0
130
マテリアルって何者?RealityKitで扱うマテリアル入門
nao_randd
0
140
JSAI2025 RecSysChallenge2024 優勝報告
unonao
1
380
Rethinking Data Access: The New httpResource in Angular
manfredsteyer
PRO
0
220
TypeScript Language Service Plugin で CSS Modules の開発体験を改善する
mizdra
PRO
3
2.4k
「MCPを使ってる人」が より詳しくなるための解説
yamaguchidesu
0
600
REST API設計の実践 – ベストプラクティスとその落とし穴
kentaroutakeda
2
320
💎 My RubyKaigi Effect in 2025: Top Ruby Companies 🌐
yasulab
PRO
1
130
抽象データ型について学んだ
ryounasso
0
210
Featured
See All Featured
Mobile First: as difficult as doing things right
swwweet
223
9.6k
How to Ace a Technical Interview
jacobian
276
23k
Code Reviewing Like a Champion
maltzj
523
40k
Adopting Sorbet at Scale
ufuk
76
9.4k
What’s in a name? Adding method to the madness
productmarketing
PRO
22
3.5k
GitHub's CSS Performance
jonrohan
1031
460k
Fontdeck: Realign not Redesign
paulrobertlloyd
84
5.5k
Gamification - CAS2011
davidbonilla
81
5.3k
GraphQLの誤解/rethinking-graphql
sonatard
71
11k
Put a Button on it: Removing Barriers to Going Fast.
kastner
60
3.9k
We Have a Design System, Now What?
morganepeng
52
7.6k
Let's Do A Bunch of Simple Stuff to Make Websites Faster
chriscoyier
507
140k
Transcript
@rakyll monitoring and debugging containerized systems Jaana B. Dogan, Google
[email protected]
@rakyll me overly frustrated engineer 15+ years in networking systems
making systems more reliable
@rakyll the new old monitoring? (maybe)
@rakyll systems are growing... and you are not in control
@rakyll bare metal kernel network stack cloud stack libraries frameworks
your code
@rakyll
@rakyll complexity is inevitable
@rakyll container
@rakyll container
@rakyll container container
@rakyll container container
@rakyll container container message queue
@rakyll container container storage/database
@rakyll container container load balancer location=us-west location=europe-central
@rakyll host host container container load balancer
@rakyll container container container container container orchestrated hot mess
@rakyll areas of issues: - lack of locality - networking
- scheduling - dependencies
@rakyll bare metal kernel network stack cloud stack libraries frameworks
your code
@rakyll “my job is done here”
@rakyll after going to production... 1. monitor 2. alert 3.
troubleshoot 4. fix
@rakyll
@rakyll load balancer
@rakyll load balancer critical path
@rakyll discovering critical paths making them reliable then fast making
them debuggable
@rakyll
@rakyll Latency Numbers Every Programmer Should Know by Jeff Dean
@rakyll
@rakyll ping pong pongservice:6996 project: ping the pong server.
@rakyll opencensus.io
@rakyll not my team!
@rakyll where is the source code?
@rakyll who to page?
@rakyll who to page?
@rakyll give me the logs, runtime events, profiles...
@rakyll
@rakyll
@rakyll
@rakyll http://server:9999/tracez
@rakyll challenges...
@rakyll no wire standards
@rakyll
@rakyll traceparent: <version>-<traceid>-<spanid>-<opts> Example: traceparent: 00-0af7651916cd43dd8448eb211c80319c-b7ad6b7169203331-01
@rakyll no export standards
@rakyll areas of issues: - locality - networking - scheduling
- dependencies
@rakyll fin
[email protected]