Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Monitoring is dead
Search
Sebastian Montini
September 21, 2018
Technology
0
220
Monitoring is dead
Sebastian Montini
September 21, 2018
Tweet
Share
More Decks by Sebastian Montini
See All by Sebastian Montini
AWS Community Day BA 2019
sebamontini
0
60
Giraffe: our journey to support 1 million metrics per second
sebamontini
0
170
Nomad-PyCon2017
sebamontini
0
75
Atlas, a PaaS with batteries included
sebamontini
0
69
Nomad: The sequel
sebamontini
1
150
Nomad, a love story
sebamontini
0
150
Aurora: 5 Tb later ...
sebamontini
0
73
Ansible 202 - Sysarmy Meetup
sebamontini
0
88
Cloud Computing: All that glitters is not AWS - Nerdear.la 2016
sebamontini
0
57
Other Decks in Technology
See All in Technology
2024.02.19 W&B AIエージェントLT会 / AIエージェントが業務を代行するための計画と実行 / Algomatic 宮脇
smiyawaki0820
14
3.6k
Active Directory攻防
cryptopeg
PRO
1
320
あれは良かった、あれは苦労したB2B2C型SaaSの新規開発におけるCloud Spanner
hirohito1108
2
640
利用終了したドメイン名の最強終活〜観測環境を育てて、分析・供養している件〜 / The Ultimate End-of-Life Preparation for Discontinued Domain Names
nttcom
2
200
ホワイトボードチャレンジ 説明&実行資料
ichimichi
0
130
開発スピードは上がっている…品質はどうする? スピードと品質を両立させるためのプロダクト開発の進め方とは #DevSumi #DevSumiB / Agile And Quality
nihonbuson
2
3.1k
Swiftの “private” を テストする / Testing Swift "private"
yutailang0119
0
130
技術的負債解消の取り組みと専門チームのお話 #技術的負債_Findy
bengo4com
1
1.3k
開発組織のための セキュアコーディング研修の始め方
flatt_security
3
2.5k
自動テストの世界に、この5年間で起きたこと
autifyhq
10
8.6k
クラウドサービス事業者におけるOSS
tagomoris
2
870
SA Night #2 FinatextのSA思想/SA Night #2 Finatext session
satoshiimai
1
140
Featured
See All Featured
Side Projects
sachag
452
42k
Fashionably flexible responsive web design (full day workshop)
malarkey
406
66k
Building a Scalable Design System with Sketch
lauravandoore
461
33k
Facilitating Awesome Meetings
lara
52
6.2k
The Web Performance Landscape in 2024 [PerfNow 2024]
tammyeverts
4
410
Visualization
eitanlees
146
15k
ReactJS: Keep Simple. Everything can be a component!
pedronauck
666
120k
Typedesign – Prime Four
hannesfritz
40
2.5k
The Cult of Friendly URLs
andyhume
78
6.2k
Practical Orchestrator
shlominoach
186
10k
Testing 201, or: Great Expectations
jmmastey
42
7.2k
Build The Right Thing And Hit Your Dates
maggiecrowley
34
2.5k
Transcript
@sebamontini MEDALLIA Monitoring is Dead And why you’re (probably) doing
it wrong
@sebamontini MEDALLIA
@sebamontini MEDALLIA Why?
@sebamontini MEDALLIA The big 5
@sebamontini MEDALLIA The big 5 ✓ CPU → uptime |
mailx -s “cpu” root ✓ MEM → free | mailx -s “mem” root ✓ DISK → (df -h; du -sh /home/*) | mailx -s “disk” root ✓ PROC → (ps -ef | grep important) | mailx -s root ✓ SYS → ping -c 4 google.com | mailx -s root
@sebamontini MEDALLIA OK: x < something
@sebamontini MEDALLIA WARN: something < x < something
@sebamontini MEDALLIA CRITICAL: x > something
@sebamontini MEDALLIA
@sebamontini MEDALLIA Observability
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA A system is a set of connected components.
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA The manner in which a system acts is
it’s behavior.
@sebamontini MEDALLIA A system is observable if you can determine
the behavior of the system based on it’s outputs.
@sebamontini MEDALLIA The outputs of a system are the concrete
results of it’s behaviors.
@sebamontini MEDALLIA Monitoring is the action of observing and checking
the behavior and outputs of a system and it’s components over time.
@sebamontini MEDALLIA The (real) big 5
@sebamontini MEDALLIA Instrumentation Collection Storage Alerting Visualization
@sebamontini MEDALLIA Instrumentation
@sebamontini MEDALLIA Gauges Counters Histogram Timers
@sebamontini MEDALLIA Gauges A gauge is an instantaneous measurement of
a value. For example, we may want to measure the number of pending jobs in a queue
@sebamontini MEDALLIA Counters A counter is just a gauge that
you can increment or decrement its value. For example, we may want a more efficient way of measuring the pending job in a queue
@sebamontini MEDALLIA Histogram A histogram measures the statistical distribution of
values in a stream of data like median or percentiles
@sebamontini MEDALLIA Timers A timer measures both the rate that
a particular piece of code is called and the distribution of its duration.
@sebamontini MEDALLIA Collection
@sebamontini MEDALLIA
@sebamontini MEDALLIA Storage
@sebamontini MEDALLIA Storage
@sebamontini MEDALLIA Alerting
@sebamontini MEDALLIA Thresholds Dead man Delta Anomaly detection
@sebamontini MEDALLIA Visualization
@sebamontini MEDALLIA
@sebamontini MEDALLIA The big 5 ✓ Instrumentation → gauges, histograms,
timers, counters ✓ Collection → pull vs push ✓ Storage → Time Series DB ✓ Alerting → threshold, flatline, delta, anomaly ✓ Visualization → dashboards
@sebamontini MEDALLIA
@sebamontini MEDALLIA The Four Golden Signals
@sebamontini MEDALLIA Latency Traffic Errors Saturation
@sebamontini MEDALLIA Thanks