Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Berlin 2013 - Session - Jeff Weinstein
Search
Sponsored
·
Your Podcast. Everywhere. Effortlessly.
Share. Educate. Inspire. Entertain. You do you. We'll handle the rest.
→
Monitorama
September 20, 2013
670
2
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
Berlin 2013 - Session - Jeff Weinstein
Monitorama
September 20, 2013
More Decks by Monitorama
See All by Monitorama
Monitorama PDX 2017 - Ian Bennett
monitorama
1
630
PDX 2017 - Pedro Andrade
monitorama
0
830
PDX 2017 - Roy Rapoport
monitorama
4
990
PDX 2017 - Julia Evans
monitorama
0
520
Berlin 2013 - Session - Brad Lhotsky
monitorama
5
770
Berlin 2013 - Session - Alex Petrov
monitorama
6
730
Berlin 2013 - Session - Oliver Hankeln
monitorama
1
590
Berlin 2013 - Session - David Goodlad
monitorama
0
510
Berlin 2013 - Session - Radu Gheorghe
monitorama
0
670
Featured
See All Featured
Game over? The fight for quality and originality in the time of robots
wayneb77
1
200
Sam Torres - BigQuery for SEOs
techseoconnect
PRO
0
290
Facilitating Awesome Meetings
lara
57
7k
Discover your Explorer Soul
emna__ayadi
2
1.1k
The Spectacular Lies of Maps
axbom
PRO
1
810
Improving Core Web Vitals using Speculation Rules API
sergeychernyshev
21
1.5k
Design and Strategy: How to Deal with People Who Don’t "Get" Design
morganepeng
133
19k
Pawsitive SEO: Lessons from My Dog (and Many Mistakes) on Thriving as a Consultant in the Age of AI
davidcarrasco
0
160
So, you think you're a good person
axbom
PRO
2
2.1k
How To Stay Up To Date on Web Technology
chriscoyier
790
250k
CoffeeScript is Beautiful & I Never Want to Write Plain JavaScript Again
sstephenson
162
16k
Intergalactic Javascript Robots from Outer Space
tanoku
273
27k
Transcript
How monitoring can improve the rest of the company
Monitorama EU 2013 @jeff_weinstein
I real-time and batch data analytics
Monitoring can wildly improve the whole company by
sharing data and sharing techniques.
Monitoring Folks Developers Business Analysts
ExecuIves & Product Data ScienIsts Data
Apps & Services & Systems Users
Data Code & Config Monitoring
Some problems…
Data Processing Apps Systems Logs /
Events Metrics Graphs & Alerts Apps 3rd Party Reports & Queries ETL AnalyIc Systems Monitoring: Streaming BI: Batch
Data Needs Logs Metrics Logs Metrics
Streaming Batch Data Monitoring BI
Data Tools Stack Monitoring • Ad hoc
– sed, grep, awk – ES, LogStash, Splunk, … • Storage – Hosts, Ganglia, OTSDB – Central syslog server • VisualizaIon/ReporIng – Graphite, RRDTool, 3rd party – Homegrown • AlerIng/EscalaIon – Nagios, Sensu, PagerDuty, … Rest of company • Ad hoc – Excel, SQL, Hive – MapReduce, … • Storage – Lots o’ databases, Excel – Hadoop, RDBMS… • VisualizaIon/ReporIng – Excel, R, Tableau ... – Dinosaur apps, … • AlerIng/EscalaIon – nada
Metrics
Views Unintelligible generated views Too granular for long
term trends Lack of historical Intolerant to anomalies
Team and incenIves • What team? • Change
vs. reliability • Planning • Budget • Churn
Good or bad? • Specific Tools • Decentralized
• Focus • Ownership • Lost context • Siloed work • Data dark • Misunderstanding
Some fixes
End to End Data Pipeline ü Structured logs ü (Config)
ü Measure once ü AutomaIc metrics ü API ü Graph tools ü Glossary ü AnnotaIons and tags ü Pipeline
Structured events • JSON (or whatever) • (opIonal)
config • Tags per key – Type – Tag: latency, funnel,… – DescripIon – Storage
Auto: Graphs, Glossary, & Storage • Graphs and dashboards
• * templates • Views and stats • Glossary • Batch analyIcs • Long term storage
build learn communicate inspire
Developers • Logging toolkit • Data pipeline
• Pain points • Outage causes • Deployment pracIces • EscalaIon playbook • Measurement as TDD • Monitor staging env
Business Analysts • Structured logs • Config
for ETL • Metrics definiIons • Slices and visualizaIons • Data size and cardinality • Outages and delays • Flexibility • VisualizaIon and tools
Data ScienIsts • Access to (meta)data • Query
monitoring • StaIsIcs and models • New data streams • Context of data issues • What’s in the logs • Validate algorithms • Teach stats and models!
Product & ExecuIves • Curated dashboards • Graph/alert
tools • Learn the business • PrioriIze alerts by $ • Incident post mortems • Metrics granularity • Data driven decisions • Recognize and celebrate
Monitoring can become the data plahorm and improve all
teams with its techniques.
Icons from The Noun Project: Dmitry Baranovskiy, Benjamin Orlovski, Luis
Prado, MikaDo Nguyen, Yarden Gilboa, Javier Cabezas, Icons Pusher, Jeremy Bristol, Blake Thomas, RiIka Khasgiwale, Mayene de Leon, Yorlmar Campos, Sergey Shmid @jeff_weinstein Thanks! hiring ;)