Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Beyond top: Command-Line Monitoring on the JVM ...
Search
Sponsored
·
Ship Features Fearlessly
Turn features on and off without deploys. Used by thousands of Ruby developers.
→
Colin Jones
October 29, 2015
Programming
690
1
Share
Beyond top: Command-Line Monitoring on the JVM (JavaOne 2015)
A session from JavaOne 2015
Colin Jones
October 29, 2015
More Decks by Colin Jones
See All by Colin Jones
A Bug's Life: What if `select` is Broken After All?
trptcolin
0
180
Underestimated costs of microservice architectures
trptcolin
3
1.6k
FP vs. OOP: Beyond the Bikeshed
trptcolin
0
450
Diving into the Details with DTrace! (RubyConf 2016 edition)
trptcolin
2
530
Diving into the Details with DTrace
trptcolin
3
540
Adopting FP: the good, the familiar, and the unknown
trptcolin
0
270
Finding out what's *really* going on, with DTrace!
trptcolin
1
390
Beyond top: Command-Line Monitoring on the JVM (ClojureRemote)
trptcolin
0
160
ZooKeeper: Wait-free coordination for Internet-scale systems
trptcolin
2
210
Other Decks in Programming
See All in Programming
Geminiをパートナーに神社DXシステムを個人開発した話(いなめぐDX 開発振り返り)
fujiba
0
130
Kubernetesでセルフホストが簡単なNewSQLを求めて / Seeking a NewSQL Database That's Simple to Self-Host on Kubernetes
nnaka2992
0
190
最初からAWS CDKで技術検証してもいいんじゃない?
akihisaikeda
4
180
ファインチューニングせずメインコンペを解く方法
pokutuna
0
220
AIコードレビューの導入・運用と AI駆動開発における「AI4QA」の取り組みについて
hagevvashi
0
580
夢の無限スパゲッティ製造機 -実装篇- #phpstudy
o0h
PRO
0
180
実践ハーネスエンジニアリング #MOSHTech
kajitack
7
5.1k
「接続」—パフォーマンスチューニングの最後の一手 〜点と点を結ぶ、その一瞬のために〜
kentaroutakeda
4
2.2k
CS教育のDX AIによる育成の効率化
niftycorp
PRO
0
170
What Spring Developers Should Know About Jakarta EE
ivargrimstad
0
830
メッセージングを利用して時間的結合を分離しよう #phperkaigi
kajitack
3
520
車輪の再発明をしよう!PHP で実装して学ぶ、Web サーバーの仕組みと HTTP の正体
h1r0
2
470
Featured
See All Featured
16th Malabo Montpellier Forum Presentation
akademiya2063
PRO
0
87
Building Experiences: Design Systems, User Experience, and Full Site Editing
marktimemedia
0
460
How to audit for AI Accessibility on your Front & Back End
davetheseo
0
230
Designing Experiences People Love
moore
143
24k
Automating Front-end Workflow
addyosmani
1370
200k
Getting science done with accelerated Python computing platforms
jacobtomlinson
2
160
Become a Pro
speakerdeck
PRO
31
5.9k
Agile Leadership in an Agile Organization
kimpetersen
PRO
0
120
The Illustrated Guide to Node.js - THAT Conference 2024
reverentgeek
1
320
AI Search: Where Are We & What Can We Do About It?
aleyda
0
7.2k
From π to Pie charts
rasagy
0
160
What the history of the web can teach us about the future of AI
inesmontani
PRO
1
500
Transcript
Beyond top(1) Command-Line Monitoring on the JVM Colin Jones @trptcolin
8th Light
None
What to expect
command-line tooling
on the JVM
introspection & serviceability
--all-flags=false
war stories
real-life usage (well, re-enacted anyway)
A long time ago in a software shop far, far
away…
Things are going pretty well
What does this thing look like? app-architecture Postgres Web /
API Application Server Load Balancer Periodic Job Application Server 3rd-party Service A 3rd-party Service B Monitored email account End users: native mobile app Admin users: desktop browsers
But strange things are afoot
the server sometimes gets really slow
the team has to manually restart the application server
incident response time is ~5 minutes
Yes, strange things are afoot
Pain, frustration, anger
Just the facts
sometimes, things get slow
all requests seem to be affected
the JVM stays up
restart the JVM and everything is fine
What could it be?
Demo
More facts, please!
constant full GCs
what’s in the heap
what application code was running
The right tools for the job
vmstat system-level: CPU, memory, disk, context switching
top per-process: CPU & memory
jps what’s our PID?
jstack status of all threads (right now-ish!)
jcmd what can’t it do?! jcmd [PID] help (sorry, JVM
6 users: see jinfo/jmap/jstack)
jstat GC classloader compiler
Mystery solved!
Now “just” fix it
idea 1: eliminate the leak
idea 2: eliminate the cache altogether?
idea 3: delete the feature
idea 4: full-text search engine
So we’re good now… until the next incident
Lessons
“it’s slow” could mean lots of things
“high CPU” could mean lots of things
collecting data is crucial in a crisis
reproducing the issue helps me sleep at night
Other “right tools for the job”
Heap analyzers
Profilers
Constant monitoring & alerting
Dynamic tracing
Learning more
Books
operators are standing by! man jstat man jstack jcmd [PID]
help [COMMAND] etc.
Thank you! Colin Jones @trptcolin 8th Light