$30 off During Our Annual Pro Sale. View Details »
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Keeping Wikipedia fast [WeLoveSpeed]
Search
Peter Hedenskog
September 20, 2019
Technology
1
490
Keeping Wikipedia fast [WeLoveSpeed]
Peter Hedenskog
September 20, 2019
Tweet
Share
More Decks by Peter Hedenskog
See All by Peter Hedenskog
Measuring Web Performance for Wikipedia using synthetic testing tools
soulislove
0
450
Measuring Web Performance Using Selenium
soulislove
2
890
Monitoring Web Performance using Open Source tools (Stockholm)
soulislove
2
240
Monitoring web performance using Open Source tools (San Francisco & Silicon Valley Web Performance Group)
soulislove
1
360
Monitoring web performance using Open Source tools (South Bay JavaScript Meetup)
soulislove
0
250
Optimise your home page (fast as lightning)
soulislove
1
58
Integrating performance tools into continuous delivery
soulislove
0
280
How to make your boss speed-curious and other webperf tricks - coldfront2014
soulislove
0
180
Sitespeed.io Lightning demo @ Velocity Santa Clara 2014
soulislove
0
120
Other Decks in Technology
See All in Technology
松尾研LLM講座2025 応用編Day3「軽量化」 講義資料
aratako
3
2.5k
Strands AgentsとNova 2 SonicでS2Sを実践してみた
yama3133
1
1.8k
20251218_AIを活用した開発生産性向上の全社的な取り組みの進め方について / How to proceed with company-wide initiatives to improve development productivity using AI
yayoi_dd
0
650
20251203_AIxIoTビジネス共創ラボ_第4回勉強会_BP山崎.pdf
iotcomjpadmin
0
130
Lookerで実現するセキュアな外部データ提供
zozotech
PRO
0
200
Snowflake導入から1年、LayerXのデータ活用の現在 / One Year into Snowflake: How LayerX Uses Data Today
civitaspo
0
2.3k
通勤手当申請チェックエージェント開発のリアル
whisaiyo
3
440
JEDAI認定プログラム JEDAI Order 2026 エントリーのご案内 / JEDAI Order 2026 Entry
databricksjapan
0
180
Amazon Quick Suite で始める手軽な AI エージェント
shimy
1
1.8k
Bedrock AgentCore Memoryの新機能 (Episode) を試してみた / try Bedrock AgentCore Memory Episodic functionarity
hoshi7_n
2
1.8k
Introduce marp-ai-slide-generator
itarutomy
0
100
オープンソースKeycloakのMCP認可サーバの仕様の対応状況 / 20251219 OpenID BizDay #18 LT Keycloak
oidfj
0
160
Featured
See All Featured
Practical Orchestrator
shlominoach
190
11k
Bioeconomy Workshop: Dr. Julius Ecuru, Opportunities for a Bioeconomy in West Africa
akademiya2063
PRO
0
31
Ruling the World: When Life Gets Gamed
codingconduct
0
100
RailsConf 2023
tenderlove
30
1.3k
A Modern Web Designer's Workflow
chriscoyier
698
190k
GitHub's CSS Performance
jonrohan
1032
470k
Taking LLMs out of the black box: A practical guide to human-in-the-loop distillation
inesmontani
PRO
3
1.9k
AI Search: Implications for SEO and How to Move Forward - #ShenzhenSEOConference
aleyda
1
1k
[Rails World 2023 - Day 1 Closing Keynote] - The Magic of Rails
eileencodes
37
2.7k
Mobile First: as difficult as doing things right
swwweet
225
10k
"I'm Feeling Lucky" - Building Great Search Experiences for Today's Users (#IAC19)
danielanewman
231
22k
Designing Dashboards & Data Visualisations in Web Apps
destraynor
231
54k
Transcript
Keeping Wikipedia fast Peter Hedenskog - @soulislove
Keeping Wikipedia fast Peter Hedenskog - @soulislove
@soulislove
@soulislove Sweden France
@soulislove
@soulislove
What to do? @soulislove
@soulislove
@soulislove
@soulislove NO!!!!!
@soulislove Jean Bernadotte?
@soulislove
@soulislove Sweden France
@soulislove
@soulislove
@soulislove
Lets talk about performance @soulislove
@soulislove Today Our setup RUM & Synthetic Learnings I’ve got
the last four years Case study one regression
@soulislove https://news.ycombinator.com/item?id=20903868
@soulislove https://phabricator.wikimedia.org
@soulislove https://grafana.wikimedia.org
Why is performance important? @soulislove
We want to bring free knowledge to the world independently
of where you live and your economic status. @soulislove
@soulislove
Engineers/dev cares too! @soulislove
Why is performance hard (for us)? @soulislove
Keeping Wikipedia fast @soulislove https://news.ycombinator.com/item?id=20903868 Keeping Wikipedia fast is easy
right?
@soulislove
The Wikipedia Performance Team Challenge All Wikis are different (JS/CSS)
All pages are different (JS/CSS) All users are different (JS/CSS) @soulislove
Our history of performance testing @soulislove
PHP -> RUM-> Synthetic @soulislove
RUM @soulislove
@soulislove Metrics from real users Sampled (1/100) Buckets: platform, browser,
location https://github.com/wikimedia/mediawiki-extensions-NavigationTiming https://grafana.wikimedia.org/d/000000143/navigation-timing?refresh=5m&orgId=1
@soulislove
@soulislove
@soulislove
How we use RUM @soulislove Metrics from “all” users/scenarios Median,
75, 95, 99 - percentiles Alert on regressions First Paint / LoadEventEnd BFF with synthetic
Synthetic testing @soulislove
@soulislove
@soulislove Browsertime + WebPageReplay
That flat line @soulislove
Deviation @soulislove
How we use synthetic @soulislove Fixing the chaos (or creating
more?) Wayback machine Three URLs per alert First Visual Change BFF with RUM
Learnings: Synthetics @soulislove
Validate metrics! @soulislove
@soulislove
https://phabricator.wikimedia.org/T187981 @soulislove
@soulislove
page1 != page2 @soulislove
@soulislove Deviation
@soulislove User journey: second view
@soulislove User journey: second view
Server matters! @soulislove
1. AWS vs GCS vs other cloud providers 2. Servers
change over time (what runs on the same physical server?) 3. C4.xlarge != C4.xlarge @soulislove
@soulislove https://phabricator.wikimedia.org/T192138 https://phabricator.wikimedia.org/T192138 https://phabricator.wikimedia.org/T192138 https://youtu.be/pYbgcDfM2Ts?t=1575
Testing multiple steps are hard @soulislove
How long time do your user stay on each page?
How long do browsers keep HTTP connections open? @soulislove
Browser versions matter @soulislove
None
Know when browsers are updated!!! @soulislove
Learnings: RUM @soulislove
RUM can be good for finding regressions @soulislove
None
User Timing API != what shows on screen @soulislove
@soulislove
New element timings are better! @soulislove
@soulislove
Browser versions are important @soulislove
@soulislove
@soulislove
The hidden tabs incident @soulislove
@soulislove The idea: async all the things … use setTimeout
to run things later.
@soulislove https://phabricator.wikimedia.org/T146510 setTimeout
@soulislove https://phabricator.wikimedia.org/T146510
@soulislove 10% of the traffics opens in another tab!
What are we missing? @soulislove
@soulislove
@soulislove RUM Higher sample rate Buckets per page type Which
metrics are important?
@soulislove Synthetic Real mobile phones (T197847) Easier for devs to
add tests (T225416)
Case study: 3/9-2019 incident @soulislove https://phabricator.wikimedia.org/T231929
@soulislove Firefox
@soulislove Firefox
@soulislove Chrome
@soulislove Firefox
@soulislove WebPageTest
@soulislove
@soulislove TTFB? No: because visible with WebPageReplay
@soulislove ttfb?
@soulislove ttfb?
@soulislove ttfb?
@soulislove Screenshots/video Diff the HAR using https://compare.sitespeed.io or size per
content type in Graphite
@soulislove ttfb?
@soulislove We got 311 span class=“cs1-visible-error”!!! Citation errors: not shown
to readers https://en.wikipedia.org/wiki/Help:CS1_errors
@soulislove Credits Pippi and father - SVT Quick et Flupke
- Hergé The king shouting - Expressen The scream - Edward Munch Napoleon - Horace Vernet Engineers India Space Shuttle - Expressen Various pictures of Carl Gustaf - Swedish tax payers through the apanage
@soulislove Questions?? @soulislove
[email protected]