$30 off During Our Annual Pro Sale. View Details »
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
Szymon Sobczak - Hadoop + Storm
Search
Base Lab
May 07, 2015
Technology
0
100
Szymon Sobczak - Hadoop + Storm
Combo for realtime big data systems
Base Lab
May 07, 2015
Tweet
Share
More Decks by Base Lab
See All by Base Lab
Slawek Skowron - Monitoring @ Scale
baselab
0
130
Karol Nowak - Monitoring clock drift in Amazon EC2 environment
baselab
0
110
Tomasz Nowak - Web Application Testing made easy
baselab
0
300
Szymon Pawlik - UX i Automatyzacja czyli jak testerzy mogą poprawić produkt.
baselab
0
250
Mateusz Herych - LIKE '%smth%' is not the way
baselab
0
150
Jerzy Chałupski - Offline mode in Android apps
baselab
3
490
Jerzy Chałupski - Data model on Android
baselab
4
240
Other Decks in Technology
See All in Technology
Oracle Database@AWS:サービス概要のご紹介
oracle4engineer
PRO
1
410
[Neurogica] 採用ポジション/ Recruitment Position
neurogica
1
130
Authlete で実装する MCP OAuth 認可サーバー #CIMD の実装を添えて
watahani
0
180
Entity Framework Core におけるIN句クエリ最適化について
htkym
0
130
株式会社ビザスク_AI__Engineering_Summit_Tokyo_2025_登壇資料.pdf
eikohashiba
1
120
AI with TiDD
shiraji
1
300
SREが取り組むデプロイ高速化 ─ Docker Buildを最適化した話
capytan
0
150
マイクロサービスへの5年間 ぶっちゃけ何をしてどうなったか
joker1007
21
8.2k
AI駆動開発の実践とその未来
eltociear
2
500
AWSの新機能をフル活用した「re:Inventエージェント」開発秘話
minorun365
2
470
ソフトウェアエンジニアとAIエンジニアの役割分担についてのある事例
kworkdev
PRO
0
280
通勤手当申請チェックエージェント開発のリアル
whisaiyo
3
470
Featured
See All Featured
[SF Ruby Conf 2025] Rails X
palkan
0
640
HDC tutorial
michielstock
0
280
From Legacy to Launchpad: Building Startup-Ready Communities
dugsong
0
110
What's in a price? How to price your products and services
michaelherold
246
13k
How to train your dragon (web standard)
notwaldorf
97
6.5k
Building a A Zero-Code AI SEO Workflow
portentint
PRO
0
200
Documentation Writing (for coders)
carmenintech
77
5.2k
Sam Torres - BigQuery for SEOs
techseoconnect
PRO
0
150
Redefining SEO in the New Era of Traffic Generation
szymonslowik
1
170
How to Talk to Developers About Accessibility
jct
1
85
実際に使うSQLの書き方 徹底解説 / pgcon21j-tutorial
soudai
PRO
196
70k
Conquering PDFs: document understanding beyond plain text
inesmontani
PRO
4
2.1k
Transcript
Szymon Sobczak
Hadoop + Storm Combo for realtime big data systems
Plan • Hadoop & Storm • Our setup • What
projects are we running • Decisions we had to make
Hadoop
None
Storm “Ala ma kota Artura" “ala”, “ma”, “kota”, “artura" “ala”,
“ma”, “kota”, “artura" a: 2 k: 1 m: 1 “ala”
Storm “Ala ma kota Artura" “ma” “ala”, “ma”, “kota”, “artura"
“ala” a: 2 m: 1 k: 1 “ala”, “artura" “kota”
Common traits
None
Base infrastructure services
Understand how the entire Base system works services
Big Data S3 uploader
Four example projects • Debugging • Reporting • Email intelligence
• Forecasting
Debugging S3 uploader
Reporting
Email analysis S3 uploader
Forecasting S3 uploader
Decisions we made ☑ Collect *all* data ☑ Put them
in one place ☐ Build platform for engineers ☐ Same code on Hadoop and Storm
Summary S3 uploader
Questions?
Thank you
[email protected]
bigdata.getbase.com