Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
[DevNexus-2018] Apache Kafka A Streaming Data P...
Search
Viktor Gamov
February 22, 2018
Technology
330
2
Share
Embed
Copy iframe code
Copy JS code
Copy link
Start on current slide
[DevNexus-2018] Apache Kafka A Streaming Data Platform
Viktor Gamov
February 22, 2018
More Decks by Viktor Gamov
See All by Viktor Gamov
Processing Streaming Data with KSQL
vikgamov
4
450
[VirtualJUG] Apache Kafka — A Streaming Data Platform
vikgamov
3
440
[SF JUG] Apache Kafka — A Streaming Data Platform
vikgamov
4
110
[OracleCode NYC-2018] Apache Kafka A Streaming Data Platform
vikgamov
1
190
[OracleCode NYC-2018] Rethinking Stream Processing with KStreams and KSQL
vikgamov
2
260
[JBreak-2018] Это кто там твитить про #jbreak?
vikgamov
0
240
[DataSciCon] Divide, Distribute and Conquer: Stream v. Batch
vikgamov
0
120
[Philly JUG] Divide, Distribute and Conquer: Stream v. Batch
vikgamov
0
510
[Atlanta JUG] Testing containers with TestContainers
vikgamov
0
1.3k
Other Decks in Technology
See All in Technology
不要なレビューをAIにまかせて AIコーディングの環境改善を加速した
shoota
1
230
AIはどのように 組織のアジリティを変えるのか?
junki
4
1.1k
GitHub Copilot app最速の発信の裏側
tomokusaba
1
200
螺旋型キャリアの生存戦略 / kinoko-conf2026
rakus_dev
0
150
Oracle Cloud Infrastructure:2026年6月度サービス・アップデート
oracle4engineer
PRO
0
150
サイバーエージェントにおけるAI推進戦略と変革への取り組み
shotatsuge
0
250
コミュニティの有益性 ~JAWS Days 2026 での体験を通して~ / The Benefits of a Community ~Through My Experience at JAWS Days 2026~
seike460
PRO
0
200
Bucharest Tech Week 2026 - Reinventing testing practices in the AI era
edeandrea
PRO
1
170
GitHub Copilot 最新アップデート – 「一歩先」の実践活用術
moulongzhang
5
1.5k
IaC コードを資産へ:AWS CDK 社内ライブラリと横断展開 / aws-summit-japan-2026
gotok365
9
1.3k
iAEONの段階的リアーキテクト戦略 / iAEON's_Gradual_Re-architecture_Strategy
aeonpeople
0
230
Kiroで書いた 設計書 が AI レビューの 採点基準 になる
ezaki
0
140
Featured
See All Featured
ピンチをチャンスに:未来をつくるプロダクトロードマップ #pmconf2020
aki_iinuma
128
56k
Marketing to machines
jonoalderson
1
5.5k
Rails Girls Zürich Keynote
gr2m
96
14k
Faster Mobile Websites
deanohume
310
32k
Navigating the Design Leadership Dip - Product Design Week Design Leaders+ Conference 2024
apolaine
1
350
How to Talk to Developers About Accessibility
jct
2
240
Design in an AI World
tapps
1
250
Why Mistakes Are the Best Teachers: Turning Failure into a Pathway for Growth
auna
0
160
The AI Revolution Will Not Be Monopolized: How open-source beats economies of scale, even for LLMs
inesmontani
PRO
3
3.5k
Mobile First: as difficult as doing things right
swwweet
225
10k
Tips & Tricks on How to Get Your First Job In Tech
honzajavorek
1
540
The Myth of the Modular Monolith - Day 2 Keynote - Rails World 2024
eileencodes
28
3.5k
Transcript
@ Apache Kafka A Streaming Data Platform
@ @gamussa @confluentinc Who am I?
@ @gamussa @confluentinc Solutions Architect Who am I?
@ @gamussa @confluentinc Solutions Architect Developer Advocate Who am I?
@ @gamussa @confluentinc Solutions Architect Developer Advocate @gamussa in internetz
Who am I?
@ @gamussa @confluentinc Solutions Architect Developer Advocate @gamussa in internetz
Hey you, yes, you, go follow me in twitter © Who am I?
@ @gamussa @confluentinc
@ @gamussa @confluentinc A company is build on
@ @gamussa @confluentinc A company is build on DATA FLOWS
but All we have is DATA STORES
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc Streaming Platform 1. Pub/Sub 2. Store 3.
Process
@ @gamussa @confluentinc Streaming Platform 1. Pub/Sub 2. Store 3.
Process
@ @gamussa @confluentinc Core abstraction
@ @gamussa @confluentinc Core abstraction DB - table
@ @gamussa @confluentinc Core abstraction DB - table Hadoop -
file
@ @gamussa @confluentinc Core abstraction DB - table Hadoop -
file Messaging -?
@ @gamussa @confluentinc LOGS
@ @gamussa @confluentinc Producing to Kafka Time
@ @gamussa @confluentinc Producing to Kafka Time C C C
@ @gamussa @confluentinc Producing to Kafka - With Key Time
A B C D hash(key) % numPartitions = N
@ @gamussa @confluentinc Producing to Kafka - No Key Time
Messages will be produced in a round robin fashion
@ @gamussa @confluentinc Producing to Kafka - No Key Time
Messages will be produced in a round robin fashion
@ @gamussa @confluentinc Producing to Kafka - No Key Time
Messages will be produced in a round robin fashion
@ @gamussa @confluentinc Producing to Kafka - No Key Time
Messages will be produced in a round robin fashion
@ @gamussa @confluentinc Consuming From Kafka - Single Consumer C
@ @gamussa @confluentinc Consuming From Kafka - Grouped Consumers C
C C1 C C C2
@ @gamussa @confluentinc Consuming From Kafka - Grouped Consumers C
C C C
@ @gamussa @confluentinc Consuming From Kafka - Grouped Consumers 0
1 2 3
@ @gamussa @confluentinc Consuming From Kafka - Grouped Consumers 0
1 2 3
@ @gamussa @confluentinc Consuming From Kafka - Grouped Consumers 0,
3 1 2 3
@ @gamussa @confluentinc Producers Consumers
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc Kafka Connect does hard work so you
don’t
@ @gamussa @confluentinc Kafka Connect does hard work so you
don’t 1. Scale out
@ @gamussa @confluentinc Kafka Connect does hard work so you
don’t 1. Scale out
@ @gamussa @confluentinc Kafka Connect does hard work so you
don’t 1. Scale out
@ @gamussa @confluentinc Kafka Connect does hard work so you
don’t 1. Scale out
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc Streaming Platform 1. Pub/Sub 2. Store 3.
Process
@ @gamussa @confluentinc Why Store?
@ @gamussa @confluentinc Scalability of a filesystem
@ @gamussa @confluentinc Scalability of a filesystem Throughput 100s mb/s
@ @gamussa @confluentinc Scalability of a filesystem Throughput 100s mb/s
TBs per server
@ @gamussa @confluentinc Scalability of a filesystem Throughput 100s mb/s
TBs per server Commodity Hardware
@ @gamussa @confluentinc Scalability of a filesystem Throughput 100s mb/s
TBs per server Commodity Hardware O(1) writes
@ @gamussa @confluentinc Guarantees of a database
@ @gamussa @confluentinc Guarantees of a database Persistence
@ @gamussa @confluentinc Guarantees of a database Persistence Strict ordering
@ @gamussa @confluentinc Distributed by Design
@ @gamussa @confluentinc Replication Distributed by Design
@ @gamussa @confluentinc Replication Fault Tolerance Distributed by Design
@ @gamussa @confluentinc Replication Fault Tolerance Partitioning Distributed by Design
@ @gamussa @confluentinc Replication Fault Tolerance Partitioning Scale Distributed by
Design
@ @gamussa @confluentinc
@ @gamussa @confluentinc Partition Leadership and Replication Broker 1 Topic1
partition1 Broker 2 Broker 3 Broker 4 Topic1 partition1 Topic1 partition1 Leader Follower Topic1 partition2 Topic1 partition2 Topic1 partition2 Topic1 partition3 Topic1 partition4 Topic1 partition3 Topic1 partition3 Topic1 partition4 Topic1 partition4
@ @gamussa @confluentinc Partition Leadership and Replication - node failure
Broker 1 Topic1 partition1 Broker 2 Broker 3 Broker 4 Topic1 partition1 Topic1 partition1 Leader Follower Topic1 partition2 Topic1 partition2 Topic1 partition2 Topic1 partition3 Topic1 partition4 Topic1 partition3 Topic1 partition3 Topic1 partition4 Topic1 partition4
@ @gamussa @confluentinc Streaming Platform 1. Pub/Sub 2. Store 3.
Process
@ @gamussa @confluentinc What is Stream Processing? A machine for
combining streams of events
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc https://www.confluent.io/download/
@ @gamussa @confluentinc We are hiring! https://www.confluent.io/careers/
@ @gamussa @confluentinc One more thing…
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc
@ @gamussa @confluentinc A Major New Paradigm
@ @gamussa @confluentinc Thanks! questions? @gamussa
[email protected]
We are hiring!
https://www.confluent.io/careers/