Upgrade to Pro
— share decks privately, control downloads, hide ads and more …
Speaker Deck
Features
Speaker Deck
PRO
Sign in
Sign up for free
Search
Search
S3 NYC Iceberg meetup 2025-07-10
Search
sullis
July 10, 2025
Programming
58
0
Share
S3 NYC Iceberg meetup 2025-07-10
S3 object storage
NYC Iceberg meetup
2025-07-10
sullis
July 10, 2025
More Decks by sullis
See All by sullis
Dependency Management for Java - Code Remix Summit 2026-05-12
sullis
0
49
AI Assisted Software Development - Portland Java User Group - 2026-04-14
sullis
0
51
Dependency Management for Java - Seattle 2025-11-18
sullis
0
46
Dependency Management for Java - Portland - 2025-11-04
sullis
0
32
Dependency management for Java applications 2025-09-11
sullis
0
48
Amazon S3 Chicago 2025-06-04
sullis
0
140
Amazon S3 Boston 2025-05-07
sullis
0
110
Netty ConFoo Montreal 2025-02-27
sullis
0
170
GitHub Actions ConFoo Montreal 2025-02-26
sullis
0
110
Other Decks in Programming
See All in Programming
How We Practice Exploratory Testing in Iterative Development( #scrumniigata ) / 反復開発の中で、探索的テストをどう実施しているか
teyamagu
PRO
3
1.1k
Old Dog, New Tricks: The Java 25 Reinvention - JNation
bazlur_rahman
0
110
要はバランスからの卒業 #yumemi_grow
kajitack
0
190
Zod v4 Codec でスキーマに型変換を埋め込む REST API 設計 #TSKaigi2026
ryutaro_yako
0
100
Hive Metastoreを通して学ぶIceberg REST Catalog ― 仕様から実装まで
okumin
0
250
TypeSpec で繋ぐ複数プロダクトの型安全
maroon8021
1
180
AI駆動開発勉強会 広島支部 第一回勉強会 AI駆動開発概要とワークショップ
hayatoshimiu
0
320
Transactional Change Stream Processing With Debezium and Apache Flink
gunnarmorling
1
120
Firefoxにコントリビューションして得られた学び
ken7253
2
170
PHPでローカル環境用のSSL/TLS証明書を発行することはできるのか? #phpconkagawa
akase244
0
380
Spec-Driven Development with AI Agents (Workshop, May 2026)
antonarhipov
4
420
ローカルLLMでどこまでコードが書けるか / How much code can be written on a local LLM
kishida
2
400
Featured
See All Featured
State of Search Keynote: SEO is Dead Long Live SEO
ryanjones
0
190
Designing for humans not robots
tammielis
254
26k
Navigating Weather and Climate Data
rabernat
0
190
Kristin Tynski - Automating Marketing Tasks With AI
techseoconnect
PRO
0
250
Cheating the UX When There Is Nothing More to Optimize - PixelPioneers
stephaniewalter
287
14k
SEO Brein meetup: CTRL+C is not how to scale international SEO
lindahogenes
1
2.7k
Leveraging LLMs for student feedback in introductory data science courses - posit::conf(2025)
minecr
1
260
Visualizing Your Data: Incorporating Mongo into Loggly Infrastructure
mongodb
49
9.9k
[Rails World 2023 - Day 1 Closing Keynote] - The Magic of Rails
eileencodes
38
2.9k
Chrome DevTools: State of the Union 2024 - Debugging React & Beyond
addyosmani
10
1.2k
Bridging the Design Gap: How Collaborative Modelling removes blockers to flow between stakeholders and teams @FastFlow conf
baasie
0
560
JAMstack: Web Apps at Ludicrous Speed - All Things Open 2022
reverentgeek
1
450
Transcript
S3 object storage Sean Sullivan NYC Iceberg meetup July 10
2025
🔵 Amazon S3 🔵 Iceberg and S3 🔵 Kafka and
S3
Launched on March 14, 2006 S3 “Simple Storage Service”
S3 is an object storage service with a HTTP REST
API
S3
Objects are stored in buckets
🔵 General purpose buckets 🔵 Directory buckets 🔵 Table buckets
With S3 Express One Zone, data is stored in a
different bucket type — an S3 directory bucket — which can support up to 2 million requests per second Directory buckets
select a specific Availability Zone within an AWS Region to
store your data S3 Express One Zone
co-locate your storage and compute resources in the same Availability
Zone to further optimize performance S3 Express One Zone
“Table buckets” were announced at AWS re:Invent 2024 Table buckets
S3 Tables fully managed Iceberg tables Iceberg tables as a
first class AWS resource
None
S3 Analytics Accelerator
S3 Analytics Accelerator 📍 integrates with AWS CRT 📍 sequential
prefetching 📍 small object prefetching
S3 Analytics Accelerator 📍 Parquet footer caching 📍 predictive column
prefetching
S3 Analytics Accelerator
Iceberg S3 properties s3.analytics-accelerator.enabled=true S3FileIOProperties
None
July 2023: WarpStream WarpStream is an Apache Kafka protocol compatible
data streaming platform built directly on top of S3
June 2024: WarpStream
May 2025: KIP-1150
None
“Apache Kafka is often operated in cloud hyperscaler environments where
high-reliability object storage is available and more cost-effective than block storage for equivalent workloads”
None
https://github.com/aiven/inkless
None
The End
Bonus content
RisingWave: S3 as primary storage
None
None
S3 Analytics Accelerator
None