Agenda
- Pitfalls of Data Lake using Append-Only Distributed File System
- CDC-based UPSERT in Data Lake
- Using Views to UPSERT
- Using Open Table Formats – Apache Iceberg, Hudi, Delta Lake
- Modern Transactional Data Lake Architecture
- Streaming Migrations for Analyitcs on AWS