Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Ins and Outs of the Outbox Pattern

Ins and Outs of the Outbox Pattern

The outbox pattern is a common solution for implementing data flows between microservices. By channeling messages through an outbox table, it enables services to update their own local datastore and at the same time send out notifications to other services via data streaming platforms such as Apache Kafka, in a reliable and consistent way.

However, as with everything in IT, there’s no free lunch. How to handle backfills of outbox events, how to ensure idempotency for event consumers? Doesn’t the pattern cause the database to become a bottleneck? And what about alternatives such as “Listen-to-Yourself”, or the upcoming Kafka support for 2-phase commit transactions (KIP-939)?

It’s time to take another look at the outbox pattern! In this session I’ll start by bringing you up to speed on what the outbox pattern *is*, and then go on to discuss more details such as:

- Implementing the pattern safely and efficiently
- Its semantics, pros and cons
- Dealing with backfills
- Potential alternatives to the outbox pattern and the trade-offs they make

Gunnar Morling

March 19, 2025
Tweet

More Decks by Gunnar Morling

Other Decks in Programming

Transcript

  1. Gunnar Morling @gunnarmorling.dev Ins and Outs of the Outbox Pattern

    oatsy40 https://flic.kr/p/FvD3mP (CC BY 2.0) 🦋
  2. #OutboxPattern · @gunnarmorling.dev • Technologist at Confluent • Former project

    lead of Debezium • kcctl 🧸, JfrUnit, ModiTect, MapStruct • One Billion Row Challenge 1⃣🐝🏎 • Java Champion Gunnar Morling
  3. #OutboxPattern · @gunnarmorling.dev The Problem How to… Update a service’s

    database… …AND notify other services? Consistently.
  4. #OutboxPattern · @gunnarmorling.dev Debezium New and Lesser Known Features •

    Incremental snapshots • JDBC sink connector • Official support for MariaDB • Re-Select post processor • WASM-based SMT
  5. #OutboxPattern · @gunnarmorling.dev Consumer Idempotency How to Identify Duplicates? •

    A unique value • Not a database sequence • An increasing value, e.g. PG LSN