Kafka without ZooKeeper: My Strimzi HA Playbook on K8s

Running Strimzi Kafka 4.1 in production on Kubernetes: KRaft, multi-zone HA, Cruise Control rebalancing, and the JVM and scheduling details that matter.

lunedì 1 giugno 2026 New tab

3,188 words~14 min read

I've been running Strimzi Kafka in production at scale for the past few years - multi-cloud, multi-zone, mixed broker sizes, the usual. The first year I spent more time firefighting Kafka than building anything on top of it. The next two I spent slowly stripping operational pain out of the setup until it stopped paging me.

This is the configuration I landed on, why each part is shaped the way it is, and the production failure modes that drove each decision. No theory, no marketing, no Hello-World defaults - only the cluster I actually run.

The problem

A production Kafka cluster on Kubernetes has to survive at least four things at once:

A single broker dying mid-write.

Kafka without ZooKeeper: My Strimzi HA Playbook on K8s

Kafka without ZooKeeper: My Strimzi HA Playbook on K8s

Related reading

Kafka for Data Engineers: Core Concepts, KRaft, and the Patterns That Actually…

Why Your Kafka Stack Is Holding You Back (And How to Fix It)

I Built an Interactive Kafka Playground (Partitions, Keys, Consumer Groups,…

Kafka Streams 101: A Developer’s Guide to Real-Time Application Logic

Kafka Partitioning Strategies: How to Get It Right Before It Costs You

Kafka is not a queue — and treating it like one will wreck your system

Related reading

Kafka for Data Engineers: Core Concepts, KRaft, and the Patterns That Actually…

Why Your Kafka Stack Is Holding You Back (And How to Fix It)

I Built an Interactive Kafka Playground (Partitions, Keys, Consumer Groups,…

Kafka Streams 101: A Developer’s Guide to Real-Time Application Logic

Kafka Partitioning Strategies: How to Get It Right Before It Costs You

Kafka is not a queue — and treating it like one will wreck your system