Six months ago, my team needed to deploy DeepSeek-R1 for internal use. We have a Kubernetes cluster — like everyone does in 2026 — so I started looking for tools.

The problem

There are basically three options for running LLMs on Kubernetes:

KAITO (Microsoft) — CNCF Sandbox, 1600+ stars, but Azure-only. We are on AWS.

KServe — CNCF Incubating, solid project, but requires Knative + ISTIO + 5+ other components.