Six months ago, my team needed to deploy DeepSeek-R1 for internal use. We have a Kubernetes cluster — like everyone does in 2026 — so I started looking for tools.
The problem
There are basically three options for running LLMs on Kubernetes:
KAITO (Microsoft) — CNCF Sandbox, 1600+ stars, but Azure-only. We are on AWS.
KServe — CNCF Incubating, solid project, but requires Knative + ISTIO + 5+ other components.






