Kubernetes HPA Scale to Zero Without KEDA: Native Autoscaling for Idle Workloads

If you run queue processors, batch workers, or event-driven workloads that sit idle for hours between...

mercoledì 27 maggio 2026 New tab

2,148 words~10 min read

If you run queue processors, batch workers, or event-driven workloads that sit idle for hours between bursts, you're paying for compute you don't need. Kubernetes HPA can scale these deployments to zero replicas — no KEDA, no Knative, no external controllers required. You need one feature gate, an external metrics source, and about twenty minutes of setup. When the queue is empty your pods disappear, and if you pair this with cluster autoscaler, the nodes disappear too. Real scale-to-zero, using nothing but native Kubernetes primitives.

Quick Reference

Requirement

Detail

Feature gate

Kubernetes HPA Scale to Zero Without KEDA: Native Autoscaling for Idle Workloads

Kubernetes HPA Scale to Zero Without KEDA: Native Autoscaling for Idle Workloads

Other newsrooms on this story

Related reading

Deploy Datadog Kubernetes Autoscaling at scale | Datadog

GPU autoscaling on Kubernetes with KEDA: building an external scaler with NVML

Kubernetes Is Eating Your Budget: How to Fix EKS Over-Provisioning

Strategies for running AI workloads on GKE without committed quota

Scaling Kubernetes workloads on custom metrics | Datadog

Kubernetes Pod Autoscaling: A Key to Efficient Resource Utilization

Other newsrooms on this story

Related reading

Deploy Datadog Kubernetes Autoscaling at scale | Datadog

GPU autoscaling on Kubernetes with KEDA: building an external scaler with NVML

Kubernetes Is Eating Your Budget: How to Fix EKS Over-Provisioning

Strategies for running AI workloads on GKE without committed quota

Scaling Kubernetes workloads on custom metrics | Datadog

Kubernetes Pod Autoscaling: A Key to Efficient Resource Utilization