Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production

Running AI agents in a local script is straightforward. Running them reliably in production across teams, across restarts, with isolated environments per context is a different problem entirely. BerriAI, the company behind the LiteLLM AI Gateway, is now open-sourcing a purpose-built answer to that problem: the LiteLLM Agent Platform. The platform is described as a simple, self-hosted infrastructure platform for running multiple agents in production.

What Problem Does it Solve?

It helps to understand what happens when you try to scale agents beyond a single process. Agents are stateful: they carry session history, tool call results, and intermediate reasoning across turns. If the container running your agent crashes, restarts, or gets replaced during a deployment, that session state is gone unless something is explicitly managing it. At the same time, different teams often need different runtime environments, different tools, different secrets, different access scopes which means you cannot throw all agents into one shared container.

The platform manages two things: per-team and per-context sandboxes, and session continuity across pod restarts and upgrades. These two capabilities are the core infrastructure primitives the platform provides.

Meet LiteLLM Agent Platform: A Kubernetes-Based, Self-Hosted Infrastructure Layer for Isolated Agent Sandboxes and Persistent Session Management in Production

Other newsrooms on this story

Related reading

The Missing Operational Layer Between Agent Prototypes and Production

Other newsrooms on this story

Related reading

The Missing Operational Layer Between Agent Prototypes and Production

LiteLLM Is Moving to Rust. Here's What the Benchmarks Look Like.

The Brain/Sandbox Pattern: Why Your Production Agent Needs This Architecture

When to Move Beyond LiteLLM (And When Not To)

How to Build a Self-Hosted AI Gateway With LiteLLM and Open WebUI

Agent Substrate: The Agentic AI Isolation Layer On K8s