Proxy OpenAI Through Kong AI Gateway on Kubernetes

The Problem With Talking Directly to LLMs

Most teams start by wiring their app straight to the OpenAI API. It works — until you need to add auth, rate limiting, observability, or swap out the model provider. Now you're rewriting application code instead of config.

An AI Gateway solves this. One entry point, one place to govern traffic, providers become swappable. Kong Gateway is a mature choice here — it's been doing this for APIs for years, and the AI Proxy plugin extends that to LLMs.

This post walks through the key ideas. For the full step-by-step guide, head over to the tutorial on Hashnode.

What We're Building

Proxy OpenAI Through Kong AI Gateway on Kubernetes

Related reading

Transforming Kong into an AI Gateway on GCP: Managing LLM Tokens, MCP, and…

How to Build a Self-Hosted AI Gateway With LiteLLM and Open WebUI

Enterprise LLM Gateway: Route, govern, and secure your AI traffic

OpenAI-Compatible Gateway Control Plane Checklist

How to test an OpenAI-compatible AI API gateway without rewriting your app

OpenRouter vs LiteLLM vs Portkey vs a Managed OpenAI-Compatible Gateway