Kong AI Gateway is a set of plugins for Kong Gateway that turn Kong into an LLM-aware API proxy. It adds multi-provider routing, semantic routing, prompt guardrails, token-based rate limiting, and MCP exposure to Kong's established gateway runtime built on OpenResty and Nginx.

What it does

The AI Proxy plugin routes requests to OpenAI, Anthropic, Azure, Cohere, Mistral, Gemini, Llama, Bedrock, and OpenAI-compatible upstreams with format translation between them. AI Prompt Guard and AI Prompt Decorator plugins enforce prompt-level policies — blocking prohibited content, injecting system prompts, or redacting inputs. AI Rate Limiting Advanced counts tokens rather than requests for more accurate budget enforcement. AI Semantic Routing dispatches traffic to different upstream models based on intent classification. A Kong MCP plugin (3.12+, late 2025) exposes Kong-managed services as MCP servers for agent clients.

Where it fits

Organizations that already run Kong for API traffic management and want to apply the same governance layer to LLM and agent traffic without introducing a parallel gateway. Kong's declarative config model fits GitOps-style operations.

Licensing and commercial tier

Kong Gateway core is Apache 2.0. Several AI-focused plugins and supporting features are Kong Enterprise (commercial): AI Rate Limiting Advanced, AI Semantic Routing, AI Proxy Advanced, the OIDC authentication plugin, and advanced observability. The open-source AI Proxy and Prompt Guard plugins are usable on their own; the open-source JWT plugin handles token auth without automatic JWKS refresh.

Deployment

Dockerized Kong on any container platform, including Docker Swarm and Kubernetes. Runs in database mode (Postgres or Cassandra) or DB-less mode (declarative YAML). Konga or the open-source admin UI handles configuration, though Enterprise-only plugins do not render in the open-source admin.

Limitations

The open-source JWT plugin lacks automatic JWKS refresh, IdP discovery, and role mapping — the OIDC plugin that provides those is Enterprise.
Kong's operational model is heavier than single-binary gateways; teams new to Kong face a learning curve on plugin schemas and configuration layering.
Enterprise-only plugins limit the OSS feature ceiling for token-based rate limiting and semantic routing.
Plugin version compatibility across Kong major versions requires careful tracking.

Categories:

AI Infrastructure

Similar to Kong AI Gateway

View all tools

ONNX Runtime

Cross-platform inference and training accelerator for machine learning models.

AI InfrastructureML Platforms & MLOps

Open-source runtime for ONNX models with hardware-specific optimizations for CPUs, GPUs, and edge devices. Supports inferencing and training across multiple ML frameworks.

Dify

LLMOps platform with visual agent builder, RAG, and multi-provider gateway

Agent & MCP ToolingAI Infrastructure

Self-hostable LLMOps platform from LangGenius with a visual agent builder, RAG pipeline, multi-provider gateway, and prompt management. Uses the Dify Open Source License — modified Apache 2.0 with a non-compete clause prohibiting multi-tenant SaaS resale.

Nuclio

Open-source serverless platform for event-driven functions on Docker and Kubernetes

AI InfrastructureEdge Computing

Open-source serverless platform for event-driven function execution on Docker and Kubernetes. Nuclio packages code into containerized functions and exposes them through HTTP, cron, and stream-based triggers.

Similar to Kong AI Gateway

View all tools

ONNX Runtime

Cross-platform inference and training accelerator for machine learning models.

AI InfrastructureML Platforms & MLOps

Open-source runtime for ONNX models with hardware-specific optimizations for CPUs, GPUs, and edge devices. Supports inferencing and training across multiple ML frameworks.

Dify

LLMOps platform with visual agent builder, RAG, and multi-provider gateway

Agent & MCP ToolingAI Infrastructure

Nuclio

Open-source serverless platform for event-driven functions on Docker and Kubernetes

AI InfrastructureEdge Computing

Kong AI Gateway

Apache 2.0 Kong Gateway plugin suite for multi-provider LLM routing, prompt guardrails, token-based rate limiting, and MCP exposure. Core and several plugins are OSS; AI Rate Limiting Advanced, Semantic Routing, and OIDC auth are Kong Enterprise.

What it does

Where it fits

Licensing and commercial tier

Deployment

Limitations

Similar to Kong AI Gateway

ONNX Runtime

Dify

Nuclio

Similar to Kong AI Gateway

Similar to Kong AI Gateway

ONNX Runtime

Dify

Nuclio