O Open Source Orchestration medium

Bifrost

by Community

Fastest enterprise AI gateway (50x faster than LiteLLM) with adaptive load balancer, cluster mode, guardrails, 1000+ models support & <100 µs overhead at 5k RPS.

Visit Community View repo Submit your build →

OSS

Bifrost

Added 1 June 2026

#ai-gateway #gateway #gateway-services #generative-ai #guardrails #llm #llm-cost #llm-gateway

Overview

Bifrost is a high-performance AI gateway written in Go. It adaptively load balances requests across 1000+ models, enforces guardrails, and operates in cluster mode with under 100 µs overhead at 5,000 requests per second.

Best for

Best for
Engineering teams needing ultra-fast, scalable model routing with built-in safety controls

Use cases

Route inference requests to the optimal model based on real-time performance
Apply safety guardrails and policies across multiple AI model endpoints
Scale model serving horizontally with cluster mode for high throughput

Notes

5,406 stars on GitHub. Last updated 2026-06-01. Licensed Apache-2.0.

Use cases

Route inference requests to the optimal model based on real-time performance
Apply safety guardrails and policies across multiple AI model endpoints
Scale model serving horizontally with cluster mode for high throughput

Pros

Extremely low latency overhead even at high request rates
Supports over 1000 models, reducing vendor lock-in
Built-in guardrails for safety without external services

Cons

Community vendor may offer limited enterprise support
Go language can be a barrier for teams without Go experience
Advanced clustering adds operational complexity for small deployments

Indexed from awesome-langchain and enriched against its public facts.

Pros

Extremely low latency overhead even at high request rates
Supports over 1000 models, reducing vendor lock-in
Built-in guardrails for safety without external services

Cons

Community vendor may offer limited enterprise support
Go language can be a barrier for teams without Go experience
Advanced clustering adds operational complexity for small deployments

Pairs with

Other entries in the index that connect to this one. Click through to see the chain.

Pairs with3entries

P Apps Productivity low

Open WebUI

Various

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

★ 139,558 updated 2d ago

O OSS Framework medium

ollama

Community

Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.

★ 172,846 updated 2d ago

O OSS Framework medium

vLLM

Community

A high-throughput and memory-efficient inference and serving engine for LLMs

★ 81,619 updated 2d ago

← Back to Open Source Submit your own entry →