tool · [5 sources] · 2025-10-08 14:28

Nexus Labs tests AI gateways for failover reliability

By PulseAugur Editorial · Summary by gemini-2.5-flash-lite from 5 sources

Nexus Labs conducted a 30-day production test comparing three AI gateways: Bifrost, LiteLLM, and Portkey, to evaluate their failover capabilities and latency overhead. Bifrost demonstrated a 11ms p99 latency increase with its automatic provider fallback, successfully rerouting traffic during an OpenAI outage. While LiteLLM offered valuable custom cost-tracking callbacks and Portkey showed promise, Bifrost's synchronous fallback evaluation was noted as a key advantage for reliable production traffic management. AI

Summary written by gemini-2.5-flash-lite from 5 sources. How we write summaries →

IMPACT Provides insights into optimizing LLM request routing and failover, crucial for maintaining service reliability and managing costs in production AI systems.

RANK_REASON The article details a technical evaluation and comparison of AI gateway products based on production data, fitting the research category. [lever_c_demoted from research: ic=1 ai=0.7]

Read on dev.to — LLM tag →

Nexus Labs tests AI gateways for failover reliability

COVERAGE [5]

HN — MCP stories TIER_1 · elfenleid · 2025-10-08 14:28

Show HN: Recall: Give Claude memory with Redis-backed persistent context
dev.to — LLM tag TIER_1 · Marco Rinaldi · 2026-05-21 16:52

Routing Event-Camera Pipelines Through an LLM Gateway: A Field Report

TL;DR: We added a vision-language stage to an event-camera pipeline at Prophesee and the LLM provider routing became the messiest part. Bifrost handled the failover and the OpenAI-compatible surface without forcing us to rewrite the C++ side. Honest comparison vs LiteL…
dev.to — LLM tag TIER_1 · Marcus Chen · 2026-05-21 16:02

Measuring AI Gateway Failover: 30 Days of Production Data

TL;DR: We measured failover latency across three AI gateways (Bifrost, LiteLLM, Portkey) during 30 days of production traffic at Nexus Labs. Bifrost added 11ms p99 overhead with automatic provider fallback. The model is the easy part. Routing it reliably is not.</stron…
dev.to — LLM tag TIER_1 · Elise Moreau · 2026-05-21 14:52

Routing diffusion inference traffic across three providers

TL;DR: We route a mix of diffusion and LLM traffic across three providers from a single Go-based gateway called Bifrost. The 11 microsecond overhead is real, the failover works, and the part I care about most (weighted routing for cost vs latency tradeoffs) finally sto…
dev.to — LLM tag TIER_1 · claire nguyen · 2026-05-19 04:22

Putting an LLM Gateway in Front of Our Build Agents: Why We Picked Bifrost

TL;DR: We bolted an LLM gateway in front of the AI features in our build pipeline tooling and ended up running Bifrost instead of LiteLLM or Kong. The deciding factor wasn't features, it was the 11 microsecond overhead and the fact it didn't fall over when one provider…

COVERAGE [5]

Show HN: Recall: Give Claude memory with Redis-backed persistent context

Routing Event-Camera Pipelines Through an LLM Gateway: A Field Report

Measuring AI Gateway Failover: 30 Days of Production Data

Routing diffusion inference traffic across three providers

Putting an LLM Gateway in Front of Our Build Agents: Why We Picked Bifrost

RELATED ENTITIES

RELATED TOPICS