English(EN) Auto-labelling 1.2M robotics frames with VLMs: a failover story

Bifrost网关提升机器人和智能体的LLM成本和数据质量

作者 PulseAugur 编辑部 · [2 个来源] · 2026-05-25 16:03

Nexus Labs和Prophesee的两个独立团队采用了Bifrost（一个开源网关）来管理与多个大型语言模型的交互。Prophesee使用Bifrost为120万个机器人帧添加字幕，通过智能地在GPT-4o、Claude 3.7 Sonnet和Gemini 2.5 Pro之间路由请求，节省了22%的成本。Nexus Labs实施了Bifrost来提高其智能体训练数据的质量，发现由于模型行为不一致和提供商隐藏的故障，近一半的生产跟踪数据无法使用。通过使用Bifrost的高级故障转移和日志记录功能，他们能够将损坏的跟踪数据从17%降低到3%以下，从而实现更可靠的微调。 AI

影响 Bifrost被多个团队采用，凸显了对强大基础设施日益增长的需求，以管理LLM API成本并确保智能体开发的 डेटा质量。

排序理由该集群描述了用于管理LLM API交互的开源网关工具的采用和优势，而不是核心AI模型发布或研究。

在 dev.to — LLM tag 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

dev.to — LLM tag TIER_1 English(EN) · Marco Rinaldi · 2026-05-25 16:53

使用VLMs自动标注120万个机器人帧：一个故障转移的故事

<p><strong>TL;DR: We needed to caption 1.2M reconstructed event-camera frames using vision-language models for auxiliary supervision. The first run died at 340K from Anthropic rate limits. Putting Bifrost in front of three VLM providers cut the rerun cost by 22% and finished in 9…
dev.to — LLM tag TIER_1 English(EN) · Marcus Chen · 2026-05-25 16:03

我们审计了我们的Agent工具调用跟踪。一半的评估数据是垃圾。

<p><strong>TL;DR: We pulled 41,000 production agent traces at Nexus Labs to build a fine-tuning dataset. After a manual audit of 1,200 of them, ~48% were unusable: tool calls that "succeeded" but returned wrong data, retries masking provider failures, and silent fallbacks that ch…

报道来源 [2]

使用VLMs自动标注120万个机器人帧：一个故障转移的故事

我们审计了我们的Agent工具调用跟踪。一半的评估数据是垃圾。

相关实体

相关话题