Nexus Labs
PulseAugur coverage of Nexus Labs — every cluster mentioning Nexus Labs across labs, papers, and developer communities, ranked by signal.
2 天有情绪数据
-
Bifrost gateway improves LLM cost, data quality for robotics and agents
Two separate teams at Nexus Labs and Prophesee have adopted Bifrost, an open-source gateway, to manage their interactions with multiple large language models. Prophesee used Bifrost to caption 1.2 million robotics frame…
-
LLM evaluation harness updated with production data and adversarial testing
A new approach to evaluating Large Language Models (LLMs) has been proposed to address the issue of static evaluation harnesses failing to detect model regressions. This method involves refreshing evaluation datasets we…
-
Measuring AI Gateway Failover: 30 Days of Production Data
Anthropic has released an update on Claude's sycophancy, noting that Opus 4.7 shows a 50% reduction in sycophantic responses compared to Opus 4.6, particularly in relationship guidance conversations. The company also de…