English(EN) Can a 0.6B model coordinate frontier LLMs better than any one alone? TRINITY (ICLR 2026) trains a tiny coordinator to route each turn to one of seven larger LLM

微型协调器模型TRINITY在新的基准SOTA上优化前沿LLM

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-29 22:07

研究人员开发了TRINITY，这是一种新颖的方法，使用一个0.6十亿参数的小型模型来协调多个大型前沿LLM。该协调器模型使用进化策略而非梯度下降进行训练，因为奖励稀疏，它将每个回合路由到充当思考者（Thinker）、工作者（Worker）或验证者（Verifier）的专业LLM。TRINITY在LiveCodeBench基准测试中取得了86.2%的新SOTA分数，证明了其在协调复杂LLM任务方面的有效性，而自身能力没有显著增加。该系统现已集成到Sakana的Fugu中。 AI

影响这种方法可能带来更高效、更强大的多LLM系统，通过专业路由潜在地提高复杂任务的性能。

排序理由该集群描述了一篇详细介绍新模型架构和基准性能的研究论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 Mastodon — mastodon.social 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

Mastodon — mastodon.social TIER_1 English(EN) · [email protected] · 2026-06-29 22:07

一个0.6B模型能否比单独一个模型更好地协调前沿LLM？TRINITY (ICLR 2026) 训练了一个微型协调器，将每个回合路由到一个大型LLM中的一个

Can a 0.6B model coordinate frontier LLMs better than any one alone? TRINITY (ICLR 2026) trains a tiny coordinator to route each turn to one of seven larger LLMs as Thinker, Worker, or Verifier. The optimizer is an evolution strategy (sep-CMA-ES), not gradient descent: the binary…

链接 benjaminhan.net/…/20260629-trinity-llm-co…

报道来源 [1]

一个0.6B模型能否比单独一个模型更好地协调前沿LLM？TRINITY (ICLR 2026) 训练了一个微型协调器，将每个回合路由到一个大型LLM中的一个

相关实体

相关话题