研究人员 catalog 63 起 LLM-agent 预算超支事件，并提出 Rust 缓解方案

作者 PulseAugur 编辑部 · [2 个来源] · 2026-06-02 00:00

研究人员 catalog 了 2023 年至 2026 年间 21 个框架中的 63 起 LLM-agent 预算超支事件，详细说明了财务损失并对失败类型进行了分类。为了缓解这些问题，他们开发了一个名为 `token-budgets` 的 Rust crate，该 crate 使用仿射（affine）类型来防止常见的错误，例如在编译时双重花费或在委托后使用预算。虽然更简单的 Python 实现可以处理单智能体任务，但 Rust crate 在多智能体场景中表现出更优越的安全性，可以防止导致超支的委托竞争。 AI

影响提供了一个 LLM-agent 成本故障的分类法以及一种防止这些故障的新型类型系统方法，有可能降低运营成本并提高可靠性。

排序理由学术论文，详细介绍了实证研究结果和提出的缓解策略。

在 Hugging Face Daily Papers 阅读 →

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.MA (Multiagent) TIER_1 English(EN) · Sajjad Khan · 2026-06-02 10:46

Token Budgets: An Empirical Catalog of 63 LLM-Agent Budget-Overrun Incidents, with an Affine-Typed Rust Mitigation as a Case Study

LLM-agent budget overruns are a documented production failure class: a single retry loop can spend thousands of dollars before an operator notices, and the in-process integrity properties that would prevent it (no aliasing, no double-spend, no use-after-delegation of a cost-beari…
Hugging Face Daily Papers TIER_1 English(EN) · 2026-06-02 00:00

Token Budgets: An Empirical Catalog of 63 LLM-Agent Budget-Overrun Incidents, with an Affine-Typed Rust Mitigation as a Case Study

LLM-agent budget overruns are a documented production failure class: a single retry loop can spend thousands of dollars before an operator notices, and the in-process integrity properties that would prevent it (no aliasing, no double-spend, no use-after-delegation of a cost-beari…

报道来源 [2]

Token Budgets: An Empirical Catalog of 63 LLM-Agent Budget-Overrun Incidents, with an Affine-Typed Rust Mitigation as a Case Study

Token Budgets: An Empirical Catalog of 63 LLM-Agent Budget-Overrun Incidents, with an Affine-Typed Rust Mitigation as a Case Study

相关实体

相关话题