Français(FR) Your "Claude Opus" API Might Not Be Claude Opus

影子 LLM API 用更便宜的模型欺骗研究人员

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-22 02:12

CISPA 的研究人员审计了 17 个第三方“影子”LLM API，并发现了与其声称代表的官方模型相比，存在显著的性能差异。这些服务通常提供更便宜或完全不同的模型访问权限，导致学术研究的准确性下降。该研究确定了三种常见的替换模式：静默降级、跨供应商替换和基于上下文长度的部分路由，简单的指纹测试能够检测到其中许多欺骗行为，但并非全部。 AI

影响当研究依赖于被误导的 LLM API 时，学术研究的完整性会受到损害，可能导致研究结果无效。

排序理由该集群报道了一篇已发表的学术论文，详细介绍了对 LLM API 的审计。[lever_c_demoted from research: ic=1 ai=1.0]

在 dev.to — LLM tag 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

dev.to — LLM tag TIER_1 Français(FR) · Thousand Miles AI · 2026-05-22 02:12

您的“Claude Opus”API 可能不是 Claude Opus

<p>In March 2026, researchers at the CISPA Helmholtz Center for Information Security audited 17 third-party "shadow" LLM APIs against the official endpoints they claimed to wrap. A proxy marketed as <code>Gemini-2.5</code> scored 37% on a medical benchmark where the real endpoint…

报道来源 [1]

您的“Claude Opus”API 可能不是 Claude Opus

相关实体

相关话题