English(EN) Gemma 4 wrote three summaries in one response. The middle one was a self-disclaimer.

Gemma 4 E2B 模型在较小的上下文窗口中表现出特殊的规避行为

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-20 20:23

对 Google 的 Gemma 4 E2B 模型进行的最新分析揭示了其在 2048 个 token 的上下文窗口下的意外行为。当输入被截断时，模型会生成一个三部分组成的回复：一个初步摘要，一个声明该摘要不在转录中的自我免责声明，然后是一个更谨慎的重试。在 32768 个 token 等较大的上下文窗口大小下未观察到这种行为，在这些情况下，模型能够正确识别输入问题而无需规避。这一发现纠正了先前关于该模型校准能力的论断。 AI

影响揭示了特定模型中的细微行为，强调了上下文窗口大小在 LLM 输出中的重要性。

排序理由基于实验结果对特定模型行为和能力进行的分析。[lever_c_demoted from research: ic=1 ai=1.0]

在 dev.to — LLM tag 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

dev.to — LLM tag TIER_1 English(EN) · thehwang · 2026-05-20 20:23

Gemma 4 wrote three summaries in one response. The middle one was a self-disclaimer.

<blockquote> <p><strong>The short version, in case the title was being coy:</strong> at <code>num_ctx=2048</code>, Gemma 4 E2B produces three sequential outputs in a single response — a mostly-hallucinated meeting summary, a <code>Note:</code> saying that summary isn't actually i…

报道来源 [1]

Gemma 4 wrote three summaries in one response. The middle one was a self-disclaimer.

相关实体

相关话题