English(EN) Opus 4.8 Now Flagging Bizarre Conversations as Security Risks

Anthropic 的 Claude Opus 4.8 将正常对话标记为安全风险

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-24 00:14

用户报告称 Anthropic 的 Claude Opus 4.8 模型表现出异常行为，将看似无害的对话标记为安全风险并拒绝响应。一位用户分享了一个例子，其中关于用于干旱地区捕获水分的假设性织物的查询被标记。这种行为让人想起之前在较低级别模型中出现的问题，导致用户对模型的可靠性和安全协议表示担忧。 AI

影响由于意外的安全标记，可能导致用户沮丧感增加，并降低对模型可靠性的信任。

排序理由用户报告意外的模型行为和拒绝，并非官方发布或基准测试。

在 r/ClaudeAI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

Anthropic 的 Claude Opus 4.8 将正常对话标记为安全风险

报道来源 [1]

r/ClaudeAI TIER_2 English(EN) · /u/Pndapetzim · 2026-06-24 00:14

Opus 4.8 现在将奇怪的对话标记为安全风险

<div class="md"><p>Recently asked it the following question: </p> <p>"Here's another idea, in a region where water is scarce, I'm contemplating a fine weave fabric that air can pass through to trap moisture. My idea would be treating the fabric with a hydropho…

报道来源 [1]

Opus 4.8 现在将奇怪的对话标记为安全风险

相关实体

相关话题