English(EN) Five AI Systems. Same Prompts, Twice. Wildly Different Responses.

领先的AI模型在相同提示上表现出显著分歧

作者 PulseAugur 编辑部 · [1 个来源] · 2026-06-30 03:44

对五个领先的AI系统——GPT-4、Claude 3 Opus、Claude 3 Sonnet、Gemini 1.5 Pro和Llama 3——的最新分析显示，它们对相同提示的回应存在显著的不一致性。当两次被问及相同的伦理和安全问题时，这些系统在自身和彼此之间频繁出现分歧，分歧率从34%到66%不等。这种变异性甚至发生在公认的伦理原则上，表明当前AI模型缺乏稳定的推理能力或存在根本性的架构问题。 AI

影响凸显了AI推理中潜在的不可靠性，影响了在关键应用中的信任和部署。

排序理由对AI模型行为的分析，而非直接发布或产品公告。

在 Towards AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

Towards AI TIER_1 English(EN) · Thomas D. Holt · 2026-06-30 03:44

Five AI Systems. Same Prompts, Twice. Wildly Different Responses.

<h4><em>The systems didn’t just disagree with each other. They disagreed with themselves.</em></h4><figure><img alt="" src="https://cdn-images-1.medium.com/max/1024/1*G07h1HPiOi04ATt0KYHAaA.png" /><figcaption>Rate of disagreement across five leading AI systems on identical ethics…

报道来源 [1]

Five AI Systems. Same Prompts, Twice. Wildly Different Responses.

相关实体

相关话题