实体 Claude Sonnet 4.6

Claude Sonnet 4.6

PulseAugur coverage of Claude Sonnet 4.6 — every cluster mentioning Claude Sonnet 4.6 across labs, papers, and developer communities, ranked by signal.

Show in brief

总计 · 30天

90 天内 46

发布 · 30天

90 天内 0

论文 · 30天

90 天内 21

层级分布 · 90 天

frontier release 1
significant 1
research 10
tool 28
commentary 4
meme 2

关系

developed by Anthropic 100%
instance of Opus 4.7 90%
competes with Opus 4.7 70%
competes with Hacker News 70%
competes with Opus-4.6 70%
competes with ChatGPT Plus 70%
used by DeepSeek V4-Pro 70%
competes with DeepSeek V4-Pro 70%
uses Kimi K2.5 60%
other Claude Sonnet 4.5 60%
used by Hacker News 50%

时间线

2026-05-15 product_launch Users report overactive refusal issues with Claude Sonnet 4.6.
2026-05-14 research_milestone A user observed a safety regression in Claude Sonnet 4.6 compared to version 4.5.
2026-04-15 product_launch Anthropic released Claude Sonnet 4.6, replacing the previous version. 来源

情绪 · 30 天

13 天有情绪数据

最近 · 第 3/3 页 · 共 46 条

TOOL · CL_04555 · Apr 26 · 22:18

人工智能工具在个人生活策略建议方面效果不一

一项实验评估了八种人工智能工具，包括商业生活指导平台以及GPT-5.3和Claude Sonnet 4.6等大型语言模型，以评估它们提供生活策略建议的能力。用户寻求的是智慧和以美德为中心的指导，而非纯粹的实际有效性。定制提示的Claude版本，特别是Sonnet 4.6，在提供富有洞察力的生活目标重构方面，表现优于商业工具和通用大型语言模型。Auren和Sybil等商业工具因做出未经证实的心理诊断或提供平淡、笼统的建议而受到批评。
RESEARCH · CL_05048 · Apr 23 · 20:42

LLMs show instability in psychiatric risk scores with irrelevant data

A new study evaluated the reliability of large language models (LLMs) in predicting psychiatric hospitalization risk. Researchers found that including medically insignificant details in patient profiles significantly in…
TOOL · CL_17455 · Apr 15 · 06:24

Anthropic 的 Sonnet 4.6 升级因能力下降令用户沮丧

Anthropic 强制用户从 Claude Sonnet 4.5 升级到 Sonnet 4.6，但用户报告称 Sonnet 4.6 能力较弱且更难管理。开发者因无法固定到特定模型版本而感到沮丧，这导致应用程序行为不可预测。用户还指出，与前代产品相比，Sonnet 4.6 表现出更僵化的格式和模仿不同写作风格的能力下降。
RESEARCH · CL_47566 · Apr 9 · 13:05

Anthropic 的 'Mythos' AI 因过于危险而无法公开发布

Anthropic 开发了一个名为 Claude Mythos 的新 AI 模型，该模型在基准测试性能方面取得了显著进步，尤其是在识别软件漏洞方面。由于其在查找和利用安全漏洞方面的先进能力，Anthropic 选择不公开发布 Mythos。取而代之的是，该公司通过“Project Glasswing”向特定组织提供有限的访问权限，以协助网络安全研究和漏洞发现，并大力支持开源安全计划。
FRONTIER RELEASE · CL_11191 · Apr 8 · 16:00

RT Artificial Analysis: Meta is back! Muse Spark scores 52 on the Artificial Analysis Intelligence Index, behind only Gemini 3.1 Pro, GPT-5.4, and Cla...

Meta AI has released Muse Spark, a new frontier-class multimodal model developed by Meta Superintelligence Labs. This marks Meta's return to the frontier AI race after a period of relative quiet and is their first model…
TOOL · CL_19489 · Mar 19 · 16:01

Canary 发布 AI QA 工具，在代码验证方面优于 GPT-5.4 和 Claude Code

Canary 是一款新推出的、由 AI 驱动的 QA 工具，通过理解代码库并为用户工作流生成端到端测试，来自动化拉取请求的测试。该工具旨在在代码合并前捕获回归问题，填补了当前 AI 编码助手存在的空白。Canary 还推出了 QA-Bench v0，一个用于代码验证的基准测试，其专用 QA 代理在该测试中表现优于 GPT 5.4 和 Claude Code 等模型。

人工智能工具在个人生活策略建议方面效果不一

LLMs show instability in psychiatric risk scores with irrelevant data

Anthropic 的 Sonnet 4.6 升级因能力下降令用户沮丧

Anthropic 的 'Mythos' AI 因过于危险而无法公开发布

RT Artificial Analysis: Meta is back! Muse Spark scores 52 on the Artificial Analysis Intelligence Index, behind only Gemini 3.1 Pro, GPT-5.4, and Cla...

Canary 发布 AI QA 工具，在代码验证方面优于 GPT-5.4 和 Claude Code