实体
HealthBench Professional
HealthBench Professional
PulseAugur coverage of HealthBench Professional — every cluster mentioning HealthBench Professional across labs, papers, and developer communities, ranked by signal.
总计 · 30天
2
90 天内 2
发布 · 30天
0
90 天内 0
论文 · 30天
1
90 天内 1
层级分布 · 90 天
情绪 · 30 天
1 天有情绪数据
最近 · 第 1/1 页 · 共 2 条
-
MDIA 智能体在 HealthBench Professional 基准测试中取得高分
研究人员开发了 MDIA(多智能体诊断智能体),它利用一个 7 节点临床推理图在 HealthBench Professional 基准测试中取得了优异的性能。当使用 OpenAI 的 GPT-5.4-2026-03-05 进行评估时,MDIA 得分为 0.6272,比 ChatGPT for Clinicians 高出 3.72 个百分点。研究表明,包括专科路由和上下文保留在内的架构设计,而非仅仅提示工程,对智能体的性能有显著影响。…
-
OpenAI's ChatGPT for Clinicians outperforms doctors in tests, speeds up medical AI strategy
OpenAI has launched ChatGPT for Clinicians, a specialized AI platform designed to assist medical professionals. In clinical trials, the platform achieved a score of 59.0, significantly outperforming human doctors who sc…