PulseAugur
实时 16:23:57
实体 HealthBench Professional

HealthBench Professional

PulseAugur coverage of HealthBench Professional — every cluster mentioning HealthBench Professional across labs, papers, and developer communities, ranked by signal.

Show in brief
总计 · 30天
2
90 天内 2
发布 · 30天
0
90 天内 0
论文 · 30天
1
90 天内 1
层级分布 · 90 天
情绪 · 30 天

1 天有情绪数据

最近 · 第 1/1 页 · 共 2 条
  1. TOOL · CL_50854 ·

    MDIA 智能体在 HealthBench Professional 基准测试中取得高分

    研究人员开发了 MDIA(多智能体诊断智能体),它利用一个 7 节点临床推理图在 HealthBench Professional 基准测试中取得了优异的性能。当使用 OpenAI 的 GPT-5.4-2026-03-05 进行评估时,MDIA 得分为 0.6272,比 ChatGPT for Clinicians 高出 3.72 个百分点。研究表明,包括专科路由和上下文保留在内的架构设计,而非仅仅提示工程,对智能体的性能有显著影响。…

  2. SIGNIFICANT · CL_03755 ·

    OpenAI's ChatGPT for Clinicians outperforms doctors in tests, speeds up medical AI strategy

    OpenAI has launched ChatGPT for Clinicians, a specialized AI platform designed to assist medical professionals. In clinical trials, the platform achieved a score of 59.0, significantly outperforming human doctors who sc…