Agentic AI系统在多发性骨髓瘤患者的临床推理方面与专家共识相符

作者 PulseAugur 编辑部 · [2 个来源] · 2026-04-27 13:41

一项新研究评估了一个agentic推理系统在多发性骨髓瘤管理中综合纵向临床记录的能力。该系统在与专家共识的一致性方面达到了79.6%，优于标准的检索增强生成（RAG）方法。对于复杂问题和广泛的患者病史，性能提升最为显著，但系统错误比专家分歧具有更大的临床意义。 AI

影响展示了AI在改进复杂患者数据综合方面的潜力，但由于错误严重性，强调了仔细验证的必要性。

排序理由学术论文，详细介绍了对AI系统临床推理能力的追溯性评估。

AI 生成摘要 · Google Gemini · 来自 2 个来源。我们如何撰写摘要 →

报道来源 [2]

arXiv cs.CL TIER_1 English(EN) · Johannes Moll, Jannik L\"ubberstedt, Christoph Nuernbergk, Jacob Stroh, Luisa Mertens, Anna Purcarea, Christopher Zirn, Zeineb Benchaaben, Fabian Drexel, Hartmut H\"antze, Anirudh Narayanan, Friedrich Puttkammer, Andrei Zhukov, Jacqueline Lammert, Sebasti · 2026-04-28 04:00

Agentic clinical reasoning over longitudinal myeloma records: a retrospective evaluation against expert consensus

arXiv:2604.24473v1 Announce Type: cross Abstract: Multiple myeloma is managed through sequential lines of therapy over years to decades, with each decision depending on cumulative disease history distributed across dozens to hundreds of heterogeneous clinical documents. Whether L…
arXiv cs.CL TIER_1 English(EN) · Keno K. Bressem · 2026-04-27 13:41

Agentic clinical reasoning over longitudinal myeloma records: a retrospective evaluation against expert consensus

Multiple myeloma is managed through sequential lines of therapy over years to decades, with each decision depending on cumulative disease history distributed across dozens to hundreds of heterogeneous clinical documents. Whether LLM-based systems can synthesise this evidence at a…