新的AssayBench基准测试LLM预测细胞表型能力

作者 PulseAugur 编辑部 · [1 个来源] · 2026-05-11 17:27

研究人员推出了AssayBench，这是一个新的基准，旨在评估大型语言模型（LLM）和智能体在预测细胞表型方面的能力。该基准建立在1920个CRISPR筛选的基础上，专注于预测细胞扰动的效果，这是一项对药物发现至关重要的任务。评估显示，当前的LLM，特别是通用模型，在性能上显著优于生物学特定模型和可训练基线模型，并且通过优化技术还有进一步改进的空间。 AI

影响为评估AI在生物学发现和药物开发中的潜力提供了一种标准化方法。

排序理由该集群包含一篇介绍用于评估AI模型基准的新学术论文。[lever_c_demoted from research: ic=1 ai=1.0]

在 arXiv cs.AI 阅读 →

AI 生成摘要 · Google Gemini · 来自 1 个来源。我们如何撰写摘要 →

报道来源 [1]

arXiv cs.AI TIER_1 English(EN) · Gabriele Scalia · 2026-05-11 17:27

AssayBench: An Assay-Level Virtual Cell Benchmark for LLMs and Agents

Recent advances in machine learning and large-scale biological data collections have revived the prospect of building a virtual cell, a computational model of cellular behavior that could accelerate biological discovery. One of the most compelling promises of this vision is the a…

报道来源 [1]

AssayBench: An Assay-Level Virtual Cell Benchmark for LLMs and Agents

相关实体

相关话题