实体
S^2-Bench
S^2-Bench
PulseAugur coverage of S^2-Bench — every cluster mentioning S^2-Bench across labs, papers, and developer communities, ranked by signal.
总计 · 30天
2
90 天内 2
发布 · 30天
0
90 天内 0
论文 · 30天
2
90 天内 2
层级分布 · 90 天
情绪 · 30 天
1 天有情绪数据
最近 · 第 1/1 页 · 共 2 条
-
LLMs evaluated for advanced chemistry tasks with new benchmarks
Researchers have developed new benchmarks and methods to evaluate and enhance Large Language Models (LLMs) for chemistry-related tasks. One approach, Speak-to-Structure (S^2-Bench), focuses on open-domain molecule gener…
-
MolViBench benchmark evaluates LLMs on molecular coding tasks for drug discovery
Researchers have introduced MolViBench, a novel benchmark designed to evaluate the capabilities of large language models (LLMs) in molecular coding tasks. This benchmark addresses the gap left by existing evaluations, w…