ENTITY
ScienceAgentBench
ScienceAgentBench
PulseAugur coverage of ScienceAgentBench — every cluster mentioning ScienceAgentBench across labs, papers, and developer communities, ranked by signal.
Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 2 TOTAL
-
D3-Gym dataset offers verifiable environments for AI scientific discovery
Researchers have introduced D3-Gym, a novel dataset designed to create verifiable environments for scientific data-driven discovery tasks. This dataset includes 565 tasks from real scientific repositories, each with ins…
-
DataPRM enhances LLM data analysis by rewarding scientific process
Researchers have developed DataPRM, a new process reward model designed to improve the performance of AI agents in dynamic data analysis tasks. Unlike previous models that struggled with silent errors and exploratory ac…