PulseAugur
LIVE 08:04:06
ENTITY ScienceAgentBench

ScienceAgentBench

PulseAugur coverage of ScienceAgentBench — every cluster mentioning ScienceAgentBench across labs, papers, and developer communities, ranked by signal.

Total · 30d
2
2 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
2
2 over 90d
TIER MIX · 90D
RECENT · PAGE 1/1 · 2 TOTAL
  1. RESEARCH · CL_11486 ·

    D3-Gym dataset offers verifiable environments for AI scientific discovery

    Researchers have introduced D3-Gym, a novel dataset designed to create verifiable environments for scientific data-driven discovery tasks. This dataset includes 565 tasks from real scientific repositories, each with ins…

  2. RESEARCH · CL_06279 ·

    DataPRM enhances LLM data analysis by rewarding scientific process

    Researchers have developed DataPRM, a new process reward model designed to improve the performance of AI agents in dynamic data analysis tasks. Unlike previous models that struggled with silent errors and exploratory ac…