PulseAugur
EN
LIVE 08:53:29
ENTITY GTBench

GTBench

PulseAugur coverage of GTBench — every cluster mentioning GTBench across labs, papers, and developer communities, ranked by signal.

Show in brief
Total · 30d
1
1 over 90d
Releases · 30d
0
0 over 90d
Papers · 30d
1
1 over 90d
TIER MIX · 90D
TOPICS
SENTIMENT · 30D

1 day(s) with sentiment data

RECENT · PAGE 1/1 · 1 TOTAL
  1. TOOL · CL_68274 ·

    New GTBench benchmark tests LLMs as math research assistants

    A new benchmark called GTBench has been developed to evaluate the capabilities of large language models as mathematical research assistants, specifically in the field of graph theory. The benchmark features 63 problems …