Brief · PulseAugur

RESEARCH · arXiv cs.CL English(EN) · 1d · [2 sources]

ForecastBench-Sim: A Simulated-World Forecasting Benchmark

Researchers have developed ForecastBench-Sim, a new benchmark for evaluating AI forecasting capabilities. This benchmark utilizes rollouts from the strategy game Freeciv to create a simulated environment, overcoming limitations of real-world forecasting such as slow outcome resolution and rarity of tail events. ForecastBench-Sim allows for continuous or binary forecasting questions, conditional queries, and the study of rare outcomes in a controlled setting. AI

IMPACT Provides a controlled environment for studying AI probabilistic reasoning and dynamic world states, complementing real-world forecasting benchmarks.

ForecastBench-Sim
Freeciv
DagsHub
alphaXiv
CORE Recommender
ScienceCast
Hugging Face
Civilization
CatalyzeX Code Finder for Papers
Influence Flower
Gotit.pub
arXiv