ScoringBench: A Benchmark for Evaluating Tabular Foundation Models with Proper Scoring Rules

By PulseAugur Editorial · [3 sources] · 2026-04-28 02:56

Two new research papers introduce methods for better evaluating and cleaning tabular foundation models. ScoringBench offers a comprehensive benchmark using proper scoring rules to assess model performance beyond simple point estimates, revealing how different metrics can lead to varied model rankings. Prior-Aligned Data Cleaning, on the other hand, proposes a deep reinforcement learning framework to clean real-world tabular data, addressing issues like missing values and outliers to improve model accuracy and confidence calibration. AI

IMPACT New evaluation and data cleaning techniques could improve the reliability and deployment of tabular foundation models in high-stakes applications.

RANK_REASON The cluster contains two academic papers introducing new benchmarks and methodologies for tabular foundation models.

Read on arXiv cs.AI →

AI-generated summary · Google Gemini · from 3 sources. How we write summaries →

ScoringBench: A Benchmark for Evaluating Tabular Foundation Models with Proper Scoring Rules

COVERAGE [3]

arXiv cs.LG TIER_1 English(EN) · Laure Berti-Equille · 2026-04-29 04:00

Prior-Aligned Data Cleaning for Tabular Foundation Models

arXiv:2604.25154v1 Announce Type: new Abstract: Tabular Foundation Models (TFMs) achieve state-of-the-art zero-shot accuracy on small tabular datasets by meta-learning over synthetic data-generating processes -- making them highly attractive for practitioners who cannot afford la…
arXiv cs.AI TIER_1 English(EN) · Jonas Landsgesell, Pascal Knoll, Tizian Wenzel · 2026-04-28 04:00

ScoringBench: A Benchmark for Evaluating Tabular Foundation Models with Proper Scoring Rules

arXiv:2603.29928v2 Announce Type: replace Abstract: Tabular foundation models such as TabPFN and TabICL already produce full predictive distributions, yet prevailing regression benchmarks evaluate them almost exclusively via point-estimate metrics (RMSE, $R^2$). This discards pre…
arXiv cs.LG TIER_1 English(EN) · Laure Berti-Equille · 2026-04-28 02:56

Prior-Aligned Data Cleaning for Tabular Foundation Models

Tabular Foundation Models (TFMs) achieve state-of-the-art zero-shot accuracy on small tabular datasets by meta-learning over synthetic data-generating processes -- making them highly attractive for practitioners who cannot afford large annotated corpora. However, their in-context…

COVERAGE [3]

Prior-Aligned Data Cleaning for Tabular Foundation Models

ScoringBench: A Benchmark for Evaluating Tabular Foundation Models with Proper Scoring Rules

Prior-Aligned Data Cleaning for Tabular Foundation Models

RELATED ENTITIES

RELATED TOPICS