PulseAugur
LIVE 15:27:45
research · [1 source] ·
0
research

MacrOData benchmark suite offers thousands of datasets for tabular outlier detection

Researchers have introduced MacrOData, a new benchmark suite designed to improve the evaluation of outlier detection methods for tabular data. This suite significantly expands upon existing benchmarks like AdBench by including over 2,400 datasets, categorized into real-world semantic anomalies, statistical outliers, and synthetic data. MacrOData aims to provide a more comprehensive and statistically robust platform for assessing various outlier detection techniques, including classical, deep, and foundation models. The benchmark suite and an associated online leaderboard have been made publicly available to support future research. AI

Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →

IMPACT Provides a more robust and diverse evaluation framework for tabular outlier detection models, potentially accelerating progress in the field.

RANK_REASON Introduction of a new, large-scale benchmark suite for tabular outlier detection published on arXiv.

Read on arXiv cs.LG →

COVERAGE [1]

  1. arXiv cs.LG TIER_1 · Xueying Ding, Simon Kl\"uttermann, Haomin Wen, Yilong Chen, Leman Akoglu ·

    MacrOData: New Benchmarks of Thousands of Datasets for Tabular Outlier Detection

    arXiv:2602.09329v2 Announce Type: replace Abstract: Quality benchmarks are essential for fairly and accurately tracking scientific progress and enabling practitioners to make informed methodological choices. Outlier detection (OD) on tabular data underpins numerous real-world app…