PulseAugur
EN
LIVE 10:08:26

New RIDE dataset standardizes train delay prediction benchmark

Researchers have introduced RIDE, a comprehensive dataset and benchmark designed to standardize train delay prediction. This nationwide dataset, covering the Belgian railway network from 2023 to 2025, includes 94.5 million train events and 35.7 million weather records. The benchmark facilitates direct comparison of various prediction models, revealing that graph neural networks currently achieve the best performance, outperforming traditional statistical and non-learning methods. AI

IMPACT Standardizes evaluation for train delay prediction models, potentially accelerating adoption of advanced machine learning techniques.

RANK_REASON The cluster contains an academic paper introducing a new dataset and benchmark for a specific problem. [lever_c_demoted from research: ic=1 ai=0.7]

Read on arXiv cs.LG →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. arXiv cs.LG TIER_1 English(EN) · Cl\'ement Elliker, Mathis Le Bail, Cl\'ement Mantoux, Jesse Read, Sonia Vanier ·

    RIDE: An Open Dataset and Benchmark for Train Delay Prediction

    arXiv:2606.05070v1 Announce Type: new Abstract: Train delay prediction is an important problem for both passengers and railway operators, yet progress in the field remains difficult to assess due to the lack of standardized datasets, prediction targets, and evaluation protocols. …