OncoTraj: a public benchmark for longitudinal resistance prediction in EGFR-mutant non-small-cell lung cancer on osimertinib
Researchers have introduced OncoTraj, a new public benchmark dataset designed to advance the prediction of drug resistance in EGFR-mutant non-small-cell lung cancer. The dataset comprises longitudinal patient trajectories from 813 individuals treated with osimertinib, harmonized from three clinical-genomic sources. OncoTraj includes three defined tasks for model evaluation: predicting progression at 12 months, estimating time to progression, and classifying resistance mechanisms. Initial evaluations with various baseline models, including transformers, indicate that current single-timepoint snapshot features limit predictive accuracy, suggesting a need for serial ctDNA data in future iterations. AI
IMPACT Establishes a standardized benchmark for AI models in predicting cancer drug resistance, potentially accelerating clinical applications.