A new 7-billion parameter language model called DataComp-LM has been released, which is notable for being trained on exclusively open-source data. This model also comes with a new benchmark and dataset designed to facilitate further research and development in the field of open-access AI. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
RANK_REASON Release of a new open-source model, benchmark, and dataset.