Hugging Face has released StarCoder2, a new family of large language models for code generation, trained on a massive dataset called The Stack v2. This dataset comprises over 600 programming languages and includes a significant amount of permissively licensed code. The StarCoder2 models are available in three sizes, with the largest boasting 15 billion parameters, and are designed to advance research and development in AI-powered coding tools. AI
Summary written by gemini-2.5-flash-lite from 1 source. How we write summaries →
RANK_REASON Release of new code-generation models and associated dataset by a prominent AI community platform.