PulseAugur
EN
LIVE 14:18:41

JetBrains releases Mellum 2 coding-focused MoE model

JetBrains has released Mellum 2, a new Mixture-of-Experts model designed for coding tasks. The model claims to match Qwen 3.5 9B in coding performance, though its general reasoning capabilities are reportedly weaker than Qwen 3.5 4B. The models are available on Hugging Face, with a technical report detailing their performance. AI

IMPACT Provides a new open-source model option for coding tasks, potentially improving developer productivity.

RANK_REASON This is a release of a new model from a company, accompanied by a technical report and benchmark claims, fitting the research bucket. [lever_c_demoted from research: ic=1 ai=1.0]

Read on r/LocalLLaMA →

AI-generated summary · Google Gemini · from 1 sources. How we write summaries →

COVERAGE [1]

  1. r/LocalLLaMA TIER_1 Deutsch(DE) · /u/Middle_Bullfrog_6173 ·

    Mellum 2 12B A2.5B

    <!-- SC_OFF --><div class="md"><p>Coding focused small MoE from JetBrains. They claim coding performance around Qwen 3.5 9B for the reasoning model. Worse than Qwen 3.5 4B in in everything else.</p> <p>Models: <a href="https://huggingface.co/collections/JetBrains/mellum-2">https:…