PulseAugur
EN
LIVE 15:04:09

JetBrains releases Mellum2 reasoning model with 131K context

JetBrains has released its Mellum2 model family, including the Mellum2-12B-A2.5B-Thinking variant, which is designed for complex reasoning tasks. This model utilizes a Mixture-of-Experts architecture with a large context window of 131,072 tokens. The release provides detailed instructions for integrating the model with various libraries and tools, such as Transformers, vLLM, and SGLang. AI

IMPACT Enables developers to integrate advanced reasoning capabilities into applications via an open-source model.

RANK_REASON The cluster describes the release of an open-source model from a company, with detailed usage instructions, fitting the research category.

Read on Hugging Face Trending Models →

AI-generated summary · Google Gemini · from 2 sources. How we write summaries →

JetBrains releases Mellum2 reasoning model with 131K context

COVERAGE [2]

  1. Hugging Face Trending Models TIER_1 English(EN) · JetBrains ·

    JetBrains/Mellum2-12B-A2.5B-Thinking

    text-generation · 80 downloads · 54 likes

  2. r/LocalLLaMA TIER_1 English(EN) · /u/DeltaSqueezer ·

    JetBrains open-sources Mellum2 - anyone tried these?

    <table> <tr><td> <a href="https://www.reddit.com/r/LocalLLaMA/comments/1tukilx/jetbrains_opensources_mellum2_anyone_tried_these/"> <img alt="JetBrains open-sources Mellum2 - anyone tried these?" src="https://external-preview.redd.it/mCzuTq8n7xvCy4rmMbMCcp0ElWqSR8knfaLcaG2VOdU.jpe…