PulseAugur
EN
LIVE 05:02:34

Hugging Face Transformers library adds new models and fixes bugs

Hugging Face's `transformers` library has seen a series of releases and patches, introducing new models and fixing various bugs. Notably, version 5.9.0 added Cohere's Command A+ (Cohere2Moe) and HRM-Text, while also improving audio support and generation capabilities. Earlier releases, such as v5.8.0, integrated models like DeepSeek-V4, Gemma 4 Assistant, GraniteSpeechPlus, Granite4Vision, EXAONE 4.5, and PP-FormulaNet. Several patch releases have addressed specific issues, including problems with DeepSeek V4 integration, flash attention, Qwen MoE models with FP8, and Gemma4 device map support. AI

IMPACT New model integrations and bug fixes in a widely used library accelerate research and development across the AI ecosystem.

RANK_REASON The cluster consists of multiple release notes for the Hugging Face transformers library, detailing new model additions and bug fixes.

Read on Transformers — Releases →

AI-generated summary · Google Gemini · from 10 sources. How we write summaries →

Hugging Face Transformers library adds new models and fixes bugs

COVERAGE [10]

  1. Transformers — Releases TIER_1 (SL) · Cyrilvallez ·

    Release v5.9.0

    <h1>Release v5.9.0</h1> <h2>New Model additions</h2> <h3>Cohere2Moe</h3> <p>Command A+ is a Mixture-of-Experts (MoE) language model from Cohere that features a hybrid attention pattern combining sliding window and full attention layers. The model incorporates both shared and rout…

  2. Transformers — Releases TIER_1 English(EN) · ArthurZucker ·

    Patch release v5.8.1

    <h1>Patch release v5.8.1</h1> <p>This release is mainly to fix the Deepseek V4 integration!!!</p> <a href="https://private-user-images.githubusercontent.com/48595927/591488772-0d85e891-a0ff-436e-a9d4-b6633096f2b5.png?jwt=eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJpc3MiOiJnaXRodWIuY2…

  3. Transformers — Releases TIER_1 English(EN) · vasqu ·

    Release 5.8.0

    <h1>Release v5.8.0</h1> <h2>New Model additions</h2> <h3>DeepSeek-V4</h3> <a href="https://private-user-images.githubusercontent.com/73884904/587809551-4c0fdb29-f770-463c-a97b-d24438896a4c.png?jwt=eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdG…

  4. Transformers — Releases TIER_1 (SL) · vasqu ·

    Release v5.7.0

    <h1>Release v5.7.0</h1> <h2>New Model additions</h2> <h3>Laguna</h3> <a href="https://private-user-images.githubusercontent.com/73884904/584970471-d3bae269-bea7-4ddf-a53f-d4718befdb17.png?jwt=eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnV…

  5. Transformers — Releases TIER_1 English(EN) · vasqu ·

    Patch release v5.6.2

    <h1>Patch release v5.6.2</h1> <p>Qwen 3.5 and 3.6 MoE (text-only) were broken when using with FP8. It should now work again with this 🫡</p> <ul> <li>Fix configuration reading and error handling for kernels (<a class="issue-link js-issue-link" href="https://github.com/huggingface/…

  6. Transformers — Releases TIER_1 English(EN) · ArthurZucker ·

    Patch release v5.6.1

    <h1>Patch release v5.6.1</h1> <p>Flash attention path was broken! Sorry everyone for this one 🤗</p> <ul> <li>Fix AttributeError on s_aux=None in flash_attention_forward (<a class="issue-link js-issue-link" href="https://github.com/huggingface/transformers/pull/45589">#45589</a>) …

  7. Transformers — Releases TIER_1 (SL) · vasqu ·

    Release v5.6.0

    <h1>Release v5.6.0</h1> <h2>New Model additions</h2> <h3>OpenAI Privacy Filter</h3> <p>OpenAI Privacy Filter is a bidirectional token-classification model for personally identifiable information (PII) detection and masking in text. It is intended for high-throughput data sanitiza…

  8. Transformers — Releases TIER_1 English(EN) · ArthurZucker ·

    Patch release v5.5.4

    <h1>Patch release v5.5.4</h1> <p>This is mostly some fixes that are good to have asap, mostly for tokenizers;<br /> ** Fix Kimi-K2.5 tokenizer regression and _patch_mistral_regex Attribute… (<a class="issue-link js-issue-link" href="https://github.com/huggingface/transformers/iss…

  9. Transformers — Releases TIER_1 English(EN) · Cyrilvallez ·

    Patch release: v5.5.3

    <p>Small patch release to fix <code>device_map</code> support for Gemma4! It contains the following commit:</p> <ul> <li>[gemma4] Fix device map auto (<a class="issue-link js-issue-link" href="https://github.com/huggingface/transformers/pull/45347">#45347</a>) by <a class="user-m…

  10. Transformers — Releases TIER_1 English(EN) · Cyrilvallez ·

    Patch release: v5.5.2

    <p>Small patch dedicated to optimizing gemma4, fixing inference with <code>use_cache=False</code> due to k/v states sharing between layers, as well as conversion mappings for some models that would inconsistently serialize their weight names. It contains the following PRs:</p> <u…