English(EN) Patch release: v5.5.2

Hugging Face Transformers 库新增模型并修复错误

作者 PulseAugur 编辑部 · [10 个来源] · 2026-04-09 14:05

Hugging Face 的 `transformers` 库发布了一系列版本和补丁，引入了新模型并修复了各种错误。值得注意的是，5.9.0 版本添加了 Cohere 的 Command A+ (Cohere2Moe) 和 HRM-Text，同时改进了音频支持和生成能力。早期版本，如 v5.8.0，集成了 DeepSeek-V4、Gemma 4 Assistant、GraniteSpeechPlus、Granite4Vision、EXAONE 4.5 和 PP-FormulaNet 等模型。几个补丁版本解决了特定问题，包括 DeepSeek V4 集成、flash attention、FP8 的 Qwen MoE 模型以及 Gemma4 设备映射支持等方面的问题。 AI

影响广泛使用的库中新增模型集成和错误修复，加速了整个 AI 生态系统的研究和开发。

排序理由该集群包含 Hugging Face transformers 库的多个发布说明，详细介绍了新增模型和错误修复。

在 Transformers — Releases 阅读 →

AI 生成摘要 · Google Gemini · 来自 10 个来源。我们如何撰写摘要 →

报道来源 [10]

Transformers — Releases TIER_1 (SL) · Cyrilvallez · 2026-05-20 14:12

发布 v5.9.0

<h1>Release v5.9.0</h1> <h2>New Model additions</h2> <h3>Cohere2Moe</h3> <p>Command A+ is a Mixture-of-Experts (MoE) language model from Cohere that features a hybrid attention pattern combining sliding window and full attention layers. The model incorporates both shared and rout…
Transformers — Releases TIER_1 English(EN) · ArthurZucker · 2026-05-13 03:21

补丁发布 v5.8.1

<h1>Patch release v5.8.1</h1> <p>This release is mainly to fix the Deepseek V4 integration!!!</p> <a href="https://private-user-images.githubusercontent.com/48595927/591488772-0d85e891-a0ff-436e-a9d4-b6633096f2b5.png?jwt=eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJpc3MiOiJnaXRodWIuY2…
Transformers — Releases TIER_1 English(EN) · vasqu · 2026-05-05 16:52

发布 5.8.0

<h1>Release v5.8.0</h1> <h2>New Model additions</h2> <h3>DeepSeek-V4</h3> <a href="https://private-user-images.githubusercontent.com/73884904/587809551-4c0fdb29-f770-463c-a97b-d24438896a4c.png?jwt=eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdG…
Transformers — Releases TIER_1 (SL) · vasqu · 2026-04-28 18:32

发布 v5.7.0

<h1>Release v5.7.0</h1> <h2>New Model additions</h2> <h3>Laguna</h3> <a href="https://private-user-images.githubusercontent.com/73884904/584970471-d3bae269-bea7-4ddf-a53f-d4718befdb17.png?jwt=eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnV…
Transformers — Releases TIER_1 English(EN) · vasqu · 2026-04-23 18:39

补丁发布 v5.6.2

<h1>Patch release v5.6.2</h1> <p>Qwen 3.5 and 3.6 MoE (text-only) were broken when using with FP8. It should now work again with this 🫡</p> <ul> <li>Fix configuration reading and error handling for kernels (<a class="issue-link js-issue-link" href="https://github.com/huggingface/…
Transformers — Releases TIER_1 English(EN) · ArthurZucker · 2026-04-23 08:20

补丁发布 v5.6.1

<h1>Patch release v5.6.1</h1> <p>Flash attention path was broken! Sorry everyone for this one 🤗</p> <ul> <li>Fix AttributeError on s_aux=None in flash_attention_forward (<a class="issue-link js-issue-link" href="https://github.com/huggingface/transformers/pull/45589">#45589</a>) …
Transformers — Releases TIER_1 (SL) · vasqu · 2026-04-22 15:52

发布 v5.6.0

<h1>Release v5.6.0</h1> <h2>New Model additions</h2> <h3>OpenAI Privacy Filter</h3> <p>OpenAI Privacy Filter is a bidirectional token-classification model for personally identifiable information (PII) detection and masking in text. It is intended for high-throughput data sanitiza…
Transformers — Releases TIER_1 English(EN) · ArthurZucker · 2026-04-13 16:58

补丁发布 v5.5.4

<h1>Patch release v5.5.4</h1> <p>This is mostly some fixes that are good to have asap, mostly for tokenizers;<br /> ** Fix Kimi-K2.5 tokenizer regression and _patch_mistral_regex Attribute… (<a class="issue-link js-issue-link" href="https://github.com/huggingface/transformers/iss…
Transformers — Releases TIER_1 English(EN) · Cyrilvallez · 2026-04-09 15:53

补丁发布：v5.5.3

<p>Small patch release to fix <code>device_map</code> support for Gemma4! It contains the following commit:</p> <ul> <li>[gemma4] Fix device map auto (<a class="issue-link js-issue-link" href="https://github.com/huggingface/transformers/pull/45347">#45347</a>) by <a class="user-m…
Transformers — Releases TIER_1 English(EN) · Cyrilvallez · 2026-04-09 14:05

补丁发布：v5.5.2

<p>Small patch dedicated to optimizing gemma4, fixing inference with <code>use_cache=False</code> due to k/v states sharing between layers, as well as conversion mappings for some models that would inconsistently serialize their weight names. It contains the following PRs:</p> <u…

报道来源 [10]

相关实体

相关话题