PulseAugur
实时 06:17:35
English(EN) Patch release: v5.5.2

Hugging Face Transformers 库新增模型并修复错误

Hugging Face 的 `transformers` 库发布了一系列版本和补丁,引入了新模型并修复了各种错误。值得注意的是,5.9.0 版本添加了 Cohere 的 Command A+ (Cohere2Moe) 和 HRM-Text,同时改进了音频支持和生成能力。早期版本,如 v5.8.0,集成了 DeepSeek-V4Gemma 4 AssistantGraniteSpeechPlusGranite4VisionEXAONE 4.5PP-FormulaNet 等模型。几个补丁版本解决了特定问题,包括 DeepSeek V4 集成、flash attention、FP8 的 Qwen MoE 模型以及 Gemma4 设备映射支持等方面的问题。 AI

影响 广泛使用的库中新增模型集成和错误修复,加速了整个 AI 生态系统的研究和开发。

排序理由 该集群包含 Hugging Face transformers 库的多个发布说明,详细介绍了新增模型和错误修复。

在 Transformers — Releases 阅读 →

AI 生成摘要 · Google Gemini · 来自 10 个来源。 我们如何撰写摘要 →

Hugging Face Transformers 库新增模型并修复错误

报道来源 [10]

  1. Transformers — Releases TIER_1 (SL) · Cyrilvallez ·

    Release v5.9.0

    <h1>Release v5.9.0</h1> <h2>New Model additions</h2> <h3>Cohere2Moe</h3> <p>Command A+ is a Mixture-of-Experts (MoE) language model from Cohere that features a hybrid attention pattern combining sliding window and full attention layers. The model incorporates both shared and rout…

  2. Transformers — Releases TIER_1 English(EN) · ArthurZucker ·

    Patch release v5.8.1

    <h1>Patch release v5.8.1</h1> <p>This release is mainly to fix the Deepseek V4 integration!!!</p> <a href="https://private-user-images.githubusercontent.com/48595927/591488772-0d85e891-a0ff-436e-a9d4-b6633096f2b5.png?jwt=eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJpc3MiOiJnaXRodWIuY2…

  3. Transformers — Releases TIER_1 English(EN) · vasqu ·

    Release 5.8.0

    <h1>Release v5.8.0</h1> <h2>New Model additions</h2> <h3>DeepSeek-V4</h3> <a href="https://private-user-images.githubusercontent.com/73884904/587809551-4c0fdb29-f770-463c-a97b-d24438896a4c.png?jwt=eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdG…

  4. Transformers — Releases TIER_1 (SL) · vasqu ·

    Release v5.7.0

    <h1>Release v5.7.0</h1> <h2>New Model additions</h2> <h3>Laguna</h3> <a href="https://private-user-images.githubusercontent.com/73884904/584970471-d3bae269-bea7-4ddf-a53f-d4718befdb17.png?jwt=eyJ0eXAiOiJKV1QiLCJhbGciOiJIUzI1NiJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnV…

  5. Transformers — Releases TIER_1 English(EN) · vasqu ·

    Patch release v5.6.2

    <h1>Patch release v5.6.2</h1> <p>Qwen 3.5 and 3.6 MoE (text-only) were broken when using with FP8. It should now work again with this 🫡</p> <ul> <li>Fix configuration reading and error handling for kernels (<a class="issue-link js-issue-link" href="https://github.com/huggingface/…

  6. Transformers — Releases TIER_1 English(EN) · ArthurZucker ·

    Patch release v5.6.1

    <h1>Patch release v5.6.1</h1> <p>Flash attention path was broken! Sorry everyone for this one 🤗</p> <ul> <li>Fix AttributeError on s_aux=None in flash_attention_forward (<a class="issue-link js-issue-link" href="https://github.com/huggingface/transformers/pull/45589">#45589</a>) …

  7. Transformers — Releases TIER_1 (SL) · vasqu ·

    Release v5.6.0

    <h1>Release v5.6.0</h1> <h2>New Model additions</h2> <h3>OpenAI Privacy Filter</h3> <p>OpenAI Privacy Filter is a bidirectional token-classification model for personally identifiable information (PII) detection and masking in text. It is intended for high-throughput data sanitiza…

  8. Transformers — Releases TIER_1 English(EN) · ArthurZucker ·

    Patch release v5.5.4

    <h1>Patch release v5.5.4</h1> <p>This is mostly some fixes that are good to have asap, mostly for tokenizers;<br /> ** Fix Kimi-K2.5 tokenizer regression and _patch_mistral_regex Attribute… (<a class="issue-link js-issue-link" href="https://github.com/huggingface/transformers/iss…

  9. Transformers — Releases TIER_1 English(EN) · Cyrilvallez ·

    Patch release: v5.5.3

    <p>Small patch release to fix <code>device_map</code> support for Gemma4! It contains the following commit:</p> <ul> <li>[gemma4] Fix device map auto (<a class="issue-link js-issue-link" href="https://github.com/huggingface/transformers/pull/45347">#45347</a>) by <a class="user-m…

  10. Transformers — Releases TIER_1 English(EN) · Cyrilvallez ·

    Patch release: v5.5.2

    <p>Small patch dedicated to optimizing gemma4, fixing inference with <code>use_cache=False</code> due to k/v states sharing between layers, as well as conversion mappings for some models that would inconsistently serialize their weight names. It contains the following PRs:</p> <u…