Brief · PulseAugur

RESEARCH · Transformers — Releases (SL) · 2d · [2 sources]

Release v5.11.0

The Hugging Face Transformers library has released version 5.12.0, introducing new models like MiniMax-M3-VL, a vision-language model with a CLIP-style vision tower and a sparse Mixture-of-Experts decoder. This update also includes improvements to PP-OCRv6, an efficient OCR system, and Parakeet-RNNT, a fast conformer encoder with an RNN-T decoder. Additionally, version 5.11.0 added DiffusionGemma, an encoder-decoder model for faster text generation, and DeepSeek-V3.2-Exp, which features a novel sparse attention mechanism for long-context efficiency. AI

v5.11.0
NemotronH
FalconMamba
DeepSeek Sparse Attention
Qwen3-VL
Hugging Face
Qwen2-VL
Qwen2.5-VL
DeepSeek-V3.2
Transformers
DiffusionGemma
Zamba2
Parakeet-RNNT
Hugging Face Transformers
DeepSeek-V3.2-Exp
MiniMax-M3
PP-OCRv6
MiniMax-M3-VL
DeepSeek-V3.1-Terminus