Hugging Face/Transformers

Transformers

Sun

Mon

Tue

Wed

Thu

Fri

Sat

JulAugSepOctNovDecJanFebMarAprMayJunJul

Less

Releases19Avg6/moVersionsv5.5.1v5.13.1

Highlights All Releases

Jul 11, 2026

Custom models no longer break on new linear layer type names

↗

v5.13.1

Fixes custom model compatibility with the latest vllm release by being more defensive with remap_legacy_layer_types and handling cases where custom code doesn't know about the new linear layer type names. Also fixed a key type assertion in _LazyAutoMapping.register.

Jul 3, 2026

Seven new model architectures: Kimi 2.5–2.7, MiMo-V2-Flash, Nemotron ASR, Qwen3 ASR, ZAYA, VideoPrism, RADIO

↗

v5.13.0

This release adds support for seven new model architectures: Kimi 2.5–2.7 (multimodal agentic coding), MiMo-V2-Flash (256K context MoE model), Nemotron 3.5 ASR and Nemotron ASR Streaming (multilingual speech recognition with configurable latency-accuracy tradeoffs), Qwen3 ASR with forced aligner, ZAYA1 (MoE language model), VideoPrism (video understanding encoder), and RADIO (vision foundation model family).

Jun 15, 2026

Mistral tokenizer resolution fixed

↗

v5.12.1

Fixed mistral tokenizer resolution when mistral-common is installed and updated the lower bound for PEFT. This is similar to v5.10.3 minus fixes already in the main release.

Several regressions and processing issues fixed

↗

v5.10.3

This patch release fixes several regressions introduced by previous changes, including issues with {image/video/audio}_token_ids in ProcessorMixin, InternVL models, and offsets in processing. It also addresses a regression in the Mistral common backend and updates the peft lower bound.

Jun 12, 2026

New models: MiniMax-M3-VL, PP-OCRv6, Parakeet-RNNT

↗

v5.12.0

This release introduces the MiniMax-M3-VL vision-language model, the PP-OCRv6 OCR system, and the Parakeet-RNNT model for speech processing. Several bug fixes and improvements were also made, including changes to CI, stop string matching, and model documentation.

Jun 10, 2026

DiffusionGemma and DeepSeek-V3.2 models added

↗

v5.11.0

New models DiffusionGemma and DeepSeek-V3.2 have been added, featuring optimizations for inference speed and efficient long-context handling. The Kernels API was extended for module fusion and parameter transformation, with added support for fp8/fp4 Triton kernels. Model parallel beam search bugs in Qwen2-VL model families were fixed.

Jun 4, 2026

CLIP model conversion bug fixed

↗

v5.10.2

Fixed a conversion bug for CLIP models that affected downstream models like SAM3.

Jun 3, 2026

Gemma4 Unified; Sapiens2; model parallelism hardened

↗

v5.10.1

Added Gemma4 12B Unified, an encoder-free multimodal model that projects raw vision and audio inputs directly into language model space; Sapiens2, a vision transformer family for human-centric tasks; DeepSeek-OCR-2 for document understanding; and Mellum, a code-focused mixture-of-experts model. Fixed numerous model parallelism bugs across tensor and expert parallelism, beam search under parallel settings, and loss over-counting; also fixed encoder-decoder cache initialization regression and BitsAndBytes quantization tensor-dropping bug.

May 20, 2026

Three new models added; SAM3 text embeddings API changes; generation bugs fixed

↗

v5.9.0

Added support for Cohere2Moe (a Mixture-of-Experts model with sliding window and full attention), HRM-Text (hierarchical reasoning model with two transformer stacks), and Parakeet tdt speech model. SAM3, EdgeTAM, and SAM3-Lite-Text now expect full text embeddings instead of pooler outputs, requiring input updates. Fixed generation issues including inputs_embeds handling for Gemma4, an AttributeError in RAG's generate() caused by missing config fields, memory leaks from lru decorators in vision models, and improved audio/vision encoder compilability.

May 13, 2026

Deepseek V4 integration fixed

↗

v5.8.1

Fixed Deepseek V4 integration issues including CSA mask collapse and WeightConverter regex incorrectly matching shared_experts as experts. Also added fatal_error to ContinuousBatchingManager for serving operations.

May 5, 2026

Release 5.8.0

↗

v5.8.0

Release v5.8.0

New Model additions

DeepSeek-V4

DeepSeek-V4 is the next-generation MoE (Mixture of Experts) language model…

Apr 28, 2026

Release v5.7.0

↗

v5.7.0

New Model additions

Laguna

Laguna is Poolside's mixture-of-experts language model family that extends standard SwiGLU MoE…

Apr 23, 2026

Patch release v5.6.2

↗

v5.6.2

Qwen 3.5 and 3.6 MoE (text-only) were broken when using with FP8. It should now work again with this 🫡

Fix configuration reading and error handling for kernels (https://github.com/huggingface/transformers/pull/45610) by @hmellor

Full Changelog:…

Patch release v5.6.1

↗

v5.6.1

Flash attention path was broken! Sorry everyone for this one 🤗

Fix AttributeError on s_aux=None in flash_attention_forward (https://github.com/huggingface/transformers/pull/45589) by @jamesbraza

Apr 22, 2026

Release v5.6.0

↗

v5.6.0

New Model additions

OpenAI Privacy Filter

OpenAI Privacy Filter is a bidirectional token-classification model for personally identifiable information (PII) detection and masking in text. It is intended for high-throughput data sanitization workflows where teams need…

Apr 13, 2026

Patch release v5.5.4

↗

v5.5.4

This is mostly some fixes that are good to have asap, mostly for tokenizers; ** Fix Kimi-K2.5 tokenizer regression and _patch_mistral_regex Attribute… (#45305) by ArthurZucker

For training: ** Fix #45305 + add regression test GAS (#45349) by florian6973, SunMarc ** Fix…

Apr 9, 2026

Patch release: v5.5.3

↗

v5.5.3

Small patch release to fix device_map support for Gemma4! It contains the following commit:

[gemma4] Fix device map auto (#45347) by @Cyrilvallez

Patch release: v5.5.2

↗

v5.5.2

Small patch dedicated to optimizing gemma4, fixing inference with use_cache=False due to k/v states sharing between layers, as well as conversion mappings for some models that would inconsistently serialize their weight names. It contains the following PRs:

Add MoE to…

Patch release v5.5.1

↗

v5.5.1

This patch is very small and focuses on vLLM and Gemma4!

** Fix export for gemma4 and add Integration tests (#45285) by @Cyrilvallez ** Fix vllm cis (#45139) by @ArthurZucker

Apr 2, 2026

Release v5.5.0

↗

v5.5.0

New Model additions

Gemma4

Gemma 4 is a multimodal model with pretrained and instruction-tuned variants,…

From other products

Last Checked

2h ago

Latest

v5.13.1

Source

@huggingface/transformers

Tracking since Apr 23, 2024

.json·.md·.atom

Transformers

Custom models no longer break on new linear layer type names

Seven new model architectures: Kimi 2.5–2.7, MiMo-V2-Flash, Nemotron ASR, Qwen3 ASR, ZAYA, VideoPrism, RADIO

Mistral tokenizer resolution fixed

Several regressions and processing issues fixed

New models: MiniMax-M3-VL, PP-OCRv6, Parakeet-RNNT

DiffusionGemma and DeepSeek-V3.2 models added

CLIP model conversion bug fixed

Gemma4 Unified; Sapiens2; model parallelism hardened

Three new models added; SAM3 text embeddings API changes; generation bugs fixed

Deepseek V4 integration fixed

Release 5.8.0

Release v5.8.0

New Model additions

DeepSeek-V4

Release v5.7.0

New Model additions

Laguna

Patch release v5.6.2

Patch release v5.6.1

Release v5.6.0

New Model additions

OpenAI Privacy Filter

Patch release v5.5.4

Patch release: v5.5.3

Patch release: v5.5.2

Patch release v5.5.1

Release v5.5.0

New Model additions

Gemma4

More from Hugging Face

From other products

More from Hugging Face

From other products