Diffusers

Diffusion models for image, audio, and video generation

Sun

Mon

Tue

Wed

Thu

Fri

Sat

JulAugSepOctNovDecJanFebMarAprMayJunJul

Less

Releases2Avg Interval7dAvg Cadence4/mo

Jul 3, 2026

Nine new pipelines ship; Cosmos 3, Ideogram 4, Krea 2, DreamLite, PRX Pixel, Motif-Video, AnyFlow, JoyAI-Image-Edit, DiffusionGemma

↗Breaking (minor)

v0.39.0

Nine new pipelines are now available: Cosmos 3 (NVIDIA's unified world foundation model combining generation, reasoning, and action), Ideogram 4 (flow-matching text-to-image with asymmetric guidance), Krea 2 (single-stream MMDiT with Qwen3-VL encoding), DreamLite (ByteDance text-to-image and editing), PRX Pixel (pixel-space generation by Photoroom), Motif-Video (2B text-to-video and image-to-video), AnyFlow (any-step video diffusion from NVIDIA/NUS/MIT), JoyAI-Image-Edit (unified multimodal for editing with spatial control), and DiffusionGemma (block-diffusion language model). Additional enhancements across schedulers, training utilities, and core library components.

May 1, 2026

Diffusers 0.38.0: New image and audio pipelines, Core library improvements, and more

↗

v0.38.0

New Pipelines

LLaDA2

LLaDA2 is a family of discrete diffusion language models that generate text through block-wise iterative refinement. Instead of autoregressive token-by-token generation, LLaDA2 starts…

Mar 25, 2026

Fixes for AutoModel type hints in Modular Pipelines and Flux Klein LoRA loading

↗

v0.37.1

Fix for loading ModularPipelines with AutoModel type hints in their modular_model_index.json #13271
Fix Flux Klein LoRA loading #13313
Fix unguarded torchvision import in Cosmos Predict 2.5 #13321

Mar 5, 2026

Diffusers 0.37.0: Modular Diffusers, New image and video pipelines, multiple core library improvements, and more 🔥

↗

v0.37.0

Modular Diffusers

Modular Diffusers introduces a new way to build diffusion pipelines by composing reusable blocks. Instead of writing entire pipelines from scratch, you can now mix and match building blocks to create custom workflows tailored to your specific needs! This…

Dec 8, 2025

Diffusers 0.36.0: Pipelines galore, new caching method, training scripts, and more 🎄

↗

v0.36.0

The release features a number of new image and video pipelines, a new caching method, a new training script, new kernels - powered attention backends, and more. It is quite packed with a lot of new stuff, so make sure you read the release notes fully 🚀

New image…

Oct 15, 2025

🐞 fixes for `transformers` models, imports,

↗

v0.35.2

All commits

Release: v0.35.1-patch by @sayakpaul (direct commit on v0.35.2-patch)
handle offload_state_dict when initing transformers models by @sayakpaul in #12438
[CI] Fix TRANSFORMERS_FLAX_WEIGHTS_NAME import issue by @DN6 in #12354
Fix PyTorch 2.3.1…

Aug 20, 2025

v0.35.1 for improvements in Qwen-Image Edit

↗

v0.35.1

Thanks to @naykun for the following PRs that improve Qwen-Image Edit:

Aug 19, 2025

Diffusers 0.35.0: Qwen Image pipelines, Flux Kontext, Wan 2.2, and more

↗

v0.35.0

This release comes packed with new image generation and editing pipelines, a new video pipeline, new training scripts, quality-of-life improvements, and much more. Read the rest of the release notes fully to not miss out on the fun stuff.

New pipelines 🧨

We welcomed…

Jun 24, 2025

Diffusers 0.34.0: New Image and Video Models, Better torch.compile Support, and more

↗

v0.34.0

📹 New video generation pipelines

Wan VACE

Wan VACE supports various generation techniques which achieve controllable video generation. It comes in two variants: a 1.3B model for fast iteration & prototyping, and a 14B for high quality generation. Some of the…

Apr 10, 2025

v0.33.1: fix ftfy import

↗

v0.33.1

All commits

fix ftfy import for wan pipelines by @yiyixuxu in #11262

Apr 9, 2025

Diffusers 0.33.0: New Image and Video Models, Memory Optimizations, Caching Methods, Remote VAEs, New Training Scripts, and more

↗

v0.33.0

New Pipelines for Video Generation

Wan 2.1

Wan2.1 is a comprehensive and open suite of video foundation models that pushes the boundaries of video generation. The model release includes 4 different model variants and three different pipelines for Text to Video,…

Jan 15, 2025

v0.32.2

↗

Fixes for Flux Single File loading, LoRA loading for 4bit BnB Flux, Hunyuan Video

This patch release

Fixes a regression in loading Comfy UI format single file checkpoints for Flux
Fixes a regression in loading LoRAs with bitsandbytes 4bit quantized Flux models -…

Dec 25, 2024

v0.32.1

↗

TorchAO Quantizer fixes

This patch release fixes a few bugs related to the TorchAO Quantizer introduced in v0.32.0.

Importing Diffusers would raise an error in PyTorch versions lower than 2.3.0. This should no longer be a problem.
Device Map does not work as…

Dec 23, 2024

Diffusers 0.32.0: New video pipelines, new image pipelines, new quantization backends, new training scripts, and more

↗

v0.32.0

https://github.com/user-attachments/assets/34d5f7ca-8e33-4401-8109-5c245ce7595f

This release took a while, but it has many exciting updates. It contains several new pipelines for image and video generation, new quantization backends, and more.

Going forward, to provide…

Oct 22, 2024

v0.31.0

↗

v0.31.0: Stable Diffusion 3.5 Large, CogView3, Quantization, Training Scripts, and more

Stable Diffusion 3.5 Large

Stability AI’s latest text-to-image generation model is Stable Diffusion 3.5 Large. SD3.5 Large is the next iteration of Stable Diffusion 3. It comes…

Sep 17, 2024

v0.30.3: CogVideoX Image-to-Video and Video-to-Video

↗

v0.30.3

This patch release adds Diffusers support for the upcoming CogVideoX-5B-I2V release (an Image-to-Video generation model)! The model weights will be available by end of the week on the HF Hub at THUDM/CogVideoX-5b-I2V (Link).…

Aug 31, 2024

v0.30.2: Update from single file default repository

↗

v0.30.2

All commits

update runway repo for single_file by @yiyixuxu in #9323
Fix Flux CLIP prompt embeds repeat for num_images_per_prompt > 1 by @DN6 in #9280
[IP Adapter] Fix cache_dir and local_files_only for image encoder by @asomoza in #9272

Aug 24, 2024

V0.30.1: CogVideoX-5B & Bug fixes

↗

v0.30.1

CogVideoX-5B

This patch release adds diffusers support for the upcoming CogVideoX-5B release! The model weights will be available next week on the Huggingface Hub at THUDM/CogVideoX-5b. Stay tuned for the release!

Additionally, we have implemented VAE tiling feature,…

Aug 7, 2024

v0.30.0: New Pipelines (Flux, Stable Audio, Kolors, CogVideoX, Latte, and more), New Methods (FreeNoise, SparseCtrl), and New Refactors

↗

v0.30.0

New pipelines

Image taken from the Lumina’s GitHub.

This release features many new pipelines.…

Jun 27, 2024

v0.29.2: fix deprecation and LoRA bugs 🐞

↗

v0.29.2

All commits

[SD3] Fix mis-matched shape when num_images_per_prompt > 1 using without T5 (text_encoder_3=None) by @Dalanke in #8558
[LoRA] refactor lora conversion utility. by @sayakpaul in #8295
[LoRA] fix conversion utility so that lora dora loads correctly by…

Latest

Jul 3, 2026