Hugging Face
Browser-based OAuth login; multi-commit folder uploads revamped
↗This release6 featuresNew capabilities4 enhancementsImprovements to existing features1 fixBug fixesAI-tallied from the release noteshuggingface_hub · v1.20.0
Browser-based OAuth Device Code flow is now the default login method, and hf upload uses a streamed multi-commit pipeline for faster, resumable large folder uploads. Also adds job wait commands, SSH access to running Jobs, and custom-container deployment for Inference Endpoints, with breaking changes around multi-commit uploads and non-detached job exit codes.
New models: MiniMax-M3-VL, PP-OCRv6, Parakeet-RNNT
↗This release3 featuresNew capabilities7 enhancementsImprovements to existing features13 fixesBug fixesAI-tallied from the release notesTransformers · v5.12.0
This release introduces the MiniMax-M3-VL vision-language model, the PP-OCRv6 OCR system, and the Parakeet-RNNT model for speech processing. Several bug fixes and improvements were also made, including changes to CI, stop string matching, and model documentation.
Enterprise organizations can now create service accounts: dedicated, organization-owned identities for programmatic access such as CI/CD pipelines...
A2PO trainer debuts; VLM KTO support; Async GRPO spawns process
↗This release7 featuresNew capabilities5 enhancementsImprovements to existing features2 fixesBug fixesAI-tallied from the release notesTRL · v1.6.0
The release introduces a new experimental A2POTrainer for optimal advantage regression and grants KTO trainer support for vision-language models. The AsyncRolloutWorker now runs in a separate process to avoid GIL contention and potential NCCL watchdog timeouts, along with fixes for aiohttp retries and all-NaN reward columns. Gold distillation trainer now aligns tokens via byte offsets, and SDFT/SDPO leverage the vLLM server for live teacher logprobs. Other features include bidirectional masked importance sampling for IcePop, support for NemotronH and Nemotron 3 Ultra, additional training chat templates, and decoupled self-distillation trainers.
AMD ROCm support added; FSDP2 hardened
↗This release3 featuresNew capabilities16 enhancementsImprovements to existing features24 fixesBug fixesAI-tallied from the release notesAccelerate · v1.14.0
Accelerate now works end-to-end on AMD ROCm devices. This release also includes numerous FSDP2 fixes and quality-of-life improvements, such as correct dtype handling on load, sharding of embeddings/norms, and QLoRA crash prevention.
Trusted Publishers enable keyless CI/CD auth
↗This release3 featuresNew capabilities4 enhancementsImprovements to existing features3 fixesBug fixesAI-tallied from the release noteshuggingface_hub · v1.19.0
Keyless CI/CD authentication is now supported via Trusted Publishers, allowing workflows to authenticate without storing an HF_TOKEN secret. The CLI now accepts hf:// URIs for uploading and downloading, and jobs can expose container ports through the public jobs proxy.
DiffusionGemma and DeepSeek-V3.2 models added
↗This release2 featuresNew capabilities1 enhancementImprovements to existing features30 fixesBug fixesAI-tallied from the release notesTransformers · v5.11.0
New models DiffusionGemma and DeepSeek-V3.2 have been added, featuring optimizations for inference speed and efficient long-context handling. The Kernels API was extended for module fusion and parameter transformation, with added support for fp8/fp4 Triton kernels. Model parallel beam search bugs in Qwen2-VL model families were fixed.
You can now publish to Hugging Face repositories from GitHub, GitLab, or custom CI systems without adding secrets, using workflow identity federation....
Agent traces now load for SFT; streaming shuffle uses multiple shards
↗This release7 featuresNew capabilities1 enhancementImprovements to existing features9 fixesBug fixesAI-tallied from the release notesDatasets · v5.0.0
Agent traces from Claude Code, Pi, Codex and others can now be loaded with load_dataset and parsed to messages for training with TRL. Streaming shuffle now uses multiple input shards by default for better randomization, though this is a breaking change—use max_buffer_input_shards=1 to restore the prior behavior. Also added batch(by_column=...) for robotics datasets, support for Apache Iceberg, TsFile (IoTDB), 3D mesh, and CoNLL formats, plus fixes for Parquet streaming hangs, Lance file handling, and JSON null encoding.
Unified hf cp command; concurrent downloads now safe on broken file locks
↗This release3 featuresNew capabilities1 enhancementImprovements to existing features4 fixesBug fixesAI-tallied from the release noteshuggingface_hub · v1.18.0
A new unified hf cp command handles all file-copy workflows (upload, download, remote-to-remote) with consistent hf:// URI syntax and rsync-style semantics. parse_hf_uri now accepts Hugging Face web URLs directly. Fixed a data corruption bug where concurrent hf_hub_download calls could append to the same .incomplete file on Lustre, GPFS, and some NFS mounts where flock(2) provides no mutual exclusion; downloads now use fresh temporary files instead of resuming incomplete ones, making the download path safe even when file locking is broken. This is a breaking change: resuming previously failed partial downloads is no longer possible.
Transformers · v5.10.2
Fixed a conversion bug for CLIP models that affected downstream models like SAM3.
Gemma4 Unified; Sapiens2; model parallelism hardened
↗This release4 featuresNew capabilities3 enhancementsImprovements to existing features15 fixesBug fixesAI-tallied from the release notesTransformers · v5.10.1
Added Gemma4 12B Unified, an encoder-free multimodal model that projects raw vision and audio inputs directly into language model space; Sapiens2, a vision transformer family for human-centric tasks; DeepSeek-OCR-2 for document understanding; and Mellum, a code-focused mixture-of-experts model. Fixed numerous model parallelism bugs across tensor and expert parallelism, beam search under parallel settings, and loss over-counting; also fixed encoder-decoder cache initialization regression and BitsAndBytes quantization tensor-dropping bug.
Remote WasmExecutor removed; Exa search engine added
↗This release1 featureNew capabilities3 enhancementsImprovements to existing features1 fixBug fixesAI-tallied from the release notessmolagents · v1.26.0
Remote WasmExecutor has been removed. Exa is now available as a search engine option in WebSearchTool. Also includes documentation and typo cleanups, and a docstring update for LocalPythonExecutor.
Copy files across repos; SSH into Spaces; CLI tables auto-fit terminal width
↗This release5 featuresNew capabilities3 enhancementsImprovements to existing features3 fixesBug fixesAI-tallied from the release noteshuggingface_hub · v1.17.0
You can now copy files or entire folders between different repositories without downloading or re-uploading, with LFS blobs deduplicated server-side. Added hf spaces ssh to open SSH sessions directly into a Space's Dev Mode container, and hf repos ls to list all your repositories with storage size and usage sorting. CLI tables now auto-fit to terminal width with per-column width caps and automatically right-align numeric columns.
A new Base only toggle on the Models page hides every finetune, adapter, merge, and quantization, leaving just the original base models. The **Oth...
Trainer telemetry now allowlisted
↗This release1 enhancementImprovements to existing featuresAI-tallied from the release notesTRL · v1.5.1
Trainer telemetry is now gated on an explicit class-name allowlist, restricting which trainer classes can send telemetry.