releases.shpreview
Home/Hugging Face
Hugging Face

Hugging Face

Browser-based OAuth login; multi-commit folder uploads revamped

This release6 featuresNew capabilities4 enhancementsImprovements to existing features1 fixBug fixesAI-tallied from the release notes
huggingface_hub · v1.20.0

Browser-based OAuth Device Code flow is now the default login method, and hf upload uses a streamed multi-commit pipeline for faster, resumable large folder uploads. Also adds job wait commands, SSH access to running Jobs, and custom-container deployment for Inference Endpoints, with breaking changes around multi-commit uploads and non-detached job exit codes.

Read more →

New models: MiniMax-M3-VL, PP-OCRv6, Parakeet-RNNT

This release3 featuresNew capabilities7 enhancementsImprovements to existing features13 fixesBug fixesAI-tallied from the release notes
Transformers · v5.12.0

This release introduces the MiniMax-M3-VL vision-language model, the PP-OCRv6 OCR system, and the Parakeet-RNNT model for speech processing. Several bug fixes and improvements were also made, including changes to CI, stop string matching, and model documentation.

Read more →

A2PO trainer debuts; VLM KTO support; Async GRPO spawns process

This release7 featuresNew capabilities5 enhancementsImprovements to existing features2 fixesBug fixesAI-tallied from the release notes
TRL · v1.6.0

The release introduces a new experimental A2POTrainer for optimal advantage regression and grants KTO trainer support for vision-language models. The AsyncRolloutWorker now runs in a separate process to avoid GIL contention and potential NCCL watchdog timeouts, along with fixes for aiohttp retries and all-NaN reward columns. Gold distillation trainer now aligns tokens via byte offsets, and SDFT/SDPO leverage the vLLM server for live teacher logprobs. Other features include bidirectional masked importance sampling for IcePop, support for NemotronH and Nemotron 3 Ultra, additional training chat templates, and decoupled self-distillation trainers.

Read more →

AMD ROCm support added; FSDP2 hardened

This release3 featuresNew capabilities16 enhancementsImprovements to existing features24 fixesBug fixesAI-tallied from the release notes
Accelerate · v1.14.0

Accelerate now works end-to-end on AMD ROCm devices. This release also includes numerous FSDP2 fixes and quality-of-life improvements, such as correct dtype handling on load, sharding of embeddings/norms, and QLoRA crash prevention.

Read more →

Trusted Publishers enable keyless CI/CD auth

This release3 featuresNew capabilities4 enhancementsImprovements to existing features3 fixesBug fixesAI-tallied from the release notes
huggingface_hub · v1.19.0

Keyless CI/CD authentication is now supported via Trusted Publishers, allowing workflows to authenticate without storing an HF_TOKEN secret. The CLI now accepts hf:// URIs for uploading and downloading, and jobs can expose container ports through the public jobs proxy.

Read more →

DiffusionGemma and DeepSeek-V3.2 models added

This release2 featuresNew capabilities1 enhancementImprovements to existing features30 fixesBug fixesAI-tallied from the release notes
Transformers · v5.11.0

New models DiffusionGemma and DeepSeek-V3.2 have been added, featuring optimizations for inference speed and efficient long-context handling. The Kernels API was extended for module fusion and parameter transformation, with added support for fp8/fp4 Triton kernels. Model parallel beam search bugs in Qwen2-VL model families were fixed.

Read more →

Agent traces now load for SFT; streaming shuffle uses multiple shards

This release7 featuresNew capabilities1 enhancementImprovements to existing features9 fixesBug fixesAI-tallied from the release notes
Datasets · v5.0.0

Agent traces from Claude Code, Pi, Codex and others can now be loaded with load_dataset and parsed to messages for training with TRL. Streaming shuffle now uses multiple input shards by default for better randomization, though this is a breaking change—use max_buffer_input_shards=1 to restore the prior behavior. Also added batch(by_column=...) for robotics datasets, support for Apache Iceberg, TsFile (IoTDB), 3D mesh, and CoNLL formats, plus fixes for Parquet streaming hangs, Lance file handling, and JSON null encoding.

Read more →

Unified hf cp command; concurrent downloads now safe on broken file locks

This release3 featuresNew capabilities1 enhancementImprovements to existing features4 fixesBug fixesAI-tallied from the release notes
huggingface_hub · v1.18.0

A new unified hf cp command handles all file-copy workflows (upload, download, remote-to-remote) with consistent hf:// URI syntax and rsync-style semantics. parse_hf_uri now accepts Hugging Face web URLs directly. Fixed a data corruption bug where concurrent hf_hub_download calls could append to the same .incomplete file on Lustre, GPFS, and some NFS mounts where flock(2) provides no mutual exclusion; downloads now use fresh temporary files instead of resuming incomplete ones, making the download path safe even when file locking is broken. This is a breaking change: resuming previously failed partial downloads is no longer possible.

Read more →

Gemma4 Unified; Sapiens2; model parallelism hardened

This release4 featuresNew capabilities3 enhancementsImprovements to existing features15 fixesBug fixesAI-tallied from the release notes
Transformers · v5.10.1

Added Gemma4 12B Unified, an encoder-free multimodal model that projects raw vision and audio inputs directly into language model space; Sapiens2, a vision transformer family for human-centric tasks; DeepSeek-OCR-2 for document understanding; and Mellum, a code-focused mixture-of-experts model. Fixed numerous model parallelism bugs across tensor and expert parallelism, beam search under parallel settings, and loss over-counting; also fixed encoder-decoder cache initialization regression and BitsAndBytes quantization tensor-dropping bug.

Read more →

Remote WasmExecutor removed; Exa search engine added

This release1 featureNew capabilities3 enhancementsImprovements to existing features1 fixBug fixesAI-tallied from the release notes
smolagents · v1.26.0

Remote WasmExecutor has been removed. Exa is now available as a search engine option in WebSearchTool. Also includes documentation and typo cleanups, and a docstring update for LocalPythonExecutor.

Read more →

Copy files across repos; SSH into Spaces; CLI tables auto-fit terminal width

This release5 featuresNew capabilities3 enhancementsImprovements to existing features3 fixesBug fixesAI-tallied from the release notes
huggingface_hub · v1.17.0

You can now copy files or entire folders between different repositories without downloading or re-uploading, with LFS blobs deduplicated server-side. Added hf spaces ssh to open SSH sessions directly into a Space's Dev Mode container, and hf repos ls to list all your repositories with storage size and usage sorting. CLI tables now auto-fit to terminal width with per-column width caps and automatically right-align numeric columns.

Read more →

Trainer telemetry now allowlisted

This release1 enhancementImprovements to existing featuresAI-tallied from the release notes
TRL · v1.5.1

Trainer telemetry is now gated on an explicit class-name allowlist, restricting which trainer classes can send telemetry.

Read more →
Last Checked
1h ago
Tracking since May 31, 2019