releases.shpreview

v1.4.0

v1.4.0: `torchao` FP8, TP & dataLoader support, fix memory leak

$npx -y @buildinternet/releases show rel_dvyJ6i8gyFTWgLPHHQ-1g

torchao FP8, initial Tensor Parallel support, and memory leak fixes

torchao FP8

This release introduces a new FP8 API and brings in a new backend: torchao. To use, pass in AORecipeKwargs to the Accelerator while setting mixed_precision="fp8". This is initial support, as it matures we will incorporate more into it (such as accelerate config/yaml) in future releases. See our benchmark examples here

TensorParallel

We have intial support for an in-house solution to TP when working with accelerate dataloaders. check out the PR here

Bug fixes

What's Changed

New Contributors

Full Changelog: https://github.com/huggingface/accelerate/compare/v1.3.0...v1.4.0

Fetched April 7, 2026