releases.shpreview

2.10.0

February 22, 2023DatasetsView original ↗
$npx -y @buildinternet/releases show rel_Yx0ZHZ8SQoDgJU_EKLKv_

Important

  • Avoid saving sparse ChunkedArrays in pyarrow tables by @marioga in https://github.com/huggingface/datasets/pull/5542
    • Big improvements on the speed of .flatten_indices() (x2) + save/load_from_disk (x100) on selected/shuffled datasets
  • Skip dataset verifications by default by @mariosasko in https://github.com/huggingface/datasets/pull/5303
    • introduces multiple verification_mode you can pass to `load_dataset()):
    • the new default verification steps are much faster (no need to compute expensive checksums)

Datasets features

Documentation

General improvements and bug fixes

New Contributors

Full Changelog: https://github.com/huggingface/datasets/compare/2.9.0...ef

Fetched April 7, 2026