concatenate_datasets for iterable datasets by @lhoestq in https://github.com/huggingface/datasets/pull/4500metadata.jsonl from parent directories in imagefolder @mariosasko in https://github.com/huggingface/datasets/pull/4576ArrowWriter.write_batch when batch is empty by @alvarobartt in https://github.com/huggingface/datasets/pull/4510batch_size parameter when calling add_faiss_index and add_faiss_index_from_external_arrays by @alvarobartt in https://github.com/huggingface/datasets/pull/4535load_dataset by @mariosasko in https://github.com/huggingface/datasets/pull/4577_arrow_to_datasets_dtype conversion by @mariosasko in https://github.com/huggingface/datasets/pull/4628assertEqual with assertTupleEqual in unit tests for verbosity by @alvarobartt in https://github.com/huggingface/datasets/pull/4496embed_storage on features inside lists/sequences by @mariosasko in https://github.com/huggingface/datasets/pull/4615from_pandas more robust by @mariosasko in https://github.com/huggingface/datasets/pull/4703DatasetInfo/Features by @mariosasko in https://github.com/huggingface/datasets/pull/4741Full Changelog: https://github.com/huggingface/datasets/compare/2.3.2...2.4.0
Fetched April 7, 2026