load_dataset("csv", data_files={"train": [url_to_one_csv_file, url_to_another_csv_file...]})
This works for all these dataset loaders:
streaming=True. Main contributions:
filter with multiprocessing in case all samples are discarded #2601 (@mxschmdt)Fetched April 7, 2026