Update dependency datasets to v2.16.1 - autoclosed #286
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR contains the following updates:
2.15.0
->2.16.1
Release Notes
huggingface/datasets (datasets)
v2.16.1
Compare Source
Bug fixes
cache_dir
toload_dataset
load_dataset("ted_talks_iwslt", language_pair=("ja", "en"), year="2015")
Full Changelog: huggingface/datasets@2.16.0...2.16.1
v2.16.0
Compare Source
Security features
https://hf.co/datasets/<repo_id>
. A warning is shown to let the user know about the custom code, and they can avoid this message in future by passing the argumenttrust_remote_code=True
.trust_remote_code=True
will be mandatory to load these datasets from the next major release ofdatasets
.HF_DATASETS_TRUST_REMOTE_CODE=0
you can already disable custom code by default without waiting for the next release ofdatasets
https://hf.co/datasets/<repo_id>/tree/refs%2Fconvert%2Fparquet
Features
load_dataset
step that lists the data files of big repositories (up to x100) but requireshuggingface_hub
0.20 or newerload_dataset
that used to reload data from cache even if the dataset was updated on Hugging Face~/.cache/huggingface/datasets/username___dataset_name/config_name/version/commit_sha
datasets
2.15 (using the old scheme) are still reloaded from cacheGeneral improvements and bug fixes
_get_data_files_patterns
by @lhoestq in https://github.com/huggingface/datasets/pull/6343usedforsecurity=False
in hashlib methods (FIPS compliance) by @Wauplin in https://github.com/huggingface/datasets/pull/6414ruff
for formatting by @mariosasko in https://github.com/huggingface/datasets/pull/6434tqdm
wrapper by @mariosasko in https://github.com/huggingface/datasets/pull/6433Table.__getstate__
andTable.__setstate__
by @LZHgrla in https://github.com/huggingface/datasets/pull/6444filelock
package for file locking by @mariosasko in https://github.com/huggingface/datasets/pull/6445**
by @mariosasko in https://github.com/huggingface/datasets/pull/6449dill
logic by @mariosasko in https://github.com/huggingface/datasets/pull/6454push_to_hub
by @mariosasko in https://github.com/huggingface/datasets/pull/6461__repr__
by @lhoestq in https://github.com/huggingface/datasets/pull/6480torch.Generator
objects by @mariosasko in https://github.com/huggingface/datasets/pull/6502list_files_info
withlist_repo_tree
inpush_to_hub
by @mariosasko in https://github.com/huggingface/datasets/pull/6510New Contributors
Full Changelog: huggingface/datasets@2.15.0...2.16.0
Configuration
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR has been generated by Mend Renovate. View repository job log here.