Add blocksize to DocumentDataset.read_*
that uses dask_cudf.read_*
#285
Merged
sarahyurick merged 28 commits intoNVIDIA:mainfrom praateekmahajan:praateek/try-dask-cudf-read-jsonDec 17, 2024
+814-56
Commits
Commits on Oct 8, 2024
- committed
Commits on Nov 15, 2024
- committed
- committed
Commits on Nov 18, 2024
Commits on Nov 19, 2024
- committed
- committed
- committed
Commits on Nov 20, 2024
Commits on Nov 22, 2024
- committed
- committed
- committed
- committed
Commits on Dec 6, 2024
Commits on Dec 13, 2024
Commits on Dec 15, 2024
Commits on Dec 16, 2024
Merge branch 'main' of github.com:praateekmahajan/NeMo-Curator into praateek/try-dask-cudf-read-json
committed- committed
- committed
- committed