Skip to content

Actions: huggingface/datatrove

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,027 workflow runs
1,027 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

fix requeue handler
CI #127: Commit ce6b7df pushed by guipenedo
February 2, 2024 18:09 1m 44s main
February 2, 2024 18:09 1m 44s
Adds writer adapter
CI #126: Pull request #83 opened by guipenedo
February 2, 2024 16:44 1m 50s writer-adapter
February 2, 2024 16:44 1m 50s
Improve parallelization in MinhashDedupBuckets
CI #125: Pull request #82 opened by guipenedo
February 2, 2024 16:32 1m 55s parallel_minhash_buckets
February 2, 2024 16:32 1m 55s
Requeue job automatically when specific signals are caught (#81)
CI #124: Commit 321e068 pushed by guipenedo
February 2, 2024 16:29 1m 51s main
February 2, 2024 16:29 1m 51s
Requeue job automatically when specific signals are caught
CI #123: Pull request #81 opened by guipenedo
February 2, 2024 16:29 1m 48s requeue-signals
February 2, 2024 16:29 1m 48s
added upload_block_size parameter (#79)
CI #122: Commit 8b52ab0 pushed by guipenedo
February 2, 2024 16:12 1m 42s main
February 2, 2024 16:12 1m 42s
Update exact_substrings.py
CI #121: Pull request #73 synchronize by jordane95
February 2, 2024 04:41 1m 46s jordane95:jordane95-patch-1
February 2, 2024 04:41 1m 46s
catch codec error in jsonlreader (#78)
CI #119: Commit bdac443 pushed by guipenedo
February 1, 2024 15:49 1m 47s main
February 1, 2024 15:49 1m 47s
Added upload_block_size parameter
CI #118: Pull request #79 opened by guipenedo
February 1, 2024 15:27 1m 47s merger-upload-block-size
February 1, 2024 15:27 1m 47s
catch codec error in jsonlreader
CI #117: Pull request #78 opened by guipenedo
February 1, 2024 15:15 1m 47s jsonl-codec
February 1, 2024 15:15 1m 47s
fix huggingfacereader
CI #113: Commit 685fdc6 pushed by guipenedo
January 29, 2024 11:24 1m 52s main
January 29, 2024 11:24 1m 52s
fix tests
CI #112: Commit 8de4594 pushed by guipenedo
January 29, 2024 10:26 2m 19s main
January 29, 2024 10:26 2m 19s
Bump fsspec version (#68)
CI #111: Commit 190857a pushed by guipenedo
January 27, 2024 22:23 1m 28s main
January 27, 2024 22:23 1m 28s
Bump fsspec version
CI #110: Pull request #68 opened by 0xh3x
January 27, 2024 17:20 2m 8s 0xh3x:patch-1
January 27, 2024 17:20 2m 8s
Improved documentation
CI #109: Pull request #65 synchronize by guipenedo
January 25, 2024 16:20 2m 20s docs2
January 25, 2024 16:20 2m 20s
Improved documentation
CI #108: Pull request #65 opened by guipenedo
January 25, 2024 15:52 2m 22s docs2
January 25, 2024 15:52 2m 22s
added tokenize_from_hf_to_s3 (#63)
CI #107: Commit 463c2a3 pushed by guipenedo
January 24, 2024 14:32 2m 16s main
January 24, 2024 14:32 2m 16s
added tokenize_from_hf_to_s3
CI #106: Pull request #63 synchronize by guipenedo
January 24, 2024 14:30 2m 27s tokenize_from_hf_example
January 24, 2024 14:30 2m 27s
added tokenize_from_hf_to_s3
CI #105: Pull request #63 synchronize by guipenedo
January 24, 2024 13:11 2m 17s tokenize_from_hf_example
January 24, 2024 13:11 2m 17s
added tokenize_from_hf_to_s3
CI #104: Pull request #63 opened by guipenedo
January 24, 2024 13:09 2m 45s tokenize_from_hf_example
January 24, 2024 13:09 2m 45s
fix sentence dedup example
CI #103: Commit e471e2d pushed by guipenedo
January 24, 2024 09:22 2m 23s main
January 24, 2024 09:22 2m 23s
build: replace setup.py with pyproject.toml (#59)
CI #102: Commit d872e4f pushed by guipenedo
January 23, 2024 16:36 2m 22s main
January 23, 2024 16:36 2m 22s
bugfix setting metadata on empty document
CI #100: Commit 9bef798 pushed by guipenedo
January 23, 2024 16:19 2m 8s main
January 23, 2024 16:19 2m 8s
ProTip! You can narrow down the results and go further in time using created:<2024-01-23 or the other filters available.