Skip to content

Actions: huggingface/datatrove

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,034 workflow runs
1,034 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add IPC/Feather readers
CI #13: Pull request #45 opened by mariosasko
December 18, 2023 20:01 1m 11s feather-reader
December 18, 2023 20:01 1m 11s
Merge pull request #43 from huggingface/fix-minimal-version
CI #12: Commit c1f3108 pushed by mariosasko
December 18, 2023 19:29 1m 34s main
December 18, 2023 19:29 1m 34s
Batched tokenization and c4 paragraph filters
CI #11: Pull request #44 synchronize by guipenedo
December 18, 2023 19:10 1m 19s batch_tokenize
December 18, 2023 19:10 1m 19s
fix style
CI #10: Commit 1df8e4a pushed by guipenedo
December 18, 2023 18:59 1m 22s main
December 18, 2023 18:59 1m 22s
Batched tokenization and c4 paragraph filters
CI #9: Pull request #44 synchronize by guipenedo
December 18, 2023 18:58 1m 11s batch_tokenize
December 18, 2023 18:58 1m 11s
Batched tokenization and c4 paragraph filters
CI #8: Pull request #44 opened by guipenedo
December 18, 2023 18:54 1m 6s batch_tokenize
December 18, 2023 18:54 1m 6s
small url filter bugfix/cleanup
CI #7: Commit 52d22e8 pushed by guipenedo
December 18, 2023 18:50 1m 22s main
December 18, 2023 18:50 1m 22s
Fix minimal supported Python version
CI #6: Pull request #43 opened by mariosasko
December 18, 2023 18:45 1m 22s fix-minimal-version
December 18, 2023 18:45 1m 22s
started work on formatters and ccnet perplexity
Lint #462: Commit 09c5245 pushed by guipenedo
December 18, 2023 18:44 26s pipeline_blocks_misc
December 18, 2023 18:44 26s
Set minimal Python version to 3.10
Lint #461: Commit b9735fe pushed by mariosasko
December 18, 2023 18:40 24s fix-minimal-version
December 18, 2023 18:40 24s
Misc improvements (#42)
CI #5: Commit 4eb6fcd pushed by guipenedo
December 18, 2023 18:40 1m 14s main
December 18, 2023 18:40 1m 14s
Misc improvements
CI #4: Pull request #42 synchronize by mariosasko
December 17, 2023 18:29 1m 20s misc
December 17, 2023 18:29 1m 20s
recursive was not taken into account in fsspec
Lint #460: Pull request #38 synchronize by thomwolf
December 17, 2023 09:16 21s fix-recursive
December 17, 2023 09:16 21s
recursive was not taken into account in fsspec
Run tests #243: Pull request #38 synchronize by thomwolf
December 17, 2023 09:16 1m 35s fix-recursive
December 17, 2023 09:16 1m 35s
batched tokenization
Lint #459: Commit 4f5cd00 pushed by thomwolf
December 17, 2023 09:16 24s fix-recursive
December 17, 2023 09:16 24s
Optimize ParquetReader (#40)
Lint #458: Commit 46750dd pushed by guipenedo
December 15, 2023 19:44 25s main
December 15, 2023 19:44 25s
Optimize ParquetReader (#40)
Run tests #242: Commit 46750dd pushed by guipenedo
December 15, 2023 19:44 1m 24s main
December 15, 2023 19:44 1m 24s
Misc improvements
CI #3: Pull request #42 synchronize by mariosasko
December 15, 2023 19:04 1m 12s misc
December 15, 2023 19:04 1m 12s
Misc improvements
CI #2: Pull request #42 synchronize by mariosasko
December 15, 2023 19:02 1m 8s misc
December 15, 2023 19:02 1m 8s
Misc improvements
CI #1: Pull request #42 opened by mariosasko
December 15, 2023 19:00 1m 5s misc
December 15, 2023 19:00 1m 5s
Optimize ParquetReader
Lint #457: Pull request #40 synchronize by guipenedo
December 13, 2023 14:44 26s optimize-parquet-reader
December 13, 2023 14:44 26s
Optimize ParquetReader
Run tests #241: Pull request #40 synchronize by guipenedo
December 13, 2023 14:44 1m 31s optimize-parquet-reader
December 13, 2023 14:44 1m 31s
correctly track time
Lint #456: Commit 0ebc066 pushed by guipenedo
December 13, 2023 14:44 22s optimize-parquet-reader
December 13, 2023 14:44 22s
Optimize ParquetReader
Lint #455: Pull request #40 synchronize by mariosasko
December 13, 2023 12:57 26s optimize-parquet-reader
December 13, 2023 12:57 26s
Optimize ParquetReader
Run tests #240: Pull request #40 synchronize by mariosasko
December 13, 2023 12:57 1m 28s optimize-parquet-reader
December 13, 2023 12:57 1m 28s
ProTip! You can narrow down the results and go further in time using created:<2023-12-13 or the other filters available.