Skip to content

Actions: huggingface/datatrove

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
1,280 workflow runs
1,280 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

changed ftfy defaults (#319)
Test & Check Code Quality #370: Commit 0c891f6 pushed by guipenedo
January 8, 2025 20:13 2m 48s main
January 8, 2025 20:13 2m 48s
Changed FTFY defaults
Test & Check Code Quality #369: Pull request #319 opened by guipenedo
January 8, 2025 20:11 2m 49s ftfy
January 8, 2025 20:11 2m 49s
changed ftfy defaults
Secret Leaks #194: Commit eab6a2f pushed by guipenedo
January 7, 2025 19:04 17s ftfy
January 7, 2025 19:04 17s
fix(utils): Enhance the dependencies check to include pip distribution
Test & Check Code Quality #368: Pull request #317 opened by aiqwe
January 6, 2025 14:29 5m 7s aiqwe:main
January 6, 2025 14:29 5m 7s
manually handle process management
Test & Check Code Quality #367: Commit 2548cdf pushed by guipenedo
January 2, 2025 19:15 2m 44s main
January 2, 2025 19:15 2m 44s
manually handle process management
Secret Leaks #193: Commit 2548cdf pushed by guipenedo
January 2, 2025 19:15 22s main
January 2, 2025 19:15 22s
handle processpool breaking
Test & Check Code Quality #366: Commit e1c7cec pushed by guipenedo
January 2, 2025 01:28 2m 44s main
January 2, 2025 01:28 2m 44s
handle processpool breaking
Secret Leaks #192: Commit e1c7cec pushed by guipenedo
January 2, 2025 01:28 17s main
January 2, 2025 01:28 17s
switch to processpoolexecutor to be able to properly kill runaway docs
Secret Leaks #191: Commit 6e9af63 pushed by guipenedo
January 1, 2025 11:31 18s main
January 1, 2025 11:31 18s
switch to processpoolexecutor to be able to properly kill runaway docs
Test & Check Code Quality #365: Commit 6e9af63 pushed by guipenedo
January 1, 2025 11:31 2m 37s main
January 1, 2025 11:31 2m 37s
Add open-source text extraction libraries
Test & Check Code Quality #364: Pull request #293 synchronize by garrethlee
December 28, 2024 02:58 2m 37s feat/text-extraction
December 28, 2024 02:58 2m 37s
changed trafilatura default args
Secret Leaks #190: Commit f816913 pushed by garrethlee
December 28, 2024 02:58 18s feat/text-extraction
December 28, 2024 02:58 18s
fix default compression
Secret Leaks #189: Commit 47379fd pushed by guipenedo
December 26, 2024 11:07 21s main
December 26, 2024 11:07 21s
fix default compression
Test & Check Code Quality #363: Commit 47379fd pushed by guipenedo
December 26, 2024 11:07 2m 35s main
December 26, 2024 11:07 2m 35s
Add open-source text extraction libraries
Test & Check Code Quality #362: Pull request #293 synchronize by guipenedo
December 26, 2024 11:01 2m 46s feat/text-extraction
December 26, 2024 11:01 2m 46s
nit
Secret Leaks #188: Commit aae7e33 pushed by guipenedo
December 26, 2024 11:01 17s feat/text-extraction
December 26, 2024 11:01 17s
Add open-source text extraction libraries
Test & Check Code Quality #361: Pull request #293 synchronize by garrethlee
December 22, 2024 02:35 2m 52s feat/text-extraction
December 22, 2024 02:35 2m 52s
improved test case robustness
Secret Leaks #187: Commit cd18c59 pushed by garrethlee
December 22, 2024 02:35 15s feat/text-extraction
December 22, 2024 02:35 15s
Add open-source text extraction libraries
Test & Check Code Quality #360: Pull request #293 synchronize by garrethlee
December 22, 2024 00:01 3m 21s feat/text-extraction
December 22, 2024 00:01 3m 21s
Add open-source text extraction libraries
Test & Check Code Quality #359: Pull request #293 synchronize by garrethlee
December 21, 2024 23:51 2m 59s feat/text-extraction
December 21, 2024 23:51 2m 59s
remove additional brotlipy in pyproject
Secret Leaks #185: Commit 751ec13 pushed by garrethlee
December 21, 2024 23:50 21s feat/text-extraction
December 21, 2024 23:50 21s
Add open-source text extraction libraries
Test & Check Code Quality #358: Pull request #293 synchronize by garrethlee
December 21, 2024 23:37 2m 53s feat/text-extraction
December 21, 2024 23:37 2m 53s
style: fixed ruff format errors
Secret Leaks #184: Commit 2ac81d5 pushed by garrethlee
December 21, 2024 23:37 29s feat/text-extraction
December 21, 2024 23:37 29s
Add open-source text extraction libraries
Test & Check Code Quality #357: Pull request #293 synchronize by garrethlee
December 21, 2024 23:35 2m 45s feat/text-extraction
December 21, 2024 23:35 2m 45s