Skip to content

Actions: sarahyurick/NeMo-Curator

GPU CI/CD

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
42 workflow runs
42 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Minor updates to duplicate removal (#570)
GPU CI/CD #42: Commit a080400 pushed by sarahyurick
February 26, 2025 19:21 5s main
February 26, 2025 19:21 5s
Improvements for semantic deduplication and DAPT tutorial (#564)
GPU CI/CD #41: Commit 119edd4 pushed by sarahyurick
February 24, 2025 22:16 5s main
February 24, 2025 22:16 5s
February 20, 2025 20:21 5s
Fix issues with download and extract (#541)
GPU CI/CD #39: Commit 908e0f1 pushed by sarahyurick
February 18, 2025 17:46 6s main
February 18, 2025 17:46 6s
Add notebook to show Fineweb ensemble (#536)
GPU CI/CD #38: Commit 0f0cb31 pushed by sarahyurick
February 14, 2025 21:57 4s main
February 14, 2025 21:57 4s
chore: Version bump (#545)
GPU CI/CD #37: Commit a5d1a7b pushed by sarahyurick
February 12, 2025 23:48 4s main
February 12, 2025 23:48 4s
Add support for Nemotron-CC EDU classifiers (#518)
GPU CI/CD #36: Commit a7fde15 pushed by sarahyurick
February 12, 2025 22:59 5s main
February 12, 2025 22:59 5s
Pin Transformers version >= 4.48.0 (#528)
GPU CI/CD #35: Commit 334a331 pushed by sarahyurick
February 11, 2025 19:06 7s main
February 11, 2025 19:06 7s
Update model nomenclature (#497)
GPU CI/CD #34: Commit 34a1cc6 pushed by sarahyurick
February 7, 2025 17:57 5s main
February 7, 2025 17:57 5s
ci: Version bump to 0.7.0rc1.dev0 (#513)
GPU CI/CD #33: Commit c3fb61d pushed by sarahyurick
February 4, 2025 23:14 4s main
February 4, 2025 23:14 4s
Fix DAPT tutorial (#503)
GPU CI/CD #32: Commit 75234a9 pushed by sarahyurick
January 31, 2025 20:56 4s main
January 31, 2025 20:56 4s
January 30, 2025 22:56 5s
Create notebook tutorials for distributed data classifiers (#415)
GPU CI/CD #30: Commit cd38de0 pushed by sarahyurick
January 24, 2025 21:58 4s main
January 24, 2025 21:58 4s
Create check_dask_cwd function (#484)
GPU CI/CD #29: Commit 57f0e3c pushed by sarahyurick
January 23, 2025 00:00 4s main
January 23, 2025 00:00 4s
[REVIEW] Fix Sem Dedup (#478)
GPU CI/CD #28: Commit 7cfda44 pushed by sarahyurick
January 16, 2025 20:23 4s main
January 16, 2025 20:23 4s
docs: Update CHANGELOG.md (#475)
GPU CI/CD #27: Commit 9c8f185 pushed by sarahyurick
January 10, 2025 20:55 5s main
January 10, 2025 20:55 5s
Make add_filename str/bool (#465)
GPU CI/CD #26: Commit 2d7e857 pushed by sarahyurick
January 7, 2025 19:52 5s main
January 7, 2025 19:52 5s
Reorder import (#460)
GPU CI/CD #25: Commit db411b0 pushed by sarahyurick
January 2, 2025 22:23 5s main
January 2, 2025 22:23 5s
Add tests/test_classifiers.py PyTests (#421)
GPU CI/CD #24: Commit b8ff71e pushed by sarahyurick
December 23, 2024 21:09 5s main
December 23, 2024 21:09 5s
Bug fix in dockerfile ARG vs ENV var (#446)
GPU CI/CD #23: Commit 35b5993 pushed by sarahyurick
December 23, 2024 18:28 4s main
December 23, 2024 18:28 4s
update test params to account for new minhash algo (#442)
GPU CI/CD #22: Commit c929203 pushed by sarahyurick
December 20, 2024 19:00 6s main
December 20, 2024 19:00 6s
December 17, 2024 21:02 6s
Bump RAPIDS stable to 24.12 and RAPIDS nightly to 25.02 (#434)
GPU CI/CD #20: Commit c54826a pushed by sarahyurick
December 17, 2024 20:58 5s main
December 17, 2024 20:58 5s
Add documentation for Instruction-Data-Guard classifier (#398)
GPU CI/CD #19: Commit 86830ab pushed by sarahyurick
December 16, 2024 18:43 7s main
December 16, 2024 18:43 7s
Adding fuzzy and semantic dedupe (#428)
GPU CI/CD #18: Commit 3c3cc98 pushed by sarahyurick
December 13, 2024 22:42 4s main
December 13, 2024 22:42 4s