Skip to content

Commit

Permalink
Add notebook to show Fineweb ensemble (#536)
Browse files Browse the repository at this point in the history
* Add notebook to show fineweb ensemble

Signed-off-by: Vibhu Jawa <[email protected]>

* Update tutorials/distributed_data_classification/fineweb-edu-ensebmle-classification.ipynb

Co-authored-by: Sarah Yurick <[email protected]>
Signed-off-by: Vibhu Jawa <[email protected]>

* Update tutorials/distributed_data_classification/fineweb-edu-ensebmle-classification.ipynb

Co-authored-by: Sarah Yurick <[email protected]>
Signed-off-by: Vibhu Jawa <[email protected]>

* Update tutorials/distributed_data_classification/fineweb-edu-ensebmle-classification.ipynb

Co-authored-by: Sarah Yurick <[email protected]>
Signed-off-by: Vibhu Jawa <[email protected]>

* Fix .head() calls based on Sarah's feedback

Signed-off-by: Vibhu Jawa <[email protected]>

* Address Ryan's feedback and add notes

Signed-off-by: Vibhu Jawa <[email protected]>

* Fix minor typos

Signed-off-by: Vibhu Jawa <[email protected]>

* Fix Typo and add Quality Classifier Fast Text object

Signed-off-by: Vibhu Jawa <[email protected]>

* Fix type hint

Signed-off-by: Vibhu Jawa <[email protected]>

* Update tutorials/distributed_data_classification/fineweb-edu-ensemble-classification.ipynb

Co-authored-by: Sarah Yurick <[email protected]>
Signed-off-by: Vibhu Jawa <[email protected]>

* Update tutorials/distributed_data_classification/fineweb-edu-ensemble-classification.ipynb

Co-authored-by: Sarah Yurick <[email protected]>
Signed-off-by: Vibhu Jawa <[email protected]>

* Update tutorials/distributed_data_classification/fineweb-edu-ensemble-classification.ipynb

Co-authored-by: Sarah Yurick <[email protected]>
Signed-off-by: Vibhu Jawa <[email protected]>

* Update tutorials/distributed_data_classification/fineweb-edu-ensemble-classification.ipynb

Co-authored-by: Sarah Yurick <[email protected]>
Signed-off-by: Vibhu Jawa <[email protected]>

* Link the right classifiers

Signed-off-by: Vibhu Jawa <[email protected]>

* Update tutorials/distributed_data_classification/fineweb-edu-ensemble-classification.ipynb

Co-authored-by: Sarah Yurick <[email protected]>
Signed-off-by: Vibhu Jawa <[email protected]>

---------

Signed-off-by: Vibhu Jawa <[email protected]>
Signed-off-by: Vibhu Jawa <[email protected]>
Co-authored-by: Sarah Yurick <[email protected]>
  • Loading branch information
VibhuJawa and sarahyurick authored Feb 14, 2025
1 parent a5d1a7b commit 0f0cb31
Showing 1 changed file with 1,311 additions and 0 deletions.
Loading

0 comments on commit 0f0cb31

Please sign in to comment.