Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add Swedn and VG clustering datasets #96

Merged
merged 19 commits into from
Jan 25, 2024
Merged

Conversation

KennethEnevoldsen
Copy link
Owner

This pull request adds two new clustering datasets: SwednClustering and VGSummarizationClustering. It also includes performance tests for different example sizes.

Copy link
Owner Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

in its own file for now. @x-tabdeveloping I believe we can restructure this to a folder. E.g. mteb_tasks/clustering.py

src/seb/registered_tasks/mteb_tasks_clustering.py Outdated Show resolved Hide resolved
src/seb/registered_tasks/mteb_tasks_clustering.py Outdated Show resolved Hide resolved
@KennethEnevoldsen KennethEnevoldsen removed the request for review from Muennighoff January 25, 2024 14:47
@KennethEnevoldsen KennethEnevoldsen merged commit 8537e12 into main Jan 25, 2024
4 of 6 checks passed
@KennethEnevoldsen KennethEnevoldsen deleted the add-swedn-clustering branch January 25, 2024 15:25
This was referenced Jan 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants