Skip to content

Commit

Permalink
docs: Update docs (#2)
Browse files Browse the repository at this point in the history
* Added tokenlearn blogpost

* Update

* Update

* Added gitignore

* Removed site

---------

Co-authored-by: stephantul <[email protected]>
  • Loading branch information
Pringled and stephantul authored Oct 29, 2024
1 parent d4d0cf3 commit bef738f
Show file tree
Hide file tree
Showing 18 changed files with 3 additions and 3,612 deletions.
2 changes: 2 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
@@ -0,0 +1,2 @@
_site/*
.sass-cache/*
4 changes: 1 addition & 3 deletions _posts/2024-10-29-tokenlearn_blogpost.md
Original file line number Diff line number Diff line change
@@ -1,14 +1,12 @@
---
layout: post
title: Tokenlearn blogpost
title: "POTION: bag of tricks leads to better models"
categories: [Model2Vec]
---

This blogpost describes the [Tokenlearn](https://github.com/MinishLab/tokenlearn) method, which is a method to pre-train Model2Vec models.


# Bag of tricks leads to better models: POTION

We've been brewing, concocting, distilling, and came up with a new distillation technique that leads to much better models, which we are now releasing under the name POTION. We open source all models, code, and data.

We're releasing three versions: a 64-dim (1.9M params), 128-dim (3.8M params), and 256-dim (7.6M params) model, all based on the same base model, which is, in turn, a bge-base distillation. All POTION models outperform all previous distillations in their size class, and should be considered to be drop-in replacements of our M2V_base_output model. potion-base-8M, in particular, even improves over our largest model, M2V_base_glove. potion-base-8M is better than any set of static embeddings we could find on any task, including glove, fasttext and specialized word embeddings.
Expand Down
114 changes: 0 additions & 114 deletions _site/404.html

This file was deleted.

112 changes: 0 additions & 112 deletions _site/about/index.html

This file was deleted.

134 changes: 0 additions & 134 deletions _site/archive/index.html

This file was deleted.

Loading

0 comments on commit bef738f

Please sign in to comment.