Skip to content

KennethEnevoldsen/scandinavian-embedding-benchmark

Repository files navigation

Scandinavian Embedding Benchmark

PyPI Python Version documentation Tests Ruff DOI

A benchmark for evaluating sentence/document embeddings of Scandinavian language models.

Installation

You can install the Scandinavian Embedding Benchmark (seb) via pip from PyPI:

pip install seb

To see more examples, see the documentation.

📖 Documentation

Documentation
🔧 Installation Installation instructions on how to install this package
👩‍💻 Usage Introduction on how to use the package
📖 Documentation A minimal and developing documentation

💬 Where to ask questions

Type
🚨 Bug Reports GitHub Issue Tracker
🎁 Feature Requests & Ideas GitHub Issue Tracker
👩‍💻 Usage Questions GitHub Discussions
🗯 General Discussion GitHub Discussions

Citation

To cite this work please refer to the following work accepted at neurips:

Enevoldsen, K., Kardos, M., Muennighoff, N., & Nielbo, K. (2024). The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding. In Advances in Neural Information Processing Systems

or use the following BibTeX:

@inproceedings{enevoldsen2024scandinavian,
  title={The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding},
  author={Enevoldsen, Kenneth and Kardos, M{\'a}rton and Muennighoff, Niklas and Nielbo, Kristoffer},
  booktitle={Advances in Neural Information Processing Systems},
  year={2024},
  url={https://nips.cc/virtual/2024/poster/97869}
}