supports jinaai/jina-embeddings-v2-base-code #8

wsxiaoys · 2024-02-05T18:12:32Z

I seems to get jina embeddings convert successfully with

python bert_cpp/convert.py jinaai/jina-embeddings-v2-base-code models/jina-f16.gguf

Seems the only change to original bert is ALiBi, as described in https://huggingface.co/jinaai/jina-embeddings-v2-base-code

It'll be nice if we could adapt ggml_alibi into this repo for jina embedding support

The text was updated successfully, but these errors were encountered:

iamlemec · 2024-02-06T05:33:59Z

Yeah, this would be rad. It does seem like it comes down to adding one or two lines with ggml_alibi. The only concern right now is that we're not using a KV cache, so going up to L=8192 is probably going to be infeasible due to memory. But I do plan on adding in KV cache soon. Let me see if I can get it working in the meantime.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

supports jinaai/jina-embeddings-v2-base-code #8

supports jinaai/jina-embeddings-v2-base-code #8

wsxiaoys commented Feb 5, 2024

iamlemec commented Feb 6, 2024

supports jinaai/jina-embeddings-v2-base-code #8

supports jinaai/jina-embeddings-v2-base-code #8

Comments

wsxiaoys commented Feb 5, 2024

iamlemec commented Feb 6, 2024