You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Yeah, this would be rad. It does seem like it comes down to adding one or two lines with ggml_alibi. The only concern right now is that we're not using a KV cache, so going up to L=8192 is probably going to be infeasible due to memory. But I do plan on adding in KV cache soon. Let me see if I can get it working in the meantime.
I seems to get jina embeddings convert successfully with
Seems the only change to original bert is
ALiBi
, as described in https://huggingface.co/jinaai/jina-embeddings-v2-base-codeIt'll be nice if we could adapt
ggml_alibi
into this repo for jina embedding supportThe text was updated successfully, but these errors were encountered: