DeepSeek-R1-Distill-Qwen-32B-GGUF needs the deepseek-r1-qwen tokenizer #1900

Kenshiro-28 · 2025-01-20T21:44:11Z

Current Behavior

Can't run DeepSeek-R1-Distill-Qwen-32B-GGUF, I get a tokenizer error:

"llama_model_load: error loading model: error loading model vocabulary: unknown pre-tokenizer type: 'deepseek-r1-qwen'"

Model url: https://huggingface.co/bartowski/DeepSeek-R1-Distill-Qwen-32B-GGUF

Environment and Context

Hardware: VPS with 32 GB RAM
OS: Debian 12

Using the current version of llama-cpp-python

Suggestion

Upgrade to the required version of llama.cpp:

https://github.com/ggerganov/llama.cpp/releases/tag/b4514

Mrw33554432 · 2025-01-21T07:07:01Z

I guess it can be solved by simply change a commit id? Check c9dfad4.

Mrw33554432 · 2025-01-21T10:01:52Z

I guess it can be solved by simply change a commit id? Check c9dfad4.

I change it and when i run deepseek R1 model it show: AttributeError: /root/llama-cpp-python/llama_cpp/lib/libllama.so: undefined symbol: llama_rope_type It seems that the the llama.cpp API version not match the version for the version of llama-cpp-python.

Then we will need to wait for updates.

I guess it won't be too hard to match up api, perhaps we can just pass the code to GPT and expect it to solve the issue.

Would make a try if I get some time (busy week).

Kenshiro-28 · 2025-01-21T10:22:37Z

Thank you!

JamePeng · 2025-01-21T12:22:57Z

It is not just a commit synchronization issue. There are many LLAMA_API parts in llama_cpp.py that need to be updated and synchronized to the new version refactored in llama.h of llama.cpp master.

rzafiamy · 2025-01-23T14:55:27Z

I am eager to receive the update as well. The Deepseek R1 model is truly remarkable. Hope this adaptation of API could be more automatized through AI coder in future

lexasub · 2025-01-24T01:27:37Z

does llama.cpp have such problem?

Kenshiro-28 · 2025-01-24T09:16:06Z

I guess using llama.cpp directly works fine as bartowsky did the quant with it, check the model url.

Alchete · 2025-01-24T15:41:31Z

"Deepseek R1 is one of the most amazing and impressive breakthroughs I’ve ever seen — and as open source, a profound gift to the world." - Marc Andreessen

Is it a big ask to add support for this model?

Alchete mentioned this issue Jan 22, 2025

New SOTA model DeepSeek-R1-Qwen won't load oobabooga/text-generation-webui#6679

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DeepSeek-R1-Distill-Qwen-32B-GGUF needs the deepseek-r1-qwen tokenizer #1900

DeepSeek-R1-Distill-Qwen-32B-GGUF needs the deepseek-r1-qwen tokenizer #1900

Kenshiro-28 commented Jan 20, 2025

Mrw33554432 commented Jan 21, 2025

Mrw33554432 commented Jan 21, 2025

Kenshiro-28 commented Jan 21, 2025

JamePeng commented Jan 21, 2025 •

edited

Loading

rzafiamy commented Jan 23, 2025

lexasub commented Jan 24, 2025

Kenshiro-28 commented Jan 24, 2025

Alchete commented Jan 24, 2025

DeepSeek-R1-Distill-Qwen-32B-GGUF needs the deepseek-r1-qwen tokenizer #1900

DeepSeek-R1-Distill-Qwen-32B-GGUF needs the deepseek-r1-qwen tokenizer #1900

Comments

Kenshiro-28 commented Jan 20, 2025

Current Behavior

Environment and Context

Suggestion

Mrw33554432 commented Jan 21, 2025

Mrw33554432 commented Jan 21, 2025

Kenshiro-28 commented Jan 21, 2025

JamePeng commented Jan 21, 2025 • edited Loading

rzafiamy commented Jan 23, 2025

lexasub commented Jan 24, 2025

Kenshiro-28 commented Jan 24, 2025

Alchete commented Jan 24, 2025

JamePeng commented Jan 21, 2025 •

edited

Loading