Using the sglang framework combined with outlines to format the output using DeepSeek-R1-Distill-Qwen-1.5B. #1406
Closed
llmadd
started this conversation in
Weird model output
Replies: 1 comment 4 replies
-
Your dummy API key freaked me out for a second, thought it was a real key. 😌 Part of the issue is that the regular expression to work with DeepSeek is somewhat complicated. There's an example here for limiting the number of thinking tokens, but the compilation time can be quite complex. I have an example of how to do this here: https://gist.github.com/cpfiffer/e98fc71d4fcb35ba827fc9e679112895 Can you take a look at that and see if it helps? |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I’m not very good at English, but the issue is roughly as follows: When I perform regular expression formatting, I encounter a situation where the content keeps looping or stays in the 'think' process. Even with relatively simple regular expressions, like extracting numbers from the example above, there are multiple repeated outputs. However, JSON formatted output doesn't have this issue.
Here are some examples of unreasonable outputs:
Beta Was this translation helpful? Give feedback.
All reactions