-
-
Notifications
You must be signed in to change notification settings - Fork 5.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Usage]: How do I set default temperature for openai compatible server? #11861
Comments
You are meant to pass the JSON filepath itself (not the directory) to |
I already tried that and this is the error I got: ValueError: No supported config format found in myconfig.json This is my {
"temperature": 0.0,
} |
Maybe you need to remove the trailing comma for the file to be read as JSON correctly. |
Thank you @DarkLight1337, same error is gotten:
If I run the help command
As you can see it says |
Oh, sorry I misread the argument meaning. Looking into the code, it appears that the folder needs to be a valid target for |
Ok, I think I finally was able to set This is an example of how the {
"bos_token_id": 151643,
"pad_token_id": 151643,
"do_sample": true,
"eos_token_id": [
151645,
151643
],
"repetition_penalty": 1.05,
"temperature": 0.7,
"top_p": 0.8,
"top_k": 20,
"transformers_version": "4.37.0"
} And it must be exactly named This is what I get now:
|
Your current environment
How would you like to use vllm
Hello guys, I am trying to set the default temperature for openai compatible server and unfortunately I am unable to do it. I created a json like the following inside the folder
generation_config
:And I ran this command:
But I get an error with about incompatibility:
ERROR 01-08 09:10:14 engine.py:366] No supported config format found in generation_config
What's the correct way to set a default value for temperature and other parameters? By default, temperature is set to 1:
vllm/vllm/sampling_params.py
Line 219 in 571da8f
But I want to modify from the side of the server and not from the side of the client/user.
Any idea how to do it?
Before submitting a new issue...
The text was updated successfully, but these errors were encountered: