[fix] Sampling Parameters related improvements #80

oandreeva-nv · 2025-01-04T00:53:07Z

What does the PR do?

This PR removes dependencies on vllm.SamplingParams names, present in _get_sampling_params_dict in favor of dynamically checking sampling parameters passed with the request with vllm.SamplingParams.__annotations__

Previously, we needed to keep track about SamplingParams members and their types: e.g. float keys as defined here.
I suggest to infer the type from vllm.SamplingParams.__annotations__ and for simple ones (int, float, bool, str, Optional[int]) perform a conversion from string to the expected type. Suggested logic is implemented here.

Also added proper handling of "guided_generation" parameter for constrained decoding test. Limited test added in accuracy_tests

Misc:

Fixed some inconsistencies with error response for LoRA-based inference via generate endpoint with parameters passed through "parameters" vs through client script with "sampling_parameters" input. Enhanced tests.

Checklist

Commit Type:

Check the conventional commit type
box here and add the label to the github PR.

Related PRs:

Where should the reviewer start?

vllm_backend_utils.py -> main.py

Test plan:

CI Pipeline ID:

22089744

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

closes GitHub issue: #xxx

pskiran1 · 2025-01-08T17:46:38Z

LGTM!

kthui

Nice work! Only minor comments, otherwise LGTM!

kthui · 2025-01-09T19:56:45Z

ci/L0_multi_gpu_vllm/multi_lora/test.sh

@@ -197,6 +317,22 @@ else
        RET=1
    fi
 fi
+
+# Test generate endpoint + LoRA enabled (boolean flag)


# Test generate endpoint + LoRA enabled (boolean flag)

LoRA enabled -> disabled ?

kthui · 2025-01-09T19:57:41Z

ci/L0_multi_gpu_vllm/multi_lora/test.sh

@@ -243,6 +379,22 @@ else
        RET=1
    fi
 fi
+
+# Test generate endpoint + LoRA enabled (str flag)


# Test generate endpoint + LoRA enabled (str flag)

LoRA enabled -> disabled ?

ci/L0_multi_gpu_vllm/multi_lora/test.sh

oandreeva-nv added 7 commits December 30, 2024 15:21

ip

3a222db

refactor + clean up

f63c841

Added tests

4ad17c4

clean up

17f466c

Add accuracy test for guided decoding

419e6c5

Copyright

cc6dfc6

Clean up

07c5374

oandreeva-nv requested review from kthui and pskiran1 January 4, 2025 00:53

Test fix

cb81963

kthui previously approved these changes Jan 9, 2025

View reviewed changes

oandreeva-nv commented Jan 9, 2025

View reviewed changes

ci/L0_multi_gpu_vllm/multi_lora/test.sh Outdated Show resolved Hide resolved

oandreeva-nv commented Jan 9, 2025

View reviewed changes

ci/L0_multi_gpu_vllm/multi_lora/test.sh Outdated Show resolved Hide resolved

Apply suggestions from code review

8f9567c

oandreeva-nv dismissed kthui’s stale review via 8f9567c January 9, 2025 20:12

oandreeva-nv requested a review from kthui January 9, 2025 20:13

kthui approved these changes Jan 9, 2025

View reviewed changes

oandreeva-nv merged commit 80dd037 into main Jan 9, 2025
3 checks passed

oandreeva-nv deleted the oandreeva_sampling_parameters branch January 9, 2025 20:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[fix] Sampling Parameters related improvements #80

[fix] Sampling Parameters related improvements #80

oandreeva-nv commented Jan 4, 2025

pskiran1 commented Jan 8, 2025

kthui left a comment

kthui Jan 9, 2025

oandreeva-nv Jan 9, 2025

kthui Jan 9, 2025

[fix] Sampling Parameters related improvements #80

[fix] Sampling Parameters related improvements #80

Conversation

oandreeva-nv commented Jan 4, 2025

What does the PR do?

Checklist

Commit Type:

Related PRs:

Where should the reviewer start?

Test plan:

Caveats:

Background

Related Issues: (use one of the action keywords Closes / Fixes / Resolves / Relates to)

pskiran1 commented Jan 8, 2025

kthui left a comment

Choose a reason for hiding this comment

kthui Jan 9, 2025

Choose a reason for hiding this comment

oandreeva-nv Jan 9, 2025

Choose a reason for hiding this comment

kthui Jan 9, 2025

Choose a reason for hiding this comment