Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding autocomplete to vllm model.py #20

Merged
merged 11 commits into from
Nov 9, 2023
Merged

Conversation

oandreeva-nv
Copy link
Contributor

Moving default parameters from config to auto_complete function.

Also tests Python_backend's set_model_transaction_policy

src/model.py Outdated Show resolved Hide resolved
src/model.py Show resolved Hide resolved
Copy link
Contributor

@rmccorm4 rmccorm4 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM other than ask to make stream input optional

dyastremsky
dyastremsky previously approved these changes Nov 6, 2023
rmccorm4
rmccorm4 previously approved these changes Nov 6, 2023
krishung5
krishung5 previously approved these changes Nov 6, 2023
tanmayv25
tanmayv25 previously approved these changes Nov 8, 2023
@tanmayv25 tanmayv25 dismissed stale reviews from krishung5, rmccorm4, dyastremsky, and themself via 70b4547 November 8, 2023 04:45
@tanmayv25 tanmayv25 force-pushed the oandreeva_autocomplete_vllm branch from 11d5c96 to 70b4547 Compare November 8, 2023 04:45
@tanmayv25 tanmayv25 self-requested a review November 8, 2023 04:46
@rmccorm4
Copy link
Contributor

rmccorm4 commented Nov 8, 2023

@tanmayv25 should we merge this?

@tanmayv25
Copy link
Contributor

I was unable to get a successful run using this change with olga's pipeline. I am in middle of investigating what is not working.

@tanmayv25
Copy link
Contributor

Seems like there was some setup problem. Tested successfully with job id 73883232.

@tanmayv25 tanmayv25 merged commit 0e5b209 into main Nov 9, 2023
3 checks passed
tanmayv25 pushed a commit that referenced this pull request Nov 9, 2023
@tanmayv25 tanmayv25 deleted the oandreeva_autocomplete_vllm branch November 9, 2023 02:20
tanmayv25 added a commit that referenced this pull request Nov 9, 2023
@rguo123
Copy link

rguo123 commented Nov 13, 2023

Does autocomplete work? It doesn't seem to be called anywhere in the code besides the method being defined?

@tanmayv25
Copy link
Contributor

The auto-complete feature is currently available only on triton's main branch that tracks development to a future release. The auto-complete functionality will be available with 23.11 release which is scheduled to be released by end of the month.
Thanks for your patience.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants