Skip to content

[Bug] vLLM Engine KV Cache Memory Limitation When max_model_len=4096 #1921

[Bug] vLLM Engine KV Cache Memory Limitation When max_model_len=4096

[Bug] vLLM Engine KV Cache Memory Limitation When max_model_len=4096 #1921

Triggered via issue December 29, 2024 04:55
Status Success
Total duration 9s
Artifacts

label_issue.yml

on: issues
label_issue
2s
label_issue
Fit to window
Zoom out
Zoom in

Annotations

1 warning
label_issue
ubuntu-latest pipelines will use ubuntu-24.04 soon. For more details, see https://github.com/actions/runner-images/issues/10636