Removed explicit mode for multi-lora #45

oandreeva-nv · 2024-07-03T01:17:10Z

explicit model load was causing failures in tests with vllm==v0.5.0.post1

Tabrizian · 2024-07-03T14:55:32Z

ci/L0_multi_gpu/multi_lora/multi_lora_test.py

@@ -119,7 +119,6 @@ def _test_vllm_model(
        self.triton_client.stop_stream()

    def test_multi_lora_requests(self):
-        self.triton_client.load_model(self.vllm_model_name)


Do we know why it doesn't work with explicit mode?

Due to the way vllm evaluates free GPU memory. I'm not sure what exactly changed, but in explicit mode initialize fails at:

Error in memory profiling. This happens when the GPU memory was not properly cleaned up before initializing the vLLM instance.

Unfortunately, I don't have the capacity to investigate what exactly changed in the profiling

sample job failure: 98632142

@oandreeva-nv Have you created ticket for further investigation?

Removed explicit mode for multi-lora

496d799

oandreeva-nv mentioned this pull request Jul 3, 2024

[build]: vllm version update triton-inference-server/server#7405

Merged

20 tasks

oandreeva-nv requested review from tanmayv25 and pskiran1 July 3, 2024 01:20

Tabrizian reviewed Jul 3, 2024

View reviewed changes

oandreeva-nv requested a review from Tabrizian July 3, 2024 16:06

tanmayv25 approved these changes Jul 5, 2024

View reviewed changes

oandreeva-nv merged commit db3d794 into main Jul 5, 2024
3 checks passed

oandreeva-nv deleted the oandreeva_post_050_updates branch July 5, 2024 22:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Removed explicit mode for multi-lora #45

Removed explicit mode for multi-lora #45

oandreeva-nv commented Jul 3, 2024

Tabrizian Jul 3, 2024

oandreeva-nv Jul 3, 2024

oandreeva-nv Jul 3, 2024

tanmayv25 Jul 3, 2024

oandreeva-nv Jul 3, 2024

Removed explicit mode for multi-lora #45

Removed explicit mode for multi-lora #45

Conversation

oandreeva-nv commented Jul 3, 2024

Tabrizian Jul 3, 2024

Choose a reason for hiding this comment

oandreeva-nv Jul 3, 2024

Choose a reason for hiding this comment

oandreeva-nv Jul 3, 2024

Choose a reason for hiding this comment

tanmayv25 Jul 3, 2024

Choose a reason for hiding this comment

oandreeva-nv Jul 3, 2024

Choose a reason for hiding this comment