Fixes for Llama 3.x support, adding image input tooling, tt-studio environment variable handling #83

tstescoTT · 2025-01-31T06:04:27Z

change log

Add benchmark summary support handling for vllm benchmark script, add documentation example
Rename client side scripts batch_size options to max_concurrent to indicate client side concurrent request limits
Stop-gap handling of MESH_DEVICE for Llama 3.x models
Add image input support for image-text-to-text models in client scripts and tools
- Added support for image input in trace capturing
- Added new parameters for image width and height
- Implemented handling of both text-only and image+text trace captures
Modified setup script improvements:
- Improved environment variable handling and persistence storage integration
- Added IMPL_ID field (set to "tt-metal" for all current models)
- Introduced MODEL_VERSION and MODEL_ID variables for better versioning
- Enhanced HF token validation with character count check
Fixed the vLLM model registration logic:
- Added missing ModelRegistry.register_model call for TTLlamaForCausalLM for legacy implementation models
Updated benchmark path handling to use $HOME environment variable instead of hardcoded /home/user path
Added support for a new model "DeepSeek-R1-Distill-Llama-70B" in the model setup configurations

…vironment variable handling (#82) * Add benchmark summary support handling for vllm benchmark script, add documentation example * Rename client side scripts batch_size options to max_concurrent to indicate client side concurrent request limits * Stop-gap handling of MESH_DEVICE for Llama 3.x models * Add image input support for image-text-to-text models in client scripts and tools - Added support for image input in trace capturing - Added new parameters for image width and height - Implemented handling of both text-only and image+text trace captures * Modified setup script improvements: - Improved environment variable handling and persistence storage integration - Added IMPL_ID field (set to "tt-metal" for all current models) - Introduced MODEL_VERSION and MODEL_ID variables for better versioning - Enhanced HF token validation with character count check * Fixed the vLLM model registration logic: - Added missing ModelRegistry.register_model call for TTLlamaForCausalLM for legacy implementation models * Updated benchmark path handling to use $HOME environment variable instead of hardcoded /home/user path * Added support for a new model "DeepSeek-R1-Distill-Llama-70B" in the model setup configurations

benchmarking/vllm_online_benchmark.py

ppetrovicTT · 2025-01-31T08:29:51Z

setup.sh

-    echo "  llama-3-70b"
-    echo "  llama-3-8b-instruct"
-    echo "  llama-3-8b"
+    echo "  DeepSeek-R1-Distill-Llama-70B"


i'm up for this, as it will align id with metal's expectations! (model_config.py, L137)

but we need we need to change the names in setup_model_environment as well then.
i've done that locally, will push soon. you can the usage chunk if you wish?

ppetrovicTT · 2025-01-31T08:30:46Z

setup.sh

@@ -74,6 +75,7 @@ get_hf_env_vars() {
        echo "HF_TOKEN environment variable is not set. Please set it before running the script."
        read -r -s -p "Enter your HF_TOKEN: " input_hf_token
        echo
+        echo "entered HF_TOKEN contains: ${#input_hf_token} characters, expected 37."


we can ping HF API to check if the token is valid
i have that in my change, will follow up!

Nice, can you open a separate PR on that or share the code you're using? I had a TODO to also check repo access for the target model and message to user on 404.

* fix casing * clean up HF setup venv before repacking

* update refs to /home/container_app_user/ for container home usage

* update documentation

tstescoTT · 2025-02-01T05:44:59Z

Moved this PR to #88 to use the proposed RC git workflow.

tstescoTT requested review from rpavlovicTT, ppetrovicTT and milank94 January 31, 2025 06:04

ppetrovicTT approved these changes Jan 31, 2025

View reviewed changes

tstescoTT added 3 commits January 31, 2025 13:26

tstesco/fix-setup (#85)

7909845

* fix casing * clean up HF setup venv before repacking

tstesco/update-user (#86)

fcd3ca6

* update refs to /home/container_app_user/ for container home usage

update documentation (#87)

d06b061

* update documentation

tstescoTT closed this Feb 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes for Llama 3.x support, adding image input tooling, tt-studio environment variable handling #83

Fixes for Llama 3.x support, adding image input tooling, tt-studio environment variable handling #83

tstescoTT commented Jan 31, 2025

ppetrovicTT Jan 31, 2025

ppetrovicTT Jan 31, 2025

tstescoTT Jan 31, 2025

tstescoTT commented Feb 1, 2025

Fixes for Llama 3.x support, adding image input tooling, tt-studio environment variable handling #83

Fixes for Llama 3.x support, adding image input tooling, tt-studio environment variable handling #83

Conversation

tstescoTT commented Jan 31, 2025

change log

ppetrovicTT Jan 31, 2025

Choose a reason for hiding this comment

ppetrovicTT Jan 31, 2025

Choose a reason for hiding this comment

tstescoTT Jan 31, 2025

Choose a reason for hiding this comment

tstescoTT commented Feb 1, 2025