tstesco/use-env-vars #96

tstescoTT · 2025-02-05T22:38:12Z

addressing comments:

…erfile

…erfile (#96)

# change log * Default handling of MESH_DEVICE for Llama 3.x models * Modified setup script improvements: * Improved environment variable handling and persistence storage integration * Added IMPL_ID field (set to "tt-metal" for all current models) * Introduced MODEL_VERSION and MODEL_ID variables for better versioning * Add image input support for image-text-to-text models in client scripts and tools * Added support for image input in trace capturing * Added new parameters for image width and height * Implemented handling of both text-only and image+text trace captures * Rename client side scripts batch_size options to max_concurrent to indicate client side concurrent request limits * Fixed the vLLM model registration logic. Added missing ModelRegistry.register_model call for TTLlamaForCausalLM for legacy implementation models * Updated benchmark path handling to use $HOME environment variable instead of hardcoded /home/user path * Add benchmark summary support handling for vllm benchmark script, add documentation example * Added support for a new model "DeepSeek-R1-Distill-Llama-70B" in the model setup configurations * use CACHE_ROOT and vllm_dir where possible, fix mock.vllm.openai.dockerfile (#96)

use CACHE_ROOT and vllm_dir where possible, fix mock.vllm.openai.dock…

a7277ee

…erfile

tstescoTT merged commit f29afd3 into dev Feb 5, 2025

tstescoTT added a commit that referenced this pull request Feb 5, 2025

use CACHE_ROOT and vllm_dir where possible, fix mock.vllm.openai.dock…

bac48be

…erfile (#96)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

tstesco/use-env-vars #96

tstesco/use-env-vars #96

tstescoTT commented Feb 5, 2025

tstesco/use-env-vars #96

tstesco/use-env-vars #96

Conversation

tstescoTT commented Feb 5, 2025