Llama 3.x model support, setup.sh script multiple model support using HF download #67

tstescoTT · 2025-01-14T21:40:06Z

change log:

add multiple model support using persistent_volume/model_envs/*.env
setup using Hugging Face huggingface-cli to download models: llama model install script support for llama CLI and huggingface hub llama model install script support for llama CLI and huggingface hub #14
add model setup for llama 3.x
address Initial vLLM setup fails due to missing HuggingFace permissions Initial vLLM setup fails due to missing HuggingFace permissions #37
address Docker run support for HF_TOKEN authentication using env var pass in Docker run support for HF_TOKEN authentication using env var pass in #23
renamed vllm-tt-metal-llama3-70 to vllm-tt-metal-llama3 for all llama 3.x models
updated documentation for v0 drop
add Docker Ubuntu 22.04 option for vLLM llama 3.x

… HF download change log: - add multiple model support using persistent_volume/model_envs/*.env - setup using Hugging Face huggingface-cli to download models: llama model install script support for llama CLI and huggingface hub #14 - add model setup for llama 3.x - address Initial vLLM setup fails due to missing HuggingFace permissions #37 - address Docker run support for HF_TOKEN authentication using env var pass in #23 - renamed vllm-tt-metal-llama3-70 to vllm-tt-metal-llama3 for all llama 3.x models - updated documentation for v0 drop - add Docker Ubuntu 22.04 option for vLLM llama 3.x

milank94 · 2025-01-14T22:39:01Z

vllm-tt-metal-llama3/vllm.llama3.src.ubuntu-22.04-amd64.Dockerfile

Is there any other difference other than FROM local/tt-metal/tt-metalium/ubuntu-22.04-amd64:$TT_METAL_DOCKERFILE_VERSION between this Dockerfile an the one for 20.04?

Wondering if that could be parameterized and this combined into just a single file?

I tried doing this with a multistage build, but it doesnt work well. Multistage builds are best when the fork happens later in the process and all combinations are desired to be built. Unfortunately, because tt-metal isnt publishing release images for Ubuntu 22.04 we have to build them manually here, and that makes avoiding the Ubuntu 22.04 build easier as the default so there are less steps.

One way to do this is to put as much of the RUN commands as possible into a setup.sh script and copy then run that, but this wont cover everything (e.g. CMD, COPY, etc.) and would require extra testing.

The best way I found to do this is to use TT_METAL_DOCKERFILE_URL instead of TT_METAL_DOCKERFILE_VERSION so we can pass in a different base image entirely. This supports locally built Ubuntu 22.04 images, GHCR published Ubuntu 20.04 tt-metal images, and will support GHCR Ubuntu 22.04 tt-metal images once those are published.

added this in 11141de

…tu 22.04 and 20.04 Dockerfiles

…ltiple base images

tstescoTT mentioned this pull request Jan 14, 2025

benchmark and evals changes for Llama 3.1 70B v0 drop testing #59

Merged

tstescoTT requested review from milank94 and rpavlovicTT January 14, 2025 21:45

milank94 reviewed Jan 14, 2025

View reviewed changes

milank94 approved these changes Jan 14, 2025

View reviewed changes

tstescoTT added 2 commits January 14, 2025 20:02

use vllm.llama3.src.shared.Dockerfile for shared build steps for ubun…

d850b73

…tu 22.04 and 20.04 Dockerfiles

use full url TT_METAL_DOCKERFILE_URL to allow for 1 Dockerfile for mu…

11141de

…ltiple base images

tstescoTT merged commit 5e33583 into main Jan 15, 2025
1 check passed

This was referenced Jan 21, 2025

Initial vLLM setup fails due to missing HuggingFace permissions #37

Closed

Docker run support for HF_TOKEN authentication using env var pass in #23

Closed

llama model install script support for llama CLI and huggingface hub #14

Closed

tstescoTT deleted the tstesco/llama-3x-support branch January 31, 2025 21:19

tstescoTT mentioned this pull request Feb 1, 2025

Support for Llama 3.1 8B #11

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Llama 3.x model support, setup.sh script multiple model support using HF download #67

Llama 3.x model support, setup.sh script multiple model support using HF download #67

tstescoTT commented Jan 14, 2025

milank94 Jan 14, 2025

tstescoTT Jan 15, 2025 •

edited

Loading

tstescoTT Jan 15, 2025

Llama 3.x model support, setup.sh script multiple model support using HF download #67

Llama 3.x model support, setup.sh script multiple model support using HF download #67

Conversation

tstescoTT commented Jan 14, 2025

change log:

milank94 Jan 14, 2025

Choose a reason for hiding this comment

tstescoTT Jan 15, 2025 • edited Loading

Choose a reason for hiding this comment

tstescoTT Jan 15, 2025

Choose a reason for hiding this comment

tstescoTT Jan 15, 2025 •

edited

Loading