Skip to content

add deepseek v1, v3 and all the distills #398

add deepseek v1, v3 and all the distills

add deepseek v1, v3 and all the distills #398

two-m4-pro-cluster (llama-3.1-8b)  /  run-distributed-job (M4PRO_GPU16_24GB)

succeeded Jan 24, 2025 in 1m 25s