Skip to content

add deepseek v1, v3 and all the distills #398

add deepseek v1, v3 and all the distills

add deepseek v1, v3 and all the distills #398

three-m4-pro-cluster (llama-3.3-70b)  /  run-distributed-job (M4PRO_GPU16_24GB)

succeeded Jan 24, 2025 in 2m 49s