Skip to content

fix embed_tokens for last layer in qwen models #458

fix embed_tokens for last layer in qwen models

fix embed_tokens for last layer in qwen models #458

three-m4-pro-cluster (llama-3.2-1b)  /  generate-matrix

succeeded Jan 28, 2025 in 0s