Enable Priority Configs (All Gather) (Master) #11734

SeanNijjar · 2024-08-21T19:34:48Z

Tasks

Enable Priority Configs (All Gather) (T3000)
Enable Priority Configs (All Gather) (TG)

Priority Configs:

Prior to sweeping, the priority configs should be added and tested.

From Llama 405B

line-all-gather (4 chips, 3 links, input tensor (per chip) = [1,1,32,6.5*1024], dim=1 => input_tensor for test_case = [1,4,32,6.5*1024])
line-all-gather: (8 chips, {3,4} links, input tensor (per chip) = [1,1,32,2304], dim=1 => input_tensor for test_case = [1,8,32,2304])
line-all-gather: (8 chips, {3,4} links, input tensor (per chip) = [1,1,32,4k], dim=1 => input_tensor for test_case = [1,8,32,4k])
line-all-gather: (8 chips, {3,4} links, input tensor (per chip) = [1, 1, 8[padded to 32], 4k], dim=2 => output shape (per chip) = [1, 1, 32, 4k] -> all-gather concatenates within tile
- currently expected to fail as this feature is missing
line-all-gather: (8 chips, {3,4} links, input tensor (per chip) = [1, 1, 8[padded to 32], 16k], dim=2 => output shape (per chip) = [1, 1, 32, 16k] -> all-gather concatenates within tile
- currently expected to fail as this feature is missing

From Llama 70B

line-all-gather (4 chips, 3 links, input tensor (per chip) = [1,1,32,3.5*1024], dim=1 => input_tensor for test_case = [1,4,32,(int)(3.5*1024)])
line-all-gather: (8 chips, {3,4} links, input tensor (per chip) = [1,1,32,1280], dim=1 => input_tensor for test_case = [1,8,32,1280])
line-all-gather: (8 chips, {3,4} links, input tensor (per chip) = [1,1,32,2048], dim=1 => input_tensor for test_case = [1,8,32,2048])
line-all-gather: (8 chips, {3,4} links, input tensor (per chip) = [1, 1, 8[padded to 32], 2k], dim=2 => output shape (per chip) = [1, 1, 32, 2k] -> all-gather concatenates within tile
- currently expected to fail as this feature is missing
line-all-gather: (8 chips, {3,4} links, input tensor (per chip) = [1, 1, 8[padded to 32], 4k], dim=2 => output shape (per chip) = [1, 1, 32, 4k] -> all-gather concatenates within tile
- currently expected to fail as this feature is missing

The text was updated successfully, but these errors were encountered:

SeanNijjar · 2024-10-23T23:48:11Z

These test cases are running in TG post-commit (frequent) already

This was referenced Aug 21, 2024

Line/Ring All Gather Test Sweeps #10874

Open

Enable Mixed Input/Output Tensor Config Sweeps (All Gather) (Master) #11739

Open

SeanNijjar self-assigned this Aug 21, 2024

SeanNijjar added P1 feature Op Generalization Generalization and relaxations of requirements in Ops op_cat: ccl perf for issues tracking performance problems/improvements labels Oct 10, 2024

SeanNijjar closed this as completed Oct 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable Priority Configs (All Gather) (Master) #11734

Enable Priority Configs (All Gather) (Master) #11734

SeanNijjar commented Aug 21, 2024 •

edited

Loading

SeanNijjar commented Oct 23, 2024

Enable Priority Configs (All Gather) (Master) #11734

Enable Priority Configs (All Gather) (Master) #11734

Comments

SeanNijjar commented Aug 21, 2024 • edited Loading

Tasks

Priority Configs:

From Llama 405B

From Llama 70B

SeanNijjar commented Oct 23, 2024

SeanNijjar commented Aug 21, 2024 •

edited

Loading