#5605: Only force-stall ethernet programs on earlier ethernet programs #16202

jbaumanTT · 2024-12-19T18:32:39Z

Ticket

Problem description

On llama, we're seeing high dispatch latencies on programs that execute on active ethernet cores. This is because we always stall writing program binaries and launch messages until the previous program completes, since we don't support having program binaries in a ring buffer on ethernet cores.

What's changed

Keep track of when the last program using active ethernet cores was dispatched, so we can wait on that program before sending out binaries. This is better than always waiting on the immediate previous program, since in most cases we don't run programs on the ethernet cores back-to-back.

Checklist

Post commit CI passes
Blackhole Post commit (if applicable)
Model regression CI testing passes (if applicable)
Device performance regression CI testing passes (if applicable)
(For models and ops writers) Full new models tests passes
New/Existing tests provide coverage for changes

Keep track of when the last program using active ethernet cores was dispatched, so we can wait on that program before sending out binaries. This is better than always waiting on the immediate previous program, since in most cases we don't run programs on the ethernet cores back-to-back.

jbaumanTT requested review from abhullar-tt, pgkeller, aliuTT, tt-aho, tt-dma, tt-asaigal and ubcheema as code owners December 19, 2024 18:32

jbaumanTT force-pushed the jbauman/improveethernet branch 3 times, most recently from 41e89cc to 26b75a4 Compare December 20, 2024 06:20

jbaumanTT changed the title ~~#0: Only force-stall ethernet programs on earlier ethernet programs~~ #5605: Only force-stall ethernet programs on earlier ethernet programs Dec 20, 2024

jbaumanTT force-pushed the jbauman/improveethernet branch from 26b75a4 to ef4a74e Compare December 20, 2024 17:02

tt-asaigal approved these changes Dec 20, 2024

View reviewed changes

jbaumanTT merged commit 5058b8f into main Dec 20, 2024
183 of 188 checks passed

jbaumanTT deleted the jbauman/improveethernet branch December 20, 2024 21:14

blozano-tt mentioned this pull request Dec 22, 2024

Revert "#5605: Only force-stall ethernet programs on earlier ethernet programs" #16257

Merged

blozano-tt restored the jbauman/improveethernet branch December 22, 2024 07:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#5605: Only force-stall ethernet programs on earlier ethernet programs #16202

#5605: Only force-stall ethernet programs on earlier ethernet programs #16202

jbaumanTT commented Dec 19, 2024

#5605: Only force-stall ethernet programs on earlier ethernet programs #16202

#5605: Only force-stall ethernet programs on earlier ethernet programs #16202

Conversation

jbaumanTT commented Dec 19, 2024

Ticket

Problem description

What's changed

Checklist