Add asserts for each operation in RT and also add check for setting the output #1313

mtopalovicTT · 2024-11-18T17:03:29Z

We don't validate output/input parameters in RT at all. We should add checks before each op execution which validates input tensor against expected input. We also need to do this when setting the output tensor in RT.

mtopalovicTT · 2024-11-19T16:25:25Z

@jnie-TT do you have some thoughts on this?

jnie-TT · 2024-11-19T16:41:18Z

@mtopalovicTT We didn't do it previously because the compiler-generated tensor descriptors were immature (it's much better now, but still has some issue). Some common issues that come to mind include:

Wrong data type. Compiler assumes that ops can implicitly typecast a tensor data type (which some ops can, but some ops can't), and therefore there will sometimes be data type mismatches on output tensors
Invalid layout. Previously the tile shapes were not correctly updated in the tensors when we force tile/row_major layout. Currently for the decomposed ops we still lack granular layout updates.
Some ops auto-shard, auto-move to host (I think we saw this in conv), and we sometimes have workarounds in runtime to pre-shard. Compiler cannot capture this and therefore all downstream layouts will mismatch.
These runtime checks have performance hits. If we add these we should add them as debug asserts, and probably update CI to also run runtime in debug build.

Overall though I'm definitely on-board of adding such checks to runtime. Currently there's no way to ensure alignment between runtime and compiler-generated descriptors.

mtopalovicTT added the MLIR Ops Issues related to MLIR dialect ops and their implementations label Nov 18, 2024

mtopalovicTT self-assigned this Nov 18, 2024

sdjordjevicTT assigned jnie-TT Nov 19, 2024

mtopalovicTT changed the title ~~Missing assert for output tensor shape in runtime~~ Add asserts for each operation in RT and also add check for setting the output Nov 19, 2024

jnie-TT mentioned this issue Feb 5, 2025

Runtime debug testing checklist #2125

Open

3 tasks

jnie-TT linked a pull request Feb 28, 2025 that will close this issue

Add ttnn runtime layout comparisons in debug mode, minor runtime build cleanup #2338

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add asserts for each operation in RT and also add check for setting the output #1313

Add asserts for each operation in RT and also add check for setting the output #1313

mtopalovicTT commented Nov 18, 2024 •

edited

Loading

mtopalovicTT commented Nov 19, 2024

jnie-TT commented Nov 19, 2024 •

edited

Loading

Add asserts for each operation in RT and also add check for setting the output #1313

Add asserts for each operation in RT and also add check for setting the output #1313

Comments

mtopalovicTT commented Nov 18, 2024 • edited Loading

mtopalovicTT commented Nov 19, 2024

jnie-TT commented Nov 19, 2024 • edited Loading

mtopalovicTT commented Nov 18, 2024 •

edited

Loading

jnie-TT commented Nov 19, 2024 •

edited

Loading