ttnn.pad does not when support output_shape[-1] is odd #15603

swimdi · 2024-11-27T04:08:46Z

We convert aten.constant_pad_nd.default to ttnn.pad and it currently does not support cases wheb output_shape[-1] is odd

or it will TT_THROW pad_op.cpp:34: this->output_tensor_shape[3] % 2 == 0 RM padding requires output X dim to be a multiple of 2

These input varation are appeared in models and is related to this constraint

["Tensor<[1, 3, 224, 224]> self = ?", "List[int] pad = [0, 1, 0, 1]", "number value = 0.0"],
["Tensor<[1, 96, 112, 112]> self = ?", "List[int] pad = [0, 1, 0, 1]", "number value = 0.0"],
["Tensor<[1, 144, 56, 56]> self = ?", "List[int] pad = [1, 2, 1, 2]", "number value = 0.0"],
["Tensor<[1, 240, 28, 28]> self = ?", "List[int] pad = [0, 1, 0, 1]", "number value = 0.0"],
["Tensor<[1, 672, 14, 14]> self = ?", "List[int] pad = [1, 2, 1, 2]", "number value = 0.0"],
["Tensor<[1, 3, 240, 240]> self = ?", "List[int] pad = [0, 1, 0, 1]", "number value = 0.0"],
["Tensor<[1, 96, 120, 120]> self = ?", "List[int] pad = [0, 1, 0, 1]", "number value = 0.0"],
["Tensor<[1, 144, 60, 60]> self = ?", "List[int] pad = [1, 2, 1, 2]", "number value = 0.0"],
["Tensor<[1, 240, 30, 30]> self = ?", "List[int] pad = [0, 1, 0, 1]", "number value = 0.0"],
["Tensor<[1, 672, 15, 15]> self = ?", "List[int] pad = [2, 2, 2, 2]", "number value = 0.0"],
["Tensor<[1, 3, 260, 260]> self = ?", "List[int] pad = [0, 1, 0, 1]", "number value = 0.0"],
["Tensor<[1, 96, 130, 130]> self = ?", "List[int] pad = [0, 1, 0, 1]", "number value = 0.0"],
["Tensor<[1, 144, 65, 65]> self = ?", "List[int] pad = [2, 2, 2, 2]", "number value = 0.0"],
["Tensor<[1, 288, 33, 33]> self = ?", "List[int] pad = [1, 1, 1, 1]", "number value = 0.0"],
["Tensor<[1, 720, 17, 17]> self = ?", "List[int] pad = [2, 2, 2, 2]", "number value = 0.0"],
["Tensor<[1, 3, 300, 300]> self = ?", "List[int] pad = [0, 1, 0, 1]", "number value = 0.0"],
["Tensor<[1, 144, 150, 150]> self = ?", "List[int] pad = [0, 1, 0, 1]", "number value = 0.0"],
["Tensor<[1, 192, 75, 75]> self = ?", "List[int] pad = [2, 2, 2, 2]", "number value = 0.0"],
["Tensor<[1, 288, 38, 38]> self = ?", "List[int] pad = [0, 1, 0, 1]", "number value = 0.0"],
["Tensor<[1, 816, 19, 19]> self = ?", "List[int] pad = [2, 2, 2, 2]", "number value = 0.0"],
["Tensor<[1, 3, 380, 380]> self = ?", "List[int] pad = [0, 1, 0, 1]", "number value = 0.0"],
["Tensor<[1, 144, 190, 190]> self = ?", "List[int] pad = [0, 1, 0, 1]", "number value = 0.0"],
["Tensor<[1, 192, 95, 95]> self = ?", "List[int] pad = [2, 2, 2, 2]", "number value = 0.0"],
["Tensor<[1, 336, 48, 48]> self = ?", "List[int] pad = [0, 1, 0, 1]", "number value = 0.0"],
["Tensor<[1, 960, 24, 24]> self = ?", "List[int] pad = [1, 2, 1, 2]", "number value = 0.0"],

The text was updated successfully, but these errors were encountered:

jaykru-tt · 2024-12-10T01:50:47Z

I think I have a fix for this that should be landing soon with the fixes for #12896 and #15511. I'll give you a heads up when it's merged.

@yugaoTT

### Tickets - #15511 - #15603 (90% resolved with these changes and to be fully resolved in a future PR) - #12896 ### Problem description ttnn.pad's RM sharded implementation only has support for padding along the non-width dimensions. The row major implementation additionally is not fully general with respect to the width dimension, so until now there are no great options for padding along width. In a future PR coming tomorrow, I'll add input massaging code to convert to row-major and shard as needed for input configurations that aren't currently supported by pad. ### What's changed - Adds new kernels to support padding along the width dimension. - For pad operations requiring both NCH and width padding, we use a fused op using the original height-padding kernels and the new width kernels. - The previous point required extensive refactoring to the host code. I would like eyes on pad.cpp please @yugaoTT @sminakov-tt. - Also adds a bunch of common utility functions for working with sharded tensors: - A function for easily creating sharded memory configs from C++ (analogous to the Python `create_sharded_memory_config` utility function created by @ntarafdar) - A function for locating elements of a shard by their coordinates within the tensor. I've tested this one in the context of this PR, but it didn't end up being necessary in the final implementation. ### Checklist - [~] [Post commit CI passes](https://github.com/tenstorrent/tt-metal/actions/runs/12327681570) - [x] [Model regression CI testing passes](https://github.com/tenstorrent/tt-metal/actions/runs/12308045581) - [x] [Device performance regression CI testing passes](https://github.com/tenstorrent/tt-metal/actions/runs/12308046347) - [ ] Blackhole Post commit (if applicable) - [ ] **(For models and ops writers)** Full [new models](https://github.com/tenstorrent/tt-metal/actions/workflows/full-new-models-suite.yaml) tests passes - [ ] New/Existing tests provide coverage for changes --------- Co-authored-by: tarafdarTT <[email protected]>

jaykru-tt · 2024-12-14T22:22:16Z

I didn't manage to land a fix for this in my PR, but we'll be tackling this as an onboarding exercise for a new teammate in the New Year :)

### Ticket #15603 ### Problem description ttnn.pad call throws an error when user attempts to pad a tensor to an odd number of elements in width dimension ### What's changed Remove the check, it is outdated since we resolved the underlying constraint ### Checklist - [x] [Post commit CI](https://github.com/tenstorrent/tt-metal/actions/runs/12363708529)

swimdi assigned ayerofieiev-tt Nov 27, 2024

ayerofieiev-tt assigned ntarafdar and unassigned ayerofieiev-tt Dec 2, 2024

ayerofieiev-tt transferred this issue from tenstorrent/pytorch2.0_ttnn Dec 2, 2024

github-actions bot added the community label Dec 2, 2024

ayerofieiev-tt changed the title ~~aten.constant_pad_nd.default not support output_shape[-1] is odd~~ ttnn.pad does not when support output_shape[-1] is odd Dec 2, 2024

ayerofieiev-tt removed the community label Dec 2, 2024

ntarafdar assigned jaykru-tt Dec 2, 2024

jaykru-tt mentioned this issue Dec 13, 2024

Add support for padding along width dimension to ttnn.pad #15985

Merged

6 tasks

ayerofieiev-tt mentioned this issue Dec 17, 2024

#0: Allow ttnn.pad to pad Tensor to an odd width in row major #16079

Merged

1 task

ayerofieiev-tt mentioned this issue Dec 17, 2024

Allow ttnn.pad even for odd width tenstorrent/pytorch2.0_ttnn#618

Merged

ayerofieiev-tt assigned ayerofieiev-tt and unassigned ntarafdar and jaykru-tt Dec 17, 2024

swimdi closed this as completed Dec 20, 2024

github-project-automation bot moved this from In Progress to Done in PyTorch 2.0 TT-NN Compiler Dec 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ttnn.pad does not when support output_shape[-1] is odd #15603

ttnn.pad does not when support output_shape[-1] is odd #15603

swimdi commented Nov 27, 2024 •

edited by ayerofieiev-tt

Loading

jaykru-tt commented Dec 10, 2024

jaykru-tt commented Dec 14, 2024

ttnn.pad does not when support output_shape[-1] is odd #15603

ttnn.pad does not when support output_shape[-1] is odd #15603

Comments

swimdi commented Nov 27, 2024 • edited by ayerofieiev-tt Loading

jaykru-tt commented Dec 10, 2024

jaykru-tt commented Dec 14, 2024

swimdi commented Nov 27, 2024 •

edited by ayerofieiev-tt

Loading