Some pointwise unary ops return 2-D results from 1-D inputs #12671

jdh8 · 2024-09-13T20:12:04Z

As we found out in tenstorrent/pytorch2.0_ttnn#198, several ops produce (1, N) tensors when (N,) tensors are expected.

Affected ops:

ceil
floor
gelu
rsqrt
sqrt

Spared ops:

cos
erf
exp

The lists above are non-exhaustive (yet). I fired this ticket before keeping trying on other ops because this issue is probably widespread.

The text was updated successfully, but these errors were encountered:

ayerofieiev-tt · 2024-09-13T20:20:42Z

@jdh8 is the input tilized?

jdh8 · 2024-09-26T02:40:58Z

I’m trying to compare implementations of unary ops, for example, ttnn::cos and ttnn::sqrt. However, I searched the C++ codebase, but I only found call sites instead of definitions. Where do I find these implementations?

(python_env) jdh8@tt-metal-skymizer-n150-3:~/tt-metal/ttnn/cpp$ git grep -i 'cos *('
ttnn/operations/eltwise/complex_unary/device/complex_unary_op.cpp:    Tensor c = ttnn::cos(input_b,output_mem_config);
ttnn/operations/eltwise/unary_backward/device/unary_backward_op.cpp:// name: cos(Tensor self) -> Tensor
ttnn/operations/eltwise/unary_backward/device/unary_backward_op.cpp:// # - name: acos(Tensor self) -> Tensor
ttnn/operations/eltwise/unary_backward/device/unary_backward_op.cpp:// self: grad * self.cos()
ttnn/operations/eltwise/unary_backward/device/unary_backward_op.cpp:    Tensor grad_input = ttnn::multiply(grad, ttnn::cos(input_tensor, output_mem_config), std::nullopt, output_mem_config);

jdh8 · 2024-09-29T20:55:31Z

I've inspected op implementations in both groups, but I can't find a clear difference in treating 1-D tensors.

https://github.com/tenstorrent/tt-metal/blob/32ad2318f05b904ed51952b6097ceae8d60d479e/tt_metal/hw/ckernels/wormhole_b0/metal/llk_api/llk_sfpu/ckernel_sfpu_exp.h

https://github.com/tenstorrent/tt-metal/blob/32ad2318f05b904ed51952b6097ceae8d60d479e/tt_metal/hw/ckernels/wormhole_b0/metal/llk_api/llk_sfpu/ckernel_sfpu_sqrt.h

jdh8 · 2024-10-02T09:58:01Z

After #13339, cos becomes affected. Now I believe that returning a 2-D tensor is intended. Squeezing 2-D tensors in the compiler looks like a true solution now.

jdh8 · 2024-10-13T15:28:43Z

I made a workaround in the compiler at tenstorrent/pytorch2.0_ttnn#198 because I can't find the root cause. That PR also enables conversion for all rounding ops: ceil, floor, round, trunc.

eyonland · 2024-10-21T20:37:29Z

At the moment we believe that all unary eltwise ops are impacted by this. It sounds like our to_layout is converting to [1[32], 1024] for tiled output instead of the expected shape of [1024, 1[32]] .
To align with torch we have to do ttnn.transpose(input_tensor, -2, -1). See example below.

import ttnn
import torch

w = 1024
torch_input_tensor = torch.rand((w,), dtype=torch.bfloat16)
device_id = 0
device = ttnn.open_device(device_id=device_id)
input_tensor = ttnn.from_torch(torch_input_tensor, layout=ttnn.TILE_LAYOUT, device=device)

# required to align with torch
input_tensor = ttnn.transpose(input_tensor, -2, -1)
print(input_tensor.shape)
scalar = 3
output_tensor = ttnn.erf(input_tensor)
print(output_tensor.shape)

output_tensor = ttnn.from_device(output_tensor)
output_tensor = ttnn.to_torch(output_tensor)
print(output_tensor.shape)

pytorch_output_tensor = torch.erf(torch_input_tensor)
print("what torch does")
print(pytorch_output_tensor.shape)

ttnn.close_device(device=device)

@jdh8 can you confirm?

@ayerofieiev-tt , if our understanding is correct, this sounds like it is not an eltwise issue.

ayerofieiev-tt · 2024-10-21T21:00:32Z

@jdh8 , I will keep track of this one. We might have to update ops later, but at this moment this is pretty much blocked by Tensor Layout work

* Test conversion to `ttnn.floor` with known input variations * Make import global to reduce overhead * Convert `aten.ceil` to `ttnn.ceil` * Try (1066,) with some other univariate functions The results are inconclusive: - `cos` passes - `sqrt` and `floor` fail * Test unary ops with fast and approximate mode * Try squeezing out the extraneous dimension for 1-D tensors * Restore conversion for `aten.remainder.Scalar` * Fix the workaround to squeeze back to 1-D tensors * Implement conversion to `ttnn.round` * Implement conversion to `ttnn.trunc` * Update conversion for `ttnn.round` * Lessen PCC for `ttnn.round` kernel (tenstorrent/tt-metal#13851) * Convert `torch.round` with various decimal places * Update parameters of `ttnn.round` * Restore code wrongly removed when rebasing 1355625 * Update list of pointwise unary ops * Restore general handler for `aten.hardtanh` * Properly test 1-D cases after working around tenstorrent/tt-metal#12671 * Simplify the workaround for tenstorrent/tt-metal#12671 --------- Co-authored-by: Artem Yerofieiev <[email protected]>

prajaramanTT · 2025-01-09T01:26:55Z

@jdh8 @ayerofieiev-tt Is this still an open issue ? If not, can you please close this ? Thanks

jdh8 added the bug Something isn't working label Sep 13, 2024

github-actions bot added the community label Sep 13, 2024

ayerofieiev-tt assigned eyonland Sep 13, 2024

ayerofieiev-tt added pytorch-compiler blocks-pytorch-compiler P1 and removed blocks-pytorch-compiler labels Sep 13, 2024

ayerofieiev-tt removed the community label Sep 18, 2024

ayerofieiev-tt added this to PyTorch 2.0 TT-NN Compiler Sep 18, 2024

ayerofieiev-tt added the op_cat: eltwise label Oct 1, 2024

jdh8 mentioned this issue Oct 3, 2024

Convert aten.ceil to ttnn.ceil tenstorrent/pytorch2.0_ttnn#198

Merged

6 tasks

jerrysky3 mentioned this issue Oct 4, 2024

1D shape is lost in conversion to TILE_LAYOUT tensor tenstorrent/pytorch2.0_ttnn#280

Open

eyonland mentioned this issue Oct 15, 2024

Eltwise Master Tracking #13795

Open

This was referenced Oct 17, 2024

Testing - Shape mismatch for 1-D inputs #13536

Closed

Pytorch Tracing sweeps - debugs & fixes #13521

Open

ayerofieiev-tt assigned ayerofieiev-tt and unassigned eyonland Oct 21, 2024

ayerofieiev-tt removed the op_cat: eltwise label Oct 21, 2024

This was referenced Nov 23, 2024

Convert aten.hardsigmoid to ttnn.hardsigmoid tenstorrent/pytorch2.0_ttnn#491

Merged

Dispose 1-D workaround for aten.ceil.default tenstorrent/pytorch2.0_ttnn#508

Open

jdh8 added a commit to tenstorrent/pytorch2.0_ttnn that referenced this issue Nov 27, 2024

Properly test 1-D cases after working around tenstorrent/tt-metal#12671

20ebe02

jdh8 added a commit to tenstorrent/pytorch2.0_ttnn that referenced this issue Nov 27, 2024

Simplify the workaround for tenstorrent/tt-metal#12671

975fb88

jdh8 added a commit to tenstorrent/pytorch2.0_ttnn that referenced this issue Nov 27, 2024

Properly test 1-D cases after working around tenstorrent/tt-metal#12671

292eea1

jdh8 added a commit to tenstorrent/pytorch2.0_ttnn that referenced this issue Nov 27, 2024

Simplify the workaround for tenstorrent/tt-metal#12671

d3e8893

jdh8 added a commit to tenstorrent/pytorch2.0_ttnn that referenced this issue Nov 28, 2024

Properly test 1-D cases after working around tenstorrent/tt-metal#12671

79dff1f

jdh8 added a commit to tenstorrent/pytorch2.0_ttnn that referenced this issue Nov 28, 2024

Simplify the workaround for tenstorrent/tt-metal#12671

eff45ed

jdh8 added a commit to tenstorrent/pytorch2.0_ttnn that referenced this issue Dec 10, 2024

Properly test 1-D cases after working around tenstorrent/tt-metal#12671

5f9034a

jdh8 added a commit to tenstorrent/pytorch2.0_ttnn that referenced this issue Dec 10, 2024

Simplify the workaround for tenstorrent/tt-metal#12671

f148326

jdh8 added a commit to tenstorrent/pytorch2.0_ttnn that referenced this issue Dec 12, 2024

Properly test 1-D cases after working around tenstorrent/tt-metal#12671

d9fa405

jdh8 added a commit to tenstorrent/pytorch2.0_ttnn that referenced this issue Dec 12, 2024

Simplify the workaround for tenstorrent/tt-metal#12671

1eb6a85

jdh8 added a commit to tenstorrent/pytorch2.0_ttnn that referenced this issue Dec 19, 2024

Properly test 1-D cases after working around tenstorrent/tt-metal#12671

3a04e8c

jdh8 added a commit to tenstorrent/pytorch2.0_ttnn that referenced this issue Dec 19, 2024

Simplify the workaround for tenstorrent/tt-metal#12671

09535cd

eyonland mentioned this issue Dec 20, 2024

Add support for 0-dim tensor #13535

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some pointwise unary ops return 2-D results from 1-D inputs #12671

Some pointwise unary ops return 2-D results from 1-D inputs #12671

jdh8 commented Sep 13, 2024 •

edited

Loading

ayerofieiev-tt commented Sep 13, 2024 •

edited

Loading

jdh8 commented Sep 26, 2024

jdh8 commented Sep 29, 2024

jdh8 commented Oct 2, 2024

jdh8 commented Oct 13, 2024

eyonland commented Oct 21, 2024 •

edited

Loading

ayerofieiev-tt commented Oct 21, 2024

prajaramanTT commented Jan 9, 2025

Some pointwise unary ops return 2-D results from 1-D inputs #12671

Some pointwise unary ops return 2-D results from 1-D inputs #12671

Comments

jdh8 commented Sep 13, 2024 • edited Loading

ayerofieiev-tt commented Sep 13, 2024 • edited Loading

jdh8 commented Sep 26, 2024

jdh8 commented Sep 29, 2024

jdh8 commented Oct 2, 2024

jdh8 commented Oct 13, 2024

eyonland commented Oct 21, 2024 • edited Loading

ayerofieiev-tt commented Oct 21, 2024

prajaramanTT commented Jan 9, 2025

jdh8 commented Sep 13, 2024 •

edited

Loading

ayerofieiev-tt commented Sep 13, 2024 •

edited

Loading

eyonland commented Oct 21, 2024 •

edited

Loading