-
Notifications
You must be signed in to change notification settings - Fork 110
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ttnn.moreh_cumsum causes low accuracy on Bloom model #17594
Comments
@amalbasaTT can we create a simple TT-NN test that fails? |
Also, now that we know about this issue, can someone from the team start looking into this deeper? Is there an expertise in the team? |
We had some experience debugging some composite ops and sharding in cpp, however not as much with kernels per se. We most certainly can look deeper into it, but would appreciate it if you could provide us with some docs to figure out in more detail how some of the lower level functions which appear in kernel implementation work. |
Unit test:
|
@ayerofieiev-tt I have looked into it, and I found out that we've had a similar issue while testing "add" operation where second op was int scalar (#17019 (comment)). |
We looked into this a bit deeper. It turned out that Workaround for now can be that we change the lowering to include Considering moreh_cumsum , it's using @umadevimcw any insight? |
Ticket
Link to Github Issue
Describe the bug
ttnn.moreh_cumsum causes low PCC (0.8703882797784891) on Bloom model:
- input: [0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 1, 1, 1, 1, 1, 1]
- shape: [1, 32, 1, 1]
- dtype: ttnn.uint32
- layout: Layout::TILE
Output tensor has 8388608 where input tensor has 1.
Issue happens at
ttnn_moreh_cumsum = ttnn.moreh_cumsum(ttnn_to_device, 1, )
To Reproduce
Steps to reproduce the behavior:
Bloom_code.py
andBloom_inputs.pickle
by running command:Bloom_code.py
modify 109. line of code (_tensor_constant0 = 0.7071067690849304
) to_tensor_constant0 = torch.tensor(0.7071067690849304)
(check issue Bloom_code.py forward method fails at aten.lift_fresh_copy.default(_tensor_constant0, ) because _tensor_constant0 is not a Tensor pytorch2.0_ttnn#743)Bloom_code.py
:Low accuracy error will be at
test_accuracy(cumsum, ttnn_reshape_1)
The text was updated successfully, but these errors were encountered: