lgc: add dialect GroupMemcpyOp #2802

xazhangAMD · 2023-11-07T21:18:48Z

No description provided.

amdvlk-admin · 2023-11-07T22:49:45Z

Test summary for commit `da88fdc`

CTS tests (Failed: 0/138184)

Built with version 1.3.5.2

Ubuntu navi3x, Srdcvk

Passed: 35162/69058 (50.9%)
Failed: 0/69058 (0.0%)
Not Supported: 33896/69058 (49.1%)
Warnings: 0/69058 (0.0%)

Ubuntu navi2x, Srdcvk

Passed: 35242/69126 (51.0%)
Failed: 0/69126 (0.0%)
Not Supported: 33884/69126 (49.0%)
Warnings: 0/69126 (0.0%)

lgc/patch/PatchEntryPointMutate.cpp

amdrexu · 2023-11-08T03:30:27Z

lgc/patch/PatchEntryPointMutate.cpp

+
+// =====================================================================================================================
+// Lower GroupMemcpyOp - Copy memory using threads in a workgroup (scope=2) or subgroup (scope=3).
+void PatchEntryPointMutate::lowerGroupMemcpy(GroupMemcpyOp &groupMemcpyOp) {


I don't think it is a good idea to place the handling of group memcpy here because the pass is aimed to handle entry-point mutation. Other responsibilities should be moved to other passes or even creating a new pass according to LLVM design philosophy.

The handling of task/mesh shader could be put in MeshTaskShader.cpp. For CS, many operations are straightforward, if possible, maybe we can place it on InOutBuilder since readCsBuiltIn() can read back any CS built-in so you can do anything you want. This is true for task shader as well and share the handling. If that is impossible, we can move the handling of CS to PatchInOutImportExport.

I had a similar thought in the internal review - split the op into 2: one for task/mesh and another for compute.

You can check shader stage when lowering this op so we can differentiate its usages in task/mesh shader or in compute shader. If you decide to make two dedicated ops, that is fine as well.

I discussed this with Ruiling, we both believe the lowering of this op is better to be moved to other appropriate passes other than this pass. Also, if you can share us with a LGC file (.lgc generated by frontend with the option --emit-lgc) showing the usage of this op we can better evaluate your future refactoring change in the review.

It is natural to handle task/mesh shader in MeshTaskShader.cpp but for CS, it is also weird to place the code in InOutBuilder or PatchInOutImportExport. Actually my only intent is to use this for task shader only. The llpcfe standalone tool doesn't seem to support -emit-lgc but the dump should have all the information you need.

I modified code in a commit here.

And a pipeline dump attached.
PipelineTaskMesh_0xB812ED624A368A8F.txt

Thank you. I see the usage. Your requirement is similar to the usage of PatchInitializeWorkgroupMemory::initializeWithZero. We add a new pass to handle this. Anyway, it is fine to keep CS handling in entry-point mutation as a temporary solution and move the handling of task/mesh shader to MeshTaskShader class. I plan to rework the pass PatchInitializeWorkgroupMemory and try to enable the usage for your case in the near future.

lgc: add dialect GroupMemcpyOp

da88fdc

xazhangAMD requested a review from a team as a code owner November 7, 2023 21:18

linqun approved these changes Nov 8, 2023

View reviewed changes

xazhangAMD merged commit 8e7a79b into dev Nov 8, 2023

amdrexu reviewed Nov 8, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lgc: add dialect GroupMemcpyOp #2802

lgc: add dialect GroupMemcpyOp #2802

xazhangAMD commented Nov 7, 2023

amdvlk-admin commented Nov 7, 2023

amdrexu Nov 8, 2023

xazhangAMD Nov 8, 2023

amdrexu Nov 9, 2023

xazhangAMD Nov 9, 2023

amdrexu Nov 10, 2023 •

edited

Loading

lgc: add dialect GroupMemcpyOp #2802

lgc: add dialect GroupMemcpyOp #2802

Conversation

xazhangAMD commented Nov 7, 2023

amdvlk-admin commented Nov 7, 2023

Test summary for commit da88fdc

amdrexu Nov 8, 2023

Choose a reason for hiding this comment

xazhangAMD Nov 8, 2023

Choose a reason for hiding this comment

amdrexu Nov 9, 2023

Choose a reason for hiding this comment

xazhangAMD Nov 9, 2023

Choose a reason for hiding this comment

amdrexu Nov 10, 2023 • edited Loading

Choose a reason for hiding this comment

Test summary for commit `da88fdc`

amdrexu Nov 10, 2023 •

edited

Loading