ds_ordered_count can now simultaneously operate on up to 4 dwords
in a single instruction, which are taken from (and returned to)
lanes 0..3 of a single VGPR.
Change-Id: I19b6e7b0732b617c10a779a7f9c0303eec7dd276
Paths
| Differential D63716
AMDGPU/GFX10: implement ds_ordered_count changes ClosedPublic Authored by nhaehnle on Jun 24 2019, 7:18 AM.
Details Summary ds_ordered_count can now simultaneously operate on up to 4 dwords Change-Id: I19b6e7b0732b617c10a779a7f9c0303eec7dd276
Diff Detail
Event TimelineThis revision is now accepted and ready to land.Jun 24 2019, 8:30 AM
nhaehnle added inline comments.
Closed by commit rL364815: AMDGPU/GFX10: implement ds_ordered_count changes (authored by nha). · Explain WhyJul 1 2019, 10:18 AM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 207370 llvm/trunk/include/llvm/IR/IntrinsicsAMDGPU.td
llvm/trunk/lib/Target/AMDGPU/SIISelLowering.cpp
llvm/trunk/test/CodeGen/AMDGPU/llvm.amdgcn.ds.ordered.add.gfx10.ll
|