noduplicate prevents unrolling of small loops that happen to have
barriers in them. If a loop has a barrier in it, it is OK to duplicate
it for the unroll.
Details
Details
- Reviewers
• tstellarAMD
Diff Detail
Diff Detail
Paths
| Differential D15595
AMDGPU: Switch barrier intrinsics to using convergent ClosedPublic Authored by arsenm on Dec 16 2015, 3:55 PM.
Details
Summary noduplicate prevents unrolling of small loops that happen to have
Diff Detail Event Timelinearsenm updated this object. This revision is now accepted and ready to land.Dec 17 2015, 7:11 AM
Revision Contents
Diff 43079 lib/Target/AMDGPU/AMDGPUIntrinsics.td
test/CodeGen/AMDGPU/addrspacecast.ll
test/CodeGen/AMDGPU/array-ptr-calc-i32.ll
test/CodeGen/AMDGPU/drop-mem-operand-move-smrd.ll
test/CodeGen/AMDGPU/ds-negative-offset-addressing-mode-loop.ll
test/CodeGen/AMDGPU/ds-sub-offset.ll
test/CodeGen/AMDGPU/ds_read2.ll
test/CodeGen/AMDGPU/ds_read2_superreg.ll
test/CodeGen/AMDGPU/ds_read2st64.ll
test/CodeGen/AMDGPU/ds_write2.ll
test/CodeGen/AMDGPU/ds_write2st64.ll
test/CodeGen/AMDGPU/flat-address-space.ll
test/CodeGen/AMDGPU/indirect-private-64.ll
test/CodeGen/AMDGPU/llvm.AMDGPU.div_fmas.ll
test/CodeGen/AMDGPU/merge-stores.ll
test/CodeGen/AMDGPU/schedule-vs-if-nested-loop-failure.ll
test/CodeGen/AMDGPU/si-triv-disjoint-mem-access.ll
test/CodeGen/AMDGPU/store-barrier.ll
test/CodeGen/AMDGPU/wait.ll
test/Transforms/LoopUnroll/AMDGPU/lit.local.cfg
test/Transforms/LoopUnroll/AMDGPU/unroll-barrier.ll
|