HomePhabricator

AMDGPU: Add a fast path for icmp.i1(src, false, NE)

Description

AMDGPU: Add a fast path for icmp.i1(src, false, NE)

Summary:
This allows moving the condition from the intrinsic to the standard ICmp
opcode, so that LLVM can do simplifications on it. The icmp.i1 intrinsic
is an identity for retrieving the SGPR mask.

And we can also get the mask from and i1, or i1, xor i1.

Reviewers: arsenm, nhaehnle

Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits

Differential Revision: https://reviews.llvm.org/D52060

Details

Committed
marekoJan 14 2019, 6:13 PM
Differential Revision
D52060: AMDGPU: Add a fast path for icmp.i1(src, false, NE)
Parents
rL351149: [AArch64] Adjust the feature set for Exynos
Branches
Unknown
Tags
Unknown