This is an archive of the discontinued LLVM Phabricator instance.

[X86][SSE] Remove unnecessary bit-and in pshufb vector ctlz (PR39703)
ClosedPublic

Authored by RKSimon on Nov 19 2018, 9:11 AM.

Details

Summary

SSE PSHUFB vector ctlz lowering works at the i4 nibble level. As detailed in PR39703, we were masking the lower nibble off but we only actually use it in the case where the upper nibble is known to be zero, making it safe to remove the mask and save an instruction.

Diff Detail

Repository
rL LLVM

Event Timeline

RKSimon created this revision.Nov 19 2018, 9:11 AM
This revision is now accepted and ready to land.Nov 19 2018, 10:14 AM
This revision was automatically updated to reflect the committed changes.