This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Improve regbankselect for 64-bit VGPR ctlz/cttz_zero_undef
ClosedPublic

Authored by foad on Aug 4 2021, 4:10 AM.

Details

Summary

We can improve on the generic splitting by using ffbh/ffbl, which have a
defined result when the input is zero.

Diff Detail

Event Timeline

foad created this revision.Aug 4 2021, 4:10 AM
foad requested review of this revision.Aug 4 2021, 4:10 AM
Herald added a project: Restricted Project. · View Herald TranscriptAug 4 2021, 4:10 AM
hliao added a comment.Aug 4 2021, 3:26 PM

Could you add SelectionDAG support as well?

arsenm accepted this revision.Aug 4 2021, 3:42 PM
This revision is now accepted and ready to land.Aug 4 2021, 3:42 PM
foad added a comment.Aug 5 2021, 2:40 AM
This comment was removed by foad.
foad updated this revision to Diff 364444.Aug 5 2021, 6:37 AM

Rebase.

arsenm accepted this revision.Aug 5 2021, 7:36 AM
foad added a comment.Aug 5 2021, 7:51 AM

@arsenm accepted this revision.

Thanks. Could you take a look at the prerequisite D107441 please?

This revision was landed with ongoing or failed builds.Aug 6 2021, 1:42 AM
This revision was automatically updated to reflect the committed changes.