This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Directly use rcp intrinsic in idiv expansions
ClosedPublic

Authored by arsenm on Feb 11 2020, 7:36 AM.

Details

Summary

Since natural fdiv lowering is now more conservative even with
denormals disabled, we get a slower expansion from just a plain
1.0/fdiv. Directly emit the rcp intrinsic when using it to implement
integer division to avoid a pointlessly complex sequence.

Diff Detail

Event Timeline

arsenm created this revision.Feb 11 2020, 7:36 AM
Herald added a project: Restricted Project. · View Herald TranscriptFeb 11 2020, 7:36 AM
This revision is now accepted and ready to land.Feb 11 2020, 10:54 AM