Old expansion was 20 VGPRs, 78 SGPRs and ~380 instructions.
This expansion is 11 VGPRs, 12 SGPRs and ~120 instructions.
Passes OpenCL conformance test_integer_ops quick_[u]long_math
Paths
| Differential D38607
[AMDGPU] New 64 bit div/rem expansion ClosedPublic Authored by rampitec on Oct 5 2017, 4:23 PM.
Details Summary Old expansion was 20 VGPRs, 78 SGPRs and ~380 instructions. Passes OpenCL conformance test_integer_ops quick_[u]long_math
Diff Detail
Event TimelineHerald added subscribers: t-tye, Anastasia, tpr and 5 others. · View Herald TranscriptOct 5 2017, 4:23 PM
javed.absar added inline comments.
This revision is now accepted and ready to land.Oct 6 2017, 7:19 AM Comment Actions
Yes, I have replaced control flow with cndmasks at the end of blocks. Thus if/endif comments. Closed by commit rL315081: [AMDGPU] New 64 bit div/rem expansion (authored by rampitec). · Explain WhyOct 6 2017, 10:26 AM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 118035 llvm/trunk/lib/Target/AMDGPU/AMDGPUISelLowering.cpp
llvm/trunk/test/CodeGen/AMDGPU/sdivrem64.ll
llvm/trunk/test/CodeGen/AMDGPU/udivrem64.ll
|