Avoid an unnecessary conversion operation when using the result of
ctpop.i32 or ctpop.i16 as an i32, as in both cases the ptx instruction
we run returns an i32.
(Previously if we used the value as an i32, we'd do an unnecessary
zext+trunc.)
Paths
| Differential D28721
[NVPTX] Improve lowering of llvm.ctpop. ClosedPublic Authored by jlebar on Jan 13 2017, 7:18 PM.
Details Summary Avoid an unnecessary conversion operation when using the result of (Previously if we used the value as an i32, we'd do an unnecessary
Diff Detail
Event Timelinejlebar updated this object. This revision is now accepted and ready to land.Jan 17 2017, 1:36 PM Closed by commit rL292302: [NVPTX] Improve lowering of llvm.ctpop. (authored by jlebar). · Explain WhyJan 17 2017, 4:19 PM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 84423 llvm/lib/Target/NVPTX/NVPTXInstrInfo.td
llvm/test/CodeGen/NVPTX/intrinsics.ll
|