This produces better SASS than right-shift + truncate and is fairly common for
CUDA code that operates on __half2 values represented as opaque integer.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Paths
| Differential D143448
[NVPTX] Lower extraction of upper half of i32/i64 as partial move. ClosedPublic Authored by tra on Feb 6 2023, 5:18 PM.
Details Summary This produces better SASS than right-shift + truncate and is fairly common for
Diff Detail
Event TimelineThis revision is now accepted and ready to land.Feb 7 2023, 11:44 AM This revision was landed with ongoing or failed builds.Feb 7 2023, 2:21 PM Closed by commit rG7045966982f2: [NVPTX] Lower extraction of upper half of i32/i64 as partial move. (authored by tra). · Explain Why This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 495643 llvm/lib/Target/NVPTX/NVPTXInstrInfo.td
llvm/test/CodeGen/NVPTX/f16-instructions.ll
llvm/test/CodeGen/NVPTX/f16x2-instructions.ll
llvm/test/CodeGen/NVPTX/idioms.ll
|