If we have BWI, we can truncate in a much simpler way by using vpmovwb. This even works without VLX by using the wider zmm->ymm truncate with a subvector extract.
Details
Details
Diff Detail
Diff Detail
- Repository
- rL LLVM
Paths
| Differential D38375
[X86] Make use of vpmovwb when possible in LowerMULH ClosedPublic Authored by craig.topper on Sep 28 2017, 11:58 AM.
Details Summary If we have BWI, we can truncate in a much simpler way by using vpmovwb. This even works without VLX by using the wider zmm->ymm truncate with a subvector extract.
Diff Detail
Event TimelineThis revision is now accepted and ready to land.Sep 28 2017, 12:09 PM Closed by commit rL314457: [X86] Make use of vpmovwb when possible in LowerMULH (authored by ctopper). · Explain WhySep 28 2017, 1:12 PM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 117045 llvm/trunk/lib/Target/X86/X86ISelLowering.cpp
llvm/trunk/test/CodeGen/X86/vector-idiv-sdiv-128.ll
llvm/trunk/test/CodeGen/X86/vector-idiv-sdiv-256.ll
llvm/trunk/test/CodeGen/X86/vector-idiv-udiv-128.ll
llvm/trunk/test/CodeGen/X86/vector-idiv-udiv-256.ll
|