If we have BWI, we can truncate in a much simpler way by using vpmovwb. This even works without VLX by using the wider zmm->ymm truncate with a subvector extract.
Details
Details
Diff Detail
Diff Detail
Paths
| Differential D38375
[X86] Make use of vpmovwb when possible in LowerMULH ClosedPublic Authored by craig.topper on Sep 28 2017, 11:58 AM.
Details Summary If we have BWI, we can truncate in a much simpler way by using vpmovwb. This even works without VLX by using the wider zmm->ymm truncate with a subvector extract.
Diff Detail Event TimelineThis revision is now accepted and ready to land.Sep 28 2017, 12:09 PM Closed by commit rL314457: [X86] Make use of vpmovwb when possible in LowerMULH (authored by ctopper). · Explain WhySep 28 2017, 1:12 PM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 117033 lib/Target/X86/X86ISelLowering.cpp
test/CodeGen/X86/vector-idiv-sdiv-128.ll
test/CodeGen/X86/vector-idiv-sdiv-256.ll
test/CodeGen/X86/vector-idiv-udiv-128.ll
test/CodeGen/X86/vector-idiv-udiv-256.ll
|