This patch adds another variant class to identify zero-idiom VPERM2F128rr instructions.
On Jaguar, a VPERM wih bit 3 and 7 of the mask set, is a zero-idiom.
Please let me know if okay to commit.
Thanks,
Andrea
Paths
| Differential D52663
[X86][BtVer2] Teach how to identify zero-idio VPERM2F128rr instructions. ClosedPublic Authored by andreadb on Sep 28 2018, 10:57 AM.
Details Summary This patch adds another variant class to identify zero-idiom VPERM2F128rr instructions. On Jaguar, a VPERM wih bit 3 and 7 of the mask set, is a zero-idiom. Please let me know if okay to commit. Thanks,
Diff Detail Event Timelineandreadb retitled this revision from [X86][BtVer2] VPERM2F128rr instructions with Mask bits 3 and 7 are also zero-idioms. to [X86][BtVer2] Teach how to identify zero-idio VPERM2F128rr instructions..Sep 28 2018, 10:58 AM
Comment Actions Addressed review comments. Also, moved the predicate definition to X86SchedPredicates.td This revision is now accepted and ready to land.Sep 29 2018, 7:11 AM Closed by commit rL343452: [X86][BtVer2] Teach how to identify zero-idiom VPERM2F128rr instructions. (authored by adibiagio). · Explain WhyOct 1 2018, 3:36 AM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 167514 lib/Target/X86/X86ScheduleBtVer2.td
test/tools/llvm-mca/X86/BtVer2/zero-idioms-avx-256.s
|
If I have read things correctly, 136 represents a bit mask, it might be clearer to represent that value as 0x88 or 0b10001000