We shouldn't do the transformation if we either have bypass delay OR
the new opcode has worse performance. Previous code was incorrectly
using AND.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Paths
| Differential D147727
[X86] Fix perf bug in `permilps` -> `shufd` in X86FixupInstTuning. ClosedPublic Authored by goldstein.w.n on Apr 6 2023, 11:12 AM.
Details Summary We shouldn't do the transformation if we either have bypass delay OR
Diff Detail
Event Timelinegoldstein.w.n added a child revision: D147728: [X86] Add inst fixup for `unpckpd` -> `unpckqdq`..Apr 6 2023, 11:22 AM goldstein.w.n added a parent revision: D147726: [X86] Improve inst tuning tests for X86FixupInstTuning Pass; NFC. This revision is now accepted and ready to land.Apr 6 2023, 1:24 PM Closed by commit rG2ce1698a343c: [X86] Fix perf bug in `permilps` -> `shufd` in X86FixupInstTuning. (authored by goldstein.w.n). · Explain WhyApr 9 2023, 10:17 PM This revision was automatically updated to reflect the committed changes.
Revision Contents
Diff 512069 llvm/lib/Target/X86/X86FixupInstTuning.cpp
llvm/test/CodeGen/X86/tuning-shuffle-permilps-avx512.ll
llvm/test/CodeGen/X86/tuning-shuffle-permilps.ll
|