This adds some extra costs for reverse shuffles under AArch64, filling in the i16/f16/i8 gaps in the cost model.
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
LGTM
Out of interest, do you have any benchmark results to show how these new costs improves things?
Comment Actions
Thanks. I only noticed this from an issue in the cost of a reversed interleaving group from D124612. It stopped that patch making things worse.