This adds some extra costs for reverse shuffles under AArch64, filling in the i16/f16/i8 gaps in the cost model.
Details
Details
Diff Detail
Diff Detail
Unit Tests
Unit Tests
Time | Test | |
---|---|---|
60,140 ms | x64 debian > MLIR.Examples/standalone::test.toy |
Event Timeline
Comment Actions
LGTM
Out of interest, do you have any benchmark results to show how these new costs improves things?
Comment Actions
Thanks. I only noticed this from an issue in the cost of a reversed interleaving group from D124612. It stopped that patch making things worse.