This is a stopgap solution until something more along the lines of D123264 and D123265 gets accepted. It conservatively does the folding of the low part of LLA addresses into the offset of load instructions. In my benchmarks it worked better than I was expecting, so maybe this is worth considering if the other patches get stalled?
Details
Details
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Unit Tests
Unit Tests
Time | Test | |
---|---|---|
60,030 ms | x64 debian > MLIR.Examples/standalone::test.toy | |
60,050 ms | x64 debian > libFuzzer.libFuzzer::large.test |