Search from i64 reducing phis, as well as i32, to allow the generation of smlald instructions. I've changed the way loads are generated because we no longer always have an i32 Acc value, but we are still always interested in loading an i32.
Details
Details
Diff Detail
Diff Detail
- Repository
- rL LLVM
Event Timeline
Comment Actions
Hi Sjoerd,
On looking at some IR generated from C, the multiplications were performed at i32 and then sext to i64. So I've now added those patterns to MatchParallelMACSequences. This has required an interface change to AddMACCandidiate because of the need to look through the sext to get to the mul.