While scanning a block for sequential loads, also check for duplicate instructions and remove them if possible. To do this we have to ensure that for any memory reads, there aren't any writes between the two duplicates and also that memory operations are simple. I've also moved all the tests into a ParallelDSP directory.
Details
Details
Diff Detail
Diff Detail
Event Timeline
Comment Actions
When does this come up, realistically? EarlyCSE already has a stronger form of load CSE.
Comment Actions
This happens because of unroll and jam and I had tried to run cse after that... but doing the more obvious option of running it before parallel dsp solves the problem! I'll do that instead. Thanks.