This is an archive of the discontinued LLVM Phabricator instance.

[ARM][ParallelDSP] Handle squaring multiplies
ClosedPublic

Authored by samparker on Apr 20 2020, 7:21 AM.

Details

Summary

The logic in ARMParallelDSP is setup to merge two 16-bits loads into a 32-bit load and feed them into the smlads. This requires that four loads are combined for the four inputs, but there wasn't actually a check for this.

Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=45616

Diff Detail

Event Timeline

samparker created this revision.Apr 20 2020, 7:21 AM
SjoerdMeijer accepted this revision.Apr 20 2020, 8:48 AM

Looks like a good fix to me.

This revision is now accepted and ready to land.Apr 20 2020, 8:48 AM
This revision was automatically updated to reflect the committed changes.