This is an archive of the discontinued LLVM Phabricator instance.

[X86][SSE] Add selective commutation support for insertps (PR40340)
ClosedPublic

Authored by RKSimon on Jan 17 2019, 3:48 AM.

Details

Summary

When we are inserting 1 "inline" element, and zeroing 2 of the other elements then we can safely commute the insertps source inputs to improve memory folding.

Diff Detail

Repository
rL LLVM

Event Timeline

RKSimon created this revision.Jan 17 2019, 3:48 AM
This revision is now accepted and ready to land.Jan 21 2019, 12:01 PM
This revision was automatically updated to reflect the committed changes.