This is an archive of the discontinued LLVM Phabricator instance.

Optimize insertqi when we copy all the lower 64 bits.
ClosedPublic

Authored by filcab on Apr 11 2014, 9:08 AM.

Details

Summary

Since the upper 64 bits of the destination register are undefined when
performing this operation, we can substitute it and let the optimizer
figure out that only a copy is needed.

Diff Detail

Event Timeline

filcab updated this revision to Unknown Object (????).Apr 14 2014, 7:52 PM

Updated the patch to add range merging, generating fewer insertqi when possible.

This also allows us to find more places to do the first opt.

+Nadav

Hi Filipe,

I like the idea of this transform. Nadav will have a better idea than I about whether this is the right place to go about it.

-Jim

filcab accepted this revision.Apr 23 2014, 6:17 PM
filcab added a reviewer: filcab.
This revision is now accepted and ready to land.Apr 23 2014, 6:17 PM
filcab closed this revision.Apr 23 2014, 6:17 PM