Since the upper 64 bits of the destination register are undefined when
performing this operation, we can substitute it and let the optimizer
figure out that only a copy is needed.
Details
Details
Diff Detail
Diff Detail
Event Timeline
Comment Actions
Updated the patch to add range merging, generating fewer insertqi when possible.
This also allows us to find more places to do the first opt.
Comment Actions
+Nadav
Hi Filipe,
I like the idea of this transform. Nadav will have a better idea than I about whether this is the right place to go about it.
-Jim