This is the alternate solution discussed in D80131. We load a scalar constant with implicit zero-ing using VZEXT_LOAD instead of a full vector constant.
Either way seems to be an improvement over materializing in GPR and transferring inter-register to XMM.
I'll rebase that patch to be an top of this, so we can see the incremental test diffs and decide which, if any, is better.
Details
Details
- Reviewers
RKSimon craig.topper
Diff Detail
Diff Detail
Event Timeline
Comment Actions
Abandoning - we can use the vector constant load approach from D80131 without this intermediate step.