This is an archive of the discontinued LLVM Phabricator instance.

[SVE][CodeGen] Improve codegen for some FP insert_subvector cases
ClosedPublic

Authored by david-arm on Nov 5 2021, 5:07 AM.

Details

Summary

When inserting an unpacked FP subvector into a packed vector we
can simply cast the unpacked value into a packed value, since
both types are legal for SVE. We can then use this as the input
for the UZP instruction. This avoids us expanding the operation
by going through the stack.

Diff Detail

Event Timeline

david-arm created this revision.Nov 5 2021, 5:07 AM
david-arm requested review of this revision.Nov 5 2021, 5:07 AM
Herald added a project: Restricted Project. · View Herald TranscriptNov 5 2021, 5:07 AM
sdesmalen accepted this revision.Nov 8 2021, 3:15 AM

LGTM!

This revision is now accepted and ready to land.Nov 8 2021, 3:15 AM
This revision was landed with ongoing or failed builds.Nov 8 2021, 5:46 AM
This revision was automatically updated to reflect the committed changes.