This is an archive of the discontinued LLVM Phabricator instance.

[ARM] Optimize fp store of extract to integer store if already available.
ClosedPublic

Authored by dmgreen on Feb 5 2021, 10:11 AM.

Details

Summary

Given a floating point store from an extracted vector, with an integer VGETLANE that already exists, storing the existing VGETLANEu directly can be better for performance. As the value is known to already be in an integer registers, this can help reduce fp register pressure, removed the need for the fp extract and allows use of more integer post-inc stores not available with vstr.

This can be a bit narrow in scope, but helps with certain biquad kernels that store shuffled vector elements.

Diff Detail

Event Timeline

dmgreen created this revision.Feb 5 2021, 10:11 AM
dmgreen requested review of this revision.Feb 5 2021, 10:11 AM
Herald added a project: Restricted Project. · View Herald TranscriptFeb 5 2021, 10:11 AM
This revision is now accepted and ready to land.Feb 12 2021, 7:20 AM