This is an archive of the discontinued LLVM Phabricator instance.

Removing sqrtd2 and sqrtf4 from list of vectorizable function with MASSV
ClosedPublic

Authored by masoud.ataei on Feb 25 2021, 10:14 AM.

Details

Summary

Under -O3 and -Ofast, the MASSV conversion prevents the sqrt call to be inlined.
Inline sqrt is faster than MASSV call on leppc.

Diff Detail

Event Timeline

masoud.ataei created this revision.Feb 25 2021, 10:14 AM
masoud.ataei requested review of this revision.Feb 25 2021, 10:14 AM
Herald added a project: Restricted Project. · View Herald TranscriptFeb 25 2021, 10:14 AM
This comment was removed by pjeeva01.
pjeeva01 added inline comments.Feb 25 2021, 1:34 PM
llvm/test/Transforms/LoopVectorize/PowerPC/massv-calls.ll
237

I think modifying this and the following tests to ensure following calls are not generated would be useful.

__sqrtd2_massv{{.*}}<2 x double>
__sqrtf4_massv{{.*}}<4 x float>

Checking sqrtd2_massv and sqrtf4_massv is not generated.

masoud.ataei marked an inline comment as done.Feb 26 2021, 2:12 PM

Addressed.

pjeeva01 accepted this revision.Mar 1 2021, 6:40 AM

LGTM

This revision is now accepted and ready to land.Mar 1 2021, 6:40 AM