This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/Vectorize/
-
Transforms/
-
Vectorize/
2/2
VectorCombine.cpp
-
test/Transforms/VectorCombine/X86/
-
Transforms/
-
VectorCombine/
-
X86/
2/2
extract-fneg-insert.ll

Differential D135278

[VectorCombine] convert scalar fneg with insert/extract to vector fneg
ClosedPublic

Authored by spatel on Oct 5 2022, 8:54 AM.

Download Raw Diff

Details

Reviewers

dmgreen
RKSimon
xbolva00

Commits

rGbaab4aa1ba5f: [VectorCombine] convert scalar fneg with insert/extract to vector fneg

Summary

insertelt DestVec, (fneg (extractelt SrcVec, Index)), Index --> shuffle DestVec, (fneg SrcVec), Mask

This is a specialized form of what could be a more general fold for a binop. It's also possible that fneg is overlooked by SLP in this kind of insert/extract pattern since it's a unary op.

This shows up in the motivating example from #issue 58139, but it won't solve it (that probably requires some x86-specific backend changes). There are also some small enhancements (see TODO comments) that can be done as follow-up patches.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

spatel created this revision.Oct 5 2022, 8:54 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 5 2022, 8:54 AM

Herald added subscribers: pengfei, hiraditya, mcrosier. · View Herald Transcript

spatel requested review of this revision.Oct 5 2022, 8:54 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 5 2022, 8:54 AM

Herald added subscribers: llvm-commits, • pcwang-thead. · View Herald Transcript

Harbormaster completed remote builds in B190507: Diff 465420.Oct 5 2022, 8:55 AM

spatel mentioned this in rG563545685f27: [VectorCombine] remove unused test prefixes; NFC.Oct 6 2022, 6:41 AM

tschuett added a subscriber: tschuett.Oct 9 2022, 7:21 AM

tschuett added inline comments.

llvm/lib/Transforms/Vectorize/VectorCombine.cpp
564	Tiniest nit: You could `Mask.reserve(VecTy->getNumElements());`

LGTM

llvm/lib/Transforms/Vectorize/VectorCombine.cpp
564	If you wanted to you could go even further (assuming we know that Index < VecTy->getNumElements()) SmallVector<int> Mask(VecTy->getNumElements()); std::iota(Mask.begin(), Mask.end(), 0); Mask[Index] = Index + e;
llvm/test/Transforms/VectorCombine/X86/extract-fneg-insert.ll
7	Are you going to look at this? If not please can you raise a bug.

This revision is now accepted and ready to land.Oct 10 2022, 5:16 AM

spatel marked an inline comment as done.Oct 10 2022, 5:56 AM

spatel added inline comments.

llvm/test/Transforms/VectorCombine/X86/extract-fneg-insert.ll
7	I'm not sure what the proper fix will be, so let's track it either way: https://github.com/llvm/llvm-project/issues/58261

Patch updated:
Replace shuffle mask loop with pre-allocated vector and std::iota().

Harbormaster completed remote builds in B191340: Diff 466578.Oct 10 2022, 11:54 AM

LGTM

This revision was landed with ongoing or failed builds.Oct 10 2022, 12:00 PM

Closed by commit rGbaab4aa1ba5f: [VectorCombine] convert scalar fneg with insert/extract to vector fneg (authored by spatel). · Explain Why

This revision was automatically updated to reflect the committed changes.

spatel mentioned this in rG6ace81db3ad9: [VectorCombine] add test with out-of-bounds insert/extract index; NFC.

spatel added a commit: rGbaab4aa1ba5f: [VectorCombine] convert scalar fneg with insert/extract to vector fneg.

Hello,

The following starts crashing with this patch:

opt -passes='vector-combine' bbi-74776.ll -S -o /dev/null

bbi-74776.ll233 BDownload

In D135278#3862060, @uabelho wrote:
Hello,

The following starts crashing with this patch:
opt -passes='vector-combine' bbi-74776.ll -S -o /dev/null
bbi-74776.ll233 BDownload

Thanks! Taking a look now - probably need to restrict the pattern match to true fneg.

In D135278#3862108, @spatel wrote:
In D135278#3862060, @uabelho wrote:
Hello,

The following starts crashing with this patch:
opt -passes='vector-combine' bbi-74776.ll -S -o /dev/null
bbi-74776.ll233 BDownload
Thanks! Taking a look now - probably need to restrict the pattern match to true fneg.

On 2nd thought, we can adjust the capture to handle this kind of pattern:
8d76fbb5f065

Thanks!

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Vectorize/

VectorCombine.cpp

64 lines

test/

Transforms/

VectorCombine/

X86/

extract-fneg-insert.ll