This is an archive of the discontinued LLVM Phabricator instance.

[DAG][TLI][X86][ARM][AArch64] Add `isExtractSubvectorFree` / use it in `foldExtractSubvectorFromShuffleVector()`
AbandonedPublic

Authored by lebedev.ri on Dec 13 2021, 9:37 AM.

Details

Summary

This generalizes the hardcoded profitability check that was just added in D104156.
To my surprize, this does not catch any new cases.

Diff Detail

Event Timeline

lebedev.ri created this revision.Dec 13 2021, 9:37 AM
lebedev.ri requested review of this revision.Dec 13 2021, 9:37 AM

Err, actually make use of the hooks by dropping original ad-hoc check.

ping
@RKSimon this removes the hardcoded assumptions that the free extraction is for 0'th subvector specifically, as we've discussed in D104156 originally.

Any luck with finding a x86 test case?

We do need an X86 test for this.

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
20795

The values here make it seem like replacing a non-cheap operation with two cheap ones yields the same performance (1+1 = 2). contrary to the comment. Is this intentional?

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
12899

So here, it seems like even if the extract is "free" (cost 0) it'll be counted as "cheap" (cost 1).

I wonder if it would be better to combine these functions into a single "getExtractSubvectorCost" target hook.

lebedev.ri marked 2 inline comments as done.Jan 10 2022, 9:21 AM

Any luck with finding a x86 test case?

We do need an X86 test for this.

Sorry, i do not have an x86 test for this..

llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
20795

Not only does it make seem so, that is exactly what the comment is saying.

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
12899

So here, it seems like even if the extract is "free" (cost 0) it'll be counted as "cheap" (cost 1).

Right. Because even free extractions can be done for cheap, if that makes sense.

I wonder if it would be better to combine these functions into a single "getExtractSubvectorCost" target hook.

We could do that, there's already *TTIImpl::getShuffleCost(), the problem being, what will be the "cheap budget",
i.e. how should the uses of isExtractSubvectorCheap() be adjusted?

greened added inline comments.Jan 11 2022, 1:35 PM
llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
20795

It's not what the comment is saying. The comment says "a win" which to my mind means, "performance improvement." If the cost is the same, where's the improvement? Maybe the comment is confusing me. This is used to calculate Budget which I guess limits how many extracts we can do (so two at most?). Perhaps the comment could be expanded to explain a bit more about what is going on.

llvm/lib/Target/AArch64/AArch64ISelLowering.cpp
12899

Right. Because even free extractions can be done for cheap, if that makes sense.

In some contexts I suppose. A "free" extract is "cheap" if all you care about is whether an extract is "cheap." A "free" extract is not "cheap" if you care about the relative cost of the two. I could see the above code being very confusing in some circumstances.

how should the uses of isExtractSubvectorCheap() be adjusted?

Not knowing all of the users of that function, I can't say. It's probably context-dependent.

lebedev.ri abandoned this revision.Jan 17 2022, 2:36 PM
lebedev.ri marked 2 inline comments as done.