This patch changes these functions: vectorizeLoadInsert isExtractExtractCheap foldExtractedCmps scalarizeBinopOrCmp getShuffleExtract foldBitcastShuf to use the class InstructionCost when calling TTI.get<something>Cost(). This patch is part of a series of patches to use InstructionCost instead of unsigned/int for the cost model functions. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html See this patch for the introduction of the type: https://reviews.llvm.org/D91174 Observation: This patch adds the test || !NewCost.isValid(), because we want to return false when: !NewCost.isValid && !OldCost.isValid()->the cost to transform it expensive and !NewCost.isValid() && OldCost.isValid() Therefore for simplication we only add test for !NewCost.isValid()
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
llvm/lib/Transforms/Vectorize/VectorCombine.cpp | ||
---|---|---|
201 | the or is redundant. | |
252 | Because of this assert, the branches will never be taken. This will result in different behavior in release vs debug mode. Either remove the assert, or remove the two early returns. Regardless, invalid costs are guaranteed to compare higher than valid costs, so the early returns are redundant. | |
325 | can std::min be used here? InstructionCost has overloaded comparison operators and a total ordering. Assuming it can be, we should probably get rid of InstructionCost::min and InstructionCost::max. That can be a different patch. | |
368–369 | this is redundant. If OldCost is valid, and NewCost is invalid, then OldCost < NewCost returns true. | |
537 | the or is redundant | |
648 | the or is redundant | |
750 | the or is redundant |
llvm/lib/Transforms/Vectorize/VectorCombine.cpp | ||
---|---|---|
252 | Hi @ctetreau, after discussion during the last SVE sync call @paulwalker-arm thought we shouldn't be relying upon the lexicographical ordering that defines invalid costs to be infinitely expensive. He suggested that doing so is actually a bug in the code. So the route we've been taking so far is to either check for validity explicitly or assert that it's valid. If you think this is the wrong approach here then we can perhaps discuss it and agree on a consistent approach? | |
368–369 | Again, @CarolineConcatto is just adding checks here as per discussion on the last SVE sync call, but we're happy to discuss the correct approach. |
I haven't followed the details enough to comment on the changes directly, but thanks for the cleanup! The mismatched signed/unsigned cost model APIs are/were a mess.
llvm/lib/Transforms/Vectorize/VectorCombine.cpp | ||
---|---|---|
252 | I'm not sure how I missed that conversation, but as you may recall from the review thread for D91174, I fought hard for the total ordering to be added and documented so that it's guaranteed to be true. This is exactly the sort of case you'd want to this ordering for; the validity checks are guaranteed to be redundant with the greater-than checks. Additionally, adding the redundant validity checks is more error prone, because it's more operator-heavy lines of code you can mess up. |
llvm/lib/Transforms/Vectorize/VectorCombine.cpp | ||
---|---|---|
252 | To be clear I have nothing against relying on the total ordering but I feel if the transformation is expecting instructions to have an actual cost then it should either assert or explicitly check for such. An example of this is LoopVectorize where there has already been extensive validation to ensure a loop is vectorisable and thus not being able to cost the loop is a sure sign there's either a bug in LoopVectorize's isLegal code or the cost functions themselves. |
llvm/lib/Transforms/Vectorize/VectorCombine.cpp | ||
---|---|---|
252 | It seems that the algorithm requires at least one of the Costs to be valid (it has to choose either Ext0 or Ext1), so if the assert is changed to: assert((Cost0.isValid() || Cost1.isValid()) && "At least one of Cost0 and Cost1 should be valid"); the existing code below should be sufficient and the two early returns that were added can be removed like @ctetreau suggested. | |
368–369 | Similar to above, change the assert to check OldCost.isValid() || NewCost.isValid() and remove the early exit. |
Thank you, everyone, for the review.
I have removed the redundant invalid check.
Also, thank you for making clear that invalid, atm, means as well high cost.
I'll have that in mind for the next patches.
I would say "infinitely costly", not "high cost". Somebody may have "a lot" of LLVMBucks, nobody has infinity LLVMBucks.
Sorry to nitpick, but it's important to get these things right initially before everybody sees some flawed example and emulates it. If you need something to have a really high cost, you should just pick some really high valid cost. If you need something to never be within any cost budget, you should use invalid.
llvm/lib/Transforms/Vectorize/VectorCombine.cpp | ||
---|---|---|
252 |
If it's an honest-to-gosh bug for some call to return invalid, then this is fine. I feel like this should never happen in any function that returns InstructionCost though. This would be akin to swallowing an exception and calling exit(). |
llvm/lib/Transforms/Vectorize/VectorCombine.cpp | ||
---|---|---|
200 | Is this needed?
| |
246 | Reasonable to return nullptr? If neither cost is valid, then neither of the inputs should be replaced | |
364–365 | Would it be reasonable to return false here? If all costs involved are invalid, then I would say the transform is not cheap. | |
536 | reasonable to return false here? If neither cost is valid, then do not do the transform | |
647 | reasonable to return false? | |
749 | reasonable to return false? |
Hi @ctetreau,
Thank you for the review.
So I removed all the asserts and added the earlier return if both costs are invalid, because in this case it means that the transformation is not cheap.
If only OldCost is invalid I believe we should do nothing, for the same reason I removed the test for !NewCost.isValid().
llvm/lib/Transforms/Vectorize/VectorCombine.cpp | ||
---|---|---|
246 | I think it is fine to add if the test if both are equal following the logic of if (Index0 == Index1) | |
364–365 | If I remove the assert and leave the test to do its job the return will be false, because OldCost would be equal to NewCost. | |
647 | If both are invalid, maybe. | |
749 | If both are invalid, maybe. |
llvm/lib/Transforms/Vectorize/VectorCombine.cpp | ||
---|---|---|
536 | Does this assert need to be removed still? |
llvm/lib/Transforms/Vectorize/VectorCombine.cpp | ||
---|---|---|
536 | Yes, sorry missed that! |
Added two nits, but LGTM otherwise.
llvm/lib/Transforms/Vectorize/VectorCombine.cpp | ||
---|---|---|
647–648 | nit: || !NewCost.isValid() should be sufficient (see the other comment). | |
750 | I don't think the or was redundant. If the new cost is invalid, then it shouldn't do the transform. With only testing OldCost < NewCost, we'd get: isValid? OldCost NewCost (OldCost < NewCost) result -------------------------------------------------------- 1. true true OldCost.Val < NewCost.Val ? 2. true false Valid < Invalid true 3. false true Invalid < Valid false 4. false false Invalid < Invalid false However, 4. should be 'true' in order to return early from the function. isValid? OldCost NewCost (OldCost < NewCost || !NewCost.isValid) result ----------------------------------------------------------------------- 1. true true OldCost.Val < NewCost.Val || false ? 2. true false Valid < Invalid || true true 3. false true Invalid < Valid || false false 4. false false Invalid < Invalid || true true Gives us the result we want. nit: based on that I believe !OldCost.isValid() && is now redundant. |
Is this needed?