Details
- Reviewers
• tstellarAMD jlebar
Diff Detail
Event Timeline
Well this one is interesting.
I think I see what you're getting at here; if we can ASC away all uses of a generic pointer except the icmp, then keeping around the generic pointers in icmp is just unnecessary register pressure.
But at least on NVPTX, I think this is only safe if we can transform *both* pointers in the icmp, and moreover only if both pointers are transformed to the same AS.
The IR type system requires both be the same type, so this is done here. Also I just noticed I forgot to update the TODO saying to do this
lib/Transforms/Scalar/InferAddressSpaces.cpp | ||
---|---|---|
747 | I don't see where we check that NewV and OtherNewV have the same AS. Shouldn't it be right here? Sorry if it's obvious; I'm breaking my rule against reviewing patches late at night, and may be embarrassed in the morning. |
lib/Transforms/Scalar/InferAddressSpaces.cpp | ||
---|---|---|
747 | I was confused about what you were saying, but the check turns out to have been added in the patch I forgot to squash which fixed that bug |
Just so I know how to read these tests: AS(4) is the flat AS?
lib/Transforms/Scalar/InferAddressSpaces.cpp | ||
---|---|---|
759 | Can we have a test for icmp x, undef and icmp undef, x? | |
762 | If KOtherSrc->AS() == NewAS, then isn't this ASC redundant? It seems we should only need it if KOtherSrc is undef, and in that case, can we just substitute a new undef value, instead of ASC'ing the old undef? |
Yes
lib/Transforms/Scalar/InferAddressSpaces.cpp | ||
---|---|---|
759 | I can add another test for undef, x, but there's no point in trying to handle these in the code since these are canonicalized to constants in the RHS | |
762 | I made this too conservative by adding the new check here, so this is backwards. The addrspace check is redundant with the cast. We need to cast the constants, otherwise it's making an assumption about the bit values of the pointer. For undef I think replacing the type directly would be OK, but the ConstantExpr cast should do the right thing (and I think this will be necessary in the future with freeze anyway) |
lib/Transforms/Scalar/InferAddressSpaces.cpp | ||
---|---|---|
759 | sgtm | |
772 | Since this is now generic, we probably shouldn't say "shared" here. | |
777 | Is this comment now out of date? I'm confused about "instead of currently". Would also prefer for the comment to start with prose. For example: // If we can infer that both pointers are in the same addrspace, transform e.g. // %cmp = icmp eq float*%p, %q // into // %cmp = icmp eq float addrspace(3)* %new_p, %new_q Is that all we need to say here? |
I don't see where we check that NewV and OtherNewV have the same AS. Shouldn't it be right here?
Sorry if it's obvious; I'm breaking my rule against reviewing patches late at night, and may be embarrassed in the morning.