This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/lib/Analysis/
-
lib/
-
Analysis/
-
ScalarEvolution.cpp

Differential D104322

[SCEV] PtrToInt on non-integral pointers is allowed
ClosedPublic

Authored by lebedev.ri on Jun 15 2021, 2:29 PM.

Download Raw Diff

Details

Reviewers

efriedma
reames
mkazantsev
nikic

Commits

rGa3113df21994: [SCEV] PtrToInt on non-integral pointers is allowed

Summary

As per (committed without review) @reames's rGac81cb7e6dde9b0890ee1780eae94ab96743569b change,
we are now allowed to produce ptrtoint for non-integral pointers.
This will unblock further unbreaking of SCEV regarding int-vs-pointer type confusion.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

lebedev.ri created this revision.Jun 15 2021, 2:29 PM

Herald added subscribers: javed.absar, hiraditya. · View Herald TranscriptJun 15 2021, 2:29 PM

lebedev.ri requested review of this revision.Jun 15 2021, 2:29 PM

Harbormaster completed remote builds in B109379: Diff 352248.Jun 15 2021, 5:01 PM

mkazantsev accepted this revision.Jun 15 2021, 9:48 PM

This revision is now accepted and ready to land.Jun 15 2021, 9:48 PM

Closed by commit rGa3113df21994: [SCEV] PtrToInt on non-integral pointers is allowed (authored by lebedev.ri). · Explain WhyJun 16 2021, 12:25 AM

This revision was automatically updated to reflect the committed changes.

lebedev.ri added a commit: rGa3113df21994: [SCEV] PtrToInt on non-integral pointers is allowed.

JFYI, I don't object to this change, but I am hesitant about the direction you seem to be indicating. Please don't use the excuse of our handling of non-integral types being somewhat broken to further break them. I'll comment on future reviews if appropriate.

As a drive-by note, it would be great if you could expand LangRef on non-integral pointers a bit. It made sense to me when it specified that you can't use ptrtoint on a non-integral pointer, but without that limitation, it's not really clear to me what the actual difference between a non-integral pointer and a normal one is. What transforms are you not allowed to perform on a non-integral pointer that you can perform on a normal one?

Roughly, we can either end up with SCEV effectively being inoperable for non-integral pointers,
or the users of non-integral pointers having issues with lowering the ptrtoint's produced by SCEV.
I don't know what is worse, and i'm presently not sure i care, but i do believe that we can not
leave the current SCEV casual-ness of treating pointers as integers as-is.
I actually had some of the patches @efriedma posted locally, and i agree with them.

In D104322#2822083, @nikic wrote:

As a drive-by note, it would be great if you could expand LangRef on non-integral pointers a bit. It made sense to me when it specified that you can't use ptrtoint on a non-integral pointer, but without that limitation, it's not really clear to me what the actual difference between a non-integral pointer and a normal one is. What transforms are you not allowed to perform on a non-integral pointer that you can perform on a normal one?

Yes, i would also like to see such documentation, especially if non-integral pointers
are going to be used as an "arbitrary" roadblock for SCEV changes. I would have posted
that in the review for the commit mentioned, but there was none, which also highlights
the problem around non-integral pointer status in llvm :)

In D104322#2822104, @lebedev.ri wrote:

In D104322#2822083, @nikic wrote:

As a drive-by note, it would be great if you could expand LangRef on non-integral pointers a bit. It made sense to me when it specified that you can't use ptrtoint on a non-integral pointer, but without that limitation, it's not really clear to me what the actual difference between a non-integral pointer and a normal one is. What transforms are you not allowed to perform on a non-integral pointer that you can perform on a normal one?

Yes, i would also like to see such documentation, especially if non-integral pointers
are going to be used as an "arbitrary" roadblock for SCEV changes. I would have posted
that in the review for the commit mentioned, but there was none, which also highlights
the problem around non-integral pointer status in llvm :)

On the documentation side, I'd love to, but I'm honestly not sure *how* to. There's two inter-related problems here. The first is the semantics of an inttoptr is highly dependent on the target for non-integral pointers. At the moment, it can basically only be used to implement non-inlined built-in routines in any practical way. The second issue is that our definition of the integral pointer types themselves appear to be in flux, and are very vague about certain key details. The result is I'm left unsure how to formally specify them. This is why I used the implementation defined wording I did.

On the SCEV side, I understand the frustration, but I think you're also mischaracterizing slightly. SCEV has long had the notion of subtracting two pointers which has been "questionable" the whole time as the semantics of subtracting two unrelated pointers is unclear from the underlying IE. (IR doesn't have subtract, but it does have icmp which is more or less the same.) Eli's recent changes - which are making progress btw, even if slow - are the first I've seen to really raise the question if subtract is a primitive which should be representable for pointers in SCEV. (I'll also add that the confusion around whether we could still have sizeless pointers - which I admittedly contributed to - has only recently been cleared up.)

In terms of forward progress, I am willing to accept crippling SCEV for NI pointers provided that all the changes are otherwise well structured and make sense. I'm not thrilled, and I will be a skeptical reviewer, but I won't block changes which are well justified.

Hm, random vaguely OT thought, would it be worthwhile to add an explicit pointer difference node in SCEV? This would allow us to form subtracts of pointers without needing the nasty multiply by negative one trick, and we could optimize them to the lossless inttoptr form for integral pointers, and not for NI pointers. Might be worth a bit of further thought. I'll follow up if I still think this is a good idea in an hour. :)

In D104322#2822189, @reames wrote:

In D104322#2822104, @lebedev.ri wrote:

In D104322#2822083, @nikic wrote:

As a drive-by note, it would be great if you could expand LangRef on non-integral pointers a bit. It made sense to me when it specified that you can't use ptrtoint on a non-integral pointer, but without that limitation, it's not really clear to me what the actual difference between a non-integral pointer and a normal one is. What transforms are you not allowed to perform on a non-integral pointer that you can perform on a normal one?

Yes, i would also like to see such documentation, especially if non-integral pointers
are going to be used as an "arbitrary" roadblock for SCEV changes. I would have posted
that in the review for the commit mentioned, but there was none, which also highlights
the problem around non-integral pointer status in llvm :)

On the documentation side, I'd love to, but I'm honestly not sure *how* to. There's two inter-related problems here. The first is the semantics of an inttoptr is highly dependent on the target for non-integral pointers. At the moment, it can basically only be used to implement non-inlined built-in routines in any practical way. The second issue is that our definition of the integral pointer types themselves appear to be in flux, and are very vague about certain key details. The result is I'm left unsure how to formally specify them. This is why I used the implementation defined wording I did.

On the SCEV side, I understand the frustration, but I think you're also mischaracterizing slightly. SCEV has long had the notion of subtracting two pointers which has been "questionable" the whole time as the semantics of subtracting two unrelated pointers is unclear from the underlying IE. (IR doesn't have subtract, but it does have icmp which is more or less the same.) Eli's recent changes - which are making progress btw, even if slow - are the first I've seen to really raise the question if subtract is a primitive which should be representable for pointers in SCEV. (I'll also add that the confusion around whether we could still have sizeless pointers - which I admittedly contributed to - has only recently been cleared up.)

In terms of forward progress, I am willing to accept crippling SCEV for NI pointers provided that all the changes are otherwise well structured and make sense. I'm not thrilled, and I will be a skeptical reviewer, but I won't block changes which are well justified.

Glad that we have established this.

Hm, random vaguely OT thought, would it be worthwhile to add an explicit pointer difference node in SCEV? This would allow us to form subtracts of pointers without needing the nasty multiply by negative one trick, and we could optimize them to the lossless inttoptr form for integral pointers, and not for NI pointers. Might be worth a bit of further thought. I'll follow up if I still think this is a good idea in an hour. :)

Doesn't sound that thrilling to me.

reames mentioned this in D104403: [SCEV] Avoid pointer subtraction of non-integral pointers [WIP].Jun 16 2021, 11:00 AM

Took a shot at a minimal crippling of SCEV over in https://reviews.llvm.org/D104403. That isn't a patch ready to land, but it should let us collect some perf numbers on the impact of crippling SCEV's ability to reason about non-integral pointers.

Slightly OT, but I realized an increasing reliance on ptrtoint in code SCEV analyzes and expands might interact badly with the current discussion around pointer provenance and end up negatively impacting AA results. Just a thought at the moment. (This is not related to NI pointers at all.)

reames mentioned this in D104547: [langref] attempt to clarify semantics of inttoptr/ptrtoint for non-integral types.Jun 18 2021, 9:34 AM

reames mentioned this in rGf74bb95bbe4d: [langref] attempt to clarify semantics of inttoptr/ptrtoint for non-integral….Jul 12 2021, 8:49 AM

Revision Contents

Path

Size

llvm/

lib/

Analysis/

ScalarEvolution.cpp

3 lines

Diff 352248

llvm/lib/Analysis/ScalarEvolution.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,054 Lines • ▼ Show 20 Lines	const SCEV ScalarEvolution::getLosslessPtrToIntExpr(const SCEV Op,
assert(Depth <= 1 &&		assert(Depth <= 1 &&
"getLosslessPtrToIntExpr() should self-recurse at most once.");		"getLosslessPtrToIntExpr() should self-recurse at most once.");

// We could be called with an integer-typed operands during SCEV rewrites.		// We could be called with an integer-typed operands during SCEV rewrites.
// Since the operand is an integer already, just perform zext/trunc/self cast.		// Since the operand is an integer already, just perform zext/trunc/self cast.
if (!Op->getType()->isPointerTy())		if (!Op->getType()->isPointerTy())
return Op;		return Op;

assert(!getDataLayout().isNonIntegralPointerType(Op->getType()) &&
"Source pointer type must be integral for ptrtoint!");

// What would be an ID for such a SCEV cast expression?		// What would be an ID for such a SCEV cast expression?
FoldingSetNodeID ID;		FoldingSetNodeID ID;
ID.AddInteger(scPtrToInt);		ID.AddInteger(scPtrToInt);
ID.AddPointer(Op);		ID.AddPointer(Op);

void *IP = nullptr;		void *IP = nullptr;

// Is there already an expression for such a cast?		// Is there already an expression for such a cast?
▲ Show 20 Lines • Show All 12,601 Lines • Show Last 20 Lines