This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Transforms/IPO/
-
llvm/
-
Transforms/
-
IPO/
-
Attributor.h
-
lib/Transforms/IPO/
-
Transforms/
-
IPO/
6/15
Attributor.cpp
-
test/Transforms/Attributor/
-
Transforms/
-
Attributor/
-
IPConstantProp/
-
PR26044.ll
-
fp-bc-icmp-const-fold.ll
-
solve-after-each-resolving-undefs-for-function.ll
-
undefined_behavior.ll

Differential D71799

[Attributor] AAUndefinedBehavior: Check for branches on undef value.
ClosedPublic

Authored by baziotis on Dec 21 2019, 2:05 PM.

Download Raw Diff

Details

Reviewers

jdoerfert
sstefan1
uenoku

Commits

rGef4febd85b54: [Attributor] AAUndefinedBehavior: Check for branches on undef value.

Summary

A branch is considered UB if it depends on an undefined / uninitialized value.
At this point this handles simple UB branches in the form: br i1 undef, ...
We query AAValueSimplify to get a value for the branch condition, so the branch
can be more complicated than just: br i1 undef, ....

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

baziotis created this revision.Dec 21 2019, 2:05 PM

Herald added a reviewer: sstefan1. · View Herald TranscriptDec 21 2019, 2:05 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, jfb, hiraditya. · View Herald Transcript

jdoerfert added inline comments.Dec 21 2019, 11:30 PM

llvm/lib/Transforms/IPO/Attributor.cpp
2049–2051	Split it in two calls since the pointer stuff and the control flow stuff (for branch, switch, ...) is conceptually different.

Note that there's somewhat relevant prior art,
e.g. llvm/lib/Transforms/Instrumentation/PoisonChecking.cpp (D64215)
and https://github.com/AliveToolkit/alive2.
Would be great to not have this much duplication, but a single all-powerful one, but not sure it's possible (yet?)

Separate branch instructions and memory accessing instructions.

Note that there's somewhat relevant prior art,
e.g. llvm/lib/Transforms/Instrumentation/PoisonChecking.cpp (D64215)
and https://github.com/AliveToolkit/alive2.
Would be great to not have this much duplication, but a single all-powerful one, but not sure it's possible (yet?)

Thanks for the reference.
I guess you mean for things like this: https://github.com/AliveToolkit/alive2/blob/master/tests/unit/undef.opt, i.e. propagating
the undef.
For the PoisonChecking that you referenced, from a quick glance I could not see how it relates :/ Could you be more specific?
It seems that poison checking adds runtime checks.

llvm/lib/Transforms/IPO/Attributor.cpp
2036–2039	Note that here it's possibly wrong and I forgot to comment yesterday. I didn't know exactly how to do it but here's the problem. If it has a value and it is not undef, then it's not UB -> OK If it has a value and it's undef, then it is UB -> OK But... If it doesn't have a value, we consider it not UB. Well, I'm not familiar with the internals of `AAValueSimplify`, but looking comments around, there were some like "No value _yet_". Which means that right now we may not have a value but we could in the future. And that value may be undef. This is no problem for this patch as it tries to handle cases where undef is caught in (hasValue && isa<Undef>). But eventually, `AAValueSimplify` could uncover things for us here and we may lose them because we put the instruction to `NoUBInsts`.

In the current patch, I think an instruction may be inserted to both UBInsts and NoUBInsts in some cases when AAValueSimplify changes its assumption.

In my understanding, the idea of this deduction is
" We assume br instruction causes UB. If you can prove that the instruction *doesn't* cause UB, we remove that assumption".

So I think we don't need to have both UBInsts and NoUBInsts because if an instruction is not in NoUBInsts then it is assumed to cause UB.
( You can find that AAISDeadFunction is similar. If BB is not in AssumedLiveBlocks, then it is assumed to be Dead`).

In D71799#1794216, @uenoku wrote:

In the current patch, I think an instruction might be inserted to both UBInsts and NoUBInsts in some cases when AAValueSimplify changes its assumption.

Yes, I forgot to add the check that guards this the previous patches (check the first if in InspectMemAccessInstForUB).

In my understanding, the idea of this deduction is
" We assume br instruction causes UB. If you can prove that the instruction *doesn't* cause UB, we remove that assumption".

Yes exactly. Well, my initial assumption was this:

SimplifiedV = ...;
if (!SimplifiedV.hasValue())
  ... // Do nothing for now, no value yet.
else {
  Val = SimplifiedV.getValue();
  if (Val is undef)
    UBInsts.insert;
  else
    NoUBInsts.insert;
}

Then, now that I see that again apparently I got confused somewhere so let me change this quickly. :P
Is this what you had in mind ?

So I think we don't need to have both UBInsts and NoUBInsts because if an instruction is not in NoUBInsts then it is assumed to cause UB.
( You can find that AAISDeadFunction is similar. If BB is not in AssumedLiveBlocks, then it is assumed to be Dead`).

As I noted above, these 2 sets should have no common elements. With that, having both of them just makes some things easier
(like stats, looping over UB instructions in manifest, checking if an instruction is UB).

In D71799#1794247, @baziotis wrote:
In D71799#1794216, @uenoku wrote:

In the current patch, I think an instruction might be inserted to both UBInsts and NoUBInsts in some cases when AAValueSimplify changes its assumption.

Yes, I forgot to add the check that guards this the previous patches (check the first if in InspectMemAccessInstForUB).

In my understanding, the idea of this deduction is
" We assume br instruction causes UB. If you can prove that the instruction *doesn't* cause UB, we remove that assumption".

Yes exactly. Well, my initial assumption was this:
SimplifiedV = ...;
if (!SimplifiedV.hasValue())
  ... // Do nothing for now, no value yet.
else {
  Val = SimplifiedV.getValue();
  if (Val is undef)
    UBInsts.insert;
  else
    NoUBInsts.insert;
}
Then, now that I see that again apparently I got confused somewhere so let me change this quickly. :P
Is this what you had in mind ?

So I think we don't need to have both UBInsts and NoUBInsts because if an instruction is not in NoUBInsts then it is assumed to cause UB.
( You can find that AAISDeadFunction is similar. If BB is not in AssumedLiveBlocks, then it is assumed to be Dead`).

As I noted above, these 2 sets should have no common elements. With that, having both of them just makes some things easier
(like stats, looping over UB instructions in manifest, checking if an instruction is UB).

bool isAssumedToCauseUB(I*){
   return !NoUBInsts.count(I);
}
...
SimplifiedV = ...;
if (!SimplifiedV.hasValue())
  // Do nothing. Assumption holds (Because the value might be simplified to `undef`)
else {
  Val = SimplifiedV.getValue();
  if (Val is undef)
    // Do nothing. Assumption holds.
  else
    NoUBInsts.insert;
}

I think this will work. It is no problem to have both 2 sets (but a bit redundant). If so,

SimplifiedV = ...;
if (!SimplifiedV.hasValue())
  // Assumption holds (Because the value might be simplified to `undef`)
  UBInsts.insert
else {
  Val = SimplifiedV.getValue();
  if (Val is undef)
    //Assumption holds.
    UBInsts.insert
  else
    NoUBInsts.insert;
    UBInsts.remove;
}

SimplifiedV = ...;
if (!SimplifiedV.hasValue())
  ... // Do nothing for now, no value yet.
else {
  Val = SimplifiedV.getValue();
  if (Val is undef)
    UBInsts.insert;
  else
    NoUBInsts.insert;
}

Note that while this is what makes sense to me, Johannes told me that if SimplifiedV gives None (i.e. it doesn't have value), then we can assume
that it is undef but I don't know why this is true.

Edit: Just saw your comment, which you seem to not assume that.
In your code, why insert it in UBInsts and then remove it? Since we query AAValueSimplify, isn't that going to call us again and thus at some point get the value (which in turn means we don't insert it and just wait for when we'll be called again).

In D71799#1794262, @baziotis wrote:

Note that while this is what makes sense to me, Johannes told me that if SimplifiedV gives None (i.e. it doesn't have value), then we can assume
that it is undef but I don't know why this is true.

A state of SimplifiedV is Optional<Value*> representing its simplified associated value V. Initially, set to None. If a candidate V1 is found, it is set to Some(V1). If another candidate V2 is found, trying to unify V1 and V2.
When the simplified value is None, we haven't found a candidate yet, or there is no candidate(like int f(x) { return f(x);}). So you can choose any arbitrary value(=undef) in that assumption.

Edit: Just saw your comment, which you seem to not assume that.
In your code, why insert it in UBInsts and then remove it? Since we query AAValueSimplify, isn't that going to call us again and thus at some point get the value (which in turn means we don't insert it and just wait for when we'll be called again).

The simplified value may change (None -> concrete value) so we need to track.

In D71799#1794326, @uenoku wrote:

In D71799#1794262, @baziotis wrote:

Note that while this is what makes sense to me, Johannes told me that if SimplifiedV gives None (i.e. it doesn't have value), then we can assume
that it is undef but I don't know why this is true.

A state of SimplifiedV is Optional<Value*> representing its simplified associated value V. Initially, set to None. If a candidate V1 is found, it is set to Some(V1). If another candidate V2 is found, trying to unify V1 and V2.
When the simplified value is None, we haven't found a candidate yet, or there is no candidate(like int f(x) { return f(x);}). So you can choose any arbitrary value(=undef) in that assumption.

I agree but I assumed wrongly apparently. You see, with "we can assume it is undef" I thought that I should also add it to the UBInsts set. Up to that point, if I were to do that, it would be wrong since
if something is added to UBInsts, it would never be checked again. But apparently Johannes did not mean this.

Edit: Just saw your comment, which you seem to not assume that.
In your code, why insert it in UBInsts and then remove it? Since we query AAValueSimplify, isn't that going to call us again and thus at some point get the value (which in turn means we don't insert it and just wait for when we'll be called again).

The simplified value may change (None -> concrete value) so we need to track.

I agree yes. My point was: At some point we _will_ get a concrete value (or, candidate), aren't we? If so, why add it and then remove it from the set instead of just waiting until we get a value? Hence my initial code.
But now I realize that we have to add it since e.g. isAssumedToCauseUB() and stats depend on it. So, I think your version with only NoUBInsts set is better with an additional UBInstsSize (for stats).

I agree but I assumed wrongly apparently. You see, with "we can assume it is undef" I thought that I should also add it to the UBInsts set. Up to that point, if I were to do that, it would be wrong since
if something is added to UBInsts, it would never be checked again. But apparently Johannes did not mean this.

I got the point! I missed that UB is intended to be used for liveness. Sorry about that. I'll rethink the problem. But it seems for me that current implementation regards *assumed* UBInsts as *known* UBInsts. Because once I is assumed to have UB, we never visit I. I guess this will cause invalid deduction.

I agree yes. My point was: At some point, we _will_ get a concrete value (or, candidate), aren't we? If so, why add it and then remove it from the set instead of just waiting until we get a value? Hence my initial code.

I think we won't get a concrete value for None in the iterations.

In D71799#1794378, @uenoku wrote:

I agree but I assumed wrongly apparently. You see, with "we can assume it is undef" I thought that I should also add it to the UBInsts set. Up to that point, if I were to do that, it would be wrong since
if something is added to UBInsts, it would never be checked again. But apparently Johannes did not mean this.

I got the point! I missed that UB is intended to be used for liveness. Sorry about that. I'll rethink the problem. But it seems for me that current implementation regards *assumed* UBInsts as *known* UBInsts. Because once I is assumed to have UB, we never visit I. I guess this will cause invalid deduction.

I agree yes. My point was: At some point, we _will_ get a concrete value (or, candidate), aren't we? If so, why add it and then remove it from the set instead of just waiting until we get a value? Hence my initial code.

I think we won't get a concrete value for None in the iterations.

Well, actually I am sorry for that. The reason I haven't uploaded a diff yet is that I came across what you just said. Basically, I changed the code to keep 2 sets, one for KnownNoUBInsts and another for KnownUBInsts. With that the code becomes quite better as we can do:

if (!SimplifiedV.hasValue()) {
  // No value yet, we can assume any value: assume this is undef BUT
  // this is not _known_ so we don't put in the known set.
} else {
  if (undef) {
    // insert in KnownUB
  } else {
    // insert in NoUB.
  }
}

That is better because:
a) We can use the KnownUB set for the stats
b) We can use KnownNoUB set for the isAssumedToCauseUB.

But I can't progress because as you said, for some reason we never get a concrete value in the iterations. So, in the manifest there are 2 cases:

make unreachable only those in KnownUB. The problem with that is that exactly because we don't get a concrete value, in something like this:

define i1 @ret_undef() {
  ret i1 undef
}

define void @test() {
  %cond = call i1 @ret_undef()
  br i1 %cond, ...

the branch never makes it to the KnownUB.

Make unreachable any instruction that isAssumedToCauseUB. Those are all the instructions that are not in KnownNoUB. That apart from the fact that it doesn't seem all that correct,

it also causes stack dumps. :P

Edit: Forgot to mention that of course we never re-process any instruction in either of the sets.

Well, scratch all that, this is wrong as well. For one, we can't assume that an instruction is UB just because it isn't in the KnownNoUBInsts. If we go with that implementation, we will consider UB instructions such as unconditional branches or really any instruction that it is never checked.

I think you can't use assumption (getAssumedSimplifiedValue) for the known information. You need to use only known information.

In D71799#1794391, @uenoku wrote:

I think you can't use assumption (getAssumedSimplifiedValue) for the known information. You need to use only known information.

In what part you're referring to? case 2) of the manifest cases?

Edit: i.e. that I end up making blocks unreachable on assumed info?

if (!SimplifiedV.hasValue()) {
  // No value yet, we can assume any value: assume this is undef BUT
  // this is not _known_ so we don't put in the known set.
} else {
  These are also assumption. You can't use these as known
  if (undef) {
    // insert in KnownUB 
  } else {
    // insert in NoUB.
  }
}

In D71799#1794405, @uenoku wrote:

if (!SimplifiedV.hasValue()) {
  // No value yet, we can assume any value: assume this is undef BUT
  // this is not _known_ so we don't put in the known set.
} else {
  These are also assumption. You can't use these as known
  if (undef) {
    // insert in KnownUB 
  } else {
    // insert in NoUB.
  }
}

Oh right, I have to call ValueSimplifyAA.isKnown().

I haven't read all of the discussion so it might as well be possible you converged on this already but I'll say it anyway:

Assumed information can used other assumed information, known information only known information.
You can make AAUndefBehavior track assumed information instead of known information but then we need to look at the not yet known facts in every updateImpl iteration again to make sure the assumed status is still justified.

In D71799#1794454, @jdoerfert wrote:

I haven't read all of the discussion so it might as well be possible you converged on this already but I'll say it anyway:

Assumed information can used other assumed information, known information only known information.
You can make AAUndefBehavior track assumed information instead of known information but then we need to look at the not yet known facts in every updateImpl iteration again to make sure the assumed status is still justified.

Alright, that makes sense!
So, one quick question that I hope will help solve some issues: If !SimplifiedV.hasValue() but ValueSimplifyAA.isKnown(), then it is known that the value is undef?

In D71799#1794464, @baziotis wrote:

In D71799#1794454, @jdoerfert wrote:

I haven't read all of the discussion so it might as well be possible you converged on this already but I'll say it anyway:

Assumed information can used other assumed information, known information only known information.
You can make AAUndefBehavior track assumed information instead of known information but then we need to look at the not yet known facts in every updateImpl iteration again to make sure the assumed status is still justified.

Alright, that makes sense!
So, one quick question that I hope will help solve some issues: If !SimplifiedV.hasValue() but ValueSimplifyAA.isKnown(), then it is known that the value is undef?

It can't happen but it is semantically correct.

uenoku added a reviewer: uenoku.Dec 22 2019, 10:44 PM

In D71799#1794602, @uenoku wrote:

In D71799#1794464, @baziotis wrote:

Alright, that makes sense!
So, one quick question that I hope will help solve some issues: If !SimplifiedV.hasValue() but ValueSimplifyAA.isKnown(), then it is known that the value is undef?

It can't happen but it is semantically correct.

It happens though :) Unless I messed up something. It happens with e.g. this code:

define i1 @ret_undef() {
  ret i1 undef
}

define void @cond_br() {
  %cond = call i1 @ret_undef()
  br i1 %cond, label %t, label %e
t:
  ret void
e:
  ret void
}

And in the Attributor:

if (!SimplifiedV.hasValue()) {
  if (ValueSimplifyAA.isKnown())
    dbgs() << "IS IT UNDEF?\n";
  ...
}

I see the message. Sorry btw that I don't know exactly how AAValueSimplify works. When I started this patch, I assumed it was in everyone's best interest
to not spend time in it right now, so I'm guessing from looking small pieces of its code.

In D71799#1794762, @baziotis wrote:
In D71799#1794602, @uenoku wrote:

In D71799#1794464, @baziotis wrote:

Alright, that makes sense!
So, one quick question that I hope will help solve some issues: If !SimplifiedV.hasValue() but ValueSimplifyAA.isKnown(), then it is known that the value is undef?

It can't happen but it is semantically correct.

It happens though :) Unless I messed up something. It happens with e.g. this code:
define i1 @ret_undef() {
  ret i1 undef
}

define void @cond_br() {
  %cond = call i1 @ret_undef()
  br i1 %cond, label %t, label %e
t:
  ret void
e:
  ret void
}
And in the Attributor:
if (!SimplifiedV.hasValue()) {
  if (ValueSimplifyAA.isKnown())
    dbgs() << "IS IT UNDEF?\n";
  ...
}
I see the message. Sorry btw that I don't know exactly how AAValueSimplify works. When I started this patch, I assumed it was in everyone's best interest
to not spend time in it right now, so I'm guessing from looking small pieces of its code.

Sorry for my lack of words. I thought you were talking about in updateImpl. I think it can't happen in updates but can happen once reaches to a fix point.

In D71799#1794768, @uenoku wrote:

Sorry for my lack of words. I thought you were talking about in updateImpl. It can't happen in updates but can happen once reaches to a fix point.

I am talking about updateImpl(), the code above is inside updateImpl(). It seems true though that if it happens, then a fixpoint has been reached.
Essentially was assumption is that AAValueSimplify seems to return behave weirdly when the value undef (i.e. it returns None while an undef value might be known). So with that,
if it happens, we can deduce that the instruction is UB in updateImpl() and that pretty much solves the previous problems.

In D71799#1794782, @baziotis wrote:

In D71799#1794768, @uenoku wrote:

Sorry for my lack of words. I thought you were talking about in updateImpl. It can't happen in updates but can happen once reaches to a fix point.

I am talking about updateImpl(), the code above is inside updateImpl(). It seems true though that if it happens, then a fixpoint has been reached.
Essentially was assumption is that AAValueSimplify seems to return behave weirdly when the value undef (i.e. it returns None while an undef value might be known). So with that,
if it happens, we can deduce that the instruction is UB in updateImpl() and that pretty much solves the previous problems.

Sorry for confusing you. I have missed that the shortcut was introduced(https://github.com/llvm/llvm-project/commit/2dad729f0c7b8665d362baecd8ff52449b26051d). I agree that None and isKnown() means undef.

SimplifiedV = ...;
  if (Simplified. isKnown() && (!SimplifiedV.hasValue() || (SimplifiedV.getValue() == undef))
    KnownUBInsts.insert;
}

Finally, I think this will work anyway! I should have suggested this to you first;) Really sorry!

In D71799#1794897, @uenoku wrote:
Sorry for confusing you. I have missed that the shortcut was introduced(https://github.com/llvm/llvm-project/commit/2dad729f0c7b8665d362baecd8ff52449b26051d). I agree that None and isKnown() means undef.
SimplifiedV = ...;
  if (Simplified. isKnown() && (!SimplifiedV.hasValue() || (SimplifiedV.getValue() == undef))
    KnownUBInsts.insert;
}
Finally, I think this will work anyway! I should have suggested this to you first;) Really sorry!

No worries, thanks for your time. :) Yes it will work, I've tested it yesterday but I was just waiting for an affirmation on the aforementioned question.

Use only known sets: One for known UB instructions and one for known _not_ UB instructions. The analysis correctness depends on the NoUB one. The 2 sets are used so that no same instruction is re-processed. The knownUB is also used for stats and also to change blocks to unreachable in manifest (we make unreachable the blocks that contain the instructions in this set). Note: These 2 sets are disjoint.
We use AAValueSimplify but only if the value is known. There are 2 caveats:
1. If the value is known but getAssumedSimplifiedValue() gives us None (no value), then we can assume it has reached a fixpoint and the value is undef.
2. For some reason, for branches where undef is actually written (i.e. br i1 undef, ...), the AAValueSimplify doesn't tell us it is known. Thus, before we even start querying AAValueSimplify, we check the standard LLVM Value if it is undef.

Ongoing:

Because of the point 2) above, I don't know whether we ever need the case (isKnown && hasValue && the value is undef), since that was supposed to handled the simple cases, but it doesn't because we don't get the isKnown part.
I think it needs more tests.

Disclaimer: I read only part of the conversation.

In D71799#1794768, @uenoku wrote:

In D71799#1794762, @baziotis wrote:

I see the message. Sorry btw that I don't know exactly how AAValueSimplify works. When I started this patch, I assumed it was in everyone's best interest
to not spend time in it right now, so I'm guessing from looking small pieces of its code.

Sorry for my lack of words. I thought you were talking about in updateImpl. I think it can't happen in updates but can happen once reaches to a fix point.

FWIW, I think @uenoku comment is correct here but it might help to elaborate:

The way AAValueSimplify is build ensures that only once a fixpoint is reached the simplified value is "known". One could also say, once a simplified value is known we know a fixpoint had to be reached.
Now the interesting part here is that this means a fixpoint for the AAValueSimplify object. The Attributor will determine fixpoints for attributes eagerly, thus even if others are not there yet and still iterating. It will even inform them (via indicateOptimisticFixpoint) that they reached a fixpoint and can use the assumed value as known from now on. That is why you can see the "odd behavior" in a different objects updateImpl.

Added tests that are FIXMEs.

Test for instruction that should propagate undef.
Test for uninitialized value.

In D71799#1794768, @uenoku wrote:

In D71799#1794762, @baziotis wrote:

I see the message. Sorry btw that I don't know exactly how AAValueSimplify works. When I started this patch, I assumed it was in everyone's best interest
to not spend time in it right now, so I'm guessing from looking small pieces of its code.

Sorry for my lack of words. I thought you were talking about in updateImpl. I think it can't happen in updates but can happen once reaches to a fix point.

FWIW, I think @uenoku comment is correct here but it might help to elaborate:

The way AAValueSimplify is build ensures that only once a fixpoint is reached the simplified value is "known". One could also say, once a simplified value is known we know a fixpoint had to be reached.
Now the interesting part here is that this means a fixpoint for the AAValueSimplify object. The Attributor will determine fixpoints for attributes eagerly, thus even if others are not there yet and still iterating. It will even inform them (via indicateOptimisticFixpoint) that they reached a fixpoint and can use the assumed value as known from now on. That is why you can see the "odd behavior" in a different objects updateImpl.

Yes, thanks, it helped (also @uenoku helped a lot). But I still don't get why the following happens:

For some reason, for branches where undef is actually written (i.e. br i1 undef, ...), the AAValueSimplify doesn't tell us it is known. Thus, before we even start querying AAValueSimplify, we check the standard LLVM Value if it is undef.

It seems to me that for constants or undef, this should be known. Looking at initialize() of AAValueSimplifyFloating, for constants and undef, it indicates a pessimistic fixpoint and I don't understand why.

In D71799#1795477, @baziotis wrote:

It seems to me that for constants or undef, this should be known. Looking at initialize() of AAValueSimplifyFloating, for constants and undef, it indicates a pessimistic fixpoint and I don't understand why.

At first concept of AAValueSimplify, I set the semantics of optimistic state as that the value is simplified actually. So if the state is pessimistic, getAssumedSimplifiedValue returns the original associated value.
(https://reviews.llvm.org/D66967#1659664)

Base on that, if the associated value is constant or undef, I fixed the state as pessimistic because we can't simplify the value more.
But now it seems it's more useful to reach an optimistic fixpoint when the value is constant or undef, so I can agree to change it(D71852).

uenoku mentioned this in D71852: [Attributor] Reach optimistic fixpoint in AAValueSimplify when the value is constant or undef.Dec 24 2019, 4:03 AM

In D71799#1795492, @uenoku wrote:

In D71799#1795477, @baziotis wrote:

It seems to me that for constants or undef, this should be known. Looking at initialize() of AAValueSimplifyFloating, for constants and undef, it indicates a pessimistic fixpoint and I don't understand why.

At first concept of AAValueSimplify, I set the semantics of optimistic state as that the value is simplified actually. So if the state is pessimistic, getAssumedSimplifiedValue returns the original associated value.
(https://reviews.llvm.org/D66967#1659664)

Base on that, if the associated value is constant or undef, I fixed the state as pessimistic because we can't simplify the value more.

Aha, ok I got the idea.

But now it seems it's more useful to reach an optimistic fixpoint when the value is constant or undef, so I can agree to change it(D71852).

Great, thank you for the quick action! After reading the above again, maybe then we want to not break the conceptual idea of AAValueSimplify. Actually let me comment on the other revision.

In D71799#1795512, @baziotis wrote:

In D71799#1795492, @uenoku wrote:

But now it seems it's more useful to reach an optimistic fixpoint when the value is constant or undef, so I can agree to change it(D71852).

Great, thank you for the quick action! After reading the above again, maybe then we want to not break the conceptual idea of AAValueSimplify. Actually let me comment on the other revision.

We can change things if it makes sense. What I would prefer to do wrt. AAValueSimplify is to get D68934 in. I'll put a new revision online as soon as I can, make the diff easier to read and test it some more. Though the version that is online should be very close to what it'll be.

I inlined remaining minor comments from my side. @uenoku you should finish the review and accept once your happy.

llvm/lib/Transforms/IPO/Attributor.cpp
2057	Just count, no need to check it against 0.
2086	`!XXX.size()` -> `XXX.empty()`

baziotis marked an inline comment as done.Dec 24 2019, 2:29 PM

baziotis added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp
2057	Ok, I just did it because otherwise there's an implicit cast to `bool` and well.. whatever, let me change it. :P

Small changes.

@uenoku please update me if you want something changed. Also, after this diff I was planning to make another to use AAValueSimplify in the other instructions as well. If you can, update me whether it's better to do it here.

Oh, also whether I should wait for https://reviews.llvm.org/D71852 to be committed.

In D71799#1795734, @baziotis wrote:

Small changes.

@uenoku please update me if you want something changed. Also, after this diff I was planning to make another to use AAValueSimplify in the other instructions as well. If you can, update me whether it's better to do it here.

Oh, also whether I should wait for https://reviews.llvm.org/D71852 to be committed.

I'd wait wrt. AAValueSimplify.
I'd do the following two first:

UnreachableInst is UB (though no need to replace it in the manifest.
Given an instruction, determine if that instruction or one later that is known to be executed is known to cause UB. We then hook that up to the AAIsDead.

uenoku mentioned this in rG1d5d074aef2a: [Attributor] Reach optimistic fixpoint in AAValueSimplify when the value is….Dec 24 2019, 9:22 PM

uenoku added inline comments.Dec 24 2019, 9:27 PM

llvm/lib/Transforms/IPO/Attributor.cpp
2050	Why don't we need to check for `KnownUBInsts`?

In D71799#1795803, @jdoerfert wrote:

I'd wait wrt. AAValueSimplify.

I guess you meant "wait to be committed" (and not wait as to not use it in the memory accessing functions yet).

I'd do the following two first:

UnreachableInst is UB (though no need to replace it in the manifest.

Given an instruction, determine if that instruction or one later that is known to be executed is known to cause UB. We then hook that up to the AAIsDead.

Regarding 2), would it suffice to: Go through the next instructions and follow (alive) branches either by taking unconditional branches, or branches that have known value (e.g. using AAValueSimplify) true.
Note: Walk them in a DFS kind of way until we find a UB instruction (or none at all).

llvm/lib/Transforms/IPO/Attributor.cpp
2050	Oh yes, I've forgotten about that. I should have updated it when we ended up in using only known parts. So, the correctness of this procedure is described in the comment of `KnownNoUBInsts`. Since the size `KnownUBInsts` is also monotonically increasing and bounded, then the "sum" of these 2 "functions" is also monotonically increasing and bounded. Hence, we can (and should) include that (part of this reasoning was why inserting and removing from the set was going to give us problems). Probably you have thought of all that but just to be sure we're on the same page. :)

uenoku added inline comments.Dec 25 2019, 3:40 AM

llvm/lib/Transforms/IPO/Attributor.cpp
2050	Ok, thanks.
2125–2126	Could you add a comment here to say that instruction in `NoUBInst` might cause UB?

LGTM otherwise

This revision is now accepted and ready to land.Dec 25 2019, 3:43 AM

In D71799#1795979, @uenoku wrote:

LGTM otherwise

Great! Your change on AAValueSimplify was committed along with my previous revisions, so let me do an updated diff based on that (and also address the comments).
I think it's better to do Johannes' suggestions on a different revision.

llvm/lib/Transforms/IPO/Attributor.cpp
2125–2126	Of course.

In D71799#1795961, @baziotis wrote:

In D71799#1795803, @jdoerfert wrote:

I'd wait wrt. AAValueSimplify.

I guess you meant "wait to be committed" (and not wait as to not use it in the memory accessing functions yet).

Yes. Also wrt. modifying AAVAlueSimply further. Using it is not a problem.

I'd do the following two first:

UnreachableInst is UB (though no need to replace it in the manifest.

Given an instruction, determine if that instruction or one later that is known to be executed is known to cause UB. We then hook that up to the AAIsDead.

Regarding 2), would it suffice to: Go through the next instructions and follow (alive) branches either by taking unconditional branches, or branches that have known value (e.g. using AAValueSimplify) true.
Note: Walk them in a DFS kind of way until we find a UB instruction (or none at all).

You want to use MustBeExecutedContextExplorer. You get it like this:

MustBeExecutedContextExplorer &Explorer =
    A.getInfoCache().getMustBeExecutedContextExplorer();

You iterate over it like you would with a container like this:

for (MustBeExecutedIterator &It : Explorer.range(Instruction))

There was a patch by @uenoku to deal with conditional and the merging of states when we are exploring but I don't see it in-tree and I forgot which one it was.
Nevertheless, it should immediately work for straight line code and code that is in "some merge block" after a conditional.

In D71799#1796062, @baziotis wrote:

In D71799#1795979, @uenoku wrote:

LGTM otherwise

Great! Your change on AAValueSimplify was committed along with my previous revisions, so let me do an updated diff based on that (and also address the comments).

Assuming I understand you correctly, you should.

I think it's better to do Johannes' suggestions on a different revision.

Yes.

@uenoku Can you commit this for @baziotis? (https://llvm.org/docs/DeveloperPolicy.html#commit-messages)

In D71799#1796066, @jdoerfert wrote:

In D71799#1795961, @baziotis wrote:

In D71799#1795803, @jdoerfert wrote:

I'd wait wrt. AAValueSimplify.

I guess you meant "wait to be committed" (and not wait as to not use it in the memory accessing functions yet).

Yes. Also wrt. modifying AAVAlueSimply further. Using it is not a problem.

Ok, noted.

I'd do the following two first:

UnreachableInst is UB (though no need to replace it in the manifest.

Given an instruction, determine if that instruction or one later that is known to be executed is known to cause UB. We then hook that up to the AAIsDead.

Regarding 2), would it suffice to: Go through the next instructions and follow (alive) branches either by taking unconditional branches, or branches that have known value (e.g. using AAValueSimplify) true.
Note: Walk them in a DFS kind of way until we find a UB instruction (or none at all).

You want to use MustBeExecutedContextExplorer. You get it like this:
MustBeExecutedContextExplorer &Explorer =
    A.getInfoCache().getMustBeExecutedContextExplorer();
You iterate over it like you would with a container like this:
for (MustBeExecutedIterator &It : Explorer.range(Instruction))
There was a patch by @uenoku to deal with conditional and the merging of states when we are exploring but I don't see it in-tree and I forgot which one it was.
Nevertheless, it should immediately work for straight line code and code that is in "some merge block" after a conditional.

Aha, ok, thank you! I'll try it in the next revision.

@uenoku Can you commit this for @baziotis? (https://llvm.org/docs/DeveloperPolicy.html#commit-messages)

Sorry for being late, I would like to have already updated a diff so that @uenoku can commit it but I had some problems with compilation of LLVM when I pulled the last changes.

In D71799#1796066, @jdoerfert wrote:

There was a patch by @uenoku to deal with conditional and the merging of states when we are exploring but I don't see it in-tree and I forgot which one it was.

It was D65593. I had forgotten too:). It is a good opportunity to rebase. I'll work on.
The idea is that if we know there is UB in both branches, we can say there is UB regardless of a condition value.

@uenoku Can you commit this for @baziotis? (https://llvm.org/docs/DeveloperPolicy.html#commit-messages)

Sure.

baziotis marked an inline comment as done.Dec 25 2019, 10:10 AM

baziotis added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp
2125–2126	I was thinking about it and it seems to me that having this set is kind of misleading (both the naming and the conceptual idea around it). That is, it doesn't seem that there's a way (at least for now) to know for sure that an instruction is not UB, unless we have a constant. So, I'm proposing to change the conceptual idea (and the naming) like this. An instruction can be in 3 categories: Known to cause UB (`AAUndefinedBehavior` could prove it) - make an actual set for it. Known to not cause UB (`AAUndefinedBehavior` could prove it because of a constant). Here would go only branch instructions, which could have a constant condition and not be UB. Memory accessing instructions can't AFAIK because if they have a constant, it will either be null (so UB) or undef (so, UB). Make another set for those (so that we don't re-process them in every update) Assumed to cause UB. Basically, every other instruction. What we have now is sort of this scheme, but the `KnownNoUBInsts` are not actually known to not be UB as you mentioned and hence I think this makes the understanding of the code difficult. (For reference, and you may as well skip that since this comment is already big, I think the current scheme is something like: An instruction can be: Known to cause UB (`AAUndefinedBehavior` could prove it). Assumed to not cause UB. `AAUndefinedBehavior` could _not_ prove it but still it optimistically assumes it doesn't cause UB (which is like "what??" since `AAUndefinedBehavior` is supposed to optimistically assume for UB). ...) Anyway, looking forward to your opinion and sorry for the big comment (on an already accepted revision).

jdoerfert added inline comments.Dec 25 2019, 4:00 PM

llvm/lib/Transforms/IPO/Attributor.cpp
2125–2126	The two interesting categories are: Known to cause UB (and we proved it) Assumed to cause UB (every updateImpl invocation we found a reason to assume it. This is probably not what this Attribute does but it should eventually). The third category which is tracked so we don't revisit instructions that do not fall into the first two is: Not assumed to cause UB. We failed to argue it causes UB in an updateImpl invocation. This includes things that cannot cause UB! (We track things that do cause UB not the other way around.)

baziotis marked an inline comment as done.Dec 25 2019, 4:20 PM

baziotis added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp
2125–2126	Ok, got it, thanks! FWIW, I didn't propose the "known no UB" set because I think it is a good idea to track "no UB" instructions. Rather, that schemeis overoptimistic because for every instruction that we could not prove it is no UB, it assumes it is UB. Yours is clearly better. Meaning we should have a reason to assume an instruction is UB. I'll update it asap.

Clarification on the uses of the 2 sets.

I was trying to get AAValueSimplify to work on the memory accessing instructions as well but at some point along the way
I ran the whole Attributor suite. Important: Unfortunately, it seems that this patch breaks 3 other test cases. Specifically, the changeToUnreachable part. I couldn't
understand why, I'll come back tomorrow. Please feel free to propose ideas (it seems that AAIsDead has problems with it).

So, I tried to make a reduced test case that fails:

define void @fails() {
entry:
  br i1 undef, label %end, label %end

end:
  %phi = phi i32* [ null, %entry ], [ null, %entry ]
  %a = load i32, i32* %phi, align 4
  ret void
}

It's based on the test case IPConstantProp/PR26044.ll, which also fails. The interesting things are:

If we remove the load, it doesn't fail.
If we remove the phi it doesn't fail (that's also true for PR26044.ll).
As noted yesterday, if we remove the changeToUnreachable in AAUndefinedBehavior::manifest(), it doesn't fail (all the test cases basically fail because of this part).
Also, that AAIsDead seems to have a problem with it.

My guess is that the fact that we change a branch instruction to unreachable means that there's one less predecessor for the then and else blocks of the branch.
If one of these 2 hasn't been converted to unreachable and contains a phi, then we have problems as now the BB has one less predecessor than those listed in the phi.

In D71799#1796601, @baziotis wrote:
So, I tried to make a reduced test case that fails:
define void @fails() {
entry:
  br i1 undef, label %end, label %end

end:
  %phi = phi i32* [ null, %entry ], [ null, %entry ]
  %a = load i32, i32* %phi, align 4
  ret void
}
It's based on the test case IPConstantProp/PR26044.ll, which also fails. The interesting things are:

If we remove the load, it doesn't fail.

If we remove the phi it doesn't fail (that's also true for PR26044.ll).

As noted yesterday, if we remove the changeToUnreachable in AAUndefinedBehavior::manifest(), it doesn't fail (all the test cases basically fail because of this part).

Also, that AAIsDead seems to have a problem with it.

My guess is that the fact that we change a branch instruction to unreachable means that there's one less predecessor for the then and else blocks of the branch.
If one of these 2 hasn't been converted to unreachable and contains a phi, then we have problems as now the BB has one less predecessor than those listed in the phi.

I think the problem here is that you are calling changeToUnrechable in manifest. This might cause unpredictable errors.
So you should cache instructions to be changed to unreachable and call changeToUnrechable after manifest(see below comment).
I tested this way locally and the error has been removed.

llvm/lib/Transforms/IPO/Attributor.cpp
5569–5570	Here

I create a patch(D71910) for this problem then with that patch, you can use like A.changeToUnreachableAfterManifest(I).

I think the problem here is that you are calling changeToUnrechable in manifest. This might cause unpredictable errors.

Thanks, I hadn't seen how manifest() fits into the big picture.

So you should cache instructions to be changed to unreachable and call changeToUnrechable after manifest(see below comment).
I tested this way locally and the error has been removed.
I create a patch(D71910) for this problem then with that patch, you can use like A.changeToUnreachableAfterManifest(I).

Much appreciated, thank you. I hadn't noticed and I wrote similar code thinking I was doing something wrong because I had to change parts outside AAUndefinedBehavior (because all the similar code changes uses not instructions).
I'll wait for it to be committed. Now the yesterday's code that uses AAValueSimplify on the memory accessing instructions should work.

Added one test to check propagation of null - it's not behaving as we'd like.
Abstracted the AAValueSimplify usage in AAUndefinedBehavior.

A couple of notes:

Now the same cases as before fail plus a couple more. But, the 3 cases that were failing before were crashing. Now they just give different result

which is expected. It probably is easy to change them.
The other cases however crash but because the number of iterations is not the specified. I may not be the most appropriate person to look into that.

I still don't understand why getPointerOperand() returns null on volatile instructions (although I have guess it is to prevent further processing). Is it correct what I do?

In D71799#1796829, @baziotis wrote:

I still don't understand why getPointerOperand() returns null on volatile instructions (although I have guess it is to prevent further processing). Is it correct what I do?

getPointerOperand was added as a helper for dereferenceable(volatile store/load doesn't imply dereferenceable). And you can change it if you want.

I'd say I'm not sure whether volatile store/load for undef is UB.

In D71799#1797070, @uenoku wrote:

In D71799#1796829, @baziotis wrote:

I still don't understand why getPointerOperand() returns null on volatile instructions (although I have guess it is to prevent further processing). Is it correct what I do?

getPointerOperand was added as a helper for dereferenceable(volatile store/load doesn't imply dereferenceable). And you can change it if you want.

Aha ok, thanks.

I'd say I'm not sure whether volatile store/load for undef is UB.

Well, if we go by the book, which would be the LLVM IR ref manual, and optimize aggressively for UB, undef can be considered to have any bit pattern.
And we can choose it to have the null bit pattern, which is UB for both volatile and non-volatile.

As another note (and related to the diff update message), I realize that it's difficult for both of us to try and correct 16 test cases that currently fail in this revision.
I think it's better to remove the AAValueSimplify in the memory accessing instructions. That will make the failing test-cases only 3. I could then try to fix them
and it should be easier for you to review as well. What do you think?

Well, if we go by the book, which would be the LLVM IR ref manual, and optimize aggressively for UB, undef can be considered to have any bit pattern.
And we can choose it to have the null bit pattern, which is UB for both volatile and non-volatile.

Ok, thanks.

As another note (and related to the diff update message), I realize that it's difficult for both of us to try and correct 16 test cases that currently fail in this revision.
I think it's better to remove the AAValueSimplify in the memory accessing instructions. That will make the failing test-cases only 3. I could then try to fix them
and it should be easier for you to review as well.

Please split the patch.

Attributor::getPointerOperand() and getPointerOperandOfNonVolatile().
Removed AAValueSimplify for memory accessing instructions.
Updated test cases.

Notes:

As it seems, there are multiple instances of getPointerOperand() across LLVM. We should probably be careful and not name a function

like this, hence I put getPointerOperand() as a static method of Attributor. You may want to check this, although it's somewhat old and probably outdated.

@uenoku I updated the test cases according to what I thought they tried to test. Please verify that they're correct because I may very well have misinterpreted.

In D71799#1797725, @baziotis wrote:

Attributor::getPointerOperand() and getPointerOperandOfNonVolatile().

Removed AAValueSimplify for memory accessing instructions.

Updated test cases.

Notes:

As it seems, there are multiple instances of getPointerOperand() across LLVM. We should probably be careful and not name a function

like this, hence I put getPointerOperand() as a static method of Attributor. You may want to check this, although it's somewhat old and probably outdated.

@uenoku I updated the test cases according to what I thought they tried to test. Please verify that they're correct because I may very well have misinterpreted.

Regarding 1, please change to like getPointerOperand(Instruction *I, bool AllowVolatile) and merge getPointerOperandOfNonVolatile into it.

llvm/lib/Transforms/IPO/Attributor.cpp
1994	Please assert with string.

Addressed comments

LGTM again, thank you! I'll commit.

In D71799#1797898, @uenoku wrote:

LGTM again, thank you! I'll commit.

Thank you mate, I'll continue on a new patch.

baziotis edited the summary of this revision. (Show Details)Dec 28 2019, 7:13 AM

Closed by commit rGef4febd85b54: [Attributor] AAUndefinedBehavior: Check for branches on undef value. (authored by uenoku). · Explain WhyDec 29 2019, 12:51 AM

This revision was automatically updated to reflect the committed changes.

jdoerfert mentioned this in D71960: [Attributor] AAUndefinedBehavior: Use AAValueSimplify in memory accessing instructions..Dec 29 2019, 8:32 AM

Revision Contents

Path

Size

llvm/

include/

llvm/

Transforms/

IPO/

Attributor.h

35 lines

lib/

Transforms/

IPO/

Attributor.cpp

192 lines

test/

Transforms/

Attributor/

IPConstantProp/

PR26044.ll

14 lines

fp-bc-icmp-const-fold.ll

2 lines

solve-after-each-resolving-undefs-for-function.ll

2 lines

undefined_behavior.ll

153 lines

Diff 235503

llvm/include/llvm/Transforms/IPO/Attributor.h

Show First 20 Lines • Show All 817 Lines • ▼ Show 20 Lines	if (V && (V->stripPointerCasts() == NV.stripPointerCasts() \|\|
isa_and_nonnull<UndefValue>(V)))		isa_and_nonnull<UndefValue>(V)))
return false;		return false;
assert((!V \|\| V == &NV \|\| isa<UndefValue>(NV)) &&		assert((!V \|\| V == &NV \|\| isa<UndefValue>(NV)) &&
"Use was registered twice for replacement with different values!");		"Use was registered twice for replacement with different values!");
V = &NV;		V = &NV;
return true;		return true;
}		}

		/// Get pointer operand of memory accessing instruction. If \p I is
		/// not a memory accessing instruction, return nullptr. If \p AllowVolatile,
		/// is set to false and the instruction is volatile, return nullptr.
		static const Value getPointerOperand(const Instruction I,
		bool AllowVolatile) {
		if (auto *LI = dyn_cast<LoadInst>(I)) {
		if (!AllowVolatile && LI->isVolatile())
		return nullptr;
		return LI->getPointerOperand();
		}

		if (auto *SI = dyn_cast<StoreInst>(I)) {
		if (!AllowVolatile && SI->isVolatile())
		return nullptr;
		return SI->getPointerOperand();
		}

		if (auto *CXI = dyn_cast<AtomicCmpXchgInst>(I)) {
		if (!AllowVolatile && CXI->isVolatile())
		return nullptr;
		return CXI->getPointerOperand();
		}

		if (auto *RMWI = dyn_cast<AtomicRMWInst>(I)) {
		if (!AllowVolatile && RMWI->isVolatile())
		return nullptr;
		return RMWI->getPointerOperand();
		}

		return nullptr;
		}

/// Record that \p I is to be replaced with `unreachable` after information		/// Record that \p I is to be replaced with `unreachable` after information
/// was manifested.		/// was manifested.
void changeToUnreachableAfterManifest(Instruction *I) {		void changeToUnreachableAfterManifest(Instruction *I) {
ToBeChangedToUnreachableInsts.insert(I);		ToBeChangedToUnreachableInsts.insert(I);
}		}

/// Record that \p I is deleted after information was manifested. This also		/// Record that \p I is deleted after information was manifested. This also
/// triggers deletion of trivially dead istructions.		/// triggers deletion of trivially dead istructions.
▲ Show 20 Lines • Show All 871 Lines • ▼ Show 20 Lines	struct AAUndefinedBehavior
bool isAssumedToCauseUB() const { return getAssumed(); }		bool isAssumedToCauseUB() const { return getAssumed(); }

/// Return true if "undefined behavior" is assumed for a specific instruction.		/// Return true if "undefined behavior" is assumed for a specific instruction.
virtual bool isAssumedToCauseUB(Instruction *I) const = 0;		virtual bool isAssumedToCauseUB(Instruction *I) const = 0;

/// Return true if "undefined behavior" is known.		/// Return true if "undefined behavior" is known.
bool isKnownToCauseUB() const { return getKnown(); }		bool isKnownToCauseUB() const { return getKnown(); }

		/// Return true if "undefined behavior" is known for a specific instruction.
		virtual bool isKnownToCauseUB(Instruction *I) const = 0;

/// Return an IR position, see struct IRPosition.		/// Return an IR position, see struct IRPosition.
const IRPosition &getIRPosition() const override { return *this; }		const IRPosition &getIRPosition() const override { return *this; }

/// Create an abstract attribute view for the position \p IRP.		/// Create an abstract attribute view for the position \p IRP.
static AAUndefinedBehavior &createForPosition(const IRPosition &IRP,		static AAUndefinedBehavior &createForPosition(const IRPosition &IRP,
Attributor &A);		Attributor &A);

/// Unique ID (due to the unique address)		/// Unique ID (due to the unique address)
▲ Show 20 Lines • Show All 467 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/Attributor.cpp

Show First 20 Lines • Show All 326 Lines • ▼ Show 20 Lines	if (Attrs.hasAttribute(AttrIdx, Kind))
return false;		return false;
Attrs = Attrs.removeAttribute(Ctx, AttrIdx, Kind);		Attrs = Attrs.removeAttribute(Ctx, AttrIdx, Kind);
Attrs = Attrs.addAttribute(Ctx, AttrIdx, Attr);		Attrs = Attrs.addAttribute(Ctx, AttrIdx, Attr);
return true;		return true;
}		}

llvm_unreachable("Expected enum or string attribute!");		llvm_unreachable("Expected enum or string attribute!");
}		}
static const Value getPointerOperand(const Instruction I) {
if (auto *LI = dyn_cast<LoadInst>(I))
if (!LI->isVolatile())
return LI->getPointerOperand();

if (auto *SI = dyn_cast<StoreInst>(I))
if (!SI->isVolatile())
return SI->getPointerOperand();

if (auto *CXI = dyn_cast<AtomicCmpXchgInst>(I))
if (!CXI->isVolatile())
return CXI->getPointerOperand();

if (auto *RMWI = dyn_cast<AtomicRMWInst>(I))
if (!RMWI->isVolatile())
return RMWI->getPointerOperand();

return nullptr;
}
static const Value *		static const Value *
getBasePointerOfAccessPointerOperand(const Instruction *I, int64_t &BytesOffset,		getBasePointerOfAccessPointerOperand(const Instruction *I, int64_t &BytesOffset,
const DataLayout &DL,		const DataLayout &DL,
bool AllowNonInbounds = false) {		bool AllowNonInbounds = false) {
const Value *Ptr = getPointerOperand(I);		const Value *Ptr =
		Attributor::getPointerOperand(I, /* AllowVolatile */ false);
if (!Ptr)		if (!Ptr)
return nullptr;		return nullptr;

return GetPointerBaseWithConstantOffset(Ptr, BytesOffset, DL,		return GetPointerBaseWithConstantOffset(Ptr, BytesOffset, DL,
AllowNonInbounds);		AllowNonInbounds);
}		}

ChangeStatus AbstractAttribute::update(Attributor &A) {		ChangeStatus AbstractAttribute::update(Attributor &A) {
▲ Show 20 Lines • Show All 1,362 Lines • ▼ Show 20 Lines	static int64_t getKnownNonNullAndDerefBytesForUse(
if (auto *GEP = dyn_cast<GetElementPtrInst>(I))		if (auto *GEP = dyn_cast<GetElementPtrInst>(I))
if (GEP->hasAllConstantIndices()) {		if (GEP->hasAllConstantIndices()) {
TrackUse = true;		TrackUse = true;
return 0;		return 0;
}		}

int64_t Offset;		int64_t Offset;
if (const Value *Base = getBasePointerOfAccessPointerOperand(I, Offset, DL)) {		if (const Value *Base = getBasePointerOfAccessPointerOperand(I, Offset, DL)) {
if (Base == &AssociatedValue && getPointerOperand(I) == UseV) {		if (Base == &AssociatedValue &&
		Attributor::getPointerOperand(I, /* AllowVolatile */ false) == UseV) {
int64_t DerefBytes =		int64_t DerefBytes =
(int64_t)DL.getTypeStoreSize(PtrTy->getPointerElementType()) + Offset;		(int64_t)DL.getTypeStoreSize(PtrTy->getPointerElementType()) + Offset;

IsNonNull \|= !NullPointerIsDefined;		IsNonNull \|= !NullPointerIsDefined;
return std::max(int64_t(0), DerefBytes);		return std::max(int64_t(0), DerefBytes);
}		}
}		}

/// Corner case when an offset is 0.		/// Corner case when an offset is 0.
if (const Value *Base = getBasePointerOfAccessPointerOperand(		if (const Value *Base = getBasePointerOfAccessPointerOperand(
I, Offset, DL, /AllowNonInbounds/ true)) {		I, Offset, DL, /AllowNonInbounds/ true)) {
if (Offset == 0 && Base == &AssociatedValue &&		if (Offset == 0 && Base == &AssociatedValue &&
getPointerOperand(I) == UseV) {		Attributor::getPointerOperand(I, /* AllowVolatile */ false) == UseV) {
int64_t DerefBytes =		int64_t DerefBytes =
(int64_t)DL.getTypeStoreSize(PtrTy->getPointerElementType());		(int64_t)DL.getTypeStoreSize(PtrTy->getPointerElementType());
IsNonNull \|= !NullPointerIsDefined;		IsNonNull \|= !NullPointerIsDefined;
return std::max(int64_t(0), DerefBytes);		return std::max(int64_t(0), DerefBytes);
}		}
}		}
if (const Value *Base =		if (const Value *Base =
GetPointerBaseWithConstantOffset(UseV, Offset, DL,		GetPointerBaseWithConstantOffset(UseV, Offset, DL,
▲ Show 20 Lines • Show All 229 Lines • ▼ Show 20 Lines
};		};

/// -------------------- Undefined-Behavior Attributes ------------------------		/// -------------------- Undefined-Behavior Attributes ------------------------

struct AAUndefinedBehaviorImpl : public AAUndefinedBehavior {		struct AAUndefinedBehaviorImpl : public AAUndefinedBehavior {
AAUndefinedBehaviorImpl(const IRPosition &IRP) : AAUndefinedBehavior(IRP) {}		AAUndefinedBehaviorImpl(const IRPosition &IRP) : AAUndefinedBehavior(IRP) {}

/// See AbstractAttribute::updateImpl(...).		/// See AbstractAttribute::updateImpl(...).
// TODO: We should not only check instructions that access memory
// through a pointer (i.e. also branches etc.)		// through a pointer (i.e. also branches etc.)
ChangeStatus updateImpl(Attributor &A) override {		ChangeStatus updateImpl(Attributor &A) override {
const size_t PrevSize = NoUBMemAccessInsts.size();		const size_t UBPrevSize = KnownUBInsts.size();
		const size_t NoUBPrevSize = AssumedNoUBInsts.size();

auto InspectMemAccessInstForUB = [&](Instruction &I) {		auto InspectMemAccessInstForUB = [&](Instruction &I) {
// Skip instructions that are already saved.		// Skip instructions that are already saved.
if (NoUBMemAccessInsts.count(&I) \|\| UBMemAccessInsts.count(&I))		if (AssumedNoUBInsts.count(&I) \|\| KnownUBInsts.count(&I))
return true;		return true;

// `InspectMemAccessInstForUB` is only called on instructions		// If we reach here, we know we have an instruction
// for which getPointerOperand() should give us their		// that accesses memory through a pointer operand,
// pointer operand unless they're volatile.		// for which getPointerOperand() should give it to us.
const Value *PtrOp = getPointerOperand(&I);		const Value *PtrOp =
if (!PtrOp)		Attributor::getPointerOperand(&I, /* AllowVolatile */ true);
		uenokuUnsubmitted Not Done Reply Inline Actions Please assert with string. uenoku: Please assert with string.
return true;		assert(PtrOp &&
		"Expected pointer operand of memory accessing instruction");

// A memory access through a pointer is considered UB		// A memory access through a pointer is considered UB
// only if the pointer has constant null value.		// only if the pointer has constant null value.
// TODO: Expand it to not only check constant values.		// TODO: Expand it to not only check constant values.
if (!isa<ConstantPointerNull>(PtrOp)) {		if (!isa<ConstantPointerNull>(PtrOp)) {
NoUBMemAccessInsts.insert(&I);		AssumedNoUBInsts.insert(&I);
return true;		return true;
}		}
const Type *PtrTy = PtrOp->getType();		const Type *PtrTy = PtrOp->getType();

// Because we only consider instructions inside functions,		// Because we only consider instructions inside functions,
// assume that a parent function exists.		// assume that a parent function exists.
const Function *F = I.getFunction();		const Function *F = I.getFunction();

// A memory access using constant null pointer is only considered UB		// A memory access using constant null pointer is only considered UB
// if null pointer is _not_ defined for the target platform.		// if null pointer is _not_ defined for the target platform.
if (!llvm::NullPointerIsDefined(F, PtrTy->getPointerAddressSpace()))		if (llvm::NullPointerIsDefined(F, PtrTy->getPointerAddressSpace()))
UBMemAccessInsts.insert(&I);		AssumedNoUBInsts.insert(&I);
else		else
NoUBMemAccessInsts.insert(&I);		KnownUBInsts.insert(&I);
		return true;
		};

		auto InspectBrInstForUB = [&](Instruction &I) {
		// A conditional branch instruction is considered UB if it has `undef`
		// condition.

		// Skip instructions that are already saved.
		if (AssumedNoUBInsts.count(&I) \|\| KnownUBInsts.count(&I))
		return true;

		// We know we have a branch instruction.
		auto BrInst = cast<BranchInst>(&I);

		// Unconditional branches are never considered UB.
		if (BrInst->isUnconditional())
		return true;

		// Either we stopped and the appropriate action was taken,
		// or we got back a simplified value to continue.
		Optional<Value *> SimplifiedCond =
		stopOnUndefOrAssumed(A, BrInst->getCondition(), BrInst);
		if (!SimplifiedCond.hasValue())
		baziotisAuthorUnsubmitted Done Reply Inline Actions Note that here it's possibly wrong and I forgot to comment yesterday. I didn't know exactly how to do it but here's the problem. If it has a value and it is not undef, then it's not UB -> OK If it has a value and it's undef, then it is UB -> OK But... If it doesn't have a value, we consider it not UB. Well, I'm not familiar with the internals of `AAValueSimplify`, but looking comments around, there were some like "No value _yet_". Which means that right now we may not have a value but we could in the future. And that value may be undef. This is no problem for this patch as it tries to handle cases where undef is caught in (hasValue && isa<Undef>). But eventually, `AAValueSimplify` could uncover things for us here and we may lose them because we put the instruction to `NoUBInsts`. baziotis: Note that here it's possibly wrong and I forgot to comment yesterday. I didn't know exactly how…
		return true;
		AssumedNoUBInsts.insert(&I);
return true;		return true;
};		};

A.checkForAllInstructions(InspectMemAccessInstForUB, *this,		A.checkForAllInstructions(InspectMemAccessInstForUB, *this,
{Instruction::Load, Instruction::Store,		{Instruction::Load, Instruction::Store,
Instruction::AtomicCmpXchg,		Instruction::AtomicCmpXchg,
Instruction::AtomicRMW});		Instruction::AtomicRMW});
if (PrevSize != NoUBMemAccessInsts.size())		A.checkForAllInstructions(InspectBrInstForUB, *this, {Instruction::Br});
		if (NoUBPrevSize != AssumedNoUBInsts.size() \|\|
		uenokuUnsubmitted Not Done Reply Inline Actions Why don't we need to check for `KnownUBInsts`? uenoku: Why don't we need to check for `KnownUBInsts`?
		baziotisAuthorUnsubmitted Done Reply Inline Actions Oh yes, I've forgotten about that. I should have updated it when we ended up in using only known parts. So, the correctness of this procedure is described in the comment of `KnownNoUBInsts`. Since the size `KnownUBInsts` is also monotonically increasing and bounded, then the "sum" of these 2 "functions" is also monotonically increasing and bounded. Hence, we can (and should) include that (part of this reasoning was why inserting and removing from the set was going to give us problems). Probably you have thought of all that but just to be sure we're on the same page. :) baziotis: Oh yes, I've forgotten about that. I should have updated it when we ended up in using only…
		uenokuUnsubmitted Not Done Reply Inline Actions Ok, thanks. uenoku: Ok, thanks.
		UBPrevSize != KnownUBInsts.size())
		jdoerfertUnsubmitted Not Done Reply Inline Actions Split it in two calls since the pointer stuff and the control flow stuff (for branch, switch, ...) is conceptually different. jdoerfert: Split it in two calls since the pointer stuff and the control flow stuff (for branch, switch, ..
return ChangeStatus::CHANGED;		return ChangeStatus::CHANGED;
return ChangeStatus::UNCHANGED;		return ChangeStatus::UNCHANGED;
}		}

		bool isKnownToCauseUB(Instruction *I) const override {
		return KnownUBInsts.count(I);
		jdoerfertUnsubmitted Not Done Reply Inline Actions Just count, no need to check it against 0. jdoerfert: Just count, no need to check it against 0.
		baziotisAuthorUnsubmitted Done Reply Inline Actions Ok, I just did it because otherwise there's an implicit cast to `bool` and well.. whatever, let me change it. :P baziotis: Ok, I just did it because otherwise there's an implicit cast to `bool` and well.. whatever, let…
		}

bool isAssumedToCauseUB(Instruction *I) const override {		bool isAssumedToCauseUB(Instruction *I) const override {
return UBMemAccessInsts.count(I);		// In simple words, if an instruction is not in the assumed to _not_
		// cause UB, then it is assumed UB (that includes those
		// in the KnownUBInsts set). The rest is boilerplate
		// is to ensure that it is one of the instructions we test
		// for UB.

		switch (I->getOpcode()) {
		case Instruction::Load:
		case Instruction::Store:
		case Instruction::AtomicCmpXchg:
		case Instruction::AtomicRMW:
		return !AssumedNoUBInsts.count(I);
		case Instruction::Br: {
		auto BrInst = cast<BranchInst>(I);
		if (BrInst->isUnconditional())
		return false;
		return !AssumedNoUBInsts.count(I);
		} break;
		default:
		return false;
		}
		return false;
}		}

ChangeStatus manifest(Attributor &A) override {		ChangeStatus manifest(Attributor &A) override {
if (!UBMemAccessInsts.size())		if (KnownUBInsts.empty())
		jdoerfertUnsubmitted Not Done Reply Inline Actions `!XXX.size()` -> `XXX.empty()` jdoerfert: `!XXX.size()` -> `XXX.empty()`
return ChangeStatus::UNCHANGED;		return ChangeStatus::UNCHANGED;
for (Instruction *I : UBMemAccessInsts)		for (Instruction *I : KnownUBInsts)
A.changeToUnreachableAfterManifest(I);		A.changeToUnreachableAfterManifest(I);
return ChangeStatus::CHANGED;		return ChangeStatus::CHANGED;
}		}

/// See AbstractAttribute::getAsStr()		/// See AbstractAttribute::getAsStr()
const std::string getAsStr() const override {		const std::string getAsStr() const override {
return getAssumed() ? "undefined-behavior" : "no-ub";		return getAssumed() ? "undefined-behavior" : "no-ub";
}		}

		/// Note: The correctness of this analysis depends on the fact that the
		/// following 2 sets will stop changing after some point.
		/// "Change" here means that their size changes.
		/// The size of each set is monotonically increasing
		/// (we only add items to them) and it is upper bounded by the number of
		/// instructions in the processed function (we can never save more
		/// elements in either set than this number). Hence, at some point,
		/// they will stop increasing.
		/// Consequently, at some point, both sets will have stopped
		/// changing, effectively making the analysis reach a fixpoint.

		/// Note: These 2 sets are disjoint and an instruction can be considered
		/// one of 3 things:
		/// 1) Known to cause UB (AAUndefinedBehavior could prove it) and put it in
		/// the KnownUBInsts set.
		/// 2) Assumed to cause UB (in every updateImpl, AAUndefinedBehavior
		/// has a reason to assume it).
		/// 3) Assumed to not cause UB. very other instruction - AAUndefinedBehavior
		/// could not find a reason to assume or prove that it can cause UB,
		/// hence it assumes it doesn't. We have a set for these instructions
		/// so that we don't reprocess them in every update.
		/// Note however that instructions in this set may cause UB.

protected:		protected:
// A set of all the (live) memory accessing instructions that _are_ assumed to		/// A set of all live instructions _known_ to cause UB.
// cause UB.		SmallPtrSet<Instruction *, 8> KnownUBInsts;
SmallPtrSet<Instruction *, 8> UBMemAccessInsts;

private:		private:
// A set of all the (live) memory accessing instructions		/// A set of all the (live) instructions that are assumed to _not_ cause UB.
		uenokuUnsubmitted Not Done Reply Inline Actions Could you add a comment here to say that instruction in `NoUBInst` might cause UB? uenoku: Could you add a comment here to say that instruction in `NoUBInst` might cause UB?
		baziotisAuthorUnsubmitted Done Reply Inline Actions Of course. baziotis: Of course.
		baziotisAuthorUnsubmitted Done Reply Inline Actions I was thinking about it and it seems to me that having this set is kind of misleading (both the naming and the conceptual idea around it). That is, it doesn't seem that there's a way (at least for now) to know for sure that an instruction is not UB, unless we have a constant. So, I'm proposing to change the conceptual idea (and the naming) like this. An instruction can be in 3 categories: Known to cause UB (`AAUndefinedBehavior` could prove it) - make an actual set for it. Known to not cause UB (`AAUndefinedBehavior` could prove it because of a constant). Here would go only branch instructions, which could have a constant condition and not be UB. Memory accessing instructions can't AFAIK because if they have a constant, it will either be null (so UB) or undef (so, UB). Make another set for those (so that we don't re-process them in every update) Assumed to cause UB. Basically, every other instruction. What we have now is sort of this scheme, but the `KnownNoUBInsts` are not actually known to not be UB as you mentioned and hence I think this makes the understanding of the code difficult. (For reference, and you may as well skip that since this comment is already big, I think the current scheme is something like: An instruction can be: Known to cause UB (`AAUndefinedBehavior` could prove it). Assumed to not cause UB. `AAUndefinedBehavior` could _not_ prove it but still it optimistically assumes it doesn't cause UB (which is like "what??" since `AAUndefinedBehavior` is supposed to optimistically assume for UB). ...) Anyway, looking forward to your opinion and sorry for the big comment (on an already accepted revision). baziotis: I was thinking about it and it seems to me that having this set is kind of misleading (both the…
		jdoerfertUnsubmitted Not Done Reply Inline Actions The two interesting categories are: Known to cause UB (and we proved it) Assumed to cause UB (every updateImpl invocation we found a reason to assume it. This is probably not what this Attribute does but it should eventually). The third category which is tracked so we don't revisit instructions that do not fall into the first two is: Not assumed to cause UB. We failed to argue it causes UB in an updateImpl invocation. This includes things that cannot cause UB! (We track things that do cause UB not the other way around.) jdoerfert: The two interesting categories are: 1) Known to cause UB (and we proved it) 2) Assumed to…
		baziotisAuthorUnsubmitted Done Reply Inline Actions Ok, got it, thanks! FWIW, I didn't propose the "known no UB" set because I think it is a good idea to track "no UB" instructions. Rather, that schemeis overoptimistic because for every instruction that we could not prove it is no UB, it assumes it is UB. Yours is clearly better. Meaning we should have a reason to assume an instruction is UB. I'll update it asap. baziotis: Ok, got it, thanks! FWIW, I didn't propose the "known no UB" set because I think it is a good…
// that are _not_ assumed to cause UB.		SmallPtrSet<Instruction *, 8> AssumedNoUBInsts;
// Note: The correctness of the procedure depends on the fact that this
// set stops changing after some point. "Change" here means that the size		// Should be called on updates in which if we're processing an instruction
// of the set changes. The size of this set is monotonically increasing		// \p I that depends on a value \p V, one of the following has to happen:
// (we only add items to it) and is upper bounded by the number of memory		// - If the value is assumed, then stop.
// accessing instructions in the processed function (we can never save more		// - If the value is known but undef, then consider it UB.
// elements in this set than this number). Hence, the size of this set, at		// - Otherwise, do specific processing with the simplified value.
// some point, will stop increasing, effectively reaching a fixpoint.		// We return None in the first 2 cases to signify that an appropriate
SmallPtrSet<Instruction *, 8> NoUBMemAccessInsts;		// action was taken and the caller should stop.
		// Otherwise, we return the simplified value that the caller should
		// use for specific processing.
		Optional<Value > stopOnUndefOrAssumed(Attributor &A, const Value V,
		Instruction *I) {
		const auto &ValueSimplifyAA =
		A.getAAFor<AAValueSimplify>(this, IRPosition::value(V));
		Optional<Value *> SimplifiedV =
		ValueSimplifyAA.getAssumedSimplifiedValue(A);
		if (!ValueSimplifyAA.isKnown()) {
		// Don't depend on assumed values.
		return llvm::None;
		}
		if (!SimplifiedV.hasValue()) {
		// If it is known (which we tested above) but it doesn't have a value,
		// then we can assume `undef` and hence the instruction is UB.
		KnownUBInsts.insert(I);
		return llvm::None;
		}
		Value *Val = SimplifiedV.getValue();
		if (isa<UndefValue>(Val)) {
		KnownUBInsts.insert(I);
		return llvm::None;
		}
		return Val;
		}
};		};

struct AAUndefinedBehaviorFunction final : AAUndefinedBehaviorImpl {		struct AAUndefinedBehaviorFunction final : AAUndefinedBehaviorImpl {
AAUndefinedBehaviorFunction(const IRPosition &IRP)		AAUndefinedBehaviorFunction(const IRPosition &IRP)
: AAUndefinedBehaviorImpl(IRP) {}		: AAUndefinedBehaviorImpl(IRP) {}

/// See AbstractAttribute::trackStatistics()		/// See AbstractAttribute::trackStatistics()
void trackStatistics() const override {		void trackStatistics() const override {
STATS_DECL(UndefinedBehaviorInstruction, Instruction,		STATS_DECL(UndefinedBehaviorInstruction, Instruction,
"Number of instructions known to have UB");		"Number of instructions known to have UB");
BUILD_STAT_NAME(UndefinedBehaviorInstruction, Instruction) +=		BUILD_STAT_NAME(UndefinedBehaviorInstruction, Instruction) +=
UBMemAccessInsts.size();		KnownUBInsts.size();
}		}
};		};

/// ------------------------ Will-Return Attributes ----------------------------		/// ------------------------ Will-Return Attributes ----------------------------

// Helper function that checks whether a function has any cycle.		// Helper function that checks whether a function has any cycle.
// TODO: Replace with more efficent code		// TODO: Replace with more efficent code
static bool containsCycle(Function &F) {		static bool containsCycle(Function &F) {
▲ Show 20 Lines • Show All 999 Lines • ▼ Show 20 Lines	void addAccessedBytesForUse(Attributor &A, const Use *U,
if (!UseV->getType()->isPointerTy())		if (!UseV->getType()->isPointerTy())
return;		return;

Type *PtrTy = UseV->getType();		Type *PtrTy = UseV->getType();
const DataLayout &DL = A.getDataLayout();		const DataLayout &DL = A.getDataLayout();
int64_t Offset;		int64_t Offset;
if (const Value *Base = getBasePointerOfAccessPointerOperand(		if (const Value *Base = getBasePointerOfAccessPointerOperand(
I, Offset, DL, /AllowNonInbounds/ true)) {		I, Offset, DL, /AllowNonInbounds/ true)) {
if (Base == &getAssociatedValue() && getPointerOperand(I) == UseV) {		if (Base == &getAssociatedValue() &&
		Attributor::getPointerOperand(I, /* AllowVolatile */ false) == UseV) {
uint64_t Size = DL.getTypeStoreSize(PtrTy->getPointerElementType());		uint64_t Size = DL.getTypeStoreSize(PtrTy->getPointerElementType());
addAccessedBytes(Offset, Size);		addAccessedBytes(Offset, Size);
}		}
}		}
return;		return;
}		}

/// See AAFromMustBeExecutedContext		/// See AAFromMustBeExecutedContext
▲ Show 20 Lines • Show All 2,363 Lines • ▼ Show 20 Lines	for (auto &It : ToBeChangedUses) {
Instruction *UserI = cast<Instruction>(U->getUser());		Instruction *UserI = cast<Instruction>(U->getUser());
if (isa<UndefValue>(NewV)) {		if (isa<UndefValue>(NewV)) {
ToBeChangedToUnreachableInsts.insert(UserI);		ToBeChangedToUnreachableInsts.insert(UserI);
} else {		} else {
TerminatorsToFold.push_back(UserI);		TerminatorsToFold.push_back(UserI);
}		}
}		}
}		}
for (Instruction *I : ToBeChangedToUnreachableInsts)		for (Instruction *I : ToBeChangedToUnreachableInsts)
changeToUnreachable(I, /* UseLLVMTrap */ false);		changeToUnreachable(I, /* UseLLVMTrap */ false);
		uenokuUnsubmitted Not Done Reply Inline Actions Here uenoku: Here
for (Instruction *I : TerminatorsToFold)		for (Instruction *I : TerminatorsToFold)
ConstantFoldTerminator(I->getParent());		ConstantFoldTerminator(I->getParent());

for (Instruction *I : ToBeDeletedInsts) {		for (Instruction *I : ToBeDeletedInsts) {
I->replaceAllUsesWith(UndefValue::get(I->getType()));		I->replaceAllUsesWith(UndefValue::get(I->getType()));
if (!isa<PHINode>(I) && isInstructionTriviallyDead(I))		if (!isa<PHINode>(I) && isInstructionTriviallyDead(I))
DeadInsts.push_back(I);		DeadInsts.push_back(I);
else		else
▲ Show 20 Lines • Show All 93 Lines • ▼ Show 20 Lines	case Instruction::Store:
// The alignment of a pointer is interesting for stores.		// The alignment of a pointer is interesting for stores.
case Instruction::Call:		case Instruction::Call:
case Instruction::CallBr:		case Instruction::CallBr:
case Instruction::Invoke:		case Instruction::Invoke:
case Instruction::CleanupRet:		case Instruction::CleanupRet:
case Instruction::CatchSwitch:		case Instruction::CatchSwitch:
case Instruction::AtomicRMW:		case Instruction::AtomicRMW:
case Instruction::AtomicCmpXchg:		case Instruction::AtomicCmpXchg:
		case Instruction::Br:
case Instruction::Resume:		case Instruction::Resume:
case Instruction::Ret:		case Instruction::Ret:
IsInterestingOpcode = true;		IsInterestingOpcode = true;
}		}
if (IsInterestingOpcode)		if (IsInterestingOpcode)
InstOpcodeMap[I.getOpcode()].push_back(&I);		InstOpcodeMap[I.getOpcode()].push_back(&I);
if (I.mayReadOrWriteMemory())		if (I.mayReadOrWriteMemory())
ReadOrWriteInsts.push_back(&I);		ReadOrWriteInsts.push_back(&I);
▲ Show 20 Lines • Show All 481 Lines • Show Last 20 Lines

llvm/test/Transforms/Attributor/IPConstantProp/PR26044.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature --scrub-attributes			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature --scrub-attributes
	; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=1 < %s \| FileCheck %s			; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=1 < %s \| FileCheck %s
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	define void @fn2(i32* %P) {			define void @fn2(i32* %P) {
	; CHECK-LABEL: define {{[^@]+}}@fn2			; CHECK-LABEL: define {{[^@]+}}@fn2
	; CHECK-SAME: (i32* nocapture nofree writeonly [[P:%.*]])			; CHECK-SAME: (i32* nocapture nofree writeonly [[P:%.*]])
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[IF_END:%.*]]			; CHECK-NEXT: br label [[IF_END:%.*]]
	; CHECK: for.cond1:			; CHECK: for.cond1:
	; CHECK-NEXT: br i1 undef, label [[IF_END]], label [[IF_END]]			; CHECK-NEXT: unreachable
	; CHECK: if.end:			; CHECK: if.end:
	; CHECK-NEXT: [[E_2:%.]] = phi i32 [ undef, [[ENTRY:%.]] ], [ null, [[FOR_COND1:%.]] ], [ null, [[FOR_COND1]] ]			; CHECK-NEXT: [[TMP0:%.]] = load i32, i32 undef, align 4
	; CHECK-NEXT: [[TMP0:%.]] = load i32, i32 [[E_2]], align 4
	; CHECK-NEXT: [[CALL:%.*]] = call i32 @fn1(i32 [[TMP0]])			; CHECK-NEXT: [[CALL:%.*]] = call i32 @fn1(i32 [[TMP0]])
	; CHECK-NEXT: store i32 [[CALL]], i32* [[P]]			; CHECK-NEXT: store i32 [[CALL]], i32* [[P]]
	; CHECK-NEXT: br label [[FOR_COND1]]			; CHECK-NEXT: br label %for.cond1
	;			;
	entry:			entry:
	br label %if.end			br label %if.end

	for.cond1: ; preds = %if.end, %for.end			for.cond1: ; preds = %if.end, %for.end
	br i1 undef, label %if.end, label %if.end			br i1 undef, label %if.end, label %if.end

	if.end: ; preds = %lbl, %for.cond1			if.end: ; preds = %lbl, %for.cond1
	Show All 19 Lines
	}			}

	define void @fn_no_null_opt(i32* %P) #0 {			define void @fn_no_null_opt(i32* %P) #0 {
	; CHECK-LABEL: define {{[^@]+}}@fn_no_null_opt			; CHECK-LABEL: define {{[^@]+}}@fn_no_null_opt
	; CHECK-SAME: (i32* nocapture nofree writeonly [[P:%.*]])			; CHECK-SAME: (i32* nocapture nofree writeonly [[P:%.*]])
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br label [[IF_END:%.*]]			; CHECK-NEXT: br label [[IF_END:%.*]]
	; CHECK: for.cond1:			; CHECK: for.cond1:
	; CHECK-NEXT: br i1 undef, label [[IF_END]], label [[IF_END]]			; CHECK-NEXT: unreachable
	; CHECK: if.end:			; CHECK: if.end:
	; CHECK-NEXT: [[E_2:%.]] = phi i32 [ undef, [[ENTRY:%.]] ], [ null, [[FOR_COND1:%.]] ], [ null, [[FOR_COND1]] ]			; CHECK-NEXT: [[TMP0:%.]] = load i32, i32 undef, align 4
	; CHECK-NEXT: [[TMP0:%.]] = load i32, i32 [[E_2]], align 4
	; CHECK-NEXT: [[CALL:%.*]] = call i32 @fn0(i32 [[TMP0]])			; CHECK-NEXT: [[CALL:%.*]] = call i32 @fn0(i32 [[TMP0]])
	; CHECK-NEXT: store i32 [[CALL]], i32* [[P]]			; CHECK-NEXT: store i32 [[CALL]], i32* [[P]]
	; CHECK-NEXT: br label [[FOR_COND1]]			; CHECK-NEXT: br label %for.cond1
	;			;
	entry:			entry:
	br label %if.end			br label %if.end

	for.cond1: ; preds = %if.end, %for.end			for.cond1: ; preds = %if.end, %for.end
	br i1 undef, label %if.end, label %if.end			br i1 undef, label %if.end, label %if.end

	if.end: ; preds = %lbl, %for.cond1			if.end: ; preds = %lbl, %for.cond1
	Show All 22 Lines

llvm/test/Transforms/Attributor/IPConstantProp/fp-bc-icmp-const-fold.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature --scrub-attributes			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature --scrub-attributes
	; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=1 < %s \| FileCheck %s			; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=1 < %s \| FileCheck %s
	target datalayout = "E-m:e-i64:64-n32:64"			target datalayout = "E-m:e-i64:64-n32:64"
	target triple = "powerpc64-bgq-linux"			target triple = "powerpc64-bgq-linux"

	define void @test(i32 signext %n) {			define void @test(i32 signext %n) {
	; CHECK-LABEL: define {{[^@]+}}@test			; CHECK-LABEL: define {{[^@]+}}@test
	; CHECK-SAME: (i32 signext [[N:%.*]])			; CHECK-SAME: (i32 signext [[N:%.*]])
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br i1 undef, label [[IF_THEN:%.]], label [[IF_END:%.]]			; CHECK-NEXT: unreachable
	; CHECK: if.then:			; CHECK: if.then:
	; CHECK-NEXT: unreachable			; CHECK-NEXT: unreachable
	; CHECK: if.end:			; CHECK: if.end:
	; CHECK-NEXT: unreachable			; CHECK-NEXT: unreachable
	; CHECK: if.then2:			; CHECK: if.then2:
	; CHECK-NEXT: unreachable			; CHECK-NEXT: unreachable
	; CHECK: if.end4:			; CHECK: if.end4:
	; CHECK-NEXT: unreachable			; CHECK-NEXT: unreachable
	▲ Show 20 Lines • Show All 60 Lines • Show Last 20 Lines

llvm/test/Transforms/Attributor/IPConstantProp/solve-after-each-resolving-undefs-for-function.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature --scrub-attributes			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py UTC_ARGS: --function-signature --scrub-attributes
	; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=2 < %s \| FileCheck %s			; RUN: opt -S -passes=attributor -aa-pipeline='basic-aa' -attributor-disable=false -attributor-max-iterations-verify -attributor-max-iterations=2 < %s \| FileCheck %s

	define internal i32 @testf(i1 %c) {			define internal i32 @testf(i1 %c) {
	; CHECK-LABEL: define {{[^@]+}}@testf			; CHECK-LABEL: define {{[^@]+}}@testf
	; CHECK-SAME: (i1 [[C:%.*]])			; CHECK-SAME: (i1 [[C:%.*]])
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: br i1 [[C]], label [[IF_COND:%.]], label [[IF_END:%.]]			; CHECK-NEXT: br i1 [[C]], label [[IF_COND:%.]], label [[IF_END:%.]]
	; CHECK: if.cond:			; CHECK: if.cond:
	; CHECK-NEXT: br i1 undef, label [[IF_THEN:%.*]], label [[IF_END]]			; CHECK-NEXT: unreachable
	; CHECK: if.then:			; CHECK: if.then:
	; CHECK-NEXT: unreachable			; CHECK-NEXT: unreachable
	; CHECK: if.end:			; CHECK: if.end:
	; CHECK-NEXT: ret i32 10			; CHECK-NEXT: ret i32 10
	;			;
	entry:			entry:
	br i1 %c, label %if.cond, label %if.end			br i1 %c, label %if.cond, label %if.end

	▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

llvm/test/Transforms/Attributor/undefined_behavior.ll

	; RUN: opt --attributor --attributor-disable=false -S < %s \| FileCheck %s --check-prefix=ATTRIBUTOR			; RUN: opt --attributor --attributor-disable=false -S < %s \| FileCheck %s --check-prefix=ATTRIBUTOR

	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"

	; Test cases specifically designed for the "undefined behavior" abstract function attribute.			; Test cases specifically designed for the "undefined behavior" abstract function attribute.
	; We want to verify that whenever undefined behavior is assumed, the code becomes unreachable.			; We want to verify that whenever undefined behavior is assumed, the code becomes unreachable.
	; We use FIXME's to indicate problems and missing attributes.			; We use FIXME's to indicate problems and missing attributes.

	; -- Load tests --			; -- Load tests --

	; ATTRIBUTOR-LABEL: define void @load_wholly_unreachable()
	define void @load_wholly_unreachable() {			define void @load_wholly_unreachable() {
				; ATTRIBUTOR-LABEL: @load_wholly_unreachable(
	; ATTRIBUTOR-NEXT: unreachable			; ATTRIBUTOR-NEXT: unreachable
				;
	%a = load i32, i32* null			%a = load i32, i32* null
	ret void			ret void
	}			}

	define void @load_single_bb_unreachable(i1 %cond) {			define void @load_single_bb_unreachable(i1 %cond) {
	; ATTRIBUTOR-LABEL: @load_single_bb_unreachable(			; ATTRIBUTOR-LABEL: @load_single_bb_unreachable(
	; ATTRIBUTOR-NEXT: br i1 [[COND:%.]], label [[T:%.]], label [[E:%.*]]			; ATTRIBUTOR-NEXT: br i1 [[COND:%.]], label [[T:%.]], label [[E:%.*]]
	; ATTRIBUTOR: t:			; ATTRIBUTOR: t:
	Show All 13 Lines
	; ATTRIBUTOR-LABEL: @load_null_pointer_is_defined(			; ATTRIBUTOR-LABEL: @load_null_pointer_is_defined(
	; ATTRIBUTOR-NEXT: [[A:%.]] = load i32, i32 null			; ATTRIBUTOR-NEXT: [[A:%.]] = load i32, i32 null
	; ATTRIBUTOR-NEXT: ret void			; ATTRIBUTOR-NEXT: ret void
	;			;
	%a = load i32, i32* null			%a = load i32, i32* null
	ret void			ret void
	}			}

				define internal i32* @ret_null() {
				ret i32* null
				}

				; FIXME: null is propagated but the instruction
				; is not changed to unreachable.
				define void @load_null_propagated() {
				; ATTRIBUTOR-LABEL: @load_null_propagated(
				; ATTRIBUTOR-NEXT: [[A:%.]] = load i32, i32 null
				; ATTRIBUTOR-NEXT: ret void
				;
				%ptr = call i32* @ret_null()
				%a = load i32, i32* %ptr
				ret void
				}

	; -- Store tests --			; -- Store tests --

	define void @store_wholly_unreachable() {			define void @store_wholly_unreachable() {
	; ATTRIBUTOR-LABEL: @store_wholly_unreachable(			; ATTRIBUTOR-LABEL: @store_wholly_unreachable(
	; ATTRIBUTOR-NEXT: unreachable			; ATTRIBUTOR-NEXT: unreachable
	;			;
	store i32 5, i32* null			store i32 5, i32* null
	ret void			ret void
	▲ Show 20 Lines • Show All 88 Lines • ▼ Show 20 Lines
	define void @atomiccmpxchg_null_pointer_is_defined() "null-pointer-is-valid"="true" {			define void @atomiccmpxchg_null_pointer_is_defined() "null-pointer-is-valid"="true" {
	; ATTRIBUTOR-LABEL: @atomiccmpxchg_null_pointer_is_defined(			; ATTRIBUTOR-LABEL: @atomiccmpxchg_null_pointer_is_defined(
	; ATTRIBUTOR-NEXT: [[A:%.]] = cmpxchg i32 null, i32 2, i32 3 acq_rel monotonic			; ATTRIBUTOR-NEXT: [[A:%.]] = cmpxchg i32 null, i32 2, i32 3 acq_rel monotonic
	; ATTRIBUTOR-NEXT: ret void			; ATTRIBUTOR-NEXT: ret void
	;			;
	%a = cmpxchg i32* null, i32 2, i32 3 acq_rel monotonic			%a = cmpxchg i32* null, i32 2, i32 3 acq_rel monotonic
	ret void			ret void
	}			}

				; Note: The unreachable on %t and %e is _not_ from AAUndefinedBehavior

				define i32 @cond_br_on_undef() {
				; ATTRIBUTOR-LABEL: @cond_br_on_undef(
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: t:
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: e:
				; ATTRIBUTOR-NEXT: unreachable
				;

				br i1 undef, label %t, label %e
				t:
				ret i32 1
				e:
				ret i32 2
				}

				; More complicated branching
				define void @cond_br_on_undef2(i1 %cond) {
				; ATTRIBUTOR-LABEL: @cond_br_on_undef2(
				; ATTRIBUTOR-NEXT: br i1 [[COND:%.]], label [[T1:%.]], label [[E1:%.*]]
				; ATTRIBUTOR: t1:
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: t2:
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: e2:
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: e1:
				; ATTRIBUTOR-NEXT: ret void
				;

				; Valid branch - verify that this is not converted
				; to unreachable.
				br i1 %cond, label %t1, label %e1
				t1:
				br i1 undef, label %t2, label %e2
				t2:
				ret void
				e2:
				ret void
				e1:
				ret void
				}

				define i1 @ret_undef() {
				ret i1 undef
				}

				define void @cond_br_on_undef_interproc() {
				; ATTRIBUTOR-LABEL: @cond_br_on_undef_interproc(
				; ATTRIBUTOR-NEXT: %cond = call i1 @ret_undef()
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: t:
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: e:
				; ATTRIBUTOR-NEXT: unreachable

				%cond = call i1 @ret_undef()
				br i1 %cond, label %t, label %e
				t:
				ret void
				e:
				ret void
				}

				define i1 @ret_undef2() {
				br i1 true, label %t, label %e
				t:
				ret i1 undef
				e:
				ret i1 undef
				}

				; More complicated interproc deduction of undef
				define void @cond_br_on_undef_interproc2() {
				; ATTRIBUTOR-LABEL: @cond_br_on_undef_interproc2(
				; ATTRIBUTOR-NEXT: %cond = call i1 @ret_undef2()
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: t:
				; ATTRIBUTOR-NEXT: unreachable
				; ATTRIBUTOR: e:
				; ATTRIBUTOR-NEXT: unreachable
				%cond = call i1 @ret_undef2()
				br i1 %cond, label %t, label %e
				t:
				ret void
				e:
				ret void
				}

				; Branch on undef that depends on propagation of
				; undef of a previous instruction.
				; FIXME: Currently it doesn't propagate the undef.
				define i32 @cond_br_on_undef3() {
				; ATTRIBUTOR-LABEL: @cond_br_on_undef3(
				; ATTRIBUTOR-NEXT: %cond = icmp ne i32 1, undef
				; ATTRIBUTOR-NEXT: br i1 %cond, label %t, label %e
				; ATTRIBUTOR: t:
				; ATTRIBUTOR-NEXT: ret i32 1
				; ATTRIBUTOR: e:
				; ATTRIBUTOR-NEXT: ret i32 2

				%cond = icmp ne i32 1, undef
				br i1 %cond, label %t, label %e
				t:
				ret i32 1
				e:
				ret i32 2
				}

				; Branch on undef because of uninitialized value.
				; FIXME: Currently it doesn't propagate the undef.
				define i32 @cond_br_on_undef_uninit() {
				; ATTRIBUTOR-LABEL: @cond_br_on_undef_uninit(
				; ATTRIBUTOR-NEXT: %alloc = alloca i1
				; ATTRIBUTOR-NEXT: %cond = load i1, i1* %alloc
				; ATTRIBUTOR-NEXT: br i1 %cond, label %t, label %e
				; ATTRIBUTOR: t:
				; ATTRIBUTOR-NEXT: ret i32 1
				; ATTRIBUTOR: e:
				; ATTRIBUTOR-NEXT: ret i32 2

				%alloc = alloca i1
				%cond = load i1, i1* %alloc
				br i1 %cond, label %t, label %e
				t:
				ret i32 1
				e:
				ret i32 2
				}

This is an archive of the discontinued LLVM Phabricator instance.

[Attributor] AAUndefinedBehavior: Check for branches on undef value.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 235503

llvm/include/llvm/Transforms/IPO/Attributor.h

llvm/lib/Transforms/IPO/Attributor.cpp

llvm/test/Transforms/Attributor/IPConstantProp/PR26044.ll

llvm/test/Transforms/Attributor/IPConstantProp/fp-bc-icmp-const-fold.ll

llvm/test/Transforms/Attributor/IPConstantProp/solve-after-each-resolving-undefs-for-function.ll

llvm/test/Transforms/Attributor/undefined_behavior.ll

[Attributor] AAUndefinedBehavior: Check for branches on undef value.
ClosedPublic