This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Transforms/IPO/
-
llvm/
-
Transforms/
-
IPO/
-
Attributor.h
-
lib/Transforms/IPO/
-
Transforms/
-
IPO/
11/20
Attributor.cpp

Differential D71974

[Attributor][WIP] Connect AAIsDead with AAUndefinedBehavior
Needs ReviewPublic

Authored by baziotis on Dec 29 2019, 11:27 AM.

Download Raw Diff

Details

Reviewers

jdoerfert
uenoku
sstefan1

Summary

In AAIsDeadFunction, don't include AliveSuccessors that are UB.

Diff Detail

Event Timeline

baziotis created this revision.Dec 29 2019, 11:27 AM

Herald added a project: Restricted Project. · View Herald TranscriptDec 29 2019, 11:27 AM

Herald added subscribers: llvm-commits, hiraditya. · View Herald Transcript

I think we need to query AAUB from AAIsDead in the updateImpl. See the TODO in AAIsDeadFunction::updateImpl(Attributor &A). Similarly, in AAUB we should go through the explorer context when we add something to the knownUB set. So if I is knownUB, make all instructions in the must-be-executed-context of I knownUB, thus insert all into the set. In AAIsDead we can then simply query isKnownUB and that will only need to look into the set (as before). In addition to the TODO mentioned before we need the same logic in the initialize from AAIsDeadFunction before we add the entry block as live.

llvm/lib/Transforms/IPO/Attributor.cpp
2220	We should avoid caching stuff in the AAs. `isKnownToCauseUB` can take an Attributor reference.
3175	As mentioned above, you can add the Attributor as a argument to calls.

In D71974#1798289, @jdoerfert wrote:

I think we need to query AAUB from AAIsDead in the updateImpl. See the TODO in AAIsDeadFunction::updateImpl(Attributor &A).

I didn't see that at all, thanks! I'll give it a try.

Similarly, in AAUB we should go through the explorer context when we add something to the knownUB set. So if I is knownUB, make all instructions in the must-be-executed-context of I knownUB, thus insert all into the set. In AAIsDead we can then simply query isKnownUB and that will only need to look into the set (as before).

If I understand it correctly, this will mark as UB all the instructions from the UB instruction and forwards (i.e. the must be executed context goes to the successors). That will probably complicate things as to see if a BB is dead, with the current code, it makes sense to see its first instruction right?

In addition to the TODO mentioned before we need the same logic in the initialize from AAIsDeadFunction before we add the entry block as live.

Oh yes, I had added and removed that basically because that I think can't work with assumed info easily. I'll add it with known info to make clear what I mean.

llvm/lib/Transforms/IPO/Attributor.cpp
3175	Ok, thanks, I'll do it that way.

Query AAUB in identifyAliveSuccessors() for BranchInst.

This is essentially a proposed PoC on how to use AAUB in AAIsDeadFunction::updateImpl(). The TODO there says "look for (assumed) UB to backwards propagate "deadness"".
In my understanding that seems kind of unorthodox since the whole process is moving forwards (with an "artificial" recursion so it's not easy to "go back" since we never actually return back). Because of that, I thought it may be better instead of that to just "not go forwards" on successors (and BBs) that have UB (not going forwards into them means they're not put in AssumedLiveBlocks, effectively keeping them dead).

Passed the attributor in isAssumedDead() etc. (that resulted in changes all over the place, I hope that's ok).

baziotis marked an inline comment as done.Dec 30 2019, 6:52 AM

baziotis added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp
3169–3172	With the known info it's easy because we don't need to remember that we were based on assumed info (in the `updateImpl()`). But now that I see that again, it probably is as simple as inserting `Front` in the `ToBeExploredFrom` (always) and only make the `EntryBlock` live if it is not assumed to cause UB.

In D71974#1798878, @baziotis wrote:

In D71974#1798289, @jdoerfert wrote:

Similarly, in AAUB we should go through the explorer context when we add something to the knownUB set. So if I is knownUB, make all instructions in the must-be-executed-context of I knownUB, thus insert all into the set. In AAIsDead we can then simply query isKnownUB and that will only need to look into the set (as before).

If I understand it correctly, this will mark as UB all the instructions from the UB instruction and forwards (i.e. the must be executed context goes to the successors). That will probably complicate things as to see if a BB is dead, with the current code, it makes sense to see its first instruction right?

The must-be-executed-context is not defined to only contain "successors". It might right now but that will change eventually.

I think we should stick to known information right now.

I still believe this is more complex and less general than the approach I tried to describe. Maybe the following is a good compromise in the direction I think we should going but working already right now:

Since the must-be-executed-context is not collecting predecessors yet, we need to change that. The code actually exists already, it just needs to be separated from some other improvements and put for review again. I'll look into that (or @uenoku you can if you want to).
Collecting instructions that are executed always with one which causes UB in the "knownUB" set seems logical to me. After all, their execution is "eventually" leading to UB. Having them in the set gives us a single consistent way to check instead of iterating over must-be-executed contexts all the time. We can keep the context loop in isKnownUB for now but we should eventually not do that (thus we should add a TODO).
During the liveness exploration we should check if the beginning of a block is known to cause UB, if so, we do not make it live. We can even check on the instruction level if we want to later. For now, we should be able to do all this in the assumeLive method by "not assuming it is live" if it leads to UB.

Are no tests affected?

In D71974#1799138, @jdoerfert wrote:

In D71974#1798878, @baziotis wrote:

In D71974#1798289, @jdoerfert wrote:

Similarly, in AAUB we should go through the explorer context when we add something to the knownUB set. So if I is knownUB, make all instructions in the must-be-executed-context of I knownUB, thus insert all into the set. In AAIsDead we can then simply query isKnownUB and that will only need to look into the set (as before).

If I understand it correctly, this will mark as UB all the instructions from the UB instruction and forwards (i.e. the must be executed context goes to the successors). That will probably complicate things as to see if a BB is dead, with the current code, it makes sense to see its first instruction right?

The must-be-executed-context is not defined to only contain "successors". It might right now but that will change eventually.

I think we should stick to known information right now.

Ok, isAssumedToCauseUB() is overoptimistic right now.

I still believe this is more complex and less general than the approach I tried to describe. Maybe the following is a good compromise in the direction I think we should going but working already right now:

Since the must-be-executed-context is not collecting predecessors yet, we need to change that. The code actually exists already, it just needs to be separated from some other improvements and put for review again. I'll look into that (or @uenoku you can if you want to).

Great, I didn't know this was planned.

Collecting instructions that are executed always with one which causes UB in the "knownUB" set seems logical to me. After all, their execution is "eventually" leading to UB. Having them in the set gives us a single consistent way to check instead of iterating over must-be-executed contexts all the time. We can keep the context loop in isKnownUB for now but we should eventually not do that (thus we should add a TODO).

I surely agree. My comment was relative to the current MustBeExecutedContext. That is, since it currently only looks successors (and I didn't know it was planned to change), it would be difficult to add the right predecessors to the set when we found a UB instruction.
And that would then make more difficult the fact that to connect it with AAIsDead, I was looking at the start of the block (which could be predecessor of a UB instruction). Now it's clear, thanks!

During the liveness exploration we should check if the beginning of a block is known to cause UB, if so, we do not make it live. We can even check on the instruction level if we want to later. For now, we should be able to do all this in the assumeLive method by "not assuming it is live" if it leads to UB.

This is sort of what I was going for (if you see the identifyAliveSuccessors for BranchInst, it marks alive only BBs whose first instruction is not UB, hence the whole forwards vs backwards etc.). But as you said, I made it too complicated as instead
of modifying all identifyAliveSuccessors() etc. I could just put the code in assumeLive() which I will now do. :)

Are no tests affected?

A lot, I'll upload in the next diff.

EDIT:

I think we should stick to known information right now.

There's one problem with that when I tried it in assumeLive(). It queries AAUB and blocks that are not yet known to cause UB are assumed live and are never re-tested.

In D71974#1799138, @jdoerfert wrote:

In D71974#1798878, @baziotis wrote:

In D71974#1798289, @jdoerfert wrote:

Similarly, in AAUB we should go through the explorer context when we add something to the knownUB set. So if I is knownUB, make all instructions in the must-be-executed-context of I knownUB, thus insert all into the set. In AAIsDead we can then simply query isKnownUB and that will only need to look into the set (as before).

If I understand it correctly, this will mark as UB all the instructions from the UB instruction and forwards (i.e. the must be executed context goes to the successors). That will probably complicate things as to see if a BB is dead, with the current code, it makes sense to see its first instruction right?

The must-be-executed-context is not defined to only contain "successors". It might right now but that will change eventually.
I think we should stick to known information right now.

I still believe this is more complex and less general than the approach I tried to describe. Maybe the following is a good compromise in the direction I think we should going but working already right now:

Since the must-be-executed-context is not collecting predecessors yet, we need to change that. The code actually exists already, it just needs to be separated from some other improvements and put for review again. I'll look into that (or @uenoku you can if you want to).

I'd like to work on it.

There's one problem with that when I tried it in assumeLive(). It queries AAUB and blocks that are not yet known to cause UB are assumed live and are never re-tested.

I see. Now this becomes a little more involved. What we can do is to call AAUB::isAssumedToCauseUB(BB.front()). Since we haven't visited BB yet, we will say true if there are instructions in there that might cause UB.
Now, that will require us to keep track of the instruction that transferred control to BB as we have to ask AAUB again. Not in assumeLive but in the updateImpl, before the the if(UsedAssumedInformation) check, we can filter the AliveSuccessors based on isAssumedToCauseUB(BB). If we filtered a block we need to set UsedAssumedInformation to true so we will revisit that instruction in the future. If isAssumedToCauseUB(BB) is true and isKnownToCauseUB(BB) as well, we can even remove the block without setting UsedAssumedInformation.

In D71974#1799188, @uenoku wrote:

In D71974#1799138, @jdoerfert wrote:

Since the must-be-executed-context is not collecting predecessors yet, we need to change that. The code actually exists already, it just needs to be separated from some other improvements and put for review again. I'll look into that (or @uenoku you can if you want to).

I'd like to work on it.

Thx. There is a prototype patch out there that you can/should use as a starting point.

Thanks for your suggestion Johannes, fortunately that was an easy change. But it also impacts about 26 tests or so, which I won't be able to fix today.
However now the interaction between AAUB and AAIsDead becomes interesting. For example, for this code:

define internal i1 @ret_undef() {
  ret i1 undef
}

define void @cond_br_on_undef_interproc() {
  %cond = call i1 @ret_undef()
  br i1 %cond, label %t, label %e
t:
  ret void
e:
  ret void
}

the Attributor leaves... you guessed it, nothing. :)

jdoerfert added inline comments.Dec 30 2019, 9:27 PM

llvm/lib/Transforms/IPO/Attributor.cpp
3481	I think something like `make_filter_range` from `llvm/include/llvm/ADT/STLExtras.h` could make this nicer. Or maybe we just need to put it in a helper function, or both.

baziotis marked an inline comment as done.Dec 31 2019, 8:53 AM

baziotis added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp
3481	Yes, using a helper would be better. `make_filter_range` seems cool, I didn't know it. TBH though, I think it gives less control and predictability. It's not as clear to me as in the current what happens under the hood (apart from the fact that we'll query 2 times the `AAUB`). Also, FWIW, the current will have a worst case of one allocation and one free and an average case of no allocation / free (assuming that most instructions have less than 8 successors). I'll put it in a helper function and update me if you still think it's not that good and I'll change it to `make filter_range`.

baziotis marked an inline comment as done.Dec 31 2019, 9:05 AM

baziotis added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp
3481	Btw, I forgot to explain the `AssumedLiveBlocks.erase(BB)` part. Basically, if an alive successor is UB and its parent BB has been put into `AssumedLiveBlocks`, then we should mark that BB dead by removing it. That however seems bad to me. Up to now, we only inserted to it and it feels that something can go badly if we start removing stuff (like, go to an endless loop or the not having a monotone procedure). I'll take a look again, it was a quick change but feel free to update me if we should worry about that.

jdoerfert added inline comments.Dec 31 2019, 9:27 AM

llvm/lib/Transforms/IPO/Attributor.cpp
3481	I haven't read the code in detail, some observations: if (AAUB.isKnownToCauseUB(AliveSuccessor) \|\| (Assumed = AAUB.isAssumedToCauseUB(AliveSuccessor))) { Assumed is always true or uninitialized here. Reading it after results in true or UB. What you want is to check assumed and if true check known to determine if you used assumed information. No need for `count` if you call `erase`. Just erase/insert stuff, the return value even tells you if it was in before. I would like to understand the situation in which we think a block is UB only after it was put in `AssumedLiveBlocks`. We should not erase it but add an assertion so we can see if it triggers on the test. I hope it does not (and we can keep the assertion) as that would mean it works properly.

baziotis marked an inline comment as done.Dec 31 2019, 10:00 AM

baziotis added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp
3481	Assumed is always true or uninitialized here. Reading it after results in true or UB. What you want is to check assumed and if true check known to determine if you used assumed information. Yes, it should be initialized to `false`. Well, the reason the code has been written in this (maybe weird) way is because if `Known` returns true, we don't have to call assumed (which we will do if we always call assumed first). I think that if we just initialize to `false` it will be ok. No need for count if you call erase. Just erase/insert stuff, the return value even tells you if it was in before. Ty, I didn't know that. I would like to understand the situation in which we think a block is UB only after it was put in AssumedLiveBlocks. We should not erase it but add an assertion so we can see if it triggers on the test. I hope it does not (and we can keep the assertion) as that would mean it works properly. A simple way this can happen is with that: define void @cond_br_on_undef_interproc() { br i1 undef, label %t, label %e t: ret void e: ret void } Note that `br i1 undef, ...` is in the entry block, which is assumed live in `initialize()` based on known info (and of course, at this point, it's not yet known it causes UB). So, this becomes: define void @cond_br_on_undef_interproc() { unreachable t: unreachable e: unreachable } which is ok I guess, but we could have deleted the whole function (which we didn't since the block is still alive). When other instructions go to the entry block, that gets more complicated. I'm not yet sure what happens for non-entry blocks.

baziotis marked an inline comment as done.Dec 31 2019, 10:07 AM

baziotis added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp
3481	Nit: Actually, we can remove `Assumed` completely and use `UsedAssumedInformation` which will probably make the code maybe too smart for our own good. :P

jdoerfert added inline comments.Dec 31 2019, 10:39 AM

llvm/lib/Transforms/IPO/Attributor.cpp
3481	Note that br i1 undef, ... is in the entry block, which is assumed live in initialize() based on known info (and of course, at this point, it's not yet known it causes UB). We need to apply the same logic (via the helper) in the initialize. When other instructions go to the entry block, that gets more complicated. That is forbidden. I'm not yet sure what happens for non-entry blocks. I think the logic in the initialize will allow us to place an assertion here. Nit: Actually, we can remove Assumed completely and use UsedAssumedInformation which will probably make the code maybe too smart for our own good. :P Variables do not cost anything. Make the core readable first.

baziotis marked an inline comment as done.Dec 31 2019, 11:13 AM

baziotis added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp
3481	When other instructions go to the entry block, that gets more complicated. That is forbidden. Sorry, I didn't mean to say "go" as "branch to". I meant when we add other instructions to the entry block. We need to apply the same logic (via the helper) in the initialize. Sorry, I didn't get that. Could you please specify what logic? I was thinking something like (in the initialize): If it is known to cause UB, just return. Otherwise, insert `Front` in `ToBeExplored` and `assumeLive()` the block only if `!AAUB.isAssumedToCauseUB(Front)`. Unfortunately, I think that the fact that `isAssumedToCauseUB()` is over-optimistic will make all this more and more complicated. Take for example this: define i32 @example(i1* %alloc) { %cond = load i1, i1* %alloc br i1 %cond, label %t, label %e t: ret i32 1 e: ret i32 2 } (and assuming we're having the current thing in `initialize()`). The entry block will be marked live. `AAIsDeadFunction::updateImpl()` is called. The `br` is successor of `%cond`. But, because `isAssumedToCauseUB()` is overoptimistic, it will assume this UB. This in turn will insert `%cond` to `KnownDeadEnds`. Now we could say here that this is based on assumed info and another `updateImpl()` will correct but here's the catch (or you could call it something like a dead-lock): `AAUB::updateImpl()` will be called. But, this uses `checkForAllInstructions()`, which uses liveness, which in turn, uses `isAssumedDead()`. The `br` will be assumed dead (because its predecessor, `%cond`, is in `KnownDeadEnds`) and won't make it to the predicate which would eventually add it to `AssumedNoUBInsts` which would correct all this situation. This means that the `br` never gets live and this procedure continues and we're left with: define i32 @cond_br_on_undef_uninit(i1* nocapture nofree nonnull readonly dereferenceable(1) %alloc) #0 { %cond = load i1, i1* %alloc br i1 %cond, label %t, label %e t: ; preds = %0 unreachable e: ; preds = %0 unreachable }

jdoerfert added inline comments.Dec 31 2019, 1:37 PM

llvm/lib/Transforms/IPO/Attributor.cpp
3481	Sorry, I didn't get that. Could you please specify what logic? The same as you apply now to filter "alive successors". We should not mark entry live in the initilize if AAUB::isAssumedUB(EntryBB) is true. We probably need to redefine `KnownDeadEnds` such that we can add the entry block first instruction during initilize and it will not be assumed live. That will require some changes but should be possible. Let's talke that next year ;)

baziotis marked an inline comment as done.Dec 31 2019, 5:35 PM

baziotis added inline comments.

llvm/lib/Transforms/IPO/Attributor.cpp
3481	Let's talke that next year Yes, happy new year. :)

Sorry for being late, this is an exam period so time is very limited. I saw again the problem and it changed a bit but it's similar. To be sure we're on the same page:
In this:

define i32 @example(i1* %alloc) {
  %cond = load i1, i1* %alloc
  br i1 %cond, label %t, label %e
t:
  ret i32 1
e:
  ret i32 2
}

The block of br initially is assumed live, but AAIsDeadFunction::updateImpl() is called, which, because the br is assumed UB, removes the block from the AssumedLiveBlocks.
AAUB looks only live instructions and the br now will never be live (also maybe the fact that we're removing - and it's the only place - messes with the monotony).

I think one solution is to only remove a block if it is known UB.

I forgot about this. Can we rebase it :) ?

In D71974#1877441, @jdoerfert wrote:

I forgot about this. Can we rebase it :) ?

No problem. I just rebased and at first glance, the problem now does not exist, meaning the @example is output as is.
But a lot of things have changed in the Attributor so certainly I don't understand what is happening. I'll check again tomorrow.

uenoku mentioned this in D74817: [MustExecute] Add backward exploration for must-be-executed-context.Feb 19 2020, 12:06 AM

uenoku mentioned this in rGe253cdda35eb: [MustExecute] Add backward exploration for must-be-executed-context.Feb 19 2020, 9:51 PM

I've been looking today again at this and it's the first time in the Attributor that I don't understand what is happening. This diff contains some prints and AAIsDeadFunction::manifest() is never called. I'll continue looking into this but any help is appreciated, I have lost episodes.

Rebased with AAUB not manifesting

In this diff, note that I have commented the manifesting in AAUB, so that, for now, we can test UB blocks are deleted because of AAIsDead.
Btw, may I throw the idea that this be done in the end ? That is, blocks be deleted only because of AAIsDead.

Anyway, here are some examples:

define i32 @cond_br_on_undef_interproc(i1 %cond) {
  br i1 %cond, label %t, label %e
t:
  %a = load i32, i32* null
  ret i32 1
e:
  ret i32 2
}

In here, happens what we expect. The entry block is marked live initially and then in AAIsDeadFunction::updateImpl() the block with the UB is removed from alive blocks.
When the manifest phase in Attributor::run() comes, AAIsDeadFunction has 2/3 live blocks, so it is considered live (the AA) and its manifest ultimately deletes the block.

define i32 @example(i1* %alloc) {
  %cond = load i1, i1* %alloc
  br i1 %cond, label %t, label %e
t:
  ret i32 1
e:
  ret i32 2
}

Here, isAssumedToCauseUB() is overoptimistic and that means that the branch is assumed UB and its block is removed in AAIsDeadFunction::updateImpl().
The catch is that although assumed information is used, AAUB never corrects that since it looks at live instructions for UB.
That's a problem in and of itself but that was happening before. The weird thing is that in the manifest stage of run, AAIsDeadFunction is considered
dead because the entry point of the function is considered dead. Shouldn't that mean that the function has to be deleted?

Ping for when you have time. :)

As a note, I think that we shouldn't delete a block based on assumed info. Because then AAUB can never correct it.
This does not completely remove the problems but it should lead to a better path.

Tests?

llvm/lib/Transforms/IPO/Attributor.cpp
2332	What is happening here? Leftover?
2369	I would prefer if we pass the Explorer (or the Attributor) instead of caching it.
3475	I don't think retroactively removing blocks here is a good idea. Once assumed live it should stay that way.

Don't delete blocks. Still not correct though.

In D71974#1908675, @jdoerfert wrote:

Tests?

A lot of tests fail but I'm not sure if it makes sense to include any tests that test the new behavior yet. Do you want invalid tests?
For example, this now:

define i32 @example(i1* %alloc) {
  %cond = load i1, i1* %alloc
  br i1 %cond, label %t, label %e
t:
  ret i32 1
e:
  ret i32 2
}

results in:

define i32 @example(i1* nocapture nofree nonnull readnone dereferenceable(1) %alloc) #0 {
  br i1 undef, label %t, label %e

t:                                                ; preds = %0
  unreachable

e:                                                ; preds = %0
  unreachable
}

And again, as far as I can understand, removing something from alive successors means it isn't getting updated by the AAUB because AAUB never sees it (as it is dead).
Honestly, unfortunately I can't see how this connection fits into the whole picture given the current structure.

llvm/lib/Transforms/IPO/Attributor.cpp
2332	No, I think a previous commit states that I put that to test only what AAIsDead deletes. I removed it now.
2369	Yes sorry, that was a leftover.
3475	I agree, as I had stated in a previous comment: (also maybe the fact that we're removing - and it's the only place - messes with the monotony). which as far as I can understand, still holds.

Let's postpone this until we have a chance to make the connection clear.

In D71974#1911845, @jdoerfert wrote:

Let's postpone this until we have a chance to make the connection clear.

Ok, good.

Revision Contents

Path

Size

llvm/

include/

llvm/

Transforms/

IPO/

Attributor.h

7 lines

lib/

Transforms/

IPO/

Attributor.cpp

44 lines

Diff 248798

llvm/include/llvm/Transforms/IPO/Attributor.h

Show First 20 Lines • Show All 2,017 Lines • ▼ Show 20 Lines	struct AAUndefinedBehavior
: public StateWrapper<BooleanState, AbstractAttribute>,		: public StateWrapper<BooleanState, AbstractAttribute>,
public IRPosition {		public IRPosition {
AAUndefinedBehavior(const IRPosition &IRP) : IRPosition(IRP) {}		AAUndefinedBehavior(const IRPosition &IRP) : IRPosition(IRP) {}

/// Return true if "undefined behavior" is assumed.		/// Return true if "undefined behavior" is assumed.
bool isAssumedToCauseUB() const { return getAssumed(); }		bool isAssumedToCauseUB() const { return getAssumed(); }

/// Return true if "undefined behavior" is assumed for a specific instruction.		/// Return true if "undefined behavior" is assumed for a specific instruction.
virtual bool isAssumedToCauseUB(Instruction *I) const = 0;		virtual bool isAssumedToCauseUB(const Instruction *I) const = 0;

/// Return true if "undefined behavior" is known.		/// Return true if "undefined behavior" is known.
bool isKnownToCauseUB() const { return getKnown(); }		bool isKnownToCauseUB() const { return getKnown(); }

/// Return true if "undefined behavior" is known for a specific instruction.		/// Return true if "undefined behavior" is known for a specific instruction.
virtual bool isKnownToCauseUB(Instruction *I) const = 0;		virtual bool isKnownToCauseUB(Attributor &A, const Instruction *I) const = 0;

/// Return an IR position, see struct IRPosition.		/// Return an IR position, see struct IRPosition.
const IRPosition &getIRPosition() const override { return *this; }		const IRPosition &getIRPosition() const override { return *this; }

/// Create an abstract attribute view for the position \p IRP.		/// Create an abstract attribute view for the position \p IRP.
static AAUndefinedBehavior &createForPosition(const IRPosition &IRP,		static AAUndefinedBehavior &createForPosition(const IRPosition &IRP,
Attributor &A);		Attributor &A);

▲ Show 20 Lines • Show All 702 Lines • ▼ Show 20 Lines	struct AAValueConstantRange : public IntegerRangeState,
/// If \p I is nullptr, simply return a known range.		/// If \p I is nullptr, simply return a known range.
virtual ConstantRange		virtual ConstantRange
getKnownConstantRange(Attributor &A,		getKnownConstantRange(Attributor &A,
const Instruction *CtxI = nullptr) const = 0;		const Instruction *CtxI = nullptr) const = 0;

/// Return an assumed constant for the assocaited value a program point \p		/// Return an assumed constant for the assocaited value a program point \p
/// CtxI.		/// CtxI.
Optional<ConstantInt *>		Optional<ConstantInt *>
getAssumedConstantInt(Attributor &A, const Instruction *CtxI = nullptr) const {		getAssumedConstantInt(Attributor &A,
		const Instruction *CtxI = nullptr) const {
ConstantRange RangeV = getAssumedConstantRange(A, CtxI);		ConstantRange RangeV = getAssumedConstantRange(A, CtxI);
if (auto *C = RangeV.getSingleElement())		if (auto *C = RangeV.getSingleElement())
return cast<ConstantInt>(		return cast<ConstantInt>(
ConstantInt::get(getAssociatedValue().getType(), *C));		ConstantInt::get(getAssociatedValue().getType(), *C));
if (RangeV.isEmptySet())		if (RangeV.isEmptySet())
return llvm::None;		return llvm::None;
return nullptr;		return nullptr;
}		}

/// Unique ID (due to the unique address)		/// Unique ID (due to the unique address)
static const char ID;		static const char ID;
};		};

} // end namespace llvm		} // end namespace llvm

#endif // LLVM_TRANSFORMS_IPO_FUNCTIONATTRS_H		#endif // LLVM_TRANSFORMS_IPO_FUNCTIONATTRS_H

llvm/lib/Transforms/IPO/Attributor.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,209 Lines • ▼ Show 20 Lines
};		};

/// -------------------- Undefined-Behavior Attributes ------------------------		/// -------------------- Undefined-Behavior Attributes ------------------------

struct AAUndefinedBehaviorImpl : public AAUndefinedBehavior {		struct AAUndefinedBehaviorImpl : public AAUndefinedBehavior {
AAUndefinedBehaviorImpl(const IRPosition &IRP) : AAUndefinedBehavior(IRP) {}		AAUndefinedBehaviorImpl(const IRPosition &IRP) : AAUndefinedBehavior(IRP) {}

/// See AbstractAttribute::updateImpl(...).		/// See AbstractAttribute::updateImpl(...).
// through a pointer (i.e. also branches etc.)
ChangeStatus updateImpl(Attributor &A) override {		ChangeStatus updateImpl(Attributor &A) override {
const size_t UBPrevSize = KnownUBInsts.size();		const size_t UBPrevSize = KnownUBInsts.size();
const size_t NoUBPrevSize = AssumedNoUBInsts.size();		const size_t NoUBPrevSize = AssumedNoUBInsts.size();
		jdoerfertUnsubmitted Not Done Reply Inline Actions We should avoid caching stuff in the AAs. `isKnownToCauseUB` can take an Attributor reference. jdoerfert: We should avoid caching stuff in the AAs. `isKnownToCauseUB` can take an Attributor reference.

auto InspectMemAccessInstForUB = [&](Instruction &I) {		auto InspectMemAccessInstForUB = [&](Instruction &I) {
// Skip instructions that are already saved.		// Skip instructions that are already saved.
if (AssumedNoUBInsts.count(&I) \|\| KnownUBInsts.count(&I))		if (AssumedNoUBInsts.count(&I) \|\| KnownUBInsts.count(&I))
return true;		return true;

// If we reach here, we know we have an instruction		// If we reach here, we know we have an instruction
// that accesses memory through a pointer operand,		// that accesses memory through a pointer operand,
▲ Show 20 Lines • Show All 57 Lines • ▼ Show 20 Lines	ChangeStatus updateImpl(Attributor &A) override {
A.checkForAllInstructions(InspectBrInstForUB, *this, {Instruction::Br},		A.checkForAllInstructions(InspectBrInstForUB, *this, {Instruction::Br},
/* CheckBBLivenessOnly */ true);		/* CheckBBLivenessOnly */ true);
if (NoUBPrevSize != AssumedNoUBInsts.size() \|\|		if (NoUBPrevSize != AssumedNoUBInsts.size() \|\|
UBPrevSize != KnownUBInsts.size())		UBPrevSize != KnownUBInsts.size())
return ChangeStatus::CHANGED;		return ChangeStatus::CHANGED;
return ChangeStatus::UNCHANGED;		return ChangeStatus::UNCHANGED;
}		}

bool isKnownToCauseUB(Instruction *I) const override {		bool isKnownToCauseUB(Attributor &A, const Instruction *I) const override {
return KnownUBInsts.count(I);		MustBeExecutedContextExplorer &Explorer =
		A.getInfoCache().getMustBeExecutedContextExplorer();
		for (auto It : Explorer.range(I))
		if (KnownUBInsts.count(It))
		return true;
		return false;
}		}

bool isAssumedToCauseUB(Instruction *I) const override {		bool isAssumedToCauseUB(const Instruction *I) const override {
// In simple words, if an instruction is not in the assumed to _not_		// In simple words, if an instruction is not in the assumed to _not_
// cause UB, then it is assumed UB (that includes those		// cause UB, then it is assumed UB (that includes those
// in the KnownUBInsts set). The rest is boilerplate		// in the KnownUBInsts set). The rest is boilerplate
// is to ensure that it is one of the instructions we test		// is to ensure that it is one of the instructions we test
// for UB.		// for UB.

switch (I->getOpcode()) {		switch (I->getOpcode()) {
case Instruction::Load:		case Instruction::Load:
Show All 12 Lines	bool isAssumedToCauseUB(const Instruction *I) const override {
}		}
return false;		return false;
}		}

ChangeStatus manifest(Attributor &A) override {		ChangeStatus manifest(Attributor &A) override {
if (KnownUBInsts.empty())		if (KnownUBInsts.empty())
return ChangeStatus::UNCHANGED;		return ChangeStatus::UNCHANGED;
for (Instruction *I : KnownUBInsts)		for (Instruction *I : KnownUBInsts)
A.changeToUnreachableAfterManifest(I);		A.changeToUnreachableAfterManifest(I);
		jdoerfertUnsubmitted Not Done Reply Inline Actions What is happening here? Leftover? jdoerfert: What is happening here? Leftover?
		baziotisAuthorUnsubmitted Done Reply Inline Actions No, I think a previous commit states that I put that to test only what AAIsDead deletes. I removed it now. baziotis: No, I think a previous commit states that I put that to test only what AAIsDead deletes. I…
return ChangeStatus::CHANGED;		return ChangeStatus::CHANGED;
}		}

/// See AbstractAttribute::getAsStr()		/// See AbstractAttribute::getAsStr()
const std::string getAsStr() const override {		const std::string getAsStr() const override {
return getAssumed() ? "undefined-behavior" : "no-ub";		return getAssumed() ? "undefined-behavior" : "no-ub";
}		}

Show All 20 Lines	struct AAUndefinedBehaviorImpl : public AAUndefinedBehavior {
/// so that we don't reprocess them in every update.		/// so that we don't reprocess them in every update.
/// Note however that instructions in this set may cause UB.		/// Note however that instructions in this set may cause UB.

protected:		protected:
/// A set of all live instructions _known_ to cause UB.		/// A set of all live instructions _known_ to cause UB.
SmallPtrSet<Instruction *, 8> KnownUBInsts;		SmallPtrSet<Instruction *, 8> KnownUBInsts;

private:		private:
/// A set of all the (live) instructions that are assumed to _not_ cause UB.		/// A set of all the (live) instructions that are assumed to _not_ cause UB.
		jdoerfertUnsubmitted Not Done Reply Inline Actions I would prefer if we pass the Explorer (or the Attributor) instead of caching it. jdoerfert: I would prefer if we pass the Explorer (or the Attributor) instead of caching it.
		baziotisAuthorUnsubmitted Done Reply Inline Actions Yes sorry, that was a leftover. baziotis: Yes sorry, that was a leftover.
SmallPtrSet<Instruction *, 8> AssumedNoUBInsts;		SmallPtrSet<Instruction *, 8> AssumedNoUBInsts;

// Should be called on updates in which if we're processing an instruction		// Should be called on updates in which if we're processing an instruction
// \p I that depends on a value \p V, one of the following has to happen:		// \p I that depends on a value \p V, one of the following has to happen:
// - If the value is assumed, then stop.		// - If the value is assumed, then stop.
// - If the value is known but undef, then consider it UB.		// - If the value is known but undef, then consider it UB.
// - Otherwise, do specific processing with the simplified value.		// - Otherwise, do specific processing with the simplified value.
// We return None in the first 2 cases to signify that an appropriate		// We return None in the first 2 cases to signify that an appropriate
▲ Show 20 Lines • Show All 779 Lines • ▼ Show 20 Lines
};		};

struct AAIsDeadFunction : public AAIsDead {		struct AAIsDeadFunction : public AAIsDead {
AAIsDeadFunction(const IRPosition &IRP) : AAIsDead(IRP) {}		AAIsDeadFunction(const IRPosition &IRP) : AAIsDead(IRP) {}

/// See AbstractAttribute::initialize(...).		/// See AbstractAttribute::initialize(...).
void initialize(Attributor &A) override {		void initialize(Attributor &A) override {
const Function *F = getAssociatedFunction();		const Function *F = getAssociatedFunction();
if (F && !F->isDeclaration()) {		if (!F \|\| F->isDeclaration())
ToBeExploredFrom.insert(&F->getEntryBlock().front());		return;
assumeLive(A, F->getEntryBlock());		const BasicBlock &EntryBlock = F->getEntryBlock();
		const Instruction &Front = EntryBlock.front();
		const auto &AAUB =
		A.getAAFor<AAUndefinedBehavior>(this, IRPosition::function(F));
		if (!AAUB.isKnownToCauseUB(A, &Front)) {
		ToBeExploredFrom.insert(&Front);
		baziotisAuthorUnsubmitted Done Reply Inline Actions With the known info it's easy because we don't need to remember that we were based on assumed info (in the `updateImpl()`). But now that I see that again, it probably is as simple as inserting `Front` in the `ToBeExploredFrom` (always) and only make the `EntryBlock` live if it is not assumed to cause UB. baziotis: With the known info it's easy because we don't need to remember that we were based on assumed…
		assumeLive(A, EntryBlock);
}		}
}		}
		jdoerfertUnsubmitted Not Done Reply Inline Actions As mentioned above, you can add the Attributor as a argument to calls. jdoerfert: As mentioned above, you can add the Attributor as a argument to calls.
		baziotisAuthorUnsubmitted Done Reply Inline Actions Ok, thanks, I'll do it that way. baziotis: Ok, thanks, I'll do it that way.

/// See AbstractAttribute::getAsStr().		/// See AbstractAttribute::getAsStr().
const std::string getAsStr() const override {		const std::string getAsStr() const override {
return "Live[#BB " + std::to_string(AssumedLiveBlocks.size()) + "/" +		return "Live[#BB " + std::to_string(AssumedLiveBlocks.size()) + "/" +
std::to_string(getAssociatedFunction()->size()) + "][#TBEP " +		std::to_string(getAssociatedFunction()->size()) + "][#TBEP " +
std::to_string(ToBeExploredFrom.size()) + "][#KDE " +		std::to_string(ToBeExploredFrom.size()) + "][#KDE " +
std::to_string(KnownDeadEnds.size()) + "]";		std::to_string(KnownDeadEnds.size()) + "]";
}		}
▲ Show 20 Lines • Show All 266 Lines • ▼ Show 20 Lines	case Instruction::Br:
*this, AliveSuccessors);		*this, AliveSuccessors);
break;		break;
case Instruction::Switch:		case Instruction::Switch:
UsedAssumedInformation = identifyAliveSuccessors(A, cast<SwitchInst>(*I),		UsedAssumedInformation = identifyAliveSuccessors(A, cast<SwitchInst>(*I),
*this, AliveSuccessors);		*this, AliveSuccessors);
break;		break;
}		}

		// Keep only successors that are not assumed to cause UB.

		// Note: Instead of removing the successors, it's cleaner
		// to make a copy in which we keep only the valid ones.
		SmallVector<const Instruction *, 8> AliveSuccessorsCopy;
		AliveSuccessorsCopy.reserve(AliveSuccessors.size());
		const Function *F = I->getFunction();
		const auto &AAUB =
		A.getAAFor<AAUndefinedBehavior>(this, IRPosition::function(F));
		for (const Instruction *AliveSuccessor : AliveSuccessors) {
		bool Assumed = false;
		if (AAUB.isKnownToCauseUB(A, AliveSuccessor) \|\|
		(Assumed = AAUB.isAssumedToCauseUB(AliveSuccessor))) {
		UsedAssumedInformation = Assumed;
		} else {
		AliveSuccessorsCopy.push_back(AliveSuccessor);
		}
		}
		jdoerfertUnsubmitted Not Done Reply Inline Actions I don't think retroactively removing blocks here is a good idea. Once assumed live it should stay that way. jdoerfert: I don't think retroactively removing blocks here is a good idea. Once assumed live it should…
		baziotisAuthorUnsubmitted Done Reply Inline Actions I agree, as I had stated in a previous comment: (also maybe the fact that we're removing - and it's the only place - messes with the monotony). which as far as I can understand, still holds. baziotis: I agree, as I had stated in a previous comment: > (also maybe the fact that we're removing…
		AliveSuccessors = std::move(AliveSuccessorsCopy);

if (UsedAssumedInformation) {		if (UsedAssumedInformation) {
NewToBeExploredFrom.insert(I);		NewToBeExploredFrom.insert(I);
} else {		} else {
Change = ChangeStatus::CHANGED;		Change = ChangeStatus::CHANGED;
		jdoerfertUnsubmitted Not Done Reply Inline Actions I think something like `make_filter_range` from `llvm/include/llvm/ADT/STLExtras.h` could make this nicer. Or maybe we just need to put it in a helper function, or both. jdoerfert: I think something like `make_filter_range` from `llvm/include/llvm/ADT/STLExtras.h` could make…
		baziotisAuthorUnsubmitted Done Reply Inline Actions Yes, using a helper would be better. `make_filter_range` seems cool, I didn't know it. TBH though, I think it gives less control and predictability. It's not as clear to me as in the current what happens under the hood (apart from the fact that we'll query 2 times the `AAUB`). Also, FWIW, the current will have a worst case of one allocation and one free and an average case of no allocation / free (assuming that most instructions have less than 8 successors). I'll put it in a helper function and update me if you still think it's not that good and I'll change it to `make filter_range`. baziotis: Yes, using a helper would be better. `make_filter_range` seems cool, I didn't know it. TBH…
		baziotisAuthorUnsubmitted Done Reply Inline Actions Btw, I forgot to explain the `AssumedLiveBlocks.erase(BB)` part. Basically, if an alive successor is UB and its parent BB has been put into `AssumedLiveBlocks`, then we should mark that BB dead by removing it. That however seems bad to me. Up to now, we only inserted to it and it feels that something can go badly if we start removing stuff (like, go to an endless loop or the not having a monotone procedure). I'll take a look again, it was a quick change but feel free to update me if we should worry about that. baziotis: Btw, I forgot to explain the `AssumedLiveBlocks.erase(BB)` part. Basically, if an alive…
		jdoerfertUnsubmitted Not Done Reply Inline Actions I haven't read the code in detail, some observations: if (AAUB.isKnownToCauseUB(AliveSuccessor) \|\| (Assumed = AAUB.isAssumedToCauseUB(AliveSuccessor))) { Assumed is always true or uninitialized here. Reading it after results in true or UB. What you want is to check assumed and if true check known to determine if you used assumed information. No need for `count` if you call `erase`. Just erase/insert stuff, the return value even tells you if it was in before. I would like to understand the situation in which we think a block is UB only after it was put in `AssumedLiveBlocks`. We should not erase it but add an assertion so we can see if it triggers on the test. I hope it does not (and we can keep the assertion) as that would mean it works properly. jdoerfert: I haven't read the code in detail, some observations: ``` if (AAUB.isKnownToCauseUB…
		baziotisAuthorUnsubmitted Done Reply Inline Actions Assumed is always true or uninitialized here. Reading it after results in true or UB. What you want is to check assumed and if true check known to determine if you used assumed information. Yes, it should be initialized to `false`. Well, the reason the code has been written in this (maybe weird) way is because if `Known` returns true, we don't have to call assumed (which we will do if we always call assumed first). I think that if we just initialize to `false` it will be ok. No need for count if you call erase. Just erase/insert stuff, the return value even tells you if it was in before. Ty, I didn't know that. I would like to understand the situation in which we think a block is UB only after it was put in AssumedLiveBlocks. We should not erase it but add an assertion so we can see if it triggers on the test. I hope it does not (and we can keep the assertion) as that would mean it works properly. A simple way this can happen is with that: define void @cond_br_on_undef_interproc() { br i1 undef, label %t, label %e t: ret void e: ret void } Note that `br i1 undef, ...` is in the entry block, which is assumed live in `initialize()` based on known info (and of course, at this point, it's not yet known it causes UB). So, this becomes: define void @cond_br_on_undef_interproc() { unreachable t: unreachable e: unreachable } which is ok I guess, but we could have deleted the whole function (which we didn't since the block is still alive). When other instructions go to the entry block, that gets more complicated. I'm not yet sure what happens for non-entry blocks. baziotis: > Assumed is always true or uninitialized here. Reading it after results in true or UB. What…
		baziotisAuthorUnsubmitted Done Reply Inline Actions Nit: Actually, we can remove `Assumed` completely and use `UsedAssumedInformation` which will probably make the code maybe too smart for our own good. :P baziotis: Nit: Actually, we can remove `Assumed` completely and use `UsedAssumedInformation` which will…
		jdoerfertUnsubmitted Not Done Reply Inline Actions Note that br i1 undef, ... is in the entry block, which is assumed live in initialize() based on known info (and of course, at this point, it's not yet known it causes UB). We need to apply the same logic (via the helper) in the initialize. When other instructions go to the entry block, that gets more complicated. That is forbidden. I'm not yet sure what happens for non-entry blocks. I think the logic in the initialize will allow us to place an assertion here. Nit: Actually, we can remove Assumed completely and use UsedAssumedInformation which will probably make the code maybe too smart for our own good. :P Variables do not cost anything. Make the core readable first. jdoerfert: > Note that br i1 undef, ... is in the entry block, which is assumed live in initialize() based…
		baziotisAuthorUnsubmitted Done Reply Inline Actions When other instructions go to the entry block, that gets more complicated. That is forbidden. Sorry, I didn't mean to say "go" as "branch to". I meant when we add other instructions to the entry block. We need to apply the same logic (via the helper) in the initialize. Sorry, I didn't get that. Could you please specify what logic? I was thinking something like (in the initialize): If it is known to cause UB, just return. Otherwise, insert `Front` in `ToBeExplored` and `assumeLive()` the block only if `!AAUB.isAssumedToCauseUB(Front)`. Unfortunately, I think that the fact that `isAssumedToCauseUB()` is over-optimistic will make all this more and more complicated. Take for example this: define i32 @example(i1* %alloc) { %cond = load i1, i1* %alloc br i1 %cond, label %t, label %e t: ret i32 1 e: ret i32 2 } (and assuming we're having the current thing in `initialize()`). The entry block will be marked live. `AAIsDeadFunction::updateImpl()` is called. The `br` is successor of `%cond`. But, because `isAssumedToCauseUB()` is overoptimistic, it will assume this UB. This in turn will insert `%cond` to `KnownDeadEnds`. Now we could say here that this is based on assumed info and another `updateImpl()` will correct but here's the catch (or you could call it something like a dead-lock): `AAUB::updateImpl()` will be called. But, this uses `checkForAllInstructions()`, which uses liveness, which in turn, uses `isAssumedDead()`. The `br` will be assumed dead (because its predecessor, `%cond`, is in `KnownDeadEnds`) and won't make it to the predicate which would eventually add it to `AssumedNoUBInsts` which would correct all this situation. This means that the `br` never gets live and this procedure continues and we're left with: define i32 @cond_br_on_undef_uninit(i1* nocapture nofree nonnull readonly dereferenceable(1) %alloc) #0 { %cond = load i1, i1* %alloc br i1 %cond, label %t, label %e t: ; preds = %0 unreachable e: ; preds = %0 unreachable } baziotis: >> When other instructions go to the entry block, that gets more complicated. > That is…
		jdoerfertUnsubmitted Not Done Reply Inline Actions Sorry, I didn't get that. Could you please specify what logic? The same as you apply now to filter "alive successors". We should not mark entry live in the initilize if AAUB::isAssumedUB(EntryBB) is true. We probably need to redefine `KnownDeadEnds` such that we can add the entry block first instruction during initilize and it will not be assumed live. That will require some changes but should be possible. Let's talke that next year ;) jdoerfert: > Sorry, I didn't get that. Could you please specify what logic? The same as you apply now to…
		baziotisAuthorUnsubmitted Done Reply Inline Actions Let's talke that next year Yes, happy new year. :) baziotis: > Let's talke that next year Yes, happy new year. :)
if (AliveSuccessors.empty() \|\|		if (AliveSuccessors.empty() \|\|
(I->isTerminator() && AliveSuccessors.size() < I->getNumSuccessors()))		(I->isTerminator() && AliveSuccessors.size() < I->getNumSuccessors()))
KnownDeadEnds.insert(I);		KnownDeadEnds.insert(I);
}		}

LLVM_DEBUG(dbgs() << "[AAIsDead] #AliveSuccessors: "		LLVM_DEBUG(dbgs() << "[AAIsDead] #AliveSuccessors: "
<< AliveSuccessors.size() << " UsedAssumedInformation: "		<< AliveSuccessors.size() << " UsedAssumedInformation: "
<< UsedAssumedInformation << "\n");		<< UsedAssumedInformation << "\n");
▲ Show 20 Lines • Show All 5,372 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Attributor][WIP] Connect AAIsDead with AAUndefinedBehaviorNeeds ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 248798

llvm/include/llvm/Transforms/IPO/Attributor.h

llvm/lib/Transforms/IPO/Attributor.cpp

[Attributor][WIP] Connect AAIsDead with AAUndefinedBehavior
Needs ReviewPublic