This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/IR/
-
IR/
12/47
SafepointIRVerifier.cpp
-
test/SafepointIRVerifier/
-
SafepointIRVerifier/
-
from-same-relocation-in-phi-nodes.ll
-
unrecorded-live-at-sp.ll
-
uses-in-phi-nodes.ll

Differential D41006

[SafepointIRVerifier] Allow non-dereferencing uses of unrelocated or poisoned PHI nodes
ClosedPublic

Authored by DaniilSuchkov on Dec 8 2017, 6:27 AM.

Download Raw Diff

Details

Reviewers

anna
mkazantsev
reames

Commits

rGddb096853d00: [SafepointIRVerifier] Allow non-dereferencing uses of unrelocated or poisoned…
rL321438: [SafepointIRVerifier] Allow non-dereferencing uses of unrelocated or poisoned…

Summary

PHI that has at least one unrelocated input cannot cause any issues by itself,
though its uses should be carefully verified. With this patch PHIs are allowed
to have any inputs but when all inputs are unrelocated the PHI is marked as
unrelocated and if not all inputs are unrelocated then the PHI is marked as
poisoned. Poisoned pointers can be used only in three ways: to derive new
pointers, in PHIs or in comparisons against constants that are exclusively
derived from null.

Diff Detail

Event Timeline

DaniilSuchkov created this revision.Dec 8 2017, 6:27 AM

DaniilSuchkov added a parent revision: D40885: [NFC] Refactor SafepointIRVerifier.

mkazantsev added inline comments.Dec 9 2017, 1:36 AM

lib/IR/SafepointIRVerifier.cpp
246	Please clarify what is "a pointer derived from null". For example, is `select %cond, null, %some` derived from null? I think what you mean here is something like "... or against a constant pointer".
247	How about "poisoned value is a value which is derived from both relocated and unrelocated values, or from another poisoned values"?
250	You can always represent any constant pointer as `gep null, %some_int_constant`, so I think that this "exclusively derived from null" stuff is redundant.
255	You can merge that free into `P + Any = P` if it makes sense.
260	Maybe instead of using the term "merge pointers" stick to the term "derived pointer"? You use both, and I don't catch what is the difference between them.
261	Maybe "A pointer derived from X and constant has the same type as X"? You could also include it into the rules above.
301	as long as they are only used in safe instructions
538	How about `"Removing unrelocated" << I`... and below, `"Removing poisoned " << I`?
670	"exclusively derived null pointer" -> "constant pointer"?

DaniilSuchkov added inline comments.Dec 9 2017, 5:18 AM

lib/IR/SafepointIRVerifier.cpp
246	Actually I've missed one word. It should be "derived _exclusively_ from null".
247	It's about origination of poisoned values.
250	It refers to comment at line 639.
255	I'd like to keep it this way.
260	By "deriving" I mean f(x) -> x, and by merge f(x1, ..., xN) -> x.
301	The idea is "_This_ instructions don't need verification. But nothing is said about their uses."
538	It was intentional but I can't remember the reason so I'll fix it.
670	Not all constant pointers are derived from null but anyway you spotted a typo, thank you.

mkazantsev added inline comments.Dec 11 2017, 4:03 PM

lib/IR/SafepointIRVerifier.cpp
247	Even if it was like that, it has nothing to do with the rules below, since you don't explain where rules 2-4 come from. You only define how poison first appears from merging of U and R, but don't say how it's handled after that.
250	I'm not asking to do something about it in this patch since it was there before, but it is fishy. If I can imagine a VM in which 0xFF is some special magical pointer that cannot be simply compared against normal pointers, then I can also imagine a VM where `gep null, 0xFF` is also some special magical pointer with same properties. Actually, I can define all such special numbers as derivatives from null. From that perspective, how hard-coded constants are different from hard-coded offsets from null?
260	I guess "deriving" is actually f(x1) -> x. And what you call "deriving" is just a particular case of "merging". Again, why do we need both?
301	Ok, makes sense.

Comments slightly clarified.

lib/IR/SafepointIRVerifier.cpp
247	This part is supposed to be a brief description. How it's handled is described bellow.
250	I don't know either, but the idea is to keep this patch consistent with previous code. So I have to maintain the logic around "magic pointer constant". This patch is not about that issue, let's discuss it later.
260	Because for deriving there is only one rule: it changes nothing and for merge everything is a bit more complicated. Both "gep/bitcast is merge" and "phi/select is deriving" looks misleading. I agree that formally f(x1) -> x is a particular case of f(x1, ..., xN) -> x, but how to name it so that it won't be confusing?

anna added inline comments.Dec 15 2017, 7:18 AM

lib/IR/SafepointIRVerifier.cpp
250	I'll try to clarify this. The GC relocates the base pointer and this is why we record the base pointer for every 'derived pointer'. After the GC relocates the base pointer at runtime, we can rematerialize the derived pointer because we have stored this information in the IR. So, effectively it comes down to always identifying the "base" of a derived pointer. This is where the `getBaseType` comes in. When we generate "magic const pointers" in IR (for example, using inttoptr `magic const`), the base here is that magic const. The same idiom is `GEP(null, magic const)`, but here the base pointer is null. Relocating a null is still a null. So, this is why we have something like this: %ptr = unrelocated non constant pointer compare (%ptr, inttoptr(magic_const)) <-- can't be reordered before a safepoint but: compare(%ptr, GEP(null, magic_const)) <-- can be reordered before a safepoint Also, just as an aside, this is also why inttoptr of addrspace(1) is incorrect in the IRVerifier, but GEP(null, offset) in addrspace(1) is fine.

Comments inline.

lib/IR/SafepointIRVerifier.cpp
242	I don't think we want to introduce one more term 'poisoned' here. Specifically, poisoning has different meanings in the optimizer (and sometimes in the GC), and can be confusing. It looks like `poisoned` pointers are just derived pointers which will be lexically from multiple pointers. So something like: `gep, bitcasts` -> derived pointer from one base `phis, selects` -> derived pointers that are lexically from multiple base pointers (I say lexically, because we can have phis/selects that statically have derived from exactly one pointer). Do we really need to make this distinction? I think it's more confusing. Pls see comment below on naming to make clearer.
260	I tend to agree with Max here. We really cannot distinguish between both. How about we focus just on the fact that we have multiple sources here? IOW, don't worry about GEPs and bitcasts because they have single source base. These GEPs and bitcasts don't change the behaviour of unrelocated/relocated, so they shouldn;t affect this discusson.
262	As mentioned, I dont think you need to explicitly state out the distinction here. Just maybe a single line at the beginning when explaining `MultiSourceDerivedPtr` (I prefer that instead of `Poisoned` naming), because we can have multiple sources and still not be poisoned.
267	Nit: predecessor
279	This is incorrect IR. We cannot have multiple different incoming phi values from exactly one predecessor. With correct IR, I don't think we will have such false positives. If we do have, could you please add a test with FIXME?

This revision now requires changes to proceed.Dec 15 2017, 9:33 AM

mkazantsev added inline comments.Dec 15 2017, 1:12 PM

lib/IR/SafepointIRVerifier.cpp
242	Yes, this may be confusing with poisoned pointers that are also in GC. I don't have a better idea for its name on top of my head, though. I'm OK to go with whatever you agree with.
250	Ok, this makes sense to me.
260	I think that "deriving" is a good name for both, because formally you apply some function `f` (where `f` can be `gep, phi, bitcast` or whatever) to a number of pointer arguments and have a new pointer (derived one) as result. It is unimportant -how- exactly you derived your pointer from your argument(s). You never fail verification while deriving. You can only fail verification when you misuse the derived pointer. And this is the important part.
279	I don't get why it is incorrect. For example, void p = &a[10]; void p1 = &p[20]; void temp = p; if (cond) { temp = p1; some_call(); } void p2 = temp; Won't we have exactly this IR here?

anna added inline comments.Dec 15 2017, 7:11 PM

lib/IR/SafepointIRVerifier.cpp
279	yup, you're right. What we have is different incoming values from different incoming blocks, something like: `p2 = phi [p, def BB of p], [p1, safepoint block]` What's incorrect is, different incoming values from the same block. For example, p and p1 from their same def block: `p2 = phi [p, def BB of p] [p1, def BB of p1]`.

DaniilSuchkov added inline comments.Dec 19 2017, 3:26 AM

lib/IR/SafepointIRVerifier.cpp
262	We don't need `MultiSourceDerivedPtr` because we don't care at all from how many sources a value was derived, but we do care if all sources were (un)relocated or not. Value which was derived from multiple sources can be in any of three possible states (relocated, unrelocated, poisoned).
279	To avoid confusion I'll make this comment a bit more clear.

"merge" now replaced with "derive" in comments, example in FIXME become a bit less confusing, added new test (with XFAIL) for that FIXME.

Logic looks right, some comments inline.

lib/IR/SafepointIRVerifier.cpp

409

Pls add a comment here on why the poisoned defs are skipped. ValidUnrelocatedDefs are obvious.

485

Don't we need to handle selects and identify poisoned versus unrelocated?

495

Instead of the below code (logic is right, but too many conditionals), could we do something like this:

if (isNotExclusivelyConstantDerived(InValue)) {
  if (isValuePoisoned(InValue) || (HasRelocatedInputs &&  HasUnrelocatedInputs)) {
     PoisonedPointerDef = true;
     break;
   }
   if (BlockMap[InBB]->AvailableOut.count(InValue))
              HasRelocatedInputs = true;
    else
              HasUnrelocatedInputs = true;
}
if (!PoisonedPointerDef && HasUnrelocatedInputs) {
   assert(!HasRelocatedInputs && "Should be poisoned!");
   ValidUnrelocatedPointerDef = true;
}

This revision now requires changes to proceed.Dec 21 2017, 9:09 AM

DaniilSuchkov added inline comments.Dec 21 2017, 9:43 AM

lib/IR/SafepointIRVerifier.cpp
409	You think I should repeat myself here? From comments above (where 'poisoned' pointers are introduced) it's clear why this defs are skipped.
485	I'll do it in the next patch (in order to keep this one not too huge).
495	But we still have to handle case when some inputs are relocated and some are not: your code won't work if the _last_ input makes this phi poisoned (because on each iteration it checks flags that might have changed on previous one). Thus we cannot remove this part: if (HasUnrelocatedInputs) { if (HasRelocatedInputs) PoisonedPointerDef = true; else ValidUnrelocatedPointerDef = true; } So the only thing that might be changed is this part: if (isValuePoisoned(InValue)) { // If any of inputs is poisoned, output is always poisoned too. HasRelocatedInputs = true; HasUnrelocatedInputs = true; break; } But if we'll change it to if (isValuePoisoned(InValue)) { // If any of inputs is poisoned, output is always poisoned too. PoisonedPointerDef = true; break; } We'll have to somehow tell that `if` (mentioned before) not to touch this flag and it'll be even worse. And that `assert` shouldn't be moved to PHI's branch, it's about all instructions. Currently this part is pretty clear and straightforward: loop over phi's inputs sets two flags (that have clear meaning) and after that loop another two flags are changed accordingly.

lgtm w/ comment addressed.

lib/IR/SafepointIRVerifier.cpp
409	Are all poisoned defs removed? Only valid ones right.
485	pls add a TODO then.
495	ah yes, it wont work for the last case.

This revision is now accepted and ready to land.Dec 21 2017, 9:50 AM

anna added inline comments.Dec 21 2017, 9:53 AM

lib/IR/SafepointIRVerifier.cpp
495	also, pls point out as a comment where these rules come from (i.e. refer to header for reasoning of these rules).

Added some new comments.

Closed by commit rL321438: [SafepointIRVerifier] Allow non-dereferencing uses of unrelocated or poisoned… (authored by mkazantsev). · Explain WhyDec 25 2017, 1:36 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

lib/

IR/

SafepointIRVerifier.cpp

156 lines

test/

SafepointIRVerifier/

from-same-relocation-in-phi-nodes.ll

26 lines

unrecorded-live-at-sp.ll

5 lines

uses-in-phi-nodes.ll

106 lines

Diff 127992

lib/IR/SafepointIRVerifier.cpp

Show First 20 Lines • Show All 231 Lines • ▼ Show 20 Lines
}		}

namespace {		namespace {
class InstructionVerifier;		class InstructionVerifier;

/// Builds BasicBlockState for each BB of the function.		/// Builds BasicBlockState for each BB of the function.
/// It can traverse function for verification and provides all required		/// It can traverse function for verification and provides all required
/// information.		/// information.
		///
		/// GC pointer may be in one of three states: relocated, unrelocated and
		/// poisoned.
		annaUnsubmitted Not Done Reply Inline Actions I don't think we want to introduce one more term 'poisoned' here. Specifically, poisoning has different meanings in the optimizer (and sometimes in the GC), and can be confusing. It looks like `poisoned` pointers are just derived pointers which will be lexically from multiple pointers. So something like: `gep, bitcasts` -> derived pointer from one base `phis, selects` -> derived pointers that are lexically from multiple base pointers (I say lexically, because we can have phis/selects that statically have derived from exactly one pointer). Do we really need to make this distinction? I think it's more confusing. Pls see comment below on naming to make clearer. anna: I don't think we want to introduce one more term 'poisoned' here. Specifically, poisoning has…
		mkazantsevUnsubmitted Not Done Reply Inline Actions Yes, this may be confusing with poisoned pointers that are also in GC. I don't have a better idea for its name on top of my head, though. I'm OK to go with whatever you agree with. mkazantsev: Yes, this may be confusing with poisoned pointers that are also in GC. I don't have a better…
		/// Relocated pointer may be used without any restrictions.
		/// Unrelocated pointer cannot be dereferenced, passed as argument to any call
		/// or returned. Unrelocated pointer may be safely compared against another
		/// unrelocated pointer or against a pointer exclusively derived from null.
		mkazantsevUnsubmitted Not Done Reply Inline Actions Please clarify what is "a pointer derived from null". For example, is `select %cond, null, %some` derived from null? I think what you mean here is something like "... or against a constant pointer". mkazantsev: Please clarify what is "a pointer derived from null". For example, is `select %cond, null…
		DaniilSuchkovAuthorUnsubmitted Done Reply Inline Actions Actually I've missed one word. It should be "derived _exclusively_ from null". DaniilSuchkov: Actually I've missed one word. It should be "derived _exclusively_ from null".
		/// Poisoned pointers are produced when we somehow derive pointer from relocated
		mkazantsevUnsubmitted Not Done Reply Inline Actions How about "poisoned value is a value which is derived from both relocated and unrelocated values, or from another poisoned values"? mkazantsev: How about "poisoned value is a value which is derived from both relocated and unrelocated…
		DaniilSuchkovAuthorUnsubmitted Not Done Reply Inline Actions It's about origination of poisoned values. DaniilSuchkov: It's about origination of poisoned values.
		mkazantsevUnsubmitted Not Done Reply Inline Actions Even if it was like that, it has nothing to do with the rules below, since you don't explain where rules 2-4 come from. You only define how poison first appears from merging of U and R, but don't say how it's handled after that. mkazantsev: Even if it was like that, it has nothing to do with the rules below, since you don't explain…
		DaniilSuchkovAuthorUnsubmitted Not Done Reply Inline Actions This part is supposed to be a brief description. How it's handled is described bellow. DaniilSuchkov: This part is supposed to be a brief description. How it's handled is described bellow.
		/// and unrelocated pointers (e.g. phi, select). This pointers may be safely
		/// used in a very limited number of situations. Currently the only way to use
		/// it is comparison against constant exclusively derived from null. All
		mkazantsevUnsubmitted Not Done Reply Inline Actions You can always represent any constant pointer as `gep null, %some_int_constant`, so I think that this "exclusively derived from null" stuff is redundant. mkazantsev: You can always represent any constant pointer as `gep null, %some_int_constant`, so I think…
		DaniilSuchkovAuthorUnsubmitted Not Done Reply Inline Actions It refers to comment at line 639. DaniilSuchkov: It refers to comment at line 639.
		mkazantsevUnsubmitted Not Done Reply Inline Actions I'm not asking to do something about it in this patch since it was there before, but it is fishy. If I can imagine a VM in which 0xFF is some special magical pointer that cannot be simply compared against normal pointers, then I can also imagine a VM where `gep null, 0xFF` is also some special magical pointer with same properties. Actually, I can define all such special numbers as derivatives from null. From that perspective, how hard-coded constants are different from hard-coded offsets from null? mkazantsev: I'm not asking to do something about it in this patch since it was there before, but it is…
		DaniilSuchkovAuthorUnsubmitted Not Done Reply Inline Actions I don't know either, but the idea is to keep this patch consistent with previous code. So I have to maintain the logic around "magic pointer constant". This patch is not about that issue, let's discuss it later. DaniilSuchkov: I don't know either, but the idea is to keep this patch consistent with previous code. So I…
		annaUnsubmitted Not Done Reply Inline Actions I'll try to clarify this. The GC relocates the base pointer and this is why we record the base pointer for every 'derived pointer'. After the GC relocates the base pointer at runtime, we can rematerialize the derived pointer because we have stored this information in the IR. So, effectively it comes down to always identifying the "base" of a derived pointer. This is where the `getBaseType` comes in. When we generate "magic const pointers" in IR (for example, using inttoptr `magic const`), the base here is that magic const. The same idiom is `GEP(null, magic const)`, but here the base pointer is null. Relocating a null is still a null. So, this is why we have something like this: %ptr = unrelocated non constant pointer compare (%ptr, inttoptr(magic_const)) <-- can't be reordered before a safepoint but: compare(%ptr, GEP(null, magic_const)) <-- can be reordered before a safepoint Also, just as an aside, this is also why inttoptr of addrspace(1) is incorrect in the IRVerifier, but GEP(null, offset) in addrspace(1) is fine. anna: I'll try to clarify this. The GC relocates the base pointer and this is why we record the…
		mkazantsevUnsubmitted Not Done Reply Inline Actions Ok, this makes sense to me. mkazantsev: Ok, this makes sense to me.
		/// limitations arise due to their undefined state: this pointers should be
		/// treated as relocated and unrelocated simultaneously.
		/// Rules of deriving:
		/// R + U = P - that's where the poisoned pointers come from
		/// P + X = P
		mkazantsevUnsubmitted Done Reply Inline Actions You can merge that free into `P + Any = P` if it makes sense. mkazantsev: You can merge that free into `P + Any = P` if it makes sense.
		DaniilSuchkovAuthorUnsubmitted Not Done Reply Inline Actions I'd like to keep it this way. DaniilSuchkov: I'd like to keep it this way.
		/// U + U = U
		/// R + R = R
		/// X + C = X
		/// Where "+" - any operation that somehow derive pointer, U - unrelocated,
		/// R - relocated and P - poisoned, C - constant, X - U or R or P or C or
		mkazantsevUnsubmitted Not Done Reply Inline Actions Maybe instead of using the term "merge pointers" stick to the term "derived pointer"? You use both, and I don't catch what is the difference between them. mkazantsev: Maybe instead of using the term "merge pointers" stick to the term "derived pointer"? You use…
		DaniilSuchkovAuthorUnsubmitted Not Done Reply Inline Actions By "deriving" I mean f(x) -> x, and by merge f(x1, ..., xN) -> x. DaniilSuchkov: By "deriving" I mean f(x) -> x, and by merge f(x1, ..., xN) -> x.
		mkazantsevUnsubmitted Not Done Reply Inline Actions I guess "deriving" is actually f(x1) -> x. And what you call "deriving" is just a particular case of "merging". Again, why do we need both? mkazantsev: I guess "deriving" is actually f(x1) -> x. And what you call "deriving" is just a particular…
		DaniilSuchkovAuthorUnsubmitted Not Done Reply Inline Actions Because for deriving there is only one rule: it changes nothing and for merge everything is a bit more complicated. Both "gep/bitcast is merge" and "phi/select is deriving" looks misleading. I agree that formally f(x1) -> x is a particular case of f(x1, ..., xN) -> x, but how to name it so that it won't be confusing? DaniilSuchkov: Because for deriving there is only one rule: it changes nothing and for merge everything is a…
		annaUnsubmitted Done Reply Inline Actions I tend to agree with Max here. We really cannot distinguish between both. How about we focus just on the fact that we have multiple sources here? IOW, don't worry about GEPs and bitcasts because they have single source base. These GEPs and bitcasts don't change the behaviour of unrelocated/relocated, so they shouldn;t affect this discusson. anna: I tend to agree with Max here. We really cannot distinguish between both. How about we focus…
		mkazantsevUnsubmitted Not Done Reply Inline Actions I think that "deriving" is a good name for both, because formally you apply some function `f` (where `f` can be `gep, phi, bitcast` or whatever) to a number of pointer arguments and have a new pointer (derived one) as result. It is unimportant -how- exactly you derived your pointer from your argument(s). You never fail verification while deriving. You can only fail verification when you misuse the derived pointer. And this is the important part. mkazantsev: I think that "deriving" is a good name for both, because formally you apply some function `f`…
		/// nothing (in case when "+" is unary operation).
		mkazantsevUnsubmitted Done Reply Inline Actions Maybe "A pointer derived from X and constant has the same type as X"? You could also include it into the rules above. mkazantsev: Maybe "A pointer derived from X and constant has the same type as X"? You could also include it…
		/// Deriving of pointers by itself is always safe.
		annaUnsubmitted Not Done Reply Inline Actions As mentioned, I dont think you need to explicitly state out the distinction here. Just maybe a single line at the beginning when explaining `MultiSourceDerivedPtr` (I prefer that instead of `Poisoned` naming), because we can have multiple sources and still not be poisoned. anna: As mentioned, I dont think you need to explicitly state out the distinction here. Just maybe a…
		DaniilSuchkovAuthorUnsubmitted Not Done Reply Inline Actions We don't need `MultiSourceDerivedPtr` because we don't care at all from how many sources a value was derived, but we do care if all sources were (un)relocated or not. Value which was derived from multiple sources can be in any of three possible states (relocated, unrelocated, poisoned). DaniilSuchkov: We don't need `MultiSourceDerivedPtr` because we don't care at all from how many sources a…
		/// NOTE: when we are making decision on the status of instruction's result:
		/// a) for phi we need to check status of each input *at the end of
		/// corresponding predecessor BB*.
		/// b) for other instructions we need to check status of each input *at the
		/// current point*.
		annaUnsubmitted Done Reply Inline Actions Nit: predecessor anna: Nit: predecessor
		///
		/// FIXME: This works fairly well except one case
		/// bb1:
		/// p = some GC-ptr def
		/// p1 = gep p, offset
		/// / \|
		/// / \|
		/// bb2: \|
		/// safepoint \|
		/// \ \|
		/// \ \|
		/// bb3:
		annaUnsubmitted Done Reply Inline Actions This is incorrect IR. We cannot have multiple different incoming phi values from exactly one predecessor. With correct IR, I don't think we will have such false positives. If we do have, could you please add a test with FIXME? anna: This is incorrect IR. We cannot have multiple different incoming phi values from exactly one…
		mkazantsevUnsubmitted Not Done Reply Inline Actions I don't get why it is incorrect. For example, void p = &a[10]; void p1 = &p[20]; void temp = p; if (cond) { temp = p1; some_call(); } void p2 = temp; Won't we have exactly this IR here? mkazantsev: I don't get why it is incorrect. For example, void p = &a[10]; void p1 = &p[20]; void…
		annaUnsubmitted Not Done Reply Inline Actions yup, you're right. What we have is different incoming values from different incoming blocks, something like: `p2 = phi [p, def BB of p], [p1, safepoint block]` What's incorrect is, different incoming values from the same block. For example, p and p1 from their same def block: `p2 = phi [p, def BB of p] [p1, def BB of p1]`. anna: yup, you're right. What we have is different incoming values from different incoming blocks…
		DaniilSuchkovAuthorUnsubmitted Done Reply Inline Actions To avoid confusion I'll make this comment a bit more clear. DaniilSuchkov: To avoid confusion I'll make this comment a bit more clear.
		/// p2 = phi [p, bb2] [p1, bb1]
		/// p3 = phi [p, bb2] [p, bb1]
		/// here p and p1 is unrelocated
		/// p2 and p3 is poisoned (though they shouldn't be)
		///
		/// This leads to some weird results:
		/// cmp eq p, p2 - illegal instruction (false-positive)
		/// cmp eq p1, p2 - illegal instruction (false-positive)
		/// cmp eq p, p3 - illegal instruction (false-positive)
		/// cmp eq p, p1 - ok
		/// To fix this we need to introduce conception of generations and be able to
		/// check if two values belong to one generation or not. This way p2 will be
		/// considered to be unrelocated and no false alarm will happen.
class GCPtrTracker {		class GCPtrTracker {
const Function &F;		const Function &F;
SpecificBumpPtrAllocator<BasicBlockState> BSAllocator;		SpecificBumpPtrAllocator<BasicBlockState> BSAllocator;
DenseMap<const BasicBlock , BasicBlockState > BlockMap;		DenseMap<const BasicBlock , BasicBlockState > BlockMap;
// This set contains defs of unrelocated pointers that are proved to be legal		// This set contains defs of unrelocated pointers that are proved to be legal
// and don't need verification.		// and don't need verification.
DenseSet<const Instruction *> ValidUnrelocatedDefs;		DenseSet<const Instruction *> ValidUnrelocatedDefs;
		// This set contains poisoned defs. They can be safely ignored during
		// verification too.
		mkazantsevUnsubmitted Not Done Reply Inline Actions as long as they are only used in safe instructions mkazantsev: as long as they are only used in safe instructions
		DaniilSuchkovAuthorUnsubmitted Not Done Reply Inline Actions The idea is "_This_ instructions don't need verification. But nothing is said about their uses." DaniilSuchkov: The idea is "_This_ instructions don't need verification. But nothing is said about their uses."
		mkazantsevUnsubmitted Not Done Reply Inline Actions Ok, makes sense. mkazantsev: Ok, makes sense.
		DenseSet<const Value *> PoisonedDefs;

public:		public:
GCPtrTracker(const Function &F, const DominatorTree &DT);		GCPtrTracker(const Function &F, const DominatorTree &DT);

BasicBlockState getBasicBlockState(const BasicBlock BB);		BasicBlockState getBasicBlockState(const BasicBlock BB);
const BasicBlockState getBasicBlockState(const BasicBlock BB) const;		const BasicBlockState getBasicBlockState(const BasicBlock BB) const;

		bool isValuePoisoned(const Value *V) const { return PoisonedDefs.count(V); }

/// Traverse each BB of the function and call		/// Traverse each BB of the function and call
/// InstructionVerifier::verifyInstruction for each possibly invalid		/// InstructionVerifier::verifyInstruction for each possibly invalid
/// instruction.		/// instruction.
/// It destructively modifies GCPtrTracker so it's passed via rvalue reference		/// It destructively modifies GCPtrTracker so it's passed via rvalue reference
/// in order to prohibit further usages of GCPtrTracker as it'll be in		/// in order to prohibit further usages of GCPtrTracker as it'll be in
/// inconsistent state.		/// inconsistent state.
static void verifyFunction(GCPtrTracker &&Tracker,		static void verifyFunction(GCPtrTracker &&Tracker,
InstructionVerifier &Verifier);		InstructionVerifier &Verifier);
▲ Show 20 Lines • Show All 81 Lines • ▼ Show 20 Lines	BasicBlockState GCPtrTracker::getBasicBlockState(const BasicBlock BB) {
return it->second;		return it->second;
}		}

const BasicBlockState *GCPtrTracker::getBasicBlockState(		const BasicBlockState *GCPtrTracker::getBasicBlockState(
const BasicBlock *BB) const {		const BasicBlock *BB) const {
return const_cast<GCPtrTracker *>(this)->getBasicBlockState(BB);		return const_cast<GCPtrTracker *>(this)->getBasicBlockState(BB);
}		}

bool GCPtrTracker::instructionMayBeSkipped(const Instruction *I) const {		bool GCPtrTracker::instructionMayBeSkipped(const Instruction *I) const {
		annaUnsubmitted Done Reply Inline Actions Pls add a comment here on why the poisoned defs are skipped. `ValidUnrelocatedDefs` are obvious. anna: Pls add a comment here on why the poisoned defs are skipped. `ValidUnrelocatedDefs` are obvious.
		DaniilSuchkovAuthorUnsubmitted Not Done Reply Inline Actions You think I should repeat myself here? From comments above (where 'poisoned' pointers are introduced) it's clear why this defs are skipped. DaniilSuchkov: You think I should repeat myself here? From comments above (where 'poisoned' pointers are…
		annaUnsubmitted Not Done Reply Inline Actions Are all poisoned defs removed? Only valid ones right. anna: Are all poisoned defs removed? Only valid ones right.
return ValidUnrelocatedDefs.count(I);		// Poisoned defs are skipped since they are always safe by itself by
		// definition (for details see comment to this class).
		return ValidUnrelocatedDefs.count(I) \|\| PoisonedDefs.count(I);
}		}

void GCPtrTracker::verifyFunction(GCPtrTracker &&Tracker,		void GCPtrTracker::verifyFunction(GCPtrTracker &&Tracker,
InstructionVerifier &Verifier) {		InstructionVerifier &Verifier) {
// We need RPO here to a) report always the first error b) report errors in		// We need RPO here to a) report always the first error b) report errors in
// same order from run to run.		// same order from run to run.
ReversePostOrderTraversal<const Function *> RPOT(&Tracker.F);		ReversePostOrderTraversal<const Function *> RPOT(&Tracker.F);
for (const BasicBlock *BB : RPOT) {		for (const BasicBlock *BB : RPOT) {
▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines

bool GCPtrTracker::removeValidUnrelocatedDefs(const BasicBlock *BB,		bool GCPtrTracker::removeValidUnrelocatedDefs(const BasicBlock *BB,
const BasicBlockState *BBS,		const BasicBlockState *BBS,
AvailableValueSet &Contribution) {		AvailableValueSet &Contribution) {
assert(&BBS->Contribution == &Contribution &&		assert(&BBS->Contribution == &Contribution &&
"Passed Contribution should be from the passed BasicBlockState!");		"Passed Contribution should be from the passed BasicBlockState!");
AvailableValueSet AvailableSet = BBS->AvailableIn;		AvailableValueSet AvailableSet = BBS->AvailableIn;
bool ContributionChanged = false;		bool ContributionChanged = false;
		// For explanation why instructions are processed this way see
		// "Rules of deriving" in the comment to this class.
for (const Instruction &I : *BB) {		for (const Instruction &I : *BB) {
bool ProducesUnrelocatedPointer = false;		bool ValidUnrelocatedPointerDef = false;
if ((isa<GetElementPtrInst>(I) \|\| isa<BitCastInst>(I)) &&		bool PoisonedPointerDef = false;
		annaUnsubmitted Not Done Reply Inline Actions Don't we need to handle selects and identify poisoned versus unrelocated? anna: Don't we need to handle selects and identify poisoned versus unrelocated?
		DaniilSuchkovAuthorUnsubmitted Not Done Reply Inline Actions I'll do it in the next patch (in order to keep this one not too huge). DaniilSuchkov: I'll do it in the next patch (in order to keep this one not too huge).
		annaUnsubmitted Done Reply Inline Actions pls add a TODO then. anna: pls add a TODO then.
		// TODO: `select` instructions should be handled here too.
		if (const PHINode *PN = dyn_cast<PHINode>(&I)) {
		if (containsGCPtrType(PN->getType())) {
		// If both is true, output is poisoned.
		bool HasRelocatedInputs = false;
		bool HasUnrelocatedInputs = false;
		for (unsigned i = 0, e = PN->getNumIncomingValues(); i != e; ++i) {
		const BasicBlock *InBB = PN->getIncomingBlock(i);
		const Value *InValue = PN->getIncomingValue(i);

		annaUnsubmitted Not Done Reply Inline Actions Instead of the below code (logic is right, but too many conditionals), could we do something like this: if (isNotExclusivelyConstantDerived(InValue)) { if (isValuePoisoned(InValue) \|\| (HasRelocatedInputs && HasUnrelocatedInputs)) { PoisonedPointerDef = true; break; } if (BlockMap[InBB]->AvailableOut.count(InValue)) HasRelocatedInputs = true; else HasUnrelocatedInputs = true; } if (!PoisonedPointerDef && HasUnrelocatedInputs) { assert(!HasRelocatedInputs && "Should be poisoned!"); ValidUnrelocatedPointerDef = true; } anna: Instead of the below code (logic is right, but too many conditionals), could we do something…
		DaniilSuchkovAuthorUnsubmitted Not Done Reply Inline Actions But we still have to handle case when some inputs are relocated and some are not: your code won't work if the _last_ input makes this phi poisoned (because on each iteration it checks flags that might have changed on previous one). Thus we cannot remove this part: if (HasUnrelocatedInputs) { if (HasRelocatedInputs) PoisonedPointerDef = true; else ValidUnrelocatedPointerDef = true; } So the only thing that might be changed is this part: if (isValuePoisoned(InValue)) { // If any of inputs is poisoned, output is always poisoned too. HasRelocatedInputs = true; HasUnrelocatedInputs = true; break; } But if we'll change it to if (isValuePoisoned(InValue)) { // If any of inputs is poisoned, output is always poisoned too. PoisonedPointerDef = true; break; } We'll have to somehow tell that `if` (mentioned before) not to touch this flag and it'll be even worse. And that `assert` shouldn't be moved to PHI's branch, it's about all instructions. Currently this part is pretty clear and straightforward: loop over phi's inputs sets two flags (that have clear meaning) and after that loop another two flags are changed accordingly. DaniilSuchkov: But we still have to handle case when some inputs are relocated and some are not: your code…
		annaUnsubmitted Not Done Reply Inline Actions ah yes, it wont work for the last case. anna: ah yes, it wont work for the last case.
		annaUnsubmitted Done Reply Inline Actions also, pls point out as a comment where these rules come from (i.e. refer to header for reasoning of these rules). anna: also, pls point out as a comment where these rules come from (i.e. refer to header for…
		if (isNotExclusivelyConstantDerived(InValue)) {
		if (isValuePoisoned(InValue)) {
		// If any of inputs is poisoned, output is always poisoned too.
		HasRelocatedInputs = true;
		HasUnrelocatedInputs = true;
		break;
		}
		if (BlockMap[InBB]->AvailableOut.count(InValue))
		HasRelocatedInputs = true;
		else
		HasUnrelocatedInputs = true;
		}
		}
		if (HasUnrelocatedInputs) {
		if (HasRelocatedInputs)
		PoisonedPointerDef = true;
		else
		ValidUnrelocatedPointerDef = true;
		}
		}
		} else if ((isa<GetElementPtrInst>(I) \|\| isa<BitCastInst>(I)) &&
containsGCPtrType(I.getType())) {		containsGCPtrType(I.getType())) {
// GEP/bitcast of unrelocated pointer is legal by itself but this		// GEP/bitcast of unrelocated pointer is legal by itself but this def
// def shouldn't appear in any AvailableSet.		// shouldn't appear in any AvailableSet.
for (const Value *V : I.operands())		for (const Value *V : I.operands())
if (containsGCPtrType(V->getType()) &&		if (containsGCPtrType(V->getType()) &&
isNotExclusivelyConstantDerived(V) && !AvailableSet.count(V)) {		isNotExclusivelyConstantDerived(V) && !AvailableSet.count(V)) {
ProducesUnrelocatedPointer = true;		if (isValuePoisoned(V))
		PoisonedPointerDef = true;
		else
		ValidUnrelocatedPointerDef = true;
break;		break;
}		}
}		}
if (!ProducesUnrelocatedPointer) {		assert(!(ValidUnrelocatedPointerDef && PoisonedPointerDef) &&
bool Cleared = false;		"Value cannot be both unrelocated and poisoned!");
transferInstruction(I, Cleared, AvailableSet);		if (ValidUnrelocatedPointerDef) {
(void)Cleared;		// Remove def of unrelocated pointer from Contribution of this BB and
} else {		// trigger update of all its successors.
// Remove def of unrelocated pointer from Contribution of this BB
// and trigger update of all its successors.
Contribution.erase(&I);		Contribution.erase(&I);
		PoisonedDefs.erase(&I);
ValidUnrelocatedDefs.insert(&I);		ValidUnrelocatedDefs.insert(&I);
DEBUG(dbgs() << "Removing " << I << " from Contribution of "		DEBUG(dbgs() << "Removing urelocated " << I << " from Contribution of "
		mkazantsevUnsubmitted Done Reply Inline Actions How about `"Removing unrelocated" << I`... and below, `"Removing poisoned " << I`? mkazantsev: How about `"Removing unrelocated" << I`... and below, `"Removing poisoned " << I`?
		DaniilSuchkovAuthorUnsubmitted Not Done Reply Inline Actions It was intentional but I can't remember the reason so I'll fix it. DaniilSuchkov: It was intentional but I can't remember the reason so I'll fix it.
<< BB->getName() << "\n");		<< BB->getName() << "\n");
ContributionChanged = true;		ContributionChanged = true;
		} else if (PoisonedPointerDef) {
		// Mark pointer as poisoned, remove its def from Contribution and trigger
		// update of all successors.
		Contribution.erase(&I);
		PoisonedDefs.insert(&I);
		DEBUG(dbgs() << "Removing poisoned " << I << " from Contribution of "
		<< BB->getName() << "\n");
		ContributionChanged = true;
		} else {
		bool Cleared = false;
		transferInstruction(I, Cleared, AvailableSet);
		(void)Cleared;
}		}
}		}
return ContributionChanged;		return ContributionChanged;
}		}

void GCPtrTracker::gatherDominatingDefs(const BasicBlock *BB,		void GCPtrTracker::gatherDominatingDefs(const BasicBlock *BB,
AvailableValueSet &Result,		AvailableValueSet &Result,
const DominatorTree &DT) {		const DominatorTree &DT) {
▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	void InstructionVerifier::verifyInstruction(
} else if (isa<CmpInst>(I) &&		} else if (isa<CmpInst>(I) &&
containsGCPtrType(I.getOperand(0)->getType())) {		containsGCPtrType(I.getOperand(0)->getType())) {
Value LHS = I.getOperand(0), RHS = I.getOperand(1);		Value LHS = I.getOperand(0), RHS = I.getOperand(1);
enum BaseType baseTyLHS = getBaseType(LHS),		enum BaseType baseTyLHS = getBaseType(LHS),
baseTyRHS = getBaseType(RHS);		baseTyRHS = getBaseType(RHS);

// Returns true if LHS and RHS are unrelocated pointers and they are		// Returns true if LHS and RHS are unrelocated pointers and they are
// valid unrelocated uses.		// valid unrelocated uses.
auto hasValidUnrelocatedUse = [&AvailableSet, baseTyLHS, baseTyRHS, &LHS,		auto hasValidUnrelocatedUse = [&AvailableSet, Tracker, baseTyLHS, baseTyRHS,
&RHS] () {		&LHS, &RHS] () {
// A cmp instruction has valid unrelocated pointer operands only if		// A cmp instruction has valid unrelocated pointer operands only if
// both operands are unrelocated pointers.		// both operands are unrelocated pointers.
// In the comparison between two pointers, if one is an unrelocated		// In the comparison between two pointers, if one is an unrelocated
// use, the other should be an unrelocated use, for this		// use, the other should be an unrelocated use, for this
// instruction to contain valid unrelocated uses. This unrelocated		// instruction to contain valid unrelocated uses. This unrelocated
// use can be a null constant as well, or another unrelocated		// use can be a null constant as well, or another unrelocated
// pointer.		// pointer.
if (AvailableSet.count(LHS) \|\| AvailableSet.count(RHS))		if (AvailableSet.count(LHS) \|\| AvailableSet.count(RHS))
return false;		return false;
// Constant pointers (that are not exclusively null) may have		// Constant pointers (that are not exclusively null) may have
// meaning in different VMs, so we cannot reorder the compare		// meaning in different VMs, so we cannot reorder the compare
// against constant pointers before the safepoint. In other words,		// against constant pointers before the safepoint. In other words,
// comparison of an unrelocated use against a non-null constant		// comparison of an unrelocated use against a non-null constant
// maybe invalid.		// maybe invalid.
if ((baseTyLHS == BaseType::ExclusivelySomeConstant &&		if ((baseTyLHS == BaseType::ExclusivelySomeConstant &&
baseTyRHS == BaseType::NonConstant) \|\|		baseTyRHS == BaseType::NonConstant) \|\|
(baseTyLHS == BaseType::NonConstant &&		(baseTyLHS == BaseType::NonConstant &&
baseTyRHS == BaseType::ExclusivelySomeConstant))		baseTyRHS == BaseType::ExclusivelySomeConstant))
return false;		return false;

		// If one of pointers is poisoned and other is not exclusively derived
		// from null it is an invalid expression: it produces poisoned result
		// and unless we want to track all defs (not only gc pointers) the only
		// option is to prohibit such instructions.
		if ((Tracker->isValuePoisoned(LHS) && baseTyRHS != ExclusivelyNull) \|\|
		(Tracker->isValuePoisoned(RHS) && baseTyLHS != ExclusivelyNull))
		return false;

// All other cases are valid cases enumerated below:		// All other cases are valid cases enumerated below:
// 1. Comparison between an exlusively derived null pointer and a		// 1. Comparison between an exclusively derived null pointer and a
// constant base pointer.		// constant base pointer.
// 2. Comparison between an exlusively derived null pointer and a		// 2. Comparison between an exclusively derived null pointer and a
// non-constant unrelocated base pointer.		// non-constant unrelocated base pointer.
// 3. Comparison between 2 unrelocated pointers.		// 3. Comparison between 2 unrelocated pointers.
		// 4. Comparison between a pointer exclusively derived from null and a
		mkazantsevUnsubmitted Not Done Reply Inline Actions "exclusively derived null pointer" -> "constant pointer"? mkazantsev: "exclusively derived null pointer" -> "constant pointer"?
		DaniilSuchkovAuthorUnsubmitted Done Reply Inline Actions Not all constant pointers are derived from null but anyway you spotted a typo, thank you. DaniilSuchkov: Not all constant pointers are derived from null but anyway you spotted a typo, thank you.
		// non-constant poisoned pointer.
return true;		return true;
};		};
if (!hasValidUnrelocatedUse()) {		if (!hasValidUnrelocatedUse()) {
// Print out all non-constant derived pointers that are unrelocated		// Print out all non-constant derived pointers that are unrelocated
// uses, which are invalid.		// uses, which are invalid.
if (baseTyLHS == BaseType::NonConstant && !AvailableSet.count(LHS))		if (baseTyLHS == BaseType::NonConstant && !AvailableSet.count(LHS))
reportInvalidUse(*LHS, I);		reportInvalidUse(*LHS, I);
if (baseTyRHS == BaseType::NonConstant && !AvailableSet.count(RHS))		if (baseTyRHS == BaseType::NonConstant && !AvailableSet.count(RHS))
Show All 38 Lines

test/SafepointIRVerifier/from-same-relocation-in-phi-nodes.ll

This file was added.

				; XFAIL: *
				; RUN: opt -safepoint-ir-verifier-print-only -verify-safepoint-ir -S %s 2>&1 \| FileCheck %s

				; In %merge %val.unrelocated, %ptr and %arg should be unrelocated.
				; FIXME: if this test fails it is a false-positive alarm. IR is correct.
				define void @test.unrelocated-phi.ok(i8 addrspace(1)* %arg) gc "statepoint-example" {
				; CHECK-LABEL: Verifying gc pointers in function: test.unrelocated-phi.ok
				bci_0:
				%ptr = getelementptr i8, i8 addrspace(1)* %arg, i64 4
				br i1 undef, label %left, label %right

				left:
				%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 5, i32 0, i32 -1, i32 0, i32 0, i32 0)
				br label %merge

				right:
				br label %merge

				merge:
				; CHECK: No illegal uses found by SafepointIRVerifier in: test.unrelocated-phi.ok
				%val.unrelocated = phi i8 addrspace(1)* [ %arg, %left ], [ %ptr, %right ]
				%c = icmp eq i8 addrspace(1)* %val.unrelocated, %arg
				ret void
				}

				declare token @llvm.experimental.gc.statepoint.p0f_isVoidf(i64, i32, void ()*, i32, i32, ...)

test/SafepointIRVerifier/unrecorded-live-at-sp.ll

	; RUN: opt %s -safepoint-ir-verifier-print-only -verify-safepoint-ir -S 2>&1 \| FileCheck %s			; RUN: opt %s -safepoint-ir-verifier-print-only -verify-safepoint-ir -S 2>&1 \| FileCheck %s

	; CHECK: Illegal use of unrelocated value found!			; CHECK: Illegal use of unrelocated value found!
	; CHECK-NEXT: Def: %base_phi3 = phi %jObject addrspace(1)* [ %obj609.relocated, %not_zero146 ], [ %base_phi2, %bci_37-aload ], !is_base_value !0			; CHECK-NEXT: Def: %base_phi4 = phi %jObject addrspace(1)* addrspace(1)* [ %addr98.relocated, %not_zero146 ], [ %cast6, %bci_37-aload ], !is_base_value !0
	; CHECK-NEXT: Use: %base_phi2 = phi %jObject addrspace(1)* [ %base_phi3, %not_zero179 ], [ %cast5, %bci_0 ], !is_base_value !0			; CHECK-NEXT: Use: %safepoint_token = tail call token (i64, i32, i32 (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_i32f(i64 0, i32 0, i32 () undef, i32 0, i32 0, i32 0, i32 5, i32 0, i32 0, i32 0, i32 0, i32 0, %jObject addrspace(1)* %base_phi1, %jObject addrspace(1)* addrspace(1)* %base_phi4, %jObject addrspace(1)* addrspace(1)* %relocated4, %jObject addrspace(1)* %relocated7)


	%jObject = type { [8 x i8] }			%jObject = type { [8 x i8] }

	declare %jObject addrspace(1)* @generate_obj1() #1			declare %jObject addrspace(1)* @generate_obj1() #1

	declare %jObject addrspace(1)* addrspace(1)* @generate_obj2() #1			declare %jObject addrspace(1)* addrspace(1)* @generate_obj2() #1

	declare %jObject addrspace(1)* @generate_obj3() #1			declare %jObject addrspace(1)* @generate_obj3() #1
	▲ Show 20 Lines • Show All 58 Lines • Show Last 20 Lines

test/SafepointIRVerifier/uses-in-phi-nodes.ll

	; RUN: opt -safepoint-ir-verifier-print-only -verify-safepoint-ir -S %s 2>&1 \| FileCheck %s			; RUN: opt -safepoint-ir-verifier-print-only -verify-safepoint-ir -S %s 2>&1 \| FileCheck %s

	define i8 addrspace(1)* @test.not.ok.0(i8 addrspace(1)* %arg) gc "statepoint-example" {			define i8 addrspace(1)* @test.not.ok.0(i8 addrspace(1)* %arg) gc "statepoint-example" {
	; CHECK-LABEL: Verifying gc pointers in function: test.not.ok.0			; CHECK-LABEL: Verifying gc pointers in function: test.not.ok.0
	bci_0:			bci_0:
	br i1 undef, label %left, label %right			br i1 undef, label %left, label %right

	left:			left:
	%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 5, i32 0, i32 -1, i32 0, i32 0, i32 0)			%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 5, i32 0, i32 -1, i32 0, i32 0, i32 0)
	br label %merge			br label %merge

	right:			right:
	br label %merge			br label %merge

	merge:			merge:
	; CHECK: Illegal use of unrelocated value found!			; CHECK: Illegal use of unrelocated value found!
	; CHECK-NEXT: Def: i8 addrspace(1)* %arg			; CHECK-NEXT: Def: %val = phi i8 addrspace(1)* [ %arg, %left ], [ %arg, %right ]
	; CHECK-NEXT: Use: %val = phi i8 addrspace(1)* [ %arg, %left ], [ %arg, %right ]			; CHECK-NEXT: Use: ret i8 addrspace(1)* %val
	%val = phi i8 addrspace(1)* [ %arg, %left ], [ %arg, %right]			%val = phi i8 addrspace(1)* [ %arg, %left ], [ %arg, %right ]
	ret i8 addrspace(1)* %val			ret i8 addrspace(1)* %val
	}			}

	define i8 addrspace(1)* @test.not.ok.1(i8 addrspace(1)* %arg) gc "statepoint-example" {			define i8 addrspace(1)* @test.not.ok.1(i8 addrspace(1)* %arg) gc "statepoint-example" {
	; CHECK-LABEL: Verifying gc pointers in function: test.not.ok.1			; CHECK-LABEL: Verifying gc pointers in function: test.not.ok.1
	bci_0:			bci_0:
	br i1 undef, label %left, label %right			br i1 undef, label %left, label %right

	left:			left:
	%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 5, i32 0, i32 -1, i32 0, i32 0, i32 0)			%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 5, i32 0, i32 -1, i32 0, i32 0, i32 0)
	br label %merge			br label %merge

	right:			right:
	br label %merge			br label %merge

	merge:			merge:
	; CHECK: Illegal use of unrelocated value found!			; CHECK: Illegal use of unrelocated value found!
	; CHECK-NEXT: Def: i8 addrspace(1)* %arg			; CHECK-NEXT: Def: %val = phi i8 addrspace(1)* [ %arg, %left ], [ null, %right ]
	; CHECK-NEXT: Use: %val = phi i8 addrspace(1)* [ %arg, %left ], [ null, %right ]			; CHECK-NEXT: Use: ret i8 addrspace(1)* %val
	%val = phi i8 addrspace(1)* [ %arg, %left ], [ null, %right]			%val = phi i8 addrspace(1)* [ %arg, %left ], [ null, %right ]
	ret i8 addrspace(1)* %val			ret i8 addrspace(1)* %val
	}			}

	define i8 addrspace(1)* @test.ok.0(i8 addrspace(1)* %arg) gc "statepoint-example" {			define i8 addrspace(1)* @test.ok.0(i8 addrspace(1)* %arg) gc "statepoint-example" {
	; CHECK: No illegal uses found by SafepointIRVerifier in: test.ok.0			; CHECK: No illegal uses found by SafepointIRVerifier in: test.ok.0
	bci_0:			bci_0:
	br i1 undef, label %left, label %right			br i1 undef, label %left, label %right

	Show All 21 Lines
	right:			right:
	br label %merge			br label %merge

	merge:			merge:
	%val = phi i8 addrspace(1)* [ %arg, %left ], [ %arg, %right]			%val = phi i8 addrspace(1)* [ %arg, %left ], [ %arg, %right]
	ret i8 addrspace(1)* %val			ret i8 addrspace(1)* %val
	}			}

				; It should be allowed to compare poisoned ptr with null.
				define void @test.poisoned.cmp.ok(i8 addrspace(1)* %arg) gc "statepoint-example" {
				; CHECK-LABEL: Verifying gc pointers in function: test.poisoned.cmp.ok
				bci_0:
				br i1 undef, label %left, label %right

				left:
				%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 0, i8 addrspace(1)* %arg , i32 -1, i32 0, i32 0, i32 0)
				%arg.relocated = call i8 addrspace(1)* @llvm.experimental.gc.relocate.p1i8(token %safepoint_token, i32 7, i32 7) ; arg, arg
				br label %merge

				right:
				%safepoint_token2 = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 0, i8 addrspace(1)* %arg , i32 -1, i32 0, i32 0, i32 0)
				br label %merge

				merge:
				; CHECK: No illegal uses found by SafepointIRVerifier in: test.poisoned.cmp.ok
				%val.poisoned = phi i8 addrspace(1)* [ %arg.relocated, %left ], [ %arg, %right ]
				%c = icmp eq i8 addrspace(1)* %val.poisoned, null
				ret void
				}

				; It is illegal to compare poisoned ptr and relocated.
				define void @test.poisoned.cmp.fail.0(i8 addrspace(1)* %arg) gc "statepoint-example" {
				; CHECK-LABEL: Verifying gc pointers in function: test.poisoned.cmp.fail.0
				bci_0:
				br i1 undef, label %left, label %right

				left:
				%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 0, i8 addrspace(1)* %arg , i32 -1, i32 0, i32 0, i32 0)
				%arg.relocated = call i8 addrspace(1)* @llvm.experimental.gc.relocate.p1i8(token %safepoint_token, i32 7, i32 7) ; arg, arg
				br label %merge

				right:
				%safepoint_token2 = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 0, i8 addrspace(1)* %arg , i32 -1, i32 0, i32 0, i32 0)
				%arg.relocated2 = call i8 addrspace(1)* @llvm.experimental.gc.relocate.p1i8(token %safepoint_token2, i32 7, i32 7) ; arg, arg
				br label %merge

				merge:
				; CHECK: Illegal use of unrelocated value found!
				; CHECK-NEXT: Def: %val.poisoned = phi i8 addrspace(1)* [ %arg.relocated, %left ], [ %arg, %right ]
				; CHECK-NEXT: Use: %c = icmp eq i8 addrspace(1)* %val.poisoned, %val
				%val.poisoned = phi i8 addrspace(1)* [ %arg.relocated, %left ], [ %arg, %right ]
				%val = phi i8 addrspace(1)* [ %arg.relocated, %left ], [ %arg.relocated2, %right ]
				%c = icmp eq i8 addrspace(1)* %val.poisoned, %val
				ret void
				}

				; It is illegal to compare poisoned ptr and unrelocated.
				define void @test.poisoned.cmp.fail.1(i8 addrspace(1)* %arg) gc "statepoint-example" {
				; CHECK-LABEL: Verifying gc pointers in function: test.poisoned.cmp.fail.1
				bci_0:
				br i1 undef, label %left, label %right

				left:
				%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 0, i8 addrspace(1)* %arg , i32 -1, i32 0, i32 0, i32 0)
				%arg.relocated = call i8 addrspace(1)* @llvm.experimental.gc.relocate.p1i8(token %safepoint_token, i32 7, i32 7) ; arg, arg
				br label %merge

				right:
				%safepoint_token2 = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 0, i8 addrspace(1)* %arg , i32 -1, i32 0, i32 0, i32 0)
				%arg.relocated2 = call i8 addrspace(1)* @llvm.experimental.gc.relocate.p1i8(token %safepoint_token2, i32 7, i32 7) ; arg, arg
				br label %merge

				merge:
				; CHECK: Illegal use of unrelocated value found!
				; CHECK-NEXT: Def: %val.poisoned = phi i8 addrspace(1)* [ %arg.relocated, %left ], [ %arg, %right ]
				; CHECK-NEXT: Use: %c = icmp eq i8 addrspace(1)* %val.poisoned, %arg
				%val.poisoned = phi i8 addrspace(1)* [ %arg.relocated, %left ], [ %arg, %right ]
				%c = icmp eq i8 addrspace(1)* %val.poisoned, %arg
				ret void
				}

				; It should be allowed to compare unrelocated phi with unrelocated value.
				define void @test.unrelocated-phi.cmp.ok(i8 addrspace(1)* %arg) gc "statepoint-example" {
				; CHECK-LABEL: Verifying gc pointers in function: test.unrelocated-phi.cmp.ok
				bci_0:
				br i1 undef, label %left, label %right

				left:
				%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 5, i32 0, i32 -1, i32 0, i32 0, i32 0)
				br label %merge

				right:
				br label %merge

				merge:
				; CHECK: No illegal uses found by SafepointIRVerifier in: test.unrelocated-phi.cmp.ok
				%val.unrelocated = phi i8 addrspace(1)* [ %arg, %left ], [ null, %right ]
				%c = icmp eq i8 addrspace(1)* %val.unrelocated, %arg
				ret void
				}

	declare token @llvm.experimental.gc.statepoint.p0f_isVoidf(i64, i32, void ()*, i32, i32, ...)			declare token @llvm.experimental.gc.statepoint.p0f_isVoidf(i64, i32, void ()*, i32, i32, ...)
				declare i8 addrspace(1)* @llvm.experimental.gc.relocate.p1i8(token, i32, i32)
	declare void @not_statepoint()			declare void @not_statepoint()

This is an archive of the discontinued LLVM Phabricator instance.

[SafepointIRVerifier] Allow non-dereferencing uses of unrelocated or poisoned PHI nodesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 127992

lib/IR/SafepointIRVerifier.cpp

test/SafepointIRVerifier/from-same-relocation-in-phi-nodes.ll

test/SafepointIRVerifier/unrecorded-live-at-sp.ll

test/SafepointIRVerifier/uses-in-phi-nodes.ll

[SafepointIRVerifier] Allow non-dereferencing uses of unrelocated or poisoned PHI nodes
ClosedPublic