This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
lib/IR/
-
IR/
-
SafepointIRVerifier.cpp
-
test/SafepointIRVerifier/
-
SafepointIRVerifier/
-
from-same-relocation-in-phi-nodes.ll
-
unrecorded-live-at-sp.ll
-
uses-in-phi-nodes.ll

Differential D41006

[SafepointIRVerifier] Allow non-dereferencing uses of unrelocated or poisoned PHI nodes
ClosedPublic

Authored by DaniilSuchkov on Dec 8 2017, 6:27 AM.

Download Raw Diff

Details

Reviewers

anna
mkazantsev
reames

Commits

rGddb096853d00: [SafepointIRVerifier] Allow non-dereferencing uses of unrelocated or poisoned…
rL321438: [SafepointIRVerifier] Allow non-dereferencing uses of unrelocated or poisoned…

Summary

PHI that has at least one unrelocated input cannot cause any issues by itself,
though its uses should be carefully verified. With this patch PHIs are allowed
to have any inputs but when all inputs are unrelocated the PHI is marked as
unrelocated and if not all inputs are unrelocated then the PHI is marked as
poisoned. Poisoned pointers can be used only in three ways: to derive new
pointers, in PHIs or in comparisons against constants that are exclusively
derived from null.

Diff Detail

Repository: rL LLVM

Event Timeline

DaniilSuchkov created this revision.Dec 8 2017, 6:27 AM

DaniilSuchkov added a parent revision: D40885: [NFC] Refactor SafepointIRVerifier.

mkazantsev added inline comments.Dec 9 2017, 1:36 AM

lib/IR/SafepointIRVerifier.cpp
246 ↗	(On Diff #126127)	Please clarify what is "a pointer derived from null". For example, is `select %cond, null, %some` derived from null? I think what you mean here is something like "... or against a constant pointer".
247 ↗	(On Diff #126127)	How about "poisoned value is a value which is derived from both relocated and unrelocated values, or from another poisoned values"?
250 ↗	(On Diff #126127)	You can always represent any constant pointer as `gep null, %some_int_constant`, so I think that this "exclusively derived from null" stuff is redundant.
255 ↗	(On Diff #126127)	You can merge that free into `P + Any = P` if it makes sense.
260 ↗	(On Diff #126127)	Maybe instead of using the term "merge pointers" stick to the term "derived pointer"? You use both, and I don't catch what is the difference between them.
261 ↗	(On Diff #126127)	Maybe "A pointer derived from X and constant has the same type as X"? You could also include it into the rules above.
300 ↗	(On Diff #126127)	as long as they are only used in safe instructions
532 ↗	(On Diff #126127)	How about `"Removing unrelocated" << I`... and below, `"Removing poisoned " << I`?
664 ↗	(On Diff #126127)	"exclusively derived null pointer" -> "constant pointer"?

DaniilSuchkov added inline comments.Dec 9 2017, 5:18 AM

lib/IR/SafepointIRVerifier.cpp
246 ↗	(On Diff #126127)	Actually I've missed one word. It should be "derived _exclusively_ from null".
247 ↗	(On Diff #126127)	It's about origination of poisoned values.
250 ↗	(On Diff #126127)	It refers to comment at line 639.
255 ↗	(On Diff #126127)	I'd like to keep it this way.
260 ↗	(On Diff #126127)	By "deriving" I mean f(x) -> x, and by merge f(x1, ..., xN) -> x.
300 ↗	(On Diff #126127)	The idea is "_This_ instructions don't need verification. But nothing is said about their uses."
532 ↗	(On Diff #126127)	It was intentional but I can't remember the reason so I'll fix it.
664 ↗	(On Diff #126127)	Not all constant pointers are derived from null but anyway you spotted a typo, thank you.

mkazantsev added inline comments.Dec 11 2017, 4:03 PM

lib/IR/SafepointIRVerifier.cpp
247 ↗	(On Diff #126127)	Even if it was like that, it has nothing to do with the rules below, since you don't explain where rules 2-4 come from. You only define how poison first appears from merging of U and R, but don't say how it's handled after that.
250 ↗	(On Diff #126127)	I'm not asking to do something about it in this patch since it was there before, but it is fishy. If I can imagine a VM in which 0xFF is some special magical pointer that cannot be simply compared against normal pointers, then I can also imagine a VM where `gep null, 0xFF` is also some special magical pointer with same properties. Actually, I can define all such special numbers as derivatives from null. From that perspective, how hard-coded constants are different from hard-coded offsets from null?
260 ↗	(On Diff #126127)	I guess "deriving" is actually f(x1) -> x. And what you call "deriving" is just a particular case of "merging". Again, why do we need both?
300 ↗	(On Diff #126127)	Ok, makes sense.

Comments slightly clarified.

lib/IR/SafepointIRVerifier.cpp
247 ↗	(On Diff #126127)	This part is supposed to be a brief description. How it's handled is described bellow.
250 ↗	(On Diff #126127)	I don't know either, but the idea is to keep this patch consistent with previous code. So I have to maintain the logic around "magic pointer constant". This patch is not about that issue, let's discuss it later.
260 ↗	(On Diff #126127)	Because for deriving there is only one rule: it changes nothing and for merge everything is a bit more complicated. Both "gep/bitcast is merge" and "phi/select is deriving" looks misleading. I agree that formally f(x1) -> x is a particular case of f(x1, ..., xN) -> x, but how to name it so that it won't be confusing?

anna added inline comments.Dec 15 2017, 7:18 AM

lib/IR/SafepointIRVerifier.cpp
250 ↗	(On Diff #126127)	I'll try to clarify this. The GC relocates the base pointer and this is why we record the base pointer for every 'derived pointer'. After the GC relocates the base pointer at runtime, we can rematerialize the derived pointer because we have stored this information in the IR. So, effectively it comes down to always identifying the "base" of a derived pointer. This is where the `getBaseType` comes in. When we generate "magic const pointers" in IR (for example, using inttoptr `magic const`), the base here is that magic const. The same idiom is `GEP(null, magic const)`, but here the base pointer is null. Relocating a null is still a null. So, this is why we have something like this: %ptr = unrelocated non constant pointer compare (%ptr, inttoptr(magic_const)) <-- can't be reordered before a safepoint but: compare(%ptr, GEP(null, magic_const)) <-- can be reordered before a safepoint Also, just as an aside, this is also why inttoptr of addrspace(1) is incorrect in the IRVerifier, but GEP(null, offset) in addrspace(1) is fine.

Comments inline.

lib/IR/SafepointIRVerifier.cpp
242 ↗	(On Diff #126501)	I don't think we want to introduce one more term 'poisoned' here. Specifically, poisoning has different meanings in the optimizer (and sometimes in the GC), and can be confusing. It looks like `poisoned` pointers are just derived pointers which will be lexically from multiple pointers. So something like: `gep, bitcasts` -> derived pointer from one base `phis, selects` -> derived pointers that are lexically from multiple base pointers (I say lexically, because we can have phis/selects that statically have derived from exactly one pointer). Do we really need to make this distinction? I think it's more confusing. Pls see comment below on naming to make clearer.
262 ↗	(On Diff #126501)	As mentioned, I dont think you need to explicitly state out the distinction here. Just maybe a single line at the beginning when explaining `MultiSourceDerivedPtr` (I prefer that instead of `Poisoned` naming), because we can have multiple sources and still not be poisoned.
267 ↗	(On Diff #126501)	Nit: predecessor
279 ↗	(On Diff #126501)	This is incorrect IR. We cannot have multiple different incoming phi values from exactly one predecessor. With correct IR, I don't think we will have such false positives. If we do have, could you please add a test with FIXME?
260 ↗	(On Diff #126127)	I tend to agree with Max here. We really cannot distinguish between both. How about we focus just on the fact that we have multiple sources here? IOW, don't worry about GEPs and bitcasts because they have single source base. These GEPs and bitcasts don't change the behaviour of unrelocated/relocated, so they shouldn;t affect this discusson.

This revision now requires changes to proceed.Dec 15 2017, 9:33 AM

mkazantsev added inline comments.Dec 15 2017, 1:12 PM

lib/IR/SafepointIRVerifier.cpp
242 ↗	(On Diff #126501)	Yes, this may be confusing with poisoned pointers that are also in GC. I don't have a better idea for its name on top of my head, though. I'm OK to go with whatever you agree with.
279 ↗	(On Diff #126501)	I don't get why it is incorrect. For example, void p = &a[10]; void p1 = &p[20]; void temp = p; if (cond) { temp = p1; some_call(); } void p2 = temp; Won't we have exactly this IR here?
250 ↗	(On Diff #126127)	Ok, this makes sense to me.
260 ↗	(On Diff #126127)	I think that "deriving" is a good name for both, because formally you apply some function `f` (where `f` can be `gep, phi, bitcast` or whatever) to a number of pointer arguments and have a new pointer (derived one) as result. It is unimportant -how- exactly you derived your pointer from your argument(s). You never fail verification while deriving. You can only fail verification when you misuse the derived pointer. And this is the important part.

anna added inline comments.Dec 15 2017, 7:11 PM

lib/IR/SafepointIRVerifier.cpp
279 ↗	(On Diff #126501)	yup, you're right. What we have is different incoming values from different incoming blocks, something like: `p2 = phi [p, def BB of p], [p1, safepoint block]` What's incorrect is, different incoming values from the same block. For example, p and p1 from their same def block: `p2 = phi [p, def BB of p] [p1, def BB of p1]`.

DaniilSuchkov added inline comments.Dec 19 2017, 3:26 AM

lib/IR/SafepointIRVerifier.cpp
262 ↗	(On Diff #126501)	We don't need `MultiSourceDerivedPtr` because we don't care at all from how many sources a value was derived, but we do care if all sources were (un)relocated or not. Value which was derived from multiple sources can be in any of three possible states (relocated, unrelocated, poisoned).
279 ↗	(On Diff #126501)	To avoid confusion I'll make this comment a bit more clear.

"merge" now replaced with "derive" in comments, example in FIXME become a bit less confusing, added new test (with XFAIL) for that FIXME.

Logic looks right, some comments inline.

lib/IR/SafepointIRVerifier.cpp

409 ↗

(On Diff #127496)

Pls add a comment here on why the poisoned defs are skipped. ValidUnrelocatedDefs are obvious.

481 ↗

(On Diff #127496)

Don't we need to handle selects and identify poisoned versus unrelocated?

491 ↗

(On Diff #127496)

Instead of the below code (logic is right, but too many conditionals), could we do something like this:

if (isNotExclusivelyConstantDerived(InValue)) {
  if (isValuePoisoned(InValue) || (HasRelocatedInputs &&  HasUnrelocatedInputs)) {
     PoisonedPointerDef = true;
     break;
   }
   if (BlockMap[InBB]->AvailableOut.count(InValue))
              HasRelocatedInputs = true;
    else
              HasUnrelocatedInputs = true;
}
if (!PoisonedPointerDef && HasUnrelocatedInputs) {
   assert(!HasRelocatedInputs && "Should be poisoned!");
   ValidUnrelocatedPointerDef = true;
}

This revision now requires changes to proceed.Dec 21 2017, 9:09 AM

DaniilSuchkov added inline comments.Dec 21 2017, 9:43 AM

lib/IR/SafepointIRVerifier.cpp
409 ↗	(On Diff #127496)	You think I should repeat myself here? From comments above (where 'poisoned' pointers are introduced) it's clear why this defs are skipped.
481 ↗	(On Diff #127496)	I'll do it in the next patch (in order to keep this one not too huge).
491 ↗	(On Diff #127496)	But we still have to handle case when some inputs are relocated and some are not: your code won't work if the _last_ input makes this phi poisoned (because on each iteration it checks flags that might have changed on previous one). Thus we cannot remove this part: if (HasUnrelocatedInputs) { if (HasRelocatedInputs) PoisonedPointerDef = true; else ValidUnrelocatedPointerDef = true; } So the only thing that might be changed is this part: if (isValuePoisoned(InValue)) { // If any of inputs is poisoned, output is always poisoned too. HasRelocatedInputs = true; HasUnrelocatedInputs = true; break; } But if we'll change it to if (isValuePoisoned(InValue)) { // If any of inputs is poisoned, output is always poisoned too. PoisonedPointerDef = true; break; } We'll have to somehow tell that `if` (mentioned before) not to touch this flag and it'll be even worse. And that `assert` shouldn't be moved to PHI's branch, it's about all instructions. Currently this part is pretty clear and straightforward: loop over phi's inputs sets two flags (that have clear meaning) and after that loop another two flags are changed accordingly.

lgtm w/ comment addressed.

lib/IR/SafepointIRVerifier.cpp
409 ↗	(On Diff #127496)	Are all poisoned defs removed? Only valid ones right.
481 ↗	(On Diff #127496)	pls add a TODO then.
491 ↗	(On Diff #127496)	ah yes, it wont work for the last case.

This revision is now accepted and ready to land.Dec 21 2017, 9:50 AM

anna added inline comments.Dec 21 2017, 9:53 AM

lib/IR/SafepointIRVerifier.cpp
491 ↗	(On Diff #127496)	also, pls point out as a comment where these rules come from (i.e. refer to header for reasoning of these rules).

Added some new comments.

Closed by commit rL321438: [SafepointIRVerifier] Allow non-dereferencing uses of unrelocated or poisoned… (authored by mkazantsev). · Explain WhyDec 25 2017, 1:36 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

lib/

IR/

SafepointIRVerifier.cpp

156 lines

test/

SafepointIRVerifier/

from-same-relocation-in-phi-nodes.ll

26 lines

unrecorded-live-at-sp.ll

5 lines

uses-in-phi-nodes.ll

106 lines

Diff 128130

llvm/trunk/lib/IR/SafepointIRVerifier.cpp

Show First 20 Lines • Show All 231 Lines • ▼ Show 20 Lines
}		}

namespace {		namespace {
class InstructionVerifier;		class InstructionVerifier;

/// Builds BasicBlockState for each BB of the function.		/// Builds BasicBlockState for each BB of the function.
/// It can traverse function for verification and provides all required		/// It can traverse function for verification and provides all required
/// information.		/// information.
		///
		/// GC pointer may be in one of three states: relocated, unrelocated and
		/// poisoned.
		/// Relocated pointer may be used without any restrictions.
		/// Unrelocated pointer cannot be dereferenced, passed as argument to any call
		/// or returned. Unrelocated pointer may be safely compared against another
		/// unrelocated pointer or against a pointer exclusively derived from null.
		/// Poisoned pointers are produced when we somehow derive pointer from relocated
		/// and unrelocated pointers (e.g. phi, select). This pointers may be safely
		/// used in a very limited number of situations. Currently the only way to use
		/// it is comparison against constant exclusively derived from null. All
		/// limitations arise due to their undefined state: this pointers should be
		/// treated as relocated and unrelocated simultaneously.
		/// Rules of deriving:
		/// R + U = P - that's where the poisoned pointers come from
		/// P + X = P
		/// U + U = U
		/// R + R = R
		/// X + C = X
		/// Where "+" - any operation that somehow derive pointer, U - unrelocated,
		/// R - relocated and P - poisoned, C - constant, X - U or R or P or C or
		/// nothing (in case when "+" is unary operation).
		/// Deriving of pointers by itself is always safe.
		/// NOTE: when we are making decision on the status of instruction's result:
		/// a) for phi we need to check status of each input *at the end of
		/// corresponding predecessor BB*.
		/// b) for other instructions we need to check status of each input *at the
		/// current point*.
		///
		/// FIXME: This works fairly well except one case
		/// bb1:
		/// p = some GC-ptr def
		/// p1 = gep p, offset
		/// / \|
		/// / \|
		/// bb2: \|
		/// safepoint \|
		/// \ \|
		/// \ \|
		/// bb3:
		/// p2 = phi [p, bb2] [p1, bb1]
		/// p3 = phi [p, bb2] [p, bb1]
		/// here p and p1 is unrelocated
		/// p2 and p3 is poisoned (though they shouldn't be)
		///
		/// This leads to some weird results:
		/// cmp eq p, p2 - illegal instruction (false-positive)
		/// cmp eq p1, p2 - illegal instruction (false-positive)
		/// cmp eq p, p3 - illegal instruction (false-positive)
		/// cmp eq p, p1 - ok
		/// To fix this we need to introduce conception of generations and be able to
		/// check if two values belong to one generation or not. This way p2 will be
		/// considered to be unrelocated and no false alarm will happen.
class GCPtrTracker {		class GCPtrTracker {
const Function &F;		const Function &F;
SpecificBumpPtrAllocator<BasicBlockState> BSAllocator;		SpecificBumpPtrAllocator<BasicBlockState> BSAllocator;
DenseMap<const BasicBlock , BasicBlockState > BlockMap;		DenseMap<const BasicBlock , BasicBlockState > BlockMap;
// This set contains defs of unrelocated pointers that are proved to be legal		// This set contains defs of unrelocated pointers that are proved to be legal
// and don't need verification.		// and don't need verification.
DenseSet<const Instruction *> ValidUnrelocatedDefs;		DenseSet<const Instruction *> ValidUnrelocatedDefs;
		// This set contains poisoned defs. They can be safely ignored during
		// verification too.
		DenseSet<const Value *> PoisonedDefs;

public:		public:
GCPtrTracker(const Function &F, const DominatorTree &DT);		GCPtrTracker(const Function &F, const DominatorTree &DT);

BasicBlockState getBasicBlockState(const BasicBlock BB);		BasicBlockState getBasicBlockState(const BasicBlock BB);
const BasicBlockState getBasicBlockState(const BasicBlock BB) const;		const BasicBlockState getBasicBlockState(const BasicBlock BB) const;

		bool isValuePoisoned(const Value *V) const { return PoisonedDefs.count(V); }

/// Traverse each BB of the function and call		/// Traverse each BB of the function and call
/// InstructionVerifier::verifyInstruction for each possibly invalid		/// InstructionVerifier::verifyInstruction for each possibly invalid
/// instruction.		/// instruction.
/// It destructively modifies GCPtrTracker so it's passed via rvalue reference		/// It destructively modifies GCPtrTracker so it's passed via rvalue reference
/// in order to prohibit further usages of GCPtrTracker as it'll be in		/// in order to prohibit further usages of GCPtrTracker as it'll be in
/// inconsistent state.		/// inconsistent state.
static void verifyFunction(GCPtrTracker &&Tracker,		static void verifyFunction(GCPtrTracker &&Tracker,
InstructionVerifier &Verifier);		InstructionVerifier &Verifier);
▲ Show 20 Lines • Show All 82 Lines • ▼ Show 20 Lines
}		}

const BasicBlockState *GCPtrTracker::getBasicBlockState(		const BasicBlockState *GCPtrTracker::getBasicBlockState(
const BasicBlock *BB) const {		const BasicBlock *BB) const {
return const_cast<GCPtrTracker *>(this)->getBasicBlockState(BB);		return const_cast<GCPtrTracker *>(this)->getBasicBlockState(BB);
}		}

bool GCPtrTracker::instructionMayBeSkipped(const Instruction *I) const {		bool GCPtrTracker::instructionMayBeSkipped(const Instruction *I) const {
return ValidUnrelocatedDefs.count(I);		// Poisoned defs are skipped since they are always safe by itself by
		// definition (for details see comment to this class).
		return ValidUnrelocatedDefs.count(I) \|\| PoisonedDefs.count(I);
}		}

void GCPtrTracker::verifyFunction(GCPtrTracker &&Tracker,		void GCPtrTracker::verifyFunction(GCPtrTracker &&Tracker,
InstructionVerifier &Verifier) {		InstructionVerifier &Verifier) {
// We need RPO here to a) report always the first error b) report errors in		// We need RPO here to a) report always the first error b) report errors in
// same order from run to run.		// same order from run to run.
ReversePostOrderTraversal<const Function *> RPOT(&Tracker.F);		ReversePostOrderTraversal<const Function *> RPOT(&Tracker.F);
for (const BasicBlock *BB : RPOT) {		for (const BasicBlock *BB : RPOT) {
▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines

bool GCPtrTracker::removeValidUnrelocatedDefs(const BasicBlock *BB,		bool GCPtrTracker::removeValidUnrelocatedDefs(const BasicBlock *BB,
const BasicBlockState *BBS,		const BasicBlockState *BBS,
AvailableValueSet &Contribution) {		AvailableValueSet &Contribution) {
assert(&BBS->Contribution == &Contribution &&		assert(&BBS->Contribution == &Contribution &&
"Passed Contribution should be from the passed BasicBlockState!");		"Passed Contribution should be from the passed BasicBlockState!");
AvailableValueSet AvailableSet = BBS->AvailableIn;		AvailableValueSet AvailableSet = BBS->AvailableIn;
bool ContributionChanged = false;		bool ContributionChanged = false;
		// For explanation why instructions are processed this way see
		// "Rules of deriving" in the comment to this class.
for (const Instruction &I : *BB) {		for (const Instruction &I : *BB) {
bool ProducesUnrelocatedPointer = false;		bool ValidUnrelocatedPointerDef = false;
if ((isa<GetElementPtrInst>(I) \|\| isa<BitCastInst>(I)) &&		bool PoisonedPointerDef = false;
		// TODO: `select` instructions should be handled here too.
		if (const PHINode *PN = dyn_cast<PHINode>(&I)) {
		if (containsGCPtrType(PN->getType())) {
		// If both is true, output is poisoned.
		bool HasRelocatedInputs = false;
		bool HasUnrelocatedInputs = false;
		for (unsigned i = 0, e = PN->getNumIncomingValues(); i != e; ++i) {
		const BasicBlock *InBB = PN->getIncomingBlock(i);
		const Value *InValue = PN->getIncomingValue(i);

		if (isNotExclusivelyConstantDerived(InValue)) {
		if (isValuePoisoned(InValue)) {
		// If any of inputs is poisoned, output is always poisoned too.
		HasRelocatedInputs = true;
		HasUnrelocatedInputs = true;
		break;
		}
		if (BlockMap[InBB]->AvailableOut.count(InValue))
		HasRelocatedInputs = true;
		else
		HasUnrelocatedInputs = true;
		}
		}
		if (HasUnrelocatedInputs) {
		if (HasRelocatedInputs)
		PoisonedPointerDef = true;
		else
		ValidUnrelocatedPointerDef = true;
		}
		}
		} else if ((isa<GetElementPtrInst>(I) \|\| isa<BitCastInst>(I)) &&
containsGCPtrType(I.getType())) {		containsGCPtrType(I.getType())) {
// GEP/bitcast of unrelocated pointer is legal by itself but this		// GEP/bitcast of unrelocated pointer is legal by itself but this def
// def shouldn't appear in any AvailableSet.		// shouldn't appear in any AvailableSet.
for (const Value *V : I.operands())		for (const Value *V : I.operands())
if (containsGCPtrType(V->getType()) &&		if (containsGCPtrType(V->getType()) &&
isNotExclusivelyConstantDerived(V) && !AvailableSet.count(V)) {		isNotExclusivelyConstantDerived(V) && !AvailableSet.count(V)) {
ProducesUnrelocatedPointer = true;		if (isValuePoisoned(V))
		PoisonedPointerDef = true;
		else
		ValidUnrelocatedPointerDef = true;
break;		break;
}		}
}		}
if (!ProducesUnrelocatedPointer) {		assert(!(ValidUnrelocatedPointerDef && PoisonedPointerDef) &&
bool Cleared = false;		"Value cannot be both unrelocated and poisoned!");
transferInstruction(I, Cleared, AvailableSet);		if (ValidUnrelocatedPointerDef) {
(void)Cleared;		// Remove def of unrelocated pointer from Contribution of this BB and
} else {		// trigger update of all its successors.
// Remove def of unrelocated pointer from Contribution of this BB
// and trigger update of all its successors.
Contribution.erase(&I);		Contribution.erase(&I);
		PoisonedDefs.erase(&I);
ValidUnrelocatedDefs.insert(&I);		ValidUnrelocatedDefs.insert(&I);
DEBUG(dbgs() << "Removing " << I << " from Contribution of "		DEBUG(dbgs() << "Removing urelocated " << I << " from Contribution of "
<< BB->getName() << "\n");		<< BB->getName() << "\n");
ContributionChanged = true;		ContributionChanged = true;
		} else if (PoisonedPointerDef) {
		// Mark pointer as poisoned, remove its def from Contribution and trigger
		// update of all successors.
		Contribution.erase(&I);
		PoisonedDefs.insert(&I);
		DEBUG(dbgs() << "Removing poisoned " << I << " from Contribution of "
		<< BB->getName() << "\n");
		ContributionChanged = true;
		} else {
		bool Cleared = false;
		transferInstruction(I, Cleared, AvailableSet);
		(void)Cleared;
}		}
}		}
return ContributionChanged;		return ContributionChanged;
}		}

void GCPtrTracker::gatherDominatingDefs(const BasicBlock *BB,		void GCPtrTracker::gatherDominatingDefs(const BasicBlock *BB,
AvailableValueSet &Result,		AvailableValueSet &Result,
const DominatorTree &DT) {		const DominatorTree &DT) {
▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	void InstructionVerifier::verifyInstruction(
} else if (isa<CmpInst>(I) &&		} else if (isa<CmpInst>(I) &&
containsGCPtrType(I.getOperand(0)->getType())) {		containsGCPtrType(I.getOperand(0)->getType())) {
Value LHS = I.getOperand(0), RHS = I.getOperand(1);		Value LHS = I.getOperand(0), RHS = I.getOperand(1);
enum BaseType baseTyLHS = getBaseType(LHS),		enum BaseType baseTyLHS = getBaseType(LHS),
baseTyRHS = getBaseType(RHS);		baseTyRHS = getBaseType(RHS);

// Returns true if LHS and RHS are unrelocated pointers and they are		// Returns true if LHS and RHS are unrelocated pointers and they are
// valid unrelocated uses.		// valid unrelocated uses.
auto hasValidUnrelocatedUse = [&AvailableSet, baseTyLHS, baseTyRHS, &LHS,		auto hasValidUnrelocatedUse = [&AvailableSet, Tracker, baseTyLHS, baseTyRHS,
&RHS] () {		&LHS, &RHS] () {
// A cmp instruction has valid unrelocated pointer operands only if		// A cmp instruction has valid unrelocated pointer operands only if
// both operands are unrelocated pointers.		// both operands are unrelocated pointers.
// In the comparison between two pointers, if one is an unrelocated		// In the comparison between two pointers, if one is an unrelocated
// use, the other should be an unrelocated use, for this		// use, the other should be an unrelocated use, for this
// instruction to contain valid unrelocated uses. This unrelocated		// instruction to contain valid unrelocated uses. This unrelocated
// use can be a null constant as well, or another unrelocated		// use can be a null constant as well, or another unrelocated
// pointer.		// pointer.
if (AvailableSet.count(LHS) \|\| AvailableSet.count(RHS))		if (AvailableSet.count(LHS) \|\| AvailableSet.count(RHS))
return false;		return false;
// Constant pointers (that are not exclusively null) may have		// Constant pointers (that are not exclusively null) may have
// meaning in different VMs, so we cannot reorder the compare		// meaning in different VMs, so we cannot reorder the compare
// against constant pointers before the safepoint. In other words,		// against constant pointers before the safepoint. In other words,
// comparison of an unrelocated use against a non-null constant		// comparison of an unrelocated use against a non-null constant
// maybe invalid.		// maybe invalid.
if ((baseTyLHS == BaseType::ExclusivelySomeConstant &&		if ((baseTyLHS == BaseType::ExclusivelySomeConstant &&
baseTyRHS == BaseType::NonConstant) \|\|		baseTyRHS == BaseType::NonConstant) \|\|
(baseTyLHS == BaseType::NonConstant &&		(baseTyLHS == BaseType::NonConstant &&
baseTyRHS == BaseType::ExclusivelySomeConstant))		baseTyRHS == BaseType::ExclusivelySomeConstant))
return false;		return false;

		// If one of pointers is poisoned and other is not exclusively derived
		// from null it is an invalid expression: it produces poisoned result
		// and unless we want to track all defs (not only gc pointers) the only
		// option is to prohibit such instructions.
		if ((Tracker->isValuePoisoned(LHS) && baseTyRHS != ExclusivelyNull) \|\|
		(Tracker->isValuePoisoned(RHS) && baseTyLHS != ExclusivelyNull))
		return false;

// All other cases are valid cases enumerated below:		// All other cases are valid cases enumerated below:
// 1. Comparison between an exlusively derived null pointer and a		// 1. Comparison between an exclusively derived null pointer and a
// constant base pointer.		// constant base pointer.
// 2. Comparison between an exlusively derived null pointer and a		// 2. Comparison between an exclusively derived null pointer and a
// non-constant unrelocated base pointer.		// non-constant unrelocated base pointer.
// 3. Comparison between 2 unrelocated pointers.		// 3. Comparison between 2 unrelocated pointers.
		// 4. Comparison between a pointer exclusively derived from null and a
		// non-constant poisoned pointer.
return true;		return true;
};		};
if (!hasValidUnrelocatedUse()) {		if (!hasValidUnrelocatedUse()) {
// Print out all non-constant derived pointers that are unrelocated		// Print out all non-constant derived pointers that are unrelocated
// uses, which are invalid.		// uses, which are invalid.
if (baseTyLHS == BaseType::NonConstant && !AvailableSet.count(LHS))		if (baseTyLHS == BaseType::NonConstant && !AvailableSet.count(LHS))
reportInvalidUse(*LHS, I);		reportInvalidUse(*LHS, I);
if (baseTyRHS == BaseType::NonConstant && !AvailableSet.count(RHS))		if (baseTyRHS == BaseType::NonConstant && !AvailableSet.count(RHS))
Show All 38 Lines

llvm/trunk/test/SafepointIRVerifier/from-same-relocation-in-phi-nodes.ll

				; XFAIL: *
				; RUN: opt -safepoint-ir-verifier-print-only -verify-safepoint-ir -S %s 2>&1 \| FileCheck %s

				; In %merge %val.unrelocated, %ptr and %arg should be unrelocated.
				; FIXME: if this test fails it is a false-positive alarm. IR is correct.
				define void @test.unrelocated-phi.ok(i8 addrspace(1)* %arg) gc "statepoint-example" {
				; CHECK-LABEL: Verifying gc pointers in function: test.unrelocated-phi.ok
				bci_0:
				%ptr = getelementptr i8, i8 addrspace(1)* %arg, i64 4
				br i1 undef, label %left, label %right

				left:
				%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 5, i32 0, i32 -1, i32 0, i32 0, i32 0)
				br label %merge

				right:
				br label %merge

				merge:
				; CHECK: No illegal uses found by SafepointIRVerifier in: test.unrelocated-phi.ok
				%val.unrelocated = phi i8 addrspace(1)* [ %arg, %left ], [ %ptr, %right ]
				%c = icmp eq i8 addrspace(1)* %val.unrelocated, %arg
				ret void
				}

				declare token @llvm.experimental.gc.statepoint.p0f_isVoidf(i64, i32, void ()*, i32, i32, ...)

llvm/trunk/test/SafepointIRVerifier/unrecorded-live-at-sp.ll

	; RUN: opt %s -safepoint-ir-verifier-print-only -verify-safepoint-ir -S 2>&1 \| FileCheck %s			; RUN: opt %s -safepoint-ir-verifier-print-only -verify-safepoint-ir -S 2>&1 \| FileCheck %s

	; CHECK: Illegal use of unrelocated value found!			; CHECK: Illegal use of unrelocated value found!
	; CHECK-NEXT: Def: %base_phi3 = phi %jObject addrspace(1)* [ %obj609.relocated, %not_zero146 ], [ %base_phi2, %bci_37-aload ], !is_base_value !0			; CHECK-NEXT: Def: %base_phi4 = phi %jObject addrspace(1)* addrspace(1)* [ %addr98.relocated, %not_zero146 ], [ %cast6, %bci_37-aload ], !is_base_value !0
	; CHECK-NEXT: Use: %base_phi2 = phi %jObject addrspace(1)* [ %base_phi3, %not_zero179 ], [ %cast5, %bci_0 ], !is_base_value !0			; CHECK-NEXT: Use: %safepoint_token = tail call token (i64, i32, i32 (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_i32f(i64 0, i32 0, i32 () undef, i32 0, i32 0, i32 0, i32 5, i32 0, i32 0, i32 0, i32 0, i32 0, %jObject addrspace(1)* %base_phi1, %jObject addrspace(1)* addrspace(1)* %base_phi4, %jObject addrspace(1)* addrspace(1)* %relocated4, %jObject addrspace(1)* %relocated7)


	%jObject = type { [8 x i8] }			%jObject = type { [8 x i8] }

	declare %jObject addrspace(1)* @generate_obj1() #1			declare %jObject addrspace(1)* @generate_obj1() #1

	declare %jObject addrspace(1)* addrspace(1)* @generate_obj2() #1			declare %jObject addrspace(1)* addrspace(1)* @generate_obj2() #1

	declare %jObject addrspace(1)* @generate_obj3() #1			declare %jObject addrspace(1)* @generate_obj3() #1
	▲ Show 20 Lines • Show All 58 Lines • Show Last 20 Lines

llvm/trunk/test/SafepointIRVerifier/uses-in-phi-nodes.ll

	; RUN: opt -safepoint-ir-verifier-print-only -verify-safepoint-ir -S %s 2>&1 \| FileCheck %s			; RUN: opt -safepoint-ir-verifier-print-only -verify-safepoint-ir -S %s 2>&1 \| FileCheck %s

	define i8 addrspace(1)* @test.not.ok.0(i8 addrspace(1)* %arg) gc "statepoint-example" {			define i8 addrspace(1)* @test.not.ok.0(i8 addrspace(1)* %arg) gc "statepoint-example" {
	; CHECK-LABEL: Verifying gc pointers in function: test.not.ok.0			; CHECK-LABEL: Verifying gc pointers in function: test.not.ok.0
	bci_0:			bci_0:
	br i1 undef, label %left, label %right			br i1 undef, label %left, label %right

	left:			left:
	%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 5, i32 0, i32 -1, i32 0, i32 0, i32 0)			%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 5, i32 0, i32 -1, i32 0, i32 0, i32 0)
	br label %merge			br label %merge

	right:			right:
	br label %merge			br label %merge

	merge:			merge:
	; CHECK: Illegal use of unrelocated value found!			; CHECK: Illegal use of unrelocated value found!
	; CHECK-NEXT: Def: i8 addrspace(1)* %arg			; CHECK-NEXT: Def: %val = phi i8 addrspace(1)* [ %arg, %left ], [ %arg, %right ]
	; CHECK-NEXT: Use: %val = phi i8 addrspace(1)* [ %arg, %left ], [ %arg, %right ]			; CHECK-NEXT: Use: ret i8 addrspace(1)* %val
	%val = phi i8 addrspace(1)* [ %arg, %left ], [ %arg, %right]			%val = phi i8 addrspace(1)* [ %arg, %left ], [ %arg, %right ]
	ret i8 addrspace(1)* %val			ret i8 addrspace(1)* %val
	}			}

	define i8 addrspace(1)* @test.not.ok.1(i8 addrspace(1)* %arg) gc "statepoint-example" {			define i8 addrspace(1)* @test.not.ok.1(i8 addrspace(1)* %arg) gc "statepoint-example" {
	; CHECK-LABEL: Verifying gc pointers in function: test.not.ok.1			; CHECK-LABEL: Verifying gc pointers in function: test.not.ok.1
	bci_0:			bci_0:
	br i1 undef, label %left, label %right			br i1 undef, label %left, label %right

	left:			left:
	%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 5, i32 0, i32 -1, i32 0, i32 0, i32 0)			%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 5, i32 0, i32 -1, i32 0, i32 0, i32 0)
	br label %merge			br label %merge

	right:			right:
	br label %merge			br label %merge

	merge:			merge:
	; CHECK: Illegal use of unrelocated value found!			; CHECK: Illegal use of unrelocated value found!
	; CHECK-NEXT: Def: i8 addrspace(1)* %arg			; CHECK-NEXT: Def: %val = phi i8 addrspace(1)* [ %arg, %left ], [ null, %right ]
	; CHECK-NEXT: Use: %val = phi i8 addrspace(1)* [ %arg, %left ], [ null, %right ]			; CHECK-NEXT: Use: ret i8 addrspace(1)* %val
	%val = phi i8 addrspace(1)* [ %arg, %left ], [ null, %right]			%val = phi i8 addrspace(1)* [ %arg, %left ], [ null, %right ]
	ret i8 addrspace(1)* %val			ret i8 addrspace(1)* %val
	}			}

	define i8 addrspace(1)* @test.ok.0(i8 addrspace(1)* %arg) gc "statepoint-example" {			define i8 addrspace(1)* @test.ok.0(i8 addrspace(1)* %arg) gc "statepoint-example" {
	; CHECK: No illegal uses found by SafepointIRVerifier in: test.ok.0			; CHECK: No illegal uses found by SafepointIRVerifier in: test.ok.0
	bci_0:			bci_0:
	br i1 undef, label %left, label %right			br i1 undef, label %left, label %right

	Show All 21 Lines
	right:			right:
	br label %merge			br label %merge

	merge:			merge:
	%val = phi i8 addrspace(1)* [ %arg, %left ], [ %arg, %right]			%val = phi i8 addrspace(1)* [ %arg, %left ], [ %arg, %right]
	ret i8 addrspace(1)* %val			ret i8 addrspace(1)* %val
	}			}

				; It should be allowed to compare poisoned ptr with null.
				define void @test.poisoned.cmp.ok(i8 addrspace(1)* %arg) gc "statepoint-example" {
				; CHECK-LABEL: Verifying gc pointers in function: test.poisoned.cmp.ok
				bci_0:
				br i1 undef, label %left, label %right

				left:
				%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 0, i8 addrspace(1)* %arg , i32 -1, i32 0, i32 0, i32 0)
				%arg.relocated = call i8 addrspace(1)* @llvm.experimental.gc.relocate.p1i8(token %safepoint_token, i32 7, i32 7) ; arg, arg
				br label %merge

				right:
				%safepoint_token2 = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 0, i8 addrspace(1)* %arg , i32 -1, i32 0, i32 0, i32 0)
				br label %merge

				merge:
				; CHECK: No illegal uses found by SafepointIRVerifier in: test.poisoned.cmp.ok
				%val.poisoned = phi i8 addrspace(1)* [ %arg.relocated, %left ], [ %arg, %right ]
				%c = icmp eq i8 addrspace(1)* %val.poisoned, null
				ret void
				}

				; It is illegal to compare poisoned ptr and relocated.
				define void @test.poisoned.cmp.fail.0(i8 addrspace(1)* %arg) gc "statepoint-example" {
				; CHECK-LABEL: Verifying gc pointers in function: test.poisoned.cmp.fail.0
				bci_0:
				br i1 undef, label %left, label %right

				left:
				%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 0, i8 addrspace(1)* %arg , i32 -1, i32 0, i32 0, i32 0)
				%arg.relocated = call i8 addrspace(1)* @llvm.experimental.gc.relocate.p1i8(token %safepoint_token, i32 7, i32 7) ; arg, arg
				br label %merge

				right:
				%safepoint_token2 = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 0, i8 addrspace(1)* %arg , i32 -1, i32 0, i32 0, i32 0)
				%arg.relocated2 = call i8 addrspace(1)* @llvm.experimental.gc.relocate.p1i8(token %safepoint_token2, i32 7, i32 7) ; arg, arg
				br label %merge

				merge:
				; CHECK: Illegal use of unrelocated value found!
				; CHECK-NEXT: Def: %val.poisoned = phi i8 addrspace(1)* [ %arg.relocated, %left ], [ %arg, %right ]
				; CHECK-NEXT: Use: %c = icmp eq i8 addrspace(1)* %val.poisoned, %val
				%val.poisoned = phi i8 addrspace(1)* [ %arg.relocated, %left ], [ %arg, %right ]
				%val = phi i8 addrspace(1)* [ %arg.relocated, %left ], [ %arg.relocated2, %right ]
				%c = icmp eq i8 addrspace(1)* %val.poisoned, %val
				ret void
				}

				; It is illegal to compare poisoned ptr and unrelocated.
				define void @test.poisoned.cmp.fail.1(i8 addrspace(1)* %arg) gc "statepoint-example" {
				; CHECK-LABEL: Verifying gc pointers in function: test.poisoned.cmp.fail.1
				bci_0:
				br i1 undef, label %left, label %right

				left:
				%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 0, i8 addrspace(1)* %arg , i32 -1, i32 0, i32 0, i32 0)
				%arg.relocated = call i8 addrspace(1)* @llvm.experimental.gc.relocate.p1i8(token %safepoint_token, i32 7, i32 7) ; arg, arg
				br label %merge

				right:
				%safepoint_token2 = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 0, i8 addrspace(1)* %arg , i32 -1, i32 0, i32 0, i32 0)
				%arg.relocated2 = call i8 addrspace(1)* @llvm.experimental.gc.relocate.p1i8(token %safepoint_token2, i32 7, i32 7) ; arg, arg
				br label %merge

				merge:
				; CHECK: Illegal use of unrelocated value found!
				; CHECK-NEXT: Def: %val.poisoned = phi i8 addrspace(1)* [ %arg.relocated, %left ], [ %arg, %right ]
				; CHECK-NEXT: Use: %c = icmp eq i8 addrspace(1)* %val.poisoned, %arg
				%val.poisoned = phi i8 addrspace(1)* [ %arg.relocated, %left ], [ %arg, %right ]
				%c = icmp eq i8 addrspace(1)* %val.poisoned, %arg
				ret void
				}

				; It should be allowed to compare unrelocated phi with unrelocated value.
				define void @test.unrelocated-phi.cmp.ok(i8 addrspace(1)* %arg) gc "statepoint-example" {
				; CHECK-LABEL: Verifying gc pointers in function: test.unrelocated-phi.cmp.ok
				bci_0:
				br i1 undef, label %left, label %right

				left:
				%safepoint_token = call token (i64, i32, void (), i32, i32, ...) @llvm.experimental.gc.statepoint.p0f_isVoidf(i64 0, i32 0, void () undef, i32 0, i32 0, i32 0, i32 5, i32 0, i32 -1, i32 0, i32 0, i32 0)
				br label %merge

				right:
				br label %merge

				merge:
				; CHECK: No illegal uses found by SafepointIRVerifier in: test.unrelocated-phi.cmp.ok
				%val.unrelocated = phi i8 addrspace(1)* [ %arg, %left ], [ null, %right ]
				%c = icmp eq i8 addrspace(1)* %val.unrelocated, %arg
				ret void
				}

	declare token @llvm.experimental.gc.statepoint.p0f_isVoidf(i64, i32, void ()*, i32, i32, ...)			declare token @llvm.experimental.gc.statepoint.p0f_isVoidf(i64, i32, void ()*, i32, i32, ...)
				declare i8 addrspace(1)* @llvm.experimental.gc.relocate.p1i8(token, i32, i32)
	declare void @not_statepoint()			declare void @not_statepoint()

This is an archive of the discontinued LLVM Phabricator instance.

[SafepointIRVerifier] Allow non-dereferencing uses of unrelocated or poisoned PHI nodesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 128130

llvm/trunk/lib/IR/SafepointIRVerifier.cpp

llvm/trunk/test/SafepointIRVerifier/from-same-relocation-in-phi-nodes.ll

llvm/trunk/test/SafepointIRVerifier/unrecorded-live-at-sp.ll

llvm/trunk/test/SafepointIRVerifier/uses-in-phi-nodes.ll

[SafepointIRVerifier] Allow non-dereferencing uses of unrelocated or poisoned PHI nodes
ClosedPublic