This is an archive of the discontinued LLVM Phabricator instance.

Fix for two constant propagation problems in GVN with assume intrinsic instruction
ClosedPublic

Authored by Ray on Jan 11 2016, 5:01 PM.

Download Raw Diff

Details

Reviewers

Prazek
• dberlin
nlewycky
DavidKreitzer

Commits

rG4d7257dfa111: Fix for two constant propagation problems in GVN with the assume intrinsic…
rL258435: Fix for two constant propagation problems in GVN with the assume intrinsic

Summary

This patch fixes two constant propagation problems in GVN for assume intrinsic instruction. The first problem is filed in PR25285: when there are multiple instances of the same assume instructions in a basic block with a loop back-edge pointing to itself, the first assume instruction is incorrectly constant-propagated with the value "TRUE" and eliminated later on. The second problem is that, the value "TRUE" is incorrectly constant-propagated to the successor blocks that are not dominated by the basic block containing the assume instruction.

Diff Detail

Repository: rL LLVM

Event Timeline

Ray updated this revision to Diff 44578.Jan 11 2016, 5:01 PM

Ray retitled this revision from to Fix for two constant propagation problems in GVN with assume intrinsic instruction.

Ray updated this object.

Ray added reviewers: DavidKreitzer, Prazek.

Ray added a subscriber: llvm-commits.

Something is wrong with the review, because I can't look at code that wasn't modified.
I think the diff was not generated with -U99999.

Prazek added reviewers: nlewycky, • dberlin.Jan 11 2016, 10:56 PM

This is the new diff generated with --diff-cmd=diff -x -U999999

• dberlin added inline comments.Jan 12 2016, 11:07 AM

include/llvm/Transforms/Utils/Local.h
320	I'm not sure this wording change makes sense. It doesn't care whether it's the end of the block that dominates or not. And by definition, uses in the same-block as their definitions must be dominated by those defs.
lib/Transforms/Scalar/GVN.cpp
2137	If there are multiple edges to the same BB, we should not do the replacement at all (we lack post-dominance info necessary to be safe in that case).
2201	If this problem only occurs with assume instructions, we shouldn't change the generic code. If it's not a problem that only occur with assume instructions, we need A. more testcases. B. a more complete description of the problem that is happening and why this is the correct fix.
2277	Same comment as above :)
lib/Transforms/Utils/Local.cpp
1540	This is definitely not correct. Local to a block, all uses are dominated by their definition, so you don't need properlyDominates to replace them legally. It's up to the caller to determine whether this is safe semantically. (Even if this change was correct, you've also now made the replacement functions completely inconsistent with each other.)

Also note, it's possible to have a case where root.start dominates but it
is not safe to propagate due to multiple edges from that block to to
root.getEnd.
The whole reason for having the edge is to avoid this case, AFAIK.

So i'm pretty sure using root.getStart() is just going to cause incorrect
transforms.

Note that all of propagateEquality is really somewhat of a hack to avoid
using proper control dependence, and at some point, we'll likely want to
just rip it out and use post-dominators to do this *right*.

I don't know whether the optimization capability we lose here by avoiding
assume in some cases makes us want to do that more (I leave that question
to data and chandler)

I don't have much time right now to do it because of exams, but if it won't
get accept until friday, I will take a look.

Ray added inline comments.Jan 12 2016, 1:02 PM

lib/Transforms/Scalar/GVN.cpp
2137	Note that, there is a condition about "multiple edges to the same BB": it is from Root.Start to Root.End. That's when DominatesByEdge is set to false, and that's exactly the reason why DominatesByEdge was introduced as a parameter to this function. See the example added before @_Z1ii in test/Transforms/GVN/assume-equal.ll ; This test checks if constant propagation works for multiple node edges ; CHECK-LABEL: define i32 @_Z1ii(i32 %p) define i32 @_Z1ii(i32 %p) { entry: %cmp = icmp eq i32 %p, 42 call void @llvm.assume(i1 %cmp) ; CHECK: br i1 true, label %bb2, label %bb2 br i1 %cmp, label %bb2, label %bb2 bb2: ; CHECK: br i1 true, label %bb2, label %bb2 br i1 %cmp, label %bb2, label %bb2 ; CHECK: ret i32 42 ret i32 %p } I was trying to make the comments in more detail. But if it is confusing, we can improve it.
2201	This is a general problem in GVN::propagateEquality with the following code: unsigned NumReplacements = DominatesByEdge ? replaceDominatedUsesWith(LHS, RHS, DT, Root) : replaceDominatedUsesWith(LHS, RHS, DT, Root.getStart()); When DominatesByEdge is TRUE, it passes the edge "Root" to replaceDominatedUsesWith, and we are fine. Since, if the edge "Root" dominates a USE, then Root.Start must dominates the USE. When DominatesByEdge is FALSE, it passes the Root.End to replaceDominatedUsesWith, and we have a problem here. Since, if Root.End dominates a USE, it does not mean that Root.Start dominates the USE. We show in the test case _Z1im below. In this case, we ideally need "RootDominatesEnd" (in GVN::propagateEquality) to further test before calling replaceDominatedWith(...Root.End), e.g.: unsigned NumReplacements = DominatesByEdge ? replaceDominatedUsesWith(LHS, RHS, DT, Root) : RootDominatesEnd? replaceDominatedUsesWith(LHS, RHS, DT, Root.getEnd()) : 0; However, when we cannot use the current RootDominatesEnd defined as below to test: // For speed, compute a conservative fast approximation to // DT->dominates(Root, Root.getEnd()); bool RootDominatesEnd = isOnlyReachableViaThisEdge(Root, DT); The reason is that, it only works when DominatesByEdge=TRUE, not when DominatesByEdge =FALSE where there are multiple edges (e.g., 2 edges) from Root.Start to Root.End, see @_Z1ii in test/Transforms/GVN/assume-equal.ll again. And, we cannot use the following either, boot RootDominatesEnd = DT->dominates(Root, Root.getEnd()); since, it ran into the problem Prazek saw before, giving an assert when DominatesByEdge=FALSE. see @_Z1ii in test/Transforms/GVN/assume-equal.ll again Now, If we modify RootDominatesEnd to bool RootDominatesEnd = DT.dominates(Root.Start(), Root.End()), then it can be used: unsigned NumReplacements = DominatesByEdge ? replaceDominatedUsesWith(LHS, RHS, DT, Root) : RootDominatesEnd? replaceDominatedUsesWith(LHS, RHS, DT, Root.getEnd()) : 0; I didn't post this change, because 1) DT.dominates will slow down as comments show 2) the name RootDominatesEnd is not proper. In fact, the simplest way is to pass Root.Start to replaceDominatedUsesWith, then change replaceDominatedUsesWith(... BasicBlock* BB) to replace the uses by the "END" of the BB. These two changes together will give exactly the same effect as when passing an edge, i.e. ,replaceDominatedUsesWith(..... BasicBlockEdge Root). Note that, currently GVN is the only caller of replaceDominatedUsesWith( ..., BasicBlock* BB). So, it is safe to change to use "properlyDominates". Although it is a general problem with propagateEquality, I don't have any test cases other than those with assume instructions. The reason is that, The problem only exists when DominatedByEdge = FALSE GVN::processAssumeIntrinsic is the only caller to propagateEquality when DominatedByEdge=FALSE as below: for (BasicBlock *Successor : successors(IntrinsicI->getParent())) { BasicBlockEdge Edge(IntrinsicI->getParent(), Successor); Changed \|= propagateEquality(V, True, Edge, false); }
lib/Transforms/Utils/Local.cpp
1540	Since propagateEquality in GVN is the only caller, that's why we changed the comments to inform other users that, this function will replace USES from the END of the BB. If the called wants to replace the uses in the BB itself, the other overloaded interface replaceDomiantedUsesWith(..... BasicBlockEdge Root) can be used.

I actually agree with Daniel's comments, especially about the proposal to rewrite propagateEquality. I would see my fix as a necessary co-implementation with Piotr's earlier patch http://reviews.llvm.org/D12170, since my changes only happen in that particular commit.

Probably we can submit this "necessary" fix as well as the test cases (for "assume") first, and then think of restructure the propagateEquality.

I update this new patch to fix my previously wrong comments pointed out by Daniel. Thanks for that:)

I like Ray's plan of first making the targeted stability fix and then deciding how best to restructure propagateEquality to make it clearer. The bug illustrated by the @_Z1im test case is pretty egregious and ought to be fixed promptly IMO.

Daniel, you suggested possibly splitting propagateEquality, but there is a lot of functionality in there that you'd like to share between the assume intrinsic caller and the other callers. For example, the code that infers A == 1 && B == 1 from A && B == 1 applies equally regardless of whether A && B == 1 came from "assume(A && B)" or "if (A && B)".

I think the fundamental difference between assume and the other callers of propagateEquality is that for assume, the equality property becomes known true immediately following the assume instruction while for the other callers, the equality property becomes known true along the outgoing edge of the block ending in the conditional branch or switch. It shouldn't be hard to make that distinction in the interface to propagateEquality. Perhaps something like this:

/// The given values are known to be equal at a certain point, P, in the program.
/// When DominatesByEdge is true, that point is the CFG edge Root.
/// When DominatesByEdge is false, that point is the Instruction Instr.
/// Exploit this, for example by replacing 'LHS' with 'RHS' at all uses dominated by P.
/// Returns whether a change was made.
bool GVN::propagateEquality(Value *LHS, Value *RHS, const BasicBlockEdge &Root,
                            Instruction *Instr, bool DominatesByEdge) {

When DominatesByEdge is true, Instr would be ignored. And when DominatesByEdge is false, Root would be ignored. Then the dominance check for the assume case becomes one of "does this instruction dominate" rather than "does this block dominate". (Note that this lets us get rid of the "for each successor" loop inside processAssumeIntrinsic, which is pretty artificial. I suspect it was written that way just to make it easier to reuse propagateEquality.)

To start with, I'm fine with committing this patch if we agree to address
the other issues, *somehow*, in a followup (either by y'all or by Piotr).

Hi Piotr,

Did you get a time to look at the fix I uploaded for the second time? Does it look good to you?

Daniel’s suggestion on restructuring the propagateEquality is good and necessary, since many developers find this function confusing.

Dave and I tried the idea he mentioned below, i.e., the new interface, but it seems that it does not help much to eliminate the code confusion in that function, and not simpler than the current fix.

I am wondering if you have any plan to rewrite this function. Are you familiar with the code? If not, do you know who is familiar with this code, and can assist us in doing the rewriting. We can all help if needed, but we need an expert first.

Thank you very much!

Regards,
Ray
From: Daniel Berlin [mailto:dberlin@dberlin.org]
Sent: Thursday, January 14, 2016 3:42 PM
To: reviews+D16100+public+7f8d41cf7877574b@reviews.llvm.org
Cc: Zhang, Yuanrui <yuanrui.zhang@intel.com>; Nick Lewycky <nlewycky@google.com>; Piotr Padlewski <piotr.padlewski@gmail.com>; Kreitzer, David L <david.l.kreitzer@intel.com>; llvm-commits <llvm-commits@lists.llvm.org>
Subject: Re: [PATCH] D16100: Fix for two constant propagation problems in GVN with assume intrinsic instruction

To start with, I'm fine with committing this patch if we agree to address the other issues, *somehow*, in a followup (either by y'all or by Piotr).

LGTM

This revision is now accepted and ready to land.Jan 20 2016, 1:09 PM

Closed by commit rL258435: Fix for two constant propagation problems in GVN with the assume intrinsic (authored by dlkreitz). · Explain WhyJan 21 2016, 1:36 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

include/

llvm/

Transforms/

Utils/

Local.h

2 lines

lib/

Transforms/

Scalar/

GVN.cpp

9 lines

Utils/

Local.cpp

2 lines

test/

Transforms/

GVN/

assume-equal.ll

40 lines

Diff 44578

include/llvm/Transforms/Utils/Local.h

Context not available.
	unsigned replaceDominatedUsesWith(Value From, Value To, DominatorTree &DT,	unsigned replaceDominatedUsesWith(Value From, Value To, DominatorTree &DT,
	const BasicBlockEdge &Edge);	const BasicBlockEdge &Edge);
	/// \brief Replace each use of 'From' with 'To' if that use is dominated by	/// \brief Replace each use of 'From' with 'To' if that use is dominated by
	/// the given BasicBlock. Returns the number of replacements made.	/// the end of the given BasicBlock. Returns the number of replacements made.
	unsigned replaceDominatedUsesWith(Value From, Value To, DominatorTree &DT,	unsigned replaceDominatedUsesWith(Value From, Value To, DominatorTree &DT,
		dberlinUnsubmitted Not Done Reply Inline Actions I'm not sure this wording change makes sense. It doesn't care whether it's the end of the block that dominates or not. And by definition, uses in the same-block as their definitions must be dominated by those defs. dberlin: I'm not sure this wording change makes sense. It doesn't care whether it's the end of the…
	const BasicBlock *BB);	const BasicBlock *BB);

Context not available.

lib/Transforms/Scalar/GVN.cpp

Context not available.
	/// The given values are known to be equal in every block	/// The given values are known to be equal in every block
	/// dominated by 'Root'. Exploit this, for example by replacing 'LHS' with	/// dominated by 'Root'. Exploit this, for example by replacing 'LHS' with
	/// 'RHS' everywhere in the scope. Returns whether a change was made.	/// 'RHS' everywhere in the scope. Returns whether a change was made.
	/// If DominatesByEdge is false, then it means that it is dominated by Root.End.	/// If DominatesByEdge is false, it means that there might be multiple edges
		/// from Root.Start to Root.End, in which case we need to pass Root.Start and
		dberlinUnsubmitted Not Done Reply Inline Actions If there are multiple edges to the same BB, we should not do the replacement at all (we lack post-dominance info necessary to be safe in that case). dberlin: If there are multiple edges to the same BB, we should not do the replacement at all (we lack…
		RayAuthorUnsubmitted Not Done Reply Inline Actions Note that, there is a condition about "multiple edges to the same BB": it is from Root.Start to Root.End. That's when DominatesByEdge is set to false, and that's exactly the reason why DominatesByEdge was introduced as a parameter to this function. See the example added before @_Z1ii in test/Transforms/GVN/assume-equal.ll ; This test checks if constant propagation works for multiple node edges ; CHECK-LABEL: define i32 @_Z1ii(i32 %p) define i32 @_Z1ii(i32 %p) { entry: %cmp = icmp eq i32 %p, 42 call void @llvm.assume(i1 %cmp) ; CHECK: br i1 true, label %bb2, label %bb2 br i1 %cmp, label %bb2, label %bb2 bb2: ; CHECK: br i1 true, label %bb2, label %bb2 br i1 %cmp, label %bb2, label %bb2 ; CHECK: ret i32 42 ret i32 %p } I was trying to make the comments in more detail. But if it is confusing, we can improve it. Ray: Note that, there is a condition about "multiple edges to the same BB": it is from Root.Start…
		/// use a different implementation of replaceDominatedUsesWith to test.
		//
	bool GVN::propagateEquality(Value LHS, Value RHS, const BasicBlockEdge &Root,	bool GVN::propagateEquality(Value LHS, Value RHS, const BasicBlockEdge &Root,
	bool DominatesByEdge) {	bool DominatesByEdge) {
	SmallVector<std::pair<Value, Value>, 4> Worklist;	SmallVector<std::pair<Value, Value>, 4> Worklist;
Context not available.
	unsigned NumReplacements =	unsigned NumReplacements =
	DominatesByEdge	DominatesByEdge
	? replaceDominatedUsesWith(LHS, RHS, *DT, Root)	? replaceDominatedUsesWith(LHS, RHS, *DT, Root)
	: replaceDominatedUsesWith(LHS, RHS, *DT, Root.getEnd());	: replaceDominatedUsesWith(LHS, RHS, *DT, Root.getStart());
		dberlinUnsubmitted Not Done Reply Inline Actions If this problem only occurs with assume instructions, we shouldn't change the generic code. If it's not a problem that only occur with assume instructions, we need A. more testcases. B. a more complete description of the problem that is happening and why this is the correct fix. dberlin: If this problem only occurs with assume instructions, we shouldn't change the generic code. If…
		RayAuthorUnsubmitted Not Done Reply Inline Actions This is a general problem in GVN::propagateEquality with the following code: unsigned NumReplacements = DominatesByEdge ? replaceDominatedUsesWith(LHS, RHS, DT, Root) : replaceDominatedUsesWith(LHS, RHS, DT, Root.getStart()); When DominatesByEdge is TRUE, it passes the edge "Root" to replaceDominatedUsesWith, and we are fine. Since, if the edge "Root" dominates a USE, then Root.Start must dominates the USE. When DominatesByEdge is FALSE, it passes the Root.End to replaceDominatedUsesWith, and we have a problem here. Since, if Root.End dominates a USE, it does not mean that Root.Start dominates the USE. We show in the test case _Z1im below. In this case, we ideally need "RootDominatesEnd" (in GVN::propagateEquality) to further test before calling replaceDominatedWith(...Root.End), e.g.: unsigned NumReplacements = DominatesByEdge ? replaceDominatedUsesWith(LHS, RHS, DT, Root) : RootDominatesEnd? replaceDominatedUsesWith(LHS, RHS, DT, Root.getEnd()) : 0; However, when we cannot use the current RootDominatesEnd defined as below to test: // For speed, compute a conservative fast approximation to // DT->dominates(Root, Root.getEnd()); bool RootDominatesEnd = isOnlyReachableViaThisEdge(Root, DT); The reason is that, it only works when DominatesByEdge=TRUE, not when DominatesByEdge =FALSE where there are multiple edges (e.g., 2 edges) from Root.Start to Root.End, see @_Z1ii in test/Transforms/GVN/assume-equal.ll again. And, we cannot use the following either, boot RootDominatesEnd = DT->dominates(Root, Root.getEnd()); since, it ran into the problem Prazek saw before, giving an assert when DominatesByEdge=FALSE. see @_Z1ii in test/Transforms/GVN/assume-equal.ll again Now, If we modify RootDominatesEnd to bool RootDominatesEnd = DT.dominates(Root.Start(), Root.End()), then it can be used: unsigned NumReplacements = DominatesByEdge ? replaceDominatedUsesWith(LHS, RHS, DT, Root) : RootDominatesEnd? replaceDominatedUsesWith(LHS, RHS, DT, Root.getEnd()) : 0; I didn't post this change, because 1) DT.dominates will slow down as comments show 2) the name RootDominatesEnd is not proper. In fact, the simplest way is to pass Root.Start to replaceDominatedUsesWith, then change replaceDominatedUsesWith(... BasicBlock* BB) to replace the uses by the "END" of the BB. These two changes together will give exactly the same effect as when passing an edge, i.e. ,replaceDominatedUsesWith(..... BasicBlockEdge Root). Note that, currently GVN is the only caller of replaceDominatedUsesWith( ..., BasicBlock* BB). So, it is safe to change to use "properlyDominates". Although it is a general problem with propagateEquality, I don't have any test cases other than those with assume instructions. The reason is that, The problem only exists when DominatedByEdge = FALSE GVN::processAssumeIntrinsic is the only caller to propagateEquality when DominatedByEdge=FALSE as below: for (BasicBlock Successor : successors(IntrinsicI->getParent())) { BasicBlockEdge Edge(IntrinsicI->getParent(), Successor); Changed \|= propagateEquality(V, True, Edge, false); } Ray:* ===This is a general problem in GVN::propagateEquality with the following code: unsigned…

	Changed \|= NumReplacements > 0;	Changed \|= NumReplacements > 0;
	NumGVNEqProp += NumReplacements;	NumGVNEqProp += NumReplacements;
Context not available.
	DominatesByEdge	DominatesByEdge
	? replaceDominatedUsesWith(NotCmp, NotVal, *DT, Root)	? replaceDominatedUsesWith(NotCmp, NotVal, *DT, Root)
	: replaceDominatedUsesWith(NotCmp, NotVal, *DT,	: replaceDominatedUsesWith(NotCmp, NotVal, *DT,
	Root.getEnd());	Root.getStart());
		dberlinUnsubmitted Not Done Reply Inline Actions Same comment as above :) dberlin: Same comment as above :)
	Changed \|= NumReplacements > 0;	Changed \|= NumReplacements > 0;
	NumGVNEqProp += NumReplacements;	NumGVNEqProp += NumReplacements;
	}	}
Context not available.

lib/Transforms/Utils/Local.cpp

Context not available.
	UI != UE;) {	UI != UE;) {
	Use &U = *UI++;	Use &U = *UI++;
	auto *I = cast<Instruction>(U.getUser());	auto *I = cast<Instruction>(U.getUser());
	if (DT.dominates(BB, I->getParent())) {	if (DT.properlyDominates(BB, I->getParent())) {
		dberlinUnsubmitted Not Done Reply Inline Actions This is definitely not correct. Local to a block, all uses are dominated by their definition, so you don't need properlyDominates to replace them legally. It's up to the caller to determine whether this is safe semantically. (Even if this change was correct, you've also now made the replacement functions completely inconsistent with each other.) dberlin: This is definitely not correct. Local to a block, all uses are dominated by their definition…
		RayAuthorUnsubmitted Not Done Reply Inline Actions Since propagateEquality in GVN is the only caller, that's why we changed the comments to inform other users that, this function will replace USES from the END of the BB. If the called wants to replace the uses in the BB itself, the other overloaded interface replaceDomiantedUsesWith(..... BasicBlockEdge Root) can be used. Ray: Since propagateEquality in GVN is the only caller, that's why we changed the comments to inform…
	U.set(To);	U.set(To);
	DEBUG(dbgs() << "Replace dominated use of '" << From->getName() << "' as "	DEBUG(dbgs() << "Replace dominated use of '" << From->getName() << "' as "
	<< To << " in " << U << "\n");	<< To << " in " << U << "\n");
Context not available.

test/Transforms/GVN/assume-equal.ll

Context not available.
	ret i32 17	ret i32 17
	}	}

		; This test checks if GVN can do the constant propagation correctly
		; when there are multiple uses of the same assume value in the
		; basic block that has a loop back-edge pointing to itself.
		;
		; CHECK-LABEL: define i32 @_Z1il(i32 %val, i1 %k)
		define i32 @_Z1il(i32 %val, i1 %k) {
		br label %next

		next:
		; CHECK: tail call void @llvm.assume(i1 %k)
		; CHECK-NEXT: %cmp = icmp eq i32 %val, 50
		tail call void @llvm.assume(i1 %k)
		tail call void @llvm.assume(i1 %k)
		%cmp = icmp eq i32 %val, 50
		br i1 %cmp, label %next, label %meh

		meh:
		ret i32 0
		}

		; This test checks if GVN can prevent the constant propagation correctly
		; in the successor blocks that are not dominated by the basic block
		; with the assume instruction.
		;
		; CHECK-LABEL: define i1 @_z1im(i32 %val, i1 %k, i1 %j)
		define i1 @_z1im(i32 %val, i1 %k, i1 %j) {
		br i1 %j, label %next, label %meh

		next:
		; CHECK: tail call void @llvm.assume(i1 %k)
		; CHECK-NEXT: br label %meh
		tail call void @llvm.assume(i1 %k)
		tail call void @llvm.assume(i1 %k)
		br label %meh

		meh:
		; CHECK: ret i1 %k
		ret i1 %k
		}

	declare noalias i8* @_Znwm(i64)	declare noalias i8* @_Znwm(i64)
	declare void @_ZN1AC1Ev(%struct.A*)	declare void @_ZN1AC1Ev(%struct.A*)
	declare void @llvm.assume(i1)	declare void @llvm.assume(i1)
Context not available.

This is an archive of the discontinued LLVM Phabricator instance.

Fix for two constant propagation problems in GVN with assume intrinsic instructionClosedPublic

Details

Diff Detail

Event Timeline

This is a general problem in GVN::propagateEquality with the following code:

Although it is a general problem with propagateEquality, I don't have any test cases other than those with assume instructions. The reason is that,

Revision Contents

Diff 44578

include/llvm/Transforms/Utils/Local.h

lib/Transforms/Scalar/GVN.cpp

This is a general problem in GVN::propagateEquality with the following code:

Although it is a general problem with propagateEquality, I don't have any test cases other than those with assume instructions. The reason is that,

lib/Transforms/Utils/Local.cpp

test/Transforms/GVN/assume-equal.ll

Fix for two constant propagation problems in GVN with assume intrinsic instruction
ClosedPublic