This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/Transforms/Scalar/
-
Transforms/
-
Scalar/
12/25
ADCE.cpp

Differential D23824

[ADCE] Add handling of PHI nodes when removing control flow
ClosedPublic

Authored by david2050 on Aug 23 2016, 6:54 PM.

Download Raw Diff

Details

Reviewers

nadav
majnemer
mehdi_amini

Summary

This is part of a series of patches to evolve ADCE.cpp to support
removing of unnecessary control flow.

This patch updates the propagation of liveness information to handle
special properties of PHI nodes.

We still force all terminators live for now until we add code to
handle removing control flow in a later patch.

No changes to effective behavior with this patch

Previous patches:

D23559 [ADCE] Add control dependence computation
D23225 [ADCE] Modify data structures to support removing control flow
D23065 [ADCE] Refactor anticipating new functionality (NFC)
D23102 [ADCE] Refactoring for new functionality (NFC)

Diff Detail

Event Timeline

david2050 updated this revision to Diff 69069.Aug 23 2016, 6:54 PM

david2050 retitled this revision from to [ADCE] Add handling of PHI nodes when removing control flow.

david2050 updated this object.

david2050 added reviewers: mehdi_amini, nadav, majnemer.

david2050 added subscribers: llvm-commits, freik, twoh.

majnemer added inline comments.Aug 23 2016, 7:56 PM

lib/Transforms/Scalar/ADCE.cpp
286	I'd use `auto *`.
329	Ditto.
443	Ditto.

Use auto keyword

mehdi_amini added inline comments.Aug 23 2016, 8:48 PM

lib/Transforms/Scalar/ADCE.cpp
116	Can you revisit the comment? I find it hard to figure what you want to do here. Also, as a side note, "verify" makes me think of a "check", which does not go well with the name of the method prefixed with "mark".
448	This is not clear to me, I have no idea what is going on in this code? I mean I can follow what your doing, but I don't know why.

Expand comments on actions taken by makeLivePhiNodeInputs

mehdi_amini added inline comments.Aug 25 2016, 11:12 AM

lib/Transforms/Scalar/ADCE.cpp
116	s/poeration/operation
121	s/hte/the
469	There is no `br label %cond.false` here.
470	Thanks for the extensive comment and the example, unfortunately this is not really the information I was looking for. Phi nodes are dependent on the control flow, and we need to mark predecessors live in general, I got that part, but what isn't just clear to me is why only some blocks are marked live. For instance: define i8 @foo(i1 %cond1, i1 %cond2, i8 %a, i8 %b, i8 %c, i8 %d) nounwind { entry: br i1 %cond1, label %first_true, label %first_false first_true: br i1 %cond2, label %second_true, label %second_false first_false: br label %end second_true: br label %end second_false: br label %end end: %result = phi i8 [ %a, %first_false ], [ %b, %second_true ], [ %c, %second_false ] ret i8 %result } Assuming none of the `br label %end` is live, and considering the only phi, as written, all predecessors will be marked live. If I replace the phi with `phi i8 [ %a, %first_false ], [ %a, %second_true ], [ %c, %second_false ]` , then only `%second_false` is marked live. Now, what I find annoying is that with the same edges but just changing the order of the operands, you would get a different results. For instance: `phi i8 [ %c, %second_false ], [ %a, %first_false ], [ %a, %second_true ]` now `%second_false` is not marked live and instead `%first_false` and `%second_true` are.
472	s/CommonReachngDefintion/CommonReachingDefinition/

Address comments.
Remove code that is functionally redundant.

Herald added a subscriber: david2050. · View Herald TranscriptAug 25 2016, 11:47 AM

mehdi_amini added inline comments.Aug 25 2016, 11:50 AM

lib/Transforms/Scalar/ADCE.cpp
465	Sorry missed this one: `S/tis/this`

david2050 added inline comments.Aug 25 2016, 12:34 PM

lib/Transforms/Scalar/ADCE.cpp
470	You are correct the it is sensitive to the order. Marking one is sufficient for correctness. I have considered marking the lexically first one as live which would provide more stability but it is a rare occurrence so it is not clear if it is worth fixing. I don't have any collected data on this.

fix typo

david2050 marked an inline comment as done.Sep 1 2016, 9:10 AM

david2050 added inline comments.

lib/Transforms/Scalar/ADCE.cpp
470	I ran an experiment by prioritize that we keep the lexically last defined value. This should bias toward forcing live earlier branch decisions which are likely to be forced live anyway if we mark a later branch live. This had minimal impact on the my set of C++ tests (two changes, in different directions).

• dberlin added a subscriber: • dberlin.Sep 1 2016, 11:52 AM

• dberlin added inline comments.

lib/Transforms/Scalar/ADCE.cpp
483	I'm curious why you bother with this detection? Pretty much every other pass we have can already do a better job of it :) That is, they can determine not just if they are dead or trivially equivalent, but produce equivalent values. You are basically detecting the block is not necessary to produce that value to the phi node. That's, IMHO, more of a redundancy elimination technique than a dead code one. The code isn't dead, it's just equivalent to some other code. Or am i missing something? Do you have examples where the standard pass pipeline doesn't remove the block producing the equivalent value?

david2050 added inline comments.Sep 5 2016, 10:32 AM

lib/Transforms/Scalar/ADCE.cpp
483	This step is necessary otherwise we delete a branch which determines a live value. It is an artifact of the semantics of PHI nodes. This is about preserving existing structure not translating to a new form. And Yes, outside of loops, the baseline code catches almost all cases. The primary motivation was dead, may-be-infinute loops.

• dberlin added inline comments.Sep 5 2016, 11:17 AM

lib/Transforms/Scalar/ADCE.cpp
483	I understand the semantics of phi nodes :) I'm asking why you don't just mark them all live. Traditionally, in DCE, what would happen is that if the phi is live, you mark all values that are incoming to the phi as live (and in a control dependence DCE, any control dependences of the necessary edges) Here, it seems like you are trying to figure out if the phi is useless by seeing if all branches have the same incoming value, and if so, you are arbitrarily choosing one so you can delete at least one branch. That seems .. pointless. It should only also matter in the case of either predecessors that only exist for control flow (IE empty basic blocks), or critical edges. The only interesting loop case i'm aware of where this matters is: if (b) { int i; for (i = 0; i<1000; ++i); j = 0; } return j; } But your current code won't allow deleting the loop in this case, AFAICT. (among other things, you would need to determine what the control dependence would look like if the critical edge was split without splitting it. This is what GCC does)

• dberlin added inline comments.Sep 5 2016, 7:20 PM

lib/Transforms/Scalar/ADCE.cpp
483	(and to be super clear, when i say "I'm asking why you don't just mark them all live.", i mean "all the arguments and edges of a given phi node", not "all phis").

david2050 added inline comments.Sep 6 2016, 9:12 AM

lib/Transforms/Scalar/ADCE.cpp
483	It is not all branches, it is all branches from dead predecessors since in the case that we should remove all such predecessors to be replaced with a single edge, we verify that there is a unique value for that edge. If not we select one predecessor to keep live. Consider a = 0 if (x) { if (y) a =1 else a = 2 } else a =3 phi(...) The phi node following this block has tree edges labeled with values 1,2,3 respectively. The guarded basic blocks are empty because the assignments are folded into the phi node. The branch decision at which tests 'y' is dead at this point because there are no live operations in any block it controls. Here we determine that we need to keep at least one of the empty blocks around to distinguish between 1 and 2, which models the behavior if the assignments to 'a' were instead explicitly in the original blocks. Given a more complex example we might have a = 0 if (x) { if (z) a=1 else { if (y) { } else { } } } else a =3 phi(...) Here there are 4 predecessors to the join, three are 2 but only two distinct values and we only need to have the branch on 'z' live. The branch on 'y' need not be. This is why the code does not mark all incoming edges live but does not do the extra work to try and pick and optimal choice (the path with a=1 in this case). The scenario is rare enough that it is not clear extra work is worth the effort. In your example, I believe the current code will allow the remove the loop since the assignment to J is not control dependent on the loop but only on the branch testing 'b'. The subset of the CFG corresponding to the loop would be all marked dead and removed and the edge leaving the block holding the test on 'b' redirected to the block holding 'j=0'. I don't understand your reference to splitting a critical edge.

• dberlin added inline comments.Sep 6 2016, 10:30 AM

lib/Transforms/Scalar/ADCE.cpp
483	(Sorry if this is short, i lost this comment once already). Okay, we are talking past each other. I understand what you are doing, and why you are trying to mark only certain values of a phi live. I'm saying "this is not worth it, and is complex" compared to just marking all of them. You produced an example showing it removes a single block. I agree it can remove that block. Instead of arguing by example, i produced data: I ran your patch on ~5000 C++ packages as is, and then ran it on ~5000 C++ packages with it changed to just mark all phi node inputs as live when the phi node is live. The number of cases i find where it makes a difference is ~0.01%. The average size of the binaries in which it does something are ~16 bytes smaller (which, for those binaries, amounts to an average size change of 0.00005%) That seems "not worth it" to me :) Now, i'm just trying to provide data not actually to say "we shouldn't do it this way", but more "if we are going to add complexity, let's do it where we have data that shows it's worth it". We both can clearly come up with contrived examples where it matters. So if you have data showing trying to only mark single branches of unique values live is worth it, great, add the data and let's stop arguing, and let's do it. "In your example, I believe the current code will allow the remove the loop since the assignment to J is not control dependent on the loop but only on the branch testing 'b'. The subset of the CFG corresponding to the loop would be all marked dead and removed and the edge leaving the block holding the test on 'b' redirected to the block holding 'j=0'. I don't understand your reference to splitting a critical edge." You should try it before you knock it :) You may have to change it to j=b or add a new parameter somewhere, depending on what passes you run before it (otherwise the phi ends up with the constant propagated into it). You will end up with a phi node in the end block that merges the else-condition value (IE from outside the condition entirely) of jwith the inside-condition value. Control-dependence will force you to keep the empty loop live due to the critical edge (again, whether the edge stays critical or not depends on where you end up in the pass pipeline) This case is more common, it affects at least ~1% of the packages. I did not fix it, so i can't say how much it saves or not saves. It is common enough that gcc special cases it (as i said) and has regression tests to make sure the loop is eliminated by DCE.

david2050 added inline comments.Sep 6 2016, 11:12 AM

lib/Transforms/Scalar/ADCE.cpp
483	Thanks for great due diligence for the patch. The experiment you ran seems doomed because the patches submitted so far are incomplete. If you look around line 305 there is loop which forces all branches live. This is temporary waiting for the next patch which includes the code to delete branches. Right now the patch reduces to prior behavior of not actually deleting in branches although it takes a long way to get there. Thus any changes you see based on your edit are curious since it should not have any impact on the generated code at all. I certainly agree that even the relatively simple pick-one-value here may have no value. Perhaps we can accept this change as is, with a TODO to evaluate this point after the next (and final) patch is up? (Assuming there is nothing else to change along the way)

• dberlin added inline comments.Sep 6 2016, 12:00 PM

lib/Transforms/Scalar/ADCE.cpp
483	I feel like you are implying something here. In this case, I simply hacked in the right parts of the rest of the old patch (and a little work) before it was broken up before running the experiment. (You could have simply asked for the code if you are concerned). You are talking about something that is < 50 lines of code to make work (control dependence DCE is not that hard, nor is this even the first implementation of it in LLVM!), so i'm not sure why you would think i didn't run the experiment properly? (I could also point out i don't actually care about correctness of the generated code, only the maximum optimization value) If you would like to actually run your own experiment show that this is worth it, you are, as i said, welcome. You may want to focus on that instead of trying to pick mine apart ;) Right now, like I said, i see nothing that makes it seem like this is worth it, and your response here doesn't help that. If you want to produce data that says it's worth it, great!. Otherwise, I'm loathe to start by adding complexity and then try to take it away later. That never works.

david2050 added inline comments.Sep 6 2016, 12:30 PM

lib/Transforms/Scalar/ADCE.cpp
483	No implications, and apologies for suggesting you were incomplete in your analysis. To be clear then, in your variant, you marked every basic block which is a predecessor of a live-phi as live?

I ran this test:

int foo(int a, int b, int N) {
  int j = 0;
  if (b) {
    int i;
    for (i = 0; i < N; i++)
      ;
    j = 1;
  }
  return j;
}

compiled to bc and then processed: opt -sroa -adce -adce-remove-loops and it did successfully remove the loop. I also tried this variant

int foo(int b, int N) {
  int j = 0;
  if (b) {
    int i;
    j = 1;
    for (i = 0; i < N; i++)
      ;
  }
  return j;
}

The final output for that one looked like:

define i32 @foo(i32 %b, i32 %N) #0 {
  %1 = icmp ne i32 %b, 0
  br i1 %1, label %2, label %3

; <label>:2:                                      ; preds = %0
  br label %3

; <label>:3:                                      ; preds = %2, %0
  %j.0 = phi i32 [ 0, %0 ], [ 1, %2 ]
  ret i32 %j.0
}

This snippet

int foo(int b, int j, int N) {
  if (b) {
    int i;
    for (i = 0; i < N; i++)
      ;
    j =	0;
  }
  return j;
}

via

clang -c -emit-llvm -O0 ...
opt -sroa -adce -adce-remove-loops -S ...

generates

define i32 @foo(i32 %b, i32 %j, i32 %N) #0 {
  %1 = icmp ne i32 %b, 0
  br i1 %1, label %2, label %3

; <label>:2:                                      ; preds = %0
  br label %3

; <label>:3:                                      ; preds = %2, %0
  %.0 = phi i32 [ %j, %0 ], [ 0, %2 ]
  ret i32 %.0
}

Change handling of PHI nodes to force predecessors live

Introduce CFLive field to mark blocks whose control dependence sources should be live; use this for PHI predecessors and blocks with live operations

ping

A lot simpler now!

LGTM, thanks.

This revision is now accepted and ready to land.Sep 19 2016, 8:20 AM

majnemer added inline comments.Sep 19 2016, 10:10 AM

lib/Transforms/Scalar/ADCE.cpp
391	I don't think you need llvm:: here.
394	Needs a space after 'if'

david2050 marked 2 inline comments as done.Sep 19 2016, 10:35 AM

david2050 added inline comments.

lib/Transforms/Scalar/ADCE.cpp
391	stupid xcode autocompletion :-)

Respond to David's comments

LGTM

david2050 closed this revision.Sep 19 2016, 4:25 PM

david2050 mentioned this in D24918: [ADCE] Add code to remove dead branches.Sep 26 2016, 8:02 AM

david2050 mentioned this in rL289548: [ADCE] Add code to remove dead branches.Dec 13 2016, 8:52 AM

Revision Contents

Path

Size

lib/

Transforms/

Scalar/

ADCE.cpp

92 lines

Diff 69069

lib/Transforms/Scalar/ADCE.cpp

Show First 20 Lines • Show All 103 Lines • ▼ Show 20 Lines	class AggressiveDeadCodeElimination {
bool isAlwaysLive(Instruction &I);		bool isAlwaysLive(Instruction &I);
/// Return true for instrumentation instructions for value profiling.		/// Return true for instrumentation instructions for value profiling.
bool isInstrumentsConstant(Instruction &I);		bool isInstrumentsConstant(Instruction &I);

/// Propagate liveness to reaching definitions.		/// Propagate liveness to reaching definitions.
void markLiveInstructions();		void markLiveInstructions();
/// Mark an instruction as live.		/// Mark an instruction as live.
void markLive(Instruction *I);		void markLive(Instruction *I);
		/// Mark reaching defintions and possibly reaching blocks live for a PHINode.
		void markPhiReachingDefs(PHINode *PN);

		/// Verify that a live phi node that has inputs in blocks with
		/// dead terminators has a unique reaching definition from all such blocks.
		mehdi_aminiUnsubmitted Done Reply Inline Actions Can you revisit the comment? I find it hard to figure what you want to do here. Also, as a side note, "verify" makes me think of a "check", which does not go well with the name of the method prefixed with "mark". mehdi_amini: Can you revisit the comment? I find it hard to figure what you want to do here. Also, as a…
		mehdi_aminiUnsubmitted Done Reply Inline Actions s/poeration/operation mehdi_amini: s/poeration/operation
		void markLivePhiNodeInputs();
		void markLivePhiNodeInputs(BasicBlock *BB);

/// Record the Debug Scopes which surround live debug information.		/// Record the Debug Scopes which surround live debug information.
void collectLiveScopes(const DILocalScope &LS);		void collectLiveScopes(const DILocalScope &LS);
		mehdi_aminiUnsubmitted Done Reply Inline Actions s/hte/the mehdi_amini: s/hte/the
void collectLiveScopes(const DILocation &DL);		void collectLiveScopes(const DILocation &DL);

/// Analyze dead branches to find those whose branches are the sources		/// Analyze dead branches to find those whose branches are the sources
/// of control dependences impacting a live block. Those branches are		/// of control dependences impacting a live block. Those branches are
/// marked live.		/// marked live.
void markLiveBranchesFromControlDependences();		void markLiveBranchesFromControlDependences();

/// Remove instructions not marked live, return if any any instruction		/// Remove instructions not marked live, return if any any instruction
▲ Show 20 Lines • Show All 136 Lines • ▼ Show 20 Lines	if (Function *Callee = CI->getCalledFunction())
if (isa<Constant>(CI->getArgOperand(0)))		if (isa<Constant>(CI->getArgOperand(0)))
return true;		return true;
return false;		return false;
}		}

void AggressiveDeadCodeElimination::markLiveInstructions() {		void AggressiveDeadCodeElimination::markLiveInstructions() {

// Propagate liveness backwards to operands.		// Propagate liveness backwards to operands.
		bool PhiNodeInputsChecked = false;
do {		do {
// Worklist holds newly discovered live instructions		// Worklist holds newly discovered live instructions
// where we need to mark the inputs as live.		// where we need to mark the inputs as live.
while (!Worklist.empty()) {		while (!Worklist.empty()) {
Instruction *LiveInst = Worklist.pop_back_val();		Instruction *LiveInst = Worklist.pop_back_val();
		DEBUG(dbgs() << "work live: "; LiveInst->dump(););

// Collect the live debug info scopes attached to this instruction.		// Collect the live debug info scopes attached to this instruction.
if (const DILocation *DL = LiveInst->getDebugLoc())		if (const DILocation *DL = LiveInst->getDebugLoc())
collectLiveScopes(*DL);		collectLiveScopes(*DL);

DEBUG(dbgs() << "work live: "; LiveInst->dump(););		if (PHINode *PN = dyn_cast<PHINode>(LiveInst)) {
		majnemerUnsubmitted Done Reply Inline Actions I'd use `auto `. majnemer:* I'd use `auto *`.
		markPhiReachingDefs(PN);
		continue;
		}

for (Use &OI : LiveInst->operands())		for (Use &OI : LiveInst->operands())
if (Instruction *Inst = dyn_cast<Instruction>(OI))		if (Instruction *Inst = dyn_cast<Instruction>(OI))
markLive(Inst);		markLive(Inst);
}		}
markLiveBranchesFromControlDependences();		markLiveBranchesFromControlDependences();

		// After we have incorporated all data and control flow effects, make
		// a scan of phi nodes to make sure there are no situations where the
		// effects of a branch are realized only in different reaching values
		// at a live phi node.
		if (Worklist.empty() && !PhiNodeInputsChecked) {
		PhiNodeInputsChecked = true;
		markLivePhiNodeInputs();
		}

if (Worklist.empty()) {		if (Worklist.empty()) {
// Temporary until we can actually delete branches.		// Temporary until we can actually delete branches.
SmallVector<TerminatorInst *, 16> DeadTerminators;		SmallVector<TerminatorInst *, 16> DeadTerminators;
for (auto *BB : BlocksWithDeadTerminators)		for (auto *BB : BlocksWithDeadTerminators)
DeadTerminators.push_back(BB->getTerminator());		DeadTerminators.push_back(BB->getTerminator());
for (auto *I : DeadTerminators)		for (auto *I : DeadTerminators)
markLive(I);		markLive(I);
assert(BlocksWithDeadTerminators.empty());		assert(BlocksWithDeadTerminators.empty());
// End temporary.		// End temporary.
}		}
} while (!Worklist.empty());		} while (!Worklist.empty());

assert(BlocksWithDeadTerminators.empty());		assert(BlocksWithDeadTerminators.empty());
}		}

		void AggressiveDeadCodeElimination::markPhiReachingDefs(PHINode *PN) {

		// For each reaching definition, if it is an instruction, mark it live.
		// Otherwise, mark the terminator of the associated block live so we preserve
		// the control flow associated with this value.
		auto NumIdx = PN->getNumIncomingValues();
		for (unsigned Idx = 0; Idx < NumIdx; ++Idx) {
		auto *Value = PN->getIncomingValue(Idx);
		if (Instruction *ReachingDef = dyn_cast<Instruction>(Value)) {
		majnemerUnsubmitted Done Reply Inline Actions Ditto. majnemer: Ditto.
		markLive(ReachingDef);
		continue;
		}
		auto *PredTerm = PN->getIncomingBlock(Idx)->getTerminator();
		DEBUG(dbgs() << "constant phi live: "; PredTerm->dump(););
		markLive(PredTerm);
		}
		}

void AggressiveDeadCodeElimination::markLive(Instruction *I) {		void AggressiveDeadCodeElimination::markLive(Instruction *I) {

auto &Info = InstInfo[I];		auto &Info = InstInfo[I];
if (Info.Live)		if (Info.Live)
return;		return;

DEBUG(dbgs() << "mark live: "; I->dump());		DEBUG(dbgs() << "mark live: "; I->dump());
Info.Live = true;		Info.Live = true;
Show All 36 Lines	void AggressiveDeadCodeElimination::collectLiveScopes(const DILocation &DL) {
// Collect live scopes from the scope chain.		// Collect live scopes from the scope chain.
collectLiveScopes(*DL.getScope());		collectLiveScopes(*DL.getScope());

// Tail-recurse through the inlined-at chain.		// Tail-recurse through the inlined-at chain.
if (const DILocation *IA = DL.getInlinedAt())		if (const DILocation *IA = DL.getInlinedAt())
collectLiveScopes(*IA);		collectLiveScopes(*IA);
}		}

void AggressiveDeadCodeElimination::markLiveBranchesFromControlDependences() {		void AggressiveDeadCodeElimination::markLiveBranchesFromControlDependences() {
		majnemerUnsubmitted Done Reply Inline Actions I don't think you need llvm:: here. majnemer: I don't think you need llvm:: here.
		david2050AuthorUnsubmitted Not Done Reply Inline Actions stupid xcode autocompletion :-) david2050: stupid xcode autocompletion :-)

if (BlocksWithDeadTerminators.empty())		if (BlocksWithDeadTerminators.empty())
return;		return;
		majnemerUnsubmitted Done Reply Inline Actions Needs a space after 'if' majnemer: Needs a space after 'if'

DEBUG({		DEBUG({
dbgs() << "new live blocks:\n";		dbgs() << "new live blocks:\n";
for (auto *BB : NewLiveBlocks)		for (auto *BB : NewLiveBlocks)
dbgs() << "\t" << BB->getName() << '\n';		dbgs() << "\t" << BB->getName() << '\n';
dbgs() << "dead terminator blocks:\n";		dbgs() << "dead terminator blocks:\n";
for (auto *BB : BlocksWithDeadTerminators)		for (auto *BB : BlocksWithDeadTerminators)
dbgs() << "\t" << BB->getName() << '\n';		dbgs() << "\t" << BB->getName() << '\n';
Show All 14 Lines	void AggressiveDeadCodeElimination::markLiveBranchesFromControlDependences() {

// Dead terminators which control live blocks are now marked live.		// Dead terminators which control live blocks are now marked live.
for (auto BB : IDFBlocks) {		for (auto BB : IDFBlocks) {
DEBUG(dbgs() << "live control in: " << BB->getName() << '\n');		DEBUG(dbgs() << "live control in: " << BB->getName() << '\n');
markLive(BB->getTerminator());		markLive(BB->getTerminator());
}		}
}		}

		void AggressiveDeadCodeElimination::markLivePhiNodeInputs() {
		SmallPtrSet<BasicBlock *, 16> LiveJoinBlocks;

		// Find all successors of blocks with dead terminators
		// and mark the live phi nodes in them.
		SmallVector<BasicBlock *, 16> Elements(BlocksWithDeadTerminators.begin(),
		BlocksWithDeadTerminators.end());
		for (auto *BB : Elements)
		for (auto *SuccBB : successors(BB))
		if (LiveJoinBlocks.insert(SuccBB).second)
		markLivePhiNodeInputs(SuccBB);
		}

		void AggressiveDeadCodeElimination::markLivePhiNodeInputs(BasicBlock *BB) {
		auto &Info = BlockInfo[BB];
		if (!Info.Live)
		return;
		// Iterate over all live PHINodes.
		for (auto it = BB->begin(); PHINode *PN = dyn_cast<PHINode>(it); ++it) {
		majnemerUnsubmitted Done Reply Inline Actions Ditto. majnemer: Ditto.
		if (!isLive(PN))
		continue;

		// Verify a common reaching definition from predecessors with
		// dead terminators, marking some live to enforce this.
		mehdi_aminiUnsubmitted Done Reply Inline Actions This is not clear to me, I have no idea what is going on in this code? I mean I can follow what your doing, but I don't know why. mehdi_amini: This is not clear to me, I have no idea what is going on in this code? I mean I can follow…
		Value *CommonReachngDefintion = nullptr;
		auto NumIdx = PN->getNumIncomingValues();
		for (unsigned Idx = 0; Idx < NumIdx; ++Idx) {
		auto PredTerm = PN->getIncomingBlock(Idx)->getTerminator();
		if (isLive(PredTerm))
		continue;
		auto *Value = PN->getIncomingValue(Idx);
		if (!CommonReachngDefintion) {
		CommonReachngDefintion = Value;
		continue;
		}
		if (CommonReachngDefintion != Value) {
		// When there are two definitions from "dead" predecessor paths we
		// preserve one of those paths so the two values can be distinguished.
		DEBUG(dbgs() << "Live due to phi conflict "; PredTerm->dump());
		markLive(PredTerm);
		}
		mehdi_aminiUnsubmitted Done Reply Inline Actions Sorry missed this one: `S/tis/this` mehdi_amini: Sorry missed this one: `S/tis/this`
		}
		}
		}

		mehdi_aminiUnsubmitted Done Reply Inline Actions There is no `br label %cond.false` here. mehdi_amini: There is no `br label %cond.false` here.
		//===----------------------------------------------------------------------===//
		mehdi_aminiUnsubmitted Not Done Reply Inline Actions Thanks for the extensive comment and the example, unfortunately this is not really the information I was looking for. Phi nodes are dependent on the control flow, and we need to mark predecessors live in general, I got that part, but what isn't just clear to me is why only some blocks are marked live. For instance: define i8 @foo(i1 %cond1, i1 %cond2, i8 %a, i8 %b, i8 %c, i8 %d) nounwind { entry: br i1 %cond1, label %first_true, label %first_false first_true: br i1 %cond2, label %second_true, label %second_false first_false: br label %end second_true: br label %end second_false: br label %end end: %result = phi i8 [ %a, %first_false ], [ %b, %second_true ], [ %c, %second_false ] ret i8 %result } Assuming none of the `br label %end` is live, and considering the only phi, as written, all predecessors will be marked live. If I replace the phi with `phi i8 [ %a, %first_false ], [ %a, %second_true ], [ %c, %second_false ]` , then only `%second_false` is marked live. Now, what I find annoying is that with the same edges but just changing the order of the operands, you would get a different results. For instance: `phi i8 [ %c, %second_false ], [ %a, %first_false ], [ %a, %second_true ]` now `%second_false` is not marked live and instead `%first_false` and `%second_true` are. mehdi_amini: Thanks for the extensive comment and the example, unfortunately this is not really the…
		david2050AuthorUnsubmitted Not Done Reply Inline Actions You are correct the it is sensitive to the order. Marking one is sufficient for correctness. I have considered marking the lexically first one as live which would provide more stability but it is a rare occurrence so it is not clear if it is worth fixing. I don't have any collected data on this. david2050: You are correct the it is sensitive to the order. Marking one is sufficient for correctness. I…
		david2050AuthorUnsubmitted Not Done Reply Inline Actions I ran an experiment by prioritize that we keep the lexically last defined value. This should bias toward forcing live earlier branch decisions which are likely to be forced live anyway if we mark a later branch live. This had minimal impact on the my set of C++ tests (two changes, in different directions). david2050: I ran an experiment by prioritize that we keep the lexically last defined value. This should…
		//
		// Routines to update the CFG and SSA information before removing dead code.
		mehdi_aminiUnsubmitted Done Reply Inline Actions s/CommonReachngDefintion/CommonReachingDefinition/ mehdi_amini: s/CommonReachngDefintion/CommonReachingDefinition/
		//
		//===----------------------------------------------------------------------===//
bool AggressiveDeadCodeElimination::removeDeadInstructions() {		bool AggressiveDeadCodeElimination::removeDeadInstructions() {

// The inverse of the live set is the dead set. These are those instructions		// The inverse of the live set is the dead set. These are those instructions
// which have no side effects and do not influence the control flow or return		// which have no side effects and do not influence the control flow or return
// value of the function, and may therefore be deleted safely.		// value of the function, and may therefore be deleted safely.
// NOTE: We reuse the Worklist vector here for memory efficiency.		// NOTE: We reuse the Worklist vector here for memory efficiency.
for (Instruction &I : instructions(F)) {		for (Instruction &I : instructions(F)) {
// Check if the instruction is alive.		// Check if the instruction is alive.
if (isLive(&I))		if (isLive(&I))
		dberlinUnsubmitted Not Done Reply Inline Actions I'm curious why you bother with this detection? Pretty much every other pass we have can already do a better job of it :) That is, they can determine not just if they are dead or trivially equivalent, but produce equivalent values. You are basically detecting the block is not necessary to produce that value to the phi node. That's, IMHO, more of a redundancy elimination technique than a dead code one. The code isn't dead, it's just equivalent to some other code. Or am i missing something? Do you have examples where the standard pass pipeline doesn't remove the block producing the equivalent value? dberlin: I'm curious why you bother with this detection? Pretty much every other pass we have can…
		david2050AuthorUnsubmitted Not Done Reply Inline Actions This step is necessary otherwise we delete a branch which determines a live value. It is an artifact of the semantics of PHI nodes. This is about preserving existing structure not translating to a new form. And Yes, outside of loops, the baseline code catches almost all cases. The primary motivation was dead, may-be-infinute loops. david2050: This step is necessary otherwise we delete a branch which determines a live value. It is an…
		dberlinUnsubmitted Not Done Reply Inline Actions I understand the semantics of phi nodes :) I'm asking why you don't just mark them all live. Traditionally, in DCE, what would happen is that if the phi is live, you mark all values that are incoming to the phi as live (and in a control dependence DCE, any control dependences of the necessary edges) Here, it seems like you are trying to figure out if the phi is useless by seeing if all branches have the same incoming value, and if so, you are arbitrarily choosing one so you can delete at least one branch. That seems .. pointless. It should only also matter in the case of either predecessors that only exist for control flow (IE empty basic blocks), or critical edges. The only interesting loop case i'm aware of where this matters is: if (b) { int i; for (i = 0; i<1000; ++i); j = 0; } return j; } But your current code won't allow deleting the loop in this case, AFAICT. (among other things, you would need to determine what the control dependence would look like if the critical edge was split without splitting it. This is what GCC does) dberlin: I understand the semantics of phi nodes :) I'm asking why you don't just mark them all live.
		dberlinUnsubmitted Not Done Reply Inline Actions (and to be super clear, when i say "I'm asking why you don't just mark them all live.", i mean "all the arguments and edges of a given phi node", not "all phis"). dberlin: (and to be super clear, when i say "I'm asking why you don't just mark them all live.", i…
		david2050AuthorUnsubmitted Not Done Reply Inline Actions It is not all branches, it is all branches from dead predecessors since in the case that we should remove all such predecessors to be replaced with a single edge, we verify that there is a unique value for that edge. If not we select one predecessor to keep live. Consider a = 0 if (x) { if (y) a =1 else a = 2 } else a =3 phi(...) The phi node following this block has tree edges labeled with values 1,2,3 respectively. The guarded basic blocks are empty because the assignments are folded into the phi node. The branch decision at which tests 'y' is dead at this point because there are no live operations in any block it controls. Here we determine that we need to keep at least one of the empty blocks around to distinguish between 1 and 2, which models the behavior if the assignments to 'a' were instead explicitly in the original blocks. Given a more complex example we might have a = 0 if (x) { if (z) a=1 else { if (y) { } else { } } } else a =3 phi(...) Here there are 4 predecessors to the join, three are 2 but only two distinct values and we only need to have the branch on 'z' live. The branch on 'y' need not be. This is why the code does not mark all incoming edges live but does not do the extra work to try and pick and optimal choice (the path with a=1 in this case). The scenario is rare enough that it is not clear extra work is worth the effort. In your example, I believe the current code will allow the remove the loop since the assignment to J is not control dependent on the loop but only on the branch testing 'b'. The subset of the CFG corresponding to the loop would be all marked dead and removed and the edge leaving the block holding the test on 'b' redirected to the block holding 'j=0'. I don't understand your reference to splitting a critical edge. david2050: It is not all branches, it is all branches from dead predecessors since in the case that we…
		dberlinUnsubmitted Not Done Reply Inline Actions (Sorry if this is short, i lost this comment once already). Okay, we are talking past each other. I understand what you are doing, and why you are trying to mark only certain values of a phi live. I'm saying "this is not worth it, and is complex" compared to just marking all of them. You produced an example showing it removes a single block. I agree it can remove that block. Instead of arguing by example, i produced data: I ran your patch on ~5000 C++ packages as is, and then ran it on ~5000 C++ packages with it changed to just mark all phi node inputs as live when the phi node is live. The number of cases i find where it makes a difference is ~0.01%. The average size of the binaries in which it does something are ~16 bytes smaller (which, for those binaries, amounts to an average size change of 0.00005%) That seems "not worth it" to me :) Now, i'm just trying to provide data not actually to say "we shouldn't do it this way", but more "if we are going to add complexity, let's do it where we have data that shows it's worth it". We both can clearly come up with contrived examples where it matters. So if you have data showing trying to only mark single branches of unique values live is worth it, great, add the data and let's stop arguing, and let's do it. "In your example, I believe the current code will allow the remove the loop since the assignment to J is not control dependent on the loop but only on the branch testing 'b'. The subset of the CFG corresponding to the loop would be all marked dead and removed and the edge leaving the block holding the test on 'b' redirected to the block holding 'j=0'. I don't understand your reference to splitting a critical edge." You should try it before you knock it :) You may have to change it to j=b or add a new parameter somewhere, depending on what passes you run before it (otherwise the phi ends up with the constant propagated into it). You will end up with a phi node in the end block that merges the else-condition value (IE from outside the condition entirely) of jwith the inside-condition value. Control-dependence will force you to keep the empty loop live due to the critical edge (again, whether the edge stays critical or not depends on where you end up in the pass pipeline) This case is more common, it affects at least ~1% of the packages. I did not fix it, so i can't say how much it saves or not saves. It is common enough that gcc special cases it (as i said) and has regression tests to make sure the loop is eliminated by DCE. dberlin: (Sorry if this is short, i lost this comment once already). Okay, we are talking past each…
		david2050AuthorUnsubmitted Not Done Reply Inline Actions Thanks for great due diligence for the patch. The experiment you ran seems doomed because the patches submitted so far are incomplete. If you look around line 305 there is loop which forces all branches live. This is temporary waiting for the next patch which includes the code to delete branches. Right now the patch reduces to prior behavior of not actually deleting in branches although it takes a long way to get there. Thus any changes you see based on your edit are curious since it should not have any impact on the generated code at all. I certainly agree that even the relatively simple pick-one-value here may have no value. Perhaps we can accept this change as is, with a TODO to evaluate this point after the next (and final) patch is up? (Assuming there is nothing else to change along the way) david2050: Thanks for great due diligence for the patch. The experiment you ran seems doomed because the…
		dberlinUnsubmitted Not Done Reply Inline Actions I feel like you are implying something here. In this case, I simply hacked in the right parts of the rest of the old patch (and a little work) before it was broken up before running the experiment. (You could have simply asked for the code if you are concerned). You are talking about something that is < 50 lines of code to make work (control dependence DCE is not that hard, nor is this even the first implementation of it in LLVM!), so i'm not sure why you would think i didn't run the experiment properly? (I could also point out i don't actually care about correctness of the generated code, only the maximum optimization value) If you would like to actually run your own experiment show that this is worth it, you are, as i said, welcome. You may want to focus on that instead of trying to pick mine apart ;) Right now, like I said, i see nothing that makes it seem like this is worth it, and your response here doesn't help that. If you want to produce data that says it's worth it, great!. Otherwise, I'm loathe to start by adding complexity and then try to take it away later. That never works. dberlin: I feel like you are implying something here. In this case, I simply hacked in the right…
		david2050AuthorUnsubmitted Not Done Reply Inline Actions No implications, and apologies for suggesting you were incomplete in your analysis. To be clear then, in your variant, you marked every basic block which is a predecessor of a live-phi as live? david2050: No implications, and apologies for suggesting you were incomplete in your analysis. To be…
continue;		continue;

assert(!I.isTerminator() && "NYI: Removing Control Flow");		assert(!I.isTerminator() && "NYI: Removing Control Flow");

if (auto *DII = dyn_cast<DbgInfoIntrinsic>(&I)) {		if (auto *DII = dyn_cast<DbgInfoIntrinsic>(&I)) {
// Check if the scope of this variable location is alive.		// Check if the scope of this variable location is alive.
if (AliveScopes.count(DII->getDebugLoc()->getScope()))		if (AliveScopes.count(DII->getDebugLoc()->getScope()))
continue;		continue;
▲ Show 20 Lines • Show All 72 Lines • Show Last 20 Lines