This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
include/llvm/Support/
-
llvm/
-
Support/
-
GenericDomTree.h
-
GenericDomTreeConstruction.h
-
lib/Transforms/Scalar/
-
Transforms/
-
Scalar/
-
ADCE.cpp
-
test/
-
Analysis/
-
PostDominators/
-
pr24415.ll
-
pr6047_a.ll
-
pr6047_b.ll
-
pr6047_c.ll
-
pr6047_d.ll
-
RegionInfo/
-
infinite_loop.ll
-
infinite_loop_2.ll
-
infinite_loop_3.ll
-
infinite_loop_4.ll
-
infinite_loop_5_a.ll
-
infinite_loop_5_b.ll
-
Transforms/StructurizeCFG/
-
StructurizeCFG/
-
branch-on-argument.ll
-
no-branch-to-entry.ll

Differential D29705

Fix PR 24415 (at least), by making our post-dominator tree behavior sane.
ClosedPublic

Authored by • dberlin on Feb 8 2017, 12:16 AM.

Download Raw Diff

Details

Reviewers

chandlerc
davide

Commits

rG03f6938edcaf: Fix PR 24415 (at least), by making our post-dominator tree behavior sane.
rL296535: Fix PR 24415 (at least), by making our post-dominator tree behavior sane.

Summary

Currently, our post-dom tree tries to ignore and remove the effects of
infinite loops. It fails miserably at this, because it tries to do it
ahead of time, and thus can only detect self-loops, and any other type
of infinite loop, it pretends doesn't exist at all.

This can, in a bunch of cases, lead to wrong answers and a completely
empty post-dom tree.

Wrong answer:

declare void foo()
define internal void @f() {
entry:
  br i1 undef, label %bb35, label %bb3.i

bb3.i:
  call void @foo()
  br label %bb3.i

bb35.loopexit3:
  br label %bb35

bb35:
  ret void
}

We get:

Inorder PostDominator Tree:
  [1]  <<exit node>> {0,7}
    [2] %bb35 {1,6}
      [3] %bb35.loopexit3 {2,3}
      [3] %entry {4,5}

This is a trivial modification of the testcase for PR 6047
Note that we pretend bb3.i doesn't exist.
We also pretend that bb35 post-dominates entry.

While it's true that it does not exit in a theoretical sense, it's not
really helpful to try to ignore the effect and pretend that bb35
post-dominates entry. Worse, we pretend the infinite loop does
nothing (it's usually considered a side-effect), and doesn't even
exist, even when it calls a function. Sadly, this makes it impossible
to use when you are trying to move code safely. All compilers also
create virtual or real single exit nodes (including us), and connect
infinite loops there (which this patch does). In fact, others have
worked around our behavior here, to the point of building their own
post-dom trees:
https://zneak.github.io/fcd/2016/02/17/structuring.html and pointing
out the region infrastructure is near-useless for them with postdom in
this state :(

Completely empty post-dom tree:

define void @spam() #0 {
bb:
  br label %bb1

bb1:                                              ; preds = %bb1, %bb
  br label %bb1

bb2:                                              ; No predecessors!
  ret void
}

Printing analysis 'Post-Dominator Tree Construction' for function 'foo':
=============================--------------------------------
Inorder PostDominator Tree:
  [1]  <<exit node>> {0,1}

(note that even if you ignore the effects of infinite loops, bb2
should be present as an exit node that post-dominates nothing).

This patch changes post-dom to properly handle infinite loops and does
root finding during calculation to prevent empty tress in such cases.

We match gcc's (and the canonical theoretical) behavior for infinite
loops (find the backedge, connect it to the exit block).

Testcases coming as soon as i finish running this on a ton of random graphs :)

Diff Detail

Repository: rL LLVM

Event Timeline

• dberlin created this revision.Feb 8 2017, 12:16 AM

• dberlin edited the summary of this revision. (Show Details)Feb 8 2017, 12:17 AM

• dberlin edited the summary of this revision. (Show Details)

• dberlin added inline comments.Feb 8 2017, 12:21 AM

include/llvm/Support/GenericDomTreeConstruction.h
46 ↗	(On Diff #87601)	Sorry, this was part of an alternate approach, will revert.
172 ↗	(On Diff #87601)	Keen observers will note that this ::size call was O(N) since it's an ilist. :( So the code i added really only adds O(N) time worst case, since we already wasted O(N) time just counting blocks above.
194 ↗	(On Diff #87601)	This comment needs updating a bit. It really ends up finding the infinite loop backedge block, which is what we want. (note that post-dom is not unique, so if there are multiple backedges, you can't win, there is no "best" one)
249 ↗	(On Diff #87601)	This got clang-formatted, i'll revert
285 ↗	(On Diff #87601)	and this is leftover from a dead approach too. i'll remove.

Minor things.

include/llvm/Support/GenericDomTreeConstruction.h
165 ↗	(On Diff #87601)	Ranged-for? `for (NodeType *BB: make_range(FuncGraphT::nodes_begin(&F), FuncGraphT::nodes_end(&F))) {`.
177 ↗	(On Diff #87601)	Typo? "make. block."
184 ↗	(On Diff #87601)	Ranged-for?
247 ↗	(On Diff #87601)	Ranged?

• dberlin added inline comments.Feb 8 2017, 6:37 AM

include/llvm/Support/GenericDomTreeConstruction.h
165 ↗	(On Diff #87601)	Honestly, i would rather us just improve graph traits to provide ranges here. I'm not sure i believe the ranged for version is more readable/easier to understand with make_range. i'm happy to do that as a followup to add "nodes" and "children" to graph traits.

The approach looks good to me, if you add a testcase and revert the unrelated changes I'll give it another look (possibly today)

I think a test for infinite loops (the one attached to the PR or/and slight modification of them) are fine (at least for me)

I have updated all tests.
The region ones were particularly nutso before (IHMO), and now look sane.

there is one failure in ADCE, because previously postdom gave it an empty tree, and now gives it a real tree.
It expects that all not-reachable-from-exit nodes will not have a dom tree node, which is not correct.
It also assumes it will be able to find a safe place to redirect an edge that still exits the function, which is also not correct.
Fixes for both coming.
Then i am going to look at the two structurize cfg failures.

adding the last two people to touch structurize CFG.

The code is mostly undocumented, and it seems ... interesting, in many ways .
It has an obsession with assuming regions are started and terminated with branchinsts, when they could, for example, be switches with one successor (it looks like it forces switch lowering instead of fixing the code).
It also gets successors of blocks by trying to grab terminators and walk over the terminator successors, instead of just walking the successors directly.

In any case, there is nothing to fix for these testcases.
There are two failures.
In branch-on-argument.ll, we fail invert_branch_on_arg_inf_loop.
This is because there are no SESE regions here once post-dom is correct.
The only SESE region is function entry to function exit, and structurizeCFG explicitly skips that region,

Ditto for no-branch-to-entry.ll
It's not a SESE region, so it no longer processes it.
I can't make a testcase where it wants to branch to entry anymore.

I'm XFAIL'ing it for someone to determine if it's really still needed. My guess is this was a side effect of broken SESE regions.

• dberlin added reviewers: jlebar, tstellar.Feb 19 2017, 5:28 PM

• dberlin removed reviewers: jlebar, tstellar.

• dberlin added subscribers: jlebar, tstellar.

Update with fixes for broken testcases, and review comments. Add test case for pr24415

Herald added a subscriber: david2050. · View Herald TranscriptFeb 19 2017, 5:33 PM

I'm fine with the structurize cfg changes.

Okay. At this point i'm pretty positive i can prove that any region that now it considers the entry to be the start of the loop will be a top-level region that structurizecfg will ignore, and we can delete the test (and associated code in structurizecfg/domtree to handle it). I'm going to do so unless someone objects.

We just gave a bogus region tree before:
Printing analysis 'Detect single entry single exit regions' for function 'no_branch_to_entry_true':

[0] entry => <Function Return>
{
  entry, for.end, for.body,
  [1] entry => for.end
  {
    entry, for.body,
  }
}

#1 is definitely not a sese region. It double definitely does not include for.body, which never exits and never goes to for.end :P :

Let's go from first principles, with a little help from wikipedia:
In graph theory, a single-entry single-exit (SESE) region in a given graph is an ordered edge pair (a, b) of distinct control flow edges a and b where:

a dominates b
b postdominates a
Every cycle containing a also contains b and vice versa.

In the old world, for.end post-dominates entry (and for.body is not in the postdomree), which is wrong:

=============================--------------------------------
Inorder PostDominator Tree:
  [1]  <<exit node>> {0,5}
    [2] %for.end {1,4}
      [3] %entry {2,3}

So it believes entry->for.end is a sese region.
Which would in fact, be true if there was no loop that postdom has ignored.
The region-former then uses the definition of dominance/post-dominance, and believes that such a region must also include the other preds of entry (IE for.body). This is even reasonable and correct given the regular definition of postdom.
But of course, in this case, postdom is wrong,and really,all of the blocks are siblings in the postdomtree.
In the new world,it properly forms the postdomtree:

[1]  <<exit node>> {0,7}
   [2] %for.end {1,2}
   [2] %entry {3,4}
   [2] %for.body {5,6}

There is no sese region to be formed here. The only way you could ever end up with a sese region loop that involves entry is a normal loop:

define void @no_branch_to_entry_undef(i32 addrspace(1)* %out) {
entry:
  br i1 undef, label %for.end, label %for.body
for.body:                                         ; preds = %entry, %for.body
  store i32 999, i32 addrspace(1)* %out, align 4
  br i1 undef, label %for.body, label %for.end
for.end:                                          ; preds = %Flow
  ret void
}

It does not try to modify the entry block for such a normal loop.

Thus, i believe all of this hackery is now dead.

(and obviously, i'll xfail in this patch, and then delete it wit associated hackery in a followup)

Ping. This is ready to go in and I have a new verifier that depends on it

The Domtree changes look good to me but I can't comment on StructurizeCFG

This revision is now accepted and ready to land.Feb 28 2017, 11:31 AM

LGTM, then

Closed by commit rL296535: Fix PR 24415 (at least), by making our post-dominator tree behavior sane. (authored by dannyb). · Explain WhyFeb 28 2017, 3:09 PM

This revision was automatically updated to reflect the committed changes.

kuhar mentioned this in D35851: [Dominators] Include infinite loops in PostDominatorTree.Jul 25 2017, 12:22 PM

kuhar mentioned this in rL310940: [Dominators] Include infinite loops in PostDominatorTree.Aug 15 2017, 11:17 AM

Revision Contents

Path

Size

llvm/

trunk/

include/

llvm/

Support/

GenericDomTree.h

10 lines

GenericDomTreeConstruction.h

89 lines

lib/

Transforms/

Scalar/

ADCE.cpp

51 lines

test/

Analysis/

PostDominators/

18 lines

7 lines

8 lines

51 lines

10 lines

RegionInfo/

4 lines

11 lines

19 lines

14 lines

1 line

1 line

Transforms/

StructurizeCFG/

branch-on-argument.ll

9 lines

no-branch-to-entry.ll

1 line

Diff 90090

llvm/trunk/include/llvm/Support/GenericDomTree.h

Show First 20 Lines • Show All 764 Lines • ▼ Show 20 Lines	void updateDFSNumbers() const {
}		}

SlowQueries = 0;		SlowQueries = 0;
DFSInfoValid = true;		DFSInfoValid = true;
}		}

/// recalculate - compute a dominator tree for the given function		/// recalculate - compute a dominator tree for the given function
template <class FT> void recalculate(FT &F) {		template <class FT> void recalculate(FT &F) {
typedef GraphTraits<FT *> TraitsTy;
reset();		reset();
this->Vertex.push_back(nullptr);		this->Vertex.push_back(nullptr);

if (!this->IsPostDominators) {		if (!this->IsPostDominators) {
// Initialize root
NodeT *entry = TraitsTy::getEntryNode(&F);
addRoot(entry);

Calculate<FT, NodeT >(this, F);		Calculate<FT, NodeT >(this, F);
} else {		} else {
// Initialize the roots list
for (auto *Node : nodes(&F))
if (TraitsTy::child_begin(Node) == TraitsTy::child_end(Node))
addRoot(Node);

Calculate<FT, Inverse<NodeT >>(this, F);		Calculate<FT, Inverse<NodeT >>(this, F);
}		}
}		}
};		};

// These two functions are declared out of line as a workaround for building		// These two functions are declared out of line as a workaround for building
// with old (< r147295) versions of clang because of pr11642.		// with old (< r147295) versions of clang because of pr11642.
template <class NodeT>		template <class NodeT>
Show All 26 Lines

llvm/trunk/include/llvm/Support/GenericDomTreeConstruction.h

Show All 19 Lines
/// faster than the almost-linear O(n*alpha(n)) version, even for large CFGs.		/// faster than the almost-linear O(n*alpha(n)) version, even for large CFGs.
///		///
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_SUPPORT_GENERICDOMTREECONSTRUCTION_H		#ifndef LLVM_SUPPORT_GENERICDOMTREECONSTRUCTION_H
#define LLVM_SUPPORT_GENERICDOMTREECONSTRUCTION_H		#define LLVM_SUPPORT_GENERICDOMTREECONSTRUCTION_H

#include "llvm/ADT/DepthFirstIterator.h"		#include "llvm/ADT/DepthFirstIterator.h"
		#include "llvm/ADT/PostOrderIterator.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/Support/GenericDomTree.h"		#include "llvm/Support/GenericDomTree.h"

namespace llvm {		namespace llvm {

// External storage for depth first iterator that reuses the info lookup map		// External storage for depth first iterator that reuses the info lookup map
// domtree already has. We don't have a set, but a map instead, so we are		// domtree already has. We don't have a set, but a map instead, so we are
// converting the one argument insert calls.		// converting the one argument insert calls.
template <class NodeRef, class InfoType> struct df_iterator_dom_storage {		template <class NodeRef, class InfoType> struct df_iterator_dom_storage {
public:		public:
typedef DenseMap<NodeRef, InfoType> BaseSet;		typedef DenseMap<NodeRef, InfoType> BaseSet;
df_iterator_dom_storage(BaseSet &Storage) : Storage(Storage) {}		df_iterator_dom_storage(BaseSet &Storage) : Storage(Storage) {}

typedef typename BaseSet::iterator iterator;		typedef typename BaseSet::iterator iterator;
std::pair<iterator, bool> insert(NodeRef N) {		std::pair<iterator, bool> insert(NodeRef To) {
return Storage.insert({N, InfoType()});		auto Result = Storage.insert({To, InfoType()});

		return Result;
}		}
void completed(NodeRef) {}		void completed(NodeRef) {}

private:		private:
BaseSet &Storage;		BaseSet &Storage;
};		};

template <class GraphT>		template <class GraphT>
unsigned ReverseDFSPass(DominatorTreeBaseByGraphTraits<GraphT> &DT,		unsigned ReverseDFSPass(DominatorTreeBaseByGraphTraits<GraphT> &DT,
typename GraphT::NodeRef V, unsigned N) {		typename GraphT::NodeRef V, unsigned N) {
df_iterator_dom_storage<		df_iterator_dom_storage<
typename GraphT::NodeRef,		typename GraphT::NodeRef,
typename DominatorTreeBaseByGraphTraits<GraphT>::InfoRec>		typename DominatorTreeBaseByGraphTraits<GraphT>::InfoRec>
DFStorage(DT.Info);		DFStorage(DT.Info);
bool IsChildOfArtificialExit = (N != 0);
for (auto I = idf_ext_begin(V, DFStorage), E = idf_ext_end(V, DFStorage);		for (auto I = idf_ext_begin(V, DFStorage), E = idf_ext_end(V, DFStorage);
I != E; ++I) {		I != E; ++I) {
typename GraphT::NodeRef BB = *I;		typename GraphT::NodeRef BB = *I;
auto &BBInfo = DT.Info[BB];		auto &BBInfo = DT.Info[BB];
BBInfo.DFSNum = BBInfo.Semi = ++N;		BBInfo.DFSNum = BBInfo.Semi = ++N;
BBInfo.Label = BB;		BBInfo.Label = BB;
// Set the parent to the top of the visited stack. The stack includes us,		// Set the parent to the top of the visited stack. The stack includes us,
// and is 1 based, so we subtract to account for both of these.		// and is 1 based, so we subtract to account for both of these.
if (I.getPathLength() > 1)		if (I.getPathLength() > 1)
BBInfo.Parent = DT.Info[I.getPath(I.getPathLength() - 2)].DFSNum;		BBInfo.Parent = DT.Info[I.getPath(I.getPathLength() - 2)].DFSNum;
DT.Vertex.push_back(BB); // Vertex[n] = V;		DT.Vertex.push_back(BB); // Vertex[n] = V;

if (IsChildOfArtificialExit)
BBInfo.Parent = 1;

IsChildOfArtificialExit = false;
}		}
return N;		return N;
}		}
template <class GraphT>		template <class GraphT>
unsigned DFSPass(DominatorTreeBaseByGraphTraits<GraphT> &DT,		unsigned DFSPass(DominatorTreeBaseByGraphTraits<GraphT> &DT,
typename GraphT::NodeRef V, unsigned N) {		typename GraphT::NodeRef V, unsigned N) {
df_iterator_dom_storage<		df_iterator_dom_storage<
typename GraphT::NodeRef,		typename GraphT::NodeRef,
▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines	typename GraphT::NodeRef Eval(DominatorTreeBaseByGraphTraits<GraphT> &DT,

return VInInfo.Label;		return VInInfo.Label;
}		}

template <class FuncT, class NodeT>		template <class FuncT, class NodeT>
void Calculate(DominatorTreeBaseByGraphTraits<GraphTraits<NodeT>> &DT,		void Calculate(DominatorTreeBaseByGraphTraits<GraphTraits<NodeT>> &DT,
FuncT &F) {		FuncT &F) {
typedef GraphTraits<NodeT> GraphT;		typedef GraphTraits<NodeT> GraphT;
		typedef GraphTraits<FuncT *> FuncGraphT;
static_assert(std::is_pointer<typename GraphT::NodeRef>::value,		static_assert(std::is_pointer<typename GraphT::NodeRef>::value,
"NodeRef should be pointer type");		"NodeRef should be pointer type");
typedef typename std::remove_pointer<typename GraphT::NodeRef>::type NodeType;		typedef typename std::remove_pointer<typename GraphT::NodeRef>::type NodeType;

unsigned N = 0;		unsigned N = 0;
bool MultipleRoots = (DT.Roots.size() > 1);		bool NeedFakeRoot = DT.isPostDominator();
if (MultipleRoots) {		// If this is post dominators, push a fake node to start
		if (NeedFakeRoot) {
auto &BBInfo = DT.Info[nullptr];		auto &BBInfo = DT.Info[nullptr];
BBInfo.DFSNum = BBInfo.Semi = ++N;		BBInfo.DFSNum = BBInfo.Semi = ++N;
BBInfo.Label = nullptr;		BBInfo.Label = nullptr;

DT.Vertex.push_back(nullptr); // Vertex[n] = V;		DT.Vertex.push_back(nullptr); // Vertex[n] = V;
		} else {
		// The root is the entry block of the CFG
		DT.addRoot(FuncGraphT::getEntryNode(&F));
}		}

// Step #1: Number blocks in depth-first order and initialize variables used		// Step #1: Number blocks in depth-first order and initialize variables used
// in later stages of the algorithm.		// in later stages of the algorithm.
if (DT.isPostDominator()){		if (DT.isPostDominator()) {
for (unsigned i = 0, e = static_cast<unsigned>(DT.Roots.size());		unsigned Total = 0;
i != e; ++i)		for (auto I : nodes(&F)) {
N = ReverseDFSPass<GraphT>(DT, DT.Roots[i], N);		++Total;
		// If it has no successors, it is definitely a root.
		if (FuncGraphT::child_begin(I) == FuncGraphT::child_end(I)) {
		N = ReverseDFSPass<GraphT>(DT, I, N);
		DT.Info[I].Parent = 1;
		DT.addRoot(I);
		}
		}
		// Accounting for the virtual exit, see if we had any unreachable nodes
		if (Total + 1 != N ) {
		// Make another DFS pass over all other nodes to find the unreachable
		// blocks, and find the furthest paths we'll be able to make.
		// Note that this looks N^2, but it's really 2N worst case, if every node
		// is unreachable. This is because we are still going to only visit each
		// unreachable node once, we may just visit it in two directions,
		// depending on how lucky we get.
		SmallPtrSet<NodeType *, 4> ConnectToExitBlock;
		for (auto I : nodes(&F))
		if (!DT.Info.count(I)) {
		// Find the furthest away we can get by following successors, then
		// follow them in reverse. This gives us some reasonable answer about
		// the post-dom tree inside any infinite loop. In particular, it
		// guarantees we get to the farthest away point along some
		// path. This also matches GCC behavior. If we really wanted a
		// totally complete picture of dominance inside this infinite loop, we
		// could do it with SCC-like algorithms to find the lowest and highest
		// points in the infinite loop. In theory, it would be nice to give
		// the canonical backedge for the loop, but it's expensive.
		auto FurthestAway = po_begin(I);
		ConnectToExitBlock.insert(FurthestAway);
		N = ReverseDFSPass<GraphT>(DT, FurthestAway, N);
		}
		// Finally, now everything should be visited, and anything with parent
		// ==
		// 0 should be connected to virtual exit.
		for (auto *Node : ConnectToExitBlock) {
		auto FindResult = DT.Info.find(Node);
		assert(FindResult != DT.Info.end() &&
		"Everything should have been visited by now");
		if (FindResult->second.Parent == 0) {
		FindResult->second.Parent = 1;
		DT.addRoot(Node);
		}
		}
		}
} else {		} else {
N = DFSPass<GraphT>(DT, DT.Roots[0], N);		N = DFSPass<GraphT>(DT, GraphTraits<FuncT *>::getEntryNode(&F), N);
}		}

// it might be that some blocks did not get a DFS number (e.g., blocks of
// infinite loops). In these cases an artificial exit node is required.
MultipleRoots \|= (DT.isPostDominator() && N != GraphTraits<FuncT*>::size(&F));

// When naively implemented, the Lengauer-Tarjan algorithm requires a separate		// When naively implemented, the Lengauer-Tarjan algorithm requires a separate
// bucket for each vertex. However, this is unnecessary, because each vertex		// bucket for each vertex. However, this is unnecessary, because each vertex
// is only placed into a single bucket (that of its semidominator), and each		// is only placed into a single bucket (that of its semidominator), and each
// vertex's bucket is processed before it is added to any bucket itself.		// vertex's bucket is processed before it is added to any bucket itself.
//		//
// Instead of using a bucket per vertex, we use a single array Buckets that		// Instead of using a bucket per vertex, we use a single array Buckets that
// has two purposes. Before the vertex V with preorder number i is processed,		// has two purposes. Before the vertex V with preorder number i is processed,
// Buckets[i] stores the index of the first element in V's bucket. After V's		// Buckets[i] stores the index of the first element in V's bucket. After V's
▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines	void Calculate(DominatorTreeBaseByGraphTraits<GraphTraits<NodeT>> &DT,
// Step #4: Explicitly define the immediate dominator of each vertex		// Step #4: Explicitly define the immediate dominator of each vertex
for (unsigned i = 2; i <= N; ++i) {		for (unsigned i = 2; i <= N; ++i) {
typename GraphT::NodeRef W = DT.Vertex[i];		typename GraphT::NodeRef W = DT.Vertex[i];
typename GraphT::NodeRef &WIDom = DT.IDoms[W];		typename GraphT::NodeRef &WIDom = DT.IDoms[W];
if (WIDom != DT.Vertex[DT.Info[W].Semi])		if (WIDom != DT.Vertex[DT.Info[W].Semi])
WIDom = DT.IDoms[WIDom];		WIDom = DT.IDoms[WIDom];
}		}

if (DT.Roots.empty()) return;

// Add a node for the root. This node might be the actual root, if there is		// Add a node for the root. This node might be the actual root, if there is
// one exit block, or it may be the virtual exit (denoted by (BasicBlock *)0)		// one exit block, or it may be the virtual exit (denoted by (BasicBlock *)0)
// which postdominates all real exits if there are multiple exit blocks, or		// which postdominates all real exits if there are multiple exit blocks, or
// an infinite loop.		// an infinite loop.
typename GraphT::NodeRef Root = !MultipleRoots ? DT.Roots[0] : nullptr;		typename GraphT::NodeRef Root = NeedFakeRoot ? nullptr : DT.Roots[0];

DT.RootNode =		DT.RootNode =
(DT.DomTreeNodes[Root] =		(DT.DomTreeNodes[Root] =
llvm::make_unique<DomTreeNodeBase<NodeType>>(Root, nullptr))		llvm::make_unique<DomTreeNodeBase<NodeType>>(Root, nullptr))
.get();		.get();

// Loop over all of the reachable blocks in the function...		// Loop over all of the reachable blocks in the function...
for (unsigned i = 2; i <= N; ++i) {		for (unsigned i = 2; i <= N; ++i) {
Show All 30 Lines

llvm/trunk/lib/Transforms/Scalar/ADCE.cpp

Show First 20 Lines • Show All 247 Lines • ▼ Show 20 Lines	for (auto *BB: depth_first_ext(&F.getEntryBlock(), State)) {
if (State.onStack(Succ)) {		if (State.onStack(Succ)) {
// back edge....		// back edge....
markLive(Term);		markLive(Term);
break;		break;
}		}
}		}
}		}

// Mark blocks live if there is no path from the block to the		// Mark blocks live if there is no path from the block to a
// return of the function or a successor for which this is true.		// return of the function.
// This protects IDFCalculator which cannot handle such blocks.		// We do this by seeing which of the postdomtree root children exit the
for (auto &BBInfoPair : BlockInfo) {		// program, and for all others, mark the subtree live.
auto &BBInfo = BBInfoPair.second;		for (auto &PDTChild : children<DomTreeNode *>(PDT.getRootNode())) {
if (BBInfo.terminatorIsLive())		auto *BB = PDTChild->getBlock();
continue;		auto &Info = BlockInfo[BB];
auto *BB = BBInfo.BB;		// Real function return
if (!PDT.getNode(BB)) {		if (isa<ReturnInst>(Info.Terminator)) {
markLive(BBInfo.Terminator);		DEBUG(dbgs() << "post-dom root child is not a return: " << BB->getName()
continue;
}
for (auto *Succ : successors(BB))
if (!PDT.getNode(Succ)) {
markLive(BBInfo.Terminator);
break;
}
}

// Mark blocks live if there is no path from the block to the
// return of the function or a successor for which this is true.
// This protects IDFCalculator which cannot handle such blocks.
for (auto &BBInfoPair : BlockInfo) {
auto &BBInfo = BBInfoPair.second;
if (BBInfo.terminatorIsLive())
continue;
auto *BB = BBInfo.BB;
if (!PDT.getNode(BB)) {
DEBUG(dbgs() << "Not post-dominated by return: " << BB->getName()
<< '\n';);		<< '\n';);
markLive(BBInfo.Terminator);
continue;		continue;
}		}
for (auto *Succ : successors(BB))
if (!PDT.getNode(Succ)) {		// This child is something else, like an infinite loop.
DEBUG(dbgs() << "Successor not post-dominated by return: "		for (auto DFNode : depth_first(PDTChild))
<< BB->getName() << '\n';);		markLive(BlockInfo[DFNode->getBlock()].Terminator);
markLive(BBInfo.Terminator);
break;
}
}		}

// Treat the entry block as always live		// Treat the entry block as always live
auto *BB = &F.getEntryBlock();		auto *BB = &F.getEntryBlock();
auto &EntryInfo = BlockInfo[BB];		auto &EntryInfo = BlockInfo[BB];
EntryInfo.Live = true;		EntryInfo.Live = true;
if (EntryInfo.UnconditionalBranch)		if (EntryInfo.UnconditionalBranch)
markLive(EntryInfo.Terminator);		markLive(EntryInfo.Terminator);
▲ Show 20 Lines • Show All 380 Lines • Show Last 20 Lines

llvm/trunk/test/Analysis/PostDominators/pr24415.ll

				; RUN: opt < %s -postdomtree -analyze \| FileCheck %s
				; RUN: opt < %s -passes='print<postdomtree>' 2>&1 \| FileCheck %s

				; Function Attrs: nounwind ssp uwtable
				define void @foo() {
				br label %1

				; <label>:1 ; preds = %0, %1
				br label %1
				; No predecessors!
				ret void
				}

				; CHECK: Inorder PostDominator Tree:
				; CHECK-NEXT: [1] <<exit node>> {0,7}
				; CHECK-NEXT: [2] %2 {1,2}
				; CHECK-NEXT: [2] %1 {3,6}
				; CHECK-NEXT: [3] %0 {4,5}

llvm/trunk/test/Analysis/PostDominators/pr6047_a.ll

	; RUN: opt < %s -postdomtree -analyze \| FileCheck %s			; RUN: opt < %s -postdomtree -analyze \| FileCheck %s
	define internal void @f() {			define internal void @f() {
	entry:			entry:
	br i1 undef, label %bb35, label %bb3.i			br i1 undef, label %bb35, label %bb3.i

	bb3.i:			bb3.i:
	br label %bb3.i			br label %bb3.i

	bb35.loopexit3:			bb35.loopexit3:
	br label %bb35			br label %bb35

	bb35:			bb35:
	ret void			ret void
	}			}
	; CHECK: [3] %entry			;CHECK:Inorder PostDominator Tree:
				;CHECK-NEXT: [1] <<exit node>> {0,9}
				;CHECK-NEXT: [2] %bb35 {1,4}
				;CHECK-NEXT: [3] %bb35.loopexit3 {2,3}
				;CHECK-NEXT: [2] %entry {5,6}
				;CHECK-NEXT: [2] %bb3.i {7,8}

llvm/trunk/test/Analysis/PostDominators/pr6047_b.ll

	Show All 10 Lines


	bb35.loopexit3:			bb35.loopexit3:
	br label %bb35			br label %bb35

	bb35:			bb35:
	ret void			ret void
	}			}
	; CHECK: [4] %entry			; CHECK: Inorder PostDominator Tree:
				; CHECK-NEXT: [1] <<exit node>> {0,11}
				; CHECK-NEXT: [2] %bb35 {1,4}
				; CHECK-NEXT: [3] %bb35.loopexit3 {2,3}
				; CHECK-NEXT: [2] %a {5,6}
				; CHECK-NEXT: [2] %entry {7,8}
				; CHECK-NEXT: [2] %bb3.i {9,10}

llvm/trunk/test/Analysis/PostDominators/pr6047_c.ll

Show First 20 Lines • Show All 138 Lines • ▼ Show 20 Lines	bb35.loopexit:
br label %bb35		br label %bb35

bb35.loopexit3:		bb35.loopexit3:
br label %bb35		br label %bb35

bb35:		bb35:
ret void		ret void
}		}
; CHECK: [3] %entry		; CHECK: Inorder PostDominator Tree:
		; CHECK-NEXT: [1] <<exit node>> {0,97}
		; CHECK-NEXT: [2] %bb35 {1,92}
		; CHECK-NEXT: [3] %bb35.loopexit3 {2,3}
		; CHECK-NEXT: [3] %bb35.loopexit {4,5}
		; CHECK-NEXT: [3] %bb31 {6,7}
		; CHECK-NEXT: [3] %bb30 {8,9}
		; CHECK-NEXT: [3] %bb30.loopexit1 {10,11}
		; CHECK-NEXT: [3] %bb30.loopexit {12,13}
		; CHECK-NEXT: [3] %bb23 {14,15}
		; CHECK-NEXT: [3] %bb23.us {16,17}
		; CHECK-NEXT: [3] %bb23.preheader {18,19}
		; CHECK-NEXT: [3] %bb23.us.preheader {20,21}
		; CHECK-NEXT: [3] %bb.nph {22,23}
		; CHECK-NEXT: [3] %bb29.preheader {24,25}
		; CHECK-NEXT: [3] %bb20 {26,27}
		; CHECK-NEXT: [3] %bb19 {28,29}
		; CHECK-NEXT: [3] %bb.nph14 {30,31}
		; CHECK-NEXT: [3] %bb17.loopexit.split {32,33}
		; CHECK-NEXT: [3] %bb16 {34,35}
		; CHECK-NEXT: [3] %bb15 {36,37}
		; CHECK-NEXT: [3] %bb15.loopexit2 {38,39}
		; CHECK-NEXT: [3] %bb15.loopexit {40,41}
		; CHECK-NEXT: [3] %bb8 {42,43}
		; CHECK-NEXT: [3] %bb8.us {44,45}
		; CHECK-NEXT: [3] %bb8.preheader {46,47}
		; CHECK-NEXT: [3] %bb8.us.preheader {48,49}
		; CHECK-NEXT: [3] %bb.nph18 {50,51}
		; CHECK-NEXT: [3] %bb14.preheader {52,53}
		; CHECK-NEXT: [3] %bb5 {54,55}
		; CHECK-NEXT: [3] %bb4 {56,57}
		; CHECK-NEXT: [3] %bb.nph21 {58,59}
		; CHECK-NEXT: [3] %bb3.i.loopexit.us {60,61}
		; CHECK-NEXT: [3] %bb8.i.us {62,63}
		; CHECK-NEXT: [3] %bb4.i.us {64,65}
		; CHECK-NEXT: [3] %bb6.i.us {66,67}
		; CHECK-NEXT: [3] %bb1.i.us {68,69}
		; CHECK-NEXT: [3] %bb.i4.us.backedge {70,71}
		; CHECK-NEXT: [3] %bb7.i.us {72,73}
		; CHECK-NEXT: [3] %bb.i4.us {74,75}
		; CHECK-NEXT: [3] %bb3.split.us {76,77}
		; CHECK-NEXT: [3] %bb3 {78,79}
		; CHECK-NEXT: [3] %bb32.preheader {80,81}
		; CHECK-NEXT: [3] %_float32_unpack.exit8 {82,83}
		; CHECK-NEXT: [3] %bb.i5 {84,85}
		; CHECK-NEXT: [3] %_float32_unpack.exit {86,87}
		; CHECK-NEXT: [3] %bb.i {88,89}
		; CHECK-NEXT: [3] %bb {90,91}
		; CHECK-NEXT: [2] %entry {93,94}
		; CHECK-NEXT: [2] %bb3.i {95,96}

llvm/trunk/test/Analysis/PostDominators/pr6047_d.ll

Show All 15 Lines	bb3.i:
br label %bb3.i		br label %bb3.i

bb35.loopexit3:		bb35.loopexit3:
br label %bb35		br label %bb35

bb35:		bb35:
ret void		ret void
}		}
; CHECK: [4] %entry		; CHECK: Inorder PostDominator Tree:
		; CHECK-NEXT: [1] <<exit node>> {0,15}
		; CHECK-NEXT: [2] %bb35 {1,4}
		; CHECK-NEXT: [3] %bb35.loopexit3 {2,3}
		; CHECK-NEXT: [2] %c {5,12}
		; CHECK-NEXT: [3] %b {6,7}
		; CHECK-NEXT: [3] %entry {8,9}
		; CHECK-NEXT: [3] %a {10,11}
		; CHECK-NEXT: [2] %bb3.i {13,14}

llvm/trunk/test/Analysis/RegionInfo/infinite_loop.ll

Show All 10 Lines	2:
br label %"2"		br label %"2"
3:		3:
br label %"4"		br label %"4"
4:		4:
ret void		ret void
}		}
; CHECK-NOT: =>		; CHECK-NOT: =>
; CHECK: [0] 0 => <Function Return>		; CHECK: [0] 0 => <Function Return>
; CHECK: [1] 1 => 4		; STAT: 1 region - The # of regions
; STAT: 2 region - The # of regions
; STAT: 1 region - The # of simple regions

llvm/trunk/test/Analysis/RegionInfo/infinite_loop_2.ll

Show All 20 Lines	6:
br label %"2"		br label %"2"
3:		3:
br label %"4"		br label %"4"
4:		4:
ret void		ret void
}		}
; CHECK-NOT: =>		; CHECK-NOT: =>
; CHECK: [0] 0 => <Function Return>		; CHECK: [0] 0 => <Function Return>
; CHECK: [1] 1 => 3		; CHECK: [1] 5 => 6
; STAT: 2 region - The # of regions		; STAT: 2 region - The # of regions
; STAT: 1 region - The # of simple regions

; BBIT: 0, 1, 2, 5, 11, 6, 12, 3, 4,		; BBIT: 0, 1, 2, 5, 11, 6, 12, 3, 4,
; BBIT: 1, 2, 5, 11, 6, 12,		; BBIT: 5, 11, 12,

; RNIT: 0, 1 => 3, 3, 4,		; RNIT: 0, 1, 2, 5 => 6, 6, 3, 4,
; RNIT: 1, 2, 5, 11, 6, 12,		; RNIT: 5, 11, 12,

llvm/trunk/test/Analysis/RegionInfo/infinite_loop_3.ll

	Show All 32 Lines
	10:			10:
	br label %"8"			br label %"8"
	3:			3:
	br label %"4"			br label %"4"
	4:			4:
	ret void			ret void
	}			}
	; CHECK-NOT: =>			; CHECK-NOT: =>
	; CHECK: [0] 0 => <Function Return>			; CHECK:[0] 0 => <Function Return>
	; CHECK-NEXT: [1] 1 => 3			; CHECK-NEXT: [1] 5 => 6
	; CHECK-NEXT: [1] 7 => 1			; CHECK-NEXT: [1] 9 => 10
	; STAT: 3 region - The # of regions			; STAT: 3 region - The # of regions
	; STAT: 2 region - The # of simple regions

	; BBIT: 0, 7, 1, 2, 5, 11, 6, 12, 3, 4, 8, 9, 13, 10, 14,			; BBIT: 0, 7, 1, 2, 5, 11, 6, 12, 3, 4, 8, 9, 13, 10, 14,
	; BBIT: 7, 8, 9, 13, 10, 14,			; BBIT: 5, 11, 12,
	; BBIT: 1, 2, 5, 11, 6, 12,			; BBIT: 9, 13, 14,

	; RNIT: 0, 7 => 1, 1 => 3, 3, 4,			; RNIT: 0, 7, 1, 2, 5 => 6, 6, 3, 4, 8, 9 => 10, 10,
	; RNIT: 7, 8, 9, 13, 10, 14,			; RNIT: 5, 11, 12,
	; RNIT: 1, 2, 5, 11, 6, 12,			; RNIT: 9, 13, 14,

llvm/trunk/test/Analysis/RegionInfo/infinite_loop_4.ll

Show All 32 Lines	10:
br label %"8"		br label %"8"
3:		3:
br label %"4"		br label %"4"
4:		4:
ret void		ret void
}		}
; CHECK-NOT: =>		; CHECK-NOT: =>
; CHECK: [0] 0 => <Function Return>		; CHECK: [0] 0 => <Function Return>
; CHECK-NEXT: [1] 7 => 3		; CHECK-NEXT: [1] 2 => 10
; STAT: 2 region - The # of regions		; CHECK_NEXT: [2] 5 => 6
		; STAT: 3 region - The # of regions
; STAT: 1 region - The # of simple regions		; STAT: 1 region - The # of simple regions

; BBIT: 0, 7, 1, 2, 5, 11, 6, 10, 8, 9, 13, 14, 12, 3, 4,		; BBIT: 0, 7, 1, 2, 5, 11, 6, 10, 8, 9, 13, 14, 12, 3, 4,
; BBIT: 7, 1, 2, 5, 11, 6, 10, 8, 9, 13, 14, 12,		; BBIT: 2, 5, 11, 6, 12,
		; BBIT: 5, 11, 12,
; RNIT: 0, 7 => 3, 3, 4,		; RNIT: 0, 7, 1, 2 => 10, 10, 8, 9, 13, 14, 3, 4,
; RNIT: 7, 1, 2, 5, 11, 6, 10, 8, 9, 13, 14, 12,		; RNIT: 2, 5 => 6, 6,
		; RNIT: 5, 11, 12,

llvm/trunk/test/Analysis/RegionInfo/infinite_loop_5_a.ll

	Show All 13 Lines
	3:			3:
	br label %"4"			br label %"4"
	4:			4:
	ret void			ret void
	}			}

	; CHECK: Region tree:			; CHECK: Region tree:
	; CHECK-NEXT: [0] 0 => <Function Return>			; CHECK-NEXT: [0] 0 => <Function Return>
	; CHECK-NEXT: [1] 7 => 3
	; CHECK-NEXT: End region tree			; CHECK-NEXT: End region tree

llvm/trunk/test/Analysis/RegionInfo/infinite_loop_5_b.ll

	Show All 15 Lines
	3:			3:
	br label %"4"			br label %"4"
	4:			4:
	ret void			ret void
	}			}

	; CHECK: Region tree:			; CHECK: Region tree:
	; CHECK-NEXT: [0] 0 => <Function Return>			; CHECK-NEXT: [0] 0 => <Function Return>
	; CHECK-NEXT: [1] 7 => 3
	; CHECK-NEXT: End region tree			; CHECK-NEXT: End region tree

llvm/trunk/test/Transforms/StructurizeCFG/branch-on-argument.ll

	; RUN: opt -S -o - -structurizecfg < %s \| FileCheck %s			; RUN: opt -S -o - -structurizecfg < %s \| FileCheck %s

	; CHECK-LABEL: @invert_branch_on_arg_inf_loop(			; CHECK-LABEL: @invert_branch_on_arg_inf_loop(
	; CHECK: entry:			; CHECK: entry:
	; CHECK: %arg.inv = xor i1 %arg, true			; CHECK: %arg.inv = xor i1 %arg, true
	; CHECK: phi i1 [ false, %Flow1 ], [ %arg.inv, %entry ]
	define void @invert_branch_on_arg_inf_loop(i32 addrspace(1)* %out, i1 %arg) {			define void @invert_branch_on_arg_inf_loop(i32 addrspace(1)* %out, i1 %arg) {
	entry:			entry:
	br i1 %arg, label %for.end, label %for.body			br i1 %arg, label %for.end, label %sesestart
				sesestart:
				br label %for.body

	for.body: ; preds = %entry, %for.body			for.body: ; preds = %entry, %for.body
	store i32 999, i32 addrspace(1)* %out, align 4			store i32 999, i32 addrspace(1)* %out, align 4
	br label %for.body			br i1 %arg, label %for.body, label %seseend
				seseend:
				ret void

	for.end: ; preds = %Flow			for.end: ; preds = %Flow
	ret void			ret void
	}			}


	; CHECK-LABEL: @invert_branch_on_arg_jump_into_loop(			; CHECK-LABEL: @invert_branch_on_arg_jump_into_loop(
	; CHECK: entry:			; CHECK: entry:
	Show All 26 Lines

llvm/trunk/test/Transforms/StructurizeCFG/no-branch-to-entry.ll

				; XFAIL: *
	; RUN: opt -S -o - -structurizecfg -verify-dom-info < %s \| FileCheck %s			; RUN: opt -S -o - -structurizecfg -verify-dom-info < %s \| FileCheck %s

	; CHECK-LABEL: @no_branch_to_entry_undef(			; CHECK-LABEL: @no_branch_to_entry_undef(
	; CHECK: entry:			; CHECK: entry:
	; CHECK-NEXT: br label %entry.orig			; CHECK-NEXT: br label %entry.orig
	define void @no_branch_to_entry_undef(i32 addrspace(1)* %out) {			define void @no_branch_to_entry_undef(i32 addrspace(1)* %out) {
	entry:			entry:
	br i1 undef, label %for.end, label %for.body			br i1 undef, label %for.end, label %for.body
	Show All 23 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Fix PR 24415 (at least), by making our post-dominator tree behavior sane.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 90090

llvm/trunk/include/llvm/Support/GenericDomTree.h

llvm/trunk/include/llvm/Support/GenericDomTreeConstruction.h

llvm/trunk/lib/Transforms/Scalar/ADCE.cpp

llvm/trunk/test/Analysis/PostDominators/pr24415.ll

llvm/trunk/test/Analysis/PostDominators/pr6047_a.ll

llvm/trunk/test/Analysis/PostDominators/pr6047_b.ll

llvm/trunk/test/Analysis/PostDominators/pr6047_c.ll

llvm/trunk/test/Analysis/PostDominators/pr6047_d.ll

llvm/trunk/test/Analysis/RegionInfo/infinite_loop.ll

llvm/trunk/test/Analysis/RegionInfo/infinite_loop_2.ll

llvm/trunk/test/Analysis/RegionInfo/infinite_loop_3.ll

llvm/trunk/test/Analysis/RegionInfo/infinite_loop_4.ll

llvm/trunk/test/Analysis/RegionInfo/infinite_loop_5_a.ll

llvm/trunk/test/Analysis/RegionInfo/infinite_loop_5_b.ll

llvm/trunk/test/Transforms/StructurizeCFG/branch-on-argument.ll

llvm/trunk/test/Transforms/StructurizeCFG/no-branch-to-entry.ll

Fix PR 24415 (at least), by making our post-dominator tree behavior sane.
ClosedPublic