Download Raw Diff

Details

Reviewers

kuhar
sanjoy
asbirlea

Commits

rGd990c2a9e23f: [Dominators] Simplify and optimize path compression used in link-eval forest.
rL354433: [Dominators] Simplify and optimize path compression used in link-eval forest.

Summary

NodeToInfo[*] have been allocated so the addresses are stable. We can store them instead of NodePtr to save NumToNode lookups.
Nodes are traversed twice. Using Visited to check the traversal number is expensive and obscure. Just split the two traversals into two loops explicitly.
The check VInInfo.DFSNum < LastLinked is redundant as it is implied by VInInfo->Parent < LastLinked
VLabelInfo PLabelInfo are used to save a NodeToInfo lookup in the second traversal.

Also add some comments explaining eval().

This shows a ~4.5% improvement (9.8444s -> 9.3996s) on

perf stat -r 10 taskset -c 0 ~/llvm/Release/bin/opt -passes=$(printf '%.0srequire<domtree>,invalidate<domtree>,' {1..1000})'require<domtree>' -disable-output sqlite-autoconf-3270100/sqlite3.bc

Diff Detail

Repository

rL LLVM

Build Status

Buildable 28315
Build 28314: arc lint + arc unit

Event Timeline

MaskRay created this revision.Feb 17 2019, 5:49 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 17 2019, 5:49 AM

Herald added subscribers: llvm-commits, jdoerfert, kristina. · View Herald Transcript

Harbormaster completed remote builds in B28236: Diff 187157.Feb 17 2019, 5:49 AM

Hi @MaskRay,

This looks like a fantastic improvement! The only issue is that the code, both original and after your changes, lacks comments.

For example, I can see you put a comment that identified that the second part of the function does path compression, but it's not clear how exactly, and what is the first part of the function doing.

Next, what is the motivation behind this change? Have you identified some bottlenecks in the path compression part, have you suffered form the pathological quadratic complexity of SemiNCA? Perhaps it would make more sense to implement a hybrid algorithm that uses Lengauer-Tarjan for the initial full construction and SemiNCA for incremental updates? I have been thinking about it for quite some time now, but there weren't any complaints about the quadratic complexity yet, thus I figured the additional maintenance cost is not justified.

What was your evaluation for this patch? What benchmarks did you run? How do the numbers look like?

This revision now requires changes to proceed.Feb 17 2019, 10:58 AM

kuhar added a subscriber: NutshellySima.Feb 17 2019, 1:54 PM

kuhar added a subscriber: brzycki.

Add comments

Harbormaster completed remote builds in B28240: Diff 187187.Feb 17 2019, 7:29 PM

In D58327#1400612, @kuhar wrote:

Hi @MaskRay,

Thanks for the quick feedback!

This looks like a fantastic improvement! The only issue is that the code, both original and after your changes, lacks comments.

For example, I can see you put a comment that identified that the second part of the function does path compression, but it's not clear how exactly, and what is the first part of the function doing.

I added some comments explaining the eval() function as used in the link-eval forest. The file header didn't say Semi-NCA is O(N^2) so I have changed it to mention that, as the original one discussing eval() time complexity may give people false impression that this is almost linear or O(N log N) (at least me when I was reading the implementation).

Next, what is the motivation behind this change? Have you identified some bottlenecks in the path compression part, have you suffered form the pathological quadratic complexity of SemiNCA? Perhaps it would make more sense to implement a hybrid algorithm that uses Lengauer-Tarjan for the initial full construction and SemiNCA for incremental updates? I have been thinking about it for quite some time now, but there weren't any complaints about the quadratic complexity yet, thus I figured the additional maintenance cost is not justified.

No particular motivation when I started doing this :) I just came across this file and learned some dominators in the weekend. I haven't suffered from the worst-case O(N^2) complexity and I doubt it may happen in practice. sncaworst in Linear-Time Algorithms for Dominators and Related Problems requires an O(N) dominator tree path with O(N) children reachable from the top of the tree path. Anyway, I've changed the file header to mention this may require attention.

What was your evaluation for this patch? What benchmarks did you run? How do the numbers look like?

This patch should be obvious improvement but it may be small. I think there are several other aspects that can be improved and may have a larger impact. The biggest problem from my view is the representation. The NumToNode hash table lookup immediately followed by NodeToInfo lookup is a common pattern in the implementation and this may be expensive. Label can be an unsigned instead of a NodePtr to save some indirect lookups (this is the optimization opportunity we can leverage after the transition from SLT to Semi-NCA). IDom can probably do the same but that seems to require more efforts. DFSNum may be a candidate to be removed, etc. I don't understand why numbering starts from 1 but it is only a small aesthetic issue.

I'll have to read An Experimental Study of Dynamic Dominators and the other parts of the dominator implementation in llvm first to get a better idea where can be improved :)

Seeking for suggestions of tests. What workflow do you use when changing this file? I'm currently thinking of IRTests and check-llvm-analysis when experimenting and check-llvm or check-all for a comprehensive testing.

Could you please share yours? Especially the statistics in the talk Dominator Trees and incremental updates that transcend time...

Thanks for the comments, it looks amazing now.

No particular motivation when I started doing this :) I just came across this file and learned some dominators in the weekend.

Cool.

I haven't suffered from the worst-case O(N^2) complexity and I doubt it may happen in practice. sncaworst in Linear-Time Algorithms for Dominators and Related Problems requires an O(N) dominator tree path with O(N) children reachable from the top of the tree path. Anyway, I've changed the file header to mention this may require attention.

Great. It was indirectly mentioned in the part that described various checks, but it seems much more explicit now.

This patch should be obvious improvement but it may be small. I think there are several other aspects that can be improved and may have a larger impact. The biggest problem from my view is the representation. The NumToNode hash table lookup immediately followed by NodeToInfo lookup is a common pattern in the implementation and this may be expensive. Label can be an unsigned instead of a NodePtr to save some indirect lookups (this is the optimization opportunity we can leverage after the transition from SLT to Semi-NCA).

The patch changes the stack size for eval, which may have unexpected impact on caching and code layout. This is often very surprising and difficult to reason about.
As expected, the current version of SNCA spends most of time on DenseMap lookups, and profiles are not very informative (at least to me). Reducing the use of hashtables can be quite some improvement.

> IDom can probably do the same but that seems to require more efforts. DFSNum may be a candidate to be removed, etc. I don't understand why numbering starts from 1 but it is only a small aesthetic issue.
Keep in mind that IDom is a part of the public interface of DomTreeNode.

Seeking for suggestions of tests. What workflow do you use when changing this file? I'm currently thinking of IRTests and check-llvm-analysis when experimenting and check-llvm or check-all for a comprehensive testing.
Could you please share yours? Especially the statistics in the talk Dominator Trees and incremental updates that transcend time...

For the incremental updater, I'd usually manually instrument some of the GenericDomTree functions and count the number of nodes visited, times spent doing insertions, deletions, SNCA, etc. You can find some code for it in D50303 and D36884. For incremental updates, I just run -O3 on some big bitcode files, like clang, opt, sqlite.

For the construction algorithm itself, I had an llvm tool that loaded an LLVM module and randomly run construction on each functions a couple of times. See D36897.

I usually run all tests in llvm (ninja check-all), on the llvm-test-suite with LNT, run it on the it on bitcode linked into a single file (on clang, opt, sqlite, and more recently webassembly and rippled) -- I use gllvm for it. In order to make it work with gllvm, I usually compile projects with something like this:

export LLVM_COMPILER_PATH=my_path/llvm-9.0/bin
cmake .. -GNinja -DCMAKE_CXX_COMPILER=/home/kuba/go/bin/gclang++ DCMAKE_C_COMPILER=/home/kuba/go/bin/gclang -DCMAKE_BUILD_TYPE=Release -DCMAKE_CXX_FLAGS='-Xclang -disable-llvm-optzns' -DCMAKE_C_FLAGS='-Xclang -disable-llvm-optzns'

And then extract bitcode with get-bc and feed it into opt with -O3 -- aggressive inlining helps with exercising dominator construction due to larger CFGs.

Another idea for benchmarking is to count the number of iterations of loops that you expect to be affected by your patches and making sure it decreases, as an indirect measure of running time improvement.

typo

Harbormaster completed remote builds in B28241: Diff 187188.Feb 17 2019, 10:08 PM

The improvement is quite fantastic!

For the incremental updater, I'd usually manually instrument some of the GenericDomTree functions and count the number of nodes visited, times spent doing insertions, deletions, SNCA, etc. You can find some code for it in D50303 and D36884. For incremental updates, I just run -O3 on some big bitcode files, like clang, opt, sqlite.
...
And then extract bitcode with get-bc and feed it into opt with -O3 -- aggressive inlining helps with exercising dominator construction due to larger CFGs.

And I'd like to mention that it's needed to remove this piece of code which makes the incremental updater fallback to DominatorTree reconstruction when the number of updates is quite large relative to the number of DomTree nodes from GenericDomTreeConstruction.h before testing the performance of the incremental updater. I believe it can make improvement of the incremental updater more observable.

And I'd like to mention that it's needed to remove this piece of code which makes the incremental updater fallback to DominatorTree reconstruction when the number of updates is quite large relative to the number of DomTree nodes from GenericDomTreeConstruction.h before testing the performance of the incremental updater. I believe it can make improvement of the incremental updater more observable.

Thanks for the note. I have a better understanding now. See my other 2 patches D58369 D58373 in the area :)

I am now testing with IRTests check-llvm-analysis during development and check-llvm check-all when about to send out a patch.

Comparing the results of opt -passes='default<O3>,function(print<domtree>)' -disable-output (llvm-link.bc,llvm-as.bc,sqlite3.bc,...) give some more confidence.

I don't expect this patch to have perceivable performance boost.. Maybe there is a bit. Ran time opt -domtree -disable-output < ~/llvm/GLLVM/bin/clang-9.bc several times with and without the patch:

Before: 1:05.27 1:05.45 1:05.46 1:06.96 1:05.25 1:04.58
After: 1:05.71 1:04.49 1:03.50 1:05.14 1:05.04 1:03.77

In D58327#1401992, @MaskRay wrote:

Comparing the results of opt -passes='default<O3>,function(print<domtree>)' -disable-output (llvm-link.bc,llvm-as.bc,sqlite3.bc,...) give some more confidence.

I don't expect this patch to have perceivable performance boost.. Maybe there is a bit. Ran time opt -domtree -disable-output < ~/llvm/GLLVM/bin/clang-9.bc several times with and without the patch:

Before: 1:05.27 1:05.45 1:05.46 1:06.96 1:05.25 1:04.58
After: 1:05.71 1:04.49 1:03.50 1:05.14 1:05.04 1:03.77

If you are building DT/PDT once per function, then your times is going to be dominated by reading bitcode, parsing it, building a module, verifying it, etc. -- I suggest to visit each function a number of times (say 1000), in randomized order, and measure time of a single pass / tool that does it.
In addition, make sure that your performance governor is set to performance and that you pin the process to a single cpu core with its sibling disabled. (like in https://llvm.org/docs/Benchmarking.html#linux). Benchmarking on x86 is hard.

In D58327#1402240, @kuhar wrote:

In addition, make sure that your performance governor is set to performance and that you pin the process to a single cpu core with its sibling disabled. (like in https://llvm.org/docs/Benchmarking.html#linux). Benchmarking on x86 is hard.

One test I like to use is the LLVM test-suite subdirectory CTMark with running the top-level lit process through taskset, as in:

cmake -G Ninja -D TEST_SUITE_SUBDIRS=CTMark -D TEST_SUITE_RUN_BENCHMARKS=0 ...

ninja

taskset -c $core lit -v -j 1 . -o results.json

In addition, make sure that your performance governor is set to performance and that you pin the process to a single cpu core with its sibling disabled. (like in https://llvm.org/docs/Benchmarking.html#linux). Benchmarking on x86 is hard.

Thanks for the pointer! Here is the performance number of perf stat -r 10 taskset -c 0 ~/llvm/Release/bin/opt -passes=$(printf '%.0srequire<domtree>,invalidate<domtree>,' {1..1000})'require<domtree>' -disable-output /tmp/p/sqlite-autoconf-3270100/sqlite3.bc
I'm very new to the optimization part of llvm.. Let me know if there is better way to measure domtree construction.

Before: 9.8444 +- 0.0279 seconds time elapsed ( +- 0.28% )
After: 9.3996 +- 0.0170 seconds time elapsed ( +- 0.18% )

I've tried this thrice and can reproduce similar comparison results.

Awesome, so it seems like we have ~4.5% improvement. Thanks for running the experiments.

This revision is now accepted and ready to land.Feb 19 2019, 7:35 PM

Add performance number to the description

Harbormaster completed remote builds in B28314: Diff 187509.Feb 19 2019, 8:36 PM

Harbormaster completed remote builds in B28315: Diff 187510.Feb 19 2019, 8:36 PM

Closed by commit rL354433: [Dominators] Simplify and optimize path compression used in link-eval forest. (authored by MaskRay). · Explain WhyFeb 19 2019, 8:39 PM

This revision was automatically updated to reflect the committed changes.

One test I like to use is the LLVM test-suite subdirectory CTMark with running the top-level lit process through taskset, as in:

@brzycki Thanks for the tip! Do you know how to create testing .bc from LLVM test-suite for benchmarks?

And how can I benchmark dynamic dominator tree? CalculateFromScratch is easy to benchmark as I can just repeatedly run the function analysis domtree and discard the result but I don't know how for the dynamic case...

In D58327#1403532, @MaskRay wrote:

@brzycki Thanks for the tip! Do you know how to create testing .bc from LLVM test-suite for benchmarks?

Hello @MaskRay, test-suite has a subdirectory named LLVMSource that contains IR variants of tests. However, I see no reference to that sub-directory in the CMake build files and the last update to that directory was in 2012. I have never tried it and have no idea if lit can run tests in there. The LLVMSource/Makefile does pull in all files in that directory so if it worked you should be able to just add a *.ll file there for it to be picked up.

And how can I benchmark dynamic dominator tree? CalculateFromScratch is easy to benchmark as I can just repeatedly run the function analysis domtree and discard the result but I don't know how for the dynamic case...

The testing I've done on Dominators was as customer to the external API and all the benchmarking I performed was in conjunction with the pass. I don't know of a way to profile just the DTU.

There is a directory under test-suite's MicroBenchmarks that use the Google benchmark framework to time small sections of code. I don't know if that could be easily harnessed to test LLVM internals though. I suspect given test-suite's standalone nature it won't be easy to create a benchmark that relies on LLVM as a library. But I don't know for sure.

In D58327#1403532, @MaskRay wrote:

And how can I benchmark dynamic dominator tree? CalculateFromScratch is easy to benchmark as I can just repeatedly run the function analysis domtree and discard the result but I don't know how for the dynamic case...

The best method I came up with is to instrument the updater primitives (applyUpdates, insertEdge, deleteEdge, recalculate, etc.), and measure time within the running optimizer. It should be best to get some big bitcode file and run opt with O3 on it. clang is reasonably sized in this configuration, you should also have luck with webassembly and rippled. If you are with google, you can try some bigger internal targets (there is a script to compile a given target to bitcode).

In D58327#1403532, @MaskRay wrote:

...
And how can I benchmark dynamic dominator tree? CalculateFromScratch is easy to benchmark as I can just repeatedly run the function analysis domtree and discard the result but I don't know how for the dynamic case...

You can reference the code in D50300 to time incremental updater primitives. I recommend you to find some projects which have large functions, probably machine generated code, for example, the one mentioned in Bug 37929. Bitcodes which have large functions can make improvement more observable. I remember incremental DT updating happens intensively in some passes like JumpThreading, so you can run O3 and save a .bc before jumpthreading. It can save you some time when benchmarking.

Diff 187510

include/llvm/Support/GenericDomTreeConstruction.h

Show All 9 Lines
/// Generic dominator tree construction - This file provides routines to		/// Generic dominator tree construction - This file provides routines to
/// construct immediate dominator information for a flow-graph based on the		/// construct immediate dominator information for a flow-graph based on the
/// Semi-NCA algorithm described in this dissertation:		/// Semi-NCA algorithm described in this dissertation:
///		///
/// Linear-Time Algorithms for Dominators and Related Problems		/// Linear-Time Algorithms for Dominators and Related Problems
/// Loukas Georgiadis, Princeton University, November 2005, pp. 21-23:		/// Loukas Georgiadis, Princeton University, November 2005, pp. 21-23:
/// ftp://ftp.cs.princeton.edu/reports/2005/737.pdf		/// ftp://ftp.cs.princeton.edu/reports/2005/737.pdf
///		///
/// This implements the O(n*log(n)) versions of EVAL and LINK, because it turns		/// Semi-NCA algorithm runs in O(n^2) worst-case time but usually slightly
/// out that the theoretically slower O(n*log(n)) implementation is actually		/// faster than Simple Lengauer-Tarjan in practice.
/// faster than the almost-linear O(n*alpha(n)) version, even for large CFGs.		///
		/// O(n^2) worst cases happen when the computation of nearest common ancestors
		/// requires O(n) average time, which is very unlikely in real world. If this
		/// ever turns out to be an issue, consider implementing a hybrid algorithm.
///		///
/// The file uses the Depth Based Search algorithm to perform incremental		/// The file uses the Depth Based Search algorithm to perform incremental
/// updates (insertion and deletions). The implemented algorithm is based on		/// updates (insertion and deletions). The implemented algorithm is based on
/// this publication:		/// this publication:
///		///
/// An Experimental Study of Dynamic Dominators		/// An Experimental Study of Dynamic Dominators
/// Loukas Georgiadis, et al., April 12 2016, pp. 5-7, 9-10:		/// Loukas Georgiadis, et al., April 12 2016, pp. 5-7, 9-10:
/// https://arxiv.org/pdf/1604.02711.pdf		/// https://arxiv.org/pdf/1604.02711.pdf
▲ Show 20 Lines • Show All 220 Lines • ▼ Show 20 Lines	while (!WorkList.empty()) {
SuccInfo.Parent = LastNum;		SuccInfo.Parent = LastNum;
SuccInfo.ReverseChildren.push_back(BB);		SuccInfo.ReverseChildren.push_back(BB);
}		}
}		}

return LastNum;		return LastNum;
}		}

NodePtr eval(NodePtr VIn, unsigned LastLinked) {		// V is a predecessor of W. eval() returns V if V < W, otherwise the minimum
auto &VInInfo = NodeToInfo[VIn];		// of sdom(U), where U > W and there is a virtual forest path from U to V. The
if (VInInfo.DFSNum < LastLinked)		// virtual forest consists of linked edges of processed vertices.
return VIn;		//
		// We can follow Parent pointers (virtual forest edges) to determine the
SmallVector<NodePtr, 32> Work;		// ancestor U with minimum sdom(U). But it is slow and thus we employ the path
SmallPtrSet<NodePtr, 32> Visited;		// compression technique to speed up to O(m*log(n)). Theoretically the virtual
		// forest can be organized as balanced trees to achieve almost linear
if (VInInfo.Parent >= LastLinked)		// O(m*alpha(m,n)) running time. But it requires two auxiliary arrays (Size
Work.push_back(VIn);		// and Child) and is unlikely to be faster than the simple implementation.
		//
while (!Work.empty()) {		// For each vertex V, its Label points to the vertex with the minimal sdom(U)
NodePtr V = Work.back();		// (Semi) in its path from V (included) to NodeToInfo[V].Parent (excluded).
auto &VInfo = NodeToInfo[V];		NodePtr eval(NodePtr V, unsigned LastLinked,
NodePtr VAncestor = NumToNode[VInfo.Parent];		SmallVectorImpl<InfoRec *> &Stack) {
		InfoRec *VInfo = &NodeToInfo[V];
// Process Ancestor first		if (VInfo->Parent < LastLinked)
if (Visited.insert(VAncestor).second && VInfo.Parent >= LastLinked) {		return VInfo->Label;
Work.push_back(VAncestor);
continue;		// Store ancestors except the last (root of a virtual tree) into a stack.
}		assert(Stack.empty());
Work.pop_back();		do {
		Stack.push_back(VInfo);
// Update VInfo based on Ancestor info		VInfo = &NodeToInfo[NumToNode[VInfo->Parent]];
if (VInfo.Parent < LastLinked)		} while (VInfo->Parent >= LastLinked);
continue;
		// Path compression. Point each vertex's Parent to the root and update its
auto &VAInfo = NodeToInfo[VAncestor];		// Label if any of its ancestors (PInfo->Label) has a smaller Semi.
NodePtr VAncestorLabel = VAInfo.Label;		const InfoRec *PInfo = VInfo;
NodePtr VLabel = VInfo.Label;		const InfoRec *PLabelInfo = &NodeToInfo[PInfo->Label];
if (NodeToInfo[VAncestorLabel].Semi < NodeToInfo[VLabel].Semi)		do {
VInfo.Label = VAncestorLabel;		VInfo = Stack.pop_back_val();
VInfo.Parent = VAInfo.Parent;		VInfo->Parent = PInfo->Parent;
}		const InfoRec *VLabelInfo = &NodeToInfo[VInfo->Label];
		if (PLabelInfo->Semi < VLabelInfo->Semi)
return VInInfo.Label;		VInfo->Label = PInfo->Label;
		else
		PLabelInfo = VLabelInfo;
		PInfo = VInfo;
		} while (!Stack.empty());
		return VInfo->Label;
}		}

// This function requires DFS to be run before calling it.		// This function requires DFS to be run before calling it.
void runSemiNCA(DomTreeT &DT, const unsigned MinLevel = 0) {		void runSemiNCA(DomTreeT &DT, const unsigned MinLevel = 0) {
const unsigned NextDFSNum(NumToNode.size());		const unsigned NextDFSNum(NumToNode.size());
// Initialize IDoms to spanning tree parents.		// Initialize IDoms to spanning tree parents.
for (unsigned i = 1; i < NextDFSNum; ++i) {		for (unsigned i = 1; i < NextDFSNum; ++i) {
const NodePtr V = NumToNode[i];		const NodePtr V = NumToNode[i];
auto &VInfo = NodeToInfo[V];		auto &VInfo = NodeToInfo[V];
VInfo.IDom = NumToNode[VInfo.Parent];		VInfo.IDom = NumToNode[VInfo.Parent];
}		}

// Step #1: Calculate the semidominators of all vertices.		// Step #1: Calculate the semidominators of all vertices.
		SmallVector<InfoRec *, 32> EvalStack;
for (unsigned i = NextDFSNum - 1; i >= 2; --i) {		for (unsigned i = NextDFSNum - 1; i >= 2; --i) {
NodePtr W = NumToNode[i];		NodePtr W = NumToNode[i];
auto &WInfo = NodeToInfo[W];		auto &WInfo = NodeToInfo[W];

// Initialize the semi dominator to point to the parent node.		// Initialize the semi dominator to point to the parent node.
WInfo.Semi = WInfo.Parent;		WInfo.Semi = WInfo.Parent;
for (const auto &N : WInfo.ReverseChildren) {		for (const auto &N : WInfo.ReverseChildren) {
if (NodeToInfo.count(N) == 0) // Skip unreachable predecessors.		if (NodeToInfo.count(N) == 0) // Skip unreachable predecessors.
continue;		continue;

const TreeNodePtr TN = DT.getNode(N);		const TreeNodePtr TN = DT.getNode(N);
// Skip predecessors whose level is above the subtree we are processing.		// Skip predecessors whose level is above the subtree we are processing.
if (TN && TN->getLevel() < MinLevel)		if (TN && TN->getLevel() < MinLevel)
continue;		continue;

unsigned SemiU = NodeToInfo[eval(N, i + 1)].Semi;		unsigned SemiU = NodeToInfo[eval(N, i + 1, EvalStack)].Semi;
if (SemiU < WInfo.Semi) WInfo.Semi = SemiU;		if (SemiU < WInfo.Semi) WInfo.Semi = SemiU;
}		}
}		}

// Step #2: Explicitly define the immediate dominator of each vertex.		// Step #2: Explicitly define the immediate dominator of each vertex.
// IDom[i] = NCA(SDom[i], SpanningTreeParent(i)).		// IDom[i] = NCA(SDom[i], SpanningTreeParent(i)).
// Note that the parents were stored in IDoms and later got invalidated		// Note that the parents were stored in IDoms and later got invalidated
// during path compression in Eval.		// during path compression in Eval.
▲ Show 20 Lines • Show All 1,318 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Dominators] Simplify and optimize path compression used in link-eval forest.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 187510

include/llvm/Support/GenericDomTreeConstruction.h

This is an archive of the discontinued LLVM Phabricator instance.

[Dominators] Simplify and optimize path compression used in link-eval forest.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 187510

include/llvm/Support/GenericDomTreeConstruction.h

[Dominators] Simplify and optimize path compression used in link-eval forest.
ClosedPublic