This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
test/tools/llvm-xray/X86/
-
tools/
-
llvm-xray/
-
X86/
4
graph-deduce-tail-call.yaml
1/3
graph-simple-case.yaml
-
tools/llvm-xray/
-
llvm-xray/
2/2
CMakeLists.txt
20/22
xray-graph.h
35/36
xray-graph.cc

Differential D27243

Initial work on the XRay Graph tool.
ClosedPublic

Authored by varno on Nov 29 2016, 5:29 PM.

Download Raw Diff

Details

Reviewers

dberris
dblaikie

Commits

rG87299ad2e793: [XRay] Implement the `llvm-xray graph` subcommand
rL292156: [XRay] Implement the `llvm-xray graph` subcommand

Summary

[XRay] Implement the llvm-xray graph subcommand

This is an innitial change to implement a new subcommand for the llvm-xray tool.

Here we define the graph subcommand which generates a graph from the function
call information and uses it to present the call information graphically with
additional annotations. This tool was originally proposed by dberris.

Depends on D24377.

Diff Detail

Build Status

Buildable 2866
Build 2866: arc lint + arc unit

Event Timeline

varno updated this revision to Diff 79681.Nov 29 2016, 5:29 PM

varno retitled this revision from to Initial work on the XRay Graph tool..

varno updated this object.

varno added reviewers: dberris, dblaikie.

varno added a parent revision: D24377: [XRay] Implement the `llvm-xray account` subcommand.

varno added subscribers: dberris, mgorny, llvm-commits.

dblaikie added inline comments.Dec 1 2016, 1:34 PM

tools/llvm-xray/xray-graph.cc
85–89	could potentially use a conditional operator inside the [] since the expression is otherwise identical. Also, the outer () aren't necessary - but you can keep them if you find they enhance readability.
90–92	Consider dropping {} on single line blocks.
117–121	Inconsistent bracing (why does the inner loop have braces but the outer loop doesn't? - personally I'd probably drop them from both, but I could see an argument for adding them to both - just doesn't seem right to add them only to one and not the other)
121	Extra semicolon
129	Extra semi
133–134	Drop these (it's just a static variable in this scope anyway)
175–184	Should be some common utility for this, so every tool doesn't have to go through the same hoops (probably coalesce all the instrumentation map extractor stuff as well)
tools/llvm-xray/xray-graph.h
31	No need for the explicit 'private' as it's implied/the default here.
42	(Consider LLVM's data structures - can be more memory efficient than the allocation-per-node of standard containers. Also a map of maps might be more efficient as a map of pair -> value, if it's equivalent for your use case)
60	(similarly, consider other data structures - but also maybe consider multimap rather than map to vector)
69	Extra semicolon

Initial work on the XRay Graph tool.
Responses to Phabricator comments

Remove spurious files by basing patch on D24377

dberris added inline comments.Dec 1 2016, 5:56 PM

tools/llvm-xray/CMakeLists.txt
17	nit: we try to somewhat keep this in lexicographical order.
tools/llvm-xray/xray-graph.cc
175–184	+1 -- if you rebase again to the latest of D24377 you can use the utility function that determines the supported loader function here.
tools/llvm-xray/xray-graph.h
42	http://llvm.org/docs/ProgrammersManual.html is a good resource to look up available data structures -- for instance, for map-like containers: http://llvm.org/docs/ProgrammersManual.html#map-like-containers-std-map-densemap-etc You have a few choices you can work with.

varno marked 8 inline comments as done.Dec 1 2016, 5:57 PM

varno added inline comments.

tools/llvm-xray/xray-graph.cc
175–184	Yeah there probably should be. Not yet though AFAIK.
tools/llvm-xray/xray-graph.h
42	Thinking more on this before I make changes.
60	Thinking on this more before I make changes.

I am planning changes

Additional Changes in response to dblaikie's comments
Rebase to current version of D24377 and use getSupportedLoader

I have addressed the last open comments.

Cool -- now let's see some tests for this, to make sure that we don't regress it when it goes in. You can find examples of tests in the test/tools/llvm-xray/ directory.

dberris requested changes to this revision.Dec 5 2016, 7:46 PM

dberris edited edge metadata.

Needs tests.

This revision now requires changes to proceed.Dec 5 2016, 7:46 PM

Extra work making the xray-graph tool more powerful.
Further work on options in llvm-xray graph subcommand
Created Tests

Some style comments.

As for tests I think this is an OK set for first tests, but it'd be good to look at error conditions and making sure we're handling those appropriately.

tools/llvm-xray/xray-graph.cc
122	Can you use `.emplace_back(...)` instead here?
156–157	Please add an empty line between the end of function definitions, and the start of the next function.
tools/llvm-xray/xray-graph.h
28–29	Please add an empty line between last non-comment and first comment lines.
76	Can you use a `SmallVector<...>` instead of a `std::vector<...>`?
116–125	Can this not happen incrementally, as we're adding the records, or on demand when we export? i.e. why do they need to be public functions?
127–128	Does this need to be part of the class? Could this not be a function in the implementation?

varno added inline comments.Dec 6 2016, 8:37 PM

tools/llvm-xray/xray-graph.cc
122	No.
tools/llvm-xray/xray-graph.h
116–125	Not really, all the records need to be added before we calculate the statistics. And I was wanting the statistics calculated to do graph transformations later (before output).
127–128	I suppose it could be a friend function, it uses private types (the TimeStat type).

Reply to dberris comments.

The formatting changes I suspect can be automated away by using clang-format, so we don't have to spend too many cycles just getting those things "right". :)

tools/llvm-xray/xray-graph.cc
175–176	Empty line between these two lines?
188–197	The indentation here is a little weird. Fix?
tools/llvm-xray/xray-graph.h
116–125	Sure, but why aren't these just implementation details of the exportGraphAsDOT(...) function?
127–128	It doesn't have to use that type, right? I suspect you could just make this a template in the unnamed namespace in the implementation. Or that type could just be public and you could just use it (i.e for example you might want to be able to provide an iterator to the graph later, so that other commands can leverage the graph too).

Comments by Dberris

varno marked 5 inline comments as done.Dec 6 2016, 9:10 PM

varno added inline comments.

tools/llvm-xray/xray-graph.h
116–125	I suppose that right now they could be, but I have plans that require them to be run before exportGraphAsDOT

Did you run this through clang-format in LLVM mode?

tools/llvm-xray/xray-graph.cc
230–231	Missing empty line between these two.
345–348	I'm looking at these lines and thinking it seems they're unnecessarily exposing implementation details -- really these things are something the graph renderer can decide itself if it took the file header as an argument to the constructor. And these really ought to just happen when we're exporting the data as DOT and passing arguments to the kind of data we want to see.
tools/llvm-xray/xray-graph.h
116–125	I suspect we can break these out later if we really need to. For now, they're a distraction and from an API design perspective, really brittle -- if someone wants to use this class they have to know to call these functions before exporting. If this doesn't happen while we're accounting the records, then this means the state of the graph is not easily determined. If for example an external algorithm requires the graph, then we ought to be able to access the graph itself -- and maybe the accounting ought to happen externally instead, as an algorithm that passes through the graph we've built or as an extension to the graph traversal algorithm being employed, exposed as a method to this class.

Replying to comments

varno marked 2 inline comments as done.Dec 6 2016, 9:54 PM

varno added inline comments.

tools/llvm-xray/xray-graph.h
116–125	Ok, I'll break them out and we can add them in later if needed

dberris added inline comments.Dec 6 2016, 9:59 PM

tools/llvm-xray/xray-graph.h
44–45	Missing empty line between these two.
75–76	Missing line in between these two.

Patch comment reply

Now had a look closely at the implementation. Please pardon the piecemeal review.

tools/llvm-xray/xray-graph.cc
103	Potentially spurious empty line here.
118	Also potentially spurious empty line here.
143–149	So I think in a future change which we talked about offline, we ought to make this and the `account` tool work with each other in some form of refactored implementation -- where the account tool can depend on the graph tool instead of duplicating a lot of this logic. If you rebase against the `account` change (D24377) then the version used there now has less calls to .pop_back() on the vector, and also has a simpler loop as a result (i.e. you don't need the extra spillover logic, and the version there has less nesting and more straight-line code anyway. The actionable comment right now I think would be a `//FIXME: Refactor this and the account subcommand to reduce code duplication`
151	Maybe use a name that isn't overloaded in this context -- `D` is a perfectly fine letter for something that indicates the delta between two numbers.
205–206	Empty line in between, and don't need void in the function arguments.
215	No need for void in the function arguments.
216	Consider also using a `SmallVector` here, with potentially the same initial size as with the adjacency list we maintain.
222	It's thoroughly confusing too that you're using Timings here, but there's a data member named Timings as well. :)
222–229	Is there no way to do this in a single pass, in the same loop above? I could imagine building a map that contains a vertex as the key, then accumulating statistics as you go traversing the graph (or even just storing the data anyway like you do here for the timings), and another pass to collapse that data.
tools/llvm-xray/xray-graph.h
89	Why are the iterators not the same type?
93	No need for `void` in the function arguments.

Comments

varno marked 5 inline comments as done.Dec 7 2016, 2:50 PM

varno added inline comments.

tools/llvm-xray/xray-graph.cc
222–229	Really not, the way here has linear space and uses the required memory for each section. If we did not care about memory I could store the accumulation data for both edges and vertices. but otherwise ???

Reply to dberris comments.
Comments
comments

From a style perspective and implementation I think this is mostly OK -- waiting on @dblaikie to have a look.

tools/llvm-xray/xray-graph.cc
222–229	Right, so the alternate body of the function seems to be more straight-forward, and the amount of memory being used isn't that bad. You can even be smart about this and have an upper bound on the elements in the vector, and as you insert you keep a relative order and track median, and other percentiles incrementally. That's not as critical as the functionality though, and we can tune this as we go along late anyway if we encounter really big graphs in practice that would cause this to be a huge problem (either as implemented or even in the alternative version). I'm happy either way if you choose to stick with the current implementation, but if you do consider just removing the alternate implementation you have here `#ifdef 0`'d out.

This revision is now accepted and ready to land.Dec 7 2016, 6:47 PM

varno added a child revision: D28225: Implemented color coding and Vertex labels in XRay Graph.Jan 2 2017, 10:35 PM

dblaikie accepted this revision.Jan 9 2017, 1:14 PM

dblaikie edited edge metadata.

dblaikie added inline comments.

test/tools/llvm-xray/X86/graph-deduce-tail-call.yaml
9–18	This seems to be passing a variety of values for '-e' but tests that the behavior is the same in all of them. I'm assuming the flag does something - do you have tests that confirm that it does the right thing? (somewhat similar for the top two test cases too)
40–41	Is deduce-sibling-calls tested?
tools/llvm-xray/xray-graph.cc
212–213	Is this optimization worthwhile? Or could we put this as a local variable in the outer loop below - no need to clear it, etc.
223	Remove dead code
288–289	Probably skip the extra language like "does what the name suggests" and "does this in the expected way". If it's pretty obvious/self explanatory, then a brief comment is OK.
341–342	Should this be silently handled? If a user specifies a file, seems like we should error if it's not there (@dberris - goes for existing tools too, I imagine, guess I didn't notice this in the others)
359	Move this to where it's used (then you could even use 'const auto *' if you like

varno marked 4 inline comments as done.Jan 9 2017, 2:43 PM

varno added inline comments.

test/tools/llvm-xray/X86/graph-deduce-tail-call.yaml
9–18	I need to add additional test cases for these, however the test case size for -e 99p must be quite long in order for it to work. Will add in next revision of patch.
40–41	Yes. Otherwise the graph would only have two nodes with timing information
tools/llvm-xray/xray-graph.cc
212–213	I think it is worth while, however I have no testing results. At this time, as I am working on a patch which changes how this code works completely due to a new data structure, I don't know.

dberris added inline comments.Jan 12 2017, 12:59 AM

test/tools/llvm-xray/X86/graph-simple-case.yaml
35–36	Did you need to write "DAG" here somewhere too?
tools/llvm-xray/CMakeLists.txt
14–16	Rebasing this to tip of trunk now that 'llvm-xray account' has landed might mean this dependency goes away now.
tools/llvm-xray/xray-graph.cc
212–213	I'd urge you to not change this patch, but instead stack one on top of it instead for any further changes you'd need to do. I'd much rather do that review of the refactoring separately than doing this review over with new data structures.
341–342	I think we at least should print something to the effect of "well, we can't symbolise properly". I'm happy with making this an explicit error, to not give users potentially misleading results.
359	Given the changes that have landed now, there's a better way of doing this (if we look at what's happening in `llvm-xray account` at least).

Rebase and addess comment.

varno marked 15 inline comments as done.Jan 12 2017, 3:32 PM

varno added inline comments.

test/tools/llvm-xray/X86/graph-simple-case.yaml
35–36	I don't need DAG here as there is only one edge in this graph.

dberris added inline comments.Jan 12 2017, 7:01 PM

test/tools/llvm-xray/X86/graph-simple-case.yaml
35–36	So as written, this will mean that if these lines are not arranged in exactly this order, the test will pass. If you meant to preserve order of the output lines being checked, you pick either `-DAG:` or `-NEXT:`. This means, instead of: #EMPTY: #EMPTY: #EMPTY: It ought to be: #EMPTY: #EMPTY-NEXT: #EMPTY-NEXT: #EMPTY-NEXT: to preserve the order of lines and matching.

Initial work on the XRay Graph tool.
clang-format

fix tests

Hi dblakie, I was wondering if you could land this patch for me?

Closed by commit rL292156: [XRay] Implement the `llvm-xray graph` subcommand (authored by dblaikie). · Explain WhyJan 16 2017, 1:06 PM

This revision was automatically updated to reflect the committed changes.

Please try to avoid adding pid_t references. This type is not available on windows. llvm::sys::ProcessInfo::ProcessId is a possible replacement (i've fixed the usage in this commit in r292206).

Revision Contents

Path

Size

test/

tools/

llvm-xray/

X86/

graph-deduce-tail-call.yaml

75 lines

graph-simple-case.yaml

46 lines

tools/

llvm-xray/

CMakeLists.txt

1 line

xray-graph.h

129 lines

xray-graph.cc

365 lines

Diff 84216

test/tools/llvm-xray/X86/graph-deduce-tail-call.yaml

This file was added.

				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml -d \
				#RUN: \| FileCheck %s -check-prefix=COUNT
				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml -d -e count \
				#RUN: \| FileCheck %s -check-prefix=COUNT
				#
				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml -d -e min \
				#RUN: \| FileCheck %s -check-prefix=TIME
				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml -d -e med \
				#RUN: \| FileCheck %s -check-prefix=TIME
				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml -d -e 90p \
				#RUN: \| FileCheck %s -check-prefix=TIME
				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml -d -e 99p \
				#RUN: \| FileCheck %s -check-prefix=TIME
				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml -d -e max \
				#RUN: \| FileCheck %s -check-prefix=TIME
				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml -d -e sum \
				#RUN: \| FileCheck %s -check-prefix=TIME
				#
				dblaikieUnsubmitted Not Done Reply Inline Actions This seems to be passing a variety of values for '-e' but tests that the behavior is the same in all of them. I'm assuming the flag does something - do you have tests that confirm that it does the right thing? (somewhat similar for the top two test cases too) dblaikie: This seems to be passing a variety of values for '-e' but tests that the behavior is the same…
				varnoAuthorUnsubmitted Not Done Reply Inline Actions I need to add additional test cases for these, however the test case size for -e 99p must be quite long in order for it to work. Will add in next revision of patch. varno: I need to add additional test cases for these, however the test case size for -e 99p must be…
				---
				header:
				version: 1
				type: 0
				constant-tsc: true
				nonstop-tsc: true
				cycle-frequency: 0
				records:
				# Here we reconstruct the following call trace:
				#
				# f1()
				# f2()
				# f3()
				#
				# But we find that we're missing an exit record for f2() because it's
				# tail-called f3(). We make sure that if we see a trace like this that we can
				# deduce tail calls, and account the time (potentially wrongly) to f2() when
				# f1() exits. That is because we don't go back to f3()'s entry record to
				# properly do the math on the timing of f2().
				#
				# Note that by default, tail/sibling call deduction is disabled, and is enabled
				# with a flag "-d" or "-deduce-sibling-calls".
				#
				dblaikieUnsubmitted Not Done Reply Inline Actions Is deduce-sibling-calls tested? dblaikie: Is deduce-sibling-calls tested?
				varnoAuthorUnsubmitted Not Done Reply Inline Actions Yes. Otherwise the graph would only have two nodes with timing information varno: Yes. Otherwise the graph would only have two nodes with timing information
				- { type: 0, func-id: 1, cpu: 1, thread: 111, kind: function-enter, tsc: 10000 }
				- { type: 0, func-id: 2, cpu: 1, thread: 111, kind: function-enter, tsc: 10001 }
				- { type: 0, func-id: 3, cpu: 1, thread: 111, kind: function-enter, tsc: 10002 }
				- { type: 0, func-id: 3, cpu: 1, thread: 111, kind: function-exit, tsc: 10003 }
				- { type: 0, func-id: 1, cpu: 1, thread: 111, kind: function-exit, tsc: 10004 }
				...

				#EMPTY: digraph xray {
				#EMPTY-DAG: F0 -> F1 [label=""];
				#EMPTY-DAG: F1 -> F2 [label=""];
				#EMPTY-DAG: F2 -> F3 [label=""];
				#EMPTY-DAG: F1 [label="@(1)"];
				#EMPTY-DAG: F2 [label="@(2)"];
				#EMPTY-DAG: F3 [label="@(3)"];
				#EMPTY-NEXT: }

				#COUNT: digraph xray {
				#COUNT-DAG: F0 -> F1 [label="1"];
				#COUNT-DAG: F1 -> F2 [label="1"];
				#COUNT-DAG: F2 -> F3 [label="1"];
				#COUNT-DAG: F1 [label="@(1)"];
				#COUNT-DAG: F2 [label="@(2)"];
				#COUNT-DAG: F3 [label="@(3)"];
				#COUNT-NEXT: }


				#TIME: digraph xray {
				#TIME-DAG: F0 -> F1 [label="4.{{.*}}"];
				#TIME-DAG: F1 -> F2 [label="3.{{.*}}"];
				#TIME-DAG: F2 -> F3 [label="1.{{.*}}"];
				#TIME-DAG: F1 [label="@(1)"];
				#TIME-DAG: F2 [label="@(2)"];
				#TIME-DAG: F3 [label="@(3)"];
				#TIME-NEXT: }

test/tools/llvm-xray/X86/graph-simple-case.yaml

This file was added.

				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml \
				#RUN: \| FileCheck %s -check-prefix=COUNT
				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml -e count \
				#RUN: \| FileCheck %s -check-prefix=COUNT
				#
				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml -e min \
				#RUN: \| FileCheck %s -check-prefix=TIME
				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml -e med \
				#RUN: \| FileCheck %s -check-prefix=TIME
				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml -e 90p \
				#RUN: \| FileCheck %s -check-prefix=TIME
				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml -e 99p \
				#RUN: \| FileCheck %s -check-prefix=TIME
				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml -e max \
				#RUN: \| FileCheck %s -check-prefix=TIME
				#RUN: llvm-xray graph %s -o - -m %S/Inputs/simple-instrmap.yaml -t yaml -e sum \
				#RUN: \| FileCheck %s -check-prefix=TIME
				---
				header:
				version: 1
				type: 0
				constant-tsc: true
				nonstop-tsc: true
				cycle-frequency: 2601000000
				records:
				- { type: 0, func-id: 1, cpu: 1, thread: 111, kind: function-enter,
				tsc: 10001 }
				- { type: 0, func-id: 1, cpu: 1, thread: 111, kind: function-exit,
				tsc: 10100 }
				...


				#EMPTY: digraph xray {
				#EMPTY-NEXT: F0 -> F1 [label=""];
				#EMPTY-NEXT: F1 [label="@(1)"];
				#EMPTY-NEXT: }
				dberrisUnsubmitted Done Reply Inline Actions Did you need to write "DAG" here somewhere too? dberris: Did you need to write "DAG" here somewhere too?
				varnoAuthorUnsubmitted Not Done Reply Inline Actions I don't need DAG here as there is only one edge in this graph. varno: I don't need DAG here as there is only one edge in this graph.
				dberrisUnsubmitted Not Done Reply Inline Actions So as written, this will mean that if these lines are not arranged in exactly this order, the test will pass. If you meant to preserve order of the output lines being checked, you pick either `-DAG:` or `-NEXT:`. This means, instead of: #EMPTY: #EMPTY: #EMPTY: It ought to be: #EMPTY: #EMPTY-NEXT: #EMPTY-NEXT: #EMPTY-NEXT: to preserve the order of lines and matching. dberris: So as written, this will mean that if these lines are not arranged in exactly this order, the…

				#COUNT: digraph xray {
				#COUNT-NEXT: F0 -> F1 [label="1"];
				#COUNT-NEXT: F1 [label="@(1)"];
				#COUNT-NEXT: }

				#TIME: digraph xray {
				#TIME-NEXT: F0 -> F1 [label="3.8{{.*}}e-08"];
				#TIME-NEXT: F1 [label="@(1)"];
				#TIME-NEXT: }

tools/llvm-xray/CMakeLists.txt

	set(LLVM_LINK_COMPONENTS			set(LLVM_LINK_COMPONENTS
	${LLVM_TARGETS_TO_BUILD}			${LLVM_TARGETS_TO_BUILD}
	DebugInfoDWARF			DebugInfoDWARF
	Object			Object
	Support			Support
	Symbolize			Symbolize
	XRay)			XRay)

	set(LLVM_XRAY_TOOLS			set(LLVM_XRAY_TOOLS
	func-id-helper.cc			func-id-helper.cc
	xray-account.cc			xray-account.cc
	xray-converter.cc			xray-converter.cc
	xray-extract.cc			xray-extract.cc
	xray-extract.cc			xray-extract.cc
				xray-graph.cc
	xray-registry.cc)			xray-registry.cc)
				dberrisUnsubmitted Done Reply Inline Actions Rebasing this to tip of trunk now that 'llvm-xray account' has landed might mean this dependency goes away now. dberris: Rebasing this to tip of trunk now that 'llvm-xray account' has landed might mean this…

				dberrisUnsubmitted Done Reply Inline Actions nit: we try to somewhat keep this in lexicographical order. dberris: nit: we try to somewhat keep this in lexicographical order.
	add_llvm_tool(llvm-xray llvm-xray.cc ${LLVM_XRAY_TOOLS})			add_llvm_tool(llvm-xray llvm-xray.cc ${LLVM_XRAY_TOOLS})

tools/llvm-xray/xray-graph.h

This file was added.

				//===-- xray-graph.h - XRay Function Call Graph Renderer --------- C++ --===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// Generate a DOT file to represent the function call graph encountered in
				// the trace.
				//
				//===----------------------------------------------------------------------===//

				#ifndef XRAY_GRAPH_H
				#define XRAY_GRAPH_H

				#include <vector>

				#include "func-id-helper.h"
				#include "llvm/ADT/DenseMap.h"
				#include "llvm/ADT/SmallVector.h"
				#include "llvm/Support/raw_ostream.h"
				#include "llvm/XRay/Trace.h"
				#include "llvm/XRay/XRayRecord.h"

				namespace llvm {
				namespace xray {

				dberrisUnsubmitted Done Reply Inline Actions Please add an empty line between last non-comment and first comment lines. dberris: Please add an empty line between last non-comment and first comment lines.
				/// A class encapsulating the logic related to analyzing XRay traces, producting
				/// Graphs from them and then exporting those graphs for review.
				dblaikieUnsubmitted Done Reply Inline Actions No need for the explicit 'private' as it's implied/the default here. dblaikie: No need for the explicit 'private' as it's implied/the default here.
				class GraphRenderer {
				public:
				/// An inner struct for common timing statistics information
				struct TimeStat {
				uint64_t Count;
				double Min;
				double Median;
				double Pct90;
				double Pct99;
				double Max;
				double Sum;
				dblaikieUnsubmitted Done Reply Inline Actions (Consider LLVM's data structures - can be more memory efficient than the allocation-per-node of standard containers. Also a map of maps might be more efficient as a map of pair -> value, if it's equivalent for your use case) dblaikie: (Consider LLVM's data structures - can be more memory efficient than the allocation-per-node of…
				dberrisUnsubmitted Done Reply Inline Actions http://llvm.org/docs/ProgrammersManual.html is a good resource to look up available data structures -- for instance, for map-like containers: http://llvm.org/docs/ProgrammersManual.html#map-like-containers-std-map-densemap-etc You have a few choices you can work with. dberris: http://llvm.org/docs/ProgrammersManual.html is a good resource to look up available data…
				varnoAuthorUnsubmitted Not Done Reply Inline Actions Thinking more on this before I make changes. varno: Thinking more on this before I make changes.
				};

				/// An inner struct for storing edge attributes for our graph. Here the
				dberrisUnsubmitted Done Reply Inline Actions Missing empty line between these two. dberris: Missing empty line between these two.
				/// attributes are mainly function call statistics.
				///
				/// FIXME: expand to contain more information eg call latencies.
				struct EdgeAttribute {
				TimeStat S;
				std::vector<uint64_t> Timings;
				};

				/// An Inner Struct for storing vertex attributes, at the moment just
				/// SymbolNames, however in future we could store bulk function statistics.
				///
				/// FIXME: Store more attributes based on instrumentation map.
				struct VertexAttribute {
				std::string SymbolName;
				TimeStat S;
				dblaikieUnsubmitted Done Reply Inline Actions (similarly, consider other data structures - but also maybe consider multimap rather than map to vector) dblaikie: (similarly, consider other data structures - but also maybe consider multimap rather than map…
				varnoAuthorUnsubmitted Not Done Reply Inline Actions Thinking on this more before I make changes. varno: Thinking on this more before I make changes.
				};

				private:
				/// The Graph stored in an edge-list like format, with the edges also having
				/// An attached set of attributes.
				DenseMap<int32_t, DenseMap<int32_t, EdgeAttribute>> Graph;

				/// Graph Vertex Attributes. These are presently stored seperate from the
				/// main graph.
				dblaikieUnsubmitted Done Reply Inline Actions Extra semicolon dblaikie: Extra semicolon
				DenseMap<int32_t, VertexAttribute> VertexAttrs;

				struct FunctionAttr {
				int32_t FuncId;
				uint64_t TSC;
				};

				dberrisUnsubmitted Done Reply Inline Actions Can you use a `SmallVector<...>` instead of a `std::vector<...>`? dberris: Can you use a `SmallVector<...>` instead of a `std::vector<...>`?
				dberrisUnsubmitted Done Reply Inline Actions Missing line in between these two. dberris: Missing line in between these two.
				/// Use a Map to store the Function stack for each thread whilst building the
				/// graph.
				///
				/// FIXME: Perhaps we can Build this into LatencyAccountant? or vise versa?
				DenseMap<pid_t, SmallVector<FunctionAttr, 4>> PerThreadFunctionStack;

				/// Usefull object for getting human readable Symbol Names.
				FuncIdConversionHelper &FuncIdHelper;
				bool DeduceSiblingCalls = false;
				uint64_t CurrentMaxTSC = 0;

				/// A private function to help implement the statistic generation functions;
				template <typename U>
				dberrisUnsubmitted Done Reply Inline Actions Why are the iterators not the same type? dberris: Why are the iterators not the same type?
				void getStats(U begin, U end, GraphRenderer::TimeStat &S);

				/// Calculates latency statistics for each edge and stores the data in the
				/// Graph
				dberrisUnsubmitted Done Reply Inline Actions No need for `void` in the function arguments. dberris: No need for `void` in the function arguments.
				void calculateEdgeStatistics();

				/// Calculates latency statistics for each vertex and stores the data in the
				/// Graph
				void calculateVertexStatistics();

				/// Normalises latency statistics for each edge and vertex by CycleFrequency;
				void normaliseStatistics(double CycleFrequency);

				public:
				/// Takes in a reference to a FuncIdHelper in order to have ready access to
				/// Symbol names.
				explicit GraphRenderer(FuncIdConversionHelper &FuncIdHelper, bool DSC)
				: FuncIdHelper(FuncIdHelper), DeduceSiblingCalls(DSC) {}

				/// Process an Xray record and expand the graph.
				///
				/// This Function will return true on success, or false if records are not
				/// presented in per-thread call-tree DFS order. (That is for each thread the
				/// Records should be in order runtime on an ideal system.)
				///
				/// FIXME: Make this more robust against small irregularities.
				bool accountRecord(const XRayRecord &Record);

				/// An enum for enumerating the various statistics gathered on latencies
				enum class StatType { COUNT, MIN, MED, PCT90, PCT99, MAX, SUM };

				/// Output the Embedded graph in DOT format on \p OS, labeling the edges by
				/// \p T
				void exportGraphAsDOT(raw_ostream &OS, const XRayFileHeader &H,
				StatType T = StatType::COUNT);
				};
				dberrisUnsubmitted Done Reply Inline Actions Can this not happen incrementally, as we're adding the records, or on demand when we export? i.e. why do they need to be public functions? dberris: Can this not happen incrementally, as we're adding the records, or on demand when we export? i.
				varnoAuthorUnsubmitted Done Reply Inline Actions Not really, all the records need to be added before we calculate the statistics. And I was wanting the statistics calculated to do graph transformations later (before output). varno: Not really, all the records need to be added before we calculate the statistics. And I was…
				dberrisUnsubmitted Done Reply Inline Actions Sure, but why aren't these just implementation details of the exportGraphAsDOT(...) function? dberris: Sure, but why aren't these just implementation details of the exportGraphAsDOT(...) function?
				varnoAuthorUnsubmitted Done Reply Inline Actions I suppose that right now they could be, but I have plans that require them to be run before exportGraphAsDOT varno: I suppose that right now they could be, but I have plans that require them to be run before…
				dberrisUnsubmitted Done Reply Inline Actions I suspect we can break these out later if we really need to. For now, they're a distraction and from an API design perspective, really brittle -- if someone wants to use this class they have to know to call these functions before exporting. If this doesn't happen while we're accounting the records, then this means the state of the graph is not easily determined. If for example an external algorithm requires the graph, then we ought to be able to access the graph itself -- and maybe the accounting ought to happen externally instead, as an algorithm that passes through the graph we've built or as an extension to the graph traversal algorithm being employed, exposed as a method to this class. dberris: I suspect we can break these out later if we really need to. For now, they're a distraction and…
				varnoAuthorUnsubmitted Done Reply Inline Actions Ok, I'll break them out and we can add them in later if needed varno: Ok, I'll break them out and we can add them in later if needed
				}
				}

				dberrisUnsubmitted Done Reply Inline Actions Does this need to be part of the class? Could this not be a function in the implementation? dberris: Does this need to be part of the class? Could this not be a function in the implementation?
				varnoAuthorUnsubmitted Done Reply Inline Actions I suppose it could be a friend function, it uses private types (the TimeStat type). varno: I suppose it could be a friend function, it uses private types (the TimeStat type).
				dberrisUnsubmitted Done Reply Inline Actions It doesn't have to use that type, right? I suspect you could just make this a template in the unnamed namespace in the implementation. Or that type could just be public and you could just use it (i.e for example you might want to be able to provide an iterator to the graph later, so that other commands can leverage the graph too). dberris: It doesn't have to use that type, right? I suspect you could just make this a template in the…
				#endif // XRAY_GRAPH_H

tools/llvm-xray/xray-graph.cc

This file was added.

				//===-- xray-graph.c - XRay Function Call Graph Renderer ------------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// Generate a DOT file to represent the function call graph encountered in
				// the trace.
				//
				//===----------------------------------------------------------------------===//
				#include <algorithm>
				#include <cassert>
				#include <system_error>
				#include <utility>

				#include "xray-extract.h"
				#include "xray-graph.h"
				#include "xray-registry.h"
				#include "llvm/Support/ErrorHandling.h"
				#include "llvm/Support/FormatVariadic.h"
				#include "llvm/XRay/Trace.h"
				#include "llvm/XRay/YAMLXRayRecord.h"

				using namespace llvm;
				using namespace xray;

				// Setup llvm-xray graph subcommand and its options.
				static cl::SubCommand Graph("graph", "Generate function-call graph");
				static cl::opt<std::string> GraphInput(cl::Positional,
				cl::desc("<xray log file>"),
				cl::Required, cl::sub(Graph));

				static cl::opt<std::string>
				GraphOutput("output", cl::value_desc("Output file"), cl::init("-"),
				cl::desc("output file; use '-' for stdout"), cl::sub(Graph));
				static cl::alias GraphOutput2("o", cl::aliasopt(GraphOutput),
				cl::desc("Alias for -output"), cl::sub(Graph));

				static cl::opt<std::string> GraphInstrMap(
				"instr_map", cl::desc("binary with the instrumrntation map, or "
				"a separate instrumentation map"),
				cl::value_desc("binary with xray_instr_map"), cl::sub(Graph), cl::init(""));
				static cl::alias GraphInstrMap2("m", cl::aliasopt(GraphInstrMap),
				cl::desc("alias for -instr_map"),
				cl::sub(Graph));

				static cl::opt<InstrumentationMapExtractor::InputFormats> InstrMapFormat(
				"instr-map-format", cl::desc("format of instrumentation map"),
				cl::values(clEnumValN(InstrumentationMapExtractor::InputFormats::ELF, "elf",
				"instrumentation map in an ELF header"),
				clEnumValN(InstrumentationMapExtractor::InputFormats::YAML,
				"yaml", "instrumentation map in YAML")),
				cl::sub(Graph), cl::init(InstrumentationMapExtractor::InputFormats::ELF));
				static cl::alias InstrMapFormat2("t", cl::aliasopt(InstrMapFormat),
				cl::desc("Alias for -instr-map-format"),
				cl::sub(Graph));

				static cl::opt<bool> GraphDeduceSiblingCalls(
				"deduce-sibling-calls",
				cl::desc("Deduce sibling calls when unrolling function call stacks"),
				cl::sub(Graph), cl::init(false));
				static cl::alias
				GraphDeduceSiblingCalls2("d", cl::aliasopt(GraphDeduceSiblingCalls),
				cl::desc("Alias for -deduce-sibling-calls"),
				cl::sub(Graph));

				static cl::opt<GraphRenderer::StatType>
				GraphEdgeLabel("edge-label",
				cl::desc("Output graphs with edges labeled with this field"),
				cl::value_desc("field"), cl::sub(Graph),
				cl::init(GraphRenderer::StatType::COUNT),
				cl::values(clEnumValN(GraphRenderer::StatType::COUNT,
				"count", "function call counts"),
				clEnumValN(GraphRenderer::StatType::MIN, "min",
				"minimum function durations"),
				clEnumValN(GraphRenderer::StatType::MED, "med",
				"median function durations"),
				clEnumValN(GraphRenderer::StatType::PCT90, "90p",
				"90th percentile durations"),
				clEnumValN(GraphRenderer::StatType::PCT99, "99p",
				"99th percentile durations"),
				clEnumValN(GraphRenderer::StatType::MAX, "max",
				"maximum function durations"),
				clEnumValN(GraphRenderer::StatType::SUM, "sum",
				"sum of call durations")));
				static cl::alias GraphEdgeLabel2("e", cl::aliasopt(GraphEdgeLabel),
				dblaikieUnsubmitted Done Reply Inline Actions could potentially use a conditional operator inside the [] since the expression is otherwise identical. Also, the outer () aren't necessary - but you can keep them if you find they enhance readability. dblaikie: could potentially use a conditional operator inside the [] since the expression is otherwise…
				cl::desc("Alias for -edge-label"),
				cl::sub(Graph));

				dblaikieUnsubmitted Done Reply Inline Actions Consider dropping {} on single line blocks. dblaikie: Consider dropping {} on single line blocks.
				namespace {
				template <class T> T diff(T L, T R) { return std::max(L, R) - std::min(L, R); }

				void updateStat(GraphRenderer::TimeStat &S, int64_t lat) {
				S.Count++;
				if (S.Min > lat \|\| S.Min == 0)
				S.Min = lat;
				if (S.Max < lat)
				S.Max = lat;
				S.Sum += lat;
				}
				dberrisUnsubmitted Done Reply Inline Actions Potentially spurious empty line here. dberris: Potentially spurious empty line here.
				}

				// Evaluates an XRay record and performs accounting on it, creating and
				// decorating a function call graph as it does so. It does this by maintaining
				// a call stack on a per-thread basis and adding edges and verticies to the
				// graph as they are seen for the first time.
				//
				// There is an immaginary root for functions at the top of their stack with
				// FuncId 0.
				//
				// FIXME: make more robust to errors and
				// Decorate Graph More Heavily.
				// FIXME: Refactor this and account subcommand to reduce code duplication.
				bool GraphRenderer::accountRecord(const XRayRecord &Record) {
				if (CurrentMaxTSC == 0)
				dberrisUnsubmitted Done Reply Inline Actions Also potentially spurious empty line here. dberris: Also potentially spurious empty line here.
				CurrentMaxTSC = Record.TSC;

				if (Record.TSC < CurrentMaxTSC)
				dblaikieUnsubmitted Done Reply Inline Actions Extra semicolon dblaikie: Extra semicolon
				dblaikieUnsubmitted Done Reply Inline Actions Inconsistent bracing (why does the inner loop have braces but the outer loop doesn't? - personally I'd probably drop them from both, but I could see an argument for adding them to both - just doesn't seem right to add them only to one and not the other) dblaikie: Inconsistent bracing (why does the inner loop have braces but the outer loop doesn't?
				return false;
				dberrisUnsubmitted Done Reply Inline Actions Can you use `.emplace_back(...)` instead here? dberris: Can you use `.emplace_back(...)` instead here?
				varnoAuthorUnsubmitted Done Reply Inline Actions No. varno: No.

				auto &ThreadStack = PerThreadFunctionStack[Record.TId];
				switch (Record.Type) {
				case RecordTypes::ENTER: {
				if (VertexAttrs.count(Record.FuncId) == 0)
				VertexAttrs[Record.FuncId].SymbolName =
				FuncIdHelper.SymbolOrNumber(Record.FuncId);
				dblaikieUnsubmitted Done Reply Inline Actions Extra semi dblaikie: Extra semi
				ThreadStack.push_back({Record.FuncId, Record.TSC});
				break;
				}
				case RecordTypes::EXIT: {
				// FIXME: Refactor this and the account subcommand to reducr code
				dblaikieUnsubmitted Done Reply Inline Actions Drop these (it's just a static variable in this scope anyway) dblaikie: Drop these (it's just a static variable in this scope anyway)
				// duplication
				if (ThreadStack.size() == 0 \|\| ThreadStack.back().FuncId != Record.FuncId) {
				if (!DeduceSiblingCalls)
				return false;
				auto Parent = std::find_if(
				ThreadStack.rbegin(), ThreadStack.rend(),
				[&](const FunctionAttr &A) { return A.FuncId == Record.FuncId; });
				if (Parent == ThreadStack.rend())
				return false; // There is no matching Function for this exit.
				while (ThreadStack.back().FuncId != Record.FuncId) {
				uint64_t D = diff(ThreadStack.back().TSC, Record.TSC);
				int32_t TopFuncId = ThreadStack.back().FuncId;
				ThreadStack.pop_back();
				assert(ThreadStack.size() != 0);
				auto &EA = Graph[ThreadStack.back().FuncId][TopFuncId];
				dberrisUnsubmitted Done Reply Inline Actions So I think in a future change which we talked about offline, we ought to make this and the `account` tool work with each other in some form of refactored implementation -- where the account tool can depend on the graph tool instead of duplicating a lot of this logic. If you rebase against the `account` change (D24377) then the version used there now has less calls to .pop_back() on the vector, and also has a simpler loop as a result (i.e. you don't need the extra spillover logic, and the version there has less nesting and more straight-line code anyway. The actionable comment right now I think would be a `//FIXME: Refactor this and the account subcommand to reduce code duplication` dberris: So I think in a future change which we talked about offline, we ought to make this and the…
				EA.Timings.push_back(D);
				updateStat(EA.S, D);
				dberrisUnsubmitted Done Reply Inline Actions Maybe use a name that isn't overloaded in this context -- `D` is a perfectly fine letter for something that indicates the delta between two numbers. dberris: Maybe use a name that isn't overloaded in this context -- `D` is a perfectly fine letter for…
				updateStat(VertexAttrs[TopFuncId].S, D);
				}
				}
				uint64_t D = diff(ThreadStack.back().TSC, Record.TSC);
				ThreadStack.pop_back();
				auto &V = Graph[ThreadStack.empty() ? 0 : ThreadStack.back().FuncId];
				dberrisUnsubmitted Done Reply Inline Actions Please add an empty line between the end of function definitions, and the start of the next function. dberris: Please add an empty line between the end of function definitions, and the start of the next…
				auto &EA = V[Record.FuncId];
				EA.Timings.push_back(D);
				updateStat(EA.S, D);
				updateStat(VertexAttrs[Record.FuncId].S, D);
				break;
				}
				}

				return true;
				}

				template <typename U>
				void GraphRenderer::getStats(U begin, U end, GraphRenderer::TimeStat &S) {
				assert(begin != end);
				std::ptrdiff_t MedianOff = S.Count / 2;
				std::nth_element(begin, begin + MedianOff, end);
				S.Median = *(begin + MedianOff);
				std::ptrdiff_t Pct90Off = (S.Count * 9) / 10;
				std::nth_element(begin, begin + Pct90Off, end);
				dberrisUnsubmitted Done Reply Inline Actions Empty line between these two lines? dberris: Empty line between these two lines?
				S.Pct90 = *(begin + Pct90Off);
				std::ptrdiff_t Pct99Off = (S.Count * 99) / 100;
				std::nth_element(begin, begin + Pct99Off, end);
				S.Pct99 = *(begin + Pct99Off);
				}

				void GraphRenderer::calculateEdgeStatistics() {
				for (auto &V : Graph) {
				dblaikieUnsubmitted Done Reply Inline Actions Should be some common utility for this, so every tool doesn't have to go through the same hoops (probably coalesce all the instrumentation map extractor stuff as well) dblaikie: Should be some common utility for this, so every tool doesn't have to go through the same hoops…
				dberrisUnsubmitted Done Reply Inline Actions +1 -- if you rebase again to the latest of D24377 you can use the utility function that determines the supported loader function here. dberris: +1 -- if you rebase again to the latest of D24377 you can use the utility function that…
				varnoAuthorUnsubmitted Not Done Reply Inline Actions Yeah there probably should be. Not yet though AFAIK. varno: Yeah there probably should be. Not yet though AFAIK.
				for (auto &E : V.second) {
				auto &A = E.second;
				getStats(A.Timings.begin(), A.Timings.end(), A.S);
				}
				}
				}

				void GraphRenderer::calculateVertexStatistics() {
				DenseMap<int32_t, std::pair<uint64_t, SmallVector<EdgeAttribute *, 4>>>
				IncommingEdges;
				uint64_t MaxCount = 0;
				for (auto &V : Graph) {
				for (auto &E : V.second) {
				dberrisUnsubmitted Done Reply Inline Actions The indentation here is a little weird. Fix? dberris: The indentation here is a little weird. Fix?
				auto &IEV = IncommingEdges[E.first];
				IEV.second.push_back(&E.second);
				IEV.first += E.second.S.Count;
				if (IEV.first > MaxCount)
				MaxCount = IEV.first;
				}
				}
				std::vector<uint64_t> TempTimings;
				TempTimings.reserve(MaxCount);
				dberrisUnsubmitted Done Reply Inline Actions Empty line in between, and don't need void in the function arguments. dberris: Empty line in between, and don't need void in the function arguments.
				for (auto &V : IncommingEdges) {
				for (auto &P : V.second.second) {
				TempTimings.insert(TempTimings.end(), P->Timings.begin(),
				P->Timings.end());
				}
				getStats(TempTimings.begin(), TempTimings.end(), VertexAttrs[V.first].S);
				TempTimings.clear();
				dblaikieUnsubmitted Done Reply Inline Actions Is this optimization worthwhile? Or could we put this as a local variable in the outer loop below - no need to clear it, etc. dblaikie: Is this optimization worthwhile? Or could we put this as a local variable in the outer loop…
				varnoAuthorUnsubmitted Done Reply Inline Actions I think it is worth while, however I have no testing results. At this time, as I am working on a patch which changes how this code works completely due to a new data structure, I don't know. varno: I think it is worth while, however I have no testing results. At this time, as I am working on…
				dberrisUnsubmitted Done Reply Inline Actions I'd urge you to not change this patch, but instead stack one on top of it instead for any further changes you'd need to do. I'd much rather do that review of the refactoring separately than doing this review over with new data structures. dberris: I'd urge you to not change this patch, but instead stack one on top of it instead for any…
				}
				}
				dberrisUnsubmitted Done Reply Inline Actions No need for void in the function arguments. dberris: No need for void in the function arguments.

				dberrisUnsubmitted Done Reply Inline Actions Consider also using a `SmallVector` here, with potentially the same initial size as with the adjacency list we maintain. dberris: Consider also using a `SmallVector` here, with potentially the same initial size as with the…
				void GraphRenderer::normaliseStatistics(double CycleFrequency) {
				for (auto &V : Graph) {
				for (auto &E : V.second) {
				auto &S = E.second.S;
				S.Min /= CycleFrequency;
				S.Median /= CycleFrequency;
				dberrisUnsubmitted Done Reply Inline Actions It's thoroughly confusing too that you're using Timings here, but there's a data member named Timings as well. :) dberris: It's thoroughly confusing too that you're using Timings here, but there's a data member named…
				S.Max /= CycleFrequency;
				dblaikieUnsubmitted Done Reply Inline Actions Remove dead code dblaikie: Remove dead code
				S.Sum /= CycleFrequency;
				S.Pct90 /= CycleFrequency;
				S.Pct99 /= CycleFrequency;
				}
				}
				for (auto &V : VertexAttrs) {
				dberrisUnsubmitted Done Reply Inline Actions Is there no way to do this in a single pass, in the same loop above? I could imagine building a map that contains a vertex as the key, then accumulating statistics as you go traversing the graph (or even just storing the data anyway like you do here for the timings), and another pass to collapse that data. dberris: Is there no way to do this in a single pass, in the same loop above? I could imagine building a…
				varnoAuthorUnsubmitted Done Reply Inline Actions Really not, the way here has linear space and uses the required memory for each section. If we did not care about memory I could store the accumulation data for both edges and vertices. but otherwise ??? varno: Really not, the way here has linear space and uses the required memory for each section. If we…
				dberrisUnsubmitted Done Reply Inline Actions Right, so the alternate body of the function seems to be more straight-forward, and the amount of memory being used isn't that bad. You can even be smart about this and have an upper bound on the elements in the vector, and as you insert you keep a relative order and track median, and other percentiles incrementally. That's not as critical as the functionality though, and we can tune this as we go along late anyway if we encounter really big graphs in practice that would cause this to be a huge problem (either as implemented or even in the alternative version). I'm happy either way if you choose to stick with the current implementation, but if you do consider just removing the alternate implementation you have here `#ifdef 0`'d out. dberris: Right, so the alternate body of the function seems to be more straight-forward, and the amount…
				auto &S = V.second.S;
				S.Min /= CycleFrequency;
				dberrisUnsubmitted Done Reply Inline Actions Missing empty line between these two. dberris: Missing empty line between these two.
				S.Median /= CycleFrequency;
				S.Max /= CycleFrequency;
				S.Sum /= CycleFrequency;
				S.Pct90 /= CycleFrequency;
				S.Pct99 /= CycleFrequency;
				}
				}

				namespace {
				void outputEdgeInfo(const GraphRenderer::TimeStat &S, GraphRenderer::StatType T,
				raw_ostream &OS) {
				switch (T) {
				case GraphRenderer::StatType::COUNT:
				OS << S.Count;
				break;
				case GraphRenderer::StatType::MIN:
				OS << S.Min;
				break;
				case GraphRenderer::StatType::MED:
				OS << S.Median;
				break;
				case GraphRenderer::StatType::PCT90:
				OS << S.Pct90;
				break;
				case GraphRenderer::StatType::PCT99:
				OS << S.Pct99;
				break;
				case GraphRenderer::StatType::MAX:
				OS << S.Max;
				break;
				case GraphRenderer::StatType::SUM:
				OS << S.Sum;
				break;
				}
				}
				}

				// Outputs a DOT format version of the Graph embedded in the GraphRenderer
				// object on OS. It does this in the expected way by itterating
				// through all edges then vertices and then outputting them and their
				// annotations.
				//
				// FIXME: output more information, better presented.
				void GraphRenderer::exportGraphAsDOT(raw_ostream &OS, const XRayFileHeader &H,
				StatType T) {
				calculateEdgeStatistics();
				calculateVertexStatistics();
				if (H.CycleFrequency)
				normaliseStatistics(H.CycleFrequency);

				OS << "digraph xray {\n";

				for (const auto &V : Graph)
				for (const auto &E : V.second) {
				OS << "F" << V.first << " -> "
				<< "F" << E.first << " [label=\"";
				outputEdgeInfo(E.second.S, T, OS);
				OS << "\"];\n";
				dblaikieUnsubmitted Done Reply Inline Actions Probably skip the extra language like "does what the name suggests" and "does this in the expected way". If it's pretty obvious/self explanatory, then a brief comment is OK. dblaikie: Probably skip the extra language like "does what the name suggests" and "does this in the…
				}

				for (const auto &V : VertexAttrs)
				OS << "F" << V.first << " [label=\""
				<< (V.second.SymbolName.size() > 40
				? V.second.SymbolName.substr(0, 40) + "..."
				: V.second.SymbolName)
				<< "\"];\n";

				OS << "}\n";
				}

				// Here we register and implement the llvm-xray graph subcommand.
				// The bulk of this code reads in the options, opens the required files, uses
				// those files to create a context for analysing the xray trace, then there is a
				// short loop which actually analyses the trace, generates the graph and then
				// outputs it as a DOT.
				//
				// FIXME: include additional filtering and annalysis passes to provide more
				// specific useful information.
				static CommandRegistration Unused(&Graph, []() -> Error {
				int Fd;
				auto EC = sys::fs::openFileForRead(GraphInput, Fd);
				if (EC)
				return make_error<StringError>(
				Twine("Cannot open file '") + GraphInput + "'", EC);

				Error Err = Error::success();
				xray::InstrumentationMapExtractor Extractor(GraphInstrMap, InstrMapFormat,
				Err);
				handleAllErrors(std::move(Err),
				[&](const ErrorInfoBase &E) { E.log(errs()); });

				const auto &FunctionAddresses = Extractor.getFunctionAddresses();

				symbolize::LLVMSymbolizer::Options Opts(
				symbolize::FunctionNameKind::LinkageName, true, true, false, "");

				symbolize::LLVMSymbolizer Symbolizer(Opts);

				llvm::xray::FuncIdConversionHelper FuncIdHelper(GraphInstrMap, Symbolizer,
				FunctionAddresses);

				xray::GraphRenderer GR(FuncIdHelper, GraphDeduceSiblingCalls);

				raw_fd_ostream OS(GraphOutput, EC, sys::fs::OpenFlags::F_Text);

				if (EC)
				return make_error<StringError>(
				Twine("Cannot open file '") + GraphOutput + "' for writing.", EC);

				auto TraceOrErr = loadTraceFile(GraphInput, true);

				dblaikieUnsubmitted Done Reply Inline Actions Should this be silently handled? If a user specifies a file, seems like we should error if it's not there (@dberris - goes for existing tools too, I imagine, guess I didn't notice this in the others) dblaikie: Should this be silently handled? If a user specifies a file, seems like we should error if it's…
				dberrisUnsubmitted Done Reply Inline Actions I think we at least should print something to the effect of "well, we can't symbolise properly". I'm happy with making this an explicit error, to not give users potentially misleading results. dberris: I think we at least should print something to the effect of "well, we can't symbolise properly".
				if (!TraceOrErr) {
				return joinErrors(
				make_error<StringError>(
				Twine("Failed loading input file '") + GraphInput + "'",
				std::make_error_code(std::errc::protocol_error)),
				std::move(Err));
				dberrisUnsubmitted Done Reply Inline Actions I'm looking at these lines and thinking it seems they're unnecessarily exposing implementation details -- really these things are something the graph renderer can decide itself if it took the file header as an argument to the constructor. And these really ought to just happen when we're exporting the data as DOT and passing arguments to the kind of data we want to see. dberris: I'm looking at these lines and thinking it seems they're unnecessarily exposing implementation…
				}

				auto &Trace = *TraceOrErr;
				const auto &Header = Trace.getFileHeader();
				for (const auto &Record : Trace) {
				// Generate graph, FIXME: better error recovery.
				if (!GR.accountRecord(Record)) {
				return make_error<StringError>(
				Twine("Failed accounting function calls in file '") + GraphInput +
				"'.",
				std::make_error_code(std::errc::bad_message));
				dblaikieUnsubmitted Done Reply Inline Actions Move this to where it's used (then you could even use 'const auto ' if you like dblaikie:* Move this to where it's used (then you could even use 'const auto *' if you like
				dberrisUnsubmitted Done Reply Inline Actions Given the changes that have landed now, there's a better way of doing this (if we look at what's happening in `llvm-xray account` at least). dberris: Given the changes that have landed now, there's a better way of doing this (if we look at…
				}
				}

				GR.exportGraphAsDOT(OS, Header, GraphEdgeLabel);
				return Error::success();
				});

This is an archive of the discontinued LLVM Phabricator instance.

Initial work on the XRay Graph tool.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 84216

test/tools/llvm-xray/X86/graph-deduce-tail-call.yaml

test/tools/llvm-xray/X86/graph-simple-case.yaml

tools/llvm-xray/CMakeLists.txt

tools/llvm-xray/xray-graph.h

tools/llvm-xray/xray-graph.cc

Initial work on the XRay Graph tool.
ClosedPublic