This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
test/tools/llvm-xray/X86/
-
tools/
-
llvm-xray/
-
X86/
-
Inputs/
-
elf64-objcopied-instrmap.bin
-
elf64-sample-o2.bin
-
naive-log-simple.xray
-
simple-xray-instrmap.yaml
-
bad-instrmap-sizes.bin
-
bad-instrmap-sizes.txt
-
convert-roundtrip.yaml
-
convert-to-yaml.txt
-
convert-with-debug-syms.txt
-
convert-with-standalone-instrmap.txt
-
convert-with-yaml-instrmap.txt
-
tools/llvm-xray/
-
llvm-xray/
-
CMakeLists.txt
-
func-id-helper.h
-
func-id-helper.cc
-
xray-converter.h
-
xray-converter.cc
-
xray-extract.cc
-
xray-log-reader.h
-
xray-log-reader.cc
-
xray-record-yaml.h
-
xray-record.h

Differential D24376

[XRay] Implement `llvm-xray convert` -- trace file conversion
ClosedPublic

Authored by dberris on Sep 8 2016, 10:59 PM.

Download Raw Diff

Details

Reviewers

dblaikie
echristo

Commits

rGf8f909f848e7: [XRay] Implement `llvm-xray convert` -- trace file conversion
rL291529: [XRay] Implement `llvm-xray convert` -- trace file conversion

Summary

This is the second part of a multi-part change to define additional
subcommands to the llvm-xray tool.

This change defines a conversion subcommand to take XRay log files, and
turns them from one format to another (binary or YAML). This currently
only supports the first version of the log file format, defined in the
compiler-rt runtime.

Depends on D21987.

Diff Detail

Repository: rL LLVM

Event Timeline

dberris updated this revision to Diff 70787.Sep 8 2016, 10:59 PM

dberris retitled this revision from to [XRay] Implement `llvm-xray convert` -- trace file conversion.

dberris updated this object.

dberris added reviewers: dblaikie, echristo.

dberris added a parent revision: D21987: [XRay] Implement `llvm-xray extract`, start of the llvm-xray tool.

dberris added a subscriber: llvm-commits.

Herald added subscribers: beanz, dberris, mehdi_amini. · View Herald TranscriptSep 8 2016, 10:59 PM

dberris added a child revision: D24377: [XRay] Implement the `llvm-xray account` subcommand.Sep 8 2016, 11:01 PM

Use the command registry instead of hard-coding into main

Herald added a subscriber: mgorny. · View Herald TranscriptOct 5 2016, 2:36 AM

Use the command registry instead of hard-coding into main
Rebase

Use llvm::Error

Rebase

Rebase

I'll be adding more tests and sample inputs for this tool, and probably use DataExtractor too for loading the records from the XRay log files. Please hold off on reviewing @dblaikie.

Add test for converting a simple raw log to YAML

PTAL @dblaikie -- do you think using the DataExtractor here would be a good idea?

Before I get into all the details - this seems similar to the extract tool, except it deals with the log rather than the instrumentation map, right?

So pretty much all the same feedback as there (& I don't necessarily remember the original answers). We ended up with a one way conversion last time (to yaml, but not back again), should this one be different? If we only convert one way, is 'convert' the right name? Should be be more explicit about the difference between these two things, or just detect based on file magic & use one command name for both kinds of input files? (check if it's an ELF file then assume the user is trying to extract an instrumentation map, and otherwise assume the file's a log?)

tools/llvm-xray/func-id-helper.cc
40–42 ↗	(On Diff #75988)	We don't usually bother putting {} on single-line if statements.
49–53 ↗	(On Diff #75988)	Probably drop the {} here too (maybe even use the conditional operator, if you like)
tools/llvm-xray/xray-record.h
33 ↗	(On Diff #73610)	Is the alignas attribute supported on all the platforms/compilers we care about?
53 ↗	(On Diff #73610)	There's LLVM_PACKED you may need to use to make this portable to MSVC, by the looks of it. (though I'm still intrinsically suspicious of splatting in/out of memory, and would be more comfortable seeing just 'obvious' code to read bytes and shift them into values, etc - but perhaps I'm in the minority/an outlier here and this sort of code is OK with everyone else)

A few of the inline comments from my last feedback are probably out of date/not relevant - I had them queued up from weeks/months ago.

That said - taking a guess at it: Yes, I'd use DataExtractor here over splatting/memcpying structs around for the same reasons.

In D24376#581421, @dblaikie wrote:

Before I get into all the details - this seems similar to the extract tool, except it deals with the log rather than the instrumentation map, right?

So pretty much all the same feedback as there (& I don't necessarily remember the original answers). We ended up with a one way conversion last time (to yaml, but not back again), should this one be different? If we only convert one way, is 'convert' the right name? Should be be more explicit about the difference between these two things, or just detect based on file magic & use one command name for both kinds of input files? (check if it's an ELF file then assume the user is trying to extract an instrumentation map, and otherwise assume the file's a log?)

The value of having a convert function is so that we can support other formats specifically for the xray log. Here are some future things we need to support:

Convert from an old version of the log into a newer version.
Convert from the XRay log format to another format. One of them is the Chrome Trace Viewer format, something that the Google Performance Tools (https://github.com/gperftools/gperftools) can consume, etc.

For now, the most convenient thing we can do is create YAML files. The important part here is the log file loading library, that we use in the accounting implementation (stacked on top of this change).

So the verb really is to "convert", as opposed to just "extract".

Does this make sense?

That said, I'll apply most of the changes in the 'extract' review here.

Cheers

Add more tests, fix some bugs
Use DataExtractor, refactor a bit to use Error properly, and use a simpler implementation of the log reader.

Herald added a subscriber: modocache. · View Herald TranscriptNov 1 2016, 1:21 AM

Ready for a look now @dblaikie, thanks in advance!

PS. We no longer support round-tripping in this implementation.

dblaikie added inline comments.Nov 1 2016, 9:54 AM

tools/llvm-xray/func-id-helper.cc
28–29 ↗	(On Diff #76528)	Fold ResOrErr into the if condition to reduce/match its scope to its use?
49–50 ↗	(On Diff #76528)	Fold variable into if condition. (alternatively - and this applies to the other case of the same - consider a short-exit: if (!ResOrErr) { handleAllErrors(...) return F.str(); } I know this means duplicating the "Return F.str()" code - but allows the main non-error code to be unindented, can make it easier to follow (by keeping the error handling close to the error, too).)
53–57 ↗	(On Diff #76528)	Drop {} on single line blocks, probably
tools/llvm-xray/func-id-helper.h
39 ↗	(On Diff #76528)	This should be implicit? Or did you want it to disable the other special members? Any particular reason? (might be worth a comment if that's the case)
tools/llvm-xray/xray-converter.cc
121–124 ↗	(On Diff #76528)	If we need to support these formats for both input and output - I don't so much mind whether we support the null conversion/roundtrip. The point in the previous review was that we didn't need to handle binary output nor yaml input - but it sounds like you need all the different input and output modes here? (at the very least the native format will be a valid input (for conversion to non-native formats) and output (for upgrading) format, for example. So, up to you - if supporting roundtripping is convenient/nice generalization, that's fine
126–129 ↗	(On Diff #76528)	Hmm - now I'm a bit confused given after my last comment. If we don't support roundtripping, and we don't support binary output - then the only thing we support is binary input and YAML output, so why the switch a few lines below that chooses the input format between binary and yaml? Isn't the yaml reading case in that switch unreachable and thus the YAMLogLoader unused/untested? Indeed, no test seems to exercise -i/-input-format?
tools/llvm-xray/xray-extract.cc
193–194 ↗	(On Diff #76528)	Roll variable into if condition
203–204 ↗	(On Diff #76528)	Guessing it's mapped rather than streamed because that's the API for DataExtractor (dealing with a buffer)? Or does it have other needs to be mapped? (might be nice if it didn't have to be - but I suppose there's no strong reason/benefit to it... )
210 ↗	(On Diff #76528)	Might make sense to pass MappedFile.size() rather than fileSize, even though they're the same (& even maybe just having MappedFile have a way to access a StringRef of the range)
tools/llvm-xray/xray-log-reader.cc
87 ↗	(On Diff #76528)	I probably wouldn't bother with explicit scope here (I usually only use them if I need the dtor behavior) - but I appreciate the benefit to scoping the variables and don't mind it if you prefer it this way.
112 ↗	(On Diff #76528)	unneeded extra scope here?
tools/llvm-xray/xray-record.h
23 ↗	(On Diff #76528)	Guessing this doesn't need an alignment attribute anymore?
38 ↗	(On Diff #76528)	Is there an enum that should be used for this member?

Address some comments
Add FIXME to support binary conversion

Thanks @dblaikie, PTAL.

tools/llvm-xray/func-id-helper.cc
49–50 ↗	(On Diff #76528)	I like just limiting the scope of the variable, so I stuck with that. Also, now using sys::path::filename instead of manually futzing with the string.
tools/llvm-xray/xray-converter.cc
126–129 ↗	(On Diff #76528)	Right, I think the better message here is "not yet supported". When we need the functionality then it seems better to change this then.
tools/llvm-xray/xray-extract.cc
203–204 ↗	(On Diff #76528)	Yeah, the API for the DataExtractor deals with a buffer. We could be reading things in chunks at a time explicitly, but that seems unnecessary (unless mmapping is undesirable for other reasons).
tools/llvm-xray/xray-log-reader.cc
87 ↗	(On Diff #76528)	No strong preference, this was a remnant of attempting to use the same variable name for the extractor -- but there's really no good reason for the explicit scope. :)

Just in case it wasn't clear, this one's ready for another look @dblaikie.

dblaikie added inline comments.Nov 21 2016, 1:52 PM

tools/llvm-xray/func-id-helper.cc
49–50 ↗	(On Diff #76528)	I'll say, on reflection, that I kind of prefer keeping the diagnostic handling closer - rather than "if (success) { next bit of code } else { error handling for the if condition }" but I'll leave that up to you - can get a feel for it and always change it later if it starts to get unwieldy
tools/llvm-xray/xray-converter.cc
126–129 ↗	(On Diff #76528)	Dead/untested code is a to be avoided - so if the YAMLLogLoader is unused/untested, it probably shouldn't be committed yet. It's difficult to keep track of test coverage if code is committed without coverage - then when the code becomes live it's not obvious that it was previously untested, etc.
152 ↗	(On Diff #76910)	Spurious semicolon?
tools/llvm-xray/xray-extract.cc
203–204 ↗	(On Diff #76528)	Can be nice to be able to read streaming input (general good for uniformity - lots of tools accept '-' as the filename to read from stdin), etc. Not a strong requirement by any means.
tools/llvm-xray/xray-log-reader.cc
113 ↗	(On Diff #76910)	Could use 'emplace_back()' if you reckon that's more readable (I don't think it's less readable than {} at least).
tools/llvm-xray/xray-record.h
38 ↗	(On Diff #76528)	The comment "Usually either ENTER = 0 or EXIT = 1" is no longer needed, as it's implied by the type. (perhaps could be reworded, etc - "Identifies this as an enter or exit record" or somesuch if that's particularly useful - though the enum is only a few lines away)

Implement conversion to v1 binary format

tools/llvm-xray/xray-converter.cc
126–129 ↗	(On Diff #76528)	Good point. I've re-implemented the binary output, and the round-tripping.
tools/llvm-xray/xray-extract.cc
203–204 ↗	(On Diff #76528)	Actually, now that I read this part of the code again, it's the YAML `Input` that requires a full document to be provided in its constructor. http://llvm.org/docs/YamlIO.html#input
tools/llvm-xray/xray-record.h
38 ↗	(On Diff #76528)	Reworded to be short and sweet.

Ready for another look now.

(I was thinking more going the other direction: removing the dead code, rather than adding a use of it)

What's the purpose of the binary/raw output format? (I think it was mentioned that the binary output format would be for testing - so we could write yaml, then generate raw files - then feed those into other tools for testing? But if the yaml input format is supported everywhere, what's the purpose of that? We'll want to have a couple of binary tests, but I imagine they would test checked in binary -> yaml output (if they test yaml -> binary -> yaml, then I'm not sure they achieve much because it's just roundtripping so we have as much code under test (binary -> yaml) as we do in the test harness (yaml -> binary), effectively)

That's my mental model at least.

(this commit seems to add several .bin test files - but I don't see where those are used?)

tools/llvm-xray/xray-converter.cc
175–176 ↗	(On Diff #78832)	This seems like an odd construct - should Extractor not error-out in the case where ConvertInstrMap is empty?

Address review comments

In D24376#602808, @dblaikie wrote:

(I was thinking more going the other direction: removing the dead code, rather than adding a use of it)

What's the purpose of the binary/raw output format? (I think it was mentioned that the binary output format would be for testing - so we could write yaml, then generate raw files - then feed those into other tools for testing? But if the yaml input format is supported everywhere, what's the purpose of that? We'll want to have a couple of binary tests, but I imagine they would test checked in binary -> yaml output (if they test yaml -> binary -> yaml, then I'm not sure they achieve much because it's just roundtripping so we have as much code under test (binary -> yaml) as we do in the test harness (yaml -> binary), effectively)

That's my mental model at least.

Other things going on concurrently (work on the FDR mode for the runtime library) is making me need to be able to turn versions of the binary log from one form to another. I've found that working with the binary versions is much more convenient from a tool perspective, and that this current simple format is more amenable to analysis than the condensed format I'm working on.

So effectively, version 2 of the log format is turning out to have both non-fixed-size records and more interleaving happening. This tool, that's able to take that complex format into something simpler/different doesn't make sense for generating just YAML files.

(this commit seems to add several .bin test files - but I don't see where those are used?)

Some of the tests are looking at symbolization of the function id's, and associating functions with debug info, etc. for coverage of the various modes by which the YAML output could be generated.

tools/llvm-xray/xray-converter.cc
175–176 ↗	(On Diff #78832)	Good point. Technically, that is an error in the use of the extractor's constructor, hence it signalling an error. Think about it as if the construction of the extractor had thrown an exception -- and in this case, the appropriate action is to ignore that exception. So this is saying, if the filename was empty, then the extractor couldn't be initialised and therefore we ignore the error. In the case that it wasn't empty, then we should emit the error, but still not cause the tool to fail in execution. Changed both here and the extract sub-command where we check the filename explicitly to not be empty.

It seems Phabricator had been swallowing some of my comments (and not sending out email messages).

This is ready for another look @dblaikie.

In D24376#603715, @dberris wrote:

In D24376#602808, @dblaikie wrote:

(I was thinking more going the other direction: removing the dead code, rather than adding a use of it)

What's the purpose of the binary/raw output format? (I think it was mentioned that the binary output format would be for testing - so we could write yaml, then generate raw files - then feed those into other tools for testing? But if the yaml input format is supported everywhere, what's the purpose of that? We'll want to have a couple of binary tests, but I imagine they would test checked in binary -> yaml output (if they test yaml -> binary -> yaml, then I'm not sure they achieve much because it's just roundtripping so we have as much code under test (binary -> yaml) as we do in the test harness (yaml -> binary), effectively)

That's my mental model at least.

Other things going on concurrently (work on the FDR mode for the runtime library) is making me need to be able to turn versions of the binary log from one form to another. I've found that working with the binary versions is much more convenient from a tool perspective, and that this current simple format is more amenable to analysis than the condensed format I'm working on.

I'm not sure I'm following here - could you describe this in more detail?

My understanding was that tools would likely use these LLVM APIs for data handling (or they'd use YAML input) - so there wouldn't be any benefit to any particular format (except YAML) since these LLVM APIs could handle all the formats. This would make the binary format(s) fairly niche - just convenient for runtime emission, but not desirable for any other use.

Where have I gone wrong here?

So effectively, version 2 of the log format is turning out to have both non-fixed-size records and more interleaving happening. This tool, that's able to take that complex format into something simpler/different doesn't make sense for generating just YAML files.

(this commit seems to add several .bin test files - but I don't see where those are used?)

Some of the tests are looking at symbolization of the function id's, and associating functions with debug info, etc. for coverage of the various modes by which the YAML output could be generated.

Not quite following, more practically: which tests are these files used in?

In D24376#604850, @dblaikie wrote:

In D24376#603715, @dberris wrote:

In D24376#602808, @dblaikie wrote:

(I was thinking more going the other direction: removing the dead code, rather than adding a use of it)

What's the purpose of the binary/raw output format? (I think it was mentioned that the binary output format would be for testing - so we could write yaml, then generate raw files - then feed those into other tools for testing? But if the yaml input format is supported everywhere, what's the purpose of that? We'll want to have a couple of binary tests, but I imagine they would test checked in binary -> yaml output (if they test yaml -> binary -> yaml, then I'm not sure they achieve much because it's just roundtripping so we have as much code under test (binary -> yaml) as we do in the test harness (yaml -> binary), effectively)

That's my mental model at least.

Other things going on concurrently (work on the FDR mode for the runtime library) is making me need to be able to turn versions of the binary log from one form to another. I've found that working with the binary versions is much more convenient from a tool perspective, and that this current simple format is more amenable to analysis than the condensed format I'm working on.

I'm not sure I'm following here - could you describe this in more detail?

Sure, there's a patch I've uploaded recently that has a lot more about this different format (see D27038).

My understanding was that tools would likely use these LLVM APIs for data handling (or they'd use YAML input) - so there wouldn't be any benefit to any particular format (except YAML) since these LLVM APIs could handle all the formats. This would make the binary format(s) fairly niche - just convenient for runtime emission, but not desirable for any other use.

Where have I gone wrong here?

That's one half of the story.

The other half is the new log format (still binary) is not conducive to this simple approach of loading up records with full information. The FDR mode log format is substantially different, in that per-thread buffers are chunked up into fixed-sized blocks and each record might be a metadata record (16 bytes) or a function record (8 bytes). That log has all sorts of information consolidated in very different ways (e.g. the thread ID is stored once, as a metadata record at the beginning of a buffer, CPU id's only show up when the CPU is first encountered, we store TSC deltas instead of full TSCs in function records), and is not conducive to just "load in memory, read records one by one".

So then, the conversion tool ought to be able to convert the FDR mode logs into this simpler log format that we already support in this tool -- then let all the other tools upcoming (accounting, timeline visualisation, etc.) just use the simpler log format.

The closest analogy I can make for existing tools that do something similar would be 'dot' and friends.

So effectively, version 2 of the log format is turning out to have both non-fixed-size records and more interleaving happening. This tool, that's able to take that complex format into something simpler/different doesn't make sense for generating just YAML files.

(this commit seems to add several .bin test files - but I don't see where those are used?)

Some of the tests are looking at symbolization of the function id's, and associating functions with debug info, etc. for coverage of the various modes by which the YAML output could be generated.

Not quite following, more practically: which tests are these files used in?

convert-with-debug-syms.txt
convert-with-standalone-instrmap.txt
convert-with-yaml-instrmap.txt

In D24376#604864, @dberris wrote:

In D24376#604850, @dblaikie wrote:

In D24376#603715, @dberris wrote:

In D24376#602808, @dblaikie wrote:

(I was thinking more going the other direction: removing the dead code, rather than adding a use of it)

What's the purpose of the binary/raw output format? (I think it was mentioned that the binary output format would be for testing - so we could write yaml, then generate raw files - then feed those into other tools for testing? But if the yaml input format is supported everywhere, what's the purpose of that? We'll want to have a couple of binary tests, but I imagine they would test checked in binary -> yaml output (if they test yaml -> binary -> yaml, then I'm not sure they achieve much because it's just roundtripping so we have as much code under test (binary -> yaml) as we do in the test harness (yaml -> binary), effectively)

That's my mental model at least.

Other things going on concurrently (work on the FDR mode for the runtime library) is making me need to be able to turn versions of the binary log from one form to another. I've found that working with the binary versions is much more convenient from a tool perspective, and that this current simple format is more amenable to analysis than the condensed format I'm working on.

I'm not sure I'm following here - could you describe this in more detail?

Sure, there's a patch I've uploaded recently that has a lot more about this different format (see D27038).

My understanding was that tools would likely use these LLVM APIs for data handling (or they'd use YAML input) - so there wouldn't be any benefit to any particular format (except YAML) since these LLVM APIs could handle all the formats. This would make the binary format(s) fairly niche - just convenient for runtime emission, but not desirable for any other use.

Where have I gone wrong here?

That's one half of the story.

The other half is the new log format (still binary) is not conducive to this simple approach of loading up records with full information. The FDR mode log format is substantially different, in that per-thread buffers are chunked up into fixed-sized blocks and each record might be a metadata record (16 bytes) or a function record (8 bytes). That log has all sorts of information consolidated in very different ways (e.g. the thread ID is stored once, as a metadata record at the beginning of a buffer, CPU id's only show up when the CPU is first encountered, we store TSC deltas instead of full TSCs in function records), and is not conducive to just "load in memory, read records one by one".

So then, the conversion tool ought to be able to convert the FDR mode logs into this simpler log format that we already support in this tool -- then let all the other tools upcoming (accounting, timeline visualisation, etc.) just use the simpler log format.

I'm still a bit confused though - why not just convert to the yaml format at that point? (& why convert at all - if we're talking about other tools built on top of LLVM's library (tools built outside of that would presumably only rely on the YAML?) would use a common parser/reader API that could handle all formats, right? Rather than having the user run a conversion tool, then the tool they want to use)

The closest analogy I can make for existing tools that do something similar would be 'dot' and friends.

So effectively, version 2 of the log format is turning out to have both non-fixed-size records and more interleaving happening. This tool, that's able to take that complex format into something simpler/different doesn't make sense for generating just YAML files.

(this commit seems to add several .bin test files - but I don't see where those are used?)

Some of the tests are looking at symbolization of the function id's, and associating functions with debug info, etc. for coverage of the various modes by which the YAML output could be generated.

Not quite following, more practically: which tests are these files used in?

convert-with-debug-syms.txt
convert-with-standalone-instrmap.txt
convert-with-yaml-instrmap.txt

ah, hmm - I saw the .bin files (some of them) added in nthe last update - so I guess maybe they just hadn't been added previously, but the uses were already there. No worries.

In D24376#604995, @dblaikie wrote:

In D24376#604864, @dberris wrote:

The other half is the new log format (still binary) is not conducive to this simple approach of loading up records with full information. The FDR mode log format is substantially different, in that per-thread buffers are chunked up into fixed-sized blocks and each record might be a metadata record (16 bytes) or a function record (8 bytes). That log has all sorts of information consolidated in very different ways (e.g. the thread ID is stored once, as a metadata record at the beginning of a buffer, CPU id's only show up when the CPU is first encountered, we store TSC deltas instead of full TSCs in function records), and is not conducive to just "load in memory, read records one by one".

So then, the conversion tool ought to be able to convert the FDR mode logs into this simpler log format that we already support in this tool -- then let all the other tools upcoming (accounting, timeline visualisation, etc.) just use the simpler log format.

I'm still a bit confused though - why not just convert to the yaml format at that point? (& why convert at all - if we're talking about other tools built on top of LLVM's library (tools built outside of that would presumably only rely on the YAML?) would use a common parser/reader API that could handle all formats, right? Rather than having the user run a conversion tool, then the tool they want to use)

YAML is a little big and verbose -- the blow-up from the binary format to YAML is a factor of at least 10. In practical terms, dealing with the binary format in terms of storage and transport is a good option.

YAML is convenient for human consumption, and if users ever build tools that use scripting languages or other things already deal with YAML (or JSON, or some other text format) then that's the convenient path to take.

Maybe the solution here is to move the log reading part into something that's part of the LLVM library, but that still has the problem that reconstruction of a consistent timeline representing events on multiple threads is much more complex than one that just takes a linear "simple" binary file. Then the XRay log file that has non-fixed-size records and more complex encoding ought to be converted to the simplified form.

So the compromise/trade-off being made here is that we localize the knowledge of dealing with the different XRay binary log formats to this tool, that can turn it into "the simplest XRay log format", that can be read by simpler libraries for consumption. Given that, and the plan of being able to convert these XRay log files into other non-XRay specific formats (Chrome trace viewer format which is JSON, or the perftools utilities, etc.) is a huge convenience. This convenience is not only for testing purposes but also for building and integrating with other existing tools that deal with all sorts of traces.

Does this make sense?

Rebase

Rebase again

Squash local commits in the hopes of producing a simpler patch.

Ping? @dblaikie

So then, the conversion tool ought to be able to convert the FDR mode logs into this simpler log format that we already support in this tool -- then let all the other tools upcoming (accounting, timeline visualisation, etc.) just use the simpler log format.

I'm still a bit confused though - why not just convert to the yaml format at that point? (& why convert at all - if we're talking about other tools built on top of LLVM's library (tools built outside of that would presumably only rely on the YAML?) would use a common parser/reader API that could handle all formats, right? Rather than having the user run a conversion tool, then the tool they want to use)

YAML is a little big and verbose -- the blow-up from the binary format to YAML is a factor of at least 10. In practical terms, dealing with the binary format in terms of storage and transport is a good option.

I can agree with this.

YAML is convenient for human consumption, and if users ever build tools that use scripting languages or other things already deal with YAML (or JSON, or some other text format) then that's the convenient path to take.

Well, not entirely convenient. It would be nice to have a "dumping" output that produces an easy to consume by normal humans format that isn't the full yaml format for specifying.

Maybe the solution here is to move the log reading part into something that's part of the LLVM library, but that still has the problem that reconstruction of a consistent timeline representing events on multiple threads is much more complex than one that just takes a linear "simple" binary file. Then the XRay log file that has non-fixed-size records and more complex encoding ought to be converted to the simplified form.

I think that the format reading should definitely be a part of the llvm library and not the tool in particular. The tool should just wrap the library.

So the compromise/trade-off being made here is that we localize the knowledge of dealing with the different XRay binary log formats to this tool, that can turn it into "the simplest XRay log format", that can be read by simpler libraries for consumption. Given that, and the plan of being able to convert these XRay log files into other non-XRay specific formats (Chrome trace viewer format which is JSON, or the perftools utilities, etc.) is a huge convenience. This convenience is not only for testing purposes but also for building and integrating with other existing tools that deal with all sorts of traces.

Does this make sense?

Being able to convert seems useful in general. You want to be able to write testcases in something human readable, but you want the default output to be binary for compactness.

-eric

dberris added a child revision: D28345: [XRay] Define the library for XRay trace logs.Jan 4 2017, 10:37 PM

In D24376#634790, @echristo wrote:

I think that the format reading should definitely be a part of the llvm library and not the tool in particular. The tool should just wrap the library.

Cool, thanks Eric -- D28345 does just the log reading library which depends on this change landing.

So the compromise/trade-off being made here is that we localize the knowledge of dealing with the different XRay binary log formats to this tool, that can turn it into "the simplest XRay log format", that can be read by simpler libraries for consumption. Given that, and the plan of being able to convert these XRay log files into other non-XRay specific formats (Chrome trace viewer format which is JSON, or the perftools utilities, etc.) is a huge convenience. This convenience is not only for testing purposes but also for building and integrating with other existing tools that deal with all sorts of traces.

Does this make sense?

Being able to convert seems useful in general. You want to be able to write testcases in something human readable, but you want the default output to be binary for compactness.

Good point. So I think "conversion" is a bit more loaded than just "dump", where "dump" really just makes the representation more human-readable (for debugging, or just turning machine-readable form to human-readable form).

Will renaming this from 'convert' to 'dump' make that better?

-eric

Good point. So I think "conversion" is a bit more loaded than just "dump", where "dump" really just makes the representation more human-readable (for debugging, or just turning machine-readable form to human-readable form).

Will renaming this from 'convert' to 'dump' make that better?

I don't think that'd be accurate - this is doing format conversions between semantic preserving representations intended for machine consumption. Our dumping tools are generally intended only for human consumption (or really hacky machine consumption in scripts for quick work sometimes (eg: some scripts I'm writing using objdump/dwarfdump for doing some object size analysis)).

tools/llvm-xray/xray-extract.cc
220 ↗	(On Diff #80031)	push_back? (since you're specifying the type anyway, etc)
tools/llvm-xray/xray-log-reader.cc
165 ↗	(On Diff #80031)	I'd skip the 'Tmp' temporary, and put the result directly into 'Records' (because it seems simpler - but arguably could have slightly improved performance if 'Records' already has a large capacity that can be reused)

This revision is now accepted and ready to land.Jan 9 2017, 11:25 AM

Address review comments

Closed by commit rL291529: [XRay] Implement `llvm-xray convert` -- trace file conversion (authored by dberris). · Explain WhyJan 9 2017, 6:49 PM

This revision was automatically updated to reflect the committed changes.

dberris mentioned this in rL291531: [XRay] Don't include <unistd.h> unnecessarily.Jan 9 2017, 7:02 PM

dberris mentioned this in rL291533: [XRay] Fixup includes for modules build.Jan 9 2017, 7:32 PM

dberris mentioned this in rL291538: [XRay] Use regular expression for finding symbols.Jan 9 2017, 8:43 PM

Revision Contents

Path

Size

llvm/

trunk/

test/

tools/

llvm-xray/

X86/

Inputs/

elf64-objcopied-instrmap.bin

elf64-sample-o2.bin

naive-log-simple.xray

simple-xray-instrmap.yaml

14 lines

bad-instrmap-sizes.bin

3 lines

bad-instrmap-sizes.txt

3 lines

convert-roundtrip.yaml

28 lines

convert-to-yaml.txt

23 lines

convert-with-debug-syms.txt

23 lines

convert-with-standalone-instrmap.txt

23 lines

convert-with-yaml-instrmap.txt

23 lines

tools/

llvm-xray/

8 lines

49 lines

60 lines

39 lines

222 lines

63 lines

57 lines

165 lines

102 lines

55 lines

Diff 83765

llvm/trunk/test/tools/llvm-xray/X86/Inputs/elf64-objcopied-instrmap.bin

This is a binary file.

Property	Old Value	New Value
svn:executable	null	*

llvm/trunk/test/tools/llvm-xray/X86/Inputs/elf64-sample-o2.bin

This is a binary file.

Property	Old Value	New Value
svn:executable	null	*

llvm/trunk/test/tools/llvm-xray/X86/Inputs/naive-log-simple.xray

This is a binary file.

llvm/trunk/test/tools/llvm-xray/X86/Inputs/simple-xray-instrmap.yaml

				---
				- { id: 1, address: 0x000000000041CA40, function: 0x000000000041CA40, kind: function-enter,
				always-instrument: true }
				- { id: 1, address: 0x000000000041CA50, function: 0x000000000041CA40, kind: tail-exit,
				always-instrument: true }
				- { id: 2, address: 0x000000000041CA70, function: 0x000000000041CA70, kind: function-enter,
				always-instrument: true }
				- { id: 2, address: 0x000000000041CA7C, function: 0x000000000041CA70, kind: tail-exit,
				always-instrument: true }
				- { id: 3, address: 0x000000000041CAA0, function: 0x000000000041CAA0, kind: function-enter,
				always-instrument: true }
				- { id: 3, address: 0x000000000041CAB4, function: 0x000000000041CAA0, kind: function-exit,
				always-instrument: true }
				...

llvm/trunk/test/tools/llvm-xray/X86/bad-instrmap-sizes.bin

	; RUN: not llvm-xray extract %S/Inputs/elf64-badentrysizes.bin 2>&1 \| FileCheck %s
	; CHECK: llvm-xray: Cannot extract instrumentation map from '{{.*}}elf64-badentrysizes.bin'.
	; CHECK-NEXT: Instrumentation map entries not evenly divisible by size of an XRay sled entry in ELF64.

llvm/trunk/test/tools/llvm-xray/X86/bad-instrmap-sizes.txt

				; RUN: not llvm-xray extract %S/Inputs/elf64-badentrysizes.bin 2>&1 \| FileCheck %s
				; CHECK: llvm-xray: Cannot extract instrumentation map from '{{.*}}elf64-badentrysizes.bin'.
				; CHECK-NEXT: Instrumentation map entries not evenly divisible by size of an XRay sled entry in ELF64.

llvm/trunk/test/tools/llvm-xray/X86/convert-roundtrip.yaml

				#RUN: llvm-xray convert %s -i=yaml -f=raw -o %t && llvm-xray convert %t -f=yaml -o - \| FileCheck %s
				---
				header:
				version: 1
				type: 0
				constant-tsc: true
				nonstop-tsc: true
				cycle-frequency: 2601000000
				records:
				- { type: 0, func-id: 1, cpu: 1, thread: 111, kind: function-enter,
				tsc: 10001 }
				- { type: 0, func-id: 1, cpu: 1, thread: 111, kind: function-exit,
				tsc: 10100 }
				...

				#CHECK: ---
				#CHECK-NEXT: header:
				#CHECK-NEXT: version: 1
				#CHECK-NEXT: type: 0
				#CHECK-NEXT: constant-tsc: true
				#CHECK-NEXT: nonstop-tsc: true
				#CHECK-NEXT: cycle-frequency: 2601000000
				#CHECK-NEXT: records:
				#CHECK-NEXT: - { type: 0, func-id: 1, function: '1', cpu: 1, thread: 111, kind: function-enter,
				#CHECK-NEXT: tsc: 10001 }
				#CHECK-NEXT: - { type: 0, func-id: 1, function: '1', cpu: 1, thread: 111, kind: function-exit,
				#CHECK-NEXT: tsc: 10100 }
				#CHECK-NEXT: ...

llvm/trunk/test/tools/llvm-xray/X86/convert-to-yaml.txt

				; RUN: llvm-xray convert %S/Inputs/naive-log-simple.xray -f=yaml -o - \| FileCheck %s

				; CHECK: ---
				; CHECK-NEXT: header:
				; CHECK-NEXT: version: 1
				; CHECK-NEXT: type: 0
				; CHECK-NEXT: constant-tsc: true
				; CHECK-NEXT: nonstop-tsc: true
				; CHECK-NEXT: cycle-frequency: 2601000000
				; CHECK-NEXT: records:
				; CHECK-NEXT: - { type: 0, func-id: 3, function: '3', cpu: 37, thread: 84697, kind: function-enter,
				; CHECK-NEXT: tsc: 3315356841453914 }
				; CHECK-NEXT: - { type: 0, func-id: 2, function: '2', cpu: 37, thread: 84697, kind: function-enter,
				; CHECK-NEXT: tsc: 3315356841454542 }
				; CHECK-NEXT: - { type: 0, func-id: 2, function: '2', cpu: 37, thread: 84697, kind: function-exit,
				; CHECK-NEXT: tsc: 3315356841454670 }
				; CHECK-NEXT: - { type: 0, func-id: 1, function: '1', cpu: 37, thread: 84697, kind: function-enter,
				; CHECK-NEXT: tsc: 3315356841454762 }
				; CHECK-NEXT: - { type: 0, func-id: 1, function: '1', cpu: 37, thread: 84697, kind: function-exit,
				; CHECK-NEXT: tsc: 3315356841454802 }
				; CHECK-NEXT: - { type: 0, func-id: 3, function: '3', cpu: 37, thread: 84697, kind: function-exit,
				; CHECK-NEXT: tsc: 3315356841494828 }
				; CHECK-NEXT: ...

llvm/trunk/test/tools/llvm-xray/X86/convert-with-debug-syms.txt

				; RUN: llvm-xray convert -m %S/Inputs/elf64-sample-o2.bin -y %S/Inputs/naive-log-simple.xray -f=yaml -o - 2>&1 \| FileCheck %s

				; CHECK: ---
				; CHECK-NEXT: header:
				; CHECK-NEXT: version: 1
				; CHECK-NEXT: type: 0
				; CHECK-NEXT: constant-tsc: true
				; CHECK-NEXT: nonstop-tsc: true
				; CHECK-NEXT: cycle-frequency: 2601000000
				; CHECK-NEXT: records:
				; CHECK-NEXT: - { type: 0, func-id: 3, function: main, cpu: 37, thread: 84697, kind: function-enter,
				; CHECK-NEXT: tsc: 3315356841453914 }
				; CHECK-NEXT: - { type: 0, func-id: 2, function: 'foo()', cpu: 37, thread: 84697, kind: function-enter,
				; CHECK-NEXT: tsc: 3315356841454542 }
				; CHECK-NEXT: - { type: 0, func-id: 2, function: 'foo()', cpu: 37, thread: 84697, kind: function-exit,
				; CHECK-NEXT: tsc: 3315356841454670 }
				; CHECK-NEXT: - { type: 0, func-id: 1, function: 'bar()', cpu: 37, thread: 84697, kind: function-enter,
				; CHECK-NEXT: tsc: 3315356841454762 }
				; CHECK-NEXT: - { type: 0, func-id: 1, function: 'bar()', cpu: 37, thread: 84697, kind: function-exit,
				; CHECK-NEXT: tsc: 3315356841454802 }
				; CHECK-NEXT: - { type: 0, func-id: 3, function: main, cpu: 37, thread: 84697, kind: function-exit,
				; CHECK-NEXT: tsc: 3315356841494828 }
				; CHECK-NEXT: ...

llvm/trunk/test/tools/llvm-xray/X86/convert-with-standalone-instrmap.txt

				; RUN: llvm-xray convert -m %S/Inputs/elf64-objcopied-instrmap.bin -y %S/Inputs/naive-log-simple.xray -f=yaml -o - 2>&1 \| FileCheck %s

				; CHECK: ---
				; CHECK-NEXT: header:
				; CHECK-NEXT: version: 1
				; CHECK-NEXT: type: 0
				; CHECK-NEXT: constant-tsc: true
				; CHECK-NEXT: nonstop-tsc: true
				; CHECK-NEXT: cycle-frequency: 2601000000
				; CHECK-NEXT: records:
				; CHECK-NEXT: - { type: 0, func-id: 3, function: '@(41caa0)', cpu: 37, thread: 84697,
				; CHECK-NEXT: kind: function-enter, tsc: 3315356841453914 }
				; CHECK-NEXT: - { type: 0, func-id: 2, function: '@(41ca70)', cpu: 37, thread: 84697,
				; CHECK-NEXT: kind: function-enter, tsc: 3315356841454542 }
				; CHECK-NEXT: - { type: 0, func-id: 2, function: '@(41ca70)', cpu: 37, thread: 84697,
				; CHECK-NEXT: kind: function-exit, tsc: 3315356841454670 }
				; CHECK-NEXT: - { type: 0, func-id: 1, function: '@(41ca40)', cpu: 37, thread: 84697,
				; CHECK-NEXT: kind: function-enter, tsc: 3315356841454762 }
				; CHECK-NEXT: - { type: 0, func-id: 1, function: '@(41ca40)', cpu: 37, thread: 84697,
				; CHECK-NEXT: kind: function-exit, tsc: 3315356841454802 }
				; CHECK-NEXT: - { type: 0, func-id: 3, function: '@(41caa0)', cpu: 37, thread: 84697,
				; CHECK-NEXT: kind: function-exit, tsc: 3315356841494828 }
				; CHECK-NEXT: ...

llvm/trunk/test/tools/llvm-xray/X86/convert-with-yaml-instrmap.txt

				; RUN: llvm-xray convert -m %S/Inputs/simple-xray-instrmap.yaml -t yaml %S/Inputs/naive-log-simple.xray -f=yaml -o - \| FileCheck %s

				; CHECK: ---
				; CHECK-NEXT: header:
				; CHECK-NEXT: version: 1
				; CHECK-NEXT: type: 0
				; CHECK-NEXT: constant-tsc: true
				; CHECK-NEXT: nonstop-tsc: true
				; CHECK-NEXT: cycle-frequency: 2601000000
				; CHECK-NEXT: records:
				; CHECK-NEXT: - { type: 0, func-id: 3, function: '3', cpu: 37, thread: 84697, kind: function-enter,
				; CHECK-NEXT: tsc: 3315356841453914 }
				; CHECK-NEXT: - { type: 0, func-id: 2, function: '2', cpu: 37, thread: 84697, kind: function-enter,
				; CHECK-NEXT: tsc: 3315356841454542 }
				; CHECK-NEXT: - { type: 0, func-id: 2, function: '2', cpu: 37, thread: 84697, kind: function-exit,
				; CHECK-NEXT: tsc: 3315356841454670 }
				; CHECK-NEXT: - { type: 0, func-id: 1, function: '1', cpu: 37, thread: 84697, kind: function-enter,
				; CHECK-NEXT: tsc: 3315356841454762 }
				; CHECK-NEXT: - { type: 0, func-id: 1, function: '1', cpu: 37, thread: 84697, kind: function-exit,
				; CHECK-NEXT: tsc: 3315356841454802 }
				; CHECK-NEXT: - { type: 0, func-id: 3, function: '3', cpu: 37, thread: 84697, kind: function-exit,
				; CHECK-NEXT: tsc: 3315356841494828 }
				; CHECK-NEXT: ...

llvm/trunk/tools/llvm-xray/CMakeLists.txt

	set(LLVM_LINK_COMPONENTS			set(LLVM_LINK_COMPONENTS
	${LLVM_TARGETS_TO_BUILD}			${LLVM_TARGETS_TO_BUILD}
				DebugInfoDWARF
				Object
	Support			Support
	Object)			Symbolize)

	set(LLVM_XRAY_TOOLS			set(LLVM_XRAY_TOOLS
				func-id-helper.cc
				xray-converter.cc
	xray-extract.cc			xray-extract.cc
				xray-extract.cc
				xray-log-reader.cc
	xray-registry.cc)			xray-registry.cc)

	add_llvm_tool(llvm-xray llvm-xray.cc ${LLVM_XRAY_TOOLS})			add_llvm_tool(llvm-xray llvm-xray.cc ${LLVM_XRAY_TOOLS})

llvm/trunk/tools/llvm-xray/func-id-helper.h

				//===- func-id-helper.h - XRay Function ID Conversion Helpers -------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// Defines helper tools dealing with XRay-generated function ids.
				//
				//===----------------------------------------------------------------------===//
				#ifndef LLVM_TOOLS_LLVM_XRAY_FUNC_ID_HELPER_H
				#define LLVM_TOOLS_LLVM_XRAY_FUNC_ID_HELPER_H

				#include "llvm/DebugInfo/Symbolize/Symbolize.h"
				#include <unordered_map>

				namespace llvm {
				namespace xray {

				// This class consolidates common operations related to Function IDs.
				class FuncIdConversionHelper {
				public:
				using FunctionAddressMap = std::unordered_map<int32_t, uint64_t>;

				private:
				std::string BinaryInstrMap;
				symbolize::LLVMSymbolizer &Symbolizer;
				const FunctionAddressMap &FunctionAddresses;

				public:
				FuncIdConversionHelper(std::string BinaryInstrMap,
				symbolize::LLVMSymbolizer &Symbolizer,
				const FunctionAddressMap &FunctionAddresses)
				: BinaryInstrMap(std::move(BinaryInstrMap)), Symbolizer(Symbolizer),
				FunctionAddresses(FunctionAddresses) {}

				// Returns the symbol or a string representation of the function id.
				std::string SymbolOrNumber(int32_t FuncId) const;

				// Returns the file and column from debug info for the given function id.
				std::string FileLineAndColumn(int32_t FuncId) const;
				};

				} // namespace xray
				} // namespace llvm

				#endif // LLVM_TOOLS_LLVM_XRAY_FUNC_ID_HELPER_H

llvm/trunk/tools/llvm-xray/func-id-helper.cc

				//===- xray-fc-account.cc - XRay Function Call Accounting Tool ------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// Implementation of the helper tools dealing with XRay-generated function ids.
				//
				//===----------------------------------------------------------------------===//

				#include "func-id-helper.h"
				#include "llvm/Support/Path.h"
				#include <sstream>

				using namespace llvm;
				using namespace xray;

				std::string FuncIdConversionHelper::SymbolOrNumber(int32_t FuncId) const {
				std::ostringstream F;
				auto It = FunctionAddresses.find(FuncId);
				if (It == FunctionAddresses.end()) {
				F << "#" << FuncId;
				return F.str();
				}

				if (auto ResOrErr = Symbolizer.symbolizeCode(BinaryInstrMap, It->second)) {
				auto &DI = *ResOrErr;
				if (DI.FunctionName == "<invalid>")
				F << "@(" << std::hex << It->second << ")";
				else
				F << DI.FunctionName;
				} else
				handleAllErrors(ResOrErr.takeError(), [&](const ErrorInfoBase &) {
				F << "@(" << std::hex << It->second << ")";
				});

				return F.str();
				}

				std::string FuncIdConversionHelper::FileLineAndColumn(int32_t FuncId) const {
				auto It = FunctionAddresses.find(FuncId);
				if (It == FunctionAddresses.end())
				return "(unknown)";

				std::ostringstream F;
				auto ResOrErr = Symbolizer.symbolizeCode(BinaryInstrMap, It->second);
				if (!ResOrErr) {
				consumeError(ResOrErr.takeError());
				return "(unknown)";
				}

				auto &DI = *ResOrErr;
				F << sys::path::filename(DI.FileName).str() << ":" << DI.Line << ":"
				<< DI.Column;

				return F.str();
				}

llvm/trunk/tools/llvm-xray/xray-converter.h

				//===- xray-converter.h - XRay Trace Conversion ---------------------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// Defines the TraceConverter class for turning binary traces into
				// human-readable text and vice versa.
				//
				//===----------------------------------------------------------------------===//
				#ifndef LLVM_TOOLS_LLVM_XRAY_XRAY_CONVERTER_H
				#define LLVM_TOOLS_LLVM_XRAY_XRAY_CONVERTER_H

				#include "func-id-helper.h"
				#include "xray-log-reader.h"
				#include "xray-record.h"

				namespace llvm {
				namespace xray {

				class TraceConverter {
				FuncIdConversionHelper &FuncIdHelper;
				bool Symbolize;

				public:
				TraceConverter(FuncIdConversionHelper &FuncIdHelper, bool Symbolize = false)
				: FuncIdHelper(FuncIdHelper), Symbolize(Symbolize) {}

				void exportAsYAML(const LogReader &Records, raw_ostream &OS);
				void exportAsRAWv1(const LogReader &Records, raw_ostream &OS);
				};

				} // namespace xray
				} // namespace llvm

				#endif // LLVM_TOOLS_LLVM_XRAY_XRAY_CONVERTER_H

llvm/trunk/tools/llvm-xray/xray-converter.cc

				//===- xray-converter.cc - XRay Trace Conversion --------------------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// Implements the trace conversion functions.
				//
				//===----------------------------------------------------------------------===//
				#include "xray-converter.h"

				#include "xray-extract.h"
				#include "xray-record-yaml.h"
				#include "xray-registry.h"
				#include "llvm/DebugInfo/Symbolize/Symbolize.h"
				#include "llvm/Support/EndianStream.h"
				#include "llvm/Support/FileSystem.h"
				#include "llvm/Support/YAMLTraits.h"
				#include "llvm/Support/raw_ostream.h"
				#include <unistd.h>

				using namespace llvm;
				using namespace xray;

				// llvm-xray convert
				// ----------------------------------------------------------------------------
				static cl::SubCommand Convert("convert", "Trace Format Conversion");
				static cl::opt<std::string> ConvertInput(cl::Positional,
				cl::desc("<xray log file>"),
				cl::Required, cl::sub(Convert));
				enum class ConvertFormats { BINARY, YAML };
				static cl::opt<ConvertFormats> ConvertInputFormat(
				"input-format", cl::desc("input format"),
				cl::values(clEnumValN(ConvertFormats::BINARY, "raw",
				"input is in raw binary"),
				clEnumValN(ConvertFormats::YAML, "yaml", "input is in yaml")),
				cl::sub(Convert));
				static cl::alias ConvertInputFormat2("i", cl::aliasopt(ConvertInputFormat),
				cl::desc("Alias for -input-format"),
				cl::sub(Convert));
				static cl::opt<ConvertFormats> ConvertOutputFormat(
				"output-format", cl::desc("output format"),
				cl::values(clEnumValN(ConvertFormats::BINARY, "raw", "output in binary"),
				clEnumValN(ConvertFormats::YAML, "yaml", "output in yaml")),
				cl::sub(Convert));
				static cl::alias ConvertOutputFormat2("f", cl::aliasopt(ConvertOutputFormat),
				cl::desc("Alias for -output-format"),
				cl::sub(Convert));
				static cl::opt<std::string>
				ConvertOutput("output", cl::value_desc("output file"), cl::init("-"),
				cl::desc("output file; use '-' for stdout"),
				cl::sub(Convert));
				static cl::alias ConvertOutput2("o", cl::aliasopt(ConvertOutput),
				cl::desc("Alias for -output"),
				cl::sub(Convert));

				static cl::opt<bool>
				ConvertSymbolize("symbolize",
				cl::desc("symbolize function ids from the input log"),
				cl::init(false), cl::sub(Convert));
				static cl::alias ConvertSymbolize2("y", cl::aliasopt(ConvertSymbolize),
				cl::desc("Alias for -symbolize"),
				cl::sub(Convert));

				static cl::opt<std::string>
				ConvertInstrMap("instr_map",
				cl::desc("binary with the instrumentation map, or "
				"a separate instrumentation map"),
				cl::value_desc("binary with xray_instr_map"),
				cl::sub(Convert), cl::init(""));
				static cl::alias ConvertInstrMap2("m", cl::aliasopt(ConvertInstrMap),
				cl::desc("Alias for -instr_map"),
				cl::sub(Convert));
				static cl::opt<bool> ConvertSortInput(
				"sort",
				cl::desc("determines whether to sort input log records by timestamp"),
				cl::sub(Convert), cl::init(true));
				static cl::alias ConvertSortInput2("s", cl::aliasopt(ConvertSortInput),
				cl::desc("Alias for -sort"),
				cl::sub(Convert));
				static cl::opt<InstrumentationMapExtractor::InputFormats> InstrMapFormat(
				"instr-map-format", cl::desc("format of instrumentation map"),
				cl::values(clEnumValN(InstrumentationMapExtractor::InputFormats::ELF, "elf",
				"instrumentation map in an ELF header"),
				clEnumValN(InstrumentationMapExtractor::InputFormats::YAML,
				"yaml", "instrumentation map in YAML")),
				cl::sub(Convert), cl::init(InstrumentationMapExtractor::InputFormats::ELF));
				static cl::alias InstrMapFormat2("t", cl::aliasopt(InstrMapFormat),
				cl::desc("Alias for -instr-map-format"),
				cl::sub(Convert));

				using llvm::yaml::MappingTraits;
				using llvm::yaml::ScalarEnumerationTraits;
				using llvm::yaml::IO;
				using llvm::yaml::Output;

				void TraceConverter::exportAsYAML(const LogReader &Records, raw_ostream &OS) {
				YAMLXRayTrace Trace;
				const auto &FH = Records.getFileHeader();
				Trace.Header = {FH.Version, FH.Type, FH.ConstantTSC, FH.NonstopTSC,
				FH.CycleFrequency};
				Trace.Records.reserve(Records.size());
				for (const auto &R : Records) {
				Trace.Records.push_back({R.RecordType, R.CPU, R.Type, R.FuncId,
				Symbolize ? FuncIdHelper.SymbolOrNumber(R.FuncId)
				: std::to_string(R.FuncId),
				R.TSC, R.TId});
				}
				Output Out(OS);
				Out << Trace;
				}

				void TraceConverter::exportAsRAWv1(const LogReader &Records, raw_ostream &OS) {
				// First write out the file header, in the correct endian-appropriate format
				// (XRay assumes currently little endian).
				support::endian::Writer<support::endianness::little> Writer(OS);
				const auto &FH = Records.getFileHeader();
				Writer.write(FH.Version);
				Writer.write(FH.Type);
				uint32_t Bitfield{0};
				if (FH.ConstantTSC)
				Bitfield \|= 1uL;
				if (FH.NonstopTSC)
				Bitfield \|= 1uL << 1;
				Writer.write(Bitfield);
				Writer.write(FH.CycleFrequency);

				// There's 16 bytes of padding at the end of the file header.
				static constexpr uint32_t Padding4B = 0;
				Writer.write(Padding4B);
				Writer.write(Padding4B);
				Writer.write(Padding4B);
				Writer.write(Padding4B);

				// Then write out the rest of the records, still in an endian-appropriate
				// format.
				for (const auto &R : Records) {
				Writer.write(R.RecordType);
				Writer.write(R.CPU);
				switch (R.Type) {
				case RecordTypes::ENTER:
				Writer.write(uint8_t{0});
				break;
				case RecordTypes::EXIT:
				Writer.write(uint8_t{1});
				break;
				}
				Writer.write(R.FuncId);
				Writer.write(R.TSC);
				Writer.write(R.TId);
				Writer.write(Padding4B);
				Writer.write(Padding4B);
				Writer.write(Padding4B);
				}
				}

				namespace llvm {
				namespace xray {

				static CommandRegistration Unused(&Convert, []() -> Error {
				// FIXME: Support conversion to BINARY when upgrading XRay trace versions.
				int Fd;
				auto EC = sys::fs::openFileForRead(ConvertInput, Fd);
				if (EC)
				return make_error<StringError>(
				Twine("Cannot open file '") + ConvertInput + "'", EC);

				Error Err = Error::success();
				xray::InstrumentationMapExtractor Extractor(ConvertInstrMap, InstrMapFormat,
				Err);
				handleAllErrors(std::move(Err),
				[&](const ErrorInfoBase &E) { E.log(errs()); });

				const auto &FunctionAddresses = Extractor.getFunctionAddresses();
				symbolize::LLVMSymbolizer::Options Opts(
				symbolize::FunctionNameKind::LinkageName, true, true, false, "");
				symbolize::LLVMSymbolizer Symbolizer(Opts);
				llvm::xray::FuncIdConversionHelper FuncIdHelper(ConvertInstrMap, Symbolizer,
				FunctionAddresses);
				llvm::xray::TraceConverter TC(FuncIdHelper, ConvertSymbolize);
				LogReader::LoaderFunction Loader;
				switch (ConvertInputFormat) {
				case ConvertFormats::BINARY:
				Loader = NaiveLogLoader;
				break;
				case ConvertFormats::YAML:
				Loader = YAMLLogLoader;
				break;
				}

				LogReader Reader(ConvertInput, Err, ConvertSortInput, Loader);
				if (Err)
				return joinErrors(
				make_error<StringError>(
				Twine("Failed loading input file '") + ConvertInput + "'.",
				std::make_error_code(std::errc::protocol_error)),
				std::move(Err));

				raw_fd_ostream OS(ConvertOutput, EC,
				ConvertOutputFormat == ConvertFormats::BINARY
				? sys::fs::OpenFlags::F_None
				: sys::fs::OpenFlags::F_Text);
				if (EC)
				return make_error<StringError>(
				Twine("Cannot open file '") + ConvertOutput + "' for writing.", EC);

				switch (ConvertOutputFormat) {
				case ConvertFormats::YAML:
				TC.exportAsYAML(Reader, OS);
				break;
				case ConvertFormats::BINARY:
				TC.exportAsRAWv1(Reader, OS);
				break;
				}
				return Error::success();
				});

				} // namespace xray
				} // namespace llvm

llvm/trunk/tools/llvm-xray/xray-extract.cc

Show First 20 Lines • Show All 156 Lines • ▼ Show 20 Lines	case 2: // TAIL
Entry.Kind = SledEntry::FunctionKinds::TAIL;		Entry.Kind = SledEntry::FunctionKinds::TAIL;
break;		break;
default:		default:
return make_error<StringError>(		return make_error<StringError>(
Twine("Encountered unknown sled type ") + "'" + Twine(int32_t{Kind}) +		Twine("Encountered unknown sled type ") + "'" + Twine(int32_t{Kind}) +
"'.",		"'.",
std::make_error_code(std::errc::executable_format_error));		std::make_error_code(std::errc::executable_format_error));
}		}
auto AlwaysInstrument = Extractor.getU8(&OffsetPtr);		Entry.AlwaysInstrument = Extractor.getU8(&OffsetPtr) != 0;
Entry.AlwaysInstrument = AlwaysInstrument != 0;

// We replicate the function id generation scheme implemented in the runtime		// We replicate the function id generation scheme implemented in the runtime
// here. Ideally we should be able to break it out, or output this map from		// here. Ideally we should be able to break it out, or output this map from
// the runtime, but that's a design point we can discuss later on. For now,		// the runtime, but that's a design point we can discuss later on. For now,
// we replicate the logic and move on.		// we replicate the logic and move on.
if (CurFn == 0) {		if (CurFn == 0) {
CurFn = Entry.Function;		CurFn = Entry.Function;
InstrMap[FuncId] = Entry.Function;		InstrMap[FuncId] = Entry.Function;
FunctionIds[Entry.Function] = FuncId;		FunctionIds[Entry.Function] = FuncId;
}		}
if (Entry.Function != CurFn) {		if (Entry.Function != CurFn) {
++FuncId;		++FuncId;
CurFn = Entry.Function;		CurFn = Entry.Function;
InstrMap[FuncId] = Entry.Function;		InstrMap[FuncId] = Entry.Function;
FunctionIds[Entry.Function] = FuncId;		FunctionIds[Entry.Function] = FuncId;
}		}
}		}
OutputSleds = std::move(Sleds);		OutputSleds = std::move(Sleds);
return llvm::Error::success();		return llvm::Error::success();
}		}

		Error LoadYAMLInstrMap(
		StringRef Filename, std::deque<SledEntry> &Sleds,
		InstrumentationMapExtractor::FunctionAddressMap &InstrMap,
		InstrumentationMapExtractor::FunctionAddressReverseMap &FunctionIds) {
		int Fd;
		if (auto EC = sys::fs::openFileForRead(Filename, Fd))
		return make_error<StringError>(
		Twine("Failed opening file '") + Filename + "' for reading.", EC);

		uint64_t FileSize;
		if (auto EC = sys::fs::file_size(Filename, FileSize))
		return make_error<StringError>(
		Twine("Failed getting size of file '") + Filename + "'.", EC);

		std::error_code EC;
		sys::fs::mapped_file_region MappedFile(
		Fd, sys::fs::mapped_file_region::mapmode::readonly, FileSize, 0, EC);
		if (EC)
		return make_error<StringError>(
		Twine("Failed memory-mapping file '") + Filename + "'.", EC);

		std::vector<YAMLXRaySledEntry> YAMLSleds;
		Input In(StringRef(MappedFile.data(), MappedFile.size()));
		In >> YAMLSleds;
		if (In.error())
		return make_error<StringError>(
		Twine("Failed loading YAML document from '") + Filename + "'.",
		In.error());

		for (const auto &Y : YAMLSleds) {
		InstrMap[Y.FuncId] = Y.Function;
		FunctionIds[Y.Function] = Y.FuncId;
		Sleds.push_back(
		SledEntry{Y.Address, Y.Function, Y.Kind, Y.AlwaysInstrument});
		}
		return Error::success();
		}

} // namespace		} // namespace

InstrumentationMapExtractor::InstrumentationMapExtractor(std::string Filename,		InstrumentationMapExtractor::InstrumentationMapExtractor(std::string Filename,
InputFormats Format,		InputFormats Format,
Error &EC) {		Error &EC) {
ErrorAsOutParameter ErrAsOutputParam(&EC);		ErrorAsOutParameter ErrAsOutputParam(&EC);
		if (Filename.empty()) {
		EC = Error::success();
		return;
		}
switch (Format) {		switch (Format) {
case InputFormats::ELF: {		case InputFormats::ELF: {
EC = handleErrors(		EC = handleErrors(
LoadBinaryInstrELF(Filename, Sleds, FunctionAddresses, FunctionIds),		LoadBinaryInstrELF(Filename, Sleds, FunctionAddresses, FunctionIds),
[](std::unique_ptr<ErrorInfoBase> E) {		[&](std::unique_ptr<ErrorInfoBase> E) {
return joinErrors(		return joinErrors(
make_error<StringError>(		make_error<StringError>(
Twine("Cannot extract instrumentation map from '") +		Twine("Cannot extract instrumentation map from '") +
ExtractInput + "'.",		Filename + "'.",
std::make_error_code(std::errc::executable_format_error)),		std::make_error_code(std::errc::executable_format_error)),
std::move(E));		std::move(E));
});		});
break;		break;
}		}
default:		case InputFormats::YAML: {
llvm_unreachable("Input format type not supported yet.");		EC = handleErrors(
		LoadYAMLInstrMap(Filename, Sleds, FunctionAddresses, FunctionIds),
		[&](std::unique_ptr<ErrorInfoBase> E) {
		return joinErrors(
		make_error<StringError>(
		Twine("Cannot load YAML instrumentation map from '") +
		Filename + "'.",
		std::make_error_code(std::errc::wrong_protocol_type)),
		std::move(E));
		});
break;		break;
}		}
}		}
		}

void InstrumentationMapExtractor::exportAsYAML(raw_ostream &OS) {		void InstrumentationMapExtractor::exportAsYAML(raw_ostream &OS) {
// First we translate the sleds into the YAMLXRaySledEntry objects in a deque.		// First we translate the sleds into the YAMLXRaySledEntry objects in a deque.
std::vector<YAMLXRaySledEntry> YAMLSleds;		std::vector<YAMLXRaySledEntry> YAMLSleds;
YAMLSleds.reserve(Sleds.size());		YAMLSleds.reserve(Sleds.size());
for (const auto &Sled : Sleds) {		for (const auto &Sled : Sleds) {
YAMLSleds.push_back({FunctionIds[Sled.Function], Sled.Address,		YAMLSleds.push_back({FunctionIds[Sled.Function], Sled.Address,
Sled.Function, Sled.Kind, Sled.AlwaysInstrument});		Sled.Function, Sled.Kind, Sled.AlwaysInstrument});
Show All 20 Lines

llvm/trunk/tools/llvm-xray/xray-log-reader.h

				//===- xray-log-reader.h - XRay Log Reader Interface ----------------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// Define the interface for an XRay log reader. Currently we only support one
				// version of the log (naive log) with fixed-sized records.
				//
				//===----------------------------------------------------------------------===//
				#ifndef LLVM_TOOLS_LLVM_XRAY_XRAY_LOG_READER_H
				#define LLVM_TOOLS_LLVM_XRAY_XRAY_LOG_READER_H

				#include <cstdint>
				#include <deque>
				#include <vector>

				#include "xray-record-yaml.h"
				#include "xray-record.h"
				#include "llvm/Support/Error.h"
				#include "llvm/Support/FileSystem.h"

				namespace llvm {
				namespace xray {

				class LogReader {
				XRayFileHeader FileHeader;
				std::vector<XRayRecord> Records;

				typedef std::vector<XRayRecord>::const_iterator citerator;

				public:
				typedef std::function<Error(StringRef, XRayFileHeader &,
				std::vector<XRayRecord> &)>
				LoaderFunction;

				LogReader(StringRef Filename, Error &Err, bool Sort, LoaderFunction Loader);

				const XRayFileHeader &getFileHeader() const { return FileHeader; }

				citerator begin() const { return Records.begin(); }
				citerator end() const { return Records.end(); }
				size_t size() const { return Records.size(); }
				};

				Error NaiveLogLoader(StringRef Data, XRayFileHeader &FileHeader,
				std::vector<XRayRecord> &Records);
				Error YAMLLogLoader(StringRef Data, XRayFileHeader &FileHeader,
				std::vector<XRayRecord> &Records);

				} // namespace xray
				} // namespace llvm

				#endif // LLVM_TOOLS_LLVM_XRAY_XRAY_LOG_READER_H

llvm/trunk/tools/llvm-xray/xray-log-reader.cc

				//===- xray-log-reader.cc - XRay Log Reader Implementation ----------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// XRay log reader implementation.
				//
				//===----------------------------------------------------------------------===//
				#include <sys/types.h>

				#include "xray-log-reader.h"
				#include "xray-record-yaml.h"
				#include "llvm/Support/DataExtractor.h"
				#include "llvm/Support/FileSystem.h"

				using namespace llvm;
				using namespace llvm::xray;
				using llvm::yaml::Input;

				LogReader::LogReader(
				StringRef Filename, Error &Err, bool Sort,
				std::function<Error(StringRef, XRayFileHeader &, std::vector<XRayRecord> &)>
				Loader) {
				ErrorAsOutParameter Guard(&Err);
				int Fd;
				if (auto EC = sys::fs::openFileForRead(Filename, Fd)) {
				Err = make_error<StringError>(
				Twine("Cannot read log from '") + Filename + "'", EC);
				return;
				}
				uint64_t FileSize;
				if (auto EC = sys::fs::file_size(Filename, FileSize)) {
				Err = make_error<StringError>(
				Twine("Cannot read log from '") + Filename + "'", EC);
				return;
				}

				std::error_code EC;
				sys::fs::mapped_file_region MappedFile(
				Fd, sys::fs::mapped_file_region::mapmode::readonly, FileSize, 0, EC);
				if (EC) {
				Err = make_error<StringError>(
				Twine("Cannot read log from '") + Filename + "'", EC);
				return;
				}

				if (auto E = Loader(StringRef(MappedFile.data(), MappedFile.size()),
				FileHeader, Records)) {
				Err = std::move(E);
				return;
				}

				if (Sort)
				std::sort(
				Records.begin(), Records.end(),
				[](const XRayRecord &L, const XRayRecord &R) { return L.TSC < R.TSC; });
				}

				Error llvm::xray::NaiveLogLoader(StringRef Data, XRayFileHeader &FileHeader,
				std::vector<XRayRecord> &Records) {
				// FIXME: Maybe deduce whether the data is little or big-endian using some
				// magic bytes in the beginning of the file?

				// First 32 bytes of the file will always be the header. We assume a certain
				// format here:
				//
				// (2) uint16 : version
				// (2) uint16 : type
				// (4) uint32 : bitfield
				// (8) uint64 : cycle frequency
				// (16) - : padding
				//
				if (Data.size() < 32)
				return make_error<StringError>(
				"Not enough bytes for an XRay log.",
				std::make_error_code(std::errc::invalid_argument));

				if (Data.size() - 32 == 0 \|\| Data.size() % 32 != 0)
				return make_error<StringError>(
				"Invalid-sized XRay data.",
				std::make_error_code(std::errc::invalid_argument));

				DataExtractor HeaderExtractor(Data, true, 8);
				uint32_t OffsetPtr = 0;
				FileHeader.Version = HeaderExtractor.getU16(&OffsetPtr);
				FileHeader.Type = HeaderExtractor.getU16(&OffsetPtr);
				uint32_t Bitfield = HeaderExtractor.getU32(&OffsetPtr);
				FileHeader.ConstantTSC = Bitfield & 1uL;
				FileHeader.NonstopTSC = Bitfield & 1uL << 1;
				FileHeader.CycleFrequency = HeaderExtractor.getU64(&OffsetPtr);

				if (FileHeader.Version != 1)
				return make_error<StringError>(
				Twine("Unsupported XRay file version: ") + Twine(FileHeader.Version),
				std::make_error_code(std::errc::invalid_argument));

				// Each record after the header will be 32 bytes, in the following format:
				//
				// (2) uint16 : record type
				// (1) uint8 : cpu id
				// (1) uint8 : type
				// (4) sint32 : function id
				// (8) uint64 : tsc
				// (4) uint32 : thread id
				// (12) - : padding
				for (auto S = Data.drop_front(32); !S.empty(); S = S.drop_front(32)) {
				DataExtractor RecordExtractor(S, true, 8);
				uint32_t OffsetPtr = 0;
				Records.emplace_back();
				auto &Record = Records.back();
				Record.RecordType = RecordExtractor.getU16(&OffsetPtr);
				Record.CPU = RecordExtractor.getU8(&OffsetPtr);
				auto Type = RecordExtractor.getU8(&OffsetPtr);
				switch (Type) {
				case 0:
				Record.Type = RecordTypes::ENTER;
				break;
				case 1:
				Record.Type = RecordTypes::EXIT;
				break;
				default:
				return make_error<StringError>(
				Twine("Unknown record type '") + Twine(int{Type}) + "'",
				std::make_error_code(std::errc::protocol_error));
				}
				Record.FuncId = RecordExtractor.getSigned(&OffsetPtr, sizeof(int32_t));
				Record.TSC = RecordExtractor.getU64(&OffsetPtr);
				Record.TId = RecordExtractor.getU32(&OffsetPtr);
				}
				return Error::success();
				}

				Error llvm::xray::YAMLLogLoader(StringRef Data, XRayFileHeader &FileHeader,
				std::vector<XRayRecord> &Records) {

				// Load the documents from the MappedFile.
				YAMLXRayTrace Trace;
				Input In(Data);
				In >> Trace;
				if (In.error())
				return make_error<StringError>("Failed loading YAML Data.", In.error());

				FileHeader.Version = Trace.Header.Version;
				FileHeader.Type = Trace.Header.Type;
				FileHeader.ConstantTSC = Trace.Header.ConstantTSC;
				FileHeader.NonstopTSC = Trace.Header.NonstopTSC;
				FileHeader.CycleFrequency = Trace.Header.CycleFrequency;

				if (FileHeader.Version != 1)
				return make_error<StringError>(
				Twine("Unsupported XRay file version: ") + Twine(FileHeader.Version),
				std::make_error_code(std::errc::invalid_argument));

				Records.clear();
				std::transform(Trace.Records.begin(), Trace.Records.end(),
				std::back_inserter(Records), [&](const YAMLXRayRecord &R) {
				return XRayRecord{R.RecordType, R.CPU, R.Type,
				R.FuncId, R.TSC, R.TId};
				});
				return Error::success();
				}

llvm/trunk/tools/llvm-xray/xray-record-yaml.h

				//===- xray-record-yaml.h - XRay Record YAML Support Definitions ----------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// Types and traits specialisations for YAML I/O of XRay log entries.
				//
				//===----------------------------------------------------------------------===//
				#ifndef LLVM_TOOLS_LLVM_XRAY_XRAY_RECORD_YAML_H
				#define LLVM_TOOLS_LLVM_XRAY_XRAY_RECORD_YAML_H

				#include <type_traits>

				#include "xray-record.h"
				#include "llvm/Support/YAMLTraits.h"

				namespace llvm {
				namespace xray {

				struct YAMLXRayFileHeader {
				uint16_t Version;
				uint16_t Type;
				bool ConstantTSC;
				bool NonstopTSC;
				uint64_t CycleFrequency;
				};

				struct YAMLXRayRecord {
				uint16_t RecordType;
				uint8_t CPU;
				RecordTypes Type;
				int32_t FuncId;
				std::string Function;
				uint64_t TSC;
				uint32_t TId;
				};

				struct YAMLXRayTrace {
				YAMLXRayFileHeader Header;
				std::vector<YAMLXRayRecord> Records;
				};

				using XRayRecordStorage =
				std::aligned_storage<sizeof(XRayRecord), alignof(XRayRecord)>::type;

				} // namespace xray

				namespace yaml {

				// YAML Traits
				// -----------
				template <> struct ScalarEnumerationTraits<xray::RecordTypes> {
				static void enumeration(IO &IO, xray::RecordTypes &Type) {
				IO.enumCase(Type, "function-enter", xray::RecordTypes::ENTER);
				IO.enumCase(Type, "function-exit", xray::RecordTypes::EXIT);
				}
				};

				template <> struct MappingTraits<xray::YAMLXRayFileHeader> {
				static void mapping(IO &IO, xray::YAMLXRayFileHeader &Header) {
				IO.mapRequired("version", Header.Version);
				IO.mapRequired("type", Header.Type);
				IO.mapRequired("constant-tsc", Header.ConstantTSC);
				IO.mapRequired("nonstop-tsc", Header.NonstopTSC);
				IO.mapRequired("cycle-frequency", Header.CycleFrequency);
				}
				};

				template <> struct MappingTraits<xray::YAMLXRayRecord> {
				static void mapping(IO &IO, xray::YAMLXRayRecord &Record) {
				// FIXME: Make this type actually be descriptive
				IO.mapRequired("type", Record.RecordType);
				IO.mapRequired("func-id", Record.FuncId);
				IO.mapOptional("function", Record.Function);
				IO.mapRequired("cpu", Record.CPU);
				IO.mapRequired("thread", Record.TId);
				IO.mapRequired("kind", Record.Type);
				IO.mapRequired("tsc", Record.TSC);
				}

				static constexpr bool flow = true;
				};

				template <> struct MappingTraits<xray::YAMLXRayTrace> {
				static void mapping(IO &IO, xray::YAMLXRayTrace &Trace) {
				// A trace file contains two parts, the header and the list of all the
				// trace records.
				IO.mapRequired("header", Trace.Header);
				IO.mapRequired("records", Trace.Records);
				}
				};

				} // namespace yaml
				} // namespace llvm

				LLVM_YAML_IS_SEQUENCE_VECTOR(xray::YAMLXRayRecord)

				#endif // LLVM_TOOLS_LLVM_XRAY_XRAY_RECORD_YAML_H

llvm/trunk/tools/llvm-xray/xray-record.h

				//===- xray-record.h - XRay Trace Record ----------------------------------===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// This file replicates the record definition for XRay log entries. This should
				// follow the evolution of the log record versions supported in the compiler-rt
				// xray project.
				//
				//===----------------------------------------------------------------------===//
				#ifndef LLVM_TOOLS_LLVM_XRAY_XRAY_RECORD_H
				#define LLVM_TOOLS_LLVM_XRAY_XRAY_RECORD_H

				#include <cstdint>

				namespace llvm {
				namespace xray {

				struct XRayFileHeader {
				uint16_t Version = 0;
				uint16_t Type = 0;
				bool ConstantTSC;
				bool NonstopTSC;
				uint64_t CycleFrequency = 0;
				};

				enum class RecordTypes { ENTER, EXIT };

				struct XRayRecord {
				uint16_t RecordType;

				// The CPU where the thread is running. We assume number of CPUs <= 256.
				uint8_t CPU;

				// Identifies the type of record.
				RecordTypes Type;

				// The function ID for the record.
				int32_t FuncId;

				// Get the full 8 bytes of the TSC when we get the log record.
				uint64_t TSC;

				// The thread ID for the currently running thread.
				uint32_t TId;
				};

				} // namespace xray
				} // namespace llvm

				#endif // LLVM_TOOLS_LLVM_XRAY_XRAY_RECORD_H

This is an archive of the discontinued LLVM Phabricator instance.

[XRay] Implement `llvm-xray convert` -- trace file conversionClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 83765

llvm/trunk/test/tools/llvm-xray/X86/Inputs/elf64-objcopied-instrmap.bin

llvm/trunk/test/tools/llvm-xray/X86/Inputs/elf64-sample-o2.bin

llvm/trunk/test/tools/llvm-xray/X86/Inputs/naive-log-simple.xray

llvm/trunk/test/tools/llvm-xray/X86/Inputs/simple-xray-instrmap.yaml

llvm/trunk/test/tools/llvm-xray/X86/bad-instrmap-sizes.bin

llvm/trunk/test/tools/llvm-xray/X86/bad-instrmap-sizes.txt

llvm/trunk/test/tools/llvm-xray/X86/convert-roundtrip.yaml

llvm/trunk/test/tools/llvm-xray/X86/convert-to-yaml.txt

llvm/trunk/test/tools/llvm-xray/X86/convert-with-debug-syms.txt

llvm/trunk/test/tools/llvm-xray/X86/convert-with-standalone-instrmap.txt

llvm/trunk/test/tools/llvm-xray/X86/convert-with-yaml-instrmap.txt

llvm/trunk/tools/llvm-xray/CMakeLists.txt

llvm/trunk/tools/llvm-xray/func-id-helper.h

llvm/trunk/tools/llvm-xray/func-id-helper.cc

llvm/trunk/tools/llvm-xray/xray-converter.h

llvm/trunk/tools/llvm-xray/xray-converter.cc

llvm/trunk/tools/llvm-xray/xray-extract.cc

llvm/trunk/tools/llvm-xray/xray-log-reader.h

llvm/trunk/tools/llvm-xray/xray-log-reader.cc

llvm/trunk/tools/llvm-xray/xray-record-yaml.h

llvm/trunk/tools/llvm-xray/xray-record.h

[XRay] Implement `llvm-xray convert` -- trace file conversion
ClosedPublic