This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
include/llvm/
-
llvm/
-
CodeGen/
-
AsmPrinter.h
-
Target/
-
Target.td
-
TargetOpcodes.def
-
TargetSubtargetInfo.h
-
lib/
-
CodeGen/
-
AsmPrinter/
-
AsmPrinter.cpp
-
XRayInstrumentation.cpp
-
Target/
-
ARM/
-
ARMAsmPrinter.h
-
ARMAsmPrinter.cpp
-
ARMBaseInstrInfo.h
-
ARMMCInstLower.cpp
-
ARMSubtarget.h
-
ARMSubtarget.cpp
-
X86/
-
X86AsmPrinter.h
-
X86MCInstLower.cpp
-
X86Subtarget.h
-
test/CodeGen/ARM/
-
CodeGen/
-
ARM/
-
xray-armv6-attribute-instrumentation.ll
-
xray-armv7-attribute-instrumentation.ll

Differential D23931

[XRay] ARM 32-bit no-Thumb support in LLVM
ClosedPublic

Authored by rSerge on Aug 26 2016, 9:58 AM.

Download Raw Diff

Details

Reviewers

dberris
rengolin
t.p.northover
zatrazz
asl

Commits

rG4640154446cb: [XRay] ARM 32-bit no-Thumb support in LLVM
rG17d94e279e43: [XRay] ARM 32-bit no-Thumb support in LLVM
rL281878: [XRay] ARM 32-bit no-Thumb support in LLVM
rL280888: [XRay] ARM 32-bit no-Thumb support in LLVM

Summary

This is a port of XRay to ARM 32-bit, without Thumb support yet. The XRay instrumentation support is moving up to AsmPrinter.
This is one of 3 commits to different repositories of XRay ARM port. The other 2 are:

https://reviews.llvm.org/D23932 (Clang test)
https://reviews.llvm.org/D23933 (compiler-rt)

Diff Detail

Repository: rL LLVM

Event Timeline

rSerge updated this revision to Diff 69390.Aug 26 2016, 9:58 AM

rSerge retitled this revision from to [XRay] ARM 32-bit no-Thumb support.

rSerge updated this object.

rSerge added reviewers: dberris, rengolin, asl, t.p.northover.

rSerge added a subscriber: llvm-commits.

Herald added subscribers: dberris, samparker, rengolin, aemerson. · View Herald TranscriptAug 26 2016, 9:58 AM

rSerge added a parent revision: D19904: XRay: Add entry and exit sleds.Aug 26 2016, 9:59 AM

rSerge retitled this revision from [XRay] ARM 32-bit no-Thumb support to [XRay] ARM 32-bit no-Thumb support in LLVM.

rSerge added a child revision: D23932: [XRay] ARM 32-bit no-Thumb support in Clang.Aug 26 2016, 10:12 AM

rSerge added a child revision: D23933: [XRay] ARM 32-bit no-Thumb support in compiler-rt.Aug 26 2016, 10:21 AM

rSerge updated this object.

rengolin added a reviewer: zatrazz.Aug 26 2016, 11:17 AM

iid_iunknown added a subscriber: iid_iunknown.Aug 26 2016, 11:31 AM

dberris requested changes to this revision.Aug 28 2016, 5:51 PM

dberris edited edge metadata.

dberris added inline comments.

include/llvm/CodeGen/AsmPrinter.h
209 ↗	(On Diff #69390)	Do you need to spell out 'class' here? Wouldn't `const Function*` suffice?
include/llvm/Target/Target.td
969 ↗	(On Diff #69390)	AFAICT, yes, this is correct. The expectation is that this instruction should only ever show up in the assembler as a pseudo instruction (unless this is doing something else).
970 ↗	(On Diff #69390)	This one is a little harder. At least in x86, we weren't able to get this to work this way, because stack adjustments may happen later than the insertion of the marker instruction. Unless you can control exactly when this instruction is inserted and that the stack adjustment code doesn't ever move this (or add things after this instruction) then you might want to go do the same thing that we're doing in X86.
include/llvm/Target/TargetOpcodes.def
161 ↗	(On Diff #69390)	Is there any reason to do this instead of following the same convention used in x86 of having the nops be after the return instruction?
lib/CodeGen/XRayInstrumentation.cpp
46–53 ↗	(On Diff #69390)	This is a great explanation. Can you say something similar in the description just so it's clear why there's a difference in the approach?
lib/Target/X86/X86AsmPrinter.h
74–94 ↗	(On Diff #69390)	I think it's worth noting in the description that we're moving the XRay instrumentation support up to AsmPrinter too.

This revision now requires changes to proceed.Aug 28 2016, 5:51 PM

rSerge added inline comments.Aug 30 2016, 7:19 AM

include/llvm/CodeGen/AsmPrinter.h
209 ↗	(On Diff #69390)	I just moved (copy-pasted) this from X86AsmPrinter.h . Without `class` it does not compile because XRayFunctionEntry already has a member wih the same name: `const MCSymbol *Function` .
include/llvm/Target/Target.td
970 ↗	(On Diff #69390)	The same thing as in x86_64 is not possible for ARM because it has multiple return instructions. Furthermore, CPU allows parametrized and even conditional return instructions. In the current ARM implementation we are making use of the fact that currently LLVM doesn't seem to generate conditional return instructions. On ARM, the same instruction can be used for popping multiple registers from the stack and returning (it just pops `pc` register too), and LLVM generates it sometimes. So we can't insert the sled between this stack adjustment and the return without splitting the original instruction into 2 instructions. So on ARM, rather than jumping into the exit trampoline, we call it, it does the tracing, preserves the stack and returns.
include/llvm/Target/TargetOpcodes.def
161 ↗	(On Diff #69390)	Yes, as I've explained above, the problem is that ARM has multiple return instructions, so we have to preserve the original return instruction and call the exit tracing trampoline instead of jumping into it. I'm adding a comment in the code too.
lib/CodeGen/XRayInstrumentation.cpp
46–53 ↗	(On Diff #69390)	I'm adding it to llvm\include\llvm\Target\TargetOpcodes.def .

Implemented the requested changes (more comments).

Hi, I see a number of problems with this patch. The most common one is the direct emission of binary patterns, which is not clear nor maintainable. Please, use the builders to emit instructions.

Also, I'm worried that the space you're reserving for the binary patch won't be enough for all cases. There are a number of PCS issues (hard vs soft, larger-than-32bit returns, arch and sub-arch support of return styles) which you're not accounting for any of them.

Furthermore, you need to make sure thumb-interworking works. You're outputting ARM code, but the user code can very well be Thumb, so you need to make sure it works. Not all architectures support BLX either (ex. v4T), and POP { lr } has been deprecated.

Finally, you need tests. A lot of them. To make sure you are covering the architectures you intend, in all the configurations you intend, and to actively fail if you don't intend, by adding checks in the code that error out when the arch / sub-arch is in a combination you don't expect.

rengolin added inline comments.Aug 30 2016, 8:54 AM

lib/CodeGen/XRayInstrumentation.cpp
46–53 ↗	(On Diff #69390)	Agreed. Probably move the separate comments to their implementations?
92 ↗	(On Diff #69390)	Good point. Probably not correct.
126 ↗	(On Diff #69390)	nit: this comment is better applied to the function "prependRetWithPatchableExit" after the case. People will know what to do in the future. You don't need a comment on the default case, too.
lib/Target/ARM/ARMAsmPrinter.cpp
1983 ↗	(On Diff #69390)	No need for braces if you're not declaring variables.
lib/Target/ARM/ARMMCInstLower.cpp
158 ↗	(On Diff #69390)	There isn't, as nop is currently only an alias, not an instruction. But take a look at: ARMInstrInfo::getNoopForMachoTarget() and do the same for ELF.
181 ↗	(On Diff #69390)	Why just save r0? AAPCS can use all four r0-r3 for return results.
187 ↗	(On Diff #69390)	BLX is unconditional, POP will never be executed. Is that intended?
198 ↗	(On Diff #69390)	Please, don't emit binary directly. Use the builders.

This revision now requires changes to proceed.Aug 30 2016, 8:54 AM

dberris added inline comments.Aug 30 2016, 7:18 PM

lib/CodeGen/XRayInstrumentation.cpp
92 ↗	(On Diff #69677)	Yes, this is definitely not correct. This is a remnant of some refactoring I've done and it stuck around. :( Let me add a test and fix, should be trivial.

dberris requested changes to this revision.Aug 30 2016, 10:30 PM

dberris edited edge metadata.

dberris added inline comments.

lib/CodeGen/XRayInstrumentation.cpp
92 ↗	(On Diff #69677)	This is now fixed in rL280192 -- please rebase to get the change (and tests).

In D23931#529004, @rengolin wrote:

Hi, I see a number of problems with this patch. The most common one is the direct emission of binary patterns, which is not clear nor maintainable. Please, use the builders to emit instructions.

Also, I'm worried that the space you're reserving for the binary patch won't be enough for all cases. There are a number of PCS issues (hard vs soft, larger-than-32bit returns, arch and sub-arch support of return styles) which you're not accounting for any of them.

Furthermore, you need to make sure thumb-interworking works. You're outputting ARM code, but the user code can very well be Thumb, so you need to make sure it works. Not all architectures support BLX either (ex. v4T), and POP { lr } has been deprecated.

Finally, you need tests. A lot of them. To make sure you are covering the architectures you intend, in all the configurations you intend, and to actively fail if you don't intend, by adding checks in the code that error out when the arch / sub-arch is in a combination you don't expect.

Hi,
Ok, I'll look if the same can be done with builders.
I'm not targeting all ARM architectures at once, at least not in the first commit. I think we should choose 1 ARM architecture for which XRay works, and assume the others not supported or experimental. Currently I am building and experimenting with armhf (32-bit).
Sled sizes do not have to fit all architectures either (this would result in waste of space for some, thus worse performance due to cache misses). Currently sleds are 11 bytes on x86_64 and 28 bytes on armhf.
What is PCS ?
Thumb is not supported yet.
Architectures which do not support BLX are not supported.
Any evidence that POP {lr} is deprecated? I could only find on the internet that "These instructions that include both PC and LR in the reglist are deprecated in ARMv6T2 and above.": http://infocenter.arm.com/help/index.jsp?topic=/com.arm.doc.dui0588b/Babefbce.html . I'm not using both pc and lr in PUSH or POP.
Any specific examples which more tests should be added for the single supported architecture armhf?

lib/CodeGen/XRayInstrumentation.cpp
126 ↗	(On Diff #69677)	Moving the comments towards the function calls.
lib/Target/ARM/ARMAsmPrinter.cpp
1983 ↗	(On Diff #69677)	Removing.
lib/Target/ARM/ARMMCInstLower.cpp
181 ↗	(On Diff #69677)	We save the other registers in the trampoline (`__xray_FunctionEntry` and `__xray_FunctionExit` assembly functions).
187 ↗	(On Diff #69677)	`POP` is intended to execute after return from the subroutine, which `BLX` calls.

Updated with the changes requested in the comments.

In D23931#530324, @rSerge wrote:

I'm not targeting all ARM architectures at once, at least not in the first commit. I think we should choose 1 ARM architecture for which XRay works, and assume the others not supported or experimental. Currently I am building and experimenting with armhf (32-bit).

Right, "armhf" is not one ARM architecture, but dozens. It can be anything from v6T2 to V8.2A, including all sub-architectures, features and variations. Though, from what I've seen so far, the code you use would work on any architecture of that range.

It would be safer, though, to document the *intended* target specifically, like "ARMv7A with VFPv3 support". So that people with "ARMv6T2 with VFPv2" support are not surprised when you assumed something "wrong" for them. Adding a "hasV6T2Ops()" check on the entry-point would help.

Sled sizes do not have to fit all architectures either (this would result in waste of space for some, thus worse performance due to cache misses). Currently sleds are 11 bytes on x86_64 and 28 bytes on armhf.

Check.

What is PCS ?

Procedure Call Standard. This is the part of the ABI that defines how functions are called to be compatible with the ABI. Mostly about how to serialise arguments and return values in registers, stack, etc.

Both C and C++, as well as any other language that wants to be compatible with ARM's EABI standard *have* to abide to those terms.

Thumb is not supported yet.

Do you mean not supported in the Sled code, or inserting ARM Sled code into Thumb functions?

If the former, then you have to check if the architecture/OS/ABI you're supporting allows ARM code. For instance, Windows doesn't.

If the latter, than you need to check if the architecture/OS/ABI you're supporting allows Thumb code. For instance, there could be libraries around, or even inline assembly with ".thumb" in it (yes, this does happen). I can't remember how, but there's a way to know what's the ISA for a specific function, this could help you. OTOH, this could be an assembler things, can't remember.

Any way, you need to check if the architecture/OS/ISA/ABI you have is compatible with your assumptions before you emit code.

Architectures which do not support BLX are not supported.

Fair enough. But as I said earlier, this has to be clearly encoded (via error messages) on the entry-point of your code.

Any evidence that POP {lr} is deprecated?

Sorry, my bad. I was thinking about a different case. Ignore me.

Any specific examples which more tests should be added for the single supported architecture armhf?

As I said earlier, you need to make sure you only emit your stubs on architectures that you know works. Checking the target for architecture level, ISA support and ABI should be enough, at least on the entry-point.

Adding tests is, then, easily done by having two files: one where everything should fail, RUNning with a "not" before "llc", CHECKing for error messages, and one where everything should pass, CHECKing for the correct sequence of Nops, etc.

It should be fine to add all error messages to one file and all cases that should pass to another.

cheers,
--renato

lib/Target/ARM/ARMMCInstLower.cpp
181 ↗	(On Diff #69851)	Right, and these are guaranteed to only use one 32-bit argument. Check.
187 ↗	(On Diff #69851)	D'oh, Branch&Link, sorry, you're correct.
test/CodeGen/ARM/xray-attribute-instrumentation.ll
5 ↗	(On Diff #69851)	I was expecting Nops...

rSerge marked 6 inline comments as done.Sep 1 2016, 8:31 AM

rSerge added inline comments.

lib/Target/X86/X86AsmPrinter.h
74–94 ↗	(On Diff #69851)	Do you mean the description for the diff? Or a comment in the source code?
test/CodeGen/ARM/xray-attribute-instrumentation.ll
5 ↗	(On Diff #69851)	The first instruction is a jump over the NOPs. The other 6 instructions are NOPs.

rSerge updated this object.Sep 1 2016, 8:32 AM

rSerge edited edge metadata.

rengolin added inline comments.Sep 1 2016, 8:36 AM

test/CodeGen/ARM/xray-attribute-instrumentation.ll
5 ↗	(On Diff #69851)	Right, I was referring to the .ascii... When you use builders, this won't happen any more. It will also work in big-endian. :)

In D23931#530369, @rengolin wrote:

In D23931#530324, @rSerge wrote:

I'm not targeting all ARM architectures at once, at least not in the first commit. I think we should choose 1 ARM architecture for which XRay works, and assume the others not supported or experimental. Currently I am building and experimenting with armhf (32-bit).

Right, "armhf" is not one ARM architecture, but dozens. It can be anything from v6T2 to V8.2A, including all sub-architectures, features and variations. Though, from what I've seen so far, the code you use would work on any architecture of that range.

Thanks for explaining. I am still starting with ARM and LLVM.

It would be safer, though, to document the *intended* target specifically, like "ARMv7A with VFPv3 support". So that people with "ARMv6T2 with VFPv2" support are not surprised when you assumed something "wrong" for them. Adding a "hasV6T2Ops()" check on the entry-point would help.

Ok, I'll try to select something more specific than armhf.

Sled sizes do not have to fit all architectures either (this would result in waste of space for some, thus worse performance due to cache misses). Currently sleds are 11 bytes on x86_64 and 28 bytes on armhf.

Check.

What is PCS ?

Procedure Call Standard. This is the part of the ABI that defines how functions are called to be compatible with the ABI. Mostly about how to serialise arguments and return values in registers, stack, etc.

Both C and C++, as well as any other language that wants to be compatible with ARM's EABI standard *have* to abide to those terms.

Thumb is not supported yet.

Do you mean not supported in the Sled code, or inserting ARM Sled code into Thumb functions?

Neither is supported. I estimated that Thumb support requires substantial additional effort.

If the former, then you have to check if the architecture/OS/ABI you're supporting allows ARM code. For instance, Windows doesn't.

If the latter, than you need to check if the architecture/OS/ABI you're supporting allows Thumb code. For instance, there could be libraries around, or even inline assembly with ".thumb" in it (yes, this does happen). I can't remember how, but there's a way to know what's the ISA for a specific function, this could help you. OTOH, this could be an assembler things, can't remember.

Yes, this looks like a lot of effort.

Any way, you need to check if the architecture/OS/ISA/ABI you have is compatible with your assumptions before you emit code.

Architectures which do not support BLX are not supported.

Fair enough. But as I said earlier, this has to be clearly encoded (via error messages) on the entry-point of your code.

Any evidence that POP {lr} is deprecated?

Sorry, my bad. I was thinking about a different case. Ignore me.

Any specific examples which more tests should be added for the single supported architecture armhf?

As I said earlier, you need to make sure you only emit your stubs on architectures that you know works. Checking the target for architecture level, ISA support and ABI should be enough, at least on the entry-point.

Adding tests is, then, easily done by having two files: one where everything should fail, RUNning with a "not" before "llc", CHECKing for error messages, and one where everything should pass, CHECKing for the correct sequence of Nops, etc.

It should be fine to add all error messages to one file and all cases that should pass to another.

cheers,
--renato

The amount of change requested in the code review seems too much for the first iteration. Can we limit the scope and plan incremental improvements?

Cheers,
Serge

Updated with the latest changes from mainline.

In D23931#531728, @rSerge wrote:

Do you mean not supported in the Sled code, or inserting ARM Sled code into Thumb functions?

Neither is supported. I estimated that Thumb support requires substantial additional effort.

My gut feeling is that this should mostly work already, since you're using BLX instructions.

But I agree, let's not get ahead of ourselves.

Limit support for ARMv7A, non-Windows (which forces Thumb2). Something like:

if (!SubTarget->hasV7Ops() || SubTarget->isWindows())
  return Forgerabarit.

cheers,
--renato

dberris added inline comments.Sep 1 2016, 11:14 PM

lib/Target/X86/X86AsmPrinter.h
74–94 ↗	(On Diff #70043)	Definitely a description in the diff.

In D23931#531853, @rengolin wrote:

In D23931#531728, @rSerge wrote:

Do you mean not supported in the Sled code, or inserting ARM Sled code into Thumb functions?

Neither is supported. I estimated that Thumb support requires substantial additional effort.

My gut feeling is that this should mostly work already, since you're using BLX instructions.

BLX r12 instruction has different machine code for ARM and Thumb. It is 4 byte long on ARM and 2 byte long on Thumb. Furthermore, the rest of machine code in a sled contains 32-bit ARM instructions. Thumb may need different machine code, or even sequence of instructions because not everything is available in Thumb. To avoid changing trampoline assembly code, the trampoline can be called with BLX indicating that the destination is in ARM assembly.

But I agree, let's not get ahead of ourselves.

Limit support for ARMv7A, non-Windows (which forces Thumb2). Something like:
if (!SubTarget->hasV7Ops() || SubTarget->isWindows())
  return Forgerabarit.

Ok.

Implemented the changes requested in the code review.

Limit support for ARMv7A, non-Windows (which forces Thumb2). Something like:
if (!SubTarget->hasV7Ops() || SubTarget->isWindows())
  return Forgerabarit.

It seems that ARMv6 is sufficient. Implemented mostly as suggested.

lib/Target/ARM/ARMMCInstLower.cpp
159 ↗	(On Diff #70255)	Changed.
199 ↗	(On Diff #70255)	Done.
test/CodeGen/ARM/xray-attribute-instrumentation.ll
6 ↗	(On Diff #70255)	Done.

Hi Serge,

The Nop emission is really simple, and the isXRaySupported() is really simple and accurate. Thanks for addressing all the comments, the code is looking really nice.

Now, two points:

There are ways to report warnings/errors back to the front-end, but it depends how this is interpreted.

Since the instrumentation is inserted by the front end, than this should be a back-end *error*, and front-ends should fail with a decent error message saying "XRay is not supported for target X".

If you want just a warning, you can avoid inserting the sleds and the run-time code won't do anything, as you're doing it now. But you then have to warn the users that they won't get what they requested. I strongly suggest to make it an error instead.

For error messages, it's best to use "getContext().reportError(Loc, ...)", as this would nicely roll back to the front-end without crashing. But if that doesn't work (it should, really), you can use "report_fatal_error", "llvm_unreachable" or even an "assert()", though these are just last-resort only.

About front-end duplicating the checks, it's up to you and @dberris. The error message in Clang and llc should be the same, though, and reportError() does that well.

Tests.

The current test is good, it checks the right number of NOPs and the overall structure. Excellent.

Now we need "negative tests", ie. those that *have* to fail. For that, you add a RUN line that starts with "not llc ..." and CHECK for the error messages. There are plenty of examples in there already.

Since you're restricting x86_64, you should have one for i386. Since you're restricting ARMv6/Unix, you should have one for ARMv5, and one for ARM Windows.

cheers,
--renato

In D23931#533606, @rengolin wrote:

Hi Serge,

The Nop emission is really simple, and the isXRaySupported() is really simple and accurate. Thanks for addressing all the comments, the code is looking really nice.

+1 -- thanks @rSerge!

Now, two points:

There are ways to report warnings/errors back to the front-end, but it depends how this is interpreted.

Since the instrumentation is inserted by the front end, than this should be a back-end *error*, and front-ends should fail with a decent error message saying "XRay is not supported for target X".

If you want just a warning, you can avoid inserting the sleds and the run-time code won't do anything, as you're doing it now. But you then have to warn the users that they won't get what they requested. I strongly suggest to make it an error instead.

For error messages, it's best to use "getContext().reportError(Loc, ...)", as this would nicely roll back to the front-end without crashing. But if that doesn't work (it should, really), you can use "report_fatal_error", "llvm_unreachable" or even an "assert()", though these are just last-resort only.

About front-end duplicating the checks, it's up to you and @dberris. The error message in Clang and llc should be the same, though, and reportError() does that well.

I'm happy with an error using the usual error reporting mechanisms here.

Tests.

The current test is good, it checks the right number of NOPs and the overall structure. Excellent.

Now we need "negative tests", ie. those that *have* to fail. For that, you add a RUN line that starts with "not llc ..." and CHECK for the error messages. There are plenty of examples in there already.

Since you're restricting x86_64, you should have one for i386. Since you're restricting ARMv6/Unix, you should have one for ARMv5, and one for ARM Windows.

I agree with this. FWIW, I'm happy with getting this in and getting it tested, then locking it down with more negative tests once it's upstream.

Thanks Renato!

lib/Target/ARM/ARMAsmPrinter.h
102–106 ↗	(On Diff #70255)	Do you already want to support tail call optimisation sleds now? Or did you plan to do something about that later?

LGTM (I think we should be fine with adding more tests later)

Thanks again @rSerge!

Thanks!

This revision is now accepted and ready to land.Sep 6 2016, 1:14 AM

rSerge marked 3 inline comments as done.Sep 6 2016, 12:17 PM

This comment was removed by rSerge.

rSerge added a comment.Sep 6 2016, 12:56 PM

This comment was removed by rSerge.

So something started to just remove the first instruction of the sled, whether the sled is emitted as binary or using instructions/builders. Clang -S generates the assembly file with correct sleds (all the instructions present), but then disassembly of the object or executable file shows only 6 last instructions, without the first instruction of the sled.
UPDATE: I just confused the compile options, so assembly files were new and object files were old. No problem with this in the code, tested.

Rebased to the latest revision. I don't have commit access rights. Could someone commit?

I'll do it for all three, thanks again @rSerge!

For some reason the standard arc patch DNNNNN workflow doesn't apply to this patch (I'm not sure if it's generated in a manner not using arcanist). I've had to massage this manually by doing:

curl https://reviews.llvm.org/file/data/spjqzhddatjrbozzbl4u/PHID-FILE-7s4h3zdshadln2e7cgbi/D23931.diff  | git apply - -p0 --ignore-whitespace --whitespace=fix

I may have to do something similar to the other patches, so all landing errors will be mine.

Closed by commit rL280888: [XRay] ARM 32-bit no-Thumb support in LLVM (authored by dberris). · Explain WhySep 7 2016, 5:27 PM

This revision was automatically updated to reflect the committed changes.

Thanks all, especially @dberris .

So, unfortunately this got reverted in rL280967 because it fails on thumb (as the checks hadn't been put in to not generate XRay sleds for non-thumb).

@rSerge -- are you able to put in the appropriate checks to warn when using XRay on thumb? @rengolin has offered to help with the testing on the build-bots to make this possible.

This revision is now accepted and ready to land.Sep 8 2016, 9:24 PM

I think @rengolin has more details as to how this caused failures and how else to debug on thumb.

This revision now requires changes to proceed.Sep 8 2016, 9:25 PM

dberris mentioned this in rL280889: [XRay] ARM 32-bit no-Thumb support in Clang.Sep 8 2016, 9:27 PM

dberris mentioned this in D23932: [XRay] ARM 32-bit no-Thumb support in Clang.

dberris mentioned this in D23933: [XRay] ARM 32-bit no-Thumb support in compiler-rt.Sep 8 2016, 9:31 PM

I don't yet understand how these commits could break build-bots. Did someone add -fxray-instrument clang option to bots which generate Thumb code?

In D23931#538093, @rSerge wrote:

I don't yet understand how these commits could break build-bots. Did someone add -fxray-instrument clang option to bots which generate Thumb code?

Nope. The error was when compiling xray_trampoline_arm.S.

Compiler-RT's patch enables XRay on ARM, which means it'll run all the existing XRay tests on ARM buildbots, which also mean Thumb ones, which also build XRay's sources.

This was the error message:

FAILED: /usr/lib/ccache/cc  -DXRAY_HAS_EXCEPTIONS=1 -D_DEBUG -D_GNU_SOURCE -D__STDC_CONSTANT_MACROS -D__STDC_FORMAT_MACROS -D__STDC_LIMIT_MACROS -Iprojects/compiler-rt/lib/xray -I/home/linaro/devel/buildbot/clang-cmake-thumbv7-a15-full-sh/llvm/projects/compiler-rt/lib/xray -Iinclude -I/home/linaro/devel/buildbot/clang-cmake-thumbv7-a15-full-sh/llvm/include -I/home/linaro/devel/buildbot/clang-cmake-thumbv7-a15-full-sh/llvm/projects/compiler-rt/lib/xray/.. -I/home/linaro/devel/buildbot/clang-cmake-thumbv7-a15-full-sh/llvm/projects/compiler-rt/lib/xray/../../include -fPIC -O3 -DNDEBUG   -UNDEBUG  -march=armv7-a -mfloat-abi=hard -fPIC -fno-builtin -fno-exceptions -fomit-frame-pointer -funwind-tables -fno-stack-protector -fvisibility=hidden -fvisibility-inlines-hidden -fno-function-sections -fno-lto -O3 -g -Wno-variadic-macros -Wno-non-virtual-dtor -MMD -MT projects/compiler-rt/lib/xray/CMakeFiles/clang_rt.xray-armhf.dir/xray_trampoline_arm.S.o -MF projects/compiler-rt/lib/xray/CMakeFiles/clang_rt.xray-armhf.dir/xray_trampoline_arm.S.o.d -o projects/compiler-rt/lib/xray/CMakeFiles/clang_rt.xray-armhf.dir/xray_trampoline_arm.S.o -c /home/linaro/devel/buildbot/clang-cmake-thumbv7-a15-full-sh/llvm/projects/compiler-rt/lib/xray/xray_trampoline_arm.S

llvm/projects/compiler-rt/lib/xray/xray_trampoline_arm.S: Assembler messages:
llvm/projects/compiler-rt/lib/xray/xray_trampoline_arm.S:17: Error: attempt to use an ARM instruction on a Thumb-only processor -- `push {r1-r3,lr}'

My patch didn't work because this is using the system compiler, and not Clang, and GCC was picky about assembling ARM instructions (from xray_trampoline_arm.S) into an object that will be linked with other Thumb-only objects.

This will require some experimentation with a cross GCC/binutils, to make sure that -mthumb won't generate the NOPs as well as not try to link the code. An #ifndef __thumb__ in xray_trampoline_arm.S to omit everything could work, so if the Clang implementation is wrong, we get a compiler error instead of a run-time error.

cheers,
--renato

Now I understand, thanks, @rengolin . Thumb is on my list, though I thought it can be done later. Now I need to weigh whether all the work with conditional compilation and error reporting for Thumb is not too much w.r.t. the time to just implement the support for Thumb. I'm looking into this...

Fixed "Error: attempt to use an ARM instruction on a Thumb-only processor -- `push {r1-r3,lr}' ". The reason was ".arch armv7" directive. This directive for GCC represents the intersection of arm7v-a and armv7-m instruction sets, implying Thumb-only instructions, and this conflicts with ".code 32" directive. Then GCC, instead of articulating the conflict, complains about every instruction in the assembly file.
Tested on cross-compilation with GCC from x86_64-Ubuntu to ARM-Linux.
Tested on cross-compilation with Clang from x86_64-Windows to ARM-Linux.

Fixed patch file format.

In D23931#544741, @rSerge wrote:

Fixed "Error: attempt to use an ARM instruction on a Thumb-only processor -- `push {r1-r3,lr}' ". The reason was ".arch armv7" directive. This directive for GCC represents the intersection of arm7v-a and armv7-m instruction sets, implying Thumb-only instructions, and this conflicts with ".code 32" directive. Then GCC, instead of articulating the conflict, complains about every instruction in the assembly file.

Ah, yes! This makes sense.

Thanks @rSerge -- I'll land this and dependent patches again.

Cheers

This revision is now accepted and ready to land.Sep 18 2016, 5:03 PM

Closed by commit rL281878: [XRay] ARM 32-bit no-Thumb support in LLVM (authored by dberris). · Explain WhySep 18 2016, 6:03 PM

This revision was automatically updated to reflect the committed changes.

rSerge added a child revision: D24799: [XRay] Check in Clang whether XRay supports the target when -fxray-instrument is passed.Sep 21 2016, 7:27 AM

rSerge added a child revision: D25030: [XRay] Support for for tail calls for ARM no-Thumb.Sep 28 2016, 8:49 AM

Revision Contents

Path

Size

llvm/

trunk/

include/

llvm/

CodeGen/

AsmPrinter.h

28 lines

Target/

Target.td

10 lines

TargetOpcodes.def

17 lines

TargetSubtargetInfo.h

2 lines

lib/

CodeGen/

AsmPrinter/

AsmPrinter.cpp

10 lines

XRayInstrumentation.cpp

115 lines

Target/

ARM/

12 lines

9 lines

4 lines

94 lines

2 lines

5 lines

X86/

X86AsmPrinter.h

23 lines

X86MCInstLower.cpp

10 lines

X86Subtarget.h

2 lines

test/

CodeGen/

ARM/

xray-armv6-attribute-instrumentation.ll

24 lines

xray-armv7-attribute-instrumentation.ll

24 lines

Diff 71763

llvm/trunk/include/llvm/CodeGen/AsmPrinter.h

Show First 20 Lines • Show All 178 Lines • ▼ Show 20 Lines	public:
const MCSection *getCurrentSection() const;		const MCSection *getCurrentSection() const;

void getNameWithPrefix(SmallVectorImpl<char> &Name,		void getNameWithPrefix(SmallVectorImpl<char> &Name,
const GlobalValue *GV) const;		const GlobalValue *GV) const;

MCSymbol getSymbol(const GlobalValue GV) const;		MCSymbol getSymbol(const GlobalValue GV) const;

//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//
		// XRay instrumentation implementation.
		//===------------------------------------------------------------------===//
		public:
		// This describes the kind of sled we're storing in the XRay table.
		enum class SledKind : uint8_t {
		FUNCTION_ENTER = 0,
		FUNCTION_EXIT = 1,
		TAIL_CALL = 2,
		};

		// The table will contain these structs that point to the sled, the function
		// containing the sled, and what kind of sled (and whether they should always
		// be instrumented).
		struct XRayFunctionEntry {
		const MCSymbol *Sled;
		const MCSymbol *Function;
		SledKind Kind;
		bool AlwaysInstrument;
		const class Function *Fn;
		};

		// All the sleds to be emitted.
		std::vector<XRayFunctionEntry> Sleds;

		// Helper function to record a given XRay sled.
		void recordSled(MCSymbol *Sled, const MachineInstr &MI, SledKind Kind);

		//===------------------------------------------------------------------===//
// MachineFunctionPass Implementation.		// MachineFunctionPass Implementation.
//===------------------------------------------------------------------===//		//===------------------------------------------------------------------===//

/// Record analysis usage.		/// Record analysis usage.
///		///
void getAnalysisUsage(AnalysisUsage &AU) const override;		void getAnalysisUsage(AnalysisUsage &AU) const override;

/// Set up the AsmPrinter when we are working on a new module. If your pass		/// Set up the AsmPrinter when we are working on a new module. If your pass
▲ Show 20 Lines • Show All 360 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/Target/Target.td

Show First 20 Lines • Show All 971 Lines • ▼ Show 20 Lines	def PATCHABLE_FUNCTION_ENTER : Instruction {
let InOperandList = (ins);		let InOperandList = (ins);
let AsmString = "# XRay Function Enter.";		let AsmString = "# XRay Function Enter.";
let usesCustomInserter = 1;		let usesCustomInserter = 1;
let hasSideEffects = 0;		let hasSideEffects = 0;
}		}
def PATCHABLE_RET : Instruction {		def PATCHABLE_RET : Instruction {
let OutOperandList = (outs unknown:$dst);		let OutOperandList = (outs unknown:$dst);
let InOperandList = (ins variable_ops);		let InOperandList = (ins variable_ops);
let AsmString = "# XRay Function Exit.";		let AsmString = "# XRay Function Patchable RET.";
let usesCustomInserter = 1;		let usesCustomInserter = 1;
let hasSideEffects = 1;		let hasSideEffects = 1;
let isReturn = 1;		let isReturn = 1;
}		}
		def PATCHABLE_FUNCTION_EXIT : Instruction {
		let OutOperandList = (outs);
		let InOperandList = (ins);
		let AsmString = "# XRay Function Exit.";
		let usesCustomInserter = 1;
		let hasSideEffects = 0; // FIXME: is this correct?
		let isReturn = 0; // Original return instruction will follow
		}
def PATCHABLE_TAIL_CALL : Instruction {		def PATCHABLE_TAIL_CALL : Instruction {
let OutOperandList = (outs unknown:$dst);		let OutOperandList = (outs unknown:$dst);
let InOperandList = (ins variable_ops);		let InOperandList = (ins variable_ops);
let AsmString = "# XRay Tail Call Exit.";		let AsmString = "# XRay Tail Call Exit.";
let usesCustomInserter = 1;		let usesCustomInserter = 1;
let hasSideEffects = 1;		let hasSideEffects = 1;
let isReturn = 1;		let isReturn = 1;
}		}
▲ Show 20 Lines • Show All 342 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/Target/TargetOpcodes.def

	Show First 20 Lines • Show All 147 Lines • ▼ Show 20 Lines

	/// This is a marker instruction which gets translated into a nop sled, useful			/// This is a marker instruction which gets translated into a nop sled, useful
	/// for inserting instrumentation instructions at runtime.			/// for inserting instrumentation instructions at runtime.
	HANDLE_TARGET_OPCODE(PATCHABLE_FUNCTION_ENTER)			HANDLE_TARGET_OPCODE(PATCHABLE_FUNCTION_ENTER)

	/// Wraps a return instruction and its operands to enable adding nop sleds			/// Wraps a return instruction and its operands to enable adding nop sleds
	/// either before or after the return. The nop sleds are useful for inserting			/// either before or after the return. The nop sleds are useful for inserting
	/// instrumentation instructions at runtime.			/// instrumentation instructions at runtime.
				/// The patch here replaces the return instruction.
	HANDLE_TARGET_OPCODE(PATCHABLE_RET)			HANDLE_TARGET_OPCODE(PATCHABLE_RET)

				/// This is a marker instruction which gets translated into a nop sled, useful
				/// for inserting instrumentation instructions at runtime.
				/// The patch here prepends the return instruction.
				/// The same thing as in x86_64 is not possible for ARM because it has multiple
				/// return instructions. Furthermore, CPU allows parametrized and even
				/// conditional return instructions. In the current ARM implementation we are
				/// making use of the fact that currently LLVM doesn't seem to generate
				/// conditional return instructions.
				/// On ARM, the same instruction can be used for popping multiple registers
				/// from the stack and returning (it just pops pc register too), and LLVM
				/// generates it sometimes. So we can't insert the sled between this stack
				/// adjustment and the return without splitting the original instruction into 2
				/// instructions. So on ARM, rather than jumping into the exit trampoline, we
				/// call it, it does the tracing, preserves the stack and returns.
				HANDLE_TARGET_OPCODE(PATCHABLE_FUNCTION_EXIT)

	/// Wraps a tail call instruction and its operands to enable adding nop sleds			/// Wraps a tail call instruction and its operands to enable adding nop sleds
	/// either before or after the tail exit. We use this as a disambiguation from			/// either before or after the tail exit. We use this as a disambiguation from
	/// PATCHABLE_RET which specifically only works for return instructions.			/// PATCHABLE_RET which specifically only works for return instructions.
	HANDLE_TARGET_OPCODE(PATCHABLE_TAIL_CALL)			HANDLE_TARGET_OPCODE(PATCHABLE_TAIL_CALL)

	/// The following generic opcodes are not supposed to appear after ISel.			/// The following generic opcodes are not supposed to appear after ISel.
	/// This is something we might want to relax, but for now, this is convenient			/// This is something we might want to relax, but for now, this is convenient
	/// to produce diagnostics.			/// to produce diagnostics.
	▲ Show 20 Lines • Show All 189 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/Target/TargetSubtargetInfo.h

	Show First 20 Lines • Show All 65 Lines • ▼ Show 20 Lines
	public:			public:
	// AntiDepBreakMode - Type of anti-dependence breaking that should			// AntiDepBreakMode - Type of anti-dependence breaking that should
	// be performed before post-RA scheduling.			// be performed before post-RA scheduling.
	typedef enum { ANTIDEP_NONE, ANTIDEP_CRITICAL, ANTIDEP_ALL } AntiDepBreakMode;			typedef enum { ANTIDEP_NONE, ANTIDEP_CRITICAL, ANTIDEP_ALL } AntiDepBreakMode;
	typedef SmallVectorImpl<const TargetRegisterClass *> RegClassVector;			typedef SmallVectorImpl<const TargetRegisterClass *> RegClassVector;

	virtual ~TargetSubtargetInfo();			virtual ~TargetSubtargetInfo();

				virtual bool isXRaySupported() const { return false; }

	// Interfaces to the major aspects of target machine information:			// Interfaces to the major aspects of target machine information:
	//			//
	// -- Instruction opcode and operand information			// -- Instruction opcode and operand information
	// -- Pipelines and scheduling information			// -- Pipelines and scheduling information
	// -- Stack frame information			// -- Stack frame information
	// -- Selection DAG lowering information			// -- Selection DAG lowering information
	// -- Call lowering information			// -- Call lowering information
	//			//
	▲ Show 20 Lines • Show All 147 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/AsmPrinter/AsmPrinter.cpp

Show First 20 Lines • Show All 2,594 Lines • ▼ Show 20 Lines	GCMetadataPrinter *AsmPrinter::GetOrCreateGCPrinter(GCStrategy &S) {

report_fatal_error("no GCMetadataPrinter registered for GC: " + Twine(Name));		report_fatal_error("no GCMetadataPrinter registered for GC: " + Twine(Name));
}		}

/// Pin vtable to this file.		/// Pin vtable to this file.
AsmPrinterHandler::~AsmPrinterHandler() {}		AsmPrinterHandler::~AsmPrinterHandler() {}

void AsmPrinterHandler::markFunctionEnd() {}		void AsmPrinterHandler::markFunctionEnd() {}

		void AsmPrinter::recordSled(MCSymbol *Sled, const MachineInstr &MI,
		SledKind Kind) {
		auto Fn = MI.getParent()->getParent()->getFunction();
		auto Attr = Fn->getFnAttribute("function-instrument");
		bool AlwaysInstrument =
		Attr.isStringAttribute() && Attr.getValueAsString() == "xray-always";
		Sleds.emplace_back(
		XRayFunctionEntry{ Sled, CurrentFnSym, Kind, AlwaysInstrument, Fn });
		}

llvm/trunk/lib/CodeGen/XRayInstrumentation.cpp

Show All 28 Lines
struct XRayInstrumentation : public MachineFunctionPass {		struct XRayInstrumentation : public MachineFunctionPass {
static char ID;		static char ID;

XRayInstrumentation() : MachineFunctionPass(ID) {		XRayInstrumentation() : MachineFunctionPass(ID) {
initializeXRayInstrumentationPass(*PassRegistry::getPassRegistry());		initializeXRayInstrumentationPass(*PassRegistry::getPassRegistry());
}		}

bool runOnMachineFunction(MachineFunction &MF) override;		bool runOnMachineFunction(MachineFunction &MF) override;
};
}

bool XRayInstrumentation::runOnMachineFunction(MachineFunction &MF) {
auto &F = *MF.getFunction();
auto InstrAttr = F.getFnAttribute("function-instrument");
bool AlwaysInstrument = !InstrAttr.hasAttribute(Attribute::None) &&
InstrAttr.isStringAttribute() &&
InstrAttr.getValueAsString() == "xray-always";
Attribute Attr = F.getFnAttribute("xray-instruction-threshold");
unsigned XRayThreshold = 0;
if (!AlwaysInstrument) {
if (Attr.hasAttribute(Attribute::None) \|\| !Attr.isStringAttribute())
return false; // XRay threshold attribute not found.
if (Attr.getValueAsString().getAsInteger(10, XRayThreshold))
return false; // Invalid value for threshold.
if (F.size() < XRayThreshold)
return false; // Function is too small.
}

// FIXME: Do the loop triviality analysis here or in an earlier pass.		private:
		// Replace the original RET instruction with the exit sled code ("patchable
// First, insert an PATCHABLE_FUNCTION_ENTER as the first instruction of the		// ret" pseudo-instruction), so that at runtime XRay can replace the sled
// MachineFunction.		// with a code jumping to XRay trampoline, which calls the tracing handler
auto &FirstMBB = *MF.begin();		// and, in the end, issues the RET instruction.
auto &FirstMI = *FirstMBB.begin();		// This is the approach to go on CPUs which have a single RET instruction,
auto *TII = MF.getSubtarget().getInstrInfo();		// like x86/x86_64.
BuildMI(FirstMBB, FirstMI, FirstMI.getDebugLoc(),		void replaceRetWithPatchableRet(MachineFunction &MF,
TII->get(TargetOpcode::PATCHABLE_FUNCTION_ENTER));		const TargetInstrInfo *TII);
		// Prepend the original return instruction with the exit sled code ("patchable
		// function exit" pseudo-instruction), preserving the original return
		// instruction just after the exit sled code.
		// This is the approach to go on CPUs which have multiple options for the
		// return instruction, like ARM. For such CPUs we can't just jump into the
		// XRay trampoline and issue a single return instruction there. We rather
		// have to call the trampoline and return from it to the original return
		// instruction of the function being instrumented.
		void prependRetWithPatchableExit(MachineFunction &MF,
		const TargetInstrInfo *TII);
		};
		} // anonymous namespace

// Then we look for all terminators and returns, then replace those with		void XRayInstrumentation::replaceRetWithPatchableRet(MachineFunction &MF,
		const TargetInstrInfo *TII)
		{
		// We look for all terminators and returns, then replace those with
// PATCHABLE_RET instructions.		// PATCHABLE_RET instructions.
SmallVector<MachineInstr *, 4> Terminators;		SmallVector<MachineInstr *, 4> Terminators;
for (auto &MBB : MF) {		for (auto &MBB : MF) {
for (auto &T : MBB.terminators()) {		for (auto &T : MBB.terminators()) {
unsigned Opc = 0;		unsigned Opc = 0;
if (T.isReturn() && T.getOpcode() == TII->getReturnOpcode()) {		if (T.isReturn() && T.getOpcode() == TII->getReturnOpcode()) {
// Replace return instructions with:		// Replace return instructions with:
// PATCHABLE_RET <Opcode>, <Operand>...		// PATCHABLE_RET <Opcode>, <Operand>...
Show All 11 Lines	for (auto &T : MBB.terminators()) {
MIB.addOperand(MO);		MIB.addOperand(MO);
Terminators.push_back(&T);		Terminators.push_back(&T);
}		}
}		}
}		}

for (auto &I : Terminators)		for (auto &I : Terminators)
I->eraseFromParent();		I->eraseFromParent();
		}

		void XRayInstrumentation::prependRetWithPatchableExit(MachineFunction &MF,
		const TargetInstrInfo *TII)
		{
		for (auto &MBB : MF) {
		for (auto &T : MBB.terminators()) {
		if (T.isReturn()) {
		// Prepend the return instruction with PATCHABLE_FUNCTION_EXIT
		BuildMI(MBB, T, T.getDebugLoc(),
		TII->get(TargetOpcode::PATCHABLE_FUNCTION_EXIT));
		}
		}
		}
		}

		bool XRayInstrumentation::runOnMachineFunction(MachineFunction &MF) {
		auto &F = *MF.getFunction();
		auto InstrAttr = F.getFnAttribute("function-instrument");
		bool AlwaysInstrument = !InstrAttr.hasAttribute(Attribute::None) &&
		InstrAttr.isStringAttribute() &&
		InstrAttr.getValueAsString() == "xray-always";
		Attribute Attr = F.getFnAttribute("xray-instruction-threshold");
		unsigned XRayThreshold = 0;
		if (!AlwaysInstrument) {
		if (Attr.hasAttribute(Attribute::None) \|\| !Attr.isStringAttribute())
		return false; // XRay threshold attribute not found.
		if (Attr.getValueAsString().getAsInteger(10, XRayThreshold))
		return false; // Invalid value for threshold.
		if (F.size() < XRayThreshold)
		return false; // Function is too small.
		}

		auto &FirstMBB = *MF.begin();
		auto &FirstMI = *FirstMBB.begin();

		if (!MF.getSubtarget().isXRaySupported()) {
		FirstMI.emitError("An attempt to perform XRay instrumentation for an"
		" unsupported target.");
		return false;
		}

		// FIXME: Do the loop triviality analysis here or in an earlier pass.

		// First, insert an PATCHABLE_FUNCTION_ENTER as the first instruction of the
		// MachineFunction.
		auto *TII = MF.getSubtarget().getInstrInfo();
		BuildMI(FirstMBB, FirstMI, FirstMI.getDebugLoc(),
		TII->get(TargetOpcode::PATCHABLE_FUNCTION_ENTER));

		switch (MF.getTarget().getTargetTriple().getArch()) {
		case Triple::ArchType::arm:
		case Triple::ArchType::thumb:
		// For the architectures which don't have a single return instruction
		prependRetWithPatchableExit(MF, TII);
		break;
		default:
		// For the architectures that have a single return instruction (such as
		// RETQ on x86_64).
		replaceRetWithPatchableRet(MF, TII);
		break;
		}
return true;		return true;
}		}

char XRayInstrumentation::ID = 0;		char XRayInstrumentation::ID = 0;
char &llvm::XRayInstrumentationID = XRayInstrumentation::ID;		char &llvm::XRayInstrumentationID = XRayInstrumentation::ID;
INITIALIZE_PASS(XRayInstrumentation, "xray-instrumentation", "Insert XRay ops",		INITIALIZE_PASS(XRayInstrumentation, "xray-instrumentation", "Insert XRay ops",
false, false)		false, false)

llvm/trunk/lib/Target/ARM/ARMAsmPrinter.h

Show First 20 Lines • Show All 95 Lines • ▼ Show 20 Lines	public:
void EmitStartOfAsmFile(Module &M) override;		void EmitStartOfAsmFile(Module &M) override;
void EmitEndOfAsmFile(Module &M) override;		void EmitEndOfAsmFile(Module &M) override;
void EmitXXStructor(const DataLayout &DL, const Constant *CV) override;		void EmitXXStructor(const DataLayout &DL, const Constant *CV) override;
void EmitGlobalVariable(const GlobalVariable *GV) override;		void EmitGlobalVariable(const GlobalVariable *GV) override;

// lowerOperand - Convert a MachineOperand into the equivalent MCOperand.		// lowerOperand - Convert a MachineOperand into the equivalent MCOperand.
bool lowerOperand(const MachineOperand &MO, MCOperand &MCOp);		bool lowerOperand(const MachineOperand &MO, MCOperand &MCOp);

		//===------------------------------------------------------------------===//
		// XRay implementation
		//===------------------------------------------------------------------===//
		public:
		// XRay-specific lowering for ARM.
		void LowerPATCHABLE_FUNCTION_ENTER(const MachineInstr &MI);
		void LowerPATCHABLE_FUNCTION_EXIT(const MachineInstr &MI);
		// Helper function that emits the XRay sleds we've collected for a particular
		// function.
		void EmitXRayTable();

private:		private:
		void EmitSled(const MachineInstr &MI, SledKind Kind);

// Helpers for EmitStartOfAsmFile() and EmitEndOfAsmFile()		// Helpers for EmitStartOfAsmFile() and EmitEndOfAsmFile()
void emitAttributes();		void emitAttributes();

// Generic helper used to emit e.g. ARMv5 mul pseudos		// Generic helper used to emit e.g. ARMv5 mul pseudos
void EmitPatchedInstruction(const MachineInstr *MI, unsigned TargetOpc);		void EmitPatchedInstruction(const MachineInstr *MI, unsigned TargetOpc);

void EmitUnwindingInstruction(const MachineInstr *MI);		void EmitUnwindingInstruction(const MachineInstr *MI);
Show All 32 Lines

llvm/trunk/lib/Target/ARM/ARMAsmPrinter.cpp

Show First 20 Lines • Show All 158 Lines • ▼ Show 20 Lines	if (Subtarget->isTargetCOFF()) {
OutStreamer->EmitCOFFSymbolStorageClass(Scl);		OutStreamer->EmitCOFFSymbolStorageClass(Scl);
OutStreamer->EmitCOFFSymbolType(Type);		OutStreamer->EmitCOFFSymbolType(Type);
OutStreamer->EndCOFFSymbolDef();		OutStreamer->EndCOFFSymbolDef();
}		}

// Emit the rest of the function body.		// Emit the rest of the function body.
EmitFunctionBody();		EmitFunctionBody();

		// Emit the XRay table for this function.
		EmitXRayTable();

// If we need V4T thumb mode Register Indirect Jump pads, emit them.		// If we need V4T thumb mode Register Indirect Jump pads, emit them.
// These are created per function, rather than per TU, since it's		// These are created per function, rather than per TU, since it's
// relatively easy to exceed the thumb branch range within a TU.		// relatively easy to exceed the thumb branch range within a TU.
if (! ThumbIndirectPads.empty()) {		if (! ThumbIndirectPads.empty()) {
OutStreamer->EmitAssemblerFlag(MCAF_Code16);		OutStreamer->EmitAssemblerFlag(MCAF_Code16);
EmitAlignment(1);		EmitAlignment(1);
for (unsigned i = 0, e = ThumbIndirectPads.size(); i < e; i++) {		for (unsigned i = 0, e = ThumbIndirectPads.size(); i < e; i++) {
OutStreamer->EmitLabel(ThumbIndirectPads[i].second);		OutStreamer->EmitLabel(ThumbIndirectPads[i].second);
▲ Show 20 Lines • Show All 1,839 Lines • ▼ Show 20 Lines	EmitToStreamer(*OutStreamer, MCInstBuilder(ARM::t2LDRi12)
.addReg(ARM::PC)		.addReg(ARM::PC)
.addReg(SrcReg)		.addReg(SrcReg)
.addImm(4)		.addImm(4)
// Predicate		// Predicate
.addImm(ARMCC::AL)		.addImm(ARMCC::AL)
.addReg(0));		.addReg(0));
return;		return;
}		}
		case ARM::PATCHABLE_FUNCTION_ENTER:
		LowerPATCHABLE_FUNCTION_ENTER(*MI);
		return;
		case ARM::PATCHABLE_FUNCTION_EXIT:
		LowerPATCHABLE_FUNCTION_EXIT(*MI);
		return;
}		}

MCInst TmpInst;		MCInst TmpInst;
LowerARMMachineInstrToMCInst(MI, TmpInst, *this);		LowerARMMachineInstrToMCInst(MI, TmpInst, *this);

EmitToStreamer(*OutStreamer, TmpInst);		EmitToStreamer(*OutStreamer, TmpInst);
}		}

Show All 11 Lines

llvm/trunk/lib/Target/ARM/ARMBaseInstrInfo.h

Show First 20 Lines • Show All 94 Lines • ▼ Show 20 Lines	protected:
MachineInstr *commuteInstructionImpl(MachineInstr &MI, bool NewMI,		MachineInstr *commuteInstructionImpl(MachineInstr &MI, bool NewMI,
unsigned OpIdx1,		unsigned OpIdx1,
unsigned OpIdx2) const override;		unsigned OpIdx2) const override;

public:		public:
// Return whether the target has an explicit NOP encoding.		// Return whether the target has an explicit NOP encoding.
bool hasNOP() const;		bool hasNOP() const;

		virtual void getNoopForElfTarget(MCInst &NopInst) const {
		getNoopForMachoTarget(NopInst);
		}

// Return the non-pre/post incrementing version of 'Opc'. Return 0		// Return the non-pre/post incrementing version of 'Opc'. Return 0
// if there is not such an opcode.		// if there is not such an opcode.
virtual unsigned getUnindexedOpcode(unsigned Opc) const =0;		virtual unsigned getUnindexedOpcode(unsigned Opc) const =0;

MachineInstr *convertToThreeAddress(MachineFunction::iterator &MFI,		MachineInstr *convertToThreeAddress(MachineFunction::iterator &MFI,
MachineInstr &MI,		MachineInstr &MI,
LiveVariables *LV) const override;		LiveVariables *LV) const override;

▲ Show 20 Lines • Show All 408 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/ARM/ARMMCInstLower.cpp

Show All 15 Lines
#include "ARMAsmPrinter.h"		#include "ARMAsmPrinter.h"
#include "MCTargetDesc/ARMBaseInfo.h"		#include "MCTargetDesc/ARMBaseInfo.h"
#include "MCTargetDesc/ARMMCExpr.h"		#include "MCTargetDesc/ARMMCExpr.h"
#include "llvm/CodeGen/MachineBasicBlock.h"		#include "llvm/CodeGen/MachineBasicBlock.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/Mangler.h"		#include "llvm/IR/Mangler.h"
#include "llvm/MC/MCExpr.h"		#include "llvm/MC/MCExpr.h"
#include "llvm/MC/MCInst.h"		#include "llvm/MC/MCInst.h"
		#include "llvm/MC/MCContext.h"
		#include "llvm/MC/MCSymbolELF.h"
		#include "llvm/MC/MCSectionELF.h"
		#include "llvm/MC/MCInstBuilder.h"
		#include "llvm/MC/MCStreamer.h"
using namespace llvm;		using namespace llvm;


MCOperand ARMAsmPrinter::GetSymbolRef(const MachineOperand &MO,		MCOperand ARMAsmPrinter::GetSymbolRef(const MachineOperand &MO,
const MCSymbol *Symbol) {		const MCSymbol *Symbol) {
const MCExpr *Expr =		const MCExpr *Expr =
MCSymbolRefExpr::create(Symbol, MCSymbolRefExpr::VK_None, OutContext);		MCSymbolRefExpr::create(Symbol, MCSymbolRefExpr::VK_None, OutContext);
switch (MO.getTargetFlags() & ARMII::MO_OPTION_MASK) {		switch (MO.getTargetFlags() & ARMII::MO_OPTION_MASK) {
▲ Show 20 Lines • Show All 113 Lines • ▼ Show 20 Lines	if (AP.lowerOperand(MO, MCOp)) {
int32_t Enc = ARM_AM::getSOImmVal(MCOp.getImm());		int32_t Enc = ARM_AM::getSOImmVal(MCOp.getImm());
if (Enc != -1)		if (Enc != -1)
MCOp.setImm(Enc);		MCOp.setImm(Enc);
}		}
OutMI.addOperand(MCOp);		OutMI.addOperand(MCOp);
}		}
}		}
}		}

		void ARMAsmPrinter::EmitSled(const MachineInstr &MI, SledKind Kind)
		{
		if (MI.getParent()->getParent()->getInfo<ARMFunctionInfo>()
		->isThumbFunction())
		{
		MI.emitError("An attempt to perform XRay instrumentation for a"
		" Thumb function (not supported). Detected when emitting a sled.");
		return;
		}
		static const int8_t NoopsInSledCount = 6;
		// We want to emit the following pattern:
		//
		// .Lxray_sled_N:
		// ALIGN
		// B #20
		// ; 6 NOP instructions (24 bytes)
		// .tmpN
		//
		// We need the 24 bytes (6 instructions) because at runtime, we'd be patching
		// over the full 28 bytes (7 instructions) with the following pattern:
		//
		// PUSH{ r0, lr }
		// MOVW r0, #<lower 16 bits of function ID>
		// MOVT r0, #<higher 16 bits of function ID>
		// MOVW ip, #<lower 16 bits of address of __xray_FunctionEntry/Exit>
		// MOVT ip, #<higher 16 bits of address of __xray_FunctionEntry/Exit>
		// BLX ip
		// POP{ r0, lr }
		//
		OutStreamer->EmitCodeAlignment(4);
		auto CurSled = OutContext.createTempSymbol("xray_sled_", true);
		OutStreamer->EmitLabel(CurSled);
		auto Target = OutContext.createTempSymbol();

		// Emit "B #20" instruction, which jumps over the next 24 bytes (because
		// register pc is 8 bytes ahead of the jump instruction by the moment CPU
		// is executing it).
		// By analogy to ARMAsmPrinter::emitPseudoExpansionLowering() \|case ARM::B\|.
		// It is not clear why \|addReg(0)\| is needed (the last operand).
		EmitToStreamer(*OutStreamer, MCInstBuilder(ARM::Bcc).addImm(20)
		.addImm(ARMCC::AL).addReg(0));

		MCInst Noop;
		Subtarget->getInstrInfo()->getNoopForElfTarget(Noop);
		for (int8_t I = 0; I < NoopsInSledCount; I++)
		{
		OutStreamer->EmitInstruction(Noop, getSubtargetInfo());
		}

		OutStreamer->EmitLabel(Target);
		recordSled(CurSled, MI, Kind);
		}

		void ARMAsmPrinter::LowerPATCHABLE_FUNCTION_ENTER(const MachineInstr &MI)
		{
		EmitSled(MI, SledKind::FUNCTION_ENTER);
		}

		void ARMAsmPrinter::LowerPATCHABLE_FUNCTION_EXIT(const MachineInstr &MI)
		{
		EmitSled(MI, SledKind::FUNCTION_EXIT);
		}

		void ARMAsmPrinter::EmitXRayTable()
		{
		if (Sleds.empty())
		return;
		if (Subtarget->isTargetELF()) {
		auto *Section = OutContext.getELFSection(
		"xray_instr_map", ELF::SHT_PROGBITS,
		ELF::SHF_ALLOC \| ELF::SHF_GROUP \| ELF::SHF_MERGE, 0,
		CurrentFnSym->getName());
		auto PrevSection = OutStreamer->getCurrentSectionOnly();
		OutStreamer->SwitchSection(Section);
		for (const auto &Sled : Sleds) {
		OutStreamer->EmitSymbolValue(Sled.Sled, 4);
		OutStreamer->EmitSymbolValue(CurrentFnSym, 4);
		auto Kind = static_cast<uint8_t>(Sled.Kind);
		OutStreamer->EmitBytes(
		StringRef(reinterpret_cast<const char *>(&Kind), 1));
		OutStreamer->EmitBytes(
		StringRef(reinterpret_cast<const char *>(&Sled.AlwaysInstrument), 1));
		OutStreamer->EmitZeros(6);
		}
		OutStreamer->SwitchSection(PrevSection);
		}
		Sleds.clear();
		}

llvm/trunk/lib/Target/ARM/ARMSubtarget.h

Show First 20 Lines • Show All 534 Lines • ▼ Show 20 Lines	bool isTargetHardFloat() const {
// FIXME: this is invalid for WindowsCE		// FIXME: this is invalid for WindowsCE
return TargetTriple.getEnvironment() == Triple::GNUEABIHF \|\|		return TargetTriple.getEnvironment() == Triple::GNUEABIHF \|\|
TargetTriple.getEnvironment() == Triple::MuslEABIHF \|\|		TargetTriple.getEnvironment() == Triple::MuslEABIHF \|\|
TargetTriple.getEnvironment() == Triple::EABIHF \|\|		TargetTriple.getEnvironment() == Triple::EABIHF \|\|
isTargetWindows() \|\| isAAPCS16_ABI();		isTargetWindows() \|\| isAAPCS16_ABI();
}		}
bool isTargetAndroid() const { return TargetTriple.isAndroid(); }		bool isTargetAndroid() const { return TargetTriple.isAndroid(); }

		virtual bool isXRaySupported() const override;

bool isAPCS_ABI() const;		bool isAPCS_ABI() const;
bool isAAPCS_ABI() const;		bool isAAPCS_ABI() const;
bool isAAPCS16_ABI() const;		bool isAAPCS16_ABI() const;

bool isROPI() const;		bool isROPI() const;
bool isRWPI() const;		bool isRWPI() const;

bool useSoftFloat() const { return UseSoftFloat; }		bool useSoftFloat() const { return UseSoftFloat; }
▲ Show 20 Lines • Show All 84 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/ARM/ARMSubtarget.cpp

Show First 20 Lines • Show All 95 Lines • ▼ Show 20 Lines	: ARMGenSubtargetInfo(TT, CPU, FS), UseMulOps(UseFusedMulOps),
// we can query directly.		// we can query directly.
InstrInfo(isThumb1Only()		InstrInfo(isThumb1Only()
? (ARMBaseInstrInfo )new Thumb1InstrInfo(this)		? (ARMBaseInstrInfo )new Thumb1InstrInfo(this)
: !isThumb()		: !isThumb()
? (ARMBaseInstrInfo )new ARMInstrInfo(this)		? (ARMBaseInstrInfo )new ARMInstrInfo(this)
: (ARMBaseInstrInfo )new Thumb2InstrInfo(this)),		: (ARMBaseInstrInfo )new Thumb2InstrInfo(this)),
TLInfo(TM, *this) {}		TLInfo(TM, *this) {}

		bool ARMSubtarget::isXRaySupported() const {
		// We don't currently suppport Thumb, but Windows requires Thumb.
		return hasV6Ops() && hasARMOps() && !isTargetWindows();
		}

void ARMSubtarget::initializeEnvironment() {		void ARMSubtarget::initializeEnvironment() {
// MCAsmInfo isn't always present (e.g. in opt) so we can't initialize this		// MCAsmInfo isn't always present (e.g. in opt) so we can't initialize this
// directly from it, but we can try to make sure they're consistent when both		// directly from it, but we can try to make sure they're consistent when both
// available.		// available.
UseSjLjEH = isTargetDarwin() && !isTargetWatchABI();		UseSjLjEH = isTargetDarwin() && !isTargetWatchABI();
assert((!TM.getMCAsmInfo() \|\|		assert((!TM.getMCAsmInfo() \|\|
(TM.getMCAsmInfo()->getExceptionHandlingType() ==		(TM.getMCAsmInfo()->getExceptionHandlingType() ==
ExceptionHandling::SjLj) == UseSjLjEH) &&		ExceptionHandling::SjLj) == UseSjLjEH) &&
▲ Show 20 Lines • Show All 238 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/X86/X86AsmPrinter.h

Show First 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	private:
// CurrentShadowSize counts the number of bytes encoded since the most		// CurrentShadowSize counts the number of bytes encoded since the most
// recently encountered STACKMAP, stopping when that number is greater than		// recently encountered STACKMAP, stopping when that number is greater than
// or equal to RequiredShadowSize.		// or equal to RequiredShadowSize.
unsigned RequiredShadowSize = 0, CurrentShadowSize = 0;		unsigned RequiredShadowSize = 0, CurrentShadowSize = 0;
};		};

StackMapShadowTracker SMShadowTracker;		StackMapShadowTracker SMShadowTracker;

// This describes the kind of sled we're storing in the XRay table.
enum class SledKind : uint8_t {
FUNCTION_ENTER = 0,
FUNCTION_EXIT = 1,
TAIL_CALL = 2,
};

// The table will contain these structs that point to the sled, the function
// containing the sled, and what kind of sled (and whether they should always
// be instrumented).
struct XRayFunctionEntry {
const MCSymbol *Sled;
const MCSymbol *Function;
SledKind Kind;
bool AlwaysInstrument;
const class Function *Fn;
};

// All the sleds to be emitted.
std::vector<XRayFunctionEntry> Sleds;

// All instructions emitted by the X86AsmPrinter should use this helper		// All instructions emitted by the X86AsmPrinter should use this helper
// method.		// method.
//		//
// This helper function invokes the SMShadowTracker on each instruction before		// This helper function invokes the SMShadowTracker on each instruction before
// outputting it to the OutStream. This allows the shadow tracker to minimise		// outputting it to the OutStream. This allows the shadow tracker to minimise
// the number of NOPs used for stackmap padding.		// the number of NOPs used for stackmap padding.
void EmitAndCountInstruction(MCInst &Inst);		void EmitAndCountInstruction(MCInst &Inst);
void LowerSTACKMAP(const MachineInstr &MI);		void LowerSTACKMAP(const MachineInstr &MI);
Show All 9 Lines	void LowerPATCHABLE_FUNCTION_ENTER(const MachineInstr &MI,
X86MCInstLower &MCIL);		X86MCInstLower &MCIL);
void LowerPATCHABLE_RET(const MachineInstr &MI, X86MCInstLower &MCIL);		void LowerPATCHABLE_RET(const MachineInstr &MI, X86MCInstLower &MCIL);
void LowerPATCHABLE_TAIL_CALL(const MachineInstr &MI, X86MCInstLower &MCIL);		void LowerPATCHABLE_TAIL_CALL(const MachineInstr &MI, X86MCInstLower &MCIL);

// Helper function that emits the XRay sleds we've collected for a particular		// Helper function that emits the XRay sleds we've collected for a particular
// function.		// function.
void EmitXRayTable();		void EmitXRayTable();

// Helper function to record a given XRay sled.
void recordSled(MCSymbol *Sled, const MachineInstr &MI, SledKind Kind);
public:		public:
explicit X86AsmPrinter(TargetMachine &TM,		explicit X86AsmPrinter(TargetMachine &TM,
std::unique_ptr<MCStreamer> Streamer)		std::unique_ptr<MCStreamer> Streamer)
: AsmPrinter(TM, std::move(Streamer)), SM(this), FM(this) {}		: AsmPrinter(TM, std::move(Streamer)), SM(this), FM(this) {}

const char *getPassName() const override {		const char *getPassName() const override {
return "X86 Assembly / Object Emitter";		return "X86 Assembly / Object Emitter";
}		}
Show All 35 Lines

llvm/trunk/lib/Target/X86/X86MCInstLower.cpp

Show First 20 Lines • Show All 1,014 Lines • ▼ Show 20 Lines	void X86AsmPrinter::LowerPATCHPOINT(const MachineInstr &MI,
unsigned NumBytes = opers.getNumPatchBytes();		unsigned NumBytes = opers.getNumPatchBytes();
assert(NumBytes >= EncodedBytes &&		assert(NumBytes >= EncodedBytes &&
"Patchpoint can't request size less than the length of a call.");		"Patchpoint can't request size less than the length of a call.");

EmitNops(*OutStreamer, NumBytes - EncodedBytes, Subtarget->is64Bit(),		EmitNops(*OutStreamer, NumBytes - EncodedBytes, Subtarget->is64Bit(),
getSubtargetInfo());		getSubtargetInfo());
}		}

void X86AsmPrinter::recordSled(MCSymbol *Sled, const MachineInstr &MI,
SledKind Kind) {
auto Fn = MI.getParent()->getParent()->getFunction();
auto Attr = Fn->getFnAttribute("function-instrument");
bool AlwaysInstrument =
Attr.isStringAttribute() && Attr.getValueAsString() == "xray-always";
Sleds.emplace_back(
XRayFunctionEntry{Sled, CurrentFnSym, Kind, AlwaysInstrument, Fn});
}

void X86AsmPrinter::LowerPATCHABLE_FUNCTION_ENTER(const MachineInstr &MI,		void X86AsmPrinter::LowerPATCHABLE_FUNCTION_ENTER(const MachineInstr &MI,
X86MCInstLower &MCIL) {		X86MCInstLower &MCIL) {
// We want to emit the following pattern:		// We want to emit the following pattern:
//		//
// .p2align 1, ...		// .p2align 1, ...
// .Lxray_sled_N:		// .Lxray_sled_N:
// jmp .tmpN		// jmp .tmpN
// # 9 bytes worth of noops		// # 9 bytes worth of noops
▲ Show 20 Lines • Show All 699 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/X86/X86Subtarget.h

Show First 20 Lines • Show All 454 Lines • ▼ Show 20 Lines	public:
bool hasPFI() const { return HasPFI; }		bool hasPFI() const { return HasPFI; }
bool hasERI() const { return HasERI; }		bool hasERI() const { return HasERI; }
bool hasDQI() const { return HasDQI; }		bool hasDQI() const { return HasDQI; }
bool hasBWI() const { return HasBWI; }		bool hasBWI() const { return HasBWI; }
bool hasVLX() const { return HasVLX; }		bool hasVLX() const { return HasVLX; }
bool hasPKU() const { return HasPKU; }		bool hasPKU() const { return HasPKU; }
bool hasMPX() const { return HasMPX; }		bool hasMPX() const { return HasMPX; }

		virtual bool isXRaySupported() const override { return is64Bit(); }

bool isAtom() const { return X86ProcFamily == IntelAtom; }		bool isAtom() const { return X86ProcFamily == IntelAtom; }
bool isSLM() const { return X86ProcFamily == IntelSLM; }		bool isSLM() const { return X86ProcFamily == IntelSLM; }
bool useSoftFloat() const { return UseSoftFloat; }		bool useSoftFloat() const { return UseSoftFloat; }

/// Use mfence if we have SSE2 or we're on x86-64 (even if we asked for		/// Use mfence if we have SSE2 or we're on x86-64 (even if we asked for
/// no-sse2). There isn't any reason to disable it if the target processor		/// no-sse2). There isn't any reason to disable it if the target processor
/// supports it.		/// supports it.
bool hasMFence() const { return hasSSE2() \|\| is64Bit(); }		bool hasMFence() const { return hasSSE2() \|\| is64Bit(); }
▲ Show 20 Lines • Show All 139 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/ARM/xray-armv6-attribute-instrumentation.ll

				; RUN: llc -filetype=asm -o - -mtriple=armv6-unknown-linux-gnu < %s \| FileCheck %s

				define i32 @foo() nounwind noinline uwtable "function-instrument"="xray-always" {
				; CHECK-LABEL: Lxray_sled_0:
				; CHECK-NEXT: b #20
				; CHECK-NEXT: mov r0, r0
				; CHECK-NEXT: mov r0, r0
				; CHECK-NEXT: mov r0, r0
				; CHECK-NEXT: mov r0, r0
				; CHECK-NEXT: mov r0, r0
				; CHECK-NEXT: mov r0, r0
				; CHECK-LABEL: Ltmp0:
				ret i32 0
				; CHECK-LABEL: Lxray_sled_1:
				; CHECK-NEXT: b #20
				; CHECK-NEXT: mov r0, r0
				; CHECK-NEXT: mov r0, r0
				; CHECK-NEXT: mov r0, r0
				; CHECK-NEXT: mov r0, r0
				; CHECK-NEXT: mov r0, r0
				; CHECK-NEXT: mov r0, r0
				; CHECK-LABEL: Ltmp1:
				; CHECK-NEXT: bx lr
				}

llvm/trunk/test/CodeGen/ARM/xray-armv7-attribute-instrumentation.ll

				; RUN: llc -filetype=asm -o - -mtriple=armv7-unknown-linux-gnu < %s \| FileCheck %s

				define i32 @foo() nounwind noinline uwtable "function-instrument"="xray-always" {
				; CHECK-LABEL: Lxray_sled_0:
				; CHECK-NEXT: b #20
				; CHECK-NEXT: nop
				; CHECK-NEXT: nop
				; CHECK-NEXT: nop
				; CHECK-NEXT: nop
				; CHECK-NEXT: nop
				; CHECK-NEXT: nop
				; CHECK-LABEL: Ltmp0:
				ret i32 0
				; CHECK-LABEL: Lxray_sled_1:
				; CHECK-NEXT: b #20
				; CHECK-NEXT: nop
				; CHECK-NEXT: nop
				; CHECK-NEXT: nop
				; CHECK-NEXT: nop
				; CHECK-NEXT: nop
				; CHECK-NEXT: nop
				; CHECK-LABEL: Ltmp1:
				; CHECK-NEXT: bx lr
				}

This is an archive of the discontinued LLVM Phabricator instance.

[XRay] ARM 32-bit no-Thumb support in LLVMClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 71763

llvm/trunk/include/llvm/CodeGen/AsmPrinter.h

llvm/trunk/include/llvm/Target/Target.td

llvm/trunk/include/llvm/Target/TargetOpcodes.def

llvm/trunk/include/llvm/Target/TargetSubtargetInfo.h

llvm/trunk/lib/CodeGen/AsmPrinter/AsmPrinter.cpp

llvm/trunk/lib/CodeGen/XRayInstrumentation.cpp

llvm/trunk/lib/Target/ARM/ARMAsmPrinter.h

llvm/trunk/lib/Target/ARM/ARMAsmPrinter.cpp

llvm/trunk/lib/Target/ARM/ARMBaseInstrInfo.h

llvm/trunk/lib/Target/ARM/ARMMCInstLower.cpp

llvm/trunk/lib/Target/ARM/ARMSubtarget.h

llvm/trunk/lib/Target/ARM/ARMSubtarget.cpp

llvm/trunk/lib/Target/X86/X86AsmPrinter.h

llvm/trunk/lib/Target/X86/X86MCInstLower.cpp

llvm/trunk/lib/Target/X86/X86Subtarget.h

llvm/trunk/test/CodeGen/ARM/xray-armv6-attribute-instrumentation.ll

llvm/trunk/test/CodeGen/ARM/xray-armv7-attribute-instrumentation.ll

[XRay] ARM 32-bit no-Thumb support in LLVM
ClosedPublic