This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/
-
llvm/
-
CodeGen/
-
Passes.h
-
InitializePasses.h
-
MC/
-
MCELFStreamer.h
-
MCObjectFileInfo.h
-
MCStreamer.h
-
Target/
1/1
Target.td
5/5
TargetInstrInfo.h
1/2
TargetOpcodes.def
-
lib/
-
CodeGen/
-
CMakeLists.txt
-
CodeGen.cpp
-
TargetPassConfig.cpp
8/8
XRayInstrumentation.cpp
-
MC/
-
MCELFStreamer.cpp
-
MCStreamer.cpp
-
Target/X86/
-
X86/
-
X86AsmPrinter.h
-
X86AsmPrinter.cpp
-
X86FrameLowering.cpp
-
X86InstrInfo.cpp
10/12
X86MCInstLower.cpp
-
test/
-
CodeGen/X86/
-
X86/
-
xray-attribute-instrumentation.ll
-
xray-selective-instrumentation-miss.ll
-
xray-selective-instrumentation.ll
-
TableGen/
-
trydecode-emission.td
-
trydecode-emission2.td
-
trydecode-emission3.td
-
utils/TableGen/
-
TableGen/
-
InstrInfoEmitter.cpp

Differential D19904

XRay: Add entry and exit sleds
ClosedPublic

Authored by dberris on May 4 2016, 12:01 AM.

Download Raw Diff

Details

Reviewers

kcc
eugenis
pcc
echristo
sanjoy
rnk

Commits

rG52735fc435b1: XRay: Add entry and exit sleds
rL275367: XRay: Add entry and exit sleds

Summary

In this patch we implement the following parts of XRay:

Supporting a function attribute named 'function-instrument' which currently only supports 'xray-always'. We should be able to use this attribute for other instrumentation approaches.
Supporting a function attribute named 'xray-instruction-threshold' used to determine whether a function is instrumented with a minimum number of instructions (IR instruction counts).
X86-specific nop sleds as described in the white paper.
A machine function pass that adds the different instrumentation marker instructions at a very late stage.
A way of identifying which return opcode is considered "normal" for each architecture.

There are some caveats here:

We don't handle PATCHABLE_RET in platforms other than x86_64 yet -- this means if IR used PATCHABLE_RET directly instead of a normal ret, instruction lowering for that platform might do the wrong thing. We think this should be handled at instruction selection time to by default be unpacked for platforms where XRay is not availble yet.

The generated section for X86 is different from what is described from the white paper for the sole reason that LLVM allows us to do this neatly. We're taking the opportunity to deviate from the white paper from this perspective to allow us to get richer information from the runtime library.

Diff Detail

Event Timeline

dberris updated this revision to Diff 56097.May 4 2016, 12:01 AM

dberris retitled this revision from to XRay: Add entry and exit sleds.

dberris updated this object.

dberris added reviewers: echristo, sanjoy.

dberris set the repository for this revision to rL LLVM.

dberris added a subscriber: llvm-commits.

Herald added a subscriber: mehdi_amini. · View Herald TranscriptMay 4 2016, 12:01 AM

BTW, It's worth mentioning that this is a prototype implementation and we're fine with making large changes for this patch.

mcrosier added a subscriber: mcrosier.May 4 2016, 6:09 AM

mcrosier added a subscriber: bmakam.May 4 2016, 6:09 AM

emaste added a subscriber: emaste.May 4 2016, 2:16 PM

Some minor comments inline. I'm not familiar with all of the intricacies of the backend, so I'll CC some other people who are likely to have more perspective on this.

include/llvm/Target/Target.td
964	Looks like this isn't used yet? If so, I'd suggest dropping it completely for now.
lib/CodeGen/XRayInstrumentation.cpp
81	Why do you care only about register operands here?
lib/Target/X86/X86MCInstLower.cpp
1056	Why not AlwaysInstrument = !Attr.hasAttribute(Attribute::None) && Attr.isStringAttribute() && Attr.getValueAsString() == "xray-always"; ?
1057	I don't quite grok the Attributes API, but I think the `!Attr.hasAttribute(Attribute::None)` bit is redundant -- if `isStringAttribute` return `true`, then `Attr.hasAttribute(Attribute::None)` will be false.
1060	emplace_back ?
1088	Indent is off.

sanjoy edited edge metadata.May 5 2016, 3:40 PM

sanjoy added subscribers: rnk, atrick, majnemer.

majnemer added inline comments.May 5 2016, 4:27 PM

lib/CodeGen/XRayInstrumentation.cpp
73–78	I don't think you want to replace whacky constructs like `EH_RETURN` and `CLEANUPRET` with `PATCHABLE_RET`, do you? Just "normal" returns, right?

Address comments from sanjoy and majnemer

lib/CodeGen/XRayInstrumentation.cpp
73–78	Yes, that's right -- does this version exclude those appropriately?
81	Oops, I had assumed that the RET instructions only ever had register operands. That's a faulty assumption, good catch.

majnemer added inline comments.May 6 2016, 10:37 AM

lib/CodeGen/XRayInstrumentation.cpp
72–77	I would recommend white-listing the opcodes you know are safe to replace instead of blacklisting the ones you know which are unsafe. This would make the pass conservatively correct in the face of future changes to LLVM.

dberris added inline comments.May 8 2016, 9:19 PM

lib/CodeGen/XRayInstrumentation.cpp
72–77	That makes sense, thanks majnemer -- however it's not clear to me how I'd refer to the target-specific opcodes from here. Pardon the newbie question, but how do I say "I just want return instructions here"? The only options I can see are: Look at the original LLVM IR to see if it's a ReturnInst. Figure out a way for including the X86 opcodes here. Maybe I'm missing something simpler here?

Update operand unpacking properly.

clang-format

majnemer added inline comments.May 8 2016, 10:25 PM

lib/CodeGen/XRayInstrumentation.cpp
72–76	The typical way of doing this sort of thing, AFAICT, is to teach `TargetInstrInfo` about it. For example, it implements `getCatchReturnOpcode` and `getCallFrameSetupOpcode`.

Add a check in TargetInstrInfo for whether a return is a "normal" return.

We also implement this somewhat correctly in X86, but use the default
implementation for other platforms.

dberris marked an inline comment as done.May 8 2016, 11:06 PM

dberris added inline comments.

lib/CodeGen/XRayInstrumentation.cpp
72–74	Thanks! I've updated the patch to do this. PTAL?

Fix the uploaded diff.

majnemer added inline comments.May 9 2016, 7:45 AM

include/llvm/Target/TargetInstrInfo.h
158–169	I was thinking more of a whilelist-oriented solution, something like: `unsigned getNormalReturnOpcode() const { return NormalRetOpcode; }`

bmakam added inline comments.May 9 2016, 9:56 AM

lib/Target/X86/X86MCInstLower.cpp
1070	Could you please expand on why you need 9 bytes of noops here? I am not quite familiar with x86_64 but was under the impression that on x86_64 the jmp instruction is 1 byte for opcode and 4 bytes for signed relative displacement, so shouldn't 5 bytes worth of nops be sufficient?

dberris added inline comments.May 9 2016, 7:10 PM

include/llvm/Target/TargetInstrInfo.h
158–169	But how does this work on platforms that can spell 'return' many different normal ways (like in X86)? There's RETL, RETQ, and all other versions of RET which are considered "normal"?
lib/Target/X86/X86MCInstLower.cpp
1070	Good question, thanks. I have to check whether we're using the right version of JMP, but I'm specifically looking for the version that's one byte for the JMP instruction, and 8 bits (1 byte) for the relative offset. So far I haven't been able to spell `jmp +0x09` correctly and have it work, without having an additional symbol as a target for the jump instruction. If we get that right, we can then add the 9 byte nops we need to get exactly 11 bytes for the function entry. Is there a fool-proof way of spelling "JMP +0x09" with the builder interface? Or should I add another JMP instruction in X86 that supports the 8-bit displacement immediate operand? The reason why I can't use a JMP that isn't 2 bytes, is because it's really hard to write just 5 bytes atomically. I can probably do something with an 8-byte atomic write, but enforcing that 8-byte write doesn't span cache lines is also very tricky to make safe.

dberris added inline comments.May 9 2016, 9:49 PM

lib/Target/X86/X86MCInstLower.cpp

1070

Actually now that I've had a look at the generated object file, I can confirm that we're using the two-byte version of JMP with this construct. Quick test:

test.cc:

#include <cstdio>

[[clang::xray_always_instrument]] void foo() { std::printf("Hello, XRay!\n"); }

int main(int argc, char* argv[]) { foo(); }

Compiled with (modified clang to emit IR that has annotated functions for XRay):

./bin/clang -fxray-instrument -fxray-instruction-threshold=1 -std=c++11 -x c++ -S test.cc -emit-llvm

Creates this IR listing:

; ModuleID = 'test.cc'
source_filename = "test.cc"
target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"

@.str = private unnamed_addr constant [14 x i8] c"Hello, XRay!\0A\00", align 1

; Function Attrs: uwtable
define void @_Z3foov() #0 {
entry:
  %call = call i32 (i8*, ...) @printf(i8* getelementptr inbounds ([14 x i8], [14 x i8]* @.str, i32 0, i32 0))
  ret void
}

declare i32 @printf(i8*, ...) #1

; Function Attrs: norecurse uwtable
define i32 @main(i32 %argc, i8** %argv) #2 {
entry:
  %argc.addr = alloca i32, align 4
  %argv.addr = alloca i8**, align 8
  store i32 %argc, i32* %argc.addr, align 4
  store i8** %argv, i8*** %argv.addr, align 8
  call void @_Z3foov()
  ret i32 0
}

attributes #0 = { uwtable "disable-tail-calls"="false" "function-instrument"="xray-always" "less-precise-fpmad"="false" "no-frame-pointer-elim"="true" "no-frame-pointer-elim-non-leaf" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
attributes #1 = { "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="true" "no-frame-pointer-elim-non-leaf" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
attributes #2 = { norecurse uwtable "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="true" "no-frame-pointer-elim-non-leaf" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" "xray-instruction-threshold"="1" }

!llvm.ident = !{!0}

!0 = !{!"clang version 3.9.0 (http://llvm.org/git/clang.git 2b9ed9227330789f24e56fe9e800e7be2111073b) (http://llvm.org/git/llvm.git 1f3e353113ff9e2f835955bf005a8a5e25f16ad1)"}

Then compiled+disassembled this way:

./bin/llc -filetype=obj -o - < test.ll | ./bin/llvm-objdump -disassemble -

Produces the following output:

<stdin>:        file format ELF64-x86-64

Disassembly of section .text:
_Z3foov:
       0:       eb 09   jmp     9 <_Z3foov+0xB>
       2:       66 0f 1f 84 00 00 02 00 00      nopw    512(%rax,%rax)
       b:       55      pushq   %rbp
       c:       48 89 e5        movq    %rsp, %rbp
       f:       bf 00 00 00 00  movl    $0, %edi
      14:       31 c0   xorl    %eax, %eax
      16:       e8 00 00 00 00  callq   0 <_Z3foov+0x1B>
      1b:       5d      popq    %rbp
      1c:       c3      retq
      1d:       2e 66 0f 1f 84 00 00 02 00 00   nopw    %cs:512(%rax,%rax)
      27:       66 0f 1f 84 00 00 00 00 00      nopw    (%rax,%rax)

main:
      30:       eb 09   jmp     9 <main+0xB>
      32:       66 0f 1f 84 00 00 02 00 00      nopw    512(%rax,%rax)
      3b:       55      pushq   %rbp
      3c:       48 89 e5        movq    %rsp, %rbp
      3f:       48 83 ec 10     subq    $16, %rsp
      43:       89 7d fc        movl    %edi, -4(%rbp)
      46:       48 89 75 f0     movq    %rsi, -16(%rbp)
      4a:       e8 b1 ff ff ff  callq   -79 <_Z3foov>
      4f:       31 c0   xorl    %eax, %eax
      51:       48 83 c4 10     addq    $16, %rsp
      55:       5d      popq    %rbp
      56:       c3      retq
      57:       2e 66 0f 1f 84 00 00 02 00 00   nopw    %cs:512(%rax,%rax)

Add documentation on why we need 9 bytes of nops

dberris marked an inline comment as done.May 10 2016, 9:02 PM

dberris added inline comments.

lib/Target/X86/X86MCInstLower.cpp
1070	Updated the comments now to make it clear and specific. Thanks!

majnemer added inline comments.May 10 2016, 9:03 PM

include/llvm/Target/TargetInstrInfo.h
158–169	The same way it is handled for `X86::ADJCALLSTACKDOWN64` vs `X86::ADJCALLSTACKDOWN32`: https://github.com/llvm-mirror/llvm/blob/master/lib/Target/X86/X86InstrInfo.cpp#L105

dberris added inline comments.May 10 2016, 9:08 PM

include/llvm/Target/TargetInstrInfo.h
158–169	That makes a lot of sense, thanks! I'll rework this a bit to make it more of a targeted specific OpCode. I suppose it's fine to have multiples of these functions, each for relevant flavours of fairly common instruction classes, yes? So I suppose, for `tail call` exits/returns, we would have something similar for that?

Make ReturnOpcode an argument to TargetInstrInfo constructor and whitelist one ReturnOpcode instead

dberris marked an inline comment as done.May 10 2016, 10:15 PM

dberris added inline comments.

include/llvm/Target/TargetInstrInfo.h
158–169	Updated now to white-list a specific return OpCode.

dberris updated this object.May 12 2016, 2:27 AM

dberris added reviewers: rnk, eugenis, kcc, pcc.

Ping -- is there something else missing for this change?

Marked more inline comments as "Done".

Drive by comment.

include/llvm/Target/TargetOpcodes.def
145	Remove the XRay specific stuff here and rewrite in a more generic fashion (and below).

Update documentation to remove XRay references

dberris marked an inline comment as done.May 17 2016, 7:50 PM

dberris added inline comments.

include/llvm/Target/TargetOpcodes.def
145	Done

dberris added a child revision: D20352: Add XRay flags to Clang.May 17 2016, 10:34 PM

Ping?

Sync to master again.

Fixes after merge
Emit a common symbol for the instrumentation map
Improve table creation and allow access to the table via xray_instr_map and xray_instr_map_end
Fix the sled type encoding

niravd added a subscriber: niravd.Jun 22 2016, 6:17 AM

dberris added a child revision: D21612: [compiler-rt] [XRay] Basic initialization and flag definition for XRay runtime.Jun 22 2016, 10:29 AM

Use an explicit byte sequence for the jmp

dberris added a parent revision: D21743: Convenience function for MCContext to get named sections when lowering..Jun 26 2016, 8:58 PM

Make XRay write out tables per function

Clean up local history, squashing to a single revision (rebased too)

Rebase
Add ELF::SHF_MERGE to flags for XRay section
Update tests to reflect adjustments in implementation

Un-break XRay section creation.

dberris mentioned this in D20352: Add XRay flags to Clang.Jul 5 2016, 10:36 PM

lgtm

lib/Target/X86/X86MCInstLower.cpp
1077	Does XRay support multiple DSOs in the process? If so, the XRay runtime may be more than 2GB away from the code being patched, and this offset will overflow.
1086–1087	I guess we can deal with this separately. IMO we should have a 'jmpb' instruction or something that forces a short jump or assembler error if the displacement is too large.

This revision is now accepted and ready to land.Jul 12 2016, 9:18 AM

Thanks Reid -- do you or echristo@ mind landing this for me? I don't think I have commit powers yet.

lib/Target/X86/X86MCInstLower.cpp
1077	That's true -- the runtime patching code will fail to patch certain sleds that are farther than 32-bits away. In case we need more, in the future we can tweak this so that we can try to force the XRay runtime trampoline in the first 2GB of the process' memory and use an "absolute" address (which may need more bytes in the sled), or have a mode to support larger binaries (a flag that changes the emitted sleds, and have the runtime support both kinds of sleds). I suspect if it comes up in practice that we can deal with this more effectively.
1086–1087	I'm happy to do this as a refactoring or a follow-up change. A new instruction sounds like the right action to take anyway.

Rebase

Closed by commit rL275367: XRay: Add entry and exit sleds (authored by dberris). · Explain WhyJul 13 2016, 9:13 PM

This revision was automatically updated to reflect the committed changes.

dberris mentioned this in D21612: [compiler-rt] [XRay] Basic initialization and flag definition for XRay runtime.Aug 2 2016, 4:56 PM

rSerge added a child revision: D23931: [XRay] ARM 32-bit no-Thumb support in LLVM.Aug 26 2016, 9:59 AM

Revision Contents

Path

Size

include/

llvm/

CodeGen/

Passes.h

4 lines

InitializePasses.h

1 line

MC/

MCELFStreamer.h

3 lines

MCObjectFileInfo.h

3 lines

MCStreamer.h

6 lines

Target/

Target.td

15 lines

TargetInstrInfo.h

7 lines

TargetOpcodes.def

13 lines

lib/

CodeGen/

CMakeLists.txt

1 line

CodeGen.cpp

1 line

TargetPassConfig.cpp

1 line

XRayInstrumentation.cpp

96 lines

MC/

MCELFStreamer.cpp

8 lines

MCStreamer.cpp

5 lines

Target/

X86/

41 lines

3 lines

1 line

3 lines

100 lines

test/

CodeGen/

X86/

xray-attribute-instrumentation.ll

13 lines

xray-selective-instrumentation-miss.ll

9 lines

xray-selective-instrumentation.ll

9 lines

TableGen/

trydecode-emission.td

4 lines

trydecode-emission2.td

4 lines

trydecode-emission3.td

4 lines

utils/

TableGen/

InstrInfoEmitter.cpp

6 lines

Commit	Tree	Parents	Author	Summary	Date
41cf930d1d32	783bc1f67ffd	8c4cfcd1166c	Dean Michael Berris	Make XRay write out tables per function	Jun 26 2016, 8:51 PM
8c4cfcd1166c	2ed135f4f1bd	ca341a880376	Dean Michael Berris	Convenience function for MCStreamer to get named sections when lowering. (Show More…)	Jun 26 2016, 8:47 PM
ca341a880376	111c9cd1cd8a	8fd5dd6aa8a6	Dean Michael Berris	Use an explicit byte sequence for the jmp	Jun 22 2016, 4:29 PM
8fd5dd6aa8a6	fb3621d1494b	976ca87ed5af	Dean Michael Berris	Fix the sled type encoding	Jun 21 2016, 7:48 PM
976ca87ed5af	9e1945bb41ea	5891768e6fcb	Dean Michael Berris	Improve table creation and allow access to the table via __xray_instr_map and… (Show More…)	Jun 21 2016, 11:46 AM
5891768e6fcb	3e22333b21ad	563aca1c16a4	Dean Michael Berris	Emit a common symbol for the instrumentation map	Jun 16 2016, 11:42 PM
563aca1c16a4	2dda013634b0	0d10245e7f6f	Dean Michael Berris	Fixes after merge	Jun 15 2016, 2:50 AM
0d10245e7f6f	d8515cf74c8e	e7a9c9699b2a 84645e054979	Dean Michael Berris	Merge branch 'master' of http://llvm.org/git/llvm into xray-no-intrinsic	Jun 13 2016, 11:09 PM
e7a9c9699b2a	db5600e439ae	3e218add0742	Dean Michael Berris	Set entry size and first symbol	Jun 5 2016, 9:56 PM
3e218add0742	bae984a2c923	09243f1bbc96	Dean Michael Berris	Update comment on the instrumentation map structure	May 30 2016, 9:21 PM
09243f1bbc96	154ef911c601	61a6db48ebec	Dean Michael Berris	Add symbol for __xray_instr_map in ELF	May 30 2016, 9:16 PM
61a6db48ebec	1447fe69d802	748f1420f5f5 acc7ed7b83f2	Dean Michael Berris	Merge branch 'master' of http://llvm.org/git/llvm into xray-no-intrinsic	May 30 2016, 8:54 PM
748f1420f5f5	2db1941dd4bf	95f605f6abbe f673dcd8f184	Dean Michael Berris	Merge branch 'master' into xray-no-intrinsic	May 17 2016, 9:14 PM
95f605f6abbe	8ef75f9855b2	1bd4d89206bf	Dean Michael Berris	Update documentation to remove XRay references	May 17 2016, 7:49 PM
1bd4d89206bf	b90d3f5973c5	be09ecd477c4	Dean Michael Berris	Make ReturnOpcode an argument to TargetInstrInfo constructor and whitelist one… (Show More…)	May 10 2016, 10:09 PM
be09ecd477c4	19a439cf1036	1f3e353113ff	Dean Michael Berris	Add documentation on why we need 9 bytes of nops	May 10 2016, 8:52 PM
1f3e353113ff	17bb95e1fbb8	9fbe26b73667	Dean Michael Berris	Add a test in TargetInstrInfo for normal return instructions	May 8 2016, 11:00 PM
9fbe26b73667	320dcfb06c24	cc3f04897ee9	Dean Michael Berris	clang-format	May 8 2016, 9:49 PM
cc3f04897ee9	8ca434587001	68d0fee9276a	Dean Michael Berris	Fix to unpack operands properly	May 8 2016, 9:36 PM
68d0fee9276a	d8f390b2ca01	5c7e000a2115	Dean Michael Berris	Address comments in public review (Show More…)	May 5 2016, 10:13 PM
5c7e000a2115	bf22c9a8cdb3	55126f7752db	Dean Michael Berris	Remove unnecessary changes	May 3 2016, 11:37 PM
55126f7752db	0292e1972d3b	e22a3cd7c362	Dean Michael Berris	Preserve only required changes	May 3 2016, 11:32 PM
e22a3cd7c362	b02b553c0f0c	53c03bdaed7f	Dean Michael Berris	Undo some unnecessary changes	May 3 2016, 11:23 PM
53c03bdaed7f	9783cbe4a9b3	9c84e5fe216b	Dean Michael Berris	Undo some unnecessary changes	May 3 2016, 11:20 PM
9c84e5fe216b	0c38fd96a67a	abf166ae5d74	Dean Michael Berris	Fix tab	May 3 2016, 11:14 PM
abf166ae5d74	a408e4b8ce7c	6da063068708	Dean Michael Berris	Undo unnecessary changes to AsmPrinter.cpp	May 3 2016, 11:13 PM
6da063068708	1b6327388880	82e21eea388a	Dean Michael Berris	Fix numbering	May 3 2016, 11:09 PM
82e21eea388a	a4d67cb90a8f	23f27f8fdc67	Dean Michael Berris	Fix tabs	May 3 2016, 11:08 PM
23f27f8fdc67	933d029b0a9e	e8484be22b23	Dean Michael Berris	Fix comment	May 3 2016, 11:07 PM
e8484be22b23	26e7d2d8443b	9a8aa6a3183c	Dean Michael Berris	Undo unnecessary changes to AsmPrinter.h	May 3 2016, 11:04 PM
9a8aa6a3183c	dfa0b1faf3b8	f1a4b1ec29d3	Dean Michael Berris	Simplify, based on echristo@ comments	May 3 2016, 9:46 PM
f1a4b1ec29d3	b9c9741f13cd	d8014916986a	Dean Michael Berris	Update attribute test to look for specific asm	May 3 2016, 2:52 AM
d8014916986a	0ed627164cfb	4bde4968eada	Dean Michael Berris	Rename always_instrument to function-instrument	May 3 2016, 2:04 AM
4bde4968eada	d0da45a4daba	903646a32ac9	Dean Michael Berris	Sanitize section names	May 3 2016, 1:58 AM
903646a32ac9	71a720823829	d9bdb4e198ea	Dean Michael Berris	Rename PATCHABLE_FUNCTION_EXIT to PATCHABLE_RET	May 3 2016, 1:48 AM
d9bdb4e198ea	5712acc56ee3	7138bce13008	Dean Michael Berris	Rename XRAY_TAIL to PATCHABLE_TAILg	May 3 2016, 1:41 AM
7138bce13008	8f14bcba9b37	1958d020dbfe	Dean Michael Berris	Rename XRAY_FUNCTION_* to PATCHABLE_FUNCTION_*	May 3 2016, 1:28 AM
1958d020dbfe	54cfc0a2a55b	f37f639de941 9de339f6ec5a	Dean Michael Berris	Merge branch 'master' into xray-no-intrinsic	May 3 2016, 1:14 AM
f37f639de941	8bd0059199ef	2450a8ff48c7	Dean Michael Berris	Fixup for preparation to do renames	May 3 2016, 1:13 AM
2450a8ff48c7	fc958db7fbec	2b575801cf53	Dean Michael Berris	Merge fixes	May 2 2016, 11:13 PM
2b575801cf53	a8092e767f73	5424eec283e5	Dean Michael Berris	Merge fixes	May 2 2016, 11:07 PM
5424eec283e5	d9ba72d4b9f4	16d86921aabb	Dean Michael Berris	Fix merge artefacts	May 2 2016, 10:52 PM
16d86921aabb	0f2c32661f8d	68b77f072da1	Dean Michael Berris	Fix X86ISelLowering.cpp from master	May 2 2016, 10:50 PM
68b77f072da1	3b57dc11b82e	d14554167f05 2d54216d9bfe	Dean Michael Berris	Merge branch 'master' into xray-no-intrinsic	May 2 2016, 10:48 PM
d14554167f05	1cac69ab4425	b9c4c382140d	Dean Michael Berris	Refactor XRay instrumentation into a MachinePass (Show More…)	Apr 25 2016, 11:07 PM
b9c4c382140d	fd72ecb37142	9a8a5f2b1ce7	Dean Michael Berris	Decide on XRay-instrumentation when emitting RET	Apr 21 2016, 2:55 AM
9a8a5f2b1ce7	37623caa9aa6	1bd66bea561c	Dean Michael Berris	Support lowering XRAY_FUNCTION_EXIT as if a return (Show More…)	Apr 21 2016, 2:35 AM
1bd66bea561c	2476bc735557	28d6406f6a22	Dean Michael Berris	Remove flags requirement for XRay enabling (Show More…)	Apr 14 2016, 12:03 AM
28d6406f6a22	799910391f8b	15fb58b8a7ed	Dean Berris	Revert some unnecessary XRay-related changes (Show More…)	Apr 12 2016, 5:47 AM
15fb58b8a7ed	d82c3fb21399	4e83ac214bc7	Dean Michael Berris	Heuristics for Instrumentation and ASM Cleanup (Show More…)	Apr 12 2016, 2:43 AM
4e83ac214bc7	88d73ef6f9d7	f10abbab88a7	Dean Michael Berris	Fix remaining failing test after mis-merge on rebase	Apr 10 2016, 11:49 PM
f10abbab88a7	f033388042b3	b15c35e3f725	Dean Michael Berris	Fix mis-initialisation of the XRay pointer	Apr 10 2016, 11:36 PM
b15c35e3f725	57673c15e914	bd4d943a253c	Dean Michael Berris	Generate XRay sections for MachO and ELF (Show More…)	Apr 7 2016, 5:44 AM
bd4d943a253c	2e1fb5627346	1aeb7298fe07	Dean Michael Berris	Fix the tests	Mar 28 2016, 5:51 PM
1aeb7298fe07	d64c89e2c13c	172a7d70dd5e	Dean Berris	Add a section just for the XRay instrumentation map	Mar 22 2016, 10:34 PM
172a7d70dd5e	f52854086eea	cca07bcfab9a	Dean Berris	Mark the XRay blocks with pseudo instructions	Mar 22 2016, 10:32 PM
cca07bcfab9a	fcb26fd74a14	1d6435fde8d1	Dean Berris	Add pseudo instructions for xray markers	Mar 22 2016, 10:30 PM
1d6435fde8d1	b61ef002fc99	6c9ac8fe17be	Dean Berris	Make XRay emission conditional on an attribute	Mar 20 2016, 7:10 PM
6c9ac8fe17be	12d28af55ac0	4ae81f103377	Dean Berris	Placeholder for future work	Mar 16 2016, 1:09 AM
4ae81f103377	486090231750	53855fe69f75	Dean Berris	Add nop sleds when emitting Prologue	Mar 10 2016, 11:30 PM
53855fe69f75	39175bad4b8a	315ba27c27f9	Dean Berris	Add a basic block with jump+nops just before emitting ret	Mar 1 2016, 8:10 PM
315ba27c27f9	24deaad49ac8	0a4c40265be0	Dean Berris	Attempt at just having changes in lowering	Feb 29 2016, 8:18 PM

Diff 61929

include/llvm/CodeGen/Passes.h

Show First 20 Lines • Show All 257 Lines • ▼ Show 20 Lines	/// MachineDominanaceFrontier - This pass is a machine dominators analysis pass.
extern char &OptimizePHIsID;		extern char &OptimizePHIsID;

/// StackSlotColoring - This pass performs stack slot coloring.		/// StackSlotColoring - This pass performs stack slot coloring.
extern char &StackSlotColoringID;		extern char &StackSlotColoringID;

/// \brief This pass lays out funclets contiguously.		/// \brief This pass lays out funclets contiguously.
extern char &FuncletLayoutID;		extern char &FuncletLayoutID;

		/// This pass inserts the XRay instrumentation sleds if they are supported by
		/// the target platform.
		extern char &XRayInstrumentationID;

/// \brief This pass implements the "patchable-function" attribute.		/// \brief This pass implements the "patchable-function" attribute.
extern char &PatchableFunctionID;		extern char &PatchableFunctionID;

/// createStackProtectorPass - This pass adds stack protectors to functions.		/// createStackProtectorPass - This pass adds stack protectors to functions.
///		///
FunctionPass createStackProtectorPass(const TargetMachine TM);		FunctionPass createStackProtectorPass(const TargetMachine TM);

/// createMachineVerifierPass - This pass verifies cenerated machine code		/// createMachineVerifierPass - This pass verifies cenerated machine code
▲ Show 20 Lines • Show All 131 Lines • Show Last 20 Lines

include/llvm/InitializePasses.h

	Show First 20 Lines • Show All 327 Lines • ▼ Show 20 Lines
	void initializeUnreachableBlockElimPass(PassRegistry&);			void initializeUnreachableBlockElimPass(PassRegistry&);
	void initializeUnreachableMachineBlockElimPass(PassRegistry&);			void initializeUnreachableMachineBlockElimPass(PassRegistry&);
	void initializeVerifierLegacyPassPass(PassRegistry&);			void initializeVerifierLegacyPassPass(PassRegistry&);
	void initializeVirtRegMapPass(PassRegistry&);			void initializeVirtRegMapPass(PassRegistry&);
	void initializeVirtRegRewriterPass(PassRegistry&);			void initializeVirtRegRewriterPass(PassRegistry&);
	void initializeWholeProgramDevirtPass(PassRegistry &);			void initializeWholeProgramDevirtPass(PassRegistry &);
	void initializeWinEHPreparePass(PassRegistry&);			void initializeWinEHPreparePass(PassRegistry&);
	void initializeWriteBitcodePassPass(PassRegistry &);			void initializeWriteBitcodePassPass(PassRegistry &);
				void initializeXRayInstrumentationPass(PassRegistry &);
	}			}

	#endif			#endif

include/llvm/MC/MCELFStreamer.h

Show First 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	public:
void EmitValueToAlignment(unsigned, int64_t, unsigned, unsigned) override;		void EmitValueToAlignment(unsigned, int64_t, unsigned, unsigned) override;

void FinishImpl() override;		void FinishImpl() override;

void EmitBundleAlignMode(unsigned AlignPow2) override;		void EmitBundleAlignMode(unsigned AlignPow2) override;
void EmitBundleLock(bool AlignToEnd) override;		void EmitBundleLock(bool AlignToEnd) override;
void EmitBundleUnlock() override;		void EmitBundleUnlock() override;

		MCSection *getNamedGroupSection(StringRef Section, StringRef Identifer,
		unsigned EntSize) override;

private:		private:
bool isBundleLocked() const;		bool isBundleLocked() const;
void EmitInstToFragment(const MCInst &Inst, const MCSubtargetInfo &) override;		void EmitInstToFragment(const MCInst &Inst, const MCSubtargetInfo &) override;
void EmitInstToData(const MCInst &Inst, const MCSubtargetInfo &) override;		void EmitInstToData(const MCInst &Inst, const MCSubtargetInfo &) override;

void fixSymbolsInTLSFixups(const MCExpr *expr);		void fixSymbolsInTLSFixups(const MCExpr *expr);

/// \brief Merge the content of the fragment \p EF into the fragment \p DF.		/// \brief Merge the content of the fragment \p EF into the fragment \p DF.
Show All 17 Lines

include/llvm/MC/MCObjectFileInfo.h

Show First 20 Lines • Show All 334 Lines • ▼ Show 20 Lines	public:
}		}

// COFF specific sections.		// COFF specific sections.
MCSection *getDrectveSection() const { return DrectveSection; }		MCSection *getDrectveSection() const { return DrectveSection; }
MCSection *getPDataSection() const { return PDataSection; }		MCSection *getPDataSection() const { return PDataSection; }
MCSection *getXDataSection() const { return XDataSection; }		MCSection *getXDataSection() const { return XDataSection; }
MCSection *getSXDataSection() const { return SXDataSection; }		MCSection *getSXDataSection() const { return SXDataSection; }

		// XRay specific sections.
		MCSection *getXRaySection() const { return XRaySection; }

MCSection *getEHFrameSection() {		MCSection *getEHFrameSection() {
return EHFrameSection;		return EHFrameSection;
}		}

enum Environment { IsMachO, IsELF, IsCOFF };		enum Environment { IsMachO, IsELF, IsCOFF };
Environment getObjectFileType() const { return Env; }		Environment getObjectFileType() const { return Env; }

bool isPositionIndependent() const { return PositionIndependent; }		bool isPositionIndependent() const { return PositionIndependent; }
Show All 19 Lines

include/llvm/MC/MCStreamer.h

Show First 20 Lines • Show All 754 Lines • ▼ Show 20 Lines	public:
/// Get the .pdata section used for the given section. Typically the given		/// Get the .pdata section used for the given section. Typically the given
/// section is either the main .text section or some other COMDAT .text		/// section is either the main .text section or some other COMDAT .text
/// section, but it may be any section containing code.		/// section, but it may be any section containing code.
MCSection getAssociatedPDataSection(const MCSection TextSec);		MCSection getAssociatedPDataSection(const MCSection TextSec);

/// Get the .xdata section used for the given section.		/// Get the .xdata section used for the given section.
MCSection getAssociatedXDataSection(const MCSection TextSec);		MCSection getAssociatedXDataSection(const MCSection TextSec);

		/// Get a group section with provided group identifier. This is in ELF, for
		/// instance, named by concatenating \p Section with '.' then \p Identifier.
		virtual MCSection *getNamedGroupSection(StringRef Section,
		StringRef Identifier,
		unsigned EntSize);

virtual void EmitSyntaxDirective();		virtual void EmitSyntaxDirective();

/// \brief Emit a .reloc directive.		/// \brief Emit a .reloc directive.
/// Returns true if the relocation could not be emitted because Name is not		/// Returns true if the relocation could not be emitted because Name is not
/// known.		/// known.
virtual bool EmitRelocDirective(const MCExpr &Offset, StringRef Name,		virtual bool EmitRelocDirective(const MCExpr &Offset, StringRef Name,
const MCExpr *Expr, SMLoc Loc) {		const MCExpr *Expr, SMLoc Loc) {
return true;		return true;
▲ Show 20 Lines • Show All 60 Lines • Show Last 20 Lines

include/llvm/Target/Target.td

	Show First 20 Lines • Show All 940 Lines • ▼ Show 20 Lines
	def PATCHABLE_OP : Instruction {			def PATCHABLE_OP : Instruction {
	let OutOperandList = (outs unknown:$dst);			let OutOperandList = (outs unknown:$dst);
	let InOperandList = (ins variable_ops);			let InOperandList = (ins variable_ops);
	let usesCustomInserter = 1;			let usesCustomInserter = 1;
	let mayLoad = 1;			let mayLoad = 1;
	let mayStore = 1;			let mayStore = 1;
	let hasSideEffects = 1;			let hasSideEffects = 1;
	}			}
				def PATCHABLE_FUNCTION_ENTER : Instruction {
				let OutOperandList = (outs);
				let InOperandList = (ins);
				let AsmString = "# XRay Function Enter.";
				let usesCustomInserter = 1;
				let hasSideEffects = 0;
				}
				def PATCHABLE_RET : Instruction {
				let OutOperandList = (outs unknown:$dst);
				let InOperandList = (ins variable_ops);
				let AsmString = "# XRay Function Exit.";
				let usesCustomInserter = 1;
				let hasSideEffects = 1;
				let isReturn = 1;
				}

				sanjoyUnsubmitted Done Reply Inline Actions Looks like this isn't used yet? If so, I'd suggest dropping it completely for now. sanjoy: Looks like this isn't used yet? If so, I'd suggest dropping it completely for now.
	// Generic opcodes used in GlobalISel.			// Generic opcodes used in GlobalISel.
	include "llvm/Target/GenericOpcodes.td"			include "llvm/Target/GenericOpcodes.td"

	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// AsmParser - This class can be implemented by targets that wish to implement			// AsmParser - This class can be implemented by targets that wish to implement
	// .s file parsing.			// .s file parsing.
	▲ Show 20 Lines • Show All 329 Lines • Show Last 20 Lines

include/llvm/Target/TargetInstrInfo.h

Show First 20 Lines • Show All 49 Lines • ▼ Show 20 Lines
///		///
/// TargetInstrInfo - Interface to description of machine instruction set		/// TargetInstrInfo - Interface to description of machine instruction set
///		///
class TargetInstrInfo : public MCInstrInfo {		class TargetInstrInfo : public MCInstrInfo {
TargetInstrInfo(const TargetInstrInfo &) = delete;		TargetInstrInfo(const TargetInstrInfo &) = delete;
void operator=(const TargetInstrInfo &) = delete;		void operator=(const TargetInstrInfo &) = delete;
public:		public:
TargetInstrInfo(unsigned CFSetupOpcode = ~0u, unsigned CFDestroyOpcode = ~0u,		TargetInstrInfo(unsigned CFSetupOpcode = ~0u, unsigned CFDestroyOpcode = ~0u,
unsigned CatchRetOpcode = ~0u)		unsigned CatchRetOpcode = ~0u, unsigned ReturnOpcode = ~0u)
: CallFrameSetupOpcode(CFSetupOpcode),		: CallFrameSetupOpcode(CFSetupOpcode),
CallFrameDestroyOpcode(CFDestroyOpcode),		CallFrameDestroyOpcode(CFDestroyOpcode),
CatchRetOpcode(CatchRetOpcode) {}		CatchRetOpcode(CatchRetOpcode),
		ReturnOpcode(ReturnOpcode) {}

virtual ~TargetInstrInfo();		virtual ~TargetInstrInfo();

static bool isGenericOpcode(unsigned Opc) {		static bool isGenericOpcode(unsigned Opc) {
return Opc <= TargetOpcode::GENERIC_OP_END;		return Opc <= TargetOpcode::GENERIC_OP_END;
}		}

/// Given a machine instruction descriptor, returns the register		/// Given a machine instruction descriptor, returns the register
▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines	public:
/// if they exist (-1 otherwise). Some targets use pseudo instructions in		/// if they exist (-1 otherwise). Some targets use pseudo instructions in
/// order to abstract away the difference between operating with a frame		/// order to abstract away the difference between operating with a frame
/// pointer and operating without, through the use of these two instructions.		/// pointer and operating without, through the use of these two instructions.
///		///
unsigned getCallFrameSetupOpcode() const { return CallFrameSetupOpcode; }		unsigned getCallFrameSetupOpcode() const { return CallFrameSetupOpcode; }
unsigned getCallFrameDestroyOpcode() const { return CallFrameDestroyOpcode; }		unsigned getCallFrameDestroyOpcode() const { return CallFrameDestroyOpcode; }

unsigned getCatchReturnOpcode() const { return CatchRetOpcode; }		unsigned getCatchReturnOpcode() const { return CatchRetOpcode; }
		unsigned getReturnOpcode() const { return ReturnOpcode; }

/// Returns the actual stack pointer adjustment made by an instruction		/// Returns the actual stack pointer adjustment made by an instruction
/// as part of a call sequence. By default, only call frame setup/destroy		/// as part of a call sequence. By default, only call frame setup/destroy
/// instructions adjust the stack, but targets may want to override this		/// instructions adjust the stack, but targets may want to override this
/// to enable more fine-grained adjustment, or adjust by a different value.		/// to enable more fine-grained adjustment, or adjust by a different value.
virtual int getSPAdjust(const MachineInstr *MI) const;		virtual int getSPAdjust(const MachineInstr *MI) const;

/// Return true if the instruction is a "coalescable" extension instruction.		/// Return true if the instruction is a "coalescable" extension instruction.
/// That is, it's like a copy where it's legal for the source to overlap the		/// That is, it's like a copy where it's legal for the source to overlap the
/// destination. e.g. X86::MOVSX64rr32. If this returns true, then it's		/// destination. e.g. X86::MOVSX64rr32. If this returns true, then it's
/// expected the pre-extension value is available as a subreg of the result		/// expected the pre-extension value is available as a subreg of the result
/// register. This also returns the sub-register index in SubIdx.		/// register. This also returns the sub-register index in SubIdx.
virtual bool isCoalescableExtInstr(const MachineInstr &MI,		virtual bool isCoalescableExtInstr(const MachineInstr &MI,
		majnemerUnsubmitted Done Reply Inline Actions I was thinking more of a whilelist-oriented solution, something like: `unsigned getNormalReturnOpcode() const { return NormalRetOpcode; }` majnemer: I was thinking more of a whilelist-oriented solution, something like: `unsigned…
		dberrisAuthorUnsubmitted Done Reply Inline Actions But how does this work on platforms that can spell 'return' many different normal ways (like in X86)? There's RETL, RETQ, and all other versions of RET which are considered "normal"? dberris: But how does this work on platforms that can spell 'return' many different normal ways (like in…
		majnemerUnsubmitted Done Reply Inline Actions The same way it is handled for `X86::ADJCALLSTACKDOWN64` vs `X86::ADJCALLSTACKDOWN32`: https://github.com/llvm-mirror/llvm/blob/master/lib/Target/X86/X86InstrInfo.cpp#L105 majnemer: The same way it is handled for `X86::ADJCALLSTACKDOWN64` vs `X86::ADJCALLSTACKDOWN32`: https…
		dberrisAuthorUnsubmitted Done Reply Inline Actions That makes a lot of sense, thanks! I'll rework this a bit to make it more of a targeted specific OpCode. I suppose it's fine to have multiples of these functions, each for relevant flavours of fairly common instruction classes, yes? So I suppose, for `tail call` exits/returns, we would have something similar for that? dberris: That makes a lot of sense, thanks! I'll rework this a bit to make it more of a targeted…
		dberrisAuthorUnsubmitted Done Reply Inline Actions Updated now to white-list a specific return OpCode. dberris: Updated now to white-list a specific return OpCode.
unsigned &SrcReg, unsigned &DstReg,		unsigned &SrcReg, unsigned &DstReg,
unsigned &SubIdx) const {		unsigned &SubIdx) const {
return false;		return false;
}		}

/// If the specified machine instruction is a direct		/// If the specified machine instruction is a direct
/// load from a stack slot, return the virtual or physical register number of		/// load from a stack slot, return the virtual or physical register number of
/// the destination along with the FrameIndex of the loaded stack slot. If		/// the destination along with the FrameIndex of the loaded stack slot. If
▲ Show 20 Lines • Show All 1,257 Lines • ▼ Show 20 Lines	public:
virtual ArrayRef<std::pair<unsigned, const char *>>		virtual ArrayRef<std::pair<unsigned, const char *>>
getSerializableBitmaskMachineOperandTargetFlags() const {		getSerializableBitmaskMachineOperandTargetFlags() const {
return None;		return None;
}		}

private:		private:
unsigned CallFrameSetupOpcode, CallFrameDestroyOpcode;		unsigned CallFrameSetupOpcode, CallFrameDestroyOpcode;
unsigned CatchRetOpcode;		unsigned CatchRetOpcode;
		unsigned ReturnOpcode;
};		};

/// \brief Provide DenseMapInfo for TargetInstrInfo::RegSubRegPair.		/// \brief Provide DenseMapInfo for TargetInstrInfo::RegSubRegPair.
template<>		template<>
struct DenseMapInfo<TargetInstrInfo::RegSubRegPair> {		struct DenseMapInfo<TargetInstrInfo::RegSubRegPair> {
typedef DenseMapInfo<unsigned> RegInfo;		typedef DenseMapInfo<unsigned> RegInfo;

static inline TargetInstrInfo::RegSubRegPair getEmptyKey() {		static inline TargetInstrInfo::RegSubRegPair getEmptyKey() {
Show All 24 Lines

include/llvm/Target/TargetOpcodes.def

	Show First 20 Lines • Show All 136 Lines • ▼ Show 20 Lines
	/// instruction wrapped in PATCHABLE_OP has to either have a minimum			/// instruction wrapped in PATCHABLE_OP has to either have a minimum
	/// size or be preceded with a nop of that size. The first operand is			/// size or be preceded with a nop of that size. The first operand is
	/// an immediate denoting the minimum size of the instruction, the			/// an immediate denoting the minimum size of the instruction, the
	/// second operand is an immediate denoting the opcode of the original			/// second operand is an immediate denoting the opcode of the original
	/// instruction. The rest of the operands are the operands of the			/// instruction. The rest of the operands are the operands of the
	/// original instruction.			/// original instruction.
	HANDLE_TARGET_OPCODE(PATCHABLE_OP, 23)			HANDLE_TARGET_OPCODE(PATCHABLE_OP, 23)

				/// This is a marker instruction which gets translated into a nop sled, useful
				echristoUnsubmitted Done Reply Inline Actions Remove the XRay specific stuff here and rewrite in a more generic fashion (and below). echristo: Remove the XRay specific stuff here and rewrite in a more generic fashion (and below).
				dberrisAuthorUnsubmitted Not Done Reply Inline Actions Done dberris: Done
				/// for inserting instrumentation instructions at runtime.
				HANDLE_TARGET_OPCODE(PATCHABLE_FUNCTION_ENTER, 24)

				/// Wraps a return instruction and its operands to enable adding nop sleds
				/// either before or after the return. The nop sleds are useful for inserting
				/// instrumentation instructions at runtime.
				HANDLE_TARGET_OPCODE(PATCHABLE_RET, 25)

	/// The following generic opcodes are not supposed to appear after ISel.			/// The following generic opcodes are not supposed to appear after ISel.
	/// This is something we might want to relax, but for now, this is convenient			/// This is something we might want to relax, but for now, this is convenient
	/// to produce diagnostics.			/// to produce diagnostics.

	/// Generic ADD instruction. This is an integer add.			/// Generic ADD instruction. This is an integer add.
	HANDLE_TARGET_OPCODE(G_ADD, 24)			HANDLE_TARGET_OPCODE(G_ADD, 26)
	HANDLE_TARGET_OPCODE_MARKER(PRE_ISEL_GENERIC_OPCODE_START, G_ADD)			HANDLE_TARGET_OPCODE_MARKER(PRE_ISEL_GENERIC_OPCODE_START, G_ADD)

	/// Generic Bitwise-OR instruction.			/// Generic Bitwise-OR instruction.
	HANDLE_TARGET_OPCODE(G_OR, 25)			HANDLE_TARGET_OPCODE(G_OR, 25)

	/// Generic BRANCH instruction. This is an unconditional branch.			/// Generic BRANCH instruction. This is an unconditional branch.
	HANDLE_TARGET_OPCODE(G_BR, 26)			HANDLE_TARGET_OPCODE(G_BR, 27)

	// TODO: Add more generic opcodes as we move along.			// TODO: Add more generic opcodes as we move along.

	/// Marker for the end of the generic opcode.			/// Marker for the end of the generic opcode.
	/// This is used to check if an opcode is in the range of the			/// This is used to check if an opcode is in the range of the
	/// generic opcodes.			/// generic opcodes.
	HANDLE_TARGET_OPCODE_MARKER(PRE_ISEL_GENERIC_OPCODE_END, G_BR)			HANDLE_TARGET_OPCODE_MARKER(PRE_ISEL_GENERIC_OPCODE_END, G_BR)

	/// BUILTIN_OP_END - This must be the last enum value in this list.			/// BUILTIN_OP_END - This must be the last enum value in this list.
	/// The target-specific post-isel opcode values start here.			/// The target-specific post-isel opcode values start here.
	HANDLE_TARGET_OPCODE_MARKER(GENERIC_OP_END, PRE_ISEL_GENERIC_OPCODE_END)			HANDLE_TARGET_OPCODE_MARKER(GENERIC_OP_END, PRE_ISEL_GENERIC_OPCODE_END)

lib/CodeGen/CMakeLists.txt

Show First 20 Lines • Show All 128 Lines • ▼ Show 20 Lines	add_llvm_library(LLVMCodeGen
TargetOptionsImpl.cpp		TargetOptionsImpl.cpp
TargetPassConfig.cpp		TargetPassConfig.cpp
TargetRegisterInfo.cpp		TargetRegisterInfo.cpp
TargetSchedule.cpp		TargetSchedule.cpp
TwoAddressInstructionPass.cpp		TwoAddressInstructionPass.cpp
UnreachableBlockElim.cpp		UnreachableBlockElim.cpp
VirtRegMap.cpp		VirtRegMap.cpp
WinEHPrepare.cpp		WinEHPrepare.cpp
		XRayInstrumentation.cpp

ADDITIONAL_HEADER_DIRS		ADDITIONAL_HEADER_DIRS
${LLVM_MAIN_INCLUDE_DIR}/llvm/CodeGen		${LLVM_MAIN_INCLUDE_DIR}/llvm/CodeGen
${LLVM_MAIN_INCLUDE_DIR}/llvm/CodeGen/PBQP		${LLVM_MAIN_INCLUDE_DIR}/llvm/CodeGen/PBQP

LINK_LIBS ${PTHREAD_LIB}		LINK_LIBS ${PTHREAD_LIB}
)		)

add_dependencies(LLVMCodeGen intrinsics_gen)		add_dependencies(LLVMCodeGen intrinsics_gen)

add_subdirectory(SelectionDAG)		add_subdirectory(SelectionDAG)
add_subdirectory(AsmPrinter)		add_subdirectory(AsmPrinter)
add_subdirectory(MIRParser)		add_subdirectory(MIRParser)
add_subdirectory(GlobalISel)		add_subdirectory(GlobalISel)

lib/CodeGen/CodeGen.cpp

Show First 20 Lines • Show All 51 Lines • ▼ Show 20 Lines	void llvm::initializeCodeGen(PassRegistry &Registry) {
initializeMachineFunctionPrinterPassPass(Registry);		initializeMachineFunctionPrinterPassPass(Registry);
initializeMachineLICMPass(Registry);		initializeMachineLICMPass(Registry);
initializeMachineLoopInfoPass(Registry);		initializeMachineLoopInfoPass(Registry);
initializeMachineModuleInfoPass(Registry);		initializeMachineModuleInfoPass(Registry);
initializeMachinePostDominatorTreePass(Registry);		initializeMachinePostDominatorTreePass(Registry);
initializeMachineSchedulerPass(Registry);		initializeMachineSchedulerPass(Registry);
initializeMachineSinkingPass(Registry);		initializeMachineSinkingPass(Registry);
initializeMachineVerifierPassPass(Registry);		initializeMachineVerifierPassPass(Registry);
		initializeXRayInstrumentationPass(Registry);
initializePatchableFunctionPass(Registry);		initializePatchableFunctionPass(Registry);
initializeOptimizePHIsPass(Registry);		initializeOptimizePHIsPass(Registry);
initializePEIPass(Registry);		initializePEIPass(Registry);
initializePHIEliminationPass(Registry);		initializePHIEliminationPass(Registry);
initializePeepholeOptimizerPass(Registry);		initializePeepholeOptimizerPass(Registry);
initializePostMachineSchedulerPass(Registry);		initializePostMachineSchedulerPass(Registry);
initializePostRAHazardRecognizerPass(Registry);		initializePostRAHazardRecognizerPass(Registry);
initializePostRASchedulerPass(Registry);		initializePostRASchedulerPass(Registry);
Show All 26 Lines

lib/CodeGen/TargetPassConfig.cpp

Show First 20 Lines • Show All 630 Lines • ▼ Show 20 Lines	if (UseIPRA)
// clobbered registers, to be used to optimize call sites.		// clobbered registers, to be used to optimize call sites.
addPass(createRegUsageInfoCollector());		addPass(createRegUsageInfoCollector());

addPass(&FuncletLayoutID, false);		addPass(&FuncletLayoutID, false);

addPass(&StackMapLivenessID, false);		addPass(&StackMapLivenessID, false);
addPass(&LiveDebugValuesID, false);		addPass(&LiveDebugValuesID, false);

		addPass(&XRayInstrumentationID, false);
addPass(&PatchableFunctionID, false);		addPass(&PatchableFunctionID, false);

AddingMachinePasses = false;		AddingMachinePasses = false;
}		}

/// Add passes that optimize machine instructions in SSA form.		/// Add passes that optimize machine instructions in SSA form.
void TargetPassConfig::addMachineSSAOptimization() {		void TargetPassConfig::addMachineSSAOptimization() {
// Pre-ra tail duplication.		// Pre-ra tail duplication.
▲ Show 20 Lines • Show All 214 Lines • Show Last 20 Lines

lib/CodeGen/XRayInstrumentation.cpp

This file was added.

				//===-- XRayInstrumentation.cpp - Adds XRay instrumentation to functions. -===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// This file implements a MachineFunctionPass that inserts the appropriate
				// XRay instrumentation instructions. We look for XRay-specific attributes
				// on the function to determine whether we should insert the replacement
				// operations.
				//
				//===---------------------------------------------------------------------===//

				#include "llvm/CodeGen/Analysis.h"
				#include "llvm/CodeGen/MachineFunction.h"
				#include "llvm/CodeGen/MachineFunctionPass.h"
				#include "llvm/CodeGen/MachineInstrBuilder.h"
				#include "llvm/CodeGen/Passes.h"
				#include "llvm/Support/TargetRegistry.h"
				#include "llvm/Target/TargetInstrInfo.h"
				#include "llvm/Target/TargetSubtargetInfo.h"

				using namespace llvm;

				namespace {
				struct XRayInstrumentation : public MachineFunctionPass {
				static char ID;

				XRayInstrumentation() : MachineFunctionPass(ID) {
				initializeXRayInstrumentationPass(*PassRegistry::getPassRegistry());
				}

				bool runOnMachineFunction(MachineFunction &MF) override;
				};
				}

				bool XRayInstrumentation::runOnMachineFunction(MachineFunction &MF) {
				auto &F = *MF.getFunction();
				auto InstrAttr = F.getFnAttribute("function-instrument");
				bool AlwaysInstrument = !InstrAttr.hasAttribute(Attribute::None) &&
				InstrAttr.isStringAttribute() &&
				InstrAttr.getValueAsString() == "xray-always";
				Attribute Attr = F.getFnAttribute("xray-instruction-threshold");
				unsigned XRayThreshold = 0;
				if (!AlwaysInstrument) {
				if (Attr.hasAttribute(Attribute::None) \|\| !Attr.isStringAttribute())
				return false; // XRay threshold attribute not found.
				if (Attr.getValueAsString().getAsInteger(10, XRayThreshold))
				return false; // Invalid value for threshold.
				if (F.size() < XRayThreshold)
				return false; // Function is too small.
				}

				// FIXME: Do the loop triviality analysis here or in an earlier pass.

				// First, insert an PATCHABLE_FUNCTION_ENTER as the first instruction of the
				// MachineFunction.
				auto &FirstMBB = *MF.begin();
				auto &FirstMI = *FirstMBB.begin();
				auto *TII = MF.getSubtarget().getInstrInfo();
				BuildMI(FirstMBB, FirstMI, FirstMI.getDebugLoc(),
				TII->get(TargetOpcode::PATCHABLE_FUNCTION_ENTER));

				// Then we look for all terminators and returns, then replace those with
				// PATCHABLE_RET instructions.
				SmallVector<MachineInstr *, 4> Terminators;
				for (auto &MBB : MF) {
				for (auto &T : MBB.terminators()) {
				// FIXME: Handle tail calls here too?
				if (T.isReturn() && T.getOpcode() == TII->getReturnOpcode()) {
				// Replace return instructions with:
				dberrisAuthorUnsubmitted Done Reply Inline Actions Thanks! I've updated the patch to do this. PTAL? dberris: Thanks! I've updated the patch to do this. PTAL?
				// PATCHABLE_RET <Opcode>, <Operand>...
				auto MIB = BuildMI(MBB, T, T.getDebugLoc(),
				majnemerUnsubmitted Done Reply Inline Actions The typical way of doing this sort of thing, AFAICT, is to teach `TargetInstrInfo` about it. For example, it implements `getCatchReturnOpcode` and `getCallFrameSetupOpcode`. majnemer: The typical way of doing this sort of thing, AFAICT, is to teach `TargetInstrInfo` about it.
				TII->get(TargetOpcode::PATCHABLE_RET))
				majnemerUnsubmitted Done Reply Inline Actions I would recommend white-listing the opcodes you know are safe to replace instead of blacklisting the ones you know which are unsafe. This would make the pass conservatively correct in the face of future changes to LLVM. majnemer: I would recommend white-listing the opcodes you know are safe to replace instead of…
				dberrisAuthorUnsubmitted Done Reply Inline Actions That makes sense, thanks majnemer -- however it's not clear to me how I'd refer to the target-specific opcodes from here. Pardon the newbie question, but how do I say "I just want return instructions here"? The only options I can see are: Look at the original LLVM IR to see if it's a ReturnInst. Figure out a way for including the X86 opcodes here. Maybe I'm missing something simpler here? dberris: That makes sense, thanks majnemer -- however it's not clear to me how I'd refer to the target…
				.addImm(T.getOpcode());
				majnemerUnsubmitted Done Reply Inline Actions I don't think you want to replace whacky constructs like `EH_RETURN` and `CLEANUPRET` with `PATCHABLE_RET`, do you? Just "normal" returns, right? majnemer: I don't think you want to replace whacky constructs like `EH_RETURN` and `CLEANUPRET` with…
				dberrisAuthorUnsubmitted Done Reply Inline Actions Yes, that's right -- does this version exclude those appropriately? dberris: Yes, that's right -- does this version exclude those appropriately?
				for (auto &MO : T.operands())
				MIB.addOperand(MO);
				Terminators.push_back(&T);
				sanjoyUnsubmitted Done Reply Inline Actions Why do you care only about register operands here? sanjoy: Why do you care only about register operands here?
				dberrisAuthorUnsubmitted Done Reply Inline Actions Oops, I had assumed that the RET instructions only ever had register operands. That's a faulty assumption, good catch. dberris: Oops, I had assumed that the RET instructions only ever had register operands. That's a faulty…
				break;
				}
				}
				}

				for (auto &I : Terminators)
				I->eraseFromParent();

				return true;
				}

				char XRayInstrumentation::ID = 0;
				char &llvm::XRayInstrumentationID = XRayInstrumentation::ID;
				INITIALIZE_PASS(XRayInstrumentation, "xray-instrumentation", "Insert XRay ops",
				false, false);

lib/MC/MCELFStreamer.cpp

Show First 20 Lines • Show All 665 Lines • ▼ Show 20 Lines	void MCELFStreamer::EmitZerofill(MCSection Section, MCSymbol Symbol,
uint64_t Size, unsigned ByteAlignment) {		uint64_t Size, unsigned ByteAlignment) {
llvm_unreachable("ELF doesn't support this directive");		llvm_unreachable("ELF doesn't support this directive");
}		}

void MCELFStreamer::EmitTBSSSymbol(MCSection Section, MCSymbol Symbol,		void MCELFStreamer::EmitTBSSSymbol(MCSection Section, MCSymbol Symbol,
uint64_t Size, unsigned ByteAlignment) {		uint64_t Size, unsigned ByteAlignment) {
llvm_unreachable("ELF doesn't support this directive");		llvm_unreachable("ELF doesn't support this directive");
}		}

		MCSection *MCELFStreamer::getNamedGroupSection(StringRef Section,
		StringRef Identifier,
		unsigned EntSize) {
		return getContext().getELFSection(Section, ELF::SHT_PROGBITS,
		ELF::SHF_ALLOC \| ELF::SHF_GROUP, EntSize,
		Identifier);
		}

lib/MC/MCStreamer.cpp

Show First 20 Lines • Show All 800 Lines • ▼ Show 20 Lines	MCSymbol MCStreamer::endSection(MCSection Section) {
MCSymbol *Sym = Section->getEndSymbol(Context);		MCSymbol *Sym = Section->getEndSymbol(Context);
if (Sym->isInSection())		if (Sym->isInSection())
return Sym;		return Sym;

SwitchSection(Section);		SwitchSection(Section);
EmitLabel(Sym);		EmitLabel(Sym);
return Sym;		return Sym;
}		}

		MCSection *MCStreamer::getNamedGroupSection(StringRef, StringRef, unsigned) {
		llvm_unreachable("Unimplemented.");
		return nullptr;
		}

lib/Target/X86/X86AsmPrinter.h

Show First 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	private:
// CurrentShadowSize counts the number of bytes encoded since the most		// CurrentShadowSize counts the number of bytes encoded since the most
// recently encountered STACKMAP, stopping when that number is greater than		// recently encountered STACKMAP, stopping when that number is greater than
// or equal to RequiredShadowSize.		// or equal to RequiredShadowSize.
unsigned RequiredShadowSize = 0, CurrentShadowSize = 0;		unsigned RequiredShadowSize = 0, CurrentShadowSize = 0;
};		};

StackMapShadowTracker SMShadowTracker;		StackMapShadowTracker SMShadowTracker;

		// This describes the kind of sled we're storing in the XRay table.
		enum class SledKind : uint8_t {
		FUNCTION_ENTER = 0,
		FUNCTION_EXIT = 1,
		TAIL_CALL = 2,
		};

		// The table will contain these structs that point to the sled, the function
		// containing the sled, and what kind of sled (and whether they should always
		// be instrumented).
		struct XRayFunctionEntry {
		const MCSymbol *Sled;
		const MCSymbol *Function;
		SledKind Kind;
		bool AlwaysInstrument;
		const class Function *Fn;
		};

		// All the sleds to be emitted.
		std::vector<XRayFunctionEntry> Sleds;

// All instructions emitted by the X86AsmPrinter should use this helper		// All instructions emitted by the X86AsmPrinter should use this helper
// method.		// method.
//		//
// This helper function invokes the SMShadowTracker on each instruction before		// This helper function invokes the SMShadowTracker on each instruction before
// outputting it to the OutStream. This allows the shadow tracker to minimise		// outputting it to the OutStream. This allows the shadow tracker to minimise
// the number of NOPs used for stackmap padding.		// the number of NOPs used for stackmap padding.
void EmitAndCountInstruction(MCInst &Inst);		void EmitAndCountInstruction(MCInst &Inst);
void LowerSTACKMAP(const MachineInstr &MI);		void LowerSTACKMAP(const MachineInstr &MI);
void LowerPATCHPOINT(const MachineInstr &MI, X86MCInstLower &MCIL);		void LowerPATCHPOINT(const MachineInstr &MI, X86MCInstLower &MCIL);
void LowerSTATEPOINT(const MachineInstr &MI, X86MCInstLower &MCIL);		void LowerSTATEPOINT(const MachineInstr &MI, X86MCInstLower &MCIL);
void LowerFAULTING_LOAD_OP(const MachineInstr &MI, X86MCInstLower &MCIL);		void LowerFAULTING_LOAD_OP(const MachineInstr &MI, X86MCInstLower &MCIL);
void LowerPATCHABLE_OP(const MachineInstr &MI, X86MCInstLower &MCIL);		void LowerPATCHABLE_OP(const MachineInstr &MI, X86MCInstLower &MCIL);

void LowerTlsAddr(X86MCInstLower &MCInstLowering, const MachineInstr &MI);		void LowerTlsAddr(X86MCInstLower &MCInstLowering, const MachineInstr &MI);

		// XRay-specific lowering for X86.
		void LowerPATCHABLE_FUNCTION_ENTER(const MachineInstr &MI,
		X86MCInstLower &MCIL);
		void LowerPATCHABLE_RET(const MachineInstr &MI, X86MCInstLower &MCIL);
		void LowerPATCHABLE_TAIL_CALL(const MachineInstr &MI, X86MCInstLower &MCIL);

		// Helper function that emits the XRay sleds we've collected for a particular
		// function.
		void EmitXRayTable();

		// Helper function to record a given XRay sled.
		void recordSled(MCSymbol *Sled, const MachineInstr &MI, SledKind Kind);
public:		public:
explicit X86AsmPrinter(TargetMachine &TM,		explicit X86AsmPrinter(TargetMachine &TM,
std::unique_ptr<MCStreamer> Streamer)		std::unique_ptr<MCStreamer> Streamer)
: AsmPrinter(TM, std::move(Streamer)), SM(this), FM(this) {}		: AsmPrinter(TM, std::move(Streamer)), SM(this), FM(this) {}

const char *getPassName() const override {		const char *getPassName() const override {
return "X86 Assembly / Object Emitter";		return "X86 Assembly / Object Emitter";
}		}

const X86Subtarget &getSubtarget() const { return *Subtarget; }		const X86Subtarget &getSubtarget() const { return *Subtarget; }

void EmitStartOfAsmFile(Module &M) override;		void EmitStartOfAsmFile(Module &M) override;
Show All 31 Lines

lib/Target/X86/X86AsmPrinter.cpp

Show First 20 Lines • Show All 63 Lines • ▼ Show 20 Lines	if (Subtarget->isTargetCOFF()) {
OutStreamer->EmitCOFFSymbolType(COFF::IMAGE_SYM_DTYPE_FUNCTION		OutStreamer->EmitCOFFSymbolType(COFF::IMAGE_SYM_DTYPE_FUNCTION
<< COFF::SCT_COMPLEX_TYPE_SHIFT);		<< COFF::SCT_COMPLEX_TYPE_SHIFT);
OutStreamer->EndCOFFSymbolDef();		OutStreamer->EndCOFFSymbolDef();
}		}

// Emit the rest of the function body.		// Emit the rest of the function body.
EmitFunctionBody();		EmitFunctionBody();

		// Emit the XRay table for this function.
		EmitXRayTable();

// We didn't modify anything.		// We didn't modify anything.
return false;		return false;
}		}

/// printSymbolOperand - Print a raw symbol reference operand. This handles		/// printSymbolOperand - Print a raw symbol reference operand. This handles
/// jump tables, constant pools, global address and external symbols, all of		/// jump tables, constant pools, global address and external symbols, all of
/// which print to a label with various suffixes for relocation types etc.		/// which print to a label with various suffixes for relocation types etc.
static void printSymbolOperand(X86AsmPrinter &P, const MachineOperand &MO,		static void printSymbolOperand(X86AsmPrinter &P, const MachineOperand &MO,
▲ Show 20 Lines • Show All 613 Lines • Show Last 20 Lines

lib/Target/X86/X86FrameLowering.cpp

Show First 20 Lines • Show All 153 Lines • ▼ Show 20 Lines	static unsigned findDeadCallerSavedReg(MachineBasicBlock &MBB,
if (!F \|\| MF->getMMI().callsEHReturn())		if (!F \|\| MF->getMMI().callsEHReturn())
return 0;		return 0;

const TargetRegisterClass &AvailableRegs = TRI->getGPRsForTailCall(MF);		const TargetRegisterClass &AvailableRegs = TRI->getGPRsForTailCall(MF);

unsigned Opc = MBBI->getOpcode();		unsigned Opc = MBBI->getOpcode();
switch (Opc) {		switch (Opc) {
default: return 0;		default: return 0;
		case TargetOpcode::PATCHABLE_RET:
case X86::RET:		case X86::RET:
case X86::RETL:		case X86::RETL:
case X86::RETQ:		case X86::RETQ:
case X86::RETIL:		case X86::RETIL:
case X86::RETIQ:		case X86::RETIQ:
case X86::TCRETURNdi:		case X86::TCRETURNdi:
case X86::TCRETURNri:		case X86::TCRETURNri:
case X86::TCRETURNmi:		case X86::TCRETURNmi:
▲ Show 20 Lines • Show All 2,763 Lines • Show Last 20 Lines

lib/Target/X86/X86InstrInfo.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 100 Lines • ▼ Show 20 Lines
	// Pin the vtable to this file.			// Pin the vtable to this file.
	void X86InstrInfo::anchor() {}			void X86InstrInfo::anchor() {}

	X86InstrInfo::X86InstrInfo(X86Subtarget &STI)			X86InstrInfo::X86InstrInfo(X86Subtarget &STI)
	: X86GenInstrInfo((STI.isTarget64BitLP64() ? X86::ADJCALLSTACKDOWN64			: X86GenInstrInfo((STI.isTarget64BitLP64() ? X86::ADJCALLSTACKDOWN64
	: X86::ADJCALLSTACKDOWN32),			: X86::ADJCALLSTACKDOWN32),
	(STI.isTarget64BitLP64() ? X86::ADJCALLSTACKUP64			(STI.isTarget64BitLP64() ? X86::ADJCALLSTACKUP64
	: X86::ADJCALLSTACKUP32),			: X86::ADJCALLSTACKUP32),
	X86::CATCHRET),			X86::CATCHRET,
				(STI.is64Bit() ? X86::RETQ : X86::RETL)),
	Subtarget(STI), RI(STI.getTargetTriple()) {			Subtarget(STI), RI(STI.getTargetTriple()) {

	static const X86MemoryFoldTableEntry MemoryFoldTable2Addr[] = {			static const X86MemoryFoldTableEntry MemoryFoldTable2Addr[] = {
	{ X86::ADC32ri, X86::ADC32mi, 0 },			{ X86::ADC32ri, X86::ADC32mi, 0 },
	{ X86::ADC32ri8, X86::ADC32mi8, 0 },			{ X86::ADC32ri8, X86::ADC32mi8, 0 },
	{ X86::ADC32rr, X86::ADC32mr, 0 },			{ X86::ADC32rr, X86::ADC32mr, 0 },
	{ X86::ADC64ri32, X86::ADC64mi32, 0 },			{ X86::ADC64ri32, X86::ADC64mi32, 0 },
	{ X86::ADC64ri8, X86::ADC64mi8, 0 },			{ X86::ADC64ri8, X86::ADC64mi8, 0 },
	▲ Show 20 Lines • Show All 7,412 Lines • Show Last 20 Lines

lib/Target/X86/X86MCInstLower.cpp

Show All 30 Lines
#include "llvm/IR/Mangler.h"		#include "llvm/IR/Mangler.h"
#include "llvm/MC/MCAsmInfo.h"		#include "llvm/MC/MCAsmInfo.h"
#include "llvm/MC/MCCodeEmitter.h"		#include "llvm/MC/MCCodeEmitter.h"
#include "llvm/MC/MCContext.h"		#include "llvm/MC/MCContext.h"
#include "llvm/MC/MCExpr.h"		#include "llvm/MC/MCExpr.h"
#include "llvm/MC/MCFixup.h"		#include "llvm/MC/MCFixup.h"
#include "llvm/MC/MCInst.h"		#include "llvm/MC/MCInst.h"
#include "llvm/MC/MCInstBuilder.h"		#include "llvm/MC/MCInstBuilder.h"
		#include "llvm/MC/MCSection.h"
#include "llvm/MC/MCStreamer.h"		#include "llvm/MC/MCStreamer.h"
#include "llvm/MC/MCSymbol.h"		#include "llvm/MC/MCSymbol.h"
		#include "llvm/MC/MCSymbolELF.h"
#include "llvm/Support/TargetRegistry.h"		#include "llvm/Support/TargetRegistry.h"
		#include "llvm/Support/ELF.h"
		#include "llvm/Target/TargetLoweringObjectFile.h"

using namespace llvm;		using namespace llvm;

namespace {		namespace {

/// X86MCInstLower - This class is used to lower an MachineInstr into an MCInst.		/// X86MCInstLower - This class is used to lower an MachineInstr into an MCInst.
class X86MCInstLower {		class X86MCInstLower {
MCContext &Ctx;		MCContext &Ctx;
const MachineFunction &MF;		const MachineFunction &MF;
▲ Show 20 Lines • Show All 990 Lines • ▼ Show 20 Lines	void X86AsmPrinter::LowerPATCHPOINT(const MachineInstr &MI,
unsigned NumBytes = opers.getMetaOper(PatchPointOpers::NBytesPos).getImm();		unsigned NumBytes = opers.getMetaOper(PatchPointOpers::NBytesPos).getImm();
assert(NumBytes >= EncodedBytes &&		assert(NumBytes >= EncodedBytes &&
"Patchpoint can't request size less than the length of a call.");		"Patchpoint can't request size less than the length of a call.");

EmitNops(*OutStreamer, NumBytes - EncodedBytes, Subtarget->is64Bit(),		EmitNops(*OutStreamer, NumBytes - EncodedBytes, Subtarget->is64Bit(),
getSubtargetInfo());		getSubtargetInfo());
}		}

		void X86AsmPrinter::recordSled(MCSymbol *Sled, const MachineInstr &MI,
		SledKind Kind) {
		auto Fn = MI.getParent()->getParent()->getFunction();
		auto Attr = Fn->getFnAttribute("function-instrument");
		sanjoyUnsubmitted Done Reply Inline Actions Why not AlwaysInstrument = !Attr.hasAttribute(Attribute::None) && Attr.isStringAttribute() && Attr.getValueAsString() == "xray-always"; ? sanjoy: Why not ``` AlwaysInstrument = !Attr.hasAttribute(Attribute::None) && Attr.isStringAttribute()…
		bool AlwaysInstrument =
		sanjoyUnsubmitted Done Reply Inline Actions I don't quite grok the Attributes API, but I think the `!Attr.hasAttribute(Attribute::None)` bit is redundant -- if `isStringAttribute` return `true`, then `Attr.hasAttribute(Attribute::None)` will be false. sanjoy: I don't quite grok the Attributes API, but I think the `!Attr.hasAttribute(Attribute::None)`…
		Attr.isStringAttribute() && Attr.getValueAsString() == "xray-always";
		Sleds.emplace_back(
		XRayFunctionEntry{Sled, CurrentFnSym, Kind, AlwaysInstrument, Fn});
		sanjoyUnsubmitted Done Reply Inline Actions emplace_back ? sanjoy: emplace_back ?
		}

		void X86AsmPrinter::LowerPATCHABLE_FUNCTION_ENTER(const MachineInstr &MI,
		X86MCInstLower &MCIL) {
		// We want to emit the following pattern:
		//
		// .Lxray_sled_N:
		// .palign 2, ...
		// jmp .tmpN
		// # 9 bytes worth of noops
		bmakamUnsubmitted Done Reply Inline Actions Could you please expand on why you need 9 bytes of noops here? I am not quite familiar with x86_64 but was under the impression that on x86_64 the jmp instruction is 1 byte for opcode and 4 bytes for signed relative displacement, so shouldn't 5 bytes worth of nops be sufficient? bmakam: Could you please expand on why you need 9 bytes of noops here? I am not quite familiar with…
		dberrisAuthorUnsubmitted Done Reply Inline Actions Good question, thanks. I have to check whether we're using the right version of JMP, but I'm specifically looking for the version that's one byte for the JMP instruction, and 8 bits (1 byte) for the relative offset. So far I haven't been able to spell `jmp +0x09` correctly and have it work, without having an additional symbol as a target for the jump instruction. If we get that right, we can then add the 9 byte nops we need to get exactly 11 bytes for the function entry. Is there a fool-proof way of spelling "JMP +0x09" with the builder interface? Or should I add another JMP instruction in X86 that supports the 8-bit displacement immediate operand? The reason why I can't use a JMP that isn't 2 bytes, is because it's really hard to write just 5 bytes atomically. I can probably do something with an 8-byte atomic write, but enforcing that 8-byte write doesn't span cache lines is also very tricky to make safe. dberris: Good question, thanks. I have to check whether we're using the right version of JMP, but I'm…
		dberrisAuthorUnsubmitted Done Reply Inline Actions Actually now that I've had a look at the generated object file, I can confirm that we're using the two-byte version of `JMP` with this construct. Quick test: test.cc: #include <cstdio> [[clang::xray_always_instrument]] void foo() { std::printf("Hello, XRay!\n"); } int main(int argc, char* argv[]) { foo(); } Compiled with (modified clang to emit IR that has annotated functions for XRay): ./bin/clang -fxray-instrument -fxray-instruction-threshold=1 -std=c++11 -x c++ -S test.cc -emit-llvm Creates this IR listing: ; ModuleID = 'test.cc' source_filename = "test.cc" target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128" target triple = "x86_64-unknown-linux-gnu" @.str = private unnamed_addr constant [14 x i8] c"Hello, XRay!\0A\00", align 1 ; Function Attrs: uwtable define void @_Z3foov() #0 { entry: %call = call i32 (i8, ...) @printf(i8 getelementptr inbounds ([14 x i8], [14 x i8]* @.str, i32 0, i32 0)) ret void } declare i32 @printf(i8, ...) #1 ; Function Attrs: norecurse uwtable define i32 @main(i32 %argc, i8* %argv) #2 { entry: %argc.addr = alloca i32, align 4 %argv.addr = alloca i8*, align 8 store i32 %argc, i32 %argc.addr, align 4 store i8 %argv, i8* %argv.addr, align 8 call void @_Z3foov() ret i32 0 } attributes #0 = { uwtable "disable-tail-calls"="false" "function-instrument"="xray-always" "less-precise-fpmad"="false" "no-frame-pointer-elim"="true" "no-frame-pointer-elim-non-leaf" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" } attributes #1 = { "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="true" "no-frame-pointer-elim-non-leaf" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" } attributes #2 = { norecurse uwtable "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="true" "no-frame-pointer-elim-non-leaf" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" "xray-instruction-threshold"="1" } !llvm.ident = !{!0} !0 = !{!"clang version 3.9.0 (http://llvm.org/git/clang.git 2b9ed9227330789f24e56fe9e800e7be2111073b) (http://llvm.org/git/llvm.git 1f3e353113ff9e2f835955bf005a8a5e25f16ad1)"} Then compiled+disassembled this way: ./bin/llc -filetype=obj -o - < test.ll \| ./bin/llvm-objdump -disassemble - Produces the following output: <stdin>: file format ELF64-x86-64 Disassembly of section .text: _Z3foov: 0: eb 09 jmp 9 <_Z3foov+0xB> 2: 66 0f 1f 84 00 00 02 00 00 nopw 512(%rax,%rax) b: 55 pushq %rbp c: 48 89 e5 movq %rsp, %rbp f: bf 00 00 00 00 movl $0, %edi 14: 31 c0 xorl %eax, %eax 16: e8 00 00 00 00 callq 0 <_Z3foov+0x1B> 1b: 5d popq %rbp 1c: c3 retq 1d: 2e 66 0f 1f 84 00 00 02 00 00 nopw %cs:512(%rax,%rax) 27: 66 0f 1f 84 00 00 00 00 00 nopw (%rax,%rax) main: 30: eb 09 jmp 9 <main+0xB> 32: 66 0f 1f 84 00 00 02 00 00 nopw 512(%rax,%rax) 3b: 55 pushq %rbp 3c: 48 89 e5 movq %rsp, %rbp 3f: 48 83 ec 10 subq $16, %rsp 43: 89 7d fc movl %edi, -4(%rbp) 46: 48 89 75 f0 movq %rsi, -16(%rbp) 4a: e8 b1 ff ff ff callq -79 <_Z3foov> 4f: 31 c0 xorl %eax, %eax 51: 48 83 c4 10 addq $16, %rsp 55: 5d popq %rbp 56: c3 retq 57: 2e 66 0f 1f 84 00 00 02 00 00 nopw %cs:512(%rax,%rax) dberris: Actually now that I've had a look at the generated object file, I can confirm that we're using…
		dberrisAuthorUnsubmitted Done Reply Inline Actions Updated the comments now to make it clear and specific. Thanks! dberris: Updated the comments now to make it clear and specific. Thanks!
		// .tmpN
		//
		// We need the 9 bytes because at runtime, we'd be patching over the full 11
		// bytes with the following pattern:
		//
		// mov %r10, <function id, 32-bit> // 6 bytes
		// call <relative offset, 32-bits> // 5 bytes
		rnkUnsubmitted Done Reply Inline Actions Does XRay support multiple DSOs in the process? If so, the XRay runtime may be more than 2GB away from the code being patched, and this offset will overflow. rnk: Does XRay support multiple DSOs in the process? If so, the XRay runtime may be more than 2GB…
		dberrisAuthorUnsubmitted Not Done Reply Inline Actions That's true -- the runtime patching code will fail to patch certain sleds that are farther than 32-bits away. In case we need more, in the future we can tweak this so that we can try to force the XRay runtime trampoline in the first 2GB of the process' memory and use an "absolute" address (which may need more bytes in the sled), or have a mode to support larger binaries (a flag that changes the emitted sleds, and have the runtime support both kinds of sleds). I suspect if it comes up in practice that we can deal with this more effectively. dberris: That's true -- the runtime patching code will fail to patch certain sleds that are farther than…
		//
		auto CurSled = OutContext.createTempSymbol("xray_sled_", true);
		OutStreamer->EmitLabel(CurSled);
		OutStreamer->EmitCodeAlignment(4);
		auto Target = OutContext.createTempSymbol();

		// Use a two-byte `jmp`. This version of JMP takes an 8-bit relative offset as
		// an operand (computed as an offset from the jmp instruction).
		// FIXME: Find another less hacky way do force the relative jump.
		OutStreamer->EmitBytes("\xeb\x09");
		rnkUnsubmitted Done Reply Inline Actions I guess we can deal with this separately. IMO we should have a 'jmpb' instruction or something that forces a short jump or assembler error if the displacement is too large. rnk: I guess we can deal with this separately. IMO we should have a 'jmpb' instruction or something…
		dberrisAuthorUnsubmitted Not Done Reply Inline Actions I'm happy to do this as a refactoring or a follow-up change. A new instruction sounds like the right action to take anyway. dberris: I'm happy to do this as a refactoring or a follow-up change. A new instruction sounds like the…
		EmitNops(*OutStreamer, 9, Subtarget->is64Bit(), getSubtargetInfo());
		sanjoyUnsubmitted Done Reply Inline Actions Indent is off. sanjoy: Indent is off.
		OutStreamer->EmitLabel(Target);
		recordSled(CurSled, MI, SledKind::FUNCTION_ENTER);
		}

		void X86AsmPrinter::LowerPATCHABLE_RET(const MachineInstr &MI,
		X86MCInstLower &MCIL) {
		// Since PATCHABLE_RET takes the opcode of the return statement as an
		// argument, we use that to emit the correct form of the RET that we want.
		// i.e. when we see this:
		//
		// PATCHABLE_RET X86::RET ...
		//
		// We should emit the RET followed by sleds.
		//
		// .Lxray_sled_N:
		// ret # or equivalent instruction
		// # 10 bytes worth of noops
		//
		// This just makes sure that the alignment for the next instruction is 2.
		auto CurSled = OutContext.createTempSymbol("xray_sled_", true);
		OutStreamer->EmitLabel(CurSled);
		unsigned OpCode = MI.getOperand(0).getImm();
		MCInst Ret;
		Ret.setOpcode(OpCode);
		for (auto &MO : make_range(MI.operands_begin() + 1, MI.operands_end()))
		if (auto MaybeOperand = MCIL.LowerMachineOperand(&MI, MO))
		Ret.addOperand(MaybeOperand.getValue());
		OutStreamer->EmitInstruction(Ret, getSubtargetInfo());
		EmitNops(*OutStreamer, 10, Subtarget->is64Bit(), getSubtargetInfo());
		recordSled(CurSled, MI, SledKind::FUNCTION_EXIT);
		}

		void X86AsmPrinter::EmitXRayTable() {
		for (const auto &Sled : Sleds) {
		if (Subtarget->isTargetELF()) {
		MCSection *Section = OutStreamer->getNamedGroupSection(
		"xray_instr_map", CurrentFnSym->getName(), 32);
		auto PrevSection = OutStreamer->getCurrentSectionOnly();
		OutStreamer->SwitchSection(Section);
		OutStreamer->EmitSymbolValue(Sled.Sled, 8);
		OutStreamer->EmitSymbolValue(CurrentFnSym, 8);
		auto Kind = static_cast<uint8_t>(Sled.Kind);
		OutStreamer->EmitBytes(
		StringRef(reinterpret_cast<const char *>(&Kind), 1));
		OutStreamer->EmitBytes(
		StringRef(reinterpret_cast<const char *>(&Sled.AlwaysInstrument), 1));
		OutStreamer->EmitZeros(14);
		OutStreamer->SwitchSection(PrevSection);
		}
		}
		Sleds.clear();
		}

// Returns instruction preceding MBBI in MachineFunction.		// Returns instruction preceding MBBI in MachineFunction.
// If MBBI is the first instruction of the first basic block, returns null.		// If MBBI is the first instruction of the first basic block, returns null.
static MachineBasicBlock::const_iterator		static MachineBasicBlock::const_iterator
PrevCrossBBInst(MachineBasicBlock::const_iterator MBBI) {		PrevCrossBBInst(MachineBasicBlock::const_iterator MBBI) {
const MachineBasicBlock *MBB = MBBI->getParent();		const MachineBasicBlock *MBB = MBBI->getParent();
while (MBBI == MBB->begin()) {		while (MBBI == MBB->begin()) {
if (MBB == &MBB->getParent()->front())		if (MBB == &MBB->getParent()->front())
return nullptr;		return nullptr;
▲ Show 20 Lines • Show All 225 Lines • ▼ Show 20 Lines	case TargetOpcode::PATCHABLE_OP:
return LowerPATCHABLE_OP(*MI, MCInstLowering);		return LowerPATCHABLE_OP(*MI, MCInstLowering);

case TargetOpcode::STACKMAP:		case TargetOpcode::STACKMAP:
return LowerSTACKMAP(*MI);		return LowerSTACKMAP(*MI);

case TargetOpcode::PATCHPOINT:		case TargetOpcode::PATCHPOINT:
return LowerPATCHPOINT(*MI, MCInstLowering);		return LowerPATCHPOINT(*MI, MCInstLowering);

		case TargetOpcode::PATCHABLE_FUNCTION_ENTER:
		return LowerPATCHABLE_FUNCTION_ENTER(*MI, MCInstLowering);

		case TargetOpcode::PATCHABLE_RET:
		return LowerPATCHABLE_RET(*MI, MCInstLowering);

case X86::MORESTACK_RET:		case X86::MORESTACK_RET:
EmitAndCountInstruction(MCInstBuilder(getRetOpcode(*Subtarget)));		EmitAndCountInstruction(MCInstBuilder(getRetOpcode(*Subtarget)));
return;		return;

case X86::MORESTACK_RET_RESTORE_R10:		case X86::MORESTACK_RET_RESTORE_R10:
// Return, then restore R10.		// Return, then restore R10.
EmitAndCountInstruction(MCInstBuilder(getRetOpcode(*Subtarget)));		EmitAndCountInstruction(MCInstBuilder(getRetOpcode(*Subtarget)));
EmitAndCountInstruction(MCInstBuilder(X86::MOV64rr)		EmitAndCountInstruction(MCInstBuilder(X86::MOV64rr)
▲ Show 20 Lines • Show All 293 Lines • Show Last 20 Lines

test/CodeGen/X86/xray-attribute-instrumentation.ll

This file was added.

				; RUN: llc -filetype=asm -o - -mtriple=x86_64-apple-macosx < %s \| FileCheck %s

				define i32 @foo() nounwind noinline uwtable "function-instrument"="xray-always" {
				; CHECK-LABEL: Lxray_sled_0:
				; CHECK-NEXT: .p2align 2, 0x90
				; CHECK-NEXT: jmp ltmp0
				; CHECK-NEXT: nopw 512(%rax,%rax)
				; CHECK-LABEL: ltmp0:
				ret i32 0
				; CHECK-LABEL: Lxray_sled_1:
				; CHECK-NEXT: retq
				; CHECK-NEXT: nopw %cs:512(%rax,%rax)
				}

test/CodeGen/X86/xray-selective-instrumentation-miss.ll

This file was added.

				; RUN: llc -mcpu=nehalem < %s \| not grep xray_sled_

				target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128"
				target triple = "x86_64-apple-darwin8"

				define i32 @foo() nounwind uwtable "xray-instruction-threshold"="3" {
				entry:
				ret i32 0
				}

test/CodeGen/X86/xray-selective-instrumentation.ll

This file was added.

				; RUN: llc -mcpu=nehalem < %s \| grep xray_sled_

				target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128"
				target triple = "x86_64-apple-darwin8"

				define i32 @foo() nounwind uwtable "xray-instruction-threshold"="1" {
				entry:
				ret i32 0
				}

test/TableGen/trydecode-emission.td

Show All 30 Lines	def InstB : TestInstruction {
let AsmString = "InstB";		let AsmString = "InstB";
let DecoderMethod = "DecodeInstB";		let DecoderMethod = "DecodeInstB";
let hasCompleteDecoder = 0;		let hasCompleteDecoder = 0;
}		}

// CHECK: /* 0 */ MCD::OPC_ExtractField, 4, 4, // Inst{7-4} ...		// CHECK: /* 0 */ MCD::OPC_ExtractField, 4, 4, // Inst{7-4} ...
// CHECK-NEXT: /* 3 */ MCD::OPC_FilterValue, 0, 14, 0, // Skip to: 21		// CHECK-NEXT: /* 3 */ MCD::OPC_FilterValue, 0, 14, 0, // Skip to: 21
// CHECK-NEXT: /* 7 */ MCD::OPC_CheckField, 2, 2, 0, 5, 0, // Skip to: 18		// CHECK-NEXT: /* 7 */ MCD::OPC_CheckField, 2, 2, 0, 5, 0, // Skip to: 18
// CHECK-NEXT: /* 13 */ MCD::OPC_TryDecode, 28, 0, 0, 0, // Opcode: InstB, skip to: 18		// CHECK-NEXT: /* 13 */ MCD::OPC_TryDecode, 30, 0, 0, 0, // Opcode: InstB, skip to: 18
// CHECK-NEXT: /* 18 */ MCD::OPC_Decode, 27, 1, // Opcode: InstA		// CHECK-NEXT: /* 18 */ MCD::OPC_Decode, 28, 1, // Opcode: InstA
// CHECK-NEXT: /* 21 */ MCD::OPC_Fail,		// CHECK-NEXT: /* 21 */ MCD::OPC_Fail,

// CHECK: if (DecodeInstB(MI, insn, Address, Decoder) == MCDisassembler::Fail) { DecodeComplete = false; return MCDisassembler::Fail; }		// CHECK: if (DecodeInstB(MI, insn, Address, Decoder) == MCDisassembler::Fail) { DecodeComplete = false; return MCDisassembler::Fail; }

test/TableGen/trydecode-emission2.td

Show All 29 Lines	def InstB : TestInstruction {
let hasCompleteDecoder = 0;		let hasCompleteDecoder = 0;
}		}

// CHECK: /* 0 */ MCD::OPC_ExtractField, 2, 1, // Inst{2} ...		// CHECK: /* 0 */ MCD::OPC_ExtractField, 2, 1, // Inst{2} ...
// CHECK-NEXT: /* 3 */ MCD::OPC_FilterValue, 0, 29, 0, // Skip to: 36		// CHECK-NEXT: /* 3 */ MCD::OPC_FilterValue, 0, 29, 0, // Skip to: 36
// CHECK-NEXT: /* 7 */ MCD::OPC_ExtractField, 5, 3, // Inst{7-5} ...		// CHECK-NEXT: /* 7 */ MCD::OPC_ExtractField, 5, 3, // Inst{7-5} ...
// CHECK-NEXT: /* 10 */ MCD::OPC_FilterValue, 0, 22, 0, // Skip to: 36		// CHECK-NEXT: /* 10 */ MCD::OPC_FilterValue, 0, 22, 0, // Skip to: 36
// CHECK-NEXT: /* 14 */ MCD::OPC_CheckField, 0, 2, 3, 5, 0, // Skip to: 25		// CHECK-NEXT: /* 14 */ MCD::OPC_CheckField, 0, 2, 3, 5, 0, // Skip to: 25
// CHECK-NEXT: /* 20 */ MCD::OPC_TryDecode, 28, 0, 0, 0, // Opcode: InstB, skip to: 25		// CHECK-NEXT: /* 20 */ MCD::OPC_TryDecode, 30, 0, 0, 0, // Opcode: InstB, skip to: 25
// CHECK-NEXT: /* 25 */ MCD::OPC_CheckField, 3, 2, 0, 5, 0, // Skip to: 36		// CHECK-NEXT: /* 25 */ MCD::OPC_CheckField, 3, 2, 0, 5, 0, // Skip to: 36
// CHECK-NEXT: /* 31 */ MCD::OPC_TryDecode, 27, 1, 0, 0, // Opcode: InstA, skip to: 36		// CHECK-NEXT: /* 31 */ MCD::OPC_TryDecode, 29, 1, 0, 0, // Opcode: InstA, skip to: 36
// CHECK-NEXT: /* 36 */ MCD::OPC_Fail,		// CHECK-NEXT: /* 36 */ MCD::OPC_Fail,

// CHECK: if (DecodeInstB(MI, insn, Address, Decoder) == MCDisassembler::Fail) { DecodeComplete = false; return MCDisassembler::Fail; }		// CHECK: if (DecodeInstB(MI, insn, Address, Decoder) == MCDisassembler::Fail) { DecodeComplete = false; return MCDisassembler::Fail; }
// CHECK: if (DecodeInstA(MI, insn, Address, Decoder) == MCDisassembler::Fail) { DecodeComplete = false; return MCDisassembler::Fail; }		// CHECK: if (DecodeInstA(MI, insn, Address, Decoder) == MCDisassembler::Fail) { DecodeComplete = false; return MCDisassembler::Fail; }

test/TableGen/trydecode-emission3.td

Show All 31 Lines	def InstB : TestInstruction {
let Inst{1-0} = op;		let Inst{1-0} = op;
let OutOperandList = (outs InstBOp:$op);		let OutOperandList = (outs InstBOp:$op);
let AsmString = "InstB";		let AsmString = "InstB";
}		}

// CHECK: /* 0 */ MCD::OPC_ExtractField, 4, 4, // Inst{7-4} ...		// CHECK: /* 0 */ MCD::OPC_ExtractField, 4, 4, // Inst{7-4} ...
// CHECK-NEXT: /* 3 */ MCD::OPC_FilterValue, 0, 14, 0, // Skip to: 21		// CHECK-NEXT: /* 3 */ MCD::OPC_FilterValue, 0, 14, 0, // Skip to: 21
// CHECK-NEXT: /* 7 */ MCD::OPC_CheckField, 2, 2, 0, 5, 0, // Skip to: 18		// CHECK-NEXT: /* 7 */ MCD::OPC_CheckField, 2, 2, 0, 5, 0, // Skip to: 18
// CHECK-NEXT: /* 13 */ MCD::OPC_TryDecode, 28, 0, 0, 0, // Opcode: InstB, skip to: 18		// CHECK-NEXT: /* 13 */ MCD::OPC_TryDecode, 30, 0, 0, 0, // Opcode: InstB, skip to: 18
// CHECK-NEXT: /* 18 */ MCD::OPC_Decode, 27, 1, // Opcode: InstA		// CHECK-NEXT: /* 18 */ MCD::OPC_Decode, 28, 1, // Opcode: InstA
// CHECK-NEXT: /* 21 */ MCD::OPC_Fail,		// CHECK-NEXT: /* 21 */ MCD::OPC_Fail,

// CHECK: if (DecodeInstBOp(MI, tmp, Address, Decoder) == MCDisassembler::Fail) { DecodeComplete = false; return MCDisassembler::Fail; }		// CHECK: if (DecodeInstBOp(MI, tmp, Address, Decoder) == MCDisassembler::Fail) { DecodeComplete = false; return MCDisassembler::Fail; }

utils/TableGen/InstrInfoEmitter.cpp

Show First 20 Lines • Show All 422 Lines • ▼ Show 20 Lines	void InstrInfoEmitter::run(raw_ostream &OS) {
// Create a TargetInstrInfo subclass to hide the MC layer initialization.		// Create a TargetInstrInfo subclass to hide the MC layer initialization.
OS << "#ifdef GET_INSTRINFO_HEADER\n";		OS << "#ifdef GET_INSTRINFO_HEADER\n";
OS << "#undef GET_INSTRINFO_HEADER\n";		OS << "#undef GET_INSTRINFO_HEADER\n";

std::string ClassName = TargetName + "GenInstrInfo";		std::string ClassName = TargetName + "GenInstrInfo";
OS << "namespace llvm {\n";		OS << "namespace llvm {\n";
OS << "struct " << ClassName << " : public TargetInstrInfo {\n"		OS << "struct " << ClassName << " : public TargetInstrInfo {\n"
<< " explicit " << ClassName		<< " explicit " << ClassName
<< "(int CFSetupOpcode = -1, int CFDestroyOpcode = -1, int CatchRetOpcode = -1);\n"		<< "(int CFSetupOpcode = -1, int CFDestroyOpcode = -1, int CatchRetOpcode = -1, int ReturnOpcode = -1);\n"
<< " ~" << ClassName << "() override {}\n"		<< " ~" << ClassName << "() override {}\n"
<< "};\n";		<< "};\n";
OS << "} // end llvm namespace\n";		OS << "} // end llvm namespace\n";

OS << "#endif // GET_INSTRINFO_HEADER\n\n";		OS << "#endif // GET_INSTRINFO_HEADER\n\n";

OS << "#ifdef GET_INSTRINFO_CTOR_DTOR\n";		OS << "#ifdef GET_INSTRINFO_CTOR_DTOR\n";
OS << "#undef GET_INSTRINFO_CTOR_DTOR\n";		OS << "#undef GET_INSTRINFO_CTOR_DTOR\n";

OS << "namespace llvm {\n";		OS << "namespace llvm {\n";
OS << "extern const MCInstrDesc " << TargetName << "Insts[];\n";		OS << "extern const MCInstrDesc " << TargetName << "Insts[];\n";
OS << "extern const unsigned " << TargetName << "InstrNameIndices[];\n";		OS << "extern const unsigned " << TargetName << "InstrNameIndices[];\n";
OS << "extern const char " << TargetName << "InstrNameData[];\n";		OS << "extern const char " << TargetName << "InstrNameData[];\n";
OS << ClassName << "::" << ClassName		OS << ClassName << "::" << ClassName
<< "(int CFSetupOpcode, int CFDestroyOpcode, int CatchRetOpcode)\n"		<< "(int CFSetupOpcode, int CFDestroyOpcode, int CatchRetOpcode, int ReturnOpcode)\n"
<< " : TargetInstrInfo(CFSetupOpcode, CFDestroyOpcode, CatchRetOpcode) {\n"		<< " : TargetInstrInfo(CFSetupOpcode, CFDestroyOpcode, CatchRetOpcode, ReturnOpcode) {\n"
<< " InitMCInstrInfo(" << TargetName << "Insts, " << TargetName		<< " InitMCInstrInfo(" << TargetName << "Insts, " << TargetName
<< "InstrNameIndices, " << TargetName << "InstrNameData, "		<< "InstrNameIndices, " << TargetName << "InstrNameData, "
<< NumberedInstructions.size() << ");\n}\n";		<< NumberedInstructions.size() << ");\n}\n";
OS << "} // end llvm namespace\n";		OS << "} // end llvm namespace\n";

OS << "#endif // GET_INSTRINFO_CTOR_DTOR\n\n";		OS << "#endif // GET_INSTRINFO_CTOR_DTOR\n\n";

emitOperandNameMappings(OS, Target, NumberedInstructions);		emitOperandNameMappings(OS, Target, NumberedInstructions);
▲ Show 20 Lines • Show All 151 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

XRay: Add entry and exit sledsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 61929

include/llvm/CodeGen/Passes.h

include/llvm/InitializePasses.h

include/llvm/MC/MCELFStreamer.h

include/llvm/MC/MCObjectFileInfo.h

include/llvm/MC/MCStreamer.h

include/llvm/Target/Target.td

include/llvm/Target/TargetInstrInfo.h

include/llvm/Target/TargetOpcodes.def

lib/CodeGen/CMakeLists.txt

lib/CodeGen/CodeGen.cpp

lib/CodeGen/TargetPassConfig.cpp

lib/CodeGen/XRayInstrumentation.cpp

lib/MC/MCELFStreamer.cpp

lib/MC/MCStreamer.cpp

lib/Target/X86/X86AsmPrinter.h

lib/Target/X86/X86AsmPrinter.cpp

lib/Target/X86/X86FrameLowering.cpp

lib/Target/X86/X86InstrInfo.cpp

lib/Target/X86/X86MCInstLower.cpp

test/CodeGen/X86/xray-attribute-instrumentation.ll

test/CodeGen/X86/xray-selective-instrumentation-miss.ll

test/CodeGen/X86/xray-selective-instrumentation.ll

test/TableGen/trydecode-emission.td

test/TableGen/trydecode-emission2.td

test/TableGen/trydecode-emission3.td

utils/TableGen/InstrInfoEmitter.cpp

XRay: Add entry and exit sleds
ClosedPublic