This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/trunk/
-
trunk/
-
include/llvm/
-
llvm/
-
CodeGen/
-
Passes.h
-
InitializePasses.h
-
Target/
-
Target.td
-
TargetInstrInfo.h
-
TargetOpcodes.def
-
lib/
-
CodeGen/
-
CMakeLists.txt
-
CodeGen.cpp
-
TargetPassConfig.cpp
-
XRayInstrumentation.cpp
-
Target/X86/
-
X86/
-
X86AsmPrinter.h
-
X86AsmPrinter.cpp
-
X86FrameLowering.cpp
-
X86InstrInfo.cpp
-
X86MCInstLower.cpp
-
test/CodeGen/X86/
-
CodeGen/
-
X86/
-
xray-attribute-instrumentation.ll
-
xray-selective-instrumentation-miss.ll
-
xray-selective-instrumentation.ll
-
utils/TableGen/
-
TableGen/
-
InstrInfoEmitter.cpp

Differential D19904

XRay: Add entry and exit sleds
ClosedPublic

Authored by dberris on May 4 2016, 12:01 AM.

Download Raw Diff

Details

Reviewers

kcc
eugenis
pcc
echristo
sanjoy
rnk

Commits

rG52735fc435b1: XRay: Add entry and exit sleds
rL275367: XRay: Add entry and exit sleds

Summary

In this patch we implement the following parts of XRay:

Supporting a function attribute named 'function-instrument' which currently only supports 'xray-always'. We should be able to use this attribute for other instrumentation approaches.
Supporting a function attribute named 'xray-instruction-threshold' used to determine whether a function is instrumented with a minimum number of instructions (IR instruction counts).
X86-specific nop sleds as described in the white paper.
A machine function pass that adds the different instrumentation marker instructions at a very late stage.
A way of identifying which return opcode is considered "normal" for each architecture.

There are some caveats here:

We don't handle PATCHABLE_RET in platforms other than x86_64 yet -- this means if IR used PATCHABLE_RET directly instead of a normal ret, instruction lowering for that platform might do the wrong thing. We think this should be handled at instruction selection time to by default be unpacked for platforms where XRay is not availble yet.

The generated section for X86 is different from what is described from the white paper for the sole reason that LLVM allows us to do this neatly. We're taking the opportunity to deviate from the white paper from this perspective to allow us to get richer information from the runtime library.

Diff Detail

Repository: rL LLVM

Event Timeline

dberris updated this revision to Diff 56097.May 4 2016, 12:01 AM

dberris retitled this revision from to XRay: Add entry and exit sleds.

dberris updated this object.

dberris added reviewers: echristo, sanjoy.

dberris set the repository for this revision to rL LLVM.

dberris added a subscriber: llvm-commits.

Herald added a subscriber: mehdi_amini. · View Herald TranscriptMay 4 2016, 12:01 AM

BTW, It's worth mentioning that this is a prototype implementation and we're fine with making large changes for this patch.

mcrosier added a subscriber: mcrosier.May 4 2016, 6:09 AM

mcrosier added a subscriber: bmakam.May 4 2016, 6:09 AM

emaste added a subscriber: emaste.May 4 2016, 2:16 PM

Some minor comments inline. I'm not familiar with all of the intricacies of the backend, so I'll CC some other people who are likely to have more perspective on this.

include/llvm/Target/Target.td
957 ↗	(On Diff #56097)	Looks like this isn't used yet? If so, I'd suggest dropping it completely for now.
lib/CodeGen/XRayInstrumentation.cpp
80 ↗	(On Diff #56097)	Why do you care only about register operands here?
lib/Target/X86/X86MCInstLower.cpp
1068 ↗	(On Diff #56097)	Why not AlwaysInstrument = !Attr.hasAttribute(Attribute::None) && Attr.isStringAttribute() && Attr.getValueAsString() == "xray-always"; ?
1069 ↗	(On Diff #56097)	I don't quite grok the Attributes API, but I think the `!Attr.hasAttribute(Attribute::None)` bit is redundant -- if `isStringAttribute` return `true`, then `Attr.hasAttribute(Attribute::None)` will be false.
1072 ↗	(On Diff #56097)	emplace_back ?
1100 ↗	(On Diff #56097)	Indent is off.

sanjoy edited edge metadata.May 5 2016, 3:40 PM

sanjoy added subscribers: rnk, atrick, majnemer.

majnemer added inline comments.May 5 2016, 4:27 PM

lib/CodeGen/XRayInstrumentation.cpp
72–77 ↗	(On Diff #56097)	I don't think you want to replace whacky constructs like `EH_RETURN` and `CLEANUPRET` with `PATCHABLE_RET`, do you? Just "normal" returns, right?

Address comments from sanjoy and majnemer

lib/CodeGen/XRayInstrumentation.cpp
72–77 ↗	(On Diff #56097)	Yes, that's right -- does this version exclude those appropriately?
80 ↗	(On Diff #56097)	Oops, I had assumed that the RET instructions only ever had register operands. That's a faulty assumption, good catch.

majnemer added inline comments.May 6 2016, 10:37 AM

lib/CodeGen/XRayInstrumentation.cpp
72–77 ↗	(On Diff #56381)	I would recommend white-listing the opcodes you know are safe to replace instead of blacklisting the ones you know which are unsafe. This would make the pass conservatively correct in the face of future changes to LLVM.

dberris added inline comments.May 8 2016, 9:19 PM

lib/CodeGen/XRayInstrumentation.cpp
72–77 ↗	(On Diff #56381)	That makes sense, thanks majnemer -- however it's not clear to me how I'd refer to the target-specific opcodes from here. Pardon the newbie question, but how do I say "I just want return instructions here"? The only options I can see are: Look at the original LLVM IR to see if it's a ReturnInst. Figure out a way for including the X86 opcodes here. Maybe I'm missing something simpler here?

Update operand unpacking properly.

clang-format

majnemer added inline comments.May 8 2016, 10:25 PM

lib/CodeGen/XRayInstrumentation.cpp
72–76 ↗	(On Diff #56525)	The typical way of doing this sort of thing, AFAICT, is to teach `TargetInstrInfo` about it. For example, it implements `getCatchReturnOpcode` and `getCallFrameSetupOpcode`.

Add a check in TargetInstrInfo for whether a return is a "normal" return.

We also implement this somewhat correctly in X86, but use the default
implementation for other platforms.

dberris marked an inline comment as done.May 8 2016, 11:06 PM

dberris added inline comments.

lib/CodeGen/XRayInstrumentation.cpp
72–74 ↗	(On Diff #56527)	Thanks! I've updated the patch to do this. PTAL?

Fix the uploaded diff.

majnemer added inline comments.May 9 2016, 7:45 AM

include/llvm/Target/TargetInstrInfo.h
155–166 ↗	(On Diff #56531)	I was thinking more of a whilelist-oriented solution, something like: `unsigned getNormalReturnOpcode() const { return NormalRetOpcode; }`

bmakam added inline comments.May 9 2016, 9:56 AM

lib/Target/X86/X86MCInstLower.cpp
1082 ↗	(On Diff #56531)	Could you please expand on why you need 9 bytes of noops here? I am not quite familiar with x86_64 but was under the impression that on x86_64 the jmp instruction is 1 byte for opcode and 4 bytes for signed relative displacement, so shouldn't 5 bytes worth of nops be sufficient?

dberris added inline comments.May 9 2016, 7:10 PM

include/llvm/Target/TargetInstrInfo.h
155–166 ↗	(On Diff #56531)	But how does this work on platforms that can spell 'return' many different normal ways (like in X86)? There's RETL, RETQ, and all other versions of RET which are considered "normal"?
lib/Target/X86/X86MCInstLower.cpp
1082 ↗	(On Diff #56531)	Good question, thanks. I have to check whether we're using the right version of JMP, but I'm specifically looking for the version that's one byte for the JMP instruction, and 8 bits (1 byte) for the relative offset. So far I haven't been able to spell `jmp +0x09` correctly and have it work, without having an additional symbol as a target for the jump instruction. If we get that right, we can then add the 9 byte nops we need to get exactly 11 bytes for the function entry. Is there a fool-proof way of spelling "JMP +0x09" with the builder interface? Or should I add another JMP instruction in X86 that supports the 8-bit displacement immediate operand? The reason why I can't use a JMP that isn't 2 bytes, is because it's really hard to write just 5 bytes atomically. I can probably do something with an 8-byte atomic write, but enforcing that 8-byte write doesn't span cache lines is also very tricky to make safe.

dberris added inline comments.May 9 2016, 9:49 PM

lib/Target/X86/X86MCInstLower.cpp

1082 ↗

(On Diff #56531)

Actually now that I've had a look at the generated object file, I can confirm that we're using the two-byte version of JMP with this construct. Quick test:

test.cc:

#include <cstdio>

[[clang::xray_always_instrument]] void foo() { std::printf("Hello, XRay!\n"); }

int main(int argc, char* argv[]) { foo(); }

Compiled with (modified clang to emit IR that has annotated functions for XRay):

./bin/clang -fxray-instrument -fxray-instruction-threshold=1 -std=c++11 -x c++ -S test.cc -emit-llvm

Creates this IR listing:

; ModuleID = 'test.cc'
source_filename = "test.cc"
target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"

@.str = private unnamed_addr constant [14 x i8] c"Hello, XRay!\0A\00", align 1

; Function Attrs: uwtable
define void @_Z3foov() #0 {
entry:
  %call = call i32 (i8*, ...) @printf(i8* getelementptr inbounds ([14 x i8], [14 x i8]* @.str, i32 0, i32 0))
  ret void
}

declare i32 @printf(i8*, ...) #1

; Function Attrs: norecurse uwtable
define i32 @main(i32 %argc, i8** %argv) #2 {
entry:
  %argc.addr = alloca i32, align 4
  %argv.addr = alloca i8**, align 8
  store i32 %argc, i32* %argc.addr, align 4
  store i8** %argv, i8*** %argv.addr, align 8
  call void @_Z3foov()
  ret i32 0
}

attributes #0 = { uwtable "disable-tail-calls"="false" "function-instrument"="xray-always" "less-precise-fpmad"="false" "no-frame-pointer-elim"="true" "no-frame-pointer-elim-non-leaf" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
attributes #1 = { "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="true" "no-frame-pointer-elim-non-leaf" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
attributes #2 = { norecurse uwtable "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="true" "no-frame-pointer-elim-non-leaf" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" "xray-instruction-threshold"="1" }

!llvm.ident = !{!0}

!0 = !{!"clang version 3.9.0 (http://llvm.org/git/clang.git 2b9ed9227330789f24e56fe9e800e7be2111073b) (http://llvm.org/git/llvm.git 1f3e353113ff9e2f835955bf005a8a5e25f16ad1)"}

Then compiled+disassembled this way:

./bin/llc -filetype=obj -o - < test.ll | ./bin/llvm-objdump -disassemble -

Produces the following output:

<stdin>:        file format ELF64-x86-64

Disassembly of section .text:
_Z3foov:
       0:       eb 09   jmp     9 <_Z3foov+0xB>
       2:       66 0f 1f 84 00 00 02 00 00      nopw    512(%rax,%rax)
       b:       55      pushq   %rbp
       c:       48 89 e5        movq    %rsp, %rbp
       f:       bf 00 00 00 00  movl    $0, %edi
      14:       31 c0   xorl    %eax, %eax
      16:       e8 00 00 00 00  callq   0 <_Z3foov+0x1B>
      1b:       5d      popq    %rbp
      1c:       c3      retq
      1d:       2e 66 0f 1f 84 00 00 02 00 00   nopw    %cs:512(%rax,%rax)
      27:       66 0f 1f 84 00 00 00 00 00      nopw    (%rax,%rax)

main:
      30:       eb 09   jmp     9 <main+0xB>
      32:       66 0f 1f 84 00 00 02 00 00      nopw    512(%rax,%rax)
      3b:       55      pushq   %rbp
      3c:       48 89 e5        movq    %rsp, %rbp
      3f:       48 83 ec 10     subq    $16, %rsp
      43:       89 7d fc        movl    %edi, -4(%rbp)
      46:       48 89 75 f0     movq    %rsi, -16(%rbp)
      4a:       e8 b1 ff ff ff  callq   -79 <_Z3foov>
      4f:       31 c0   xorl    %eax, %eax
      51:       48 83 c4 10     addq    $16, %rsp
      55:       5d      popq    %rbp
      56:       c3      retq
      57:       2e 66 0f 1f 84 00 00 02 00 00   nopw    %cs:512(%rax,%rax)

Add documentation on why we need 9 bytes of nops

dberris marked an inline comment as done.May 10 2016, 9:02 PM

dberris added inline comments.

lib/Target/X86/X86MCInstLower.cpp
1082 ↗	(On Diff #56850)	Updated the comments now to make it clear and specific. Thanks!

majnemer added inline comments.May 10 2016, 9:03 PM

include/llvm/Target/TargetInstrInfo.h
155–166 ↗	(On Diff #56850)	The same way it is handled for `X86::ADJCALLSTACKDOWN64` vs `X86::ADJCALLSTACKDOWN32`: https://github.com/llvm-mirror/llvm/blob/master/lib/Target/X86/X86InstrInfo.cpp#L105

dberris added inline comments.May 10 2016, 9:08 PM

include/llvm/Target/TargetInstrInfo.h
155–166 ↗	(On Diff #56850)	That makes a lot of sense, thanks! I'll rework this a bit to make it more of a targeted specific OpCode. I suppose it's fine to have multiples of these functions, each for relevant flavours of fairly common instruction classes, yes? So I suppose, for `tail call` exits/returns, we would have something similar for that?

Make ReturnOpcode an argument to TargetInstrInfo constructor and whitelist one ReturnOpcode instead

dberris marked an inline comment as done.May 10 2016, 10:15 PM

dberris added inline comments.

include/llvm/Target/TargetInstrInfo.h
157–168 ↗	(On Diff #56851)	Updated now to white-list a specific return OpCode.

dberris updated this object.May 12 2016, 2:27 AM

dberris added reviewers: rnk, eugenis, kcc, pcc.

Ping -- is there something else missing for this change?

Marked more inline comments as "Done".

Drive by comment.

include/llvm/Target/TargetOpcodes.def
145 ↗	(On Diff #56851)	Remove the XRay specific stuff here and rewrite in a more generic fashion (and below).

Update documentation to remove XRay references

dberris marked an inline comment as done.May 17 2016, 7:50 PM

dberris added inline comments.

include/llvm/Target/TargetOpcodes.def
145 ↗	(On Diff #57556)	Done

dberris added a child revision: D20352: Add XRay flags to Clang.May 17 2016, 10:34 PM

Ping?

Sync to master again.

Fixes after merge
Emit a common symbol for the instrumentation map
Improve table creation and allow access to the table via xray_instr_map and xray_instr_map_end
Fix the sled type encoding

niravd added a subscriber: niravd.Jun 22 2016, 6:17 AM

dberris added a child revision: D21612: [compiler-rt] [XRay] Basic initialization and flag definition for XRay runtime.Jun 22 2016, 10:29 AM

Use an explicit byte sequence for the jmp

dberris added a parent revision: D21743: Convenience function for MCContext to get named sections when lowering..Jun 26 2016, 8:58 PM

Make XRay write out tables per function

Clean up local history, squashing to a single revision (rebased too)

Rebase
Add ELF::SHF_MERGE to flags for XRay section
Update tests to reflect adjustments in implementation

Un-break XRay section creation.

dberris mentioned this in D20352: Add XRay flags to Clang.Jul 5 2016, 10:36 PM

lgtm

lib/Target/X86/X86MCInstLower.cpp
1051 ↗	(On Diff #62716)	Does XRay support multiple DSOs in the process? If so, the XRay runtime may be more than 2GB away from the code being patched, and this offset will overflow.
1060–1061 ↗	(On Diff #62716)	I guess we can deal with this separately. IMO we should have a 'jmpb' instruction or something that forces a short jump or assembler error if the displacement is too large.

This revision is now accepted and ready to land.Jul 12 2016, 9:18 AM

Thanks Reid -- do you or echristo@ mind landing this for me? I don't think I have commit powers yet.

lib/Target/X86/X86MCInstLower.cpp
1051 ↗	(On Diff #62716)	That's true -- the runtime patching code will fail to patch certain sleds that are farther than 32-bits away. In case we need more, in the future we can tweak this so that we can try to force the XRay runtime trampoline in the first 2GB of the process' memory and use an "absolute" address (which may need more bytes in the sled), or have a mode to support larger binaries (a flag that changes the emitted sleds, and have the runtime support both kinds of sleds). I suspect if it comes up in practice that we can deal with this more effectively.
1060–1061 ↗	(On Diff #62716)	I'm happy to do this as a refactoring or a follow-up change. A new instruction sounds like the right action to take anyway.

Rebase

Closed by commit rL275367: XRay: Add entry and exit sleds (authored by dberris). · Explain WhyJul 13 2016, 9:13 PM

This revision was automatically updated to reflect the committed changes.

dberris mentioned this in D21612: [compiler-rt] [XRay] Basic initialization and flag definition for XRay runtime.Aug 2 2016, 4:56 PM

rSerge added a child revision: D23931: [XRay] ARM 32-bit no-Thumb support in LLVM.Aug 26 2016, 9:59 AM

Revision Contents

Path

Size

llvm/

trunk/

include/

llvm/

CodeGen/

Passes.h

4 lines

InitializePasses.h

1 line

Target/

Target.td

15 lines

TargetInstrInfo.h

7 lines

TargetOpcodes.def

13 lines

lib/

CodeGen/

CMakeLists.txt

1 line

CodeGen.cpp

1 line

TargetPassConfig.cpp

1 line

XRayInstrumentation.cpp

96 lines

Target/

X86/

41 lines

3 lines

1 line

3 lines

105 lines

test/

CodeGen/

X86/

xray-attribute-instrumentation.ll

13 lines

xray-selective-instrumentation-miss.ll

9 lines

xray-selective-instrumentation.ll

9 lines

utils/

TableGen/

InstrInfoEmitter.cpp

6 lines

Diff 63912

llvm/trunk/include/llvm/CodeGen/Passes.h

Show First 20 Lines • Show All 257 Lines • ▼ Show 20 Lines	/// MachineDominanaceFrontier - This pass is a machine dominators analysis pass.
extern char &OptimizePHIsID;		extern char &OptimizePHIsID;

/// StackSlotColoring - This pass performs stack slot coloring.		/// StackSlotColoring - This pass performs stack slot coloring.
extern char &StackSlotColoringID;		extern char &StackSlotColoringID;

/// \brief This pass lays out funclets contiguously.		/// \brief This pass lays out funclets contiguously.
extern char &FuncletLayoutID;		extern char &FuncletLayoutID;

		/// This pass inserts the XRay instrumentation sleds if they are supported by
		/// the target platform.
		extern char &XRayInstrumentationID;

/// \brief This pass implements the "patchable-function" attribute.		/// \brief This pass implements the "patchable-function" attribute.
extern char &PatchableFunctionID;		extern char &PatchableFunctionID;

/// createStackProtectorPass - This pass adds stack protectors to functions.		/// createStackProtectorPass - This pass adds stack protectors to functions.
///		///
FunctionPass createStackProtectorPass(const TargetMachine TM);		FunctionPass createStackProtectorPass(const TargetMachine TM);

/// createMachineVerifierPass - This pass verifies cenerated machine code		/// createMachineVerifierPass - This pass verifies cenerated machine code
▲ Show 20 Lines • Show All 131 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/InitializePasses.h

	Show First 20 Lines • Show All 329 Lines • ▼ Show 20 Lines
	void initializeUnreachableBlockElimLegacyPassPass(PassRegistry&);			void initializeUnreachableBlockElimLegacyPassPass(PassRegistry&);
	void initializeUnreachableMachineBlockElimPass(PassRegistry&);			void initializeUnreachableMachineBlockElimPass(PassRegistry&);
	void initializeVerifierLegacyPassPass(PassRegistry&);			void initializeVerifierLegacyPassPass(PassRegistry&);
	void initializeVirtRegMapPass(PassRegistry&);			void initializeVirtRegMapPass(PassRegistry&);
	void initializeVirtRegRewriterPass(PassRegistry&);			void initializeVirtRegRewriterPass(PassRegistry&);
	void initializeWholeProgramDevirtPass(PassRegistry &);			void initializeWholeProgramDevirtPass(PassRegistry &);
	void initializeWinEHPreparePass(PassRegistry&);			void initializeWinEHPreparePass(PassRegistry&);
	void initializeWriteBitcodePassPass(PassRegistry &);			void initializeWriteBitcodePassPass(PassRegistry &);
				void initializeXRayInstrumentationPass(PassRegistry &);
	}			}

	#endif			#endif

llvm/trunk/include/llvm/Target/Target.td

	Show First 20 Lines • Show All 940 Lines • ▼ Show 20 Lines
	def PATCHABLE_OP : Instruction {			def PATCHABLE_OP : Instruction {
	let OutOperandList = (outs unknown:$dst);			let OutOperandList = (outs unknown:$dst);
	let InOperandList = (ins variable_ops);			let InOperandList = (ins variable_ops);
	let usesCustomInserter = 1;			let usesCustomInserter = 1;
	let mayLoad = 1;			let mayLoad = 1;
	let mayStore = 1;			let mayStore = 1;
	let hasSideEffects = 1;			let hasSideEffects = 1;
	}			}
				def PATCHABLE_FUNCTION_ENTER : Instruction {
				let OutOperandList = (outs);
				let InOperandList = (ins);
				let AsmString = "# XRay Function Enter.";
				let usesCustomInserter = 1;
				let hasSideEffects = 0;
				}
				def PATCHABLE_RET : Instruction {
				let OutOperandList = (outs unknown:$dst);
				let InOperandList = (ins variable_ops);
				let AsmString = "# XRay Function Exit.";
				let usesCustomInserter = 1;
				let hasSideEffects = 1;
				let isReturn = 1;
				}

	// Generic opcodes used in GlobalISel.			// Generic opcodes used in GlobalISel.
	include "llvm/Target/GenericOpcodes.td"			include "llvm/Target/GenericOpcodes.td"

	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// AsmParser - This class can be implemented by targets that wish to implement			// AsmParser - This class can be implemented by targets that wish to implement
	▲ Show 20 Lines • Show All 330 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/Target/TargetInstrInfo.h

Show First 20 Lines • Show All 49 Lines • ▼ Show 20 Lines
///		///
/// TargetInstrInfo - Interface to description of machine instruction set		/// TargetInstrInfo - Interface to description of machine instruction set
///		///
class TargetInstrInfo : public MCInstrInfo {		class TargetInstrInfo : public MCInstrInfo {
TargetInstrInfo(const TargetInstrInfo &) = delete;		TargetInstrInfo(const TargetInstrInfo &) = delete;
void operator=(const TargetInstrInfo &) = delete;		void operator=(const TargetInstrInfo &) = delete;
public:		public:
TargetInstrInfo(unsigned CFSetupOpcode = ~0u, unsigned CFDestroyOpcode = ~0u,		TargetInstrInfo(unsigned CFSetupOpcode = ~0u, unsigned CFDestroyOpcode = ~0u,
unsigned CatchRetOpcode = ~0u)		unsigned CatchRetOpcode = ~0u, unsigned ReturnOpcode = ~0u)
: CallFrameSetupOpcode(CFSetupOpcode),		: CallFrameSetupOpcode(CFSetupOpcode),
CallFrameDestroyOpcode(CFDestroyOpcode),		CallFrameDestroyOpcode(CFDestroyOpcode),
CatchRetOpcode(CatchRetOpcode) {}		CatchRetOpcode(CatchRetOpcode),
		ReturnOpcode(ReturnOpcode) {}

virtual ~TargetInstrInfo();		virtual ~TargetInstrInfo();

static bool isGenericOpcode(unsigned Opc) {		static bool isGenericOpcode(unsigned Opc) {
return Opc <= TargetOpcode::GENERIC_OP_END;		return Opc <= TargetOpcode::GENERIC_OP_END;
}		}

/// Given a machine instruction descriptor, returns the register		/// Given a machine instruction descriptor, returns the register
▲ Show 20 Lines • Show All 76 Lines • ▼ Show 20 Lines	public:
/// if they exist (-1 otherwise). Some targets use pseudo instructions in		/// if they exist (-1 otherwise). Some targets use pseudo instructions in
/// order to abstract away the difference between operating with a frame		/// order to abstract away the difference between operating with a frame
/// pointer and operating without, through the use of these two instructions.		/// pointer and operating without, through the use of these two instructions.
///		///
unsigned getCallFrameSetupOpcode() const { return CallFrameSetupOpcode; }		unsigned getCallFrameSetupOpcode() const { return CallFrameSetupOpcode; }
unsigned getCallFrameDestroyOpcode() const { return CallFrameDestroyOpcode; }		unsigned getCallFrameDestroyOpcode() const { return CallFrameDestroyOpcode; }

unsigned getCatchReturnOpcode() const { return CatchRetOpcode; }		unsigned getCatchReturnOpcode() const { return CatchRetOpcode; }
		unsigned getReturnOpcode() const { return ReturnOpcode; }

/// Returns the actual stack pointer adjustment made by an instruction		/// Returns the actual stack pointer adjustment made by an instruction
/// as part of a call sequence. By default, only call frame setup/destroy		/// as part of a call sequence. By default, only call frame setup/destroy
/// instructions adjust the stack, but targets may want to override this		/// instructions adjust the stack, but targets may want to override this
/// to enable more fine-grained adjustment, or adjust by a different value.		/// to enable more fine-grained adjustment, or adjust by a different value.
virtual int getSPAdjust(const MachineInstr &MI) const;		virtual int getSPAdjust(const MachineInstr &MI) const;

/// Return true if the instruction is a "coalescable" extension instruction.		/// Return true if the instruction is a "coalescable" extension instruction.
▲ Show 20 Lines • Show All 1,273 Lines • ▼ Show 20 Lines	public:
virtual ArrayRef<std::pair<unsigned, const char *>>		virtual ArrayRef<std::pair<unsigned, const char *>>
getSerializableBitmaskMachineOperandTargetFlags() const {		getSerializableBitmaskMachineOperandTargetFlags() const {
return None;		return None;
}		}

private:		private:
unsigned CallFrameSetupOpcode, CallFrameDestroyOpcode;		unsigned CallFrameSetupOpcode, CallFrameDestroyOpcode;
unsigned CatchRetOpcode;		unsigned CatchRetOpcode;
		unsigned ReturnOpcode;
};		};

/// \brief Provide DenseMapInfo for TargetInstrInfo::RegSubRegPair.		/// \brief Provide DenseMapInfo for TargetInstrInfo::RegSubRegPair.
template<>		template<>
struct DenseMapInfo<TargetInstrInfo::RegSubRegPair> {		struct DenseMapInfo<TargetInstrInfo::RegSubRegPair> {
typedef DenseMapInfo<unsigned> RegInfo;		typedef DenseMapInfo<unsigned> RegInfo;

static inline TargetInstrInfo::RegSubRegPair getEmptyKey() {		static inline TargetInstrInfo::RegSubRegPair getEmptyKey() {
Show All 24 Lines

llvm/trunk/include/llvm/Target/TargetOpcodes.def

	Show First 20 Lines • Show All 136 Lines • ▼ Show 20 Lines
	/// instruction wrapped in PATCHABLE_OP has to either have a minimum			/// instruction wrapped in PATCHABLE_OP has to either have a minimum
	/// size or be preceded with a nop of that size. The first operand is			/// size or be preceded with a nop of that size. The first operand is
	/// an immediate denoting the minimum size of the instruction, the			/// an immediate denoting the minimum size of the instruction, the
	/// second operand is an immediate denoting the opcode of the original			/// second operand is an immediate denoting the opcode of the original
	/// instruction. The rest of the operands are the operands of the			/// instruction. The rest of the operands are the operands of the
	/// original instruction.			/// original instruction.
	HANDLE_TARGET_OPCODE(PATCHABLE_OP, 23)			HANDLE_TARGET_OPCODE(PATCHABLE_OP, 23)

				/// This is a marker instruction which gets translated into a nop sled, useful
				/// for inserting instrumentation instructions at runtime.
				HANDLE_TARGET_OPCODE(PATCHABLE_FUNCTION_ENTER, 24)

				/// Wraps a return instruction and its operands to enable adding nop sleds
				/// either before or after the return. The nop sleds are useful for inserting
				/// instrumentation instructions at runtime.
				HANDLE_TARGET_OPCODE(PATCHABLE_RET, 25)

	/// The following generic opcodes are not supposed to appear after ISel.			/// The following generic opcodes are not supposed to appear after ISel.
	/// This is something we might want to relax, but for now, this is convenient			/// This is something we might want to relax, but for now, this is convenient
	/// to produce diagnostics.			/// to produce diagnostics.

	/// Generic ADD instruction. This is an integer add.			/// Generic ADD instruction. This is an integer add.
	HANDLE_TARGET_OPCODE(G_ADD, 24)			HANDLE_TARGET_OPCODE(G_ADD, 26)
	HANDLE_TARGET_OPCODE_MARKER(PRE_ISEL_GENERIC_OPCODE_START, G_ADD)			HANDLE_TARGET_OPCODE_MARKER(PRE_ISEL_GENERIC_OPCODE_START, G_ADD)

	/// Generic Bitwise-OR instruction.			/// Generic Bitwise-OR instruction.
	HANDLE_TARGET_OPCODE(G_OR, 25)			HANDLE_TARGET_OPCODE(G_OR, 25)

	/// Generic BRANCH instruction. This is an unconditional branch.			/// Generic BRANCH instruction. This is an unconditional branch.
	HANDLE_TARGET_OPCODE(G_BR, 26)			HANDLE_TARGET_OPCODE(G_BR, 27)

	// TODO: Add more generic opcodes as we move along.			// TODO: Add more generic opcodes as we move along.

	/// Marker for the end of the generic opcode.			/// Marker for the end of the generic opcode.
	/// This is used to check if an opcode is in the range of the			/// This is used to check if an opcode is in the range of the
	/// generic opcodes.			/// generic opcodes.
	HANDLE_TARGET_OPCODE_MARKER(PRE_ISEL_GENERIC_OPCODE_END, G_BR)			HANDLE_TARGET_OPCODE_MARKER(PRE_ISEL_GENERIC_OPCODE_END, G_BR)

	/// BUILTIN_OP_END - This must be the last enum value in this list.			/// BUILTIN_OP_END - This must be the last enum value in this list.
	/// The target-specific post-isel opcode values start here.			/// The target-specific post-isel opcode values start here.
	HANDLE_TARGET_OPCODE_MARKER(GENERIC_OP_END, PRE_ISEL_GENERIC_OPCODE_END)			HANDLE_TARGET_OPCODE_MARKER(GENERIC_OP_END, PRE_ISEL_GENERIC_OPCODE_END)

llvm/trunk/lib/CodeGen/CMakeLists.txt

Show First 20 Lines • Show All 130 Lines • ▼ Show 20 Lines	add_llvm_library(LLVMCodeGen
TargetOptionsImpl.cpp		TargetOptionsImpl.cpp
TargetPassConfig.cpp		TargetPassConfig.cpp
TargetRegisterInfo.cpp		TargetRegisterInfo.cpp
TargetSchedule.cpp		TargetSchedule.cpp
TwoAddressInstructionPass.cpp		TwoAddressInstructionPass.cpp
UnreachableBlockElim.cpp		UnreachableBlockElim.cpp
VirtRegMap.cpp		VirtRegMap.cpp
WinEHPrepare.cpp		WinEHPrepare.cpp
		XRayInstrumentation.cpp

ADDITIONAL_HEADER_DIRS		ADDITIONAL_HEADER_DIRS
${LLVM_MAIN_INCLUDE_DIR}/llvm/CodeGen		${LLVM_MAIN_INCLUDE_DIR}/llvm/CodeGen
${LLVM_MAIN_INCLUDE_DIR}/llvm/CodeGen/PBQP		${LLVM_MAIN_INCLUDE_DIR}/llvm/CodeGen/PBQP

LINK_LIBS ${PTHREAD_LIB}		LINK_LIBS ${PTHREAD_LIB}
)		)

add_dependencies(LLVMCodeGen intrinsics_gen)		add_dependencies(LLVMCodeGen intrinsics_gen)

add_subdirectory(SelectionDAG)		add_subdirectory(SelectionDAG)
add_subdirectory(AsmPrinter)		add_subdirectory(AsmPrinter)
add_subdirectory(MIRParser)		add_subdirectory(MIRParser)
add_subdirectory(GlobalISel)		add_subdirectory(GlobalISel)

llvm/trunk/lib/CodeGen/CodeGen.cpp

Show First 20 Lines • Show All 51 Lines • ▼ Show 20 Lines	void llvm::initializeCodeGen(PassRegistry &Registry) {
initializeMachineFunctionPrinterPassPass(Registry);		initializeMachineFunctionPrinterPassPass(Registry);
initializeMachineLICMPass(Registry);		initializeMachineLICMPass(Registry);
initializeMachineLoopInfoPass(Registry);		initializeMachineLoopInfoPass(Registry);
initializeMachineModuleInfoPass(Registry);		initializeMachineModuleInfoPass(Registry);
initializeMachinePostDominatorTreePass(Registry);		initializeMachinePostDominatorTreePass(Registry);
initializeMachineSchedulerPass(Registry);		initializeMachineSchedulerPass(Registry);
initializeMachineSinkingPass(Registry);		initializeMachineSinkingPass(Registry);
initializeMachineVerifierPassPass(Registry);		initializeMachineVerifierPassPass(Registry);
		initializeXRayInstrumentationPass(Registry);
initializePatchableFunctionPass(Registry);		initializePatchableFunctionPass(Registry);
initializeOptimizePHIsPass(Registry);		initializeOptimizePHIsPass(Registry);
initializePEIPass(Registry);		initializePEIPass(Registry);
initializePHIEliminationPass(Registry);		initializePHIEliminationPass(Registry);
initializePeepholeOptimizerPass(Registry);		initializePeepholeOptimizerPass(Registry);
initializePostMachineSchedulerPass(Registry);		initializePostMachineSchedulerPass(Registry);
initializePostRAHazardRecognizerPass(Registry);		initializePostRAHazardRecognizerPass(Registry);
initializePostRASchedulerPass(Registry);		initializePostRASchedulerPass(Registry);
Show All 26 Lines

llvm/trunk/lib/CodeGen/TargetPassConfig.cpp

Show First 20 Lines • Show All 649 Lines • ▼ Show 20 Lines	if (TM->Options.EnableIPRA)
// clobbered registers, to be used to optimize call sites.		// clobbered registers, to be used to optimize call sites.
addPass(createRegUsageInfoCollector());		addPass(createRegUsageInfoCollector());

addPass(&FuncletLayoutID, false);		addPass(&FuncletLayoutID, false);

addPass(&StackMapLivenessID, false);		addPass(&StackMapLivenessID, false);
addPass(&LiveDebugValuesID, false);		addPass(&LiveDebugValuesID, false);

		addPass(&XRayInstrumentationID, false);
addPass(&PatchableFunctionID, false);		addPass(&PatchableFunctionID, false);

AddingMachinePasses = false;		AddingMachinePasses = false;
}		}

/// Add passes that optimize machine instructions in SSA form.		/// Add passes that optimize machine instructions in SSA form.
void TargetPassConfig::addMachineSSAOptimization() {		void TargetPassConfig::addMachineSSAOptimization() {
// Pre-ra tail duplication.		// Pre-ra tail duplication.
▲ Show 20 Lines • Show All 222 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/XRayInstrumentation.cpp

				//===-- XRayInstrumentation.cpp - Adds XRay instrumentation to functions. -===//
				//
				// The LLVM Compiler Infrastructure
				//
				// This file is distributed under the University of Illinois Open Source
				// License. See LICENSE.TXT for details.
				//
				//===----------------------------------------------------------------------===//
				//
				// This file implements a MachineFunctionPass that inserts the appropriate
				// XRay instrumentation instructions. We look for XRay-specific attributes
				// on the function to determine whether we should insert the replacement
				// operations.
				//
				//===---------------------------------------------------------------------===//

				#include "llvm/CodeGen/Analysis.h"
				#include "llvm/CodeGen/MachineFunction.h"
				#include "llvm/CodeGen/MachineFunctionPass.h"
				#include "llvm/CodeGen/MachineInstrBuilder.h"
				#include "llvm/CodeGen/Passes.h"
				#include "llvm/Support/TargetRegistry.h"
				#include "llvm/Target/TargetInstrInfo.h"
				#include "llvm/Target/TargetSubtargetInfo.h"

				using namespace llvm;

				namespace {
				struct XRayInstrumentation : public MachineFunctionPass {
				static char ID;

				XRayInstrumentation() : MachineFunctionPass(ID) {
				initializeXRayInstrumentationPass(*PassRegistry::getPassRegistry());
				}

				bool runOnMachineFunction(MachineFunction &MF) override;
				};
				}

				bool XRayInstrumentation::runOnMachineFunction(MachineFunction &MF) {
				auto &F = *MF.getFunction();
				auto InstrAttr = F.getFnAttribute("function-instrument");
				bool AlwaysInstrument = !InstrAttr.hasAttribute(Attribute::None) &&
				InstrAttr.isStringAttribute() &&
				InstrAttr.getValueAsString() == "xray-always";
				Attribute Attr = F.getFnAttribute("xray-instruction-threshold");
				unsigned XRayThreshold = 0;
				if (!AlwaysInstrument) {
				if (Attr.hasAttribute(Attribute::None) \|\| !Attr.isStringAttribute())
				return false; // XRay threshold attribute not found.
				if (Attr.getValueAsString().getAsInteger(10, XRayThreshold))
				return false; // Invalid value for threshold.
				if (F.size() < XRayThreshold)
				return false; // Function is too small.
				}

				// FIXME: Do the loop triviality analysis here or in an earlier pass.

				// First, insert an PATCHABLE_FUNCTION_ENTER as the first instruction of the
				// MachineFunction.
				auto &FirstMBB = *MF.begin();
				auto &FirstMI = *FirstMBB.begin();
				auto *TII = MF.getSubtarget().getInstrInfo();
				BuildMI(FirstMBB, FirstMI, FirstMI.getDebugLoc(),
				TII->get(TargetOpcode::PATCHABLE_FUNCTION_ENTER));

				// Then we look for all terminators and returns, then replace those with
				// PATCHABLE_RET instructions.
				SmallVector<MachineInstr *, 4> Terminators;
				for (auto &MBB : MF) {
				for (auto &T : MBB.terminators()) {
				// FIXME: Handle tail calls here too?
				if (T.isReturn() && T.getOpcode() == TII->getReturnOpcode()) {
				// Replace return instructions with:
				// PATCHABLE_RET <Opcode>, <Operand>...
				auto MIB = BuildMI(MBB, T, T.getDebugLoc(),
				TII->get(TargetOpcode::PATCHABLE_RET))
				.addImm(T.getOpcode());
				for (auto &MO : T.operands())
				MIB.addOperand(MO);
				Terminators.push_back(&T);
				break;
				}
				}
				}

				for (auto &I : Terminators)
				I->eraseFromParent();

				return true;
				}

				char XRayInstrumentation::ID = 0;
				char &llvm::XRayInstrumentationID = XRayInstrumentation::ID;
				INITIALIZE_PASS(XRayInstrumentation, "xray-instrumentation", "Insert XRay ops",
				false, false);

llvm/trunk/lib/Target/X86/X86AsmPrinter.h

Show First 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	private:
// CurrentShadowSize counts the number of bytes encoded since the most		// CurrentShadowSize counts the number of bytes encoded since the most
// recently encountered STACKMAP, stopping when that number is greater than		// recently encountered STACKMAP, stopping when that number is greater than
// or equal to RequiredShadowSize.		// or equal to RequiredShadowSize.
unsigned RequiredShadowSize = 0, CurrentShadowSize = 0;		unsigned RequiredShadowSize = 0, CurrentShadowSize = 0;
};		};

StackMapShadowTracker SMShadowTracker;		StackMapShadowTracker SMShadowTracker;

		// This describes the kind of sled we're storing in the XRay table.
		enum class SledKind : uint8_t {
		FUNCTION_ENTER = 0,
		FUNCTION_EXIT = 1,
		TAIL_CALL = 2,
		};

		// The table will contain these structs that point to the sled, the function
		// containing the sled, and what kind of sled (and whether they should always
		// be instrumented).
		struct XRayFunctionEntry {
		const MCSymbol *Sled;
		const MCSymbol *Function;
		SledKind Kind;
		bool AlwaysInstrument;
		const class Function *Fn;
		};

		// All the sleds to be emitted.
		std::vector<XRayFunctionEntry> Sleds;

// All instructions emitted by the X86AsmPrinter should use this helper		// All instructions emitted by the X86AsmPrinter should use this helper
// method.		// method.
//		//
// This helper function invokes the SMShadowTracker on each instruction before		// This helper function invokes the SMShadowTracker on each instruction before
// outputting it to the OutStream. This allows the shadow tracker to minimise		// outputting it to the OutStream. This allows the shadow tracker to minimise
// the number of NOPs used for stackmap padding.		// the number of NOPs used for stackmap padding.
void EmitAndCountInstruction(MCInst &Inst);		void EmitAndCountInstruction(MCInst &Inst);
void LowerSTACKMAP(const MachineInstr &MI);		void LowerSTACKMAP(const MachineInstr &MI);
void LowerPATCHPOINT(const MachineInstr &MI, X86MCInstLower &MCIL);		void LowerPATCHPOINT(const MachineInstr &MI, X86MCInstLower &MCIL);
void LowerSTATEPOINT(const MachineInstr &MI, X86MCInstLower &MCIL);		void LowerSTATEPOINT(const MachineInstr &MI, X86MCInstLower &MCIL);
void LowerFAULTING_LOAD_OP(const MachineInstr &MI, X86MCInstLower &MCIL);		void LowerFAULTING_LOAD_OP(const MachineInstr &MI, X86MCInstLower &MCIL);
void LowerPATCHABLE_OP(const MachineInstr &MI, X86MCInstLower &MCIL);		void LowerPATCHABLE_OP(const MachineInstr &MI, X86MCInstLower &MCIL);

void LowerTlsAddr(X86MCInstLower &MCInstLowering, const MachineInstr &MI);		void LowerTlsAddr(X86MCInstLower &MCInstLowering, const MachineInstr &MI);

		// XRay-specific lowering for X86.
		void LowerPATCHABLE_FUNCTION_ENTER(const MachineInstr &MI,
		X86MCInstLower &MCIL);
		void LowerPATCHABLE_RET(const MachineInstr &MI, X86MCInstLower &MCIL);
		void LowerPATCHABLE_TAIL_CALL(const MachineInstr &MI, X86MCInstLower &MCIL);

		// Helper function that emits the XRay sleds we've collected for a particular
		// function.
		void EmitXRayTable();

		// Helper function to record a given XRay sled.
		void recordSled(MCSymbol *Sled, const MachineInstr &MI, SledKind Kind);
public:		public:
explicit X86AsmPrinter(TargetMachine &TM,		explicit X86AsmPrinter(TargetMachine &TM,
std::unique_ptr<MCStreamer> Streamer)		std::unique_ptr<MCStreamer> Streamer)
: AsmPrinter(TM, std::move(Streamer)), SM(this), FM(this) {}		: AsmPrinter(TM, std::move(Streamer)), SM(this), FM(this) {}

const char *getPassName() const override {		const char *getPassName() const override {
return "X86 Assembly / Object Emitter";		return "X86 Assembly / Object Emitter";
}		}

const X86Subtarget &getSubtarget() const { return *Subtarget; }		const X86Subtarget &getSubtarget() const { return *Subtarget; }

void EmitStartOfAsmFile(Module &M) override;		void EmitStartOfAsmFile(Module &M) override;
Show All 31 Lines

llvm/trunk/lib/Target/X86/X86AsmPrinter.cpp

Show First 20 Lines • Show All 63 Lines • ▼ Show 20 Lines	if (Subtarget->isTargetCOFF()) {
OutStreamer->EmitCOFFSymbolType(COFF::IMAGE_SYM_DTYPE_FUNCTION		OutStreamer->EmitCOFFSymbolType(COFF::IMAGE_SYM_DTYPE_FUNCTION
<< COFF::SCT_COMPLEX_TYPE_SHIFT);		<< COFF::SCT_COMPLEX_TYPE_SHIFT);
OutStreamer->EndCOFFSymbolDef();		OutStreamer->EndCOFFSymbolDef();
}		}

// Emit the rest of the function body.		// Emit the rest of the function body.
EmitFunctionBody();		EmitFunctionBody();

		// Emit the XRay table for this function.
		EmitXRayTable();

// We didn't modify anything.		// We didn't modify anything.
return false;		return false;
}		}

/// printSymbolOperand - Print a raw symbol reference operand. This handles		/// printSymbolOperand - Print a raw symbol reference operand. This handles
/// jump tables, constant pools, global address and external symbols, all of		/// jump tables, constant pools, global address and external symbols, all of
/// which print to a label with various suffixes for relocation types etc.		/// which print to a label with various suffixes for relocation types etc.
static void printSymbolOperand(X86AsmPrinter &P, const MachineOperand &MO,		static void printSymbolOperand(X86AsmPrinter &P, const MachineOperand &MO,
▲ Show 20 Lines • Show All 579 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/X86/X86FrameLowering.cpp

Show First 20 Lines • Show All 153 Lines • ▼ Show 20 Lines	static unsigned findDeadCallerSavedReg(MachineBasicBlock &MBB,
if (!F \|\| MF->getMMI().callsEHReturn())		if (!F \|\| MF->getMMI().callsEHReturn())
return 0;		return 0;

const TargetRegisterClass &AvailableRegs = TRI->getGPRsForTailCall(MF);		const TargetRegisterClass &AvailableRegs = TRI->getGPRsForTailCall(MF);

unsigned Opc = MBBI->getOpcode();		unsigned Opc = MBBI->getOpcode();
switch (Opc) {		switch (Opc) {
default: return 0;		default: return 0;
		case TargetOpcode::PATCHABLE_RET:
case X86::RET:		case X86::RET:
case X86::RETL:		case X86::RETL:
case X86::RETQ:		case X86::RETQ:
case X86::RETIL:		case X86::RETIL:
case X86::RETIQ:		case X86::RETIQ:
case X86::TCRETURNdi:		case X86::TCRETURNdi:
case X86::TCRETURNri:		case X86::TCRETURNri:
case X86::TCRETURNmi:		case X86::TCRETURNmi:
▲ Show 20 Lines • Show All 2,817 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/X86/X86InstrInfo.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 111 Lines • ▼ Show 20 Lines
	// Pin the vtable to this file.			// Pin the vtable to this file.
	void X86InstrInfo::anchor() {}			void X86InstrInfo::anchor() {}

	X86InstrInfo::X86InstrInfo(X86Subtarget &STI)			X86InstrInfo::X86InstrInfo(X86Subtarget &STI)
	: X86GenInstrInfo((STI.isTarget64BitLP64() ? X86::ADJCALLSTACKDOWN64			: X86GenInstrInfo((STI.isTarget64BitLP64() ? X86::ADJCALLSTACKDOWN64
	: X86::ADJCALLSTACKDOWN32),			: X86::ADJCALLSTACKDOWN32),
	(STI.isTarget64BitLP64() ? X86::ADJCALLSTACKUP64			(STI.isTarget64BitLP64() ? X86::ADJCALLSTACKUP64
	: X86::ADJCALLSTACKUP32),			: X86::ADJCALLSTACKUP32),
	X86::CATCHRET),			X86::CATCHRET,
				(STI.is64Bit() ? X86::RETQ : X86::RETL)),
	Subtarget(STI), RI(STI.getTargetTriple()) {			Subtarget(STI), RI(STI.getTargetTriple()) {

	static const X86MemoryFoldTableEntry MemoryFoldTable2Addr[] = {			static const X86MemoryFoldTableEntry MemoryFoldTable2Addr[] = {
	{ X86::ADC32ri, X86::ADC32mi, 0 },			{ X86::ADC32ri, X86::ADC32mi, 0 },
	{ X86::ADC32ri8, X86::ADC32mi8, 0 },			{ X86::ADC32ri8, X86::ADC32mi8, 0 },
	{ X86::ADC32rr, X86::ADC32mr, 0 },			{ X86::ADC32rr, X86::ADC32mr, 0 },
	{ X86::ADC64ri32, X86::ADC64mi32, 0 },			{ X86::ADC64ri32, X86::ADC64mi32, 0 },
	{ X86::ADC64ri8, X86::ADC64mi8, 0 },			{ X86::ADC64ri8, X86::ADC64mi8, 0 },
	▲ Show 20 Lines • Show All 7,421 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/X86/X86MCInstLower.cpp

Show All 30 Lines
#include "llvm/IR/Mangler.h"		#include "llvm/IR/Mangler.h"
#include "llvm/MC/MCAsmInfo.h"		#include "llvm/MC/MCAsmInfo.h"
#include "llvm/MC/MCCodeEmitter.h"		#include "llvm/MC/MCCodeEmitter.h"
#include "llvm/MC/MCContext.h"		#include "llvm/MC/MCContext.h"
#include "llvm/MC/MCExpr.h"		#include "llvm/MC/MCExpr.h"
#include "llvm/MC/MCFixup.h"		#include "llvm/MC/MCFixup.h"
#include "llvm/MC/MCInst.h"		#include "llvm/MC/MCInst.h"
#include "llvm/MC/MCInstBuilder.h"		#include "llvm/MC/MCInstBuilder.h"
		#include "llvm/MC/MCSection.h"
#include "llvm/MC/MCStreamer.h"		#include "llvm/MC/MCStreamer.h"
#include "llvm/MC/MCSymbol.h"		#include "llvm/MC/MCSymbol.h"
		#include "llvm/MC/MCSymbolELF.h"
		#include "llvm/MC/MCSectionELF.h"
#include "llvm/Support/TargetRegistry.h"		#include "llvm/Support/TargetRegistry.h"
		#include "llvm/Support/ELF.h"
		#include "llvm/Target/TargetLoweringObjectFile.h"

using namespace llvm;		using namespace llvm;

namespace {		namespace {

/// X86MCInstLower - This class is used to lower an MachineInstr into an MCInst.		/// X86MCInstLower - This class is used to lower an MachineInstr into an MCInst.
class X86MCInstLower {		class X86MCInstLower {
MCContext &Ctx;		MCContext &Ctx;
const MachineFunction &MF;		const MachineFunction &MF;
▲ Show 20 Lines • Show All 963 Lines • ▼ Show 20 Lines	void X86AsmPrinter::LowerPATCHPOINT(const MachineInstr &MI,
unsigned NumBytes = opers.getMetaOper(PatchPointOpers::NBytesPos).getImm();		unsigned NumBytes = opers.getMetaOper(PatchPointOpers::NBytesPos).getImm();
assert(NumBytes >= EncodedBytes &&		assert(NumBytes >= EncodedBytes &&
"Patchpoint can't request size less than the length of a call.");		"Patchpoint can't request size less than the length of a call.");

EmitNops(*OutStreamer, NumBytes - EncodedBytes, Subtarget->is64Bit(),		EmitNops(*OutStreamer, NumBytes - EncodedBytes, Subtarget->is64Bit(),
getSubtargetInfo());		getSubtargetInfo());
}		}

		void X86AsmPrinter::recordSled(MCSymbol *Sled, const MachineInstr &MI,
		SledKind Kind) {
		auto Fn = MI.getParent()->getParent()->getFunction();
		auto Attr = Fn->getFnAttribute("function-instrument");
		bool AlwaysInstrument =
		Attr.isStringAttribute() && Attr.getValueAsString() == "xray-always";
		Sleds.emplace_back(
		XRayFunctionEntry{Sled, CurrentFnSym, Kind, AlwaysInstrument, Fn});
		}

		void X86AsmPrinter::LowerPATCHABLE_FUNCTION_ENTER(const MachineInstr &MI,
		X86MCInstLower &MCIL) {
		// We want to emit the following pattern:
		//
		// .Lxray_sled_N:
		// .palign 2, ...
		// jmp .tmpN
		// # 9 bytes worth of noops
		// .tmpN
		//
		// We need the 9 bytes because at runtime, we'd be patching over the full 11
		// bytes with the following pattern:
		//
		// mov %r10, <function id, 32-bit> // 6 bytes
		// call <relative offset, 32-bits> // 5 bytes
		//
		auto CurSled = OutContext.createTempSymbol("xray_sled_", true);
		OutStreamer->EmitLabel(CurSled);
		OutStreamer->EmitCodeAlignment(4);
		auto Target = OutContext.createTempSymbol();

		// Use a two-byte `jmp`. This version of JMP takes an 8-bit relative offset as
		// an operand (computed as an offset from the jmp instruction).
		// FIXME: Find another less hacky way do force the relative jump.
		OutStreamer->EmitBytes("\xeb\x09");
		EmitNops(*OutStreamer, 9, Subtarget->is64Bit(), getSubtargetInfo());
		OutStreamer->EmitLabel(Target);
		recordSled(CurSled, MI, SledKind::FUNCTION_ENTER);
		}

		void X86AsmPrinter::LowerPATCHABLE_RET(const MachineInstr &MI,
		X86MCInstLower &MCIL) {
		// Since PATCHABLE_RET takes the opcode of the return statement as an
		// argument, we use that to emit the correct form of the RET that we want.
		// i.e. when we see this:
		//
		// PATCHABLE_RET X86::RET ...
		//
		// We should emit the RET followed by sleds.
		//
		// .Lxray_sled_N:
		// ret # or equivalent instruction
		// # 10 bytes worth of noops
		//
		// This just makes sure that the alignment for the next instruction is 2.
		auto CurSled = OutContext.createTempSymbol("xray_sled_", true);
		OutStreamer->EmitLabel(CurSled);
		unsigned OpCode = MI.getOperand(0).getImm();
		MCInst Ret;
		Ret.setOpcode(OpCode);
		for (auto &MO : make_range(MI.operands_begin() + 1, MI.operands_end()))
		if (auto MaybeOperand = MCIL.LowerMachineOperand(&MI, MO))
		Ret.addOperand(MaybeOperand.getValue());
		OutStreamer->EmitInstruction(Ret, getSubtargetInfo());
		EmitNops(*OutStreamer, 10, Subtarget->is64Bit(), getSubtargetInfo());
		recordSled(CurSled, MI, SledKind::FUNCTION_EXIT);
		}

		void X86AsmPrinter::EmitXRayTable() {
		if (Sleds.empty())
		return;
		if (Subtarget->isTargetELF()) {
		auto *Section = OutContext.getELFSection(
		"xray_instr_map", ELF::SHT_PROGBITS,
		ELF::SHF_ALLOC \| ELF::SHF_GROUP \| ELF::SHF_MERGE, 0,
		CurrentFnSym->getName());
		auto PrevSection = OutStreamer->getCurrentSectionOnly();
		OutStreamer->SwitchSection(Section);
		for (const auto &Sled : Sleds) {
		OutStreamer->EmitSymbolValue(Sled.Sled, 8);
		OutStreamer->EmitSymbolValue(CurrentFnSym, 8);
		auto Kind = static_cast<uint8_t>(Sled.Kind);
		OutStreamer->EmitBytes(
		StringRef(reinterpret_cast<const char *>(&Kind), 1));
		OutStreamer->EmitBytes(
		StringRef(reinterpret_cast<const char *>(&Sled.AlwaysInstrument), 1));
		OutStreamer->EmitZeros(14);
		}
		OutStreamer->SwitchSection(PrevSection);
		}
		Sleds.clear();
		}

// Returns instruction preceding MBBI in MachineFunction.		// Returns instruction preceding MBBI in MachineFunction.
// If MBBI is the first instruction of the first basic block, returns null.		// If MBBI is the first instruction of the first basic block, returns null.
static MachineBasicBlock::const_iterator		static MachineBasicBlock::const_iterator
PrevCrossBBInst(MachineBasicBlock::const_iterator MBBI) {		PrevCrossBBInst(MachineBasicBlock::const_iterator MBBI) {
const MachineBasicBlock *MBB = MBBI->getParent();		const MachineBasicBlock *MBB = MBBI->getParent();
while (MBBI == MBB->begin()) {		while (MBBI == MBB->begin()) {
if (MBB == &MBB->getParent()->front())		if (MBB == &MBB->getParent()->front())
return MachineBasicBlock::const_iterator();		return MachineBasicBlock::const_iterator();
▲ Show 20 Lines • Show All 225 Lines • ▼ Show 20 Lines	case TargetOpcode::PATCHABLE_OP:
return LowerPATCHABLE_OP(*MI, MCInstLowering);		return LowerPATCHABLE_OP(*MI, MCInstLowering);

case TargetOpcode::STACKMAP:		case TargetOpcode::STACKMAP:
return LowerSTACKMAP(*MI);		return LowerSTACKMAP(*MI);

case TargetOpcode::PATCHPOINT:		case TargetOpcode::PATCHPOINT:
return LowerPATCHPOINT(*MI, MCInstLowering);		return LowerPATCHPOINT(*MI, MCInstLowering);

		case TargetOpcode::PATCHABLE_FUNCTION_ENTER:
		return LowerPATCHABLE_FUNCTION_ENTER(*MI, MCInstLowering);

		case TargetOpcode::PATCHABLE_RET:
		return LowerPATCHABLE_RET(*MI, MCInstLowering);

case X86::MORESTACK_RET:		case X86::MORESTACK_RET:
EmitAndCountInstruction(MCInstBuilder(getRetOpcode(*Subtarget)));		EmitAndCountInstruction(MCInstBuilder(getRetOpcode(*Subtarget)));
return;		return;

case X86::MORESTACK_RET_RESTORE_R10:		case X86::MORESTACK_RET_RESTORE_R10:
// Return, then restore R10.		// Return, then restore R10.
EmitAndCountInstruction(MCInstBuilder(getRetOpcode(*Subtarget)));		EmitAndCountInstruction(MCInstBuilder(getRetOpcode(*Subtarget)));
EmitAndCountInstruction(MCInstBuilder(X86::MOV64rr)		EmitAndCountInstruction(MCInstBuilder(X86::MOV64rr)
▲ Show 20 Lines • Show All 312 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/X86/xray-attribute-instrumentation.ll

				; RUN: llc -filetype=asm -o - -mtriple=x86_64-unknown-linux-gnu < %s \| FileCheck %s

				define i32 @foo() nounwind noinline uwtable "function-instrument"="xray-always" {
				; CHECK-LABEL: Lxray_sled_0:
				; CHECK-NEXT: .p2align 2, 0x90
				; CHECK-NEXT: .ascii "\353\t"
				; CHECK-NEXT: nopw 512(%rax,%rax)
				; CHECK-LABEL: Ltmp0:
				ret i32 0
				; CHECK-LABEL: Lxray_sled_1:
				; CHECK-NEXT: retq
				; CHECK-NEXT: nopw %cs:512(%rax,%rax)
				}

llvm/trunk/test/CodeGen/X86/xray-selective-instrumentation-miss.ll

				; RUN: llc -mcpu=nehalem < %s \| not grep xray_sled_

				target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128"
				target triple = "x86_64-apple-darwin8"

				define i32 @foo() nounwind uwtable "xray-instruction-threshold"="3" {
				entry:
				ret i32 0
				}

llvm/trunk/test/CodeGen/X86/xray-selective-instrumentation.ll

				; RUN: llc -mcpu=nehalem < %s \| grep xray_sled_

				target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128"
				target triple = "x86_64-apple-darwin8"

				define i32 @foo() nounwind uwtable "xray-instruction-threshold"="1" {
				entry:
				ret i32 0
				}

llvm/trunk/utils/TableGen/InstrInfoEmitter.cpp

Show First 20 Lines • Show All 422 Lines • ▼ Show 20 Lines	void InstrInfoEmitter::run(raw_ostream &OS) {
// Create a TargetInstrInfo subclass to hide the MC layer initialization.		// Create a TargetInstrInfo subclass to hide the MC layer initialization.
OS << "#ifdef GET_INSTRINFO_HEADER\n";		OS << "#ifdef GET_INSTRINFO_HEADER\n";
OS << "#undef GET_INSTRINFO_HEADER\n";		OS << "#undef GET_INSTRINFO_HEADER\n";

std::string ClassName = TargetName + "GenInstrInfo";		std::string ClassName = TargetName + "GenInstrInfo";
OS << "namespace llvm {\n";		OS << "namespace llvm {\n";
OS << "struct " << ClassName << " : public TargetInstrInfo {\n"		OS << "struct " << ClassName << " : public TargetInstrInfo {\n"
<< " explicit " << ClassName		<< " explicit " << ClassName
<< "(int CFSetupOpcode = -1, int CFDestroyOpcode = -1, int CatchRetOpcode = -1);\n"		<< "(int CFSetupOpcode = -1, int CFDestroyOpcode = -1, int CatchRetOpcode = -1, int ReturnOpcode = -1);\n"
<< " ~" << ClassName << "() override {}\n"		<< " ~" << ClassName << "() override {}\n"
<< "};\n";		<< "};\n";
OS << "} // end llvm namespace\n";		OS << "} // end llvm namespace\n";

OS << "#endif // GET_INSTRINFO_HEADER\n\n";		OS << "#endif // GET_INSTRINFO_HEADER\n\n";

OS << "#ifdef GET_INSTRINFO_CTOR_DTOR\n";		OS << "#ifdef GET_INSTRINFO_CTOR_DTOR\n";
OS << "#undef GET_INSTRINFO_CTOR_DTOR\n";		OS << "#undef GET_INSTRINFO_CTOR_DTOR\n";

OS << "namespace llvm {\n";		OS << "namespace llvm {\n";
OS << "extern const MCInstrDesc " << TargetName << "Insts[];\n";		OS << "extern const MCInstrDesc " << TargetName << "Insts[];\n";
OS << "extern const unsigned " << TargetName << "InstrNameIndices[];\n";		OS << "extern const unsigned " << TargetName << "InstrNameIndices[];\n";
OS << "extern const char " << TargetName << "InstrNameData[];\n";		OS << "extern const char " << TargetName << "InstrNameData[];\n";
OS << ClassName << "::" << ClassName		OS << ClassName << "::" << ClassName
<< "(int CFSetupOpcode, int CFDestroyOpcode, int CatchRetOpcode)\n"		<< "(int CFSetupOpcode, int CFDestroyOpcode, int CatchRetOpcode, int ReturnOpcode)\n"
<< " : TargetInstrInfo(CFSetupOpcode, CFDestroyOpcode, CatchRetOpcode) {\n"		<< " : TargetInstrInfo(CFSetupOpcode, CFDestroyOpcode, CatchRetOpcode, ReturnOpcode) {\n"
<< " InitMCInstrInfo(" << TargetName << "Insts, " << TargetName		<< " InitMCInstrInfo(" << TargetName << "Insts, " << TargetName
<< "InstrNameIndices, " << TargetName << "InstrNameData, "		<< "InstrNameIndices, " << TargetName << "InstrNameData, "
<< NumberedInstructions.size() << ");\n}\n";		<< NumberedInstructions.size() << ");\n}\n";
OS << "} // end llvm namespace\n";		OS << "} // end llvm namespace\n";

OS << "#endif // GET_INSTRINFO_CTOR_DTOR\n\n";		OS << "#endif // GET_INSTRINFO_CTOR_DTOR\n\n";

emitOperandNameMappings(OS, Target, NumberedInstructions);		emitOperandNameMappings(OS, Target, NumberedInstructions);
▲ Show 20 Lines • Show All 151 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

XRay: Add entry and exit sledsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 63912

llvm/trunk/include/llvm/CodeGen/Passes.h

llvm/trunk/include/llvm/InitializePasses.h

llvm/trunk/include/llvm/Target/Target.td

llvm/trunk/include/llvm/Target/TargetInstrInfo.h

llvm/trunk/include/llvm/Target/TargetOpcodes.def

llvm/trunk/lib/CodeGen/CMakeLists.txt

llvm/trunk/lib/CodeGen/CodeGen.cpp

llvm/trunk/lib/CodeGen/TargetPassConfig.cpp

llvm/trunk/lib/CodeGen/XRayInstrumentation.cpp

llvm/trunk/lib/Target/X86/X86AsmPrinter.h

llvm/trunk/lib/Target/X86/X86AsmPrinter.cpp

llvm/trunk/lib/Target/X86/X86FrameLowering.cpp

llvm/trunk/lib/Target/X86/X86InstrInfo.cpp

llvm/trunk/lib/Target/X86/X86MCInstLower.cpp

llvm/trunk/test/CodeGen/X86/xray-attribute-instrumentation.ll

llvm/trunk/test/CodeGen/X86/xray-selective-instrumentation-miss.ll

llvm/trunk/test/CodeGen/X86/xray-selective-instrumentation.ll

llvm/trunk/utils/TableGen/InstrInfoEmitter.cpp

XRay: Add entry and exit sleds
ClosedPublic