This is an archive of the discontinued LLVM Phabricator instance.

Generate SEH unwinding info on Win64
ClosedPublic

Authored by vadimcn on Jun 9 2014, 9:27 PM.

Download Raw Diff

Details

Reviewers

nrieck
rnk
asl
chapuni

Commits

rG1db5995d1419: Re-apply r211399, "Generate native unwind info on Win64" with a fix to ignore…
rG4a01230db47e: Generate native unwind info on Win64
rL211691: Re-apply r211399, "Generate native unwind info on Win64" with a fix to ignore…
rL211399: Generate native unwind info on Win64

Summary

This patch enables LLVM to emit Win64-native SEH unwind info rather than DWARF CFI. It handles all corner cases (I hope), including stack realignment.
Because SEH is not flexible enough to describe stack frames with a gap of unknown size in the middle, such as the one caused by stack realignment, I modified register spilling code to place all spills into the fixed frame slots, so that they can be accessed relative to the frame pointer.

Diff Detail

Repository: rL LLVM

Event Timeline

vadimcn updated this revision to Diff 10261.Jun 9 2014, 9:27 PM

vadimcn retitled this revision from to Generate SEH unwinding info on Win64.

vadimcn updated this object.

vadimcn edited the test plan for this revision. (Show Details)

vadimcn added reviewers: nrieck, rnk, asl, chapuni.Jun 9 2014, 9:34 PM

vadimcn added subscribers: Unknown Object (MLST), Kai, sanjoy, mingwandroid.

Excellent. This replaces http://reviews.llvm.org/D3418 and http://reviews.llvm.org/D3419 is still needed, right?

Whoops, looks like I got .cfi_offset and .cfi_rel_offset mixed up. The original offsets in bigstructret2.ll test were actually correct.
Also, rebased on top of master.

vadimcn updated this revision to Diff 10279.Jun 10 2014, 8:29 AM

rnk added inline comments.Jun 10 2014, 11:53 AM

include/llvm/CodeGen/MachineFrameInfo.h
486 ↗	(On Diff #10279)	Let's try to avoid contiguous boolean parameters, especially when some have default arguments. They make it hard to do refactorings like removing the 'Immutable' parameter here. Instead, CreateFixedSpillSlot(Size, SPOffset) sounds like the right interface to me.
lib/MC/MCAsmStreamer.cpp
1138 ↗	(On Diff #10279)	This should be a separate revert. The existing test case is incorrect, it was doing '.seh_savereg %rsi, 16' and disassembling that with %rbp.
lib/Target/X86/MCTargetDesc/X86MCAsmInfo.cpp
148 ↗	(On Diff #10279)	I think this means we will start attempting to emit cleanups for x86_64-pc-windows-msvc, which we probably aren't ready for. Can we hold off on this change, or are you reasonably sure it's safe?
lib/Target/X86/X86FrameLowering.cpp
402–410 ↗	(On Diff #10279)	Are these comments accurate when we have an FP and the stack needs realignment? It seems like we should move the win64-specific XMM CSR saves to happen before the stack realignment. This should be safe since win64 also gives us 16-byte stack alignment.
738–742 ↗	(On Diff #10279)	The comment below about "Don't care about the rest of the stack allocation" should get merged in here. I spent a while trying to understand why this scary comment is OK, and the comment below explains it.
744 ↗	(On Diff #10279)	We have range based for loops now, so this can be: for (const CalleeSavedInfo &Info : CSI) { ... }
773 ↗	(On Diff #10279)	Format. You can drop the braces.
910 ↗	(On Diff #10279)	Seems like a superfluous formatting change
1092 ↗	(On Diff #10279)	:( Is TargetFrameLowering::getCalleeSavedSpillSlots() not powerful enough to express the offsets you need? If it isn't, I think the right way forward is to add a more flexible interface to TFL, like TFL::appendCalleeSavedSpillSlot(std::vector<CSI> &CSIs), where the default implementation forwards on to check 'TFL->getCalleeSavedSpillSlots(NumFixedSpillSlots)' so existing targets don't require changes.
lib/Target/X86/X86ISelLowering.cpp
600–603 ↗	(On Diff #10279)	Can we flip this to the positive sense? What targets actually need it? win32 only?

ivan171 added a subscriber: ivan171.Jun 10 2014, 12:41 PM

vadimcn added inline comments.Jun 10 2014, 12:42 PM

include/llvm/CodeGen/MachineFrameInfo.h
486 ↗	(On Diff #10279)	Will do.
lib/MC/MCAsmStreamer.cpp
1138 ↗	(On Diff #10279)	I have it as a separate commit in git, it just got squashed when posting diff to Phabricator. Do you want me to actually submit it as a separate review?
lib/Target/X86/MCTargetDesc/X86MCAsmInfo.cpp
148 ↗	(On Diff #10279)	Not sure I understand... Can you please elaborate?
lib/Target/X86/X86FrameLowering.cpp
402–410 ↗	(On Diff #10279)	Stack realignment happens before stack pointer decrement, so XMM slots would be be in "invalid" zone at that point. We'd have to swap re-alignment and SP decrement in order to do what you suggest. I did not want to mess with existing frame setup order too much... But I don't see the problem with the way it is now, because (re-alignment + SP decrement) is guaranteed to allocate at least as much space as SP decrement alone. And the xmm spill slots are accessed via rbp, which is set up before stack realignment, so no problem there either.
738–742 ↗	(On Diff #10279)	okay
910 ↗	(On Diff #10279)	okay, will undo
1092 ↗	(On Diff #10279)	The problem with getCalleeSavedSpillSlots() is that it would reserve spill slots regardless of whether the register actually needs to be spilled, which would increase stack usage unnecessarily. I was thinking of using hasReservedSpillSlot() for this purpose, because, in principle, it could create spill slots on-the-fly, but then we'd be relying on the order of register enumeration. Also, it'd need to store state somewhere, and the code overall would be more convoluted. How about changing getCalleeSavedSpillSlots() signature to take non-const vector<CalleeSavedInfo>? If that's too hacky, I'd propose to create TargetFrameLowering::assignSpillSlots(vector<CSI>& ) callback, that would allow to override PEI::calculateCalleeSavedRegisters() in a target-specific manner.

rnk added inline comments.Jun 10 2014, 1:03 PM

lib/MC/MCAsmStreamer.cpp
1138 ↗	(On Diff #10279)	No, I can commit it.
lib/Target/X86/MCTargetDesc/X86MCAsmInfo.cpp
148 ↗	(On Diff #10279)	Ignore this comment. I worry this will break 'clang-cl -m64' because we emit 'invoke' instructions in the frontend to run C++ destructors. Clang is currently relying on the fact that LLVM drops the landing pads on the floor and generates regular calls. Clang shouldn't be relying on that, though.
lib/Target/X86/X86FrameLowering.cpp
402–410 ↗	(On Diff #10279)	OK, that makes sense. We may end up spilling xmm registers into the stack realignment gap, but that should be OK. Can you add a comment something like: ; It's OK if the stack was realigned and xmm registers are spilled into the realignment gap.
779 ↗	(On Diff #10279)	This can also be range-based.
1092 ↗	(On Diff #10279)	If that's too hacky, I'd propose to create TargetFrameLowering::assignSpillSlots(vector<CSI>& ) callback, that would allow to override PEI::calculateCalleeSavedRegisters() in a target-specific manner. This is basically what I was trying to suggest. The default implementation should do what calculateCalleeSavedRegisters() does now, so you don't have to change every target.

compnerd added a subscriber: compnerd.Jun 10 2014, 6:48 PM

compnerd added inline comments.

lib/MC/MCObjectFileInfo.cpp
635 ↗	(On Diff #10279)	AFAIK, the COFF backend only supports Windows ATM. Why not make this an assert? assert(T.isOSWindows() && "Windows is the only supported COFF target"); if (T.getArch() != Triple::x86_64) { LSDASection = ... }
lib/Target/X86/MCTargetDesc/X86MCAsmInfo.cpp
168 ↗	(On Diff #10279)	Convert to an assert and simplify?

loladiro added a subscriber: loladiro.Jun 11 2014, 11:35 AM

I went and tried this out but ran into an assertion. This is odd because at that stage I wasn't actually using COFF, but rather ELF in memory for MCJIT. Maybe there's a check missing whether we are using COFF in addition to Win64? See below:

Program: C:\mingw-builds\msys64\home\kfischer\julia\usr\bin\julia.exe
File: C:/mingw-builds/msys64/home/kfischer/julia/deps/llvm-svn/lib/CodeGen/MachineInstr.cpp, Line 674

Expression: (isImpReg || Op.isRegMask() || MCID->isVariadic() || OpNo < MCID->getNumOperands() || isMetaDataOp) && "Trying to add an operand to a machine instr that is already done!"

In D4081#16, @loladiro wrote:
I went and tried this out but ran into an assertion. This is odd because at that stage I wasn't actually using COFF, but rather ELF in memory for MCJIT. Maybe there's a check missing whether we are using COFF in addition to Win64? See below:
Program: C:\mingw-builds\msys64\home\kfischer\julia\usr\bin\julia.exe
File: C:/mingw-builds/msys64/home/kfischer/julia/deps/llvm-svn/lib/CodeGen/MachineInstr.cpp, Line 674

Expression: (isImpReg || Op.isRegMask() || MCID->isVariadic() || OpNo < MCID->getNumOperands() || isMetaDataOp) && "Trying to add an operand to a machine instr that is already done!"

Did you do a clean build? I ran into something similar after adding SEH instructions, and the solution was to delete all .o and .inc files under Target\X86. Apparently make dependencies are not quite reliable.

I did not. Doing that now. Will report back. The make dependencies should really be figured out though.

Ok, now I'm getting "LLVM ERROR: No open Win64 EH frame function!" (again with ELF in the first stage).

Have you run the example at http://llvm.org/bugs/show_bug.cgi?id=19905 through llvm with this change in?

lib/Target/X86/X86FrameLowering.cpp
1131 ↗	(On Diff #10279)	One nice effect of having a "TargetFrameLowering::assignSpillSlots" is that you won't have to create frame indices only to delete them i.e. you'd have CSR's spilled to fixed slots to begin with.

vadimcn added inline comments.Jun 13 2014, 6:18 PM

lib/Target/X86/X86ISelLowering.cpp
600–603 ↗	(On Diff #10279)	This is to make sure that landing pads do get emitted for win64. Honestly, I have no idea what the inverse of this set looks like.

Addressed review comments.

logan added a subscriber: logan.Jun 16 2014, 7:25 AM

@rnk, ping...

lgtm, do you want me to commit this?

lib/Target/X86/X86FrameLowering.cpp
1084 ↗	(On Diff #10416)	indentation, clang-format can fix it.
1113 ↗	(On Diff #10416)	s/GRPs/GPRs/

This revision is now accepted and ready to land.Jun 18 2014, 1:07 PM

vadimcn added inline comments.Jun 18 2014, 1:39 PM

lib/Target/X86/X86FrameLowering.cpp
1084 ↗	(On Diff #10416)	Do you want me to run clang-format on all files I've changed? I am a bit worried about doing that, because it reformats quite a bit more code beyond what I touched.

You can run clang-format from the VC IDE on selected (marked) text only.However, it may be a good idea to reformat all the file if it does not conform to clang format.

Reid, what do you think?

rnk added inline comments.Jun 18 2014, 2:04 PM

lib/Target/X86/X86FrameLowering.cpp
1084 ↗	(On Diff #10416)	No, typically it's used interactively only on lines that are being changed. See git-clang-format. Anyway, I can do that if this is done.

Yes, please commit, if you don't mind running format yourself.

I tried to commit today, but there were conflicts while rebasing. I attempted to resolve them, but I still had this test failure after that:

test/CodeGen/X86/x86-64-static-relo-movl.ll

Can you look into this? The test uses a x86_64-pc-win32-macho triple, which makes us hit "LLVM ERROR: No open Win64 EH frame function!". We probably shouldn't emit win64 style unwind info when generating MachO object files.

Is 'x86_64-pc-win32-macho' a valid target? Windows 64 ABI in OSX object file seems like a weird combination.

Not necessarily by itself, but it's not uncommon to have macho or elf in memory for MCJIT (as I mentioned above). Therefore, every effort should be made to make sure it doesn't break.

Fixed x86-64-static-relo-movl.ll and rebased.

Closed by commit rL211399 (authored by @rnk).

Looks like something wrong with .pdata segment. It contains Function start address == Function end address and not always sorted by start address.

In D4081#42, @Twobit wrote:

Looks like something wrong with .pdata segment. It contains Function start address == Function end address and not always sorted by start address.

@Twobit, do you have an example?

Yes, of course. I trying to compile main.ll and get main.obj. llvm toolchain is builded from " git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@211430 "

main.obj858 BDownload

main.ll703 BDownload

Hmm, something must be wrong at the MC layer. When I assemble the output of llc with gnu assembler (after replacing register numbers with names in seh directives), it seems to produce a correct .obj.

This got reverted in r211480, you should probably look into it.

@chapuni, Which tests broke? Is there a PR?
I saw that MultiJitTest.JitPool fails, but it was failing before my change too, so I ignored it.

I met entire JITTests.exe and lli.exe (and clang-interpreter) got broken, not only MultiJitTest.JitPool.

Command 0 Stderr:
pseudo instructions should be removed before code emission

UNREACHABLE executed at C:\bb-win7\msbuild-llvmclang-x64-msc17-DA\llvm-project\llvm\lib\Target\X86\X86CodeEmitter.cpp:1106!

FYI,
mingw x64 Release+Asserts http://bb.pgr.jp/builders/ninja-clang-x64-mingw64-RA/builds/3572
msc17 x64 Debug+Asserts http://bb.pgr.jp/builders/msbuild-llvmclang-x64-msc17-DA/builds/731

I could help investigating if you couldn't reproduce failures.

I was wrong with example above. Looks like the only problem is sorting and sometimes overlapping ranges of Function start & end. For example, in cases when exist two or more .text sections there is no way at all to sort function starts (in one .pdata section)
I think, solution is to create one .pdata (and maybe .xdata) for each exception info as MS VC compiler do

This was relanded in r211691, so everything is good to go.

My small patch (can be applied with git or by hand) with workaround some of "invalid .pdata contributions" issues. It creates new pdata and xdata section for every function

pdata_patch.patch4 KBDownload

Where is the patch? I don't see it in my inbox. That sounds like the
correct solution: the individual .pdata sections should be comdat
associative with the individual .text sections.

Patch was attached to phabricator (
http://reviews.llvm.org/file/data/h36ma4imbeunwhbu56fp/PHID-FILE-2w25ko3anof4isbyyzka/pdata_patch.patch
). Also I attach it to this email.

the individual .pdata sections should be comdat associative with the individual .text sections.

Maybe yes or maybe not. Having only one element in .pdata totally eliminate
sort routine.
Anyway this is a temporary solution. I hope someone of llvm maintainers
reimplement this patch as it should

2014-07-04 9:05 GMT+06:00 Reid Kleckner <rnk@google.com>:

Where is the patch? I don't see it in my inbox. That sounds like the
correct solution: the individual .pdata sections should be comdat
associative with the individual .text sections.

http://reviews.llvm.org/D4081

pdata_patch.patch4 KBDownload

The patch solves the problem but need to be updated to latest LLVM changes.

listhex added a subscriber: listhex.Aug 14 2014, 1:57 AM

Revision Contents

Path

Size

llvm/

trunk/

include/

llvm/

CodeGen/

MachineFrameInfo.h

3 lines

MC/

MCAsmInfo.h

1 line

Target/

TargetFrameLowering.h

13 lines

lib/

CodeGen/

AsmPrinter/

Win64Exception.cpp

13 lines

MachineFunction.cpp

45 lines

PrologEpilogInserter.cpp

83 lines

MC/

MCObjectFileInfo.cpp

15 lines

MCStreamer.cpp

2 lines

Target/

X86/

MCTargetDesc/

12 lines

9 lines

355 lines

5 lines

24 lines

37 lines

test/

CodeGen/

X86/

2007-05-05-Personality.ll

6 lines

2009-06-03-Win64SpillXMM.ll

4 lines

avx-intel-ocl.ll

60 lines

gcc_except_table.ll

10 lines

win64_eh.ll

170 lines

Diff 10704

llvm/trunk/include/llvm/CodeGen/MachineFrameInfo.h

Show First 20 Lines • Show All 478 Lines • ▼ Show 20 Lines	public:

/// CreateFixedObject - Create a new object at a fixed location on the stack.		/// CreateFixedObject - Create a new object at a fixed location on the stack.
/// All fixed objects should be created before other objects are created for		/// All fixed objects should be created before other objects are created for
/// efficiency. By default, fixed objects are immutable. This returns an		/// efficiency. By default, fixed objects are immutable. This returns an
/// index with a negative value.		/// index with a negative value.
///		///
int CreateFixedObject(uint64_t Size, int64_t SPOffset, bool Immutable);		int CreateFixedObject(uint64_t Size, int64_t SPOffset, bool Immutable);

		/// CreateFixedSpillStackObject - Create a spill slot at a fixed location
		/// on the stack. Returns an index with a negative value.
		int CreateFixedSpillStackObject(uint64_t Size, int64_t SPOffset);

/// isFixedObjectIndex - Returns true if the specified index corresponds to a		/// isFixedObjectIndex - Returns true if the specified index corresponds to a
/// fixed stack object.		/// fixed stack object.
bool isFixedObjectIndex(int ObjectIdx) const {		bool isFixedObjectIndex(int ObjectIdx) const {
return ObjectIdx < 0 && (ObjectIdx >= -(int)NumFixedObjects);		return ObjectIdx < 0 && (ObjectIdx >= -(int)NumFixedObjects);
}		}

/// isImmutableObjectIndex - Returns true if the specified index corresponds		/// isImmutableObjectIndex - Returns true if the specified index corresponds
▲ Show 20 Lines • Show All 89 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/MC/MCAsmInfo.h

Show First 20 Lines • Show All 475 Lines • ▼ Show 20 Lines	ExceptionHandling::ExceptionsType getExceptionHandlingType() const {
return ExceptionsType;		return ExceptionsType;
}		}
WinEH::EncodingType getWinEHEncodingType() const {		WinEH::EncodingType getWinEHEncodingType() const {
return WinEHEncodingType;		return WinEHEncodingType;
}		}
bool isExceptionHandlingDwarf() const {		bool isExceptionHandlingDwarf() const {
return (ExceptionsType == ExceptionHandling::DwarfCFI \|\|		return (ExceptionsType == ExceptionHandling::DwarfCFI \|\|
ExceptionsType == ExceptionHandling::ARM \|\|		ExceptionsType == ExceptionHandling::ARM \|\|
		// Win64 handler data still uses DWARF LSDA encoding.
ExceptionsType == ExceptionHandling::Win64);		ExceptionsType == ExceptionHandling::Win64);
}		}
bool doesDwarfUseRelocationsAcrossSections() const {		bool doesDwarfUseRelocationsAcrossSections() const {
return DwarfUsesRelocationsAcrossSections;		return DwarfUsesRelocationsAcrossSections;
}		}
bool doDwarfFDESymbolsUseAbsDiff() const { return DwarfFDESymbolsUseAbsDiff; }		bool doDwarfFDESymbolsUseAbsDiff() const { return DwarfFDESymbolsUseAbsDiff; }
bool useDwarfRegNumForCFI() const { return DwarfRegNumForCFI; }		bool useDwarfRegNumForCFI() const { return DwarfRegNumForCFI; }
bool useParensForSymbolVariant() const { return UseParensForSymbolVariant; }		bool useParensForSymbolVariant() const { return UseParensForSymbolVariant; }
Show All 26 Lines

llvm/trunk/include/llvm/Target/TargetFrameLowering.h

Show First 20 Lines • Show All 87 Lines • ▼ Show 20 Lines	public:
///		///
int getOffsetOfLocalArea() const { return LocalAreaOffset; }		int getOffsetOfLocalArea() const { return LocalAreaOffset; }

/// isFPCloseToIncomingSP - Return true if the frame pointer is close to		/// isFPCloseToIncomingSP - Return true if the frame pointer is close to
/// the incoming stack pointer, false if it is close to the post-prologue		/// the incoming stack pointer, false if it is close to the post-prologue
/// stack pointer.		/// stack pointer.
virtual bool isFPCloseToIncomingSP() const { return true; }		virtual bool isFPCloseToIncomingSP() const { return true; }

		/// assignCalleeSavedSpillSlots - Allows target to override spill slot
		/// assignment logic. If implemented, assignCalleeSavedSpillSlots() should
		/// assign frame slots to all CSI entries and return true. If this method
		/// returns false, spill slots will be assigned using generic implementation.
		/// assignCalleeSavedSpillSlots() may add, delete or rearrange elements of
		/// CSI.
		virtual bool
		assignCalleeSavedSpillSlots(MachineFunction &MF,
		const TargetRegisterInfo *TRI,
		std::vector<CalleeSavedInfo> &CSI) const {
		return false;
		}

/// getCalleeSavedSpillSlots - This method returns a pointer to an array of		/// getCalleeSavedSpillSlots - This method returns a pointer to an array of
/// pairs, that contains an entry for each callee saved register that must be		/// pairs, that contains an entry for each callee saved register that must be
/// spilled to a particular stack location if it is spilled.		/// spilled to a particular stack location if it is spilled.
///		///
/// Each entry in this array contains a <register,offset> pair, indicating the		/// Each entry in this array contains a <register,offset> pair, indicating the
/// fixed offset from the incoming stack pointer that each register should be		/// fixed offset from the incoming stack pointer that each register should be
/// spilled at. If a register is not listed here, the code generator is		/// spilled at. If a register is not listed here, the code generator is
/// allowed to spill it anywhere it chooses.		/// allowed to spill it anywhere it chooses.
▲ Show 20 Lines • Show All 121 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/AsmPrinter/Win64Exception.cpp

Show First 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	void Win64Exception::beginFunction(const MachineFunction *MF) {
if (!shouldEmitPersonality && !shouldEmitMoves)		if (!shouldEmitPersonality && !shouldEmitMoves)
return;		return;

Asm->OutStreamer.EmitWin64EHStartProc(Asm->CurrentFnSym);		Asm->OutStreamer.EmitWin64EHStartProc(Asm->CurrentFnSym);

if (!shouldEmitPersonality)		if (!shouldEmitPersonality)
return;		return;

MCSymbol *GCCHandlerSym =		const MCSymbol *PersHandlerSym =
Asm->GetExternalSymbolSymbol("_GCC_specific_handler");		TLOF.getCFIPersonalitySymbol(Per, *Asm->Mang, Asm->TM, MMI);
Asm->OutStreamer.EmitWin64EHHandler(GCCHandlerSym, true, true);		Asm->OutStreamer.EmitWin64EHHandler(PersHandlerSym, true, true);

Asm->OutStreamer.EmitLabel(Asm->GetTempSymbol("eh_func_begin",		Asm->OutStreamer.EmitLabel(Asm->GetTempSymbol("eh_func_begin",
Asm->getFunctionNumber()));		Asm->getFunctionNumber()));
}		}

/// endFunction - Gather and emit post-function exception information.		/// endFunction - Gather and emit post-function exception information.
///		///
void Win64Exception::endFunction(const MachineFunction *) {		void Win64Exception::endFunction(const MachineFunction *) {
if (!shouldEmitPersonality && !shouldEmitMoves)		if (!shouldEmitPersonality && !shouldEmitMoves)
return;		return;

Asm->OutStreamer.EmitLabel(Asm->GetTempSymbol("eh_func_end",		Asm->OutStreamer.EmitLabel(Asm->GetTempSymbol("eh_func_end",
Asm->getFunctionNumber()));		Asm->getFunctionNumber()));

// Map all labels and get rid of any dead landing pads.		// Map all labels and get rid of any dead landing pads.
MMI->TidyLandingPads();		MMI->TidyLandingPads();

if (shouldEmitPersonality) {		if (shouldEmitPersonality) {
const TargetLoweringObjectFile &TLOF = Asm->getObjFileLowering();
const Function *Per = MMI->getPersonalities()[MMI->getPersonalityIndex()];
const MCSymbol *Sym =
TLOF.getCFIPersonalitySymbol(Per, *Asm->Mang, Asm->TM, MMI);

Asm->OutStreamer.PushSection();		Asm->OutStreamer.PushSection();
Asm->OutStreamer.EmitWin64EHHandlerData();		Asm->OutStreamer.EmitWin64EHHandlerData();
Asm->OutStreamer.EmitValue(MCSymbolRefExpr::Create(Sym, Asm->OutContext),
4);
emitExceptionTable();		emitExceptionTable();
Asm->OutStreamer.PopSection();		Asm->OutStreamer.PopSection();
}		}
Asm->OutStreamer.EmitWin64EHEndProc();		Asm->OutStreamer.EmitWin64EHEndProc();
}		}

llvm/trunk/lib/CodeGen/MachineFunction.cpp

Show First 20 Lines • Show All 451 Lines • ▼ Show 20 Lines	unsigned MachineFunction::addLiveIn(unsigned PReg,
VReg = MRI.createVirtualRegister(RC);		VReg = MRI.createVirtualRegister(RC);
MRI.addLiveIn(PReg, VReg);		MRI.addLiveIn(PReg, VReg);
return VReg;		return VReg;
}		}

/// getJTISymbol - Return the MCSymbol for the specified non-empty jump table.		/// getJTISymbol - Return the MCSymbol for the specified non-empty jump table.
/// If isLinkerPrivate is specified, an 'l' label is returned, otherwise a		/// If isLinkerPrivate is specified, an 'l' label is returned, otherwise a
/// normal 'L' label is returned.		/// normal 'L' label is returned.
MCSymbol *MachineFunction::getJTISymbol(unsigned JTI, MCContext &Ctx,		MCSymbol *MachineFunction::getJTISymbol(unsigned JTI, MCContext &Ctx,
bool isLinkerPrivate) const {		bool isLinkerPrivate) const {
const DataLayout *DL = getTarget().getDataLayout();		const DataLayout *DL = getTarget().getDataLayout();
assert(JumpTableInfo && "No jump tables");		assert(JumpTableInfo && "No jump tables");
assert(JTI < JumpTableInfo->getJumpTables().size() && "Invalid JTI!");		assert(JTI < JumpTableInfo->getJumpTables().size() && "Invalid JTI!");

const char *Prefix = isLinkerPrivate ? DL->getLinkerPrivateGlobalPrefix() :		const char *Prefix = isLinkerPrivate ? DL->getLinkerPrivateGlobalPrefix() :
DL->getPrivateGlobalPrefix();		DL->getPrivateGlobalPrefix();
SmallString<60> Name;		SmallString<60> Name;
▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines
}		}

/// CreateSpillStackObject - Create a new statically sized stack object that		/// CreateSpillStackObject - Create a new statically sized stack object that
/// represents a spill slot, returning a nonnegative identifier to represent		/// represents a spill slot, returning a nonnegative identifier to represent
/// it.		/// it.
///		///
int MachineFrameInfo::CreateSpillStackObject(uint64_t Size,		int MachineFrameInfo::CreateSpillStackObject(uint64_t Size,
unsigned Alignment) {		unsigned Alignment) {
Alignment =		Alignment = clampStackAlignment(
clampStackAlignment(!getFrameLowering()->isStackRealignable() \|\|		!getFrameLowering()->isStackRealignable() \|\| !RealignOption, Alignment,
!RealignOption,		getFrameLowering()->getStackAlignment());
Alignment, getFrameLowering()->getStackAlignment());
CreateStackObject(Size, Alignment, true);		CreateStackObject(Size, Alignment, true);
int Index = (int)Objects.size() - NumFixedObjects - 1;		int Index = (int)Objects.size() - NumFixedObjects - 1;
ensureMaxAlignment(Alignment);		ensureMaxAlignment(Alignment);
return Index;		return Index;
}		}

/// CreateVariableSizedObject - Notify the MachineFrameInfo object that a		/// CreateVariableSizedObject - Notify the MachineFrameInfo object that a
/// variable sized object has been created. This must be created whenever a		/// variable sized object has been created. This must be created whenever a
/// variable sized object is created, whether or not the index returned is		/// variable sized object is created, whether or not the index returned is
/// actually used.		/// actually used.
///		///
int MachineFrameInfo::CreateVariableSizedObject(unsigned Alignment,		int MachineFrameInfo::CreateVariableSizedObject(unsigned Alignment,
const AllocaInst *Alloca) {		const AllocaInst *Alloca) {
HasVarSizedObjects = true;		HasVarSizedObjects = true;
Alignment =		Alignment = clampStackAlignment(
clampStackAlignment(!getFrameLowering()->isStackRealignable() \|\|		!getFrameLowering()->isStackRealignable() \|\| !RealignOption, Alignment,
!RealignOption,		getFrameLowering()->getStackAlignment());
Alignment, getFrameLowering()->getStackAlignment());
Objects.push_back(StackObject(0, Alignment, 0, false, false, Alloca));		Objects.push_back(StackObject(0, Alignment, 0, false, false, Alloca));
ensureMaxAlignment(Alignment);		ensureMaxAlignment(Alignment);
return (int)Objects.size()-NumFixedObjects-1;		return (int)Objects.size()-NumFixedObjects-1;
}		}

/// CreateFixedObject - Create a new object at a fixed location on the stack.		/// CreateFixedObject - Create a new object at a fixed location on the stack.
/// All fixed objects should be created before other objects are created for		/// All fixed objects should be created before other objects are created for
/// efficiency. By default, fixed objects are immutable. This returns an		/// efficiency. By default, fixed objects are immutable. This returns an
/// index with a negative value.		/// index with a negative value.
///		///
int MachineFrameInfo::CreateFixedObject(uint64_t Size, int64_t SPOffset,		int MachineFrameInfo::CreateFixedObject(uint64_t Size, int64_t SPOffset,
bool Immutable) {		bool Immutable) {
assert(Size != 0 && "Cannot allocate zero size fixed stack objects!");		assert(Size != 0 && "Cannot allocate zero size fixed stack objects!");
// The alignment of the frame index can be determined from its offset from		// The alignment of the frame index can be determined from its offset from
// the incoming frame position. If the frame object is at offset 32 and		// the incoming frame position. If the frame object is at offset 32 and
// the stack is guaranteed to be 16-byte aligned, then we know that the		// the stack is guaranteed to be 16-byte aligned, then we know that the
// object is 16-byte aligned.		// object is 16-byte aligned.
unsigned StackAlign = getFrameLowering()->getStackAlignment();		unsigned StackAlign = getFrameLowering()->getStackAlignment();
unsigned Align = MinAlign(SPOffset, StackAlign);		unsigned Align = MinAlign(SPOffset, StackAlign);
Align =		Align = clampStackAlignment(!getFrameLowering()->isStackRealignable() \|\|
clampStackAlignment(!getFrameLowering()->isStackRealignable() \|\|
!RealignOption,		!RealignOption,
Align, getFrameLowering()->getStackAlignment());		Align, getFrameLowering()->getStackAlignment());
Objects.insert(Objects.begin(), StackObject(Size, Align, SPOffset, Immutable,		Objects.insert(Objects.begin(), StackObject(Size, Align, SPOffset, Immutable,
/isSS/ false,		/isSS/ false,
/Alloca/ nullptr));		/Alloca/ nullptr));
return -++NumFixedObjects;		return -++NumFixedObjects;
}		}

		/// CreateFixedSpillStackObject - Create a spill slot at a fixed location
		/// on the stack. Returns an index with a negative value.
		int MachineFrameInfo::CreateFixedSpillStackObject(uint64_t Size,
		int64_t SPOffset) {
		unsigned StackAlign = getFrameLowering()->getStackAlignment();
		unsigned Align = MinAlign(SPOffset, StackAlign);
		Align = clampStackAlignment(!getFrameLowering()->isStackRealignable() \|\|
		!RealignOption,
		Align, getFrameLowering()->getStackAlignment());
		Objects.insert(Objects.begin(), StackObject(Size, Align, SPOffset,
		/Immutable/ true,
		/isSS/ true,
		/Alloca/ nullptr));
		return -++NumFixedObjects;
		}

BitVector		BitVector
MachineFrameInfo::getPristineRegs(const MachineBasicBlock *MBB) const {		MachineFrameInfo::getPristineRegs(const MachineBasicBlock *MBB) const {
assert(MBB && "MBB must be valid");		assert(MBB && "MBB must be valid");
const MachineFunction *MF = MBB->getParent();		const MachineFunction *MF = MBB->getParent();
assert(MF && "MBB must be part of a MachineFunction");		assert(MF && "MBB must be part of a MachineFunction");
const TargetMachine &TM = MF->getTarget();		const TargetMachine &TM = MF->getTarget();
const TargetRegisterInfo *TRI = TM.getRegisterInfo();		const TargetRegisterInfo *TRI = TM.getRegisterInfo();
▲ Show 20 Lines • Show All 252 Lines • ▼ Show 20 Lines	static bool CanShareConstantPoolEntry(const Constant A, const Constant B,
// If they have the same type but weren't the same constant, quickly		// If they have the same type but weren't the same constant, quickly
// reject them.		// reject them.
if (A->getType() == B->getType()) return false;		if (A->getType() == B->getType()) return false;

// We can't handle structs or arrays.		// We can't handle structs or arrays.
if (isa<StructType>(A->getType()) \|\| isa<ArrayType>(A->getType()) \|\|		if (isa<StructType>(A->getType()) \|\| isa<ArrayType>(A->getType()) \|\|
isa<StructType>(B->getType()) \|\| isa<ArrayType>(B->getType()))		isa<StructType>(B->getType()) \|\| isa<ArrayType>(B->getType()))
return false;		return false;

// For now, only support constants with the same size.		// For now, only support constants with the same size.
uint64_t StoreSize = TD->getTypeStoreSize(A->getType());		uint64_t StoreSize = TD->getTypeStoreSize(A->getType());
if (StoreSize != TD->getTypeStoreSize(B->getType()) \|\|		if (StoreSize != TD->getTypeStoreSize(B->getType()) \|\| StoreSize > 128)
StoreSize > 128)
return false;		return false;

Type IntTy = IntegerType::get(A->getContext(), StoreSize8);		Type IntTy = IntegerType::get(A->getContext(), StoreSize8);

// Try constant folding a bitcast of both instructions to an integer. If we		// Try constant folding a bitcast of both instructions to an integer. If we
// get two identical ConstantInt's, then we are good to share them. We use		// get two identical ConstantInt's, then we are good to share them. We use
// the constant folding APIs to do this so that we get the benefit of		// the constant folding APIs to do this so that we get the benefit of
// DataLayout.		// DataLayout.
Show All 12 Lines	static bool CanShareConstantPoolEntry(const Constant A, const Constant B,

return A == B;		return A == B;
}		}

/// getConstantPoolIndex - Create a new entry in the constant pool or return		/// getConstantPoolIndex - Create a new entry in the constant pool or return
/// an existing one. User must specify the log2 of the minimum required		/// an existing one. User must specify the log2 of the minimum required
/// alignment for the object.		/// alignment for the object.
///		///
unsigned MachineConstantPool::getConstantPoolIndex(const Constant *C,		unsigned MachineConstantPool::getConstantPoolIndex(const Constant *C,
unsigned Alignment) {		unsigned Alignment) {
assert(Alignment && "Alignment must be specified!");		assert(Alignment && "Alignment must be specified!");
if (Alignment > PoolAlignment) PoolAlignment = Alignment;		if (Alignment > PoolAlignment) PoolAlignment = Alignment;

// Check to see if we already have this constant.		// Check to see if we already have this constant.
//		//
// FIXME, this could be made much more efficient for large constant pools.		// FIXME, this could be made much more efficient for large constant pools.
for (unsigned i = 0, e = Constants.size(); i != e; ++i)		for (unsigned i = 0, e = Constants.size(); i != e; ++i)
▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

llvm/trunk/lib/CodeGen/PrologEpilogInserter.cpp

Show First 20 Lines • Show All 262 Lines • ▼ Show 20 Lines	for (unsigned i = 0; CSRegs[i]; ++i) {
unsigned Reg = CSRegs[i];		unsigned Reg = CSRegs[i];
// Functions which call __builtin_unwind_init get all their registers saved.		// Functions which call __builtin_unwind_init get all their registers saved.
if (F.getRegInfo().isPhysRegUsed(Reg) \|\| F.getMMI().callsUnwindInit()) {		if (F.getRegInfo().isPhysRegUsed(Reg) \|\| F.getMMI().callsUnwindInit()) {
// If the reg is modified, save it!		// If the reg is modified, save it!
CSI.push_back(CalleeSavedInfo(Reg));		CSI.push_back(CalleeSavedInfo(Reg));
}		}
}		}

		if (!TFI->assignCalleeSavedSpillSlots(F, RegInfo, CSI)) {
		// If target doesn't implement this, use generic code.

if (CSI.empty())		if (CSI.empty())
return; // Early exit if no callee saved registers are modified!		return; // Early exit if no callee saved registers are modified!

unsigned NumFixedSpillSlots;		unsigned NumFixedSpillSlots;
const TargetFrameLowering::SpillSlot *FixedSpillSlots =		const TargetFrameLowering::SpillSlot *FixedSpillSlots =
TFI->getCalleeSavedSpillSlots(NumFixedSpillSlots);		TFI->getCalleeSavedSpillSlots(NumFixedSpillSlots);

// Now that we know which registers need to be saved and restored, allocate		// Now that we know which registers need to be saved and restored, allocate
// stack slots for them.		// stack slots for them.
for (std::vector<CalleeSavedInfo>::iterator		for (std::vector<CalleeSavedInfo>::iterator I = CSI.begin(), E = CSI.end();
I = CSI.begin(), E = CSI.end(); I != E; ++I) {		I != E; ++I) {
unsigned Reg = I->getReg();		unsigned Reg = I->getReg();
const TargetRegisterClass *RC = RegInfo->getMinimalPhysRegClass(Reg);		const TargetRegisterClass *RC = RegInfo->getMinimalPhysRegClass(Reg);

int FrameIdx;		int FrameIdx;
if (RegInfo->hasReservedSpillSlot(F, Reg, FrameIdx)) {		if (RegInfo->hasReservedSpillSlot(F, Reg, FrameIdx)) {
I->setFrameIdx(FrameIdx);		I->setFrameIdx(FrameIdx);
continue;		continue;
}		}

// Check to see if this physreg must be spilled to a particular stack slot		// Check to see if this physreg must be spilled to a particular stack slot
// on this target.		// on this target.
const TargetFrameLowering::SpillSlot *FixedSlot = FixedSpillSlots;		const TargetFrameLowering::SpillSlot *FixedSlot = FixedSpillSlots;
while (FixedSlot != FixedSpillSlots+NumFixedSpillSlots &&		while (FixedSlot != FixedSpillSlots + NumFixedSpillSlots &&
FixedSlot->Reg != Reg)		FixedSlot->Reg != Reg)
++FixedSlot;		++FixedSlot;

if (FixedSlot == FixedSpillSlots + NumFixedSpillSlots) {		if (FixedSlot == FixedSpillSlots + NumFixedSpillSlots) {
// Nope, just spill it anywhere convenient.		// Nope, just spill it anywhere convenient.
unsigned Align = RC->getAlignment();		unsigned Align = RC->getAlignment();
unsigned StackAlign = TFI->getStackAlignment();		unsigned StackAlign = TFI->getStackAlignment();

// We may not be able to satisfy the desired alignment specification of		// We may not be able to satisfy the desired alignment specification of
// the TargetRegisterClass if the stack alignment is smaller. Use the		// the TargetRegisterClass if the stack alignment is smaller. Use the
// min.		// min.
Align = std::min(Align, StackAlign);		Align = std::min(Align, StackAlign);
FrameIdx = MFI->CreateStackObject(RC->getSize(), Align, true);		FrameIdx = MFI->CreateStackObject(RC->getSize(), Align, true);
if ((unsigned)FrameIdx < MinCSFrameIndex) MinCSFrameIndex = FrameIdx;		if ((unsigned)FrameIdx < MinCSFrameIndex) MinCSFrameIndex = FrameIdx;
if ((unsigned)FrameIdx > MaxCSFrameIndex) MaxCSFrameIndex = FrameIdx;		if ((unsigned)FrameIdx > MaxCSFrameIndex) MaxCSFrameIndex = FrameIdx;
} else {		} else {
// Spill it to the stack where we must.		// Spill it to the stack where we must.
FrameIdx = MFI->CreateFixedObject(RC->getSize(), FixedSlot->Offset, true);		FrameIdx =
		MFI->CreateFixedSpillStackObject(RC->getSize(), FixedSlot->Offset);
}		}

I->setFrameIdx(FrameIdx);		I->setFrameIdx(FrameIdx);
}		}
		}

MFI->setCalleeSavedInfo(CSI);		MFI->setCalleeSavedInfo(CSI);
}		}

/// insertCSRSpillsAndRestores - Insert spill and restore code for		/// insertCSRSpillsAndRestores - Insert spill and restore code for
/// callee saved registers used in the function.		/// callee saved registers used in the function.
///		///
void PEI::insertCSRSpillsAndRestores(MachineFunction &Fn) {		void PEI::insertCSRSpillsAndRestores(MachineFunction &Fn) {
▲ Show 20 Lines • Show All 592 Lines • Show Last 20 Lines

llvm/trunk/lib/MC/MCObjectFileInfo.cpp

Show First 20 Lines • Show All 626 Lines • ▼ Show 20 Lines	StaticDtorSection =
COFF::IMAGE_SCN_MEM_WRITE,		COFF::IMAGE_SCN_MEM_WRITE,
SectionKind::getDataRel());		SectionKind::getDataRel());
}		}

// FIXME: We're emitting LSDA info into a readonly section on COFF, even		// FIXME: We're emitting LSDA info into a readonly section on COFF, even
// though it contains relocatable pointers. In PIC mode, this is probably a		// though it contains relocatable pointers. In PIC mode, this is probably a
// big runtime hit for C++ apps. Either the contents of the LSDA need to be		// big runtime hit for C++ apps. Either the contents of the LSDA need to be
// adjusted or this should be a data section.		// adjusted or this should be a data section.
LSDASection =		assert(T.isOSWindows() && "Windows is the only supported COFF target");
Ctx->getCOFFSection(".gcc_except_table",		if (T.getArch() == Triple::x86_64) {
		// On Windows 64 with SEH, the LSDA is emitted into the .xdata section
		LSDASection = 0;
		} else {
		LSDASection = Ctx->getCOFFSection(".gcc_except_table",
COFF::IMAGE_SCN_CNT_INITIALIZED_DATA \|		COFF::IMAGE_SCN_CNT_INITIALIZED_DATA \|
COFF::IMAGE_SCN_MEM_READ,		COFF::IMAGE_SCN_MEM_READ,
SectionKind::getReadOnly());		SectionKind::getReadOnly());
		}

// Debug info.		// Debug info.
COFFDebugSymbolsSection =		COFFDebugSymbolsSection =
Ctx->getCOFFSection(".debug$S",		Ctx->getCOFFSection(".debug$S",
COFF::IMAGE_SCN_MEM_DISCARDABLE \|		COFF::IMAGE_SCN_MEM_DISCARDABLE \|
COFF::IMAGE_SCN_CNT_INITIALIZED_DATA \|		COFF::IMAGE_SCN_CNT_INITIALIZED_DATA \|
COFF::IMAGE_SCN_MEM_READ,		COFF::IMAGE_SCN_MEM_READ,
SectionKind::getMetadata());		SectionKind::getMetadata());
▲ Show 20 Lines • Show All 211 Lines • Show Last 20 Lines

llvm/trunk/lib/MC/MCStreamer.cpp

	Show First 20 Lines • Show All 520 Lines • ▼ Show 20 Lines

	void MCStreamer::EmitWin64EHSetFrame(unsigned Register, unsigned Offset) {			void MCStreamer::EmitWin64EHSetFrame(unsigned Register, unsigned Offset) {
	EnsureValidW64UnwindInfo();			EnsureValidW64UnwindInfo();
	MCWin64EHUnwindInfo *CurFrame = CurrentW64UnwindInfo;			MCWin64EHUnwindInfo *CurFrame = CurrentW64UnwindInfo;
	if (CurFrame->LastFrameInst >= 0)			if (CurFrame->LastFrameInst >= 0)
	report_fatal_error("Frame register and offset already specified!");			report_fatal_error("Frame register and offset already specified!");
	if (Offset & 0x0F)			if (Offset & 0x0F)
	report_fatal_error("Misaligned frame pointer offset!");			report_fatal_error("Misaligned frame pointer offset!");
				if (Offset > 240)
				report_fatal_error("Frame offset must be less than or equal to 240!");
	MCSymbol *Label = getContext().CreateTempSymbol();			MCSymbol *Label = getContext().CreateTempSymbol();
	MCWin64EHInstruction Inst(Win64EH::UOP_SetFPReg, Label, Register, Offset);			MCWin64EHInstruction Inst(Win64EH::UOP_SetFPReg, Label, Register, Offset);
	EmitLabel(Label);			EmitLabel(Label);
	CurFrame->LastFrameInst = CurFrame->Instructions.size();			CurFrame->LastFrameInst = CurFrame->Instructions.size();
	CurFrame->Instructions.push_back(Inst);			CurFrame->Instructions.push_back(Inst);
	}			}

	void MCStreamer::EmitWin64EHAllocStack(unsigned Size) {			void MCStreamer::EmitWin64EHAllocStack(unsigned Size) {
	▲ Show 20 Lines • Show All 101 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/X86/MCTargetDesc/X86MCAsmInfo.cpp

	Show First 20 Lines • Show All 136 Lines • ▼ Show 20 Lines
	getNonexecutableStackSection(MCContext &Ctx) const {			getNonexecutableStackSection(MCContext &Ctx) const {
	return Ctx.getELFSection(".note.GNU-stack", ELF::SHT_PROGBITS,			return Ctx.getELFSection(".note.GNU-stack", ELF::SHT_PROGBITS,
	0, SectionKind::getMetadata());			0, SectionKind::getMetadata());
	}			}

	void X86MCAsmInfoMicrosoft::anchor() { }			void X86MCAsmInfoMicrosoft::anchor() { }

	X86MCAsmInfoMicrosoft::X86MCAsmInfoMicrosoft(const Triple &Triple) {			X86MCAsmInfoMicrosoft::X86MCAsmInfoMicrosoft(const Triple &Triple) {
	if (Triple.getArch() == Triple::x86_64)			if (Triple.getArch() == Triple::x86_64) {
	PrivateGlobalPrefix = ".L";			PrivateGlobalPrefix = ".L";
				PointerSize = 8;
				ExceptionsType = ExceptionHandling::Win64;
				}

	AssemblerDialect = AsmWriterFlavor;			AssemblerDialect = AsmWriterFlavor;

	TextAlignFillValue = 0x90;			TextAlignFillValue = 0x90;

	AllowAtInName = true;			AllowAtInName = true;

	UseIntegratedAssembler = true;			UseIntegratedAssembler = true;
	}			}

	void X86MCAsmInfoGNUCOFF::anchor() { }			void X86MCAsmInfoGNUCOFF::anchor() { }

	X86MCAsmInfoGNUCOFF::X86MCAsmInfoGNUCOFF(const Triple &Triple) {			X86MCAsmInfoGNUCOFF::X86MCAsmInfoGNUCOFF(const Triple &Triple) {
				assert(Triple.isOSWindows() && "Windows is the only supported COFF target");
	if (Triple.getArch() == Triple::x86_64) {			if (Triple.getArch() == Triple::x86_64) {
	PrivateGlobalPrefix = ".L";			PrivateGlobalPrefix = ".L";
	PointerSize = 8;			PointerSize = 8;
				ExceptionsType = ExceptionHandling::Win64;
				} else {
				ExceptionsType = ExceptionHandling::DwarfCFI;
	}			}

	AssemblerDialect = AsmWriterFlavor;			AssemblerDialect = AsmWriterFlavor;

	TextAlignFillValue = 0x90;			TextAlignFillValue = 0x90;

	// Exceptions handling
	ExceptionsType = ExceptionHandling::DwarfCFI;

	UseIntegratedAssembler = true;			UseIntegratedAssembler = true;
	}			}

llvm/trunk/lib/Target/X86/X86FrameLowering.h

	Show All 21 Lines
	class X86TargetMachine;			class X86TargetMachine;

	class X86FrameLowering : public TargetFrameLowering {			class X86FrameLowering : public TargetFrameLowering {
	public:			public:
	explicit X86FrameLowering(StackDirection D, unsigned StackAl, int LAO)			explicit X86FrameLowering(StackDirection D, unsigned StackAl, int LAO)
	: TargetFrameLowering(StackGrowsDown, StackAl, LAO) {}			: TargetFrameLowering(StackGrowsDown, StackAl, LAO) {}

	void emitCalleeSavedFrameMoves(MachineBasicBlock &MBB,			void emitCalleeSavedFrameMoves(MachineBasicBlock &MBB,
	MachineBasicBlock::iterator MBBI, DebugLoc DL,			MachineBasicBlock::iterator MBBI,
	unsigned FramePtr) const;			DebugLoc DL) const;

	/// emitProlog/emitEpilog - These methods insert prolog and epilog code into			/// emitProlog/emitEpilog - These methods insert prolog and epilog code into
	/// the function.			/// the function.
	void emitPrologue(MachineFunction &MF) const override;			void emitPrologue(MachineFunction &MF) const override;
	void emitEpilogue(MachineFunction &MF, MachineBasicBlock &MBB) const override;			void emitEpilogue(MachineFunction &MF, MachineBasicBlock &MBB) const override;

	void adjustForSegmentedStacks(MachineFunction &MF) const override;			void adjustForSegmentedStacks(MachineFunction &MF) const override;

	void adjustForHiPEPrologue(MachineFunction &MF) const override;			void adjustForHiPEPrologue(MachineFunction &MF) const override;

	void processFunctionBeforeCalleeSavedScan(MachineFunction &MF,			void processFunctionBeforeCalleeSavedScan(MachineFunction &MF,
	RegScavenger *RS = nullptr) const override;			RegScavenger *RS = nullptr) const override;

				bool
				assignCalleeSavedSpillSlots(MachineFunction &MF,
				const TargetRegisterInfo *TRI,
				std::vector<CalleeSavedInfo> &CSI) const override;

	bool spillCalleeSavedRegisters(MachineBasicBlock &MBB,			bool spillCalleeSavedRegisters(MachineBasicBlock &MBB,
	MachineBasicBlock::iterator MI,			MachineBasicBlock::iterator MI,
	const std::vector<CalleeSavedInfo> &CSI,			const std::vector<CalleeSavedInfo> &CSI,
	const TargetRegisterInfo *TRI) const override;			const TargetRegisterInfo *TRI) const override;

	bool restoreCalleeSavedRegisters(MachineBasicBlock &MBB,			bool restoreCalleeSavedRegisters(MachineBasicBlock &MBB,
	MachineBasicBlock::iterator MI,			MachineBasicBlock::iterator MI,
	const std::vector<CalleeSavedInfo> &CSI,			const std::vector<CalleeSavedInfo> &CSI,
	Show All 17 Lines

llvm/trunk/lib/Target/X86/X86FrameLowering.cpp

Show All 23 Lines
#include "llvm/CodeGen/MachineModuleInfo.h"		#include "llvm/CodeGen/MachineModuleInfo.h"
#include "llvm/CodeGen/MachineRegisterInfo.h"		#include "llvm/CodeGen/MachineRegisterInfo.h"
#include "llvm/IR/DataLayout.h"		#include "llvm/IR/DataLayout.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
#include "llvm/MC/MCAsmInfo.h"		#include "llvm/MC/MCAsmInfo.h"
#include "llvm/MC/MCSymbol.h"		#include "llvm/MC/MCSymbol.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Target/TargetOptions.h"		#include "llvm/Target/TargetOptions.h"
		#include "llvm/Support/Debug.h"

using namespace llvm;		using namespace llvm;

// FIXME: completely move here.		// FIXME: completely move here.
extern cl::opt<bool> ForceStackAlign;		extern cl::opt<bool> ForceStackAlign;

bool X86FrameLowering::hasReservedCallFrame(const MachineFunction &MF) const {		bool X86FrameLowering::hasReservedCallFrame(const MachineFunction &MF) const {
return !MF.getFrameInfo()->hasVarSizedObjects();		return !MF.getFrameInfo()->hasVarSizedObjects();
▲ Show 20 Lines • Show All 260 Lines • ▼ Show 20 Lines	for (MachineRegisterInfo::livein_iterator II = MF.getRegInfo().livein_begin(),
if (Reg == X86::EAX \|\| Reg == X86::AX \|\|		if (Reg == X86::EAX \|\| Reg == X86::AX \|\|
Reg == X86::AH \|\| Reg == X86::AL)		Reg == X86::AH \|\| Reg == X86::AL)
return true;		return true;
}		}

return false;		return false;
}		}

void X86FrameLowering::emitCalleeSavedFrameMoves(		void
MachineBasicBlock &MBB, MachineBasicBlock::iterator MBBI, DebugLoc DL,		X86FrameLowering::emitCalleeSavedFrameMoves(MachineBasicBlock &MBB,
unsigned FramePtr) const {		MachineBasicBlock::iterator MBBI,
		DebugLoc DL) const {
MachineFunction &MF = *MBB.getParent();		MachineFunction &MF = *MBB.getParent();
MachineFrameInfo *MFI = MF.getFrameInfo();		MachineFrameInfo *MFI = MF.getFrameInfo();
MachineModuleInfo &MMI = MF.getMMI();		MachineModuleInfo &MMI = MF.getMMI();
const MCRegisterInfo *MRI = MMI.getContext().getRegisterInfo();		const MCRegisterInfo *MRI = MMI.getContext().getRegisterInfo();
const TargetInstrInfo &TII = *MF.getTarget().getInstrInfo();		const TargetInstrInfo &TII = *MF.getTarget().getInstrInfo();

// Add callee saved registers to move list.		// Add callee saved registers to move list.
const std::vector<CalleeSavedInfo> &CSI = MFI->getCalleeSavedInfo();		const std::vector<CalleeSavedInfo> &CSI = MFI->getCalleeSavedInfo();
if (CSI.empty()) return;		if (CSI.empty()) return;

const X86RegisterInfo *RegInfo =
static_cast<const X86RegisterInfo *>(MF.getTarget().getRegisterInfo());
bool HasFP = hasFP(MF);

// Calculate amount of bytes used for return address storing.
int stackGrowth = -RegInfo->getSlotSize();

// FIXME: This is dirty hack. The code itself is pretty mess right now.
// It should be rewritten from scratch and generalized sometimes.

// Determine maximum offset (minimum due to stack growth).
int64_t MaxOffset = 0;
for (std::vector<CalleeSavedInfo>::const_iterator
I = CSI.begin(), E = CSI.end(); I != E; ++I)
MaxOffset = std::min(MaxOffset,
MFI->getObjectOffset(I->getFrameIdx()));

// Calculate offsets.		// Calculate offsets.
int64_t saveAreaOffset = (HasFP ? 3 : 2) * stackGrowth;
for (std::vector<CalleeSavedInfo>::const_iterator		for (std::vector<CalleeSavedInfo>::const_iterator
I = CSI.begin(), E = CSI.end(); I != E; ++I) {		I = CSI.begin(), E = CSI.end(); I != E; ++I) {
int64_t Offset = MFI->getObjectOffset(I->getFrameIdx());		int64_t Offset = MFI->getObjectOffset(I->getFrameIdx());
unsigned Reg = I->getReg();		unsigned Reg = I->getReg();
Offset = MaxOffset - Offset + saveAreaOffset;

// Don't output a new machine move if we're re-saving the frame
// pointer. This happens when the PrologEpilogInserter has inserted an extra
// "PUSH" of the frame pointer -- the "emitPrologue" method automatically
// generates one when frame pointers are used. If we generate a "machine
// move" for this extra "PUSH", the linker will lose track of the fact that
// the frame pointer should have the value of the first "PUSH" when it's
// trying to unwind.
//
// FIXME: This looks inelegant. It's possibly correct, but it's covering up
// another bug. I.e., one where we generate a prolog like this:
//
// pushl %ebp
// movl %esp, %ebp
// pushl %ebp
// pushl %esi
// ...
//
// The immediate re-push of EBP is unnecessary. At the least, it's an
// optimization bug. EBP can be used as a scratch register in certain
// cases, but probably not when we have a frame pointer.
if (HasFP && FramePtr == Reg)
continue;

unsigned DwarfReg = MRI->getDwarfRegNum(Reg, true);		unsigned DwarfReg = MRI->getDwarfRegNum(Reg, true);
unsigned CFIIndex =		unsigned CFIIndex =
MMI.addFrameInst(MCCFIInstruction::createOffset(nullptr, DwarfReg,		MMI.addFrameInst(MCCFIInstruction::createOffset(nullptr, DwarfReg,
Offset));		Offset));
BuildMI(MBB, MBBI, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(MBB, MBBI, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex);		.addCFIIndex(CFIIndex);
}		}
Show All 15 Lines	static bool usesTheStack(const MachineFunction &MF) {

return false;		return false;
}		}

/// emitPrologue - Push callee-saved registers onto the stack, which		/// emitPrologue - Push callee-saved registers onto the stack, which
/// automatically adjust the stack pointer. Adjust the stack pointer to allocate		/// automatically adjust the stack pointer. Adjust the stack pointer to allocate
/// space for local variables. Also emit labels used by the exception handler to		/// space for local variables. Also emit labels used by the exception handler to
/// generate the exception handling frames.		/// generate the exception handling frames.

		/*
		Here's a gist of what gets emitted:

		; Establish frame pointer, if needed
		[if needs FP]
		push %rbp
		.cfi_def_cfa_offset 16
		.cfi_offset %rbp, -16
		.seh_pushreg %rpb
		mov %rsp, %rbp
		.cfi_def_cfa_register %rbp

		; Spill general-purpose registers
		[for all callee-saved GPRs]
		pushq %<reg>
		[if not needs FP]
		.cfi_def_cfa_offset (offset from RETADDR)
		.seh_pushreg %<reg>

		; If the required stack alignment > default stack alignment
		; rsp needs to be re-aligned. This creates a "re-alignment gap"
		; of unknown size in the stack frame.
		[if stack needs re-alignment]
		and $MASK, %rsp

		; Allocate space for locals
		[if target is Windows and allocated space > 4096 bytes]
		; Windows needs special care for allocations larger
		; than one page.
		mov $NNN, %rax
		call ___chkstk_ms/___chkstk
		sub %rax, %rsp
		[else]
		sub $NNN, %rsp

		[if needs FP]
		.seh_stackalloc (size of XMM spill slots)
		.seh_setframe %rbp, SEHFrameOffset ; = size of all spill slots
		[else]
		.seh_stackalloc NNN

		; Spill XMMs
		; Note, that while only Windows 64 ABI specifies XMMs as callee-preserved,
		; they may get spilled on any platform, if the current function
		; calls @llvm.eh.unwind.init
		[if needs FP]
		[for all callee-saved XMM registers]
		movaps %<xmm reg>, -MMM(%rbp)
		[for all callee-saved XMM registers]
		.seh_savexmm %<xmm reg>, (-MMM + SEHFrameOffset)
		; i.e. the offset relative to (%rbp - SEHFrameOffset)
		[else]
		[for all callee-saved XMM registers]
		movaps %<xmm reg>, KKK(%rsp)
		[for all callee-saved XMM registers]
		.seh_savexmm %<xmm reg>, KKK

		.seh_endprologue

		[if needs base pointer]
		mov %rsp, %rbx

		; Emit CFI info
		[if needs FP]
		[for all callee-saved registers]
		.cfi_offset %<reg>, (offset from %rbp)
		[else]
		.cfi_def_cfa_offset (offset from RETADDR)
		[for all callee-saved registers]
		.cfi_offset %<reg>, (offset from %rsp)

		Notes:
		- .seh directives are emitted only for Windows 64 ABI
		- .cfi directives are emitted for all other ABIs
		- for 32-bit code, substitute %e?? registers for %r??
		*/

void X86FrameLowering::emitPrologue(MachineFunction &MF) const {		void X86FrameLowering::emitPrologue(MachineFunction &MF) const {
MachineBasicBlock &MBB = MF.front(); // Prologue goes in entry BB.		MachineBasicBlock &MBB = MF.front(); // Prologue goes in entry BB.
MachineBasicBlock::iterator MBBI = MBB.begin();		MachineBasicBlock::iterator MBBI = MBB.begin();
MachineFrameInfo *MFI = MF.getFrameInfo();		MachineFrameInfo *MFI = MF.getFrameInfo();
const Function *Fn = MF.getFunction();		const Function *Fn = MF.getFunction();
const X86RegisterInfo *RegInfo =		const X86RegisterInfo *RegInfo =
static_cast<const X86RegisterInfo *>(MF.getTarget().getRegisterInfo());		static_cast<const X86RegisterInfo *>(MF.getTarget().getRegisterInfo());
const TargetInstrInfo &TII = *MF.getTarget().getInstrInfo();		const TargetInstrInfo &TII = *MF.getTarget().getInstrInfo();
MachineModuleInfo &MMI = MF.getMMI();		MachineModuleInfo &MMI = MF.getMMI();
X86MachineFunctionInfo *X86FI = MF.getInfo<X86MachineFunctionInfo>();		X86MachineFunctionInfo *X86FI = MF.getInfo<X86MachineFunctionInfo>();
bool needsFrameMoves = MMI.hasDebugInfo() \|\|
Fn->needsUnwindTableEntry();
uint64_t MaxAlign = MFI->getMaxAlignment(); // Desired stack alignment.		uint64_t MaxAlign = MFI->getMaxAlignment(); // Desired stack alignment.
uint64_t StackSize = MFI->getStackSize(); // Number of bytes to allocate.		uint64_t StackSize = MFI->getStackSize(); // Number of bytes to allocate.
bool HasFP = hasFP(MF);		bool HasFP = hasFP(MF);
const X86Subtarget &STI = MF.getTarget().getSubtarget<X86Subtarget>();		const X86Subtarget &STI = MF.getTarget().getSubtarget<X86Subtarget>();
bool Is64Bit = STI.is64Bit();		bool Is64Bit = STI.is64Bit();
bool IsLP64 = STI.isTarget64BitLP64();		bool IsLP64 = STI.isTarget64BitLP64();
bool IsWin64 = STI.isTargetWin64();		bool IsWin64 = STI.isTargetWin64();
		bool IsSEH =
		MF.getTarget().getMCAsmInfo()->getExceptionHandlingType() ==
		ExceptionHandling::Win64; // Not necessarily synonymous with IsWin64.
		bool NeedsWin64SEH = IsSEH && Fn->needsUnwindTableEntry();
		bool NeedsDwarfCFI =
		!IsSEH && (MMI.hasDebugInfo() \|\| Fn->needsUnwindTableEntry());
bool UseLEA = STI.useLeaForSP();		bool UseLEA = STI.useLeaForSP();
unsigned StackAlign = getStackAlignment();		unsigned StackAlign = getStackAlignment();
unsigned SlotSize = RegInfo->getSlotSize();		unsigned SlotSize = RegInfo->getSlotSize();
unsigned FramePtr = RegInfo->getFrameRegister(MF);		unsigned FramePtr = RegInfo->getFrameRegister(MF);
unsigned StackPtr = RegInfo->getStackRegister();		unsigned StackPtr = RegInfo->getStackRegister();
unsigned BasePtr = RegInfo->getBaseRegister();		unsigned BasePtr = RegInfo->getBaseRegister();
DebugLoc DL;		DebugLoc DL;

▲ Show 20 Lines • Show All 81 Lines • ▼ Show 20 Lines	if (HasFP) {
// Update the frame offset adjustment.		// Update the frame offset adjustment.
MFI->setOffsetAdjustment(-NumBytes);		MFI->setOffsetAdjustment(-NumBytes);

// Save EBP/RBP into the appropriate stack slot.		// Save EBP/RBP into the appropriate stack slot.
BuildMI(MBB, MBBI, DL, TII.get(Is64Bit ? X86::PUSH64r : X86::PUSH32r))		BuildMI(MBB, MBBI, DL, TII.get(Is64Bit ? X86::PUSH64r : X86::PUSH32r))
.addReg(FramePtr, RegState::Kill)		.addReg(FramePtr, RegState::Kill)
.setMIFlag(MachineInstr::FrameSetup);		.setMIFlag(MachineInstr::FrameSetup);

if (needsFrameMoves) {		if (NeedsDwarfCFI) {
// Mark the place where EBP/RBP was saved.		// Mark the place where EBP/RBP was saved.
// Define the current CFA rule to use the provided offset.		// Define the current CFA rule to use the provided offset.
assert(StackSize);		assert(StackSize);
unsigned CFIIndex = MMI.addFrameInst(		unsigned CFIIndex = MMI.addFrameInst(
MCCFIInstruction::createDefCfaOffset(nullptr, 2 * stackGrowth));		MCCFIInstruction::createDefCfaOffset(nullptr, 2 * stackGrowth));
BuildMI(MBB, MBBI, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(MBB, MBBI, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex);		.addCFIIndex(CFIIndex);

// Change the rule for the FramePtr to be an "offset" rule.		// Change the rule for the FramePtr to be an "offset" rule.
unsigned DwarfFramePtr = RegInfo->getDwarfRegNum(FramePtr, true);		unsigned DwarfFramePtr = RegInfo->getDwarfRegNum(FramePtr, true);
CFIIndex = MMI.addFrameInst(		CFIIndex = MMI.addFrameInst(
MCCFIInstruction::createOffset(nullptr,		MCCFIInstruction::createOffset(nullptr,
DwarfFramePtr, 2 * stackGrowth));		DwarfFramePtr, 2 * stackGrowth));
BuildMI(MBB, MBBI, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(MBB, MBBI, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex);		.addCFIIndex(CFIIndex);
}		}

		if (NeedsWin64SEH) {
		BuildMI(MBB, MBBI, DL, TII.get(X86::SEH_PushReg))
		.addImm(FramePtr)
		.setMIFlag(MachineInstr::FrameSetup);
		}

// Update EBP with the new base value.		// Update EBP with the new base value.
BuildMI(MBB, MBBI, DL,		BuildMI(MBB, MBBI, DL,
TII.get(Is64Bit ? X86::MOV64rr : X86::MOV32rr), FramePtr)		TII.get(Is64Bit ? X86::MOV64rr : X86::MOV32rr), FramePtr)
.addReg(StackPtr)		.addReg(StackPtr)
.setMIFlag(MachineInstr::FrameSetup);		.setMIFlag(MachineInstr::FrameSetup);

if (needsFrameMoves) {		if (NeedsDwarfCFI) {
// Mark effective beginning of when frame pointer becomes valid.		// Mark effective beginning of when frame pointer becomes valid.
// Define the current CFA to use the EBP/RBP register.		// Define the current CFA to use the EBP/RBP register.
unsigned DwarfFramePtr = RegInfo->getDwarfRegNum(FramePtr, true);		unsigned DwarfFramePtr = RegInfo->getDwarfRegNum(FramePtr, true);
unsigned CFIIndex = MMI.addFrameInst(		unsigned CFIIndex = MMI.addFrameInst(
MCCFIInstruction::createDefCfaRegister(nullptr, DwarfFramePtr));		MCCFIInstruction::createDefCfaRegister(nullptr, DwarfFramePtr));
BuildMI(MBB, MBBI, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(MBB, MBBI, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex);		.addCFIIndex(CFIIndex);
}		}

// Mark the FramePtr as live-in in every block except the entry.		// Mark the FramePtr as live-in in every block.
for (MachineFunction::iterator I = std::next(MF.begin()), E = MF.end();		for (MachineFunction::iterator I = MF.begin(), E = MF.end(); I != E; ++I)
I != E; ++I)
I->addLiveIn(FramePtr);		I->addLiveIn(FramePtr);
} else {		} else {
NumBytes = StackSize - X86FI->getCalleeSavedFrameSize();		NumBytes = StackSize - X86FI->getCalleeSavedFrameSize();
}		}

// Skip the callee-saved push instructions.		// Skip the callee-saved push instructions.
bool PushedRegs = false;		bool PushedRegs = false;
int StackOffset = 2 * stackGrowth;		int StackOffset = 2 * stackGrowth;

while (MBBI != MBB.end() &&		while (MBBI != MBB.end() &&
(MBBI->getOpcode() == X86::PUSH32r \|\|		(MBBI->getOpcode() == X86::PUSH32r \|\|
MBBI->getOpcode() == X86::PUSH64r)) {		MBBI->getOpcode() == X86::PUSH64r)) {
PushedRegs = true;		PushedRegs = true;
MBBI->setFlag(MachineInstr::FrameSetup);		unsigned Reg = MBBI->getOperand(0).getReg();
++MBBI;		++MBBI;

if (!HasFP && needsFrameMoves) {		if (!HasFP && NeedsDwarfCFI) {
// Mark callee-saved push instruction.		// Mark callee-saved push instruction.
// Define the current CFA rule to use the provided offset.		// Define the current CFA rule to use the provided offset.
assert(StackSize);		assert(StackSize);
unsigned CFIIndex = MMI.addFrameInst(		unsigned CFIIndex = MMI.addFrameInst(
MCCFIInstruction::createDefCfaOffset(nullptr, StackOffset));		MCCFIInstruction::createDefCfaOffset(nullptr, StackOffset));
BuildMI(MBB, MBBI, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(MBB, MBBI, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex);		.addCFIIndex(CFIIndex);
StackOffset += stackGrowth;		StackOffset += stackGrowth;
}		}

		if (NeedsWin64SEH) {
		BuildMI(MBB, MBBI, DL, TII.get(X86::SEH_PushReg)).addImm(Reg).setMIFlag(
		MachineInstr::FrameSetup);
		}
}		}

// Realign stack after we pushed callee-saved registers (so that we'll be		// Realign stack after we pushed callee-saved registers (so that we'll be
// able to calculate their offsets from the frame pointer).		// able to calculate their offsets from the frame pointer).

// NOTE: We push the registers before realigning the stack, so
// vector callee-saved (xmm) registers may be saved w/o proper
// alignment in this way. However, currently these regs are saved in
// stack slots (see X86FrameLowering::spillCalleeSavedRegisters()), so
// this shouldn't be a problem.
if (RegInfo->needsStackRealignment(MF)) {		if (RegInfo->needsStackRealignment(MF)) {
assert(HasFP && "There should be a frame pointer if stack is realigned.");		assert(HasFP && "There should be a frame pointer if stack is realigned.");
MachineInstr *MI =		MachineInstr *MI =
BuildMI(MBB, MBBI, DL,		BuildMI(MBB, MBBI, DL,
TII.get(Is64Bit ? X86::AND64ri32 : X86::AND32ri), StackPtr)		TII.get(Is64Bit ? X86::AND64ri32 : X86::AND32ri), StackPtr)
.addReg(StackPtr)		.addReg(StackPtr)
.addImm(-MaxAlign)		.addImm(-MaxAlign)
.setMIFlag(MachineInstr::FrameSetup);		.setMIFlag(MachineInstr::FrameSetup);
▲ Show 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	if (NumBytes >= 4096 && STI.isOSWindows() && !STI.isTargetMacho()) {
if (isEAXAlive) {		if (isEAXAlive) {
// Restore EAX		// Restore EAX
MachineInstr *MI = addRegOffset(BuildMI(MF, DL, TII.get(X86::MOV32rm),		MachineInstr *MI = addRegOffset(BuildMI(MF, DL, TII.get(X86::MOV32rm),
X86::EAX),		X86::EAX),
StackPtr, false, NumBytes - 4);		StackPtr, false, NumBytes - 4);
MI->setFlag(MachineInstr::FrameSetup);		MI->setFlag(MachineInstr::FrameSetup);
MBB.insert(MBBI, MI);		MBB.insert(MBBI, MI);
}		}
} else if (NumBytes)		} else if (NumBytes) {
emitSPUpdate(MBB, MBBI, StackPtr, -(int64_t)NumBytes, Is64Bit, IsLP64,		emitSPUpdate(MBB, MBBI, StackPtr, -(int64_t)NumBytes, Is64Bit, IsLP64,
UseLEA, TII, *RegInfo);		UseLEA, TII, *RegInfo);
		}

		int SEHFrameOffset = 0;
		if (NeedsWin64SEH) {
		if (HasFP) {
		// We need to set frame base offset low enough such that all saved
		// register offsets would be positive relative to it, but we can't
		// just use NumBytes, because .seh_setframe offset must be <=240.
		// So we pretend to have only allocated enough space to spill the
		// non-volatile registers.
		// We don't care about the rest of stack allocation, because unwinder
		// will restore SP to (BP - SEHFrameOffset)
		for (const CalleeSavedInfo &Info : MFI->getCalleeSavedInfo()) {
		int offset = MFI->getObjectOffset(Info.getFrameIdx());
		SEHFrameOffset = std::max(SEHFrameOffset, abs(offset));
		}
		SEHFrameOffset += SEHFrameOffset % 16; // ensure alignmant

		// This only needs to account for XMM spill slots, GPR slots
		// are covered by .seh_pushreg's emitted above.
		BuildMI(MBB, MBBI, DL, TII.get(X86::SEH_StackAlloc))
		.addImm(SEHFrameOffset - X86FI->getCalleeSavedFrameSize())
		.setMIFlag(MachineInstr::FrameSetup);

		BuildMI(MBB, MBBI, DL, TII.get(X86::SEH_SetFrame))
		.addImm(FramePtr)
		.addImm(SEHFrameOffset)
		.setMIFlag(MachineInstr::FrameSetup);
		} else {
		// SP will be the base register for restoring XMMs
		if (NumBytes) {
		BuildMI(MBB, MBBI, DL, TII.get(X86::SEH_StackAlloc))
		.addImm(NumBytes)
		.setMIFlag(MachineInstr::FrameSetup);
		}
		}
		}

		// Skip the rest of register spilling code
		while (MBBI != MBB.end() && MBBI->getFlag(MachineInstr::FrameSetup))
		++MBBI;

		// Emit SEH info for non-GPRs
		if (NeedsWin64SEH) {
		for (const CalleeSavedInfo &Info : MFI->getCalleeSavedInfo()) {
		unsigned Reg = Info.getReg();
		if (X86::GR64RegClass.contains(Reg) \|\| X86::GR32RegClass.contains(Reg))
		continue;
		assert(X86::FR64RegClass.contains(Reg) && "Unexpected register class");

		int Offset = getFrameIndexOffset(MF, Info.getFrameIdx());
		Offset += SEHFrameOffset;

		BuildMI(MBB, MBBI, DL, TII.get(X86::SEH_SaveXMM))
		.addImm(Reg)
		.addImm(Offset)
		.setMIFlag(MachineInstr::FrameSetup);
		}

		BuildMI(MBB, MBBI, DL, TII.get(X86::SEH_EndPrologue))
		.setMIFlag(MachineInstr::FrameSetup);
		}

// If we need a base pointer, set it up here. It's whatever the value		// If we need a base pointer, set it up here. It's whatever the value
// of the stack pointer is at this point. Any variable size objects		// of the stack pointer is at this point. Any variable size objects
// will be allocated after this, so we can still use the base pointer		// will be allocated after this, so we can still use the base pointer
// to reference locals.		// to reference locals.
if (RegInfo->hasBasePointer(MF)) {		if (RegInfo->hasBasePointer(MF)) {
// Update the frame pointer with the current stack pointer.		// Update the base pointer with the current stack pointer.
unsigned Opc = Is64Bit ? X86::MOV64rr : X86::MOV32rr;		unsigned Opc = Is64Bit ? X86::MOV64rr : X86::MOV32rr;
BuildMI(MBB, MBBI, DL, TII.get(Opc), BasePtr)		BuildMI(MBB, MBBI, DL, TII.get(Opc), BasePtr)
.addReg(StackPtr)		.addReg(StackPtr)
.setMIFlag(MachineInstr::FrameSetup);		.setMIFlag(MachineInstr::FrameSetup);
}		}

if (( (!HasFP && NumBytes) \|\| PushedRegs) && needsFrameMoves) {		if (((!HasFP && NumBytes) \|\| PushedRegs) && NeedsDwarfCFI) {
// Mark end of stack pointer adjustment.		// Mark end of stack pointer adjustment.
if (!HasFP && NumBytes) {		if (!HasFP && NumBytes) {
// Define the current CFA rule to use the provided offset.		// Define the current CFA rule to use the provided offset.
assert(StackSize);		assert(StackSize);
unsigned CFIIndex = MMI.addFrameInst(		unsigned CFIIndex = MMI.addFrameInst(
MCCFIInstruction::createDefCfaOffset(nullptr,		MCCFIInstruction::createDefCfaOffset(nullptr,
-StackSize + stackGrowth));		-StackSize + stackGrowth));

BuildMI(MBB, MBBI, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(MBB, MBBI, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex);		.addCFIIndex(CFIIndex);
}		}

// Emit DWARF info specifying the offsets of the callee-saved registers.		// Emit DWARF info specifying the offsets of the callee-saved registers.
if (PushedRegs)		if (PushedRegs)
emitCalleeSavedFrameMoves(MBB, MBBI, DL, HasFP ? FramePtr : StackPtr);		emitCalleeSavedFrameMoves(MBB, MBBI, DL);
}		}
}		}

void X86FrameLowering::emitEpilogue(MachineFunction &MF,		void X86FrameLowering::emitEpilogue(MachineFunction &MF,
MachineBasicBlock &MBB) const {		MachineBasicBlock &MBB) const {
const MachineFrameInfo *MFI = MF.getFrameInfo();		const MachineFrameInfo *MFI = MF.getFrameInfo();
X86MachineFunctionInfo *X86FI = MF.getInfo<X86MachineFunctionInfo>();		X86MachineFunctionInfo *X86FI = MF.getInfo<X86MachineFunctionInfo>();
const X86RegisterInfo *RegInfo =		const X86RegisterInfo *RegInfo =
▲ Show 20 Lines • Show All 243 Lines • ▼ Show 20 Lines	if (RegInfo->hasBasePointer(MF))
FrameReg = RegInfo->getBaseRegister();		FrameReg = RegInfo->getBaseRegister();
else if (RegInfo->needsStackRealignment(MF))		else if (RegInfo->needsStackRealignment(MF))
FrameReg = RegInfo->getStackRegister();		FrameReg = RegInfo->getStackRegister();
else		else
FrameReg = RegInfo->getFrameRegister(MF);		FrameReg = RegInfo->getFrameRegister(MF);
return getFrameIndexOffset(MF, FI);		return getFrameIndexOffset(MF, FI);
}		}

bool X86FrameLowering::spillCalleeSavedRegisters(MachineBasicBlock &MBB,		bool X86FrameLowering::assignCalleeSavedSpillSlots(
MachineBasicBlock::iterator MI,		MachineFunction &MF, const TargetRegisterInfo *TRI,
		std::vector<CalleeSavedInfo> &CSI) const {
		MachineFrameInfo *MFI = MF.getFrameInfo();
		const X86RegisterInfo *RegInfo =
		static_cast<const X86RegisterInfo *>(MF.getTarget().getRegisterInfo());
		unsigned SlotSize = RegInfo->getSlotSize();
		X86MachineFunctionInfo *X86FI = MF.getInfo<X86MachineFunctionInfo>();

		unsigned CalleeSavedFrameSize = 0;
		int SpillSlotOffset = getOffsetOfLocalArea() + X86FI->getTCReturnAddrDelta();

		if (hasFP(MF)) {
		// emitPrologue always spills frame register the first thing.
		SpillSlotOffset -= SlotSize;
		MFI->CreateFixedSpillStackObject(SlotSize, SpillSlotOffset);

		// Since emitPrologue and emitEpilogue will handle spilling and restoring of
		// the frame register, we can delete it from CSI list and not have to worry
		// about avoiding it later.
		unsigned FPReg = RegInfo->getFrameRegister(MF);
		for (unsigned i = 0; i < CSI.size(); ++i) {
		if (CSI[i].getReg() == FPReg) {
		CSI.erase(CSI.begin() + i);
		break;
		}
		}
		}

		// Assign slots for GPRs. It increases frame size.
		for (unsigned i = CSI.size(); i != 0; --i) {
		unsigned Reg = CSI[i - 1].getReg();

		if (!X86::GR64RegClass.contains(Reg) && !X86::GR32RegClass.contains(Reg))
		continue;

		SpillSlotOffset -= SlotSize;
		CalleeSavedFrameSize += SlotSize;

		int SlotIndex = MFI->CreateFixedSpillStackObject(SlotSize, SpillSlotOffset);
		CSI[i - 1].setFrameIdx(SlotIndex);
		}

		X86FI->setCalleeSavedFrameSize(CalleeSavedFrameSize);

		// Assign slots for XMMs.
		for (unsigned i = CSI.size(); i != 0; --i) {
		unsigned Reg = CSI[i - 1].getReg();
		if (X86::GR64RegClass.contains(Reg) \|\| X86::GR32RegClass.contains(Reg))
		continue;

		const TargetRegisterClass *RC = RegInfo->getMinimalPhysRegClass(Reg);
		// ensure alignment
		SpillSlotOffset -= abs(SpillSlotOffset) % RC->getAlignment();
		// spill into slot
		SpillSlotOffset -= RC->getSize();
		int SlotIndex =
		MFI->CreateFixedSpillStackObject(RC->getSize(), SpillSlotOffset);
		CSI[i - 1].setFrameIdx(SlotIndex);
		MFI->ensureMaxAlignment(RC->getAlignment());
		}

		return true;
		}

		bool X86FrameLowering::spillCalleeSavedRegisters(
		MachineBasicBlock &MBB, MachineBasicBlock::iterator MI,
const std::vector<CalleeSavedInfo> &CSI,		const std::vector<CalleeSavedInfo> &CSI,
const TargetRegisterInfo *TRI) const {		const TargetRegisterInfo *TRI) const {
if (CSI.empty())
return false;

DebugLoc DL = MBB.findDebugLoc(MI);		DebugLoc DL = MBB.findDebugLoc(MI);

MachineFunction &MF = *MBB.getParent();		MachineFunction &MF = *MBB.getParent();
const X86RegisterInfo *RegInfo =
static_cast<const X86RegisterInfo *>(MF.getTarget().getRegisterInfo());
unsigned SlotSize = RegInfo->getSlotSize();
unsigned FPReg = TRI->getFrameRegister(MF);
unsigned CalleeFrameSize = 0;

const TargetInstrInfo &TII = *MF.getTarget().getInstrInfo();		const TargetInstrInfo &TII = *MF.getTarget().getInstrInfo();
X86MachineFunctionInfo *X86FI = MF.getInfo<X86MachineFunctionInfo>();
const X86Subtarget &STI = MF.getTarget().getSubtarget<X86Subtarget>();		const X86Subtarget &STI = MF.getTarget().getSubtarget<X86Subtarget>();

// Push GPRs. It increases frame size.		// Push GPRs. It increases frame size.
unsigned Opc = STI.is64Bit() ? X86::PUSH64r : X86::PUSH32r;		unsigned Opc = STI.is64Bit() ? X86::PUSH64r : X86::PUSH32r;
for (unsigned i = CSI.size(); i != 0; --i) {		for (unsigned i = CSI.size(); i != 0; --i) {
unsigned Reg = CSI[i-1].getReg();		unsigned Reg = CSI[i-1].getReg();

if (!X86::GR64RegClass.contains(Reg) &&		if (!X86::GR64RegClass.contains(Reg) &&
!X86::GR32RegClass.contains(Reg))		!X86::GR32RegClass.contains(Reg))
continue;		continue;
// Add the callee-saved register as live-in. It's killed at the spill.		// Add the callee-saved register as live-in. It's killed at the spill.
MBB.addLiveIn(Reg);		MBB.addLiveIn(Reg);
if (Reg == FPReg)
// X86RegisterInfo::emitPrologue will handle spilling of frame register.
continue;
CalleeFrameSize += SlotSize;
BuildMI(MBB, MI, DL, TII.get(Opc)).addReg(Reg, RegState::Kill)		BuildMI(MBB, MI, DL, TII.get(Opc)).addReg(Reg, RegState::Kill)
.setMIFlag(MachineInstr::FrameSetup);		.setMIFlag(MachineInstr::FrameSetup);
}		}

X86FI->setCalleeSavedFrameSize(CalleeFrameSize);

// Make XMM regs spilled. X86 does not have ability of push/pop XMM.		// Make XMM regs spilled. X86 does not have ability of push/pop XMM.
// It can be done by spilling XMMs to stack frame.		// It can be done by spilling XMMs to stack frame.
// Note that only Win64 ABI might spill XMMs.
for (unsigned i = CSI.size(); i != 0; --i) {		for (unsigned i = CSI.size(); i != 0; --i) {
unsigned Reg = CSI[i-1].getReg();		unsigned Reg = CSI[i-1].getReg();
if (X86::GR64RegClass.contains(Reg) \|\|		if (X86::GR64RegClass.contains(Reg) \|\|
X86::GR32RegClass.contains(Reg))		X86::GR32RegClass.contains(Reg))
continue;		continue;
// Add the callee-saved register as live-in. It's killed at the spill.		// Add the callee-saved register as live-in. It's killed at the spill.
MBB.addLiveIn(Reg);		MBB.addLiveIn(Reg);
const TargetRegisterClass *RC = TRI->getMinimalPhysRegClass(Reg);		const TargetRegisterClass *RC = TRI->getMinimalPhysRegClass(Reg);

TII.storeRegToStackSlot(MBB, MI, Reg, true, CSI[i-1].getFrameIdx(),		TII.storeRegToStackSlot(MBB, MI, Reg, true, CSI[i-1].getFrameIdx(),
RC, TRI);		RC, TRI);
		--MI;
		MI->setFlag(MachineInstr::FrameSetup);
		++MI;
}		}

return true;		return true;
}		}

bool X86FrameLowering::restoreCalleeSavedRegisters(MachineBasicBlock &MBB,		bool X86FrameLowering::restoreCalleeSavedRegisters(MachineBasicBlock &MBB,
MachineBasicBlock::iterator MI,		MachineBasicBlock::iterator MI,
const std::vector<CalleeSavedInfo> &CSI,		const std::vector<CalleeSavedInfo> &CSI,
const TargetRegisterInfo *TRI) const {		const TargetRegisterInfo *TRI) const {
if (CSI.empty())		if (CSI.empty())
return false;		return false;

DebugLoc DL = MBB.findDebugLoc(MI);		DebugLoc DL = MBB.findDebugLoc(MI);

MachineFunction &MF = *MBB.getParent();		MachineFunction &MF = *MBB.getParent();
const TargetInstrInfo &TII = *MF.getTarget().getInstrInfo();		const TargetInstrInfo &TII = *MF.getTarget().getInstrInfo();
const X86Subtarget &STI = MF.getTarget().getSubtarget<X86Subtarget>();		const X86Subtarget &STI = MF.getTarget().getSubtarget<X86Subtarget>();

// Reload XMMs from stack frame.		// Reload XMMs from stack frame.
for (unsigned i = 0, e = CSI.size(); i != e; ++i) {		for (unsigned i = 0, e = CSI.size(); i != e; ++i) {
unsigned Reg = CSI[i].getReg();		unsigned Reg = CSI[i].getReg();
if (X86::GR64RegClass.contains(Reg) \|\|		if (X86::GR64RegClass.contains(Reg) \|\|
X86::GR32RegClass.contains(Reg))		X86::GR32RegClass.contains(Reg))
continue;		continue;

const TargetRegisterClass *RC = TRI->getMinimalPhysRegClass(Reg);		const TargetRegisterClass *RC = TRI->getMinimalPhysRegClass(Reg);
TII.loadRegFromStackSlot(MBB, MI, Reg, CSI[i].getFrameIdx(),		TII.loadRegFromStackSlot(MBB, MI, Reg, CSI[i].getFrameIdx(), RC, TRI);
RC, TRI);
}		}

// POP GPRs.		// POP GPRs.
unsigned FPReg = TRI->getFrameRegister(MF);
unsigned Opc = STI.is64Bit() ? X86::POP64r : X86::POP32r;		unsigned Opc = STI.is64Bit() ? X86::POP64r : X86::POP32r;
for (unsigned i = 0, e = CSI.size(); i != e; ++i) {		for (unsigned i = 0, e = CSI.size(); i != e; ++i) {
unsigned Reg = CSI[i].getReg();		unsigned Reg = CSI[i].getReg();
if (!X86::GR64RegClass.contains(Reg) &&		if (!X86::GR64RegClass.contains(Reg) &&
!X86::GR32RegClass.contains(Reg))		!X86::GR32RegClass.contains(Reg))
continue;		continue;
if (Reg == FPReg)
// X86RegisterInfo::emitEpilogue will handle restoring of frame register.
continue;
BuildMI(MBB, MI, DL, TII.get(Opc), Reg);		BuildMI(MBB, MI, DL, TII.get(Opc), Reg);
}		}
return true;		return true;
}		}

void		void
X86FrameLowering::processFunctionBeforeCalleeSavedScan(MachineFunction &MF,		X86FrameLowering::processFunctionBeforeCalleeSavedScan(MachineFunction &MF,
RegScavenger *RS) const {		RegScavenger *RS) const {
Show All 14 Lines	if (TailCallReturnAddrDelta < 0) {
// RETADDR area		// RETADDR area
// ...		// ...
// }		// }
// [EBP]		// [EBP]
MFI->CreateFixedObject(-TailCallReturnAddrDelta,		MFI->CreateFixedObject(-TailCallReturnAddrDelta,
TailCallReturnAddrDelta - SlotSize, true);		TailCallReturnAddrDelta - SlotSize, true);
}		}

if (hasFP(MF)) {
assert((TailCallReturnAddrDelta <= 0) &&
"The Delta should always be zero or negative");
const TargetFrameLowering &TFI = *MF.getTarget().getFrameLowering();

// Create a frame entry for the EBP register that must be saved.
int FrameIdx = MFI->CreateFixedObject(SlotSize,
-(int)SlotSize +
TFI.getOffsetOfLocalArea() +
TailCallReturnAddrDelta,
true);
assert(FrameIdx == MFI->getObjectIndexBegin() &&
"Slot for EBP register must be last in order to be found!");
(void)FrameIdx;
}

// Spill the BasePtr if it's used.		// Spill the BasePtr if it's used.
if (RegInfo->hasBasePointer(MF))		if (RegInfo->hasBasePointer(MF))
MF.getRegInfo().setPhysRegUsed(RegInfo->getBaseRegister());		MF.getRegInfo().setPhysRegUsed(RegInfo->getBaseRegister());
}		}

static bool		static bool
HasNestArgument(const MachineFunction *MF) {		HasNestArgument(const MachineFunction *MF) {
const Function *F = MF->getFunction();		const Function *F = MF->getFunction();
▲ Show 20 Lines • Show All 472 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/X86/X86ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 599 Lines • ▼ Show 20 Lines	if (!Subtarget->is64Bit()) {
setOperationAction(ISD::ATOMIC_LOAD_UMIN, MVT::i64, Custom);		setOperationAction(ISD::ATOMIC_LOAD_UMIN, MVT::i64, Custom);
}		}

if (Subtarget->hasCmpxchg16b()) {		if (Subtarget->hasCmpxchg16b()) {
setOperationAction(ISD::ATOMIC_CMP_SWAP_WITH_SUCCESS, MVT::i128, Custom);		setOperationAction(ISD::ATOMIC_CMP_SWAP_WITH_SUCCESS, MVT::i128, Custom);
}		}

// FIXME - use subtarget debug flags		// FIXME - use subtarget debug flags
if (!Subtarget->isTargetDarwin() &&		if (!Subtarget->isTargetDarwin() && !Subtarget->isTargetELF() &&
!Subtarget->isTargetELF() &&		!Subtarget->isTargetCygMing() && !Subtarget->isTargetWin64()) {
!Subtarget->isTargetCygMing()) {
setOperationAction(ISD::EH_LABEL, MVT::Other, Expand);		setOperationAction(ISD::EH_LABEL, MVT::Other, Expand);
}		}

if (Subtarget->is64Bit()) {		if (Subtarget->is64Bit()) {
setExceptionPointerRegister(X86::RAX);		setExceptionPointerRegister(X86::RAX);
setExceptionSelectorRegister(X86::RDX);		setExceptionSelectorRegister(X86::RDX);
} else {		} else {
setExceptionPointerRegister(X86::EAX);		setExceptionPointerRegister(X86::EAX);
▲ Show 20 Lines • Show All 21,259 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/X86/X86InstrCompiler.td

Show First 20 Lines • Show All 104 Lines • ▼ Show 20 Lines

let Defs = [EAX, ESP, EFLAGS], Uses = [ESP] in		let Defs = [EAX, ESP, EFLAGS], Uses = [ESP] in
def WIN_ALLOCA : I<0, Pseudo, (outs), (ins),		def WIN_ALLOCA : I<0, Pseudo, (outs), (ins),
"# dynamic stack allocation",		"# dynamic stack allocation",
[(X86WinAlloca)]>;		[(X86WinAlloca)]>;

// When using segmented stacks these are lowered into instructions which first		// When using segmented stacks these are lowered into instructions which first
// check if the current stacklet has enough free memory. If it does, memory is		// check if the current stacklet has enough free memory. If it does, memory is
// allocated by bumping the stack pointer. Otherwise memory is allocated from		// allocated by bumping the stack pointer. Otherwise memory is allocated from
// the heap.		// the heap.

let Defs = [EAX, ESP, EFLAGS], Uses = [ESP] in		let Defs = [EAX, ESP, EFLAGS], Uses = [ESP] in
def SEG_ALLOCA_32 : I<0, Pseudo, (outs GR32:$dst), (ins GR32:$size),		def SEG_ALLOCA_32 : I<0, Pseudo, (outs GR32:$dst), (ins GR32:$size),
"# variable sized alloca for segmented stacks",		"# variable sized alloca for segmented stacks",
[(set GR32:$dst,		[(set GR32:$dst,
(X86SegAlloca GR32:$size))]>,		(X86SegAlloca GR32:$size))]>,
Requires<[Not64BitMode]>;		Requires<[Not64BitMode]>;
▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines
} // SchedRW		} // SchedRW

let isBranch = 1, isTerminator = 1, isCodeGenOnly = 1 in {		let isBranch = 1, isTerminator = 1, isCodeGenOnly = 1 in {
def EH_SjLj_Setup : I<0, Pseudo, (outs), (ins brtarget:$dst),		def EH_SjLj_Setup : I<0, Pseudo, (outs), (ins brtarget:$dst),
"#EH_SjLj_Setup\t$dst", []>;		"#EH_SjLj_Setup\t$dst", []>;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
		// Pseudo instructions used by unwind info.
		//
		let isPseudo = 1 in {
		def SEH_PushReg : I<0, Pseudo, (outs), (ins i32imm:$reg),
		"#SEH_PushReg $reg", []>;
		def SEH_SaveReg : I<0, Pseudo, (outs), (ins i32imm:$reg, i32imm:$dst),
		"#SEH_SaveReg $reg, $dst", []>;
		def SEH_SaveXMM : I<0, Pseudo, (outs), (ins i32imm:$reg, i32imm:$dst),
		"#SEH_SaveXMM $reg, $dst", []>;
		def SEH_StackAlloc : I<0, Pseudo, (outs), (ins i32imm:$size),
		"#SEH_StackAlloc $size", []>;
		def SEH_SetFrame : I<0, Pseudo, (outs), (ins i32imm:$reg, i32imm:$offset),
		"#SEH_SetFrame $reg, $offset", []>;
		def SEH_PushFrame : I<0, Pseudo, (outs), (ins i1imm:$mode),
		"#SEH_PushFrame $mode", []>;
		def SEH_EndPrologue : I<0, Pseudo, (outs), (ins),
		"#SEH_EndPrologue", []>;
		}

		//===----------------------------------------------------------------------===//
// Pseudo instructions used by segmented stacks.		// Pseudo instructions used by segmented stacks.
//		//

// This is lowered into a RET instruction by MCInstLower. We need		// This is lowered into a RET instruction by MCInstLower. We need
// this so that we don't have to have a MachineBasicBlock which ends		// this so that we don't have to have a MachineBasicBlock which ends
// with a RET and also has successors.		// with a RET and also has successors.
let isPseudo = 1 in {		let isPseudo = 1 in {
def MORESTACK_RET: I<0, Pseudo, (outs), (ins),		def MORESTACK_RET: I<0, Pseudo, (outs), (ins),
▲ Show 20 Lines • Show All 158 Lines • ▼ Show 20 Lines	let Defs = [RCX,RDI], isCodeGenOnly = 1 in {
let Uses = [AX,RCX,RDI] in		let Uses = [AX,RCX,RDI] in
def REP_STOSW_64 : I<0xAB, RawFrm, (outs), (ins), "{rep;stosw\|rep stosw}",		def REP_STOSW_64 : I<0xAB, RawFrm, (outs), (ins), "{rep;stosw\|rep stosw}",
[(X86rep_stos i16)], IIC_REP_STOS>, REP, OpSize16,		[(X86rep_stos i16)], IIC_REP_STOS>, REP, OpSize16,
Requires<[In64BitMode]>;		Requires<[In64BitMode]>;
let Uses = [RAX,RCX,RDI] in		let Uses = [RAX,RCX,RDI] in
def REP_STOSD_64 : I<0xAB, RawFrm, (outs), (ins), "{rep;stosl\|rep stosd}",		def REP_STOSD_64 : I<0xAB, RawFrm, (outs), (ins), "{rep;stosl\|rep stosd}",
[(X86rep_stos i32)], IIC_REP_STOS>, REP, OpSize32,		[(X86rep_stos i32)], IIC_REP_STOS>, REP, OpSize32,
Requires<[In64BitMode]>;		Requires<[In64BitMode]>;

let Uses = [RAX,RCX,RDI] in		let Uses = [RAX,RCX,RDI] in
def REP_STOSQ_64 : RI<0xAB, RawFrm, (outs), (ins), "{rep;stosq\|rep stosq}",		def REP_STOSQ_64 : RI<0xAB, RawFrm, (outs), (ins), "{rep;stosq\|rep stosq}",
[(X86rep_stos i64)], IIC_REP_STOS>, REP,		[(X86rep_stos i64)], IIC_REP_STOS>, REP,
Requires<[In64BitMode]>;		Requires<[In64BitMode]>;
}		}
} // SchedRW		} // SchedRW

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
▲ Show 20 Lines • Show All 1,451 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/X86/X86MCInstLower.cpp

//===-- X86MCInstLower.cpp - Convert X86 MachineInstr to an MCInst --------===//		//===-- X86MCInstLower.cpp - Convert X86 MachineInstr to an MCInst --------===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
// This file contains code to lower X86 MachineInstrs to their corresponding		// This file contains code to lower X86 MachineInstrs to their corresponding
// MCInst records.		// MCInst records.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "X86AsmPrinter.h"		#include "X86AsmPrinter.h"
		#include "X86RegisterInfo.h"
#include "InstPrinter/X86ATTInstPrinter.h"		#include "InstPrinter/X86ATTInstPrinter.h"
#include "MCTargetDesc/X86BaseInfo.h"		#include "MCTargetDesc/X86BaseInfo.h"
#include "llvm/ADT/SmallString.h"		#include "llvm/ADT/SmallString.h"
#include "llvm/CodeGen/MachineFunction.h"		#include "llvm/CodeGen/MachineFunction.h"
#include "llvm/CodeGen/MachineModuleInfoImpls.h"		#include "llvm/CodeGen/MachineModuleInfoImpls.h"
#include "llvm/CodeGen/StackMaps.h"		#include "llvm/CodeGen/StackMaps.h"
#include "llvm/IR/DataLayout.h"		#include "llvm/IR/DataLayout.h"
#include "llvm/IR/GlobalValue.h"		#include "llvm/IR/GlobalValue.h"
▲ Show 20 Lines • Show All 750 Lines • ▼ Show 20 Lines	static void LowerPATCHPOINT(MCStreamer &OS, StackMaps &SM,
assert(NumBytes >= EncodedBytes &&		assert(NumBytes >= EncodedBytes &&
"Patchpoint can't request size less than the length of a call.");		"Patchpoint can't request size less than the length of a call.");

EmitNops(OS, NumBytes - EncodedBytes, Is64Bit, STI);		EmitNops(OS, NumBytes - EncodedBytes, Is64Bit, STI);
}		}

void X86AsmPrinter::EmitInstruction(const MachineInstr *MI) {		void X86AsmPrinter::EmitInstruction(const MachineInstr *MI) {
X86MCInstLower MCInstLowering(MF, this);		X86MCInstLower MCInstLowering(MF, this);
		const X86RegisterInfo *RI =
		static_cast<const X86RegisterInfo *>(TM.getRegisterInfo());

switch (MI->getOpcode()) {		switch (MI->getOpcode()) {
case TargetOpcode::DBG_VALUE:		case TargetOpcode::DBG_VALUE:
llvm_unreachable("Should be handled target independently");		llvm_unreachable("Should be handled target independently");

// Emit nothing here but a comment if we can.		// Emit nothing here but a comment if we can.
case X86::Int_MemBarrier:		case X86::Int_MemBarrier:
OutStreamer.emitRawComment("MEMBARRIER");		OutStreamer.emitRawComment("MEMBARRIER");
return;		return;
▲ Show 20 Lines • Show All 88 Lines • ▼ Show 20 Lines	void X86AsmPrinter::EmitInstruction(const MachineInstr *MI) {

case X86::MORESTACK_RET_RESTORE_R10:		case X86::MORESTACK_RET_RESTORE_R10:
// Return, then restore R10.		// Return, then restore R10.
EmitToStreamer(OutStreamer, MCInstBuilder(getRetOpcode(*Subtarget)));		EmitToStreamer(OutStreamer, MCInstBuilder(getRetOpcode(*Subtarget)));
EmitToStreamer(OutStreamer, MCInstBuilder(X86::MOV64rr)		EmitToStreamer(OutStreamer, MCInstBuilder(X86::MOV64rr)
.addReg(X86::R10)		.addReg(X86::R10)
.addReg(X86::RAX));		.addReg(X86::RAX));
return;		return;

		case X86::SEH_PushReg:
		OutStreamer.EmitWin64EHPushReg(
		RI->getSEHRegNum(MI->getOperand(0).getImm()));
		return;

		case X86::SEH_SaveReg:
		OutStreamer.EmitWin64EHSaveReg(RI->getSEHRegNum(MI->getOperand(0).getImm()),
		MI->getOperand(1).getImm());
		return;

		case X86::SEH_SaveXMM:
		OutStreamer.EmitWin64EHSaveXMM(RI->getSEHRegNum(MI->getOperand(0).getImm()),
		MI->getOperand(1).getImm());
		return;

		case X86::SEH_StackAlloc:
		OutStreamer.EmitWin64EHAllocStack(MI->getOperand(0).getImm());
		return;

		case X86::SEH_SetFrame:
		OutStreamer.EmitWin64EHSetFrame(
		RI->getSEHRegNum(MI->getOperand(0).getImm()),
		MI->getOperand(1).getImm());
		return;

		case X86::SEH_PushFrame:
		OutStreamer.EmitWin64EHPushFrame(MI->getOperand(0).getImm());
		return;

		case X86::SEH_EndPrologue:
		OutStreamer.EmitWin64EHEndProlog();
		return;
}		}

MCInst TmpInst;		MCInst TmpInst;
MCInstLowering.Lower(MI, TmpInst);		MCInstLowering.Lower(MI, TmpInst);
EmitToStreamer(OutStreamer, TmpInst);		EmitToStreamer(OutStreamer, TmpInst);
}		}

llvm/trunk/test/CodeGen/X86/2007-05-05-Personality.ll

	; RUN: llc < %s -mtriple=i686-pc-linux-gnu -o - \| FileCheck %s --check-prefix=LIN			; RUN: llc < %s -mtriple=i686-pc-linux-gnu -o - \| FileCheck %s --check-prefix=LIN
	; RUN: llc < %s -mtriple=x86_64-pc-windows-gnu -o - \| FileCheck %s --check-prefix=LIN
	; RUN: llc < %s -mtriple=i386-pc-mingw32 -o - \| FileCheck %s --check-prefix=WIN			; RUN: llc < %s -mtriple=i386-pc-mingw32 -o - \| FileCheck %s --check-prefix=WIN
	; RUN: llc < %s -mtriple=i686-pc-windows-gnu -o - \| FileCheck %s --check-prefix=WIN			; RUN: llc < %s -mtriple=i686-pc-windows-gnu -o - \| FileCheck %s --check-prefix=WIN
				; RUN: llc < %s -mtriple=x86_64-pc-windows-gnu -o - \| FileCheck %s --check-prefix=WIN64

	; LIN: .cfi_personality 0, __gnat_eh_personality			; LIN: .cfi_personality 0, __gnat_eh_personality
	; LIN: .cfi_lsda 0, .Lexception0			; LIN: .cfi_lsda 0, .Lexception0
	; WIN: .cfi_personality 0, ___gnat_eh_personality			; WIN: .cfi_personality 0, ___gnat_eh_personality
	; WIN: .cfi_lsda 0, Lexception0			; WIN: .cfi_lsda 0, Lexception0
				; WIN64: .seh_handler __gnat_eh_personality
				; WIN64: .seh_handlerdata

	@error = external global i8			@error = external global i8

	define void @_ada_x() {			define void @_ada_x() {
	entry:			entry:
	invoke void @raise()			invoke void @raise()
	to label %eh_then unwind label %unwind			to label %eh_then unwind label %unwind

	unwind: ; preds = %entry			unwind: ; preds = %entry
	%eh_ptr = landingpad { i8, i32 } personality i8 bitcast (i32 (...)* @__gnat_eh_personality to i8*)			%eh_ptr = landingpad { i8, i32 } personality i8 bitcast (i32 (...)* @__gnat_eh_personality to i8*)
	catch i8* @error			catch i8* @error
	%eh_select = extractvalue { i8*, i32 } %eh_ptr, 1			%eh_select = extractvalue { i8*, i32 } %eh_ptr, 1
	%eh_typeid = tail call i32 @llvm.eh.typeid.for(i8* @error)			%eh_typeid = tail call i32 @llvm.eh.typeid.for(i8* @error)
	%tmp2 = icmp eq i32 %eh_select, %eh_typeid			%tmp2 = icmp eq i32 %eh_select, %eh_typeid
	br i1 %tmp2, label %eh_then, label %Unwind			br i1 %tmp2, label %eh_then, label %Unwind

	eh_then: ; preds = %unwind, %entry			eh_then: ; preds = %unwind, %entry
	Show All 13 Lines

llvm/trunk/test/CodeGen/X86/2009-06-03-Win64SpillXMM.ll

	; RUN: llc -mcpu=generic -mtriple=x86_64-mingw32 < %s \| FileCheck %s			; RUN: llc -mcpu=generic -mtriple=x86_64-mingw32 < %s \| FileCheck %s
	; CHECK: subq $40, %rsp			; CHECK: subq $40, %rsp
	; CHECK: movaps %xmm8, (%rsp)			; CHECK: movaps %xmm8, 16(%rsp)
	; CHECK: movaps %xmm7, 16(%rsp)			; CHECK: movaps %xmm7, (%rsp)

	define i32 @a() nounwind {			define i32 @a() nounwind {
	entry:			entry:
	tail call void asm sideeffect "", "~{xmm7},~{xmm8},~{dirflag},~{fpsr},~{flags}"() nounwind			tail call void asm sideeffect "", "~{xmm7},~{xmm8},~{dirflag},~{fpsr},~{flags}"() nounwind
	ret i32 undef			ret i32 undef
	}			}

llvm/trunk/test/CodeGen/X86/avx-intel-ocl.ll

	; RUN: llc < %s -mtriple=i686-apple-darwin -mcpu=corei7-avx -mattr=+avx \| FileCheck -check-prefix=X32 %s			; RUN: llc < %s -mtriple=i686-apple-darwin -mcpu=corei7-avx -mattr=+avx \| FileCheck -check-prefix=X32 %s
	; RUN: llc < %s -mtriple=i386-pc-win32 -mcpu=corei7-avx -mattr=+avx \| FileCheck -check-prefix=X32 %s			; RUN: llc < %s -mtriple=i386-pc-win32 -mcpu=corei7-avx -mattr=+avx \| FileCheck -check-prefix=X32 %s
	; RUN: llc < %s -mtriple=x86_64-win32 -mcpu=corei7-avx -mattr=+avx \| FileCheck -check-prefix=WIN64 %s			; RUN: llc < %s -mtriple=x86_64-win32 -mcpu=corei7-avx -mattr=+avx \| FileCheck -check-prefix=WIN64 %s
	; RUN: llc < %s -mtriple=x86_64-apple-darwin -mcpu=corei7-avx -mattr=+avx \| FileCheck -check-prefix=X64 %s			; RUN: llc < %s -mtriple=x86_64-apple-darwin -mcpu=corei7-avx -mattr=+avx \| FileCheck -check-prefix=X64 %s

	declare <16 x float> @func_float16_ptr(<16 x float>, <16 x float> *)			declare <16 x float> @func_float16_ptr(<16 x float>, <16 x float> *)
	declare <16 x float> @func_float16(<16 x float>, <16 x float>)			declare <16 x float> @func_float16(<16 x float>, <16 x float>)
	declare i32 @func_int(i32, i32)			declare i32 @func_int(i32, i32)

	; WIN64: testf16_inp			; WIN64-LABEL: testf16_inp
	; WIN64: vaddps {{.*}}, {{%ymm[0-1]}}			; WIN64: vaddps {{.*}}, {{%ymm[0-1]}}
	; WIN64: vaddps {{.*}}, {{%ymm[0-1]}}			; WIN64: vaddps {{.*}}, {{%ymm[0-1]}}
	; WIN64: leaq {{.*}}(%rsp), %rcx			; WIN64: leaq {{.*}}(%rsp), %rcx
	; WIN64: call			; WIN64: call
	; WIN64: ret			; WIN64: ret

	; X32: testf16_inp			; X32-LABEL: testf16_inp
	; X32: movl %eax, (%esp)			; X32: movl %eax, (%esp)
	; X32: vaddps {{.*}}, {{%ymm[0-1]}}			; X32: vaddps {{.*}}, {{%ymm[0-1]}}
	; X32: vaddps {{.*}}, {{%ymm[0-1]}}			; X32: vaddps {{.*}}, {{%ymm[0-1]}}
	; X32: call			; X32: call
	; X32: ret			; X32: ret

	; X64: testf16_inp			; X64-LABEL: testf16_inp
	; X64: vaddps {{.*}}, {{%ymm[0-1]}}			; X64: vaddps {{.*}}, {{%ymm[0-1]}}
	; X64: vaddps {{.*}}, {{%ymm[0-1]}}			; X64: vaddps {{.*}}, {{%ymm[0-1]}}
	; X64: leaq {{.*}}(%rsp), %rdi			; X64: leaq {{.*}}(%rsp), %rdi
	; X64: call			; X64: call
	; X64: ret			; X64: ret

	;test calling conventions - input parameters			;test calling conventions - input parameters
	define <16 x float> @testf16_inp(<16 x float> %a, <16 x float> %b) nounwind {			define <16 x float> @testf16_inp(<16 x float> %a, <16 x float> %b) nounwind {
	%y = alloca <16 x float>, align 16			%y = alloca <16 x float>, align 16
	%x = fadd <16 x float> %a, %b			%x = fadd <16 x float> %a, %b
	%1 = call intel_ocl_bicc <16 x float> @func_float16_ptr(<16 x float> %x, <16 x float>* %y)			%1 = call intel_ocl_bicc <16 x float> @func_float16_ptr(<16 x float> %x, <16 x float>* %y)
	%2 = load <16 x float>* %y, align 16			%2 = load <16 x float>* %y, align 16
	%3 = fadd <16 x float> %2, %1			%3 = fadd <16 x float> %2, %1
	ret <16 x float> %3			ret <16 x float> %3
	}			}

	;test calling conventions - preserved registers			;test calling conventions - preserved registers

	; preserved ymm6-ymm15			; preserved ymm6-ymm15
	; WIN64: testf16_regs			; WIN64-LABEL: testf16_regs
	; WIN64: call			; WIN64: call
	; WIN64: vaddps {{%ymm[6-7]}}, {{%ymm[0-1]}}, {{%ymm[0-1]}}			; WIN64: vaddps {{%ymm[6-7]}}, {{%ymm[0-1]}}, {{%ymm[0-1]}}
	; WIN64: vaddps {{%ymm[6-7]}}, {{%ymm[0-1]}}, {{%ymm[0-1]}}			; WIN64: vaddps {{%ymm[6-7]}}, {{%ymm[0-1]}}, {{%ymm[0-1]}}
	; WIN64: ret			; WIN64: ret

	; preserved ymm8-ymm15			; preserved ymm8-ymm15
	; X64: testf16_regs			; X64-LABEL: testf16_regs
	; X64: call			; X64: call
	; X64: vaddps {{%ymm[8-9]}}, {{%ymm[0-1]}}, {{%ymm[0-1]}}			; X64: vaddps {{%ymm[8-9]}}, {{%ymm[0-1]}}, {{%ymm[0-1]}}
	; X64: vaddps {{%ymm[8-9]}}, {{%ymm[0-1]}}, {{%ymm[0-1]}}			; X64: vaddps {{%ymm[8-9]}}, {{%ymm[0-1]}}, {{%ymm[0-1]}}
	; X64: ret			; X64: ret

	define <16 x float> @testf16_regs(<16 x float> %a, <16 x float> %b) nounwind {			define <16 x float> @testf16_regs(<16 x float> %a, <16 x float> %b) nounwind {
	%y = alloca <16 x float>, align 16			%y = alloca <16 x float>, align 16
	%x = fadd <16 x float> %a, %b			%x = fadd <16 x float> %a, %b
	%1 = call intel_ocl_bicc <16 x float> @func_float16_ptr(<16 x float> %x, <16 x float>* %y)			%1 = call intel_ocl_bicc <16 x float> @func_float16_ptr(<16 x float> %x, <16 x float>* %y)
	%2 = load <16 x float>* %y, align 16			%2 = load <16 x float>* %y, align 16
	%3 = fadd <16 x float> %1, %b			%3 = fadd <16 x float> %1, %b
	%4 = fadd <16 x float> %2, %3			%4 = fadd <16 x float> %2, %3
	ret <16 x float> %4			ret <16 x float> %4
	}			}

	; test calling conventions - prolog and epilog			; test calling conventions - prolog and epilog
	; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rsp).}} # 32-byte Spill			; WIN64-LABEL: test_prolog_epilog
	; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rsp).}} # 32-byte Spill			; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rbp).}} # 32-byte Spill
	; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rsp).}} # 32-byte Spill			; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rbp).}} # 32-byte Spill
	; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rsp).}} # 32-byte Spill			; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rbp).}} # 32-byte Spill
	; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rsp).}} # 32-byte Spill			; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rbp).}} # 32-byte Spill
	; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rsp).}} # 32-byte Spill			; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rbp).}} # 32-byte Spill
	; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rsp).}} # 32-byte Spill			; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rbp).}} # 32-byte Spill
	; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rsp).}} # 32-byte Spill			; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rbp).}} # 32-byte Spill
	; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rsp).}} # 32-byte Spill			; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rbp).}} # 32-byte Spill
	; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rsp).}} # 32-byte Spill			; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rbp).}} # 32-byte Spill
				; WIN64: vmovaps {{%ymm([6-9]\|1[0-5])}}, {{.(%rbp).}} # 32-byte Spill
	; WIN64: call			; WIN64: call
	; WIN64: vmovaps {{.(%rsp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload			; WIN64: vmovaps {{.(%rbp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload
	; WIN64: vmovaps {{.(%rsp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload			; WIN64: vmovaps {{.(%rbp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload
	; WIN64: vmovaps {{.(%rsp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload			; WIN64: vmovaps {{.(%rbp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload
	; WIN64: vmovaps {{.(%rsp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload			; WIN64: vmovaps {{.(%rbp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload
	; WIN64: vmovaps {{.(%rsp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload			; WIN64: vmovaps {{.(%rbp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload
	; WIN64: vmovaps {{.(%rsp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload			; WIN64: vmovaps {{.(%rbp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload
	; WIN64: vmovaps {{.(%rsp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload			; WIN64: vmovaps {{.(%rbp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload
	; WIN64: vmovaps {{.(%rsp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload			; WIN64: vmovaps {{.(%rbp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload
	; WIN64: vmovaps {{.(%rsp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload			; WIN64: vmovaps {{.(%rbp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload
	; WIN64: vmovaps {{.(%rsp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload			; WIN64: vmovaps {{.(%rbp).}}, {{%ymm([6-9]\|1[0-5])}} # 32-byte Reload

				; X64-LABEL: test_prolog_epilog
	; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill			; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill
	; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill			; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill
	; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill			; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill
	; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill			; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill
	; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill			; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill
	; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill			; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill
	; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill			; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill
	; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill			; X64: vmovups {{%ymm([8-9]\|1[0-5])}}, {{.*}}(%rsp) ## 32-byte Folded Spill
	; X64: call			; X64: call
	; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload			; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload
	; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload			; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload
	; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload			; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload
	; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload			; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload
	; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload			; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload
	; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload			; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload
	; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload			; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload
	; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload			; X64: vmovups {{.*}}(%rsp), {{%ymm([8-9]\|1[0-5])}} ## 32-byte Folded Reload
	define intel_ocl_bicc <16 x float> @test_prolog_epilog(<16 x float> %a, <16 x float> %b) nounwind {			define intel_ocl_bicc <16 x float> @test_prolog_epilog(<16 x float> %a, <16 x float> %b) nounwind {
	%c = call <16 x float> @func_float16(<16 x float> %a, <16 x float> %b)			%c = call <16 x float> @func_float16(<16 x float> %a, <16 x float> %b)
	ret <16 x float> %c			ret <16 x float> %c
	}			}

	; test functions with integer parameters			; test functions with integer parameters
	; pass parameters on stack for 32-bit platform			; pass parameters on stack for 32-bit platform
				; X32-LABEL: test_int
	; X32: movl {{.*}}, 4(%esp)			; X32: movl {{.*}}, 4(%esp)
	; X32: movl {{.*}}, (%esp)			; X32: movl {{.*}}, (%esp)
	; X32: call			; X32: call
	; X32: addl {{.*}}, %eax			; X32: addl {{.*}}, %eax

	; pass parameters in registers for 64-bit platform			; pass parameters in registers for 64-bit platform
				; X64-LABEL: test_int
	; X64: leal {{.*}}, %edi			; X64: leal {{.*}}, %edi
	; X64: movl {{.*}}, %esi			; X64: movl {{.*}}, %esi
	; X64: call			; X64: call
	; X64: addl {{.*}}, %eax			; X64: addl {{.*}}, %eax
	define i32 @test_int(i32 %a, i32 %b) nounwind {			define i32 @test_int(i32 %a, i32 %b) nounwind {
	%c1 = add i32 %a, %b			%c1 = add i32 %a, %b
	%c2 = call intel_ocl_bicc i32 @func_int(i32 %c1, i32 %a)			%c2 = call intel_ocl_bicc i32 @func_int(i32 %c1, i32 %a)
	%c = add i32 %c2, %b			%c = add i32 %c2, %b
	ret i32 %c			ret i32 %c
	}			}

	; WIN64: test_float4			; WIN64-LABEL: test_float4
	; WIN64-NOT: vzeroupper			; WIN64-NOT: vzeroupper
	; WIN64: call			; WIN64: call
	; WIN64-NOT: vzeroupper			; WIN64-NOT: vzeroupper
	; WIN64: call			; WIN64: call
	; WIN64: ret			; WIN64: ret

	; X64: test_float4			; X64-LABEL: test_float4
	; X64-NOT: vzeroupper			; X64-NOT: vzeroupper
	; X64: call			; X64: call
	; X64-NOT: vzeroupper			; X64-NOT: vzeroupper
	; X64: call			; X64: call
	; X64: ret			; X64: ret

	; X32: test_float4			; X32-LABEL: test_float4
	; X32: vzeroupper			; X32: vzeroupper
	; X32: call			; X32: call
	; X32: vzeroupper			; X32: vzeroupper
	; X32: call			; X32: call
	; X32: ret			; X32: ret

	declare <4 x float> @func_float4(<4 x float>, <4 x float>, <4 x float>)			declare <4 x float> @func_float4(<4 x float>, <4 x float>, <4 x float>)

	Show All 15 Lines

llvm/trunk/test/CodeGen/X86/gcc_except_table.ll

	; RUN: llc -mtriple x86_64-apple-darwin %s -o - \| FileCheck %s --check-prefix=APPLE			; RUN: llc -mtriple x86_64-apple-darwin %s -o - \| FileCheck %s --check-prefix=APPLE
	; RUN: llc -mtriple x86_64-pc-windows-gnu %s -o - \| FileCheck %s --check-prefix=MINGW64			; RUN: llc -mtriple x86_64-pc-windows-gnu %s -o - \| FileCheck %s --check-prefix=MINGW64
	; RUN: llc -mtriple i686-pc-windows-gnu %s -o - \| FileCheck %s --check-prefix=MINGW32			; RUN: llc -mtriple i686-pc-windows-gnu %s -o - \| FileCheck %s --check-prefix=MINGW32
	@_ZTIi = external constant i8*			@_ZTIi = external constant i8*

	define i32 @main() uwtable optsize ssp {			define i32 @main() uwtable optsize ssp {
	; APPLE: .cfi_startproc			; APPLE: .cfi_startproc
	; APPLE: .cfi_personality 155, ___gxx_personality_v0			; APPLE: .cfi_personality 155, ___gxx_personality_v0
	; APPLE: .cfi_lsda 16, Lexception0			; APPLE: .cfi_lsda 16, Lexception0
	; APPLE: .cfi_def_cfa_offset 16			; APPLE: .cfi_def_cfa_offset 16
	; APPLE: callq __Unwind_Resume			; APPLE: callq __Unwind_Resume
	; APPLE: .cfi_endproc			; APPLE: .cfi_endproc
	; APPLE: GCC_except_table0:			; APPLE: GCC_except_table0:
	; APPLE: Lexception0:			; APPLE: Lexception0:

	; MINGW64: .cfi_startproc			; MINGW64: .seh_proc
	; MINGW64: .cfi_personality 0, __gxx_personality_v0			; MINGW64: .seh_handler __gxx_personality_v0
	; MINGW64: .cfi_lsda 0, .Lexception0			; MINGW64: .seh_setframe 5, 0
	; MINGW64: .cfi_def_cfa_offset 16
	; MINGW64: callq _Unwind_Resume			; MINGW64: callq _Unwind_Resume
	; MINGW64: .cfi_endproc			; MINGW64: .seh_handlerdata
	; MINGW64: GCC_except_table0:			; MINGW64: GCC_except_table0:
	; MINGW64: Lexception0:			; MINGW64: Lexception0:
				; MINGW64: .seh_endproc

	; MINGW32: .cfi_startproc			; MINGW32: .cfi_startproc
	; MINGW32: .cfi_personality 0, ___gxx_personality_v0			; MINGW32: .cfi_personality 0, ___gxx_personality_v0
	; MINGW32: .cfi_lsda 0, Lexception0			; MINGW32: .cfi_lsda 0, Lexception0
	; MINGW32: .cfi_def_cfa_offset 8			; MINGW32: .cfi_def_cfa_offset 8
	; MINGW32: calll __Unwind_Resume			; MINGW32: calll __Unwind_Resume
	; MINGW32: .cfi_endproc			; MINGW32: .cfi_endproc
	; MINGW32: GCC_except_table0:			; MINGW32: GCC_except_table0:
	Show All 21 Lines

llvm/trunk/test/CodeGen/X86/win64_eh.ll

				; RUN: llc < %s -O0 -mcpu=corei7 -mtriple=x86_64-pc-win32 \| FileCheck %s -check-prefix=WIN64
				; RUN: llc < %s -O0 -mcpu=corei7 -mtriple=x86_64-pc-mingw32 \| FileCheck %s -check-prefix=WIN64

				; Check function without prolog
				define void @foo0() uwtable {
				entry:
				ret void
				}
				; WIN64-LABEL: foo0:
				; WIN64: .seh_proc foo0
				; WIN64: .seh_endprologue
				; WIN64: ret
				; WIN64: .seh_endproc

				; Checks a small stack allocation
				define void @foo1() uwtable {
				entry:
				%baz = alloca [2000 x i16], align 2
				ret void
				}
				; WIN64-LABEL: foo1:
				; WIN64: .seh_proc foo1
				; WIN64: subq $4000, %rsp
				; WIN64: .seh_stackalloc 4000
				; WIN64: .seh_endprologue
				; WIN64: addq $4000, %rsp
				; WIN64: ret
				; WIN64: .seh_endproc

				; Checks a stack allocation requiring call to __chkstk/___chkstk_ms
				define void @foo2() uwtable {
				entry:
				%baz = alloca [4000 x i16], align 2
				ret void
				}
				; WIN64-LABEL: foo2:
				; WIN64: .seh_proc foo2
				; WIN64: movabsq $8000, %rax
				; WIN64: callq {{__chkstk\|___chkstk_ms}}
				; WIN64: subq %rax, %rsp
				; WIN64: .seh_stackalloc 8000
				; WIN64: .seh_endprologue
				; WIN64: addq $8000, %rsp
				; WIN64: ret
				; WIN64: .seh_endproc


				; Checks stack push
				define i32 @foo3(i32 %f_arg, i32 %e_arg, i32 %d_arg, i32 %c_arg, i32 %b_arg, i32 %a_arg) uwtable {
				entry:
				%a = alloca i32
				%b = alloca i32
				%c = alloca i32
				%d = alloca i32
				%e = alloca i32
				%f = alloca i32
				store i32 %a_arg, i32* %a
				store i32 %b_arg, i32* %b
				store i32 %c_arg, i32* %c
				store i32 %d_arg, i32* %d
				store i32 %e_arg, i32* %e
				store i32 %f_arg, i32* %f
				%tmp = load i32* %a
				%tmp1 = mul i32 %tmp, 2
				%tmp2 = load i32* %b
				%tmp3 = mul i32 %tmp2, 3
				%tmp4 = add i32 %tmp1, %tmp3
				%tmp5 = load i32* %c
				%tmp6 = mul i32 %tmp5, 5
				%tmp7 = add i32 %tmp4, %tmp6
				%tmp8 = load i32* %d
				%tmp9 = mul i32 %tmp8, 7
				%tmp10 = add i32 %tmp7, %tmp9
				%tmp11 = load i32* %e
				%tmp12 = mul i32 %tmp11, 11
				%tmp13 = add i32 %tmp10, %tmp12
				%tmp14 = load i32* %f
				%tmp15 = mul i32 %tmp14, 13
				%tmp16 = add i32 %tmp13, %tmp15
				ret i32 %tmp16
				}
				; WIN64-LABEL: foo3:
				; WIN64: .seh_proc foo3
				; WIN64: pushq %rsi
				; WIN64: .seh_pushreg 6
				; WIN64: subq $24, %rsp
				; WIN64: .seh_stackalloc 24
				; WIN64: .seh_endprologue
				; WIN64: addq $24, %rsp
				; WIN64: popq %rsi
				; WIN64: ret
				; WIN64: .seh_endproc


				; Check emission of eh handler and handler data
				declare i32 @_d_eh_personality(i32, i32, i64, i8, i8)
				declare void @_d_eh_resume_unwind(i8*)

				declare i32 @bar()

				define i32 @foo4() #0 {
				entry:
				%step = alloca i32, align 4
				store i32 0, i32* %step
				%tmp = load i32* %step

				%tmp1 = invoke i32 @bar()
				to label %finally unwind label %landingpad

				finally:
				store i32 1, i32* %step
				br label %endtryfinally

				landingpad:
				%landing_pad = landingpad { i8, i32 } personality i32 (i32, i32, i64, i8, i8) @_d_eh_personality
				cleanup
				%tmp3 = extractvalue { i8*, i32 } %landing_pad, 0
				store i32 2, i32* %step
				call void @_d_eh_resume_unwind(i8* %tmp3)
				unreachable

				endtryfinally:
				%tmp10 = load i32* %step
				ret i32 %tmp10
				}
				; WIN64-LABEL: foo4:
				; WIN64: .seh_proc foo4
				; WIN64: .seh_handler _d_eh_personality, @unwind, @except
				; WIN64: subq $56, %rsp
				; WIN64: .seh_stackalloc 56
				; WIN64: .seh_endprologue
				; WIN64: addq $56, %rsp
				; WIN64: ret
				; WIN64: .seh_handlerdata
				; WIN64: .seh_endproc


				; Check stack re-alignment and xmm spilling
				define void @foo5() uwtable {
				entry:
				%s = alloca i32, align 64
				call void asm sideeffect "", "~{rbx},~{rdi},~{xmm6},~{xmm7}"()
				ret void
				}
				; WIN64-LABEL: foo5:
				; WIN64: .seh_proc foo5
				; WIN64: pushq %rbp
				; WIN64: .seh_pushreg 5
				; WIN64: movq %rsp, %rbp
				; WIN64: pushq %rdi
				; WIN64: .seh_pushreg 7
				; WIN64: pushq %rbx
				; WIN64: .seh_pushreg 3
				; WIN64: andq $-64, %rsp
				; WIN64: subq $128, %rsp
				; WIN64: .seh_stackalloc 48
				; WIN64: .seh_setframe 5, 64
				; WIN64: movaps %xmm7, -32(%rbp) # 16-byte Spill
				; WIN64: movaps %xmm6, -48(%rbp) # 16-byte Spill
				; WIN64: .seh_savexmm 6, 16
				; WIN64: .seh_savexmm 7, 32
				; WIN64: .seh_endprologue
				; WIN64: movaps -48(%rbp), %xmm6 # 16-byte Reload
				; WIN64: movaps -32(%rbp), %xmm7 # 16-byte Reload
				; WIN64: leaq -16(%rbp), %rsp
				; WIN64: popq %rbx
				; WIN64: popq %rdi
				; WIN64: popq %rbp
				; WIN64: retq
				; WIN64: .seh_endproc

This is an archive of the discontinued LLVM Phabricator instance.

Generate SEH unwinding info on Win64ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 10704

llvm/trunk/include/llvm/CodeGen/MachineFrameInfo.h

llvm/trunk/include/llvm/MC/MCAsmInfo.h

llvm/trunk/include/llvm/Target/TargetFrameLowering.h

llvm/trunk/lib/CodeGen/AsmPrinter/Win64Exception.cpp

llvm/trunk/lib/CodeGen/MachineFunction.cpp

llvm/trunk/lib/CodeGen/PrologEpilogInserter.cpp

llvm/trunk/lib/MC/MCObjectFileInfo.cpp

llvm/trunk/lib/MC/MCStreamer.cpp

llvm/trunk/lib/Target/X86/MCTargetDesc/X86MCAsmInfo.cpp

llvm/trunk/lib/Target/X86/X86FrameLowering.h

llvm/trunk/lib/Target/X86/X86FrameLowering.cpp

llvm/trunk/lib/Target/X86/X86ISelLowering.cpp

llvm/trunk/lib/Target/X86/X86InstrCompiler.td

llvm/trunk/lib/Target/X86/X86MCInstLower.cpp

llvm/trunk/test/CodeGen/X86/2007-05-05-Personality.ll

llvm/trunk/test/CodeGen/X86/2009-06-03-Win64SpillXMM.ll

llvm/trunk/test/CodeGen/X86/avx-intel-ocl.ll

llvm/trunk/test/CodeGen/X86/gcc_except_table.ll

llvm/trunk/test/CodeGen/X86/win64_eh.ll

Generate SEH unwinding info on Win64
ClosedPublic