This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Target/ARM/
-
Target/
-
ARM/
-
ARMAsmPrinter.cpp
-
ARMBaseInstrInfo.h
-
ARMBaseInstrInfo.cpp
-
ARMBaseRegisterInfo.h
-
ARMExpandPseudoInsts.cpp
2/7
ARMFrameLowering.cpp
-
ARMInstrInfo.td
-
ARMInstrThumb2.td
-
Thumb2SizeReduction.cpp
-
test/
-
CodeGen/ARM/Windows/
-
ARM/
-
Windows/
-
dllimport.ll
-
wineh-opcodes.ll
-
wineh-tailmerge.ll
-
DebugInfo/COFF/ARMNT/
-
COFF/
-
ARMNT/
-
arm-register-variables.ll

Differential D125648

[ARM SEH 6] [ARM] Add SEH opcodes in frame lowering
ClosedPublic

Authored by mstorsjo on May 15 2022, 2:38 PM.

Download Raw Diff

Details

Reviewers

efriedma
rnk
zzheng

Commits

rGd8e67c1cccd8: [ARM] Add SEH opcodes in frame lowering

Summary

Skip inserting regular CFI instructions if using WinCFI.

This is based a fair amount on the corresponding ARM64 implementation,
but instead of trying to insert the SEH opcodes one by one where
we generate other prolog/epilog instructions, we try to walk over the
whole prolog/epilog range and insert them. This is done because in
many cases, the exact number of instructions inserted is abstracted
away deeper.

For some cases, we manually insert specific SEH opcodes directly where
instructions are generated, where the automatic mapping of instructions
to SEH opcodes doesn't hold up (e.g. for __chkstk stack probes).

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

mstorsjo created this revision.May 15 2022, 2:38 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 15 2022, 2:38 PM

Herald added subscribers: hiraditya, kristof.beyls. · View Herald Transcript

mstorsjo requested review of this revision.May 15 2022, 2:38 PM

Herald added a project: Restricted Project. · View Herald TranscriptMay 15 2022, 2:38 PM

mstorsjo added a parent revision: D125647: [ARM SEH 5] [MC] [Win64EH] Check that the SEH unwind opcodes match the actual instructions.May 15 2022, 2:38 PM

mstorsjo added a child revision: D125649: [ARM SEH 7] [ARM] Adjust the frame pointer when it's needed for SEH unwinding.

FYI regarding this patchset; this set of patches allows generating SEH unwind info for plain unwind tables, and it works fine for itanium exception handling in mingw mode.

For the MSVC mode SEH __try, and MSVC mode C++ exceptions, more code generation changes are needed. Hopefully those changes are kinda straightforward (I presume it should be possible to just borrow bits from the AArch64 target), but I haven't implemented them - hopefully someone else can pick that up if the rest of these changes end up mergeable at some point.

Harbormaster completed remote builds in B164540: Diff 429566.May 15 2022, 3:13 PM

efriedma added inline comments.May 16 2022, 12:40 PM

llvm/lib/Target/ARM/ARMFrameLowering.cpp
1091	We shouldn't be encoding stack realignment into the unwind data. It's basically a dynamic allocation: we have to emit a frame pointer before we realign the stack, and we should cut off the unwind prologue immediately after the frame pointer is set up.

mstorsjo added inline comments.May 16 2022, 1:17 PM

llvm/lib/Target/ARM/ARMFrameLowering.cpp
1091	Right, that'd be even cleaner. (This part gets tested only in the next patch which tweaks the frame pointers, and all the later opcodes are nops in that case. So cutting off the prologue at that point sounds like a good strategy.)

mstorsjo mentioned this in D125645: [ARM SEH 3] [ARM] [MC] Add support for writing ARM WinEH unwind info.May 22 2022, 2:37 PM

Updated to cut the prologue short, omitting parts that realign the stack. Updated to use separate .seh_nop directives for epilogues.

Since the beginning, I've had to disable PostRAScheduler, because it would reshuffle instructions independently of their associated SEH_* machineinstructions. I tried to compare it to the AArch64 implementation, but there, PostRAScheduler doesn't seem to be executed at all, so that didn't give any extra info about how to avoid it. So for this case, I tweaked ARMSubtarget::enablePostRAScheduler to disable the pass when producing SEH.

Compared to the AArch64 case, the ends of epilogues are slightly more problematic here. On AArch64, the epilogue end is before the ret instruction. On ARM, e.g. an bx lr (ARM::tBX_RET) is now followed by a ARM::SEH_Nop and ARM::SEH_EpilogEnd. As the ARM::tBX_RET is a terminator, the SEH_Nop and SEH_EpilogEnd that follows it also must be terminators, otherwise the machine instruction verifier bails out. Due to this, I've added a separate ARM::SEH_Nop_Ret which is marked a terminator.

After updating to use separate nop instructions, the end of an epilogue tBX_RET, SEH_EpilogEnd (nop=1) was changed into tBX_RET, SEH_Nop_Ret, SEH_EpilogEnd. This causes the machine block placement pass to do tail merging of multiple such epilogues in one function (where it previously didn't), which loses the SEH_Nop_ Ret and SEH_EpilogEnd for all but one of those epilogues.

What's the correct way of making sure that tail merging doesn't try to touch the SEH_* instructions? This doesn't seem to be happening on AArch64.

I looked into making a bundle of the rBX_RET, SEH_Nop_Ret, SEH_EpilogEnd (with an finalizeBundle over those three instructions), but that later breaks assembly output with a failed assert "Cannot print this instruction.". I presume that would require something to unbundle them later?

When looking into the tail merging pass, I noticed that if MachineInstr::isCFIInstruction() would return true for the SEH_* instructions, it could be handled differently and maybe the pass wouldn't break them. But it doesn't seem trivial to test out making that return true for the SEH_* instructions, so I don't know if that would fix any of these issues or not.

Harbormaster completed remote builds in B165767: Diff 431271.May 22 2022, 3:30 PM

In D125648#3530563, @mstorsjo wrote:

When looking into the tail merging pass, I noticed that if MachineInstr::isCFIInstruction() would return true for the SEH_* instructions, it could be handled differently and maybe the pass wouldn't break them. But it doesn't seem trivial to test out making that return true for the SEH_* instructions, so I don't know if that would fix any of these issues or not.

I managed to make a PoC of that, where the core of the changes were this:

diff --git a/llvm/include/llvm/CodeGen/MachineInstr.h b/llvm/include/llvm/CodeGen/MachineInstr.h
index cb6698c12d8e..15bdfdb5d7eb 100644
--- a/llvm/include/llvm/CodeGen/MachineInstr.h
+++ b/llvm/include/llvm/CodeGen/MachineInstr.h 
@@ -112,6 +112,7 @@ public:
     NoMerge      = 1 << 15,             // Passes that drop source location info
                                         // (e.g. branch folding) should skip
                                         // this instruction.
+    CFILike      = 1 << 16,
   };
 
 private:
@@ -1207,7 +1208,7 @@ public:
   }
 
   // True if the instruction represents a position in the function.
-  bool isPosition() const { return isLabel() || isCFIInstruction(); }
+  bool isPosition() const { return isLabel() || isCFIInstruction() || getFlag(C
FILike); }
 
   bool isNonListDebugValue() const {
     return getOpcode() == TargetOpcode::DBG_VALUE;

diff --git a/llvm/lib/CodeGen/BranchFolding.cpp b/llvm/lib/CodeGen/BranchFolding
.cpp
index 76f6a00b718e..de81ca874800 100644
--- a/llvm/lib/CodeGen/BranchFolding.cpp
+++ b/llvm/lib/CodeGen/BranchFolding.cpp
@@ -294,7 +294,7 @@ static unsigned HashEndOfMBB(const MachineBasicBlock &MBB) {
 
 /// Whether MI should be counted as an instruction when calculating common tail
.
 static bool countsAsInstruction(const MachineInstr &MI) {
-  return !(MI.isDebugInstr() || MI.isCFIInstruction());
+  return !(MI.isDebugInstr() || MI.isCFIInstruction() || MI.getFlag(MachineInst
r::CFILike));
 }
 
 /// Iterate backwards from the given iterator \p I, towards the beginning of th
e

This fixes the issue I was seeing (by setting that flag on all the SEH_* instructions). However, I'm not sure if that actually avoids the real issue (of splitting off the SEH_* instructions from the regular instructions) or if it just makes the tail merging no longer seem worthwhile doing. It doesn't fix the need to disable the PostRAScheduler in any case.

(Also, adding this new MachineInstr flag is problematic, as it requires widening MachineInstr::Flags from uint16_t to uint32_t.)

For scheduling, the AArch64 backend overrides "isSchedulingBoundary()" for SEH instructions.

I don't think we ever ran into issues with tail merge... maybe setting MachineInstr:::NoMerge would work?

In D125648#3531768, @efriedma wrote:

For scheduling, the AArch64 backend overrides "isSchedulingBoundary()" for SEH instructions.

Oh, thanks, that does fix the issue with PostRAScheduling - thanks!

I don't think we ever ran into issues with tail merge... maybe setting MachineInstr:::NoMerge would work?

Yup, that does seem to do the trick too. Thanks!

No longer need to disable PostRAScheduler, and fixed tail merging by setting the MachineInst::NoMerge flag.

Harbormaster completed remote builds in B165909: Diff 431472.May 23 2022, 2:19 PM

Plain rebase, no functional change.

Harbormaster completed remote builds in B166447: Diff 432237.May 26 2022, 4:13 AM

efriedma added inline comments.May 31 2022, 5:45 PM

llvm/lib/Target/ARM/ARMFrameLowering.cpp
300	report_fatal_error (here and other places you abort()).
304	Maybe add t2MOVi16/t2MOVTi16 here?

Moved all SEH_Nop insertion for __chkstk into insertSEH (with only one SEH instruction being manually added there, for the SEH_StackAlloc after the __chkstk call).

Switched to report_fatal_error instead of a manual printout and std::abort().

Harbormaster completed remote builds in B167215: Diff 433335.Jun 1 2022, 3:30 AM

efriedma added inline comments.Jun 1 2022, 11:43 AM

llvm/lib/Target/ARM/ARMFrameLowering.cpp
315	Looking at this again, this is actually sort of scary. In particular, this is dependent on looking into the future: trying to predict what Thumb2SizeReduction will do with a given instruction. Which is at best fragile, at worst broken if Thumb2SizeReduction doesn't run, or decides to do something different. I guess you can sort of predict what will happen for t2MOVi16 and t2LDMIA_RET/t2LDMIA_UPD/t2STMDB_UPD. But it's less clear in other cases; we currently don't optimize t2SUBspImm, but we could. Or for TCRETURNdi, we don't actually decide the size until we hit the assembler. I'm thinking we might want to disable Thumb2SizeReduction on instructions with SEH opcodes. (Or equivalently, on FrameSetup instructions if SEH unwind is enabled.)

mstorsjo added inline comments.Jun 1 2022, 12:48 PM

llvm/lib/Target/ARM/ARMFrameLowering.cpp
315	Thanks - this was indeed one of my fears initially. In practice, these guesses for what it will end up like have worked for all the code I've tested this on so far. But it's indeed brittle. Skipping Thumb2SizeReduction for FrameSetup/FrameDestroy when SEH unwind is enabled seems to work fine though, so that alleviates most of the issue. (As a future TODO, one could maybe consider rewriting the MI to a narrow form already at this point, for the few opcodes where it matters?) For TCRETURNdi, I also feared that it would be an issue, but it hasn't cropped up. (Or maybe the nondeterminate length of the instruction makes it unable to calculate the length of the epilogue at that point? And thus just skips the check...) But it seems like the pseudo expansion of TCRETURNdi already has got such a case; MachO also requires strictly Thumb2 wide branches for tail calls, so we can opt in to that logic for SEH too.

Skip Thumb2SizeReduction for SEH prologs/epilogs, and force tail calls to wide instructions (just like on MachO), to make sure that the unwind info actually matches the width of the final instructions without heuristics about what later passes will do.

LGTM

llvm/lib/Target/ARM/ARMFrameLowering.cpp
315	I think you might need to implement narrowing for "push" and "pop", but probably not anything else. But in any case, it doesn't need to be in this patch.

This revision is now accepted and ready to land.Jun 1 2022, 1:25 PM

Harbormaster completed remote builds in B167336: Diff 433506.Jun 1 2022, 1:49 PM

This revision was landed with ongoing or failed builds.Jun 2 2022, 2:29 AM

Closed by commit rGd8e67c1cccd8: [ARM] Add SEH opcodes in frame lowering (authored by mstorsjo). · Explain Why

This revision was automatically updated to reflect the committed changes.

mstorsjo added a commit: rGd8e67c1cccd8: [ARM] Add SEH opcodes in frame lowering.

efriedma mentioned this in D149367: Emit the CodeView `S_ARMSWITCHTABLE` debug symbol for jump tables.May 17 2023, 2:26 PM

Revision Contents

Path

Size

llvm/

lib/

Target/

ARM/

ARMAsmPrinter.cpp

41 lines

ARMBaseInstrInfo.h

20 lines

ARMBaseInstrInfo.cpp

3 lines

ARMBaseRegisterInfo.h

2 lines

ARMExpandPseudoInsts.cpp

15 lines

ARMFrameLowering.cpp

375 lines

ARMInstrInfo.td

21 lines

ARMInstrThumb2.td

3 lines

Thumb2SizeReduction.cpp

22 lines

test/

CodeGen/

ARM/

Windows/

dllimport.ll

2 lines

wineh-opcodes.ll

313 lines

wineh-tailmerge.ll

59 lines

DebugInfo/

COFF/

ARMNT/

arm-register-variables.ll

4 lines

Diff 433689

llvm/lib/Target/ARM/ARMAsmPrinter.cpp

Show First 20 Lines • Show All 2,268 Lines • ▼ Show 20 Lines	void ARMAsmPrinter::emitInstruction(const MachineInstr *MI) {
}		}
case ARM::t2SpeculationBarrierSBEndBB: {		case ARM::t2SpeculationBarrierSBEndBB: {
// Print SB		// Print SB
MCInst TmpInstSB;		MCInst TmpInstSB;
TmpInstSB.setOpcode(ARM::t2SB);		TmpInstSB.setOpcode(ARM::t2SB);
EmitToStreamer(*OutStreamer, TmpInstSB);		EmitToStreamer(*OutStreamer, TmpInstSB);
return;		return;
}		}

		case ARM::SEH_StackAlloc:
		ATS.emitARMWinCFIAllocStack(MI->getOperand(0).getImm(),
		MI->getOperand(1).getImm());
		return;

		case ARM::SEH_SaveRegs:
		case ARM::SEH_SaveRegs_Ret:
		ATS.emitARMWinCFISaveRegMask(MI->getOperand(0).getImm(),
		MI->getOperand(1).getImm());
		return;

		case ARM::SEH_SaveSP:
		ATS.emitARMWinCFISaveSP(MI->getOperand(0).getImm());
		return;

		case ARM::SEH_SaveFRegs:
		ATS.emitARMWinCFISaveFRegs(MI->getOperand(0).getImm(),
		MI->getOperand(1).getImm());
		return;

		case ARM::SEH_SaveLR:
		ATS.emitARMWinCFISaveLR(MI->getOperand(0).getImm());
		return;

		case ARM::SEH_Nop:
		case ARM::SEH_Nop_Ret:
		ATS.emitARMWinCFINop(MI->getOperand(0).getImm());
		return;

		case ARM::SEH_PrologEnd:
		ATS.emitARMWinCFIPrologEnd(/Fragment=/false);
		return;

		case ARM::SEH_EpilogStart:
		ATS.emitARMWinCFIEpilogStart(ARMCC::AL);
		return;

		case ARM::SEH_EpilogEnd:
		ATS.emitARMWinCFIEpilogEnd();
		return;
}		}

MCInst TmpInst;		MCInst TmpInst;
LowerARMMachineInstrToMCInst(MI, TmpInst, *this);		LowerARMMachineInstrToMCInst(MI, TmpInst, *this);

EmitToStreamer(*OutStreamer, TmpInst);		EmitToStreamer(*OutStreamer, TmpInst);
}		}

Show All 11 Lines

llvm/lib/Target/ARM/ARMBaseInstrInfo.h

Show First 20 Lines • Show All 751 Lines • ▼ Show 20 Lines	static inline bool isValidCoprocessorNumber(unsigned Num,
// which clash with MVE.		// which clash with MVE.
if (featureBits[ARM::HasV8_1MMainlineOps] &&		if (featureBits[ARM::HasV8_1MMainlineOps] &&
((Num & 0xE) == 0x8 \|\| (Num & 0xE) == 0xE))		((Num & 0xE) == 0x8 \|\| (Num & 0xE) == 0xE))
return false;		return false;

return true;		return true;
}		}

		static inline bool isSEHInstruction(const MachineInstr &MI) {
		unsigned Opc = MI.getOpcode();
		switch (Opc) {
		case ARM::SEH_StackAlloc:
		case ARM::SEH_SaveRegs:
		case ARM::SEH_SaveRegs_Ret:
		case ARM::SEH_SaveSP:
		case ARM::SEH_SaveFRegs:
		case ARM::SEH_SaveLR:
		case ARM::SEH_Nop:
		case ARM::SEH_Nop_Ret:
		case ARM::SEH_PrologEnd:
		case ARM::SEH_EpilogStart:
		case ARM::SEH_EpilogEnd:
		return true;
		default:
		return false;
		}
		}

/// getInstrPredicate - If instruction is predicated, returns its predicate		/// getInstrPredicate - If instruction is predicated, returns its predicate
/// condition, otherwise returns AL. It also returns the condition code		/// condition, otherwise returns AL. It also returns the condition code
/// register by reference.		/// register by reference.
ARMCC::CondCodes getInstrPredicate(const MachineInstr &MI, Register &PredReg);		ARMCC::CondCodes getInstrPredicate(const MachineInstr &MI, Register &PredReg);

unsigned getMatchingCondBranchOpcode(unsigned Opc);		unsigned getMatchingCondBranchOpcode(unsigned Opc);

/// Map pseudo instructions that imply an 'S' bit onto real opcodes. Whether		/// Map pseudo instructions that imply an 'S' bit onto real opcodes. Whether
▲ Show 20 Lines • Show All 178 Lines • Show Last 20 Lines

llvm/lib/Target/ARM/ARMBaseInstrInfo.cpp

Show First 20 Lines • Show All 2,065 Lines • ▼ Show 20 Lines	bool ARMBaseInstrInfo::isSchedulingBoundary(const MachineInstr &MI,
// Terminators and labels can't be scheduled around.		// Terminators and labels can't be scheduled around.
if (MI.isTerminator() \|\| MI.isPosition())		if (MI.isTerminator() \|\| MI.isPosition())
return true;		return true;

// INLINEASM_BR can jump to another block		// INLINEASM_BR can jump to another block
if (MI.getOpcode() == TargetOpcode::INLINEASM_BR)		if (MI.getOpcode() == TargetOpcode::INLINEASM_BR)
return true;		return true;

		if (isSEHInstruction(MI))
		return true;

// Treat the start of the IT block as a scheduling boundary, but schedule		// Treat the start of the IT block as a scheduling boundary, but schedule
// t2IT along with all instructions following it.		// t2IT along with all instructions following it.
// FIXME: This is a big hammer. But the alternative is to add all potential		// FIXME: This is a big hammer. But the alternative is to add all potential
// true and anti dependencies to IT block instructions as implicit operands		// true and anti dependencies to IT block instructions as implicit operands
// to the t2IT instruction. The added compile time and complexity does not		// to the t2IT instruction. The added compile time and complexity does not
// seem worth it.		// seem worth it.
MachineBasicBlock::const_iterator I = MI;		MachineBasicBlock::const_iterator I = MI;
// Make sure to skip any debug instructions		// Make sure to skip any debug instructions
▲ Show 20 Lines • Show All 4,788 Lines • Show Last 20 Lines

llvm/lib/Target/ARM/ARMBaseRegisterInfo.h

Show First 20 Lines • Show All 208 Lines • ▼ Show 20 Lines	bool shouldCoalesce(MachineInstr *MI,
unsigned DstSubReg,		unsigned DstSubReg,
const TargetRegisterClass *NewRC,		const TargetRegisterClass *NewRC,
LiveIntervals &LIS) const override;		LiveIntervals &LIS) const override;

bool shouldRewriteCopySrc(const TargetRegisterClass *DefRC,		bool shouldRewriteCopySrc(const TargetRegisterClass *DefRC,
unsigned DefSubReg,		unsigned DefSubReg,
const TargetRegisterClass *SrcRC,		const TargetRegisterClass *SrcRC,
unsigned SrcSubReg) const override;		unsigned SrcSubReg) const override;

		int getSEHRegNum(unsigned i) const { return getEncodingValue(i); }
};		};

} // end namespace llvm		} // end namespace llvm

#endif // LLVM_LIB_TARGET_ARM_ARMBASEREGISTERINFO_H		#endif // LLVM_LIB_TARGET_ARM_ARMBASEREGISTERINFO_H

llvm/lib/Target/ARM/ARMExpandPseudoInsts.cpp

Show All 17 Lines
#include "ARMBaseRegisterInfo.h"		#include "ARMBaseRegisterInfo.h"
#include "ARMConstantPoolValue.h"		#include "ARMConstantPoolValue.h"
#include "ARMMachineFunctionInfo.h"		#include "ARMMachineFunctionInfo.h"
#include "ARMSubtarget.h"		#include "ARMSubtarget.h"
#include "MCTargetDesc/ARMAddressingModes.h"		#include "MCTargetDesc/ARMAddressingModes.h"
#include "llvm/CodeGen/LivePhysRegs.h"		#include "llvm/CodeGen/LivePhysRegs.h"
#include "llvm/CodeGen/MachineFrameInfo.h"		#include "llvm/CodeGen/MachineFrameInfo.h"
#include "llvm/CodeGen/MachineFunctionPass.h"		#include "llvm/CodeGen/MachineFunctionPass.h"
		#include "llvm/MC/MCAsmInfo.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"

using namespace llvm;		using namespace llvm;

#define DEBUG_TYPE "arm-pseudo"		#define DEBUG_TYPE "arm-pseudo"

static cl::opt<bool>		static cl::opt<bool>
VerifyARMPseudo("verify-arm-pseudo-expand", cl::Hidden,		VerifyARMPseudo("verify-arm-pseudo-expand", cl::Hidden,
▲ Show 20 Lines • Show All 2,068 Lines • ▼ Show 20 Lines	case ARM::VBSPq: {
}		}
MI.eraseFromParent();		MI.eraseFromParent();
return true;		return true;
}		}

case ARM::TCRETURNdi:		case ARM::TCRETURNdi:
case ARM::TCRETURNri: {		case ARM::TCRETURNri: {
MachineBasicBlock::iterator MBBI = MBB.getLastNonDebugInstr();		MachineBasicBlock::iterator MBBI = MBB.getLastNonDebugInstr();
		if (MBBI->getOpcode() == ARM::SEH_EpilogEnd)
		MBBI--;
		if (MBBI->getOpcode() == ARM::SEH_Nop_Ret)
		MBBI--;
assert(MBBI->isReturn() &&		assert(MBBI->isReturn() &&
"Can only insert epilog into returning blocks");		"Can only insert epilog into returning blocks");
unsigned RetOpcode = MBBI->getOpcode();		unsigned RetOpcode = MBBI->getOpcode();
DebugLoc dl = MBBI->getDebugLoc();		DebugLoc dl = MBBI->getDebugLoc();
const ARMBaseInstrInfo &TII = static_cast<const ARMBaseInstrInfo >(		const ARMBaseInstrInfo &TII = static_cast<const ARMBaseInstrInfo >(
MBB.getParent()->getSubtarget().getInstrInfo());		MBB.getParent()->getSubtarget().getInstrInfo());

// Tail call return: adjust the stack pointer and jump to callee.		// Tail call return: adjust the stack pointer and jump to callee.
MBBI = MBB.getLastNonDebugInstr();		MBBI = MBB.getLastNonDebugInstr();
		if (MBBI->getOpcode() == ARM::SEH_EpilogEnd)
		MBBI--;
		if (MBBI->getOpcode() == ARM::SEH_Nop_Ret)
		MBBI--;
MachineOperand &JumpTarget = MBBI->getOperand(0);		MachineOperand &JumpTarget = MBBI->getOperand(0);

// Jump to label or value in register.		// Jump to label or value in register.
if (RetOpcode == ARM::TCRETURNdi) {		if (RetOpcode == ARM::TCRETURNdi) {
		MachineFunction *MF = MBB.getParent();
		bool NeedsWinCFI = MF->getTarget().getMCAsmInfo()->usesWindowsCFI() &&
		MF->getFunction().needsUnwindTableEntry();
unsigned TCOpcode =		unsigned TCOpcode =
STI->isThumb()		STI->isThumb()
? (STI->isTargetMachO() ? ARM::tTAILJMPd : ARM::tTAILJMPdND)		? ((STI->isTargetMachO() \|\| NeedsWinCFI) ? ARM::tTAILJMPd
		: ARM::tTAILJMPdND)
: ARM::TAILJMPd;		: ARM::TAILJMPd;
MachineInstrBuilder MIB = BuildMI(MBB, MBBI, dl, TII.get(TCOpcode));		MachineInstrBuilder MIB = BuildMI(MBB, MBBI, dl, TII.get(TCOpcode));
if (JumpTarget.isGlobal())		if (JumpTarget.isGlobal())
MIB.addGlobalAddress(JumpTarget.getGlobal(), JumpTarget.getOffset(),		MIB.addGlobalAddress(JumpTarget.getGlobal(), JumpTarget.getOffset(),
JumpTarget.getTargetFlags());		JumpTarget.getTargetFlags());
else {		else {
assert(JumpTarget.isSymbol());		assert(JumpTarget.isSymbol());
MIB.addExternalSymbol(JumpTarget.getSymbolName(),		MIB.addExternalSymbol(JumpTarget.getSymbolName(),
▲ Show 20 Lines • Show All 1,024 Lines • Show Last 20 Lines

llvm/lib/Target/ARM/ARMFrameLowering.cpp

Show First 20 Lines • Show All 132 Lines • ▼ Show 20 Lines
#include "llvm/CodeGen/TargetInstrInfo.h"		#include "llvm/CodeGen/TargetInstrInfo.h"
#include "llvm/CodeGen/TargetOpcodes.h"		#include "llvm/CodeGen/TargetOpcodes.h"
#include "llvm/CodeGen/TargetRegisterInfo.h"		#include "llvm/CodeGen/TargetRegisterInfo.h"
#include "llvm/CodeGen/TargetSubtargetInfo.h"		#include "llvm/CodeGen/TargetSubtargetInfo.h"
#include "llvm/IR/Attributes.h"		#include "llvm/IR/Attributes.h"
#include "llvm/IR/CallingConv.h"		#include "llvm/IR/CallingConv.h"
#include "llvm/IR/DebugLoc.h"		#include "llvm/IR/DebugLoc.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
		#include "llvm/MC/MCAsmInfo.h"
#include "llvm/MC/MCContext.h"		#include "llvm/MC/MCContext.h"
#include "llvm/MC/MCDwarf.h"		#include "llvm/MC/MCDwarf.h"
#include "llvm/MC/MCInstrDesc.h"		#include "llvm/MC/MCInstrDesc.h"
#include "llvm/MC/MCRegisterInfo.h"		#include "llvm/MC/MCRegisterInfo.h"
#include "llvm/Support/CodeGen.h"		#include "llvm/Support/CodeGen.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/Compiler.h"		#include "llvm/Support/Compiler.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
▲ Show 20 Lines • Show All 118 Lines • ▼ Show 20 Lines	if (IsTailCallReturn) {
// LowerFormalArguments. This will, of course, be zero for the C calling		// LowerFormalArguments. This will, of course, be zero for the C calling
// convention.		// convention.
ArgumentPopSize = AFI->getArgumentStackToRestore();		ArgumentPopSize = AFI->getArgumentStackToRestore();
}		}

return ArgumentPopSize;		return ArgumentPopSize;
}		}

		static bool needsWinCFI(const MachineFunction &MF) {
		const Function &F = MF.getFunction();
		return MF.getTarget().getMCAsmInfo()->usesWindowsCFI() &&
		F.needsUnwindTableEntry();
		}

		// Given a load or a store instruction, generate an appropriate unwinding SEH
		// code on Windows.
		static MachineBasicBlock::iterator insertSEH(MachineBasicBlock::iterator MBBI,
		const TargetInstrInfo &TII,
		unsigned Flags) {
		unsigned Opc = MBBI->getOpcode();
		MachineBasicBlock *MBB = MBBI->getParent();
		MachineFunction &MF = *MBB->getParent();
		DebugLoc DL = MBBI->getDebugLoc();
		MachineInstrBuilder MIB;
		const ARMSubtarget &Subtarget = MF.getSubtarget<ARMSubtarget>();
		const ARMBaseRegisterInfo *RegInfo = Subtarget.getRegisterInfo();

		Flags \|= MachineInstr::NoMerge;

		switch (Opc) {
		default:
		report_fatal_error("No SEH Opcode for instruction " + TII.getName(Opc));
		break;
		efriedmaUnsubmitted Not Done Reply Inline Actions report_fatal_error (here and other places you abort()). efriedma: report_fatal_error (here and other places you abort()).
		case ARM::t2ADDri: // add.w r11, sp, #xx
		case ARM::t2ADDri12: // add.w r11, sp, #xx
		case ARM::t2SUBri: // sub.w r4, r11, #xx
		case ARM::t2MOVTi16: // movt r4, #xx
		efriedmaUnsubmitted Not Done Reply Inline Actions Maybe add t2MOVi16/t2MOVTi16 here? efriedma: Maybe add t2MOVi16/t2MOVTi16 here?
		case ARM::t2MOVi16: // movw r4, #xx
		case ARM::tBL: // bl __chkstk
		// These are harmless if used for just setting up a frame pointer,
		// but that frame pointer can't be relied upon for unwinding, unless
		// set up with SEH_SaveSP.
		MIB = BuildMI(MF, DL, TII.get(ARM::SEH_Nop))
		.addImm(/Wide=/1)
		.setMIFlags(Flags);
		break;

		case ARM::tBLXr: // blx r12 (__chkstk)
		efriedmaUnsubmitted Not Done Reply Inline Actions Looking at this again, this is actually sort of scary. In particular, this is dependent on looking into the future: trying to predict what Thumb2SizeReduction will do with a given instruction. Which is at best fragile, at worst broken if Thumb2SizeReduction doesn't run, or decides to do something different. I guess you can sort of predict what will happen for t2MOVi16 and t2LDMIA_RET/t2LDMIA_UPD/t2STMDB_UPD. But it's less clear in other cases; we currently don't optimize t2SUBspImm, but we could. Or for TCRETURNdi, we don't actually decide the size until we hit the assembler. I'm thinking we might want to disable Thumb2SizeReduction on instructions with SEH opcodes. (Or equivalently, on FrameSetup instructions if SEH unwind is enabled.) efriedma: Looking at this again, this is actually sort of scary. In particular, this is dependent on…
		mstorsjoAuthorUnsubmitted Done Reply Inline Actions Thanks - this was indeed one of my fears initially. In practice, these guesses for what it will end up like have worked for all the code I've tested this on so far. But it's indeed brittle. Skipping Thumb2SizeReduction for FrameSetup/FrameDestroy when SEH unwind is enabled seems to work fine though, so that alleviates most of the issue. (As a future TODO, one could maybe consider rewriting the MI to a narrow form already at this point, for the few opcodes where it matters?) For TCRETURNdi, I also feared that it would be an issue, but it hasn't cropped up. (Or maybe the nondeterminate length of the instruction makes it unable to calculate the length of the epilogue at that point? And thus just skips the check...) But it seems like the pseudo expansion of TCRETURNdi already has got such a case; MachO also requires strictly Thumb2 wide branches for tail calls, so we can opt in to that logic for SEH too. mstorsjo: Thanks - this was indeed one of my fears initially. In practice, these guesses for what it will…
		efriedmaUnsubmitted Not Done Reply Inline Actions I think you might need to implement narrowing for "push" and "pop", but probably not anything else. But in any case, it doesn't need to be in this patch. efriedma: I think you might need to implement narrowing for "push" and "pop", but probably not anything…
		MIB = BuildMI(MF, DL, TII.get(ARM::SEH_Nop))
		.addImm(/Wide=/0)
		.setMIFlags(Flags);
		break;

		case ARM::t2MOVi32imm: // movw+movt
		// This pseudo instruction expands into two mov instructions. If the
		// second operand is a symbol reference, this will stay as two wide
		// instructions, movw+movt. If they're immediates, the first one can
		// end up as a narrow mov though.
		// As two SEH instructions are appended here, they won't get interleaved
		// between the two final movw/movt instructions, but it doesn't make any
		// practical difference.
		MIB = BuildMI(MF, DL, TII.get(ARM::SEH_Nop))
		.addImm(/Wide=/1)
		.setMIFlags(Flags);
		MBB->insertAfter(MBBI, MIB);
		MIB = BuildMI(MF, DL, TII.get(ARM::SEH_Nop))
		.addImm(/Wide=/1)
		.setMIFlags(Flags);
		break;

		case ARM::t2LDMIA_RET:
		case ARM::t2LDMIA_UPD:
		case ARM::t2STMDB_UPD: {
		unsigned Mask = 0;
		for (unsigned i = 4, NumOps = MBBI->getNumOperands(); i != NumOps; ++i) {
		const MachineOperand &MO = MBBI->getOperand(i);
		if (!MO.isReg() \|\| MO.isImplicit())
		continue;
		unsigned Reg = RegInfo->getSEHRegNum(MO.getReg());
		if (Reg == 15)
		Reg = 14;
		Mask \|= 1 << Reg;
		}
		unsigned SEHOpc =
		(Opc == ARM::t2LDMIA_RET) ? ARM::SEH_SaveRegs_Ret : ARM::SEH_SaveRegs;
		MIB = BuildMI(MF, DL, TII.get(SEHOpc))
		.addImm(Mask)
		.addImm(/Wide=/1)
		.setMIFlags(Flags);
		break;
		}
		case ARM::VSTMDDB_UPD:
		case ARM::VLDMDIA_UPD: {
		int First = -1, Last = 0;
		for (unsigned i = 4, NumOps = MBBI->getNumOperands(); i != NumOps; ++i) {
		const MachineOperand &MO = MBBI->getOperand(i);
		unsigned Reg = RegInfo->getSEHRegNum(MO.getReg());
		if (First == -1)
		First = Reg;
		Last = Reg;
		}
		MIB = BuildMI(MF, DL, TII.get(ARM::SEH_SaveFRegs))
		.addImm(First)
		.addImm(Last)
		.setMIFlags(Flags);
		break;
		}
		case ARM::tSUBspi:
		case ARM::tADDspi:
		MIB = BuildMI(MF, DL, TII.get(ARM::SEH_StackAlloc))
		.addImm(MBBI->getOperand(2).getImm() * 4)
		.addImm(/Wide=/0)
		.setMIFlags(Flags);
		break;
		case ARM::t2SUBspImm:
		case ARM::t2SUBspImm12:
		case ARM::t2ADDspImm:
		case ARM::t2ADDspImm12:
		MIB = BuildMI(MF, DL, TII.get(ARM::SEH_StackAlloc))
		.addImm(MBBI->getOperand(2).getImm())
		.addImm(/Wide=/1)
		.setMIFlags(Flags);
		break;

		case ARM::tMOVr:
		if (MBBI->getOperand(1).getReg() == ARM::SP &&
		(Flags & MachineInstr::FrameSetup)) {
		unsigned Reg = RegInfo->getSEHRegNum(MBBI->getOperand(0).getReg());
		MIB = BuildMI(MF, DL, TII.get(ARM::SEH_SaveSP))
		.addImm(Reg)
		.setMIFlags(Flags);
		} else if (MBBI->getOperand(0).getReg() == ARM::SP &&
		(Flags & MachineInstr::FrameDestroy)) {
		unsigned Reg = RegInfo->getSEHRegNum(MBBI->getOperand(1).getReg());
		MIB = BuildMI(MF, DL, TII.get(ARM::SEH_SaveSP))
		.addImm(Reg)
		.setMIFlags(Flags);
		} else {
		report_fatal_error("No SEH Opcode for MOV");
		}
		break;

		case ARM::tBX_RET:
		case ARM::TCRETURNri:
		MIB = BuildMI(MF, DL, TII.get(ARM::SEH_Nop_Ret))
		.addImm(/Wide=/0)
		.setMIFlags(Flags);
		break;

		case ARM::TCRETURNdi:
		MIB = BuildMI(MF, DL, TII.get(ARM::SEH_Nop_Ret))
		.addImm(/Wide=/1)
		.setMIFlags(Flags);
		break;
		}
		return MBB->insertAfter(MBBI, MIB);
		}

		static MachineBasicBlock::iterator
		initMBBRange(MachineBasicBlock &MBB, const MachineBasicBlock::iterator &MBBI) {
		if (MBBI == MBB.begin())
		return MachineBasicBlock::iterator();
		return std::prev(MBBI);
		}

		static void insertSEHRange(MachineBasicBlock &MBB,
		MachineBasicBlock::iterator Start,
		const MachineBasicBlock::iterator &End,
		const ARMBaseInstrInfo &TII, unsigned MIFlags) {
		if (Start.isValid())
		Start = std::next(Start);
		else
		Start = MBB.begin();

		for (auto MI = Start; MI != End;) {
		auto Next = std::next(MI);
		// Check if this instruction already has got a SEH opcode added. In that
		// case, don't do this generic mapping.
		if (Next != End && isSEHInstruction(*Next)) {
		MI = std::next(Next);
		while (MI != End && isSEHInstruction(*MI))
		++MI;
		continue;
		}
		insertSEH(MI, TII, MIFlags);
		MI = Next;
		}
		}

static void emitRegPlusImmediate(		static void emitRegPlusImmediate(
bool isARM, MachineBasicBlock &MBB, MachineBasicBlock::iterator &MBBI,		bool isARM, MachineBasicBlock &MBB, MachineBasicBlock::iterator &MBBI,
const DebugLoc &dl, const ARMBaseInstrInfo &TII, unsigned DestReg,		const DebugLoc &dl, const ARMBaseInstrInfo &TII, unsigned DestReg,
unsigned SrcReg, int NumBytes, unsigned MIFlags = MachineInstr::NoFlags,		unsigned SrcReg, int NumBytes, unsigned MIFlags = MachineInstr::NoFlags,
ARMCC::CondCodes Pred = ARMCC::AL, unsigned PredReg = 0) {		ARMCC::CondCodes Pred = ARMCC::AL, unsigned PredReg = 0) {
if (isARM)		if (isARM)
emitARMRegPlusImmediate(MBB, MBBI, dl, DestReg, SrcReg, NumBytes,		emitARMRegPlusImmediate(MBB, MBBI, dl, DestReg, SrcReg, NumBytes,
Pred, PredReg, TII, MIFlags);		Pred, PredReg, TII, MIFlags);
▲ Show 20 Lines • Show All 193 Lines • ▼ Show 20 Lines	void ARMFrameLowering::emitPrologue(MachineFunction &MF,
assert(!AFI->isThumb1OnlyFunction() &&		assert(!AFI->isThumb1OnlyFunction() &&
"This emitPrologue does not support Thumb1!");		"This emitPrologue does not support Thumb1!");
bool isARM = !AFI->isThumbFunction();		bool isARM = !AFI->isThumbFunction();
Align Alignment = STI.getFrameLowering()->getStackAlign();		Align Alignment = STI.getFrameLowering()->getStackAlign();
unsigned ArgRegsSaveSize = AFI->getArgRegsSaveSize();		unsigned ArgRegsSaveSize = AFI->getArgRegsSaveSize();
unsigned NumBytes = MFI.getStackSize();		unsigned NumBytes = MFI.getStackSize();
const std::vector<CalleeSavedInfo> &CSI = MFI.getCalleeSavedInfo();		const std::vector<CalleeSavedInfo> &CSI = MFI.getCalleeSavedInfo();
int FPCXTSaveSize = 0;		int FPCXTSaveSize = 0;
		bool NeedsWinCFI = needsWinCFI(MF);

// Debug location must be unknown since the first debug location is used		// Debug location must be unknown since the first debug location is used
// to determine the end of the prologue.		// to determine the end of the prologue.
DebugLoc dl;		DebugLoc dl;

Register FramePtr = RegInfo->getFrameRegister(MF);		Register FramePtr = RegInfo->getFrameRegister(MF);

// Determine the sizes of each callee-save spill areas and record which frame		// Determine the sizes of each callee-save spill areas and record which frame
Show All 12 Lines	void ARMFrameLowering::emitPrologue(MachineFunction &MF,

if (!AFI->hasStackFrame() &&		if (!AFI->hasStackFrame() &&
(!STI.isTargetWindows() \|\| !WindowsRequiresStackProbe(MF, NumBytes))) {		(!STI.isTargetWindows() \|\| !WindowsRequiresStackProbe(MF, NumBytes))) {
if (NumBytes != 0) {		if (NumBytes != 0) {
emitSPUpdate(isARM, MBB, MBBI, dl, TII, -NumBytes,		emitSPUpdate(isARM, MBB, MBBI, dl, TII, -NumBytes,
MachineInstr::FrameSetup);		MachineInstr::FrameSetup);
DefCFAOffsetCandidates.addInst(std::prev(MBBI), NumBytes, true);		DefCFAOffsetCandidates.addInst(std::prev(MBBI), NumBytes, true);
}		}
		if (!NeedsWinCFI)
DefCFAOffsetCandidates.emitDefCFAOffsets(MBB, dl, TII, HasFP);		DefCFAOffsetCandidates.emitDefCFAOffsets(MBB, dl, TII, HasFP);
		if (NeedsWinCFI && MBBI != MBB.begin()) {
		insertSEHRange(MBB, {}, MBBI, TII, MachineInstr::FrameSetup);
		BuildMI(MBB, MBBI, dl, TII.get(ARM::SEH_PrologEnd))
		.setMIFlag(MachineInstr::FrameSetup);
		MF.setHasWinCFI(true);
		}
return;		return;
}		}

// Determine spill area sizes.		// Determine spill area sizes.
for (const CalleeSavedInfo &I : CSI) {		for (const CalleeSavedInfo &I : CSI) {
Register Reg = I.getReg();		Register Reg = I.getReg();
int FI = I.getFrameIdx();		int FI = I.getFrameIdx();
switch (Reg) {		switch (Reg) {
▲ Show 20 Lines • Show All 120 Lines • ▼ Show 20 Lines	if (AFI->getNumAlignedDPRCS2Regs() > 0) {
// Adjust NumBytes to represent the stack slots below the DPRCS2 area.		// Adjust NumBytes to represent the stack slots below the DPRCS2 area.
NumBytes += MFI.getObjectOffset(D8SpillFI);		NumBytes += MFI.getObjectOffset(D8SpillFI);
} else		} else
NumBytes = DPRCSOffset;		NumBytes = DPRCSOffset;

if (STI.isTargetWindows() && WindowsRequiresStackProbe(MF, NumBytes)) {		if (STI.isTargetWindows() && WindowsRequiresStackProbe(MF, NumBytes)) {
uint32_t NumWords = NumBytes >> 2;		uint32_t NumWords = NumBytes >> 2;

if (NumWords < 65536)		if (NumWords < 65536) {
BuildMI(MBB, MBBI, dl, TII.get(ARM::t2MOVi16), ARM::R4)		BuildMI(MBB, MBBI, dl, TII.get(ARM::t2MOVi16), ARM::R4)
.addImm(NumWords)		.addImm(NumWords)
.setMIFlags(MachineInstr::FrameSetup)		.setMIFlags(MachineInstr::FrameSetup)
.add(predOps(ARMCC::AL));		.add(predOps(ARMCC::AL));
else		} else {
BuildMI(MBB, MBBI, dl, TII.get(ARM::t2MOVi32imm), ARM::R4)		// Split into two instructions here, instead of using t2MOVi32imm,
.addImm(NumWords)		// to allow inserting accurate SEH instructions (including accurate
.setMIFlags(MachineInstr::FrameSetup);		// instruction size for each of them).
		BuildMI(MBB, MBBI, dl, TII.get(ARM::t2MOVi16), ARM::R4)
		.addImm(NumWords & 0xffff)
		.setMIFlags(MachineInstr::FrameSetup)
		.add(predOps(ARMCC::AL));
		BuildMI(MBB, MBBI, dl, TII.get(ARM::t2MOVTi16), ARM::R4)
		.addReg(ARM::R4)
		.addImm(NumWords >> 16)
		.setMIFlags(MachineInstr::FrameSetup)
		.add(predOps(ARMCC::AL));
		}

switch (TM.getCodeModel()) {		switch (TM.getCodeModel()) {
case CodeModel::Tiny:		case CodeModel::Tiny:
llvm_unreachable("Tiny code model not available on ARM.");		llvm_unreachable("Tiny code model not available on ARM.");
case CodeModel::Small:		case CodeModel::Small:
case CodeModel::Medium:		case CodeModel::Medium:
case CodeModel::Kernel:		case CodeModel::Kernel:
BuildMI(MBB, MBBI, dl, TII.get(ARM::tBL))		BuildMI(MBB, MBBI, dl, TII.get(ARM::tBL))
Show All 10 Lines	case CodeModel::Large:
BuildMI(MBB, MBBI, dl, TII.get(ARM::tBLXr))		BuildMI(MBB, MBBI, dl, TII.get(ARM::tBLXr))
.add(predOps(ARMCC::AL))		.add(predOps(ARMCC::AL))
.addReg(ARM::R12, RegState::Kill)		.addReg(ARM::R12, RegState::Kill)
.addReg(ARM::R4, RegState::Implicit)		.addReg(ARM::R4, RegState::Implicit)
.setMIFlags(MachineInstr::FrameSetup);		.setMIFlags(MachineInstr::FrameSetup);
break;		break;
}		}

BuildMI(MBB, MBBI, dl, TII.get(ARM::t2SUBrr), ARM::SP)		MachineInstrBuilder Instr, SEH;
		Instr = BuildMI(MBB, MBBI, dl, TII.get(ARM::t2SUBrr), ARM::SP)
.addReg(ARM::SP, RegState::Kill)		.addReg(ARM::SP, RegState::Kill)
.addReg(ARM::R4, RegState::Kill)		.addReg(ARM::R4, RegState::Kill)
.setMIFlags(MachineInstr::FrameSetup)		.setMIFlags(MachineInstr::FrameSetup)
.add(predOps(ARMCC::AL))		.add(predOps(ARMCC::AL))
.add(condCodeOp());		.add(condCodeOp());
		if (NeedsWinCFI) {
		SEH = BuildMI(MF, dl, TII.get(ARM::SEH_StackAlloc))
		.addImm(NumBytes)
		.addImm(/Wide=/1)
		.setMIFlags(MachineInstr::FrameSetup);
		MBB.insertAfter(Instr, SEH);
		}
NumBytes = 0;		NumBytes = 0;
}		}

if (NumBytes) {		if (NumBytes) {
// Adjust SP after all the callee-save spills.		// Adjust SP after all the callee-save spills.
if (AFI->getNumAlignedDPRCS2Regs() == 0 &&		if (AFI->getNumAlignedDPRCS2Regs() == 0 &&
tryFoldSPUpdateIntoPushPop(STI, MF, &*LastPush, NumBytes))		tryFoldSPUpdateIntoPushPop(STI, MF, &*LastPush, NumBytes))
DefCFAOffsetCandidates.addExtraBytes(LastPush, NumBytes);		DefCFAOffsetCandidates.addExtraBytes(LastPush, NumBytes);
Show All 23 Lines	void ARMFrameLowering::emitPrologue(MachineFunction &MF,
// that push.		// that push.
if (HasFP) {		if (HasFP) {
MachineBasicBlock::iterator AfterPush = std::next(GPRCS1Push);		MachineBasicBlock::iterator AfterPush = std::next(GPRCS1Push);
unsigned PushSize = sizeOfSPAdjustment(*GPRCS1Push);		unsigned PushSize = sizeOfSPAdjustment(*GPRCS1Push);
emitRegPlusImmediate(!AFI->isThumbFunction(), MBB, AfterPush,		emitRegPlusImmediate(!AFI->isThumbFunction(), MBB, AfterPush,
dl, TII, FramePtr, ARM::SP,		dl, TII, FramePtr, ARM::SP,
PushSize + FramePtrOffsetInPush,		PushSize + FramePtrOffsetInPush,
MachineInstr::FrameSetup);		MachineInstr::FrameSetup);
		if (!NeedsWinCFI) {
if (FramePtrOffsetInPush + PushSize != 0) {		if (FramePtrOffsetInPush + PushSize != 0) {
unsigned CFIIndex = MF.addFrameInst(MCCFIInstruction::cfiDefCfa(		unsigned CFIIndex = MF.addFrameInst(MCCFIInstruction::cfiDefCfa(
nullptr, MRI->getDwarfRegNum(FramePtr, true),		nullptr, MRI->getDwarfRegNum(FramePtr, true),
FPCXTSaveSize + ArgRegsSaveSize - FramePtrOffsetInPush));		FPCXTSaveSize + ArgRegsSaveSize - FramePtrOffsetInPush));
BuildMI(MBB, AfterPush, dl, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(MBB, AfterPush, dl, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex)		.addCFIIndex(CFIIndex)
.setMIFlags(MachineInstr::FrameSetup);		.setMIFlags(MachineInstr::FrameSetup);
} else {		} else {
unsigned CFIIndex =		unsigned CFIIndex =
MF.addFrameInst(MCCFIInstruction::createDefCfaRegister(		MF.addFrameInst(MCCFIInstruction::createDefCfaRegister(
nullptr, MRI->getDwarfRegNum(FramePtr, true)));		nullptr, MRI->getDwarfRegNum(FramePtr, true)));
BuildMI(MBB, AfterPush, dl, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(MBB, AfterPush, dl, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex)		.addCFIIndex(CFIIndex)
.setMIFlags(MachineInstr::FrameSetup);		.setMIFlags(MachineInstr::FrameSetup);
}		}
}		}
		}

		// Emit a SEH opcode indicating the prologue end. The rest of the prologue
		// instructions below don't need to be replayed to unwind the stack.
		if (NeedsWinCFI && MBBI != MBB.begin()) {
		insertSEHRange(MBB, {}, MBBI, TII, MachineInstr::FrameSetup);
		BuildMI(MBB, MBBI, dl, TII.get(ARM::SEH_PrologEnd))
		.setMIFlag(MachineInstr::FrameSetup);
		MF.setHasWinCFI(true);
		}

// Now that the prologue's actual instructions are finalised, we can insert		// Now that the prologue's actual instructions are finalised, we can insert
// the necessary DWARF cf instructions to describe the situation. Start by		// the necessary DWARF cf instructions to describe the situation. Start by
// recording where each register ended up:		// recording where each register ended up:
if (GPRCS1Size > 0) {		if (GPRCS1Size > 0 && !NeedsWinCFI) {
MachineBasicBlock::iterator Pos = std::next(GPRCS1Push);		MachineBasicBlock::iterator Pos = std::next(GPRCS1Push);
int CFIIndex;		int CFIIndex;
for (const auto &Entry : CSI) {		for (const auto &Entry : CSI) {
Register Reg = Entry.getReg();		Register Reg = Entry.getReg();
int FI = Entry.getFrameIdx();		int FI = Entry.getFrameIdx();
switch (Reg) {		switch (Reg) {
case ARM::R8:		case ARM::R8:
case ARM::R9:		case ARM::R9:
Show All 17 Lines	for (const auto &Entry : CSI) {
BuildMI(MBB, Pos, dl, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(MBB, Pos, dl, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex)		.addCFIIndex(CFIIndex)
.setMIFlags(MachineInstr::FrameSetup);		.setMIFlags(MachineInstr::FrameSetup);
break;		break;
}		}
}		}
}		}

if (GPRCS2Size > 0) {		if (GPRCS2Size > 0 && !NeedsWinCFI) {
MachineBasicBlock::iterator Pos = std::next(GPRCS2Push);		MachineBasicBlock::iterator Pos = std::next(GPRCS2Push);
for (const auto &Entry : CSI) {		for (const auto &Entry : CSI) {
Register Reg = Entry.getReg();		Register Reg = Entry.getReg();
int FI = Entry.getFrameIdx();		int FI = Entry.getFrameIdx();
switch (Reg) {		switch (Reg) {
case ARM::R8:		case ARM::R8:
case ARM::R9:		case ARM::R9:
case ARM::R10:		case ARM::R10:
Show All 9 Lines	for (const auto &Entry : CSI) {
.addCFIIndex(CFIIndex)		.addCFIIndex(CFIIndex)
.setMIFlags(MachineInstr::FrameSetup);		.setMIFlags(MachineInstr::FrameSetup);
}		}
break;		break;
}		}
}		}
}		}

if (DPRCSSize > 0) {		if (DPRCSSize > 0 && !NeedsWinCFI) {
// Since vpush register list cannot have gaps, there may be multiple vpush		// Since vpush register list cannot have gaps, there may be multiple vpush
// instructions in the prologue.		// instructions in the prologue.
MachineBasicBlock::iterator Pos = std::next(LastPush);		MachineBasicBlock::iterator Pos = std::next(LastPush);
for (const auto &Entry : CSI) {		for (const auto &Entry : CSI) {
Register Reg = Entry.getReg();		Register Reg = Entry.getReg();
int FI = Entry.getFrameIdx();		int FI = Entry.getFrameIdx();
if ((Reg >= ARM::D0 && Reg <= ARM::D31) &&		if ((Reg >= ARM::D0 && Reg <= ARM::D31) &&
(Reg < ARM::D8 \|\| Reg >= ARM::D8 + AFI->getNumAlignedDPRCS2Regs())) {		(Reg < ARM::D8 \|\| Reg >= ARM::D8 + AFI->getNumAlignedDPRCS2Regs())) {
unsigned DwarfReg = MRI->getDwarfRegNum(Reg, true);		unsigned DwarfReg = MRI->getDwarfRegNum(Reg, true);
unsigned Offset = MFI.getObjectOffset(FI);		unsigned Offset = MFI.getObjectOffset(FI);
unsigned CFIIndex = MF.addFrameInst(		unsigned CFIIndex = MF.addFrameInst(
MCCFIInstruction::createOffset(nullptr, DwarfReg, Offset));		MCCFIInstruction::createOffset(nullptr, DwarfReg, Offset));
BuildMI(MBB, Pos, dl, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(MBB, Pos, dl, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex)		.addCFIIndex(CFIIndex)
.setMIFlags(MachineInstr::FrameSetup);		.setMIFlags(MachineInstr::FrameSetup);
}		}
}		}
}		}

// Now we can emit descriptions of where the canonical frame address was		// Now we can emit descriptions of where the canonical frame address was
// throughout the process. If we have a frame pointer, it takes over the job		// throughout the process. If we have a frame pointer, it takes over the job
// half-way through, so only the first few .cfi_def_cfa_offset instructions		// half-way through, so only the first few .cfi_def_cfa_offset instructions
// actually get emitted.		// actually get emitted.
		if (!NeedsWinCFI)
DefCFAOffsetCandidates.emitDefCFAOffsets(MBB, dl, TII, HasFP);		DefCFAOffsetCandidates.emitDefCFAOffsets(MBB, dl, TII, HasFP);

if (STI.isTargetELF() && hasFP(MF))		if (STI.isTargetELF() && hasFP(MF))
MFI.setOffsetAdjustment(MFI.getOffsetAdjustment() -		MFI.setOffsetAdjustment(MFI.getOffsetAdjustment() -
AFI->getFramePtrSpillOffset());		AFI->getFramePtrSpillOffset());

AFI->setFPCXTSaveAreaSize(FPCXTSaveSize);		AFI->setFPCXTSaveAreaSize(FPCXTSaveSize);
AFI->setGPRCalleeSavedArea1Size(GPRCS1Size);		AFI->setGPRCalleeSavedArea1Size(GPRCS1Size);
AFI->setGPRCalleeSavedArea2Size(GPRCS2Size);		AFI->setGPRCalleeSavedArea2Size(GPRCS2Size);
Show All 21 Lines	if (!AFI->isThumbFunction()) {
BuildMI(MBB, MBBI, dl, TII.get(ARM::tMOVr), ARM::R4)		BuildMI(MBB, MBBI, dl, TII.get(ARM::tMOVr), ARM::R4)
.addReg(ARM::SP, RegState::Kill)		.addReg(ARM::SP, RegState::Kill)
.add(predOps(ARMCC::AL));		.add(predOps(ARMCC::AL));
emitAligningInstructions(MF, AFI, TII, MBB, MBBI, dl, ARM::R4, MaxAlign,		emitAligningInstructions(MF, AFI, TII, MBB, MBBI, dl, ARM::R4, MaxAlign,
false);		false);
BuildMI(MBB, MBBI, dl, TII.get(ARM::tMOVr), ARM::SP)		BuildMI(MBB, MBBI, dl, TII.get(ARM::tMOVr), ARM::SP)
.addReg(ARM::R4, RegState::Kill)		.addReg(ARM::R4, RegState::Kill)
.add(predOps(ARMCC::AL));		.add(predOps(ARMCC::AL));
}		}
		efriedmaUnsubmitted Not Done Reply Inline Actions We shouldn't be encoding stack realignment into the unwind data. It's basically a dynamic allocation: we have to emit a frame pointer before we realign the stack, and we should cut off the unwind prologue immediately after the frame pointer is set up. efriedma: We shouldn't be encoding stack realignment into the unwind data. It's basically a dynamic…
		mstorsjoAuthorUnsubmitted Done Reply Inline Actions Right, that'd be even cleaner. (This part gets tested only in the next patch which tweaks the frame pointers, and all the later opcodes are nops in that case. So cutting off the prologue at that point sounds like a good strategy.) mstorsjo: Right, that'd be even cleaner. (This part gets tested only in the next patch which tweaks the…

AFI->setShouldRestoreSPFromFP(true);		AFI->setShouldRestoreSPFromFP(true);
}		}

// If we need a base pointer, set it up here. It's whatever the value		// If we need a base pointer, set it up here. It's whatever the value
// of the stack pointer is at this point. Any variable size objects		// of the stack pointer is at this point. Any variable size objects
// will be allocated after this, so we can still use the base pointer		// will be allocated after this, so we can still use the base pointer
// to reference locals.		// to reference locals.
▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	void ARMFrameLowering::emitEpilogue(MachineFunction &MF,
// prologue/epilogue.		// prologue/epilogue.
if (MF.getFunction().getCallingConv() == CallingConv::GHC)		if (MF.getFunction().getCallingConv() == CallingConv::GHC)
return;		return;

// First put ourselves on the first (from top) terminator instructions.		// First put ourselves on the first (from top) terminator instructions.
MachineBasicBlock::iterator MBBI = MBB.getFirstTerminator();		MachineBasicBlock::iterator MBBI = MBB.getFirstTerminator();
DebugLoc dl = MBBI != MBB.end() ? MBBI->getDebugLoc() : DebugLoc();		DebugLoc dl = MBBI != MBB.end() ? MBBI->getDebugLoc() : DebugLoc();

		MachineBasicBlock::iterator RangeStart;
if (!AFI->hasStackFrame()) {		if (!AFI->hasStackFrame()) {
		if (MF.hasWinCFI()) {
		BuildMI(MBB, MBBI, dl, TII.get(ARM::SEH_EpilogStart))
		.setMIFlag(MachineInstr::FrameDestroy);
		RangeStart = initMBBRange(MBB, MBBI);
		}

if (NumBytes + IncomingArgStackToRestore != 0)		if (NumBytes + IncomingArgStackToRestore != 0)
emitSPUpdate(isARM, MBB, MBBI, dl, TII,		emitSPUpdate(isARM, MBB, MBBI, dl, TII,
NumBytes + IncomingArgStackToRestore,		NumBytes + IncomingArgStackToRestore,
MachineInstr::FrameDestroy);		MachineInstr::FrameDestroy);
} else {		} else {
// Unwind MBBI to point to first LDR / VLDRD.		// Unwind MBBI to point to first LDR / VLDRD.
if (MBBI != MBB.begin()) {		if (MBBI != MBB.begin()) {
do {		do {
--MBBI;		--MBBI;
} while (MBBI != MBB.begin() &&		} while (MBBI != MBB.begin() &&
MBBI->getFlag(MachineInstr::FrameDestroy));		MBBI->getFlag(MachineInstr::FrameDestroy));
if (!MBBI->getFlag(MachineInstr::FrameDestroy))		if (!MBBI->getFlag(MachineInstr::FrameDestroy))
++MBBI;		++MBBI;
}		}

		if (MF.hasWinCFI()) {
		BuildMI(MBB, MBBI, dl, TII.get(ARM::SEH_EpilogStart))
		.setMIFlag(MachineInstr::FrameDestroy);
		RangeStart = initMBBRange(MBB, MBBI);
		}

// Move SP to start of FP callee save spill area.		// Move SP to start of FP callee save spill area.
NumBytes -= (ReservedArgStack +		NumBytes -= (ReservedArgStack +
AFI->getFPCXTSaveAreaSize() +		AFI->getFPCXTSaveAreaSize() +
AFI->getGPRCalleeSavedArea1Size() +		AFI->getGPRCalleeSavedArea1Size() +
AFI->getGPRCalleeSavedArea2Size() +		AFI->getGPRCalleeSavedArea2Size() +
AFI->getDPRCalleeSavedGapSize() +		AFI->getDPRCalleeSavedGapSize() +
AFI->getDPRCalleeSavedAreaSize());		AFI->getDPRCalleeSavedAreaSize());

▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines	if (!AFI->hasStackFrame()) {

// Validate PAC, It should have been already popped into R12. For CMSE entry		// Validate PAC, It should have been already popped into R12. For CMSE entry
// function, the validation instruction is emitted during expansion of the		// function, the validation instruction is emitted during expansion of the
// tBXNS_RET, since the validation must use the value of SP at function		// tBXNS_RET, since the validation must use the value of SP at function
// entry, before saving, resp. after restoring, FPCXTNS.		// entry, before saving, resp. after restoring, FPCXTNS.
if (AFI->shouldSignReturnAddress() && !AFI->isCmseNSEntryFunction())		if (AFI->shouldSignReturnAddress() && !AFI->isCmseNSEntryFunction())
BuildMI(MBB, MBBI, DebugLoc(), STI.getInstrInfo()->get(ARM::t2AUT));		BuildMI(MBB, MBBI, DebugLoc(), STI.getInstrInfo()->get(ARM::t2AUT));
}		}

		if (MF.hasWinCFI()) {
		insertSEHRange(MBB, RangeStart, MBB.end(), TII, MachineInstr::FrameDestroy);
		BuildMI(MBB, MBB.end(), dl, TII.get(ARM::SEH_EpilogEnd))
		.setMIFlag(MachineInstr::FrameDestroy);
		}
}		}

/// getFrameIndexReference - Provide a base+offset reference to an FI slot for		/// getFrameIndexReference - Provide a base+offset reference to an FI slot for
/// debug info. It's the same as what we use for resolving the code-gen		/// debug info. It's the same as what we use for resolving the code-gen
/// references for now. FIXME: This can go wrong when references are		/// references for now. FIXME: This can go wrong when references are
/// SP-relative and simple call frames aren't used.		/// SP-relative and simple call frames aren't used.
StackOffset ARMFrameLowering::getFrameIndexReference(const MachineFunction &MF,		StackOffset ARMFrameLowering::getFrameIndexReference(const MachineFunction &MF,
int FI,		int FI,
▲ Show 20 Lines • Show All 1,551 Lines • ▼ Show 20 Lines	BuildMI(PrevStackMBB, DL, TII.get(ARM::STMDB_UPD))
.addReg(ARM::SP)		.addReg(ARM::SP)
.add(predOps(ARMCC::AL))		.add(predOps(ARMCC::AL))
.addReg(ScratchReg0)		.addReg(ScratchReg0)
.addReg(ScratchReg1);		.addReg(ScratchReg1);
}		}

// Emit the relevant DWARF information about the change in stack pointer as		// Emit the relevant DWARF information about the change in stack pointer as
// well as where to find both r4 and r5 (the callee-save registers)		// well as where to find both r4 and r5 (the callee-save registers)
		if (!MF.getTarget().getMCAsmInfo()->usesWindowsCFI()) {
CFIIndex = MF.addFrameInst(MCCFIInstruction::cfiDefCfaOffset(nullptr, 8));		CFIIndex = MF.addFrameInst(MCCFIInstruction::cfiDefCfaOffset(nullptr, 8));
BuildMI(PrevStackMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(PrevStackMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex);		.addCFIIndex(CFIIndex);
CFIIndex = MF.addFrameInst(MCCFIInstruction::createOffset(		CFIIndex = MF.addFrameInst(MCCFIInstruction::createOffset(
nullptr, MRI->getDwarfRegNum(ScratchReg1, true), -4));		nullptr, MRI->getDwarfRegNum(ScratchReg1, true), -4));
BuildMI(PrevStackMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(PrevStackMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex);		.addCFIIndex(CFIIndex);
CFIIndex = MF.addFrameInst(MCCFIInstruction::createOffset(		CFIIndex = MF.addFrameInst(MCCFIInstruction::createOffset(
nullptr, MRI->getDwarfRegNum(ScratchReg0, true), -8));		nullptr, MRI->getDwarfRegNum(ScratchReg0, true), -8));
BuildMI(PrevStackMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(PrevStackMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex);		.addCFIIndex(CFIIndex);
		}

// mov SR1, sp		// mov SR1, sp
if (Thumb) {		if (Thumb) {
BuildMI(McrMBB, DL, TII.get(ARM::tMOVr), ScratchReg1)		BuildMI(McrMBB, DL, TII.get(ARM::tMOVr), ScratchReg1)
.addReg(ARM::SP)		.addReg(ARM::SP)
.add(predOps(ARMCC::AL));		.add(predOps(ARMCC::AL));
} else if (CompareStackPointer) {		} else if (CompareStackPointer) {
BuildMI(McrMBB, DL, TII.get(ARM::MOVr), ScratchReg1)		BuildMI(McrMBB, DL, TII.get(ARM::MOVr), ScratchReg1)
▲ Show 20 Lines • Show All 185 Lines • ▼ Show 20 Lines	BuildMI(AllocMBB, DL, TII.get(ARM::STMDB_UPD))
.addReg(ARM::SP, RegState::Define)		.addReg(ARM::SP, RegState::Define)
.addReg(ARM::SP)		.addReg(ARM::SP)
.add(predOps(ARMCC::AL))		.add(predOps(ARMCC::AL))
.addReg(ARM::LR);		.addReg(ARM::LR);
}		}

// Emit the DWARF info about the change in stack as well as where to find the		// Emit the DWARF info about the change in stack as well as where to find the
// previous link register		// previous link register
		if (!MF.getTarget().getMCAsmInfo()->usesWindowsCFI()) {
CFIIndex = MF.addFrameInst(MCCFIInstruction::cfiDefCfaOffset(nullptr, 12));		CFIIndex = MF.addFrameInst(MCCFIInstruction::cfiDefCfaOffset(nullptr, 12));
BuildMI(AllocMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(AllocMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex);		.addCFIIndex(CFIIndex);
CFIIndex = MF.addFrameInst(MCCFIInstruction::createOffset(		CFIIndex = MF.addFrameInst(MCCFIInstruction::createOffset(
nullptr, MRI->getDwarfRegNum(ARM::LR, true), -12));		nullptr, MRI->getDwarfRegNum(ARM::LR, true), -12));
BuildMI(AllocMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(AllocMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex);		.addCFIIndex(CFIIndex);
		}

// Call __morestack().		// Call __morestack().
if (Thumb) {		if (Thumb) {
BuildMI(AllocMBB, DL, TII.get(ARM::tBL))		BuildMI(AllocMBB, DL, TII.get(ARM::tBL))
.add(predOps(ARMCC::AL))		.add(predOps(ARMCC::AL))
.addExternalSymbol("__morestack");		.addExternalSymbol("__morestack");
} else {		} else {
BuildMI(AllocMBB, DL, TII.get(ARM::BL))		BuildMI(AllocMBB, DL, TII.get(ARM::BL))
Show All 39 Lines	BuildMI(AllocMBB, DL, TII.get(ARM::LDMIA_UPD))
.addReg(ARM::SP, RegState::Define)		.addReg(ARM::SP, RegState::Define)
.addReg(ARM::SP)		.addReg(ARM::SP)
.add(predOps(ARMCC::AL))		.add(predOps(ARMCC::AL))
.addReg(ScratchReg0)		.addReg(ScratchReg0)
.addReg(ScratchReg1);		.addReg(ScratchReg1);
}		}

// Update the CFA offset now that we've popped		// Update the CFA offset now that we've popped
		if (!MF.getTarget().getMCAsmInfo()->usesWindowsCFI()) {
CFIIndex = MF.addFrameInst(MCCFIInstruction::cfiDefCfaOffset(nullptr, 0));		CFIIndex = MF.addFrameInst(MCCFIInstruction::cfiDefCfaOffset(nullptr, 0));
BuildMI(AllocMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(AllocMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex);		.addCFIIndex(CFIIndex);
		}

// Return from this function.		// Return from this function.
BuildMI(AllocMBB, DL, TII.get(ST->getReturnOpcode())).add(predOps(ARMCC::AL));		BuildMI(AllocMBB, DL, TII.get(ST->getReturnOpcode())).add(predOps(ARMCC::AL));

// Restore SR0 and SR1 in case of __morestack() was not called.		// Restore SR0 and SR1 in case of __morestack() was not called.
// pop {SR0, SR1}		// pop {SR0, SR1}
if (Thumb) {		if (Thumb) {
BuildMI(PostStackMBB, DL, TII.get(ARM::tPOP))		BuildMI(PostStackMBB, DL, TII.get(ARM::tPOP))
.add(predOps(ARMCC::AL))		.add(predOps(ARMCC::AL))
.addReg(ScratchReg0)		.addReg(ScratchReg0)
.addReg(ScratchReg1);		.addReg(ScratchReg1);
} else {		} else {
BuildMI(PostStackMBB, DL, TII.get(ARM::LDMIA_UPD))		BuildMI(PostStackMBB, DL, TII.get(ARM::LDMIA_UPD))
.addReg(ARM::SP, RegState::Define)		.addReg(ARM::SP, RegState::Define)
.addReg(ARM::SP)		.addReg(ARM::SP)
.add(predOps(ARMCC::AL))		.add(predOps(ARMCC::AL))
.addReg(ScratchReg0)		.addReg(ScratchReg0)
.addReg(ScratchReg1);		.addReg(ScratchReg1);
}		}

// Update the CFA offset now that we've popped		// Update the CFA offset now that we've popped
		if (!MF.getTarget().getMCAsmInfo()->usesWindowsCFI()) {
CFIIndex = MF.addFrameInst(MCCFIInstruction::cfiDefCfaOffset(nullptr, 0));		CFIIndex = MF.addFrameInst(MCCFIInstruction::cfiDefCfaOffset(nullptr, 0));
BuildMI(PostStackMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(PostStackMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex);		.addCFIIndex(CFIIndex);

// Tell debuggers that r4 and r5 are now the same as they were in the		// Tell debuggers that r4 and r5 are now the same as they were in the
// previous function, that they're the "Same Value".		// previous function, that they're the "Same Value".
CFIIndex = MF.addFrameInst(MCCFIInstruction::createSameValue(		CFIIndex = MF.addFrameInst(MCCFIInstruction::createSameValue(
nullptr, MRI->getDwarfRegNum(ScratchReg0, true)));		nullptr, MRI->getDwarfRegNum(ScratchReg0, true)));
BuildMI(PostStackMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(PostStackMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex);		.addCFIIndex(CFIIndex);
CFIIndex = MF.addFrameInst(MCCFIInstruction::createSameValue(		CFIIndex = MF.addFrameInst(MCCFIInstruction::createSameValue(
nullptr, MRI->getDwarfRegNum(ScratchReg1, true)));		nullptr, MRI->getDwarfRegNum(ScratchReg1, true)));
BuildMI(PostStackMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))		BuildMI(PostStackMBB, DL, TII.get(TargetOpcode::CFI_INSTRUCTION))
.addCFIIndex(CFIIndex);		.addCFIIndex(CFIIndex);
		}

// Organizing MBB lists		// Organizing MBB lists
PostStackMBB->addSuccessor(&PrologueMBB);		PostStackMBB->addSuccessor(&PrologueMBB);

AllocMBB->addSuccessor(PostStackMBB);		AllocMBB->addSuccessor(PostStackMBB);

GetMBB->addSuccessor(PostStackMBB);		GetMBB->addSuccessor(PostStackMBB);
GetMBB->addSuccessor(AllocMBB);		GetMBB->addSuccessor(AllocMBB);
Show All 9 Lines

llvm/lib/Target/ARM/ARMInstrInfo.td

	Show First 20 Lines • Show All 6,470 Lines • ▼ Show 20 Lines

	def CompilerBarrier : PseudoInst<(outs), (ins i32imm:$ordering), NoItinerary,			def CompilerBarrier : PseudoInst<(outs), (ins i32imm:$ordering), NoItinerary,
	[(atomic_fence timm:$ordering, 0)]> {			[(atomic_fence timm:$ordering, 0)]> {
	let hasSideEffects = 1;			let hasSideEffects = 1;
	let Size = 0;			let Size = 0;
	let AsmString = "@ COMPILER BARRIER";			let AsmString = "@ COMPILER BARRIER";
	let hasNoSchedulingInfo = 1;			let hasNoSchedulingInfo = 1;
	}			}

				//===----------------------------------------------------------------------===//
				// Instructions used for emitting unwind opcodes on Windows.
				//===----------------------------------------------------------------------===//
				let isPseudo = 1 in {
				def SEH_StackAlloc : PseudoInst<(outs), (ins i32imm:$size, i32imm:$wide), NoItinerary, []>, Sched<[]>;
				def SEH_SaveRegs : PseudoInst<(outs), (ins i32imm:$mask, i32imm:$wide), NoItinerary, []>, Sched<[]>;
				let isTerminator = 1 in
				def SEH_SaveRegs_Ret : PseudoInst<(outs), (ins i32imm:$mask, i32imm:$wide), NoItinerary, []>, Sched<[]>;
				def SEH_SaveSP : PseudoInst<(outs), (ins i32imm:$reg), NoItinerary, []>, Sched<[]>;
				def SEH_SaveFRegs : PseudoInst<(outs), (ins i32imm:$first, i32imm:$last), NoItinerary, []>, Sched<[]>;
				let isTerminator = 1 in
				def SEH_SaveLR : PseudoInst<(outs), (ins i32imm:$offst), NoItinerary, []>, Sched<[]>;
				def SEH_Nop : PseudoInst<(outs), (ins i32imm:$wide), NoItinerary, []>, Sched<[]>;
				let isTerminator = 1 in
				def SEH_Nop_Ret : PseudoInst<(outs), (ins i32imm:$wide), NoItinerary, []>, Sched<[]>;
				def SEH_PrologEnd : PseudoInst<(outs), (ins), NoItinerary, []>, Sched<[]>;
				def SEH_EpilogStart : PseudoInst<(outs), (ins), NoItinerary, []>, Sched<[]>;
				let isTerminator = 1 in
				def SEH_EpilogEnd : PseudoInst<(outs), (ins), NoItinerary, []>, Sched<[]>;
				}

llvm/lib/Target/ARM/ARMInstrThumb2.td

Show First 20 Lines • Show All 3,945 Lines • ▼ Show 20 Lines	def t2Bcc : T2I<(outs), (ins brtarget:$target), IIC_Br,
let Inst{10-0} = target{11-1};		let Inst{10-0} = target{11-1};

let DecoderMethod = "DecodeThumb2BCCInstruction";		let DecoderMethod = "DecodeThumb2BCCInstruction";
let AsmMatchConverter = "cvtThumbBranches";		let AsmMatchConverter = "cvtThumbBranches";
}		}

// Tail calls. The MachO version of thumb tail calls uses a t2 branch, so		// Tail calls. The MachO version of thumb tail calls uses a t2 branch, so
// it goes here.		// it goes here.
		// Windows SEH unwinding also needs a strict t2 branch for tail calls.
let isCall = 1, isTerminator = 1, isReturn = 1, isBarrier = 1 in {		let isCall = 1, isTerminator = 1, isReturn = 1, isBarrier = 1 in {
// IOS version.		// IOS version.
let Uses = [SP] in		let Uses = [SP] in
def tTAILJMPd: tPseudoExpand<(outs),		def tTAILJMPd: tPseudoExpand<(outs),
(ins thumb_br_target:$dst, pred:$p),		(ins thumb_br_target:$dst, pred:$p),
4, IIC_Br, [],		4, IIC_Br, [],
(t2B thumb_br_target:$dst, pred:$p)>,		(t2B thumb_br_target:$dst, pred:$p)>,
Requires<[IsThumb2, IsMachO]>, Sched<[WriteBr]>;		Requires<[IsThumb2]>, Sched<[WriteBr]>;
}		}

// IT block		// IT block
let Defs = [ITSTATE] in		let Defs = [ITSTATE] in
def t2IT : Thumb2XI<(outs), (ins it_pred:$cc, it_mask:$mask),		def t2IT : Thumb2XI<(outs), (ins it_pred:$cc, it_mask:$mask),
AddrModeNone, 2, IIC_iALUx,		AddrModeNone, 2, IIC_iALUx,
"it$mask\t$cc", "", []> {		"it$mask\t$cc", "", []> {
// 16-bit instruction.		// 16-bit instruction.
▲ Show 20 Lines • Show All 1,786 Lines • Show Last 20 Lines

llvm/lib/Target/ARM/Thumb2SizeReduction.cpp

Show All 21 Lines
#include "llvm/CodeGen/MachineFunction.h"		#include "llvm/CodeGen/MachineFunction.h"
#include "llvm/CodeGen/MachineFunctionPass.h"		#include "llvm/CodeGen/MachineFunctionPass.h"
#include "llvm/CodeGen/MachineInstr.h"		#include "llvm/CodeGen/MachineInstr.h"
#include "llvm/CodeGen/MachineInstrBuilder.h"		#include "llvm/CodeGen/MachineInstrBuilder.h"
#include "llvm/CodeGen/MachineOperand.h"		#include "llvm/CodeGen/MachineOperand.h"
#include "llvm/CodeGen/TargetInstrInfo.h"		#include "llvm/CodeGen/TargetInstrInfo.h"
#include "llvm/IR/DebugLoc.h"		#include "llvm/IR/DebugLoc.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
		#include "llvm/MC/MCAsmInfo.h"
#include "llvm/MC/MCInstrDesc.h"		#include "llvm/MC/MCInstrDesc.h"
#include "llvm/MC/MCRegisterInfo.h"		#include "llvm/MC/MCRegisterInfo.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/Compiler.h"		#include "llvm/Support/Compiler.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include <algorithm>		#include <algorithm>
▲ Show 20 Lines • Show All 162 Lines • ▼ Show 20 Lines	private:

/// ReduceToNarrow - Reduce a 32-bit instruction to a 16-bit		/// ReduceToNarrow - Reduce a 32-bit instruction to a 16-bit
/// non-two-address instruction.		/// non-two-address instruction.
bool ReduceToNarrow(MachineBasicBlock &MBB, MachineInstr *MI,		bool ReduceToNarrow(MachineBasicBlock &MBB, MachineInstr *MI,
const ReduceEntry &Entry, bool LiveCPSR,		const ReduceEntry &Entry, bool LiveCPSR,
bool IsSelfLoop);		bool IsSelfLoop);

/// ReduceMI - Attempt to reduce MI, return true on success.		/// ReduceMI - Attempt to reduce MI, return true on success.
bool ReduceMI(MachineBasicBlock &MBB, MachineInstr *MI,		bool ReduceMI(MachineBasicBlock &MBB, MachineInstr *MI, bool LiveCPSR,
bool LiveCPSR, bool IsSelfLoop);		bool IsSelfLoop, bool SkipPrologueEpilogue);

/// ReduceMBB - Reduce width of instructions in the specified basic block.		/// ReduceMBB - Reduce width of instructions in the specified basic block.
bool ReduceMBB(MachineBasicBlock &MBB);		bool ReduceMBB(MachineBasicBlock &MBB, bool SkipPrologueEpilogue);

bool OptimizeSize;		bool OptimizeSize;
bool MinimizeSize;		bool MinimizeSize;

// Last instruction to define CPSR in the current block.		// Last instruction to define CPSR in the current block.
MachineInstr *CPSRDef;		MachineInstr *CPSRDef;
// Was CPSR last defined by a high latency instruction?		// Was CPSR last defined by a high latency instruction?
// When CPSRDef is null, this refers to CPSR defs in predecessors.		// When CPSRDef is null, this refers to CPSR defs in predecessors.
▲ Show 20 Lines • Show All 786 Lines • ▼ Show 20 Lines	if (MO.isKill()) {
break;		break;
}		}
}		}

return LiveCPSR;		return LiveCPSR;
}		}

bool Thumb2SizeReduce::ReduceMI(MachineBasicBlock &MBB, MachineInstr *MI,		bool Thumb2SizeReduce::ReduceMI(MachineBasicBlock &MBB, MachineInstr *MI,
bool LiveCPSR, bool IsSelfLoop) {		bool LiveCPSR, bool IsSelfLoop,
		bool SkipPrologueEpilogue) {
unsigned Opcode = MI->getOpcode();		unsigned Opcode = MI->getOpcode();
DenseMap<unsigned, unsigned>::iterator OPI = ReduceOpcodeMap.find(Opcode);		DenseMap<unsigned, unsigned>::iterator OPI = ReduceOpcodeMap.find(Opcode);
if (OPI == ReduceOpcodeMap.end())		if (OPI == ReduceOpcodeMap.end())
return false;		return false;
		if (SkipPrologueEpilogue && (MI->getFlag(MachineInstr::FrameSetup) \|\|
		MI->getFlag(MachineInstr::FrameDestroy)))
		return false;
const ReduceEntry &Entry = ReduceTable[OPI->second];		const ReduceEntry &Entry = ReduceTable[OPI->second];

// Don't attempt normal reductions on "special" cases for now.		// Don't attempt normal reductions on "special" cases for now.
if (Entry.Special)		if (Entry.Special)
return ReduceSpecial(MBB, MI, Entry, LiveCPSR, IsSelfLoop);		return ReduceSpecial(MBB, MI, Entry, LiveCPSR, IsSelfLoop);

// Try to transform to a 16-bit two-address instruction.		// Try to transform to a 16-bit two-address instruction.
if (Entry.NarrowOpc2 &&		if (Entry.NarrowOpc2 &&
ReduceTo2Addr(MBB, MI, Entry, LiveCPSR, IsSelfLoop))		ReduceTo2Addr(MBB, MI, Entry, LiveCPSR, IsSelfLoop))
return true;		return true;

// Try to transform to a 16-bit non-two-address instruction.		// Try to transform to a 16-bit non-two-address instruction.
if (Entry.NarrowOpc1 &&		if (Entry.NarrowOpc1 &&
ReduceToNarrow(MBB, MI, Entry, LiveCPSR, IsSelfLoop))		ReduceToNarrow(MBB, MI, Entry, LiveCPSR, IsSelfLoop))
return true;		return true;

return false;		return false;
}		}

bool Thumb2SizeReduce::ReduceMBB(MachineBasicBlock &MBB) {		bool Thumb2SizeReduce::ReduceMBB(MachineBasicBlock &MBB,
		bool SkipPrologueEpilogue) {
bool Modified = false;		bool Modified = false;

// Yes, CPSR could be livein.		// Yes, CPSR could be livein.
bool LiveCPSR = MBB.isLiveIn(ARM::CPSR);		bool LiveCPSR = MBB.isLiveIn(ARM::CPSR);
MachineInstr *BundleMI = nullptr;		MachineInstr *BundleMI = nullptr;

CPSRDef = nullptr;		CPSRDef = nullptr;
HighLatencyCPSR = false;		HighLatencyCPSR = false;
Show All 27 Lines	for (; MII != E; MII = NextMII) {
if (MI->isDebugInstr())		if (MI->isDebugInstr())
continue;		continue;

LiveCPSR = UpdateCPSRUse(*MI, LiveCPSR);		LiveCPSR = UpdateCPSRUse(*MI, LiveCPSR);

// Does NextMII belong to the same bundle as MI?		// Does NextMII belong to the same bundle as MI?
bool NextInSameBundle = NextMII != E && NextMII->isBundledWithPred();		bool NextInSameBundle = NextMII != E && NextMII->isBundledWithPred();

if (ReduceMI(MBB, MI, LiveCPSR, IsSelfLoop)) {		if (ReduceMI(MBB, MI, LiveCPSR, IsSelfLoop, SkipPrologueEpilogue)) {
Modified = true;		Modified = true;
MachineBasicBlock::instr_iterator I = std::prev(NextMII);		MachineBasicBlock::instr_iterator I = std::prev(NextMII);
MI = &*I;		MI = &*I;
// Removing and reinserting the first instruction in a bundle will break		// Removing and reinserting the first instruction in a bundle will break
// up the bundle. Fix the bundling if it was broken.		// up the bundle. Fix the bundling if it was broken.
if (NextInSameBundle && !NextMII->isBundledWithPred())		if (NextInSameBundle && !NextMII->isBundledWithPred())
NextMII->bundleWithPred();		NextMII->bundleWithPred();
}		}
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	bool Thumb2SizeReduce::runOnMachineFunction(MachineFunction &MF) {

BlockInfo.clear();		BlockInfo.clear();
BlockInfo.resize(MF.getNumBlockIDs());		BlockInfo.resize(MF.getNumBlockIDs());

// Visit blocks in reverse post-order so LastCPSRDef is known for all		// Visit blocks in reverse post-order so LastCPSRDef is known for all
// predecessors.		// predecessors.
ReversePostOrderTraversal<MachineFunction*> RPOT(&MF);		ReversePostOrderTraversal<MachineFunction*> RPOT(&MF);
bool Modified = false;		bool Modified = false;
		bool NeedsWinCFI = MF.getTarget().getMCAsmInfo()->usesWindowsCFI() &&
		MF.getFunction().needsUnwindTableEntry();
for (MachineBasicBlock *MBB : RPOT)		for (MachineBasicBlock *MBB : RPOT)
Modified \|= ReduceMBB(*MBB);		Modified \|= ReduceMBB(MBB, /SkipPrologueEpilogue=*/NeedsWinCFI);
return Modified;		return Modified;
}		}

/// createThumb2SizeReductionPass - Returns an instance of the Thumb2 size		/// createThumb2SizeReductionPass - Returns an instance of the Thumb2 size
/// reduction pass.		/// reduction pass.
FunctionPass *llvm::createThumb2SizeReductionPass(		FunctionPass *llvm::createThumb2SizeReductionPass(
std::function<bool(const Function &)> Ftor) {		std::function<bool(const Function &)> Ftor) {
return new Thumb2SizeReduce(std::move(Ftor));		return new Thumb2SizeReduce(std::move(Ftor));
}		}

llvm/test/CodeGen/ARM/Windows/dllimport.ll

	Show First 20 Lines • Show All 51 Lines • ▼ Show 20 Lines
	; CHECK: bx r0			; CHECK: bx r0

	define arm_aapcs_vfpcc i32 @call_internal() {			define arm_aapcs_vfpcc i32 @call_internal() {
	%call = tail call arm_aapcs_vfpcc i32 @internal()			%call = tail call arm_aapcs_vfpcc i32 @internal()
	ret i32 %call			ret i32 %call
	}			}

	; CHECK-LABEL: call_internal			; CHECK-LABEL: call_internal
	; CHECK: b internal			; CHECK: b.w internal

llvm/test/CodeGen/ARM/Windows/wineh-opcodes.ll

This file was added.

				;; Check that this produces the expected assembly output
				; RUN: llc -mtriple=thumbv7-windows -o - %s -verify-machineinstrs \| FileCheck %s
				;; Also try to write an object file, which verifies that the SEH opcodes
				;; match the actual prologue/epilogue length.
				; RUN: llc -mtriple=thumbv7-windows -filetype=obj -o %t.obj %s -verify-machineinstrs

				; CHECK-LABEL: clobberR4Frame:
				; CHECK-NEXT: .seh_proc clobberR4Frame
				; CHECK-NEXT: @ %bb.0: @ %entry
				; CHECK-NEXT: push.w {r4, r7, r11, lr}
				; CHECK-NEXT: .seh_save_regs_w {r4, r7, r11, lr}
				; CHECK-NEXT: add.w r11, sp, #8
				; CHECK-NEXT: .seh_nop_w
				; CHECK-NEXT: .seh_endprologue
				; CHECK-NEXT: bl other

				; CHECK: .seh_startepilogue
				; CHECK-NEXT: pop.w {r4, r7, r11, pc}
				; CHECK-NEXT: .seh_save_regs_w {r4, r7, r11, lr}
				; CHECK-NEXT: .seh_endepilogue
				; CHECK-NEXT: .seh_endproc

				define arm_aapcs_vfpcc void @clobberR4Frame() uwtable "frame-pointer"="all" {
				entry:
				call arm_aapcs_vfpcc void @other()
				call void asm sideeffect "", "~{r4}"()
				ret void
				}

				; CHECK-LABEL: clobberR4NoFrame:
				; CHECK-NEXT: .seh_proc clobberR4NoFrame
				; CHECK-NEXT: @ %bb.0: @ %entry
				; CHECK-NEXT: push.w {r4, lr}
				; CHECK-NEXT: .seh_save_regs_w {r4, lr}
				; CHECK-NEXT: .seh_endprologue
				; CHECK-NEXT: bl other

				; CHECK: .seh_startepilogue
				; CHECK-NEXT: pop.w {r4, pc}
				; CHECK-NEXT: .seh_save_regs_w {r4, lr}
				; CHECK-NEXT: .seh_endepilogue
				; CHECK-NEXT: .seh_endproc

				define arm_aapcs_vfpcc void @clobberR4NoFrame() uwtable "frame-pointer"="none" {
				entry:
				call arm_aapcs_vfpcc void @other()
				call void asm sideeffect "", "~{r4}"()
				ret void
				}

				; CHECK-LABEL: clobberR4Tail:
				; CHECK-NEXT: .seh_proc clobberR4Tail
				; CHECK-NEXT: @ %bb.0: @ %entry
				; CHECK-NEXT: push.w {r4, lr}
				; CHECK-NEXT: .seh_save_regs_w {r4, lr}
				; CHECK-NEXT: .seh_endprologue

				; CHECK: .seh_startepilogue
				; CHECK-NEXT: pop.w {r4, lr}
				; CHECK-NEXT: .seh_save_regs_w {r4, lr}
				; CHECK-NEXT: b.w other
				; CHECK-NEXT: .seh_nop_w
				; CHECK-NEXT: .seh_endepilogue
				; CHECK-NEXT: .seh_endproc

				define arm_aapcs_vfpcc void @clobberR4Tail() uwtable "frame-pointer"="none" {
				entry:
				call void asm sideeffect "", "~{r4}"()
				tail call arm_aapcs_vfpcc void @other()
				ret void
				}

				; CHECK-LABEL: clobberD8D10:
				; CHECK-NEXT: .seh_proc clobberD8D10
				; CHECK-NEXT: @ %bb.0: @ %entry
				; CHECK-NEXT: vpush {d8, d9, d10}
				; CHECK-NEXT: .seh_save_fregs {d8-d10}
				; CHECK-NEXT: .seh_endprologue

				; CHECK: .seh_startepilogue
				; CHECK-NEXT: vpop {d8, d9, d10}
				; CHECK-NEXT: .seh_save_fregs {d8-d10}
				; CHECK-NEXT: b.w other
				; CHECK-NEXT: .seh_nop_w
				; CHECK-NEXT: .seh_endepilogue
				; CHECK-NEXT: .seh_endproc

				define arm_aapcs_vfpcc void @clobberD8D10() uwtable "frame-pointer"="none" {
				entry:
				call void asm sideeffect "", "~{d8},~{d9},~{d10}"()
				tail call arm_aapcs_vfpcc void @other()
				ret void
				}

				declare arm_aapcs_vfpcc void @other()

				; CHECK-LABEL: vararg:
				; CHECK-NEXT: .seh_proc vararg
				; CHECK-NEXT: @ %bb.0: @ %entry
				; CHECK-NEXT: sub sp, #12
				; CHECK-NEXT: .seh_stackalloc 12
				; CHECK-NEXT: push.w {r11, lr}
				; CHECK-NEXT: .seh_save_regs_w {r11, lr}
				; CHECK-NEXT: sub sp, #4
				; CHECK-NEXT: .seh_stackalloc 4
				; CHECK-NEXT: .seh_endprologue

				; CHECK: .seh_startepilogue
				; CHECK-NEXT: add sp, #4
				; CHECK-NEXT: .seh_stackalloc 4
				; CHECK-NEXT: pop.w {r11, lr}
				; CHECK-NEXT: .seh_save_regs_w {r11, lr}
				; CHECK-NEXT: add sp, #12
				; CHECK-NEXT: .seh_stackalloc 12
				; CHECK-NEXT: bx lr
				; CHECK-NEXT: .seh_nop
				; CHECK-NEXT: .seh_endepilogue
				; CHECK-NEXT: .seh_endproc

				define arm_aapcs_vfpcc void @vararg(i32 noundef %a, ...) uwtable "frame-pointer"="none" {
				entry:
				%ap = alloca ptr, align 4
				call void @llvm.lifetime.start.p0(i64 4, ptr nonnull %ap)
				call void @llvm.va_start(ptr nonnull %ap)
				%0 = load ptr, ptr %ap
				call arm_aapcs_vfpcc void @useva(ptr noundef %0)
				call void @llvm.va_end(ptr nonnull %ap)
				call void @llvm.lifetime.end.p0(i64 4, ptr nonnull %ap)
				ret void
				}

				declare void @llvm.lifetime.start.p0(i64 immarg, ptr nocapture)
				declare void @llvm.lifetime.end.p0(i64 immarg, ptr nocapture)
				declare void @llvm.va_start(ptr)
				declare void @llvm.va_end(ptr)

				declare arm_aapcs_vfpcc void @useva(ptr noundef)

				; CHECK-LABEL: onlystack:
				; CHECK-NEXT: .seh_proc onlystack
				; CHECK-NEXT: @ %bb.0: @ %entry
				; CHECK-NEXT: sub sp, #4
				; CHECK-NEXT: .seh_stackalloc 4
				; CHECK-NEXT: .seh_endprologue

				; CHECK: .seh_startepilogue
				; CHECK-NEXT: add sp, #4
				; CHECK-NEXT: .seh_stackalloc 4
				; CHECK-NEXT: bx lr
				; CHECK-NEXT: .seh_nop
				; CHECK-NEXT: .seh_endepilogue
				; CHECK-NEXT: .seh_endproc

				define dso_local arm_aapcs_vfpcc void @onlystack() uwtable "frame-pointer"="none" {
				entry:
				%buf = alloca [4 x i8], align 1
				call void @llvm.lifetime.start.p0(i64 4, ptr nonnull %buf)
				call void asm sideeffect "", "r"(ptr nonnull %buf)
				call void @llvm.lifetime.end.p0(i64 4, ptr nonnull %buf)
				ret void
				}

				; CHECK-LABEL: func50:
				; CHECK-NEXT: .seh_proc func50
				; CHECK-NEXT: @ %bb.0: @ %entry
				; CHECK-NEXT: push.w {r11, lr}
				; CHECK-NEXT: .seh_save_regs_w {r11, lr}
				; CHECK-NEXT: sub sp, #56
				; CHECK-NEXT: .seh_stackalloc 56
				; CHECK-NEXT: .seh_endprologue

				; CHECK: .seh_startepilogue
				; CHECK-NEXT: add sp, #56
				; CHECK-NEXT: .seh_stackalloc 56
				; CHECK-NEXT: pop.w {r11, pc}
				; CHECK-NEXT: .seh_save_regs_w {r11, lr}
				; CHECK-NEXT: .seh_endepilogue
				; CHECK-NEXT: .seh_endproc

				define arm_aapcs_vfpcc void @func50() {
				entry:
				%buf = alloca [50 x i8], align 1
				call void @llvm.lifetime.start.p0(i64 50, ptr nonnull %buf)
				call arm_aapcs_vfpcc void @useptr(ptr noundef nonnull %buf)
				call void @llvm.lifetime.end.p0(i64 50, ptr nonnull %buf)
				ret void
				}

				; CHECK-LABEL: func4000:
				; CHECK-NEXT: .seh_proc func4000
				; CHECK-NEXT: @ %bb.0: @ %entry
				; CHECK-NEXT: push.w {r11, lr}
				; CHECK-NEXT: .seh_save_regs_w {r11, lr}
				; CHECK-NEXT: sub.w sp, sp, #4000
				; CHECK-NEXT: .seh_stackalloc_w 4000
				; CHECK-NEXT: .seh_endprologue

				; CHECK: .seh_startepilogue
				; CHECK-NEXT: add.w sp, sp, #4000
				; CHECK-NEXT: .seh_stackalloc_w 4000
				; CHECK-NEXT: pop.w {r11, pc}
				; CHECK-NEXT: .seh_save_regs_w {r11, lr}
				; CHECK-NEXT: .seh_endepilogue
				; CHECK-NEXT: .seh_endproc

				define arm_aapcs_vfpcc void @func4000() {
				entry:
				%buf = alloca [4000 x i8], align 1
				call void @llvm.lifetime.start.p0(i64 4000, ptr nonnull %buf)
				call arm_aapcs_vfpcc void @useptr(ptr noundef nonnull %buf)
				call void @llvm.lifetime.end.p0(i64 4000, ptr nonnull %buf)
				ret void
				}

				; CHECK-LABEL: func5000:
				; CHECK-NEXT: .seh_proc func5000
				; CHECK-NEXT: @ %bb.0: @ %entry
				; CHECK-NEXT: push.w {r4, r5, r6, lr}
				; CHECK-NEXT: .seh_save_regs_w {r4-r6, lr}
				; CHECK-NEXT: movw r4, #1250
				; CHECK-NEXT: .seh_nop_w
				; CHECK-NEXT: bl __chkstk
				; CHECK-NEXT: .seh_nop_w
				; CHECK-NEXT: sub.w sp, sp, r4
				; CHECK-NEXT: .seh_stackalloc_w 5000
				; CHECK-NEXT: .seh_endprologue

				; CHECK: .seh_startepilogue
				; CHECK-NEXT: add.w sp, sp, #4992
				; CHECK-NEXT: .seh_stackalloc_w 4992
				; CHECK-NEXT: add sp, #8
				; CHECK-NEXT: .seh_stackalloc 8
				; CHECK-NEXT: pop.w {r4, r5, r6, pc}
				; CHECK-NEXT: .seh_save_regs_w {r4-r6, lr}
				; CHECK-NEXT: .seh_endepilogue
				; CHECK-NEXT: .seh_endproc

				define arm_aapcs_vfpcc void @func5000() {
				entry:
				%buf = alloca [5000 x i8], align 1
				call void @llvm.lifetime.start.p0(i64 5000, ptr nonnull %buf)
				call arm_aapcs_vfpcc void @useptr(ptr noundef nonnull %buf)
				call void @llvm.lifetime.end.p0(i64 5000, ptr nonnull %buf)
				ret void
				}

				; CHECK-LABEL: func262144:
				; CHECK-NEXT: .seh_proc func262144
				; CHECK-NEXT: @ %bb.0: @ %entry
				; CHECK-NEXT: push.w {r4, r5, r6, lr}
				; CHECK-NEXT: .seh_save_regs_w {r4-r6, lr}
				; CHECK-NEXT: movw r4, #0
				; CHECK-NEXT: .seh_nop
				; CHECK-NEXT: movt r4, #1
				; CHECK-NEXT: .seh_nop_w
				; CHECK-NEXT: bl __chkstk
				; CHECK-NEXT: .seh_nop_w
				; CHECK-NEXT: sub.w sp, sp, r4
				; CHECK-NEXT: .seh_stackalloc_w 262144
				; CHECK-NEXT: .seh_endprologue

				; CHECK: .seh_startepilogue
				; CHECK-NEXT: add.w sp, sp, #262144
				; CHECK-NEXT: .seh_stackalloc_w 262144
				; CHECK-NEXT: pop.w {r4, r5, r6, pc}
				; CHECK-NEXT: .seh_save_regs_w {r4-r6, lr}
				; CHECK-NEXT: .seh_endepilogue
				; CHECK-NEXT: .seh_endproc

				define arm_aapcs_vfpcc void @func262144() {
				entry:
				%buf = alloca [262144 x i8], align 1
				call void @llvm.lifetime.start.p0(i64 262144, ptr nonnull %buf)
				call arm_aapcs_vfpcc void @useptr(ptr noundef nonnull %buf)
				call void @llvm.lifetime.end.p0(i64 262144, ptr nonnull %buf)
				ret void
				}

				; CHECK-LABEL: func270000:
				; CHECK-NEXT: .seh_proc func270000
				; CHECK-NEXT: @ %bb.0: @ %entry
				; CHECK-NEXT: push.w {r4, r5, r6, lr}
				; CHECK-NEXT: .seh_save_regs_w {r4-r6, lr}
				; CHECK-NEXT: movw r4, #1964
				; CHECK-NEXT: .seh_nop_w
				; CHECK-NEXT: movt r4, #1
				; CHECK-NEXT: .seh_nop_w
				; CHECK-NEXT: bl __chkstk
				; CHECK-NEXT: .seh_nop_w
				; CHECK-NEXT: sub.w sp, sp, r4
				; CHECK-NEXT: .seh_stackalloc_w 270000
				; CHECK-NEXT: .seh_endprologue

				; CHECK: .seh_startepilogue
				; CHECK-NEXT: add.w sp, sp, #268288
				; CHECK-NEXT: .seh_stackalloc_w 268288
				; CHECK-NEXT: add.w sp, sp, #1712
				; CHECK-NEXT: .seh_stackalloc_w 1712
				; CHECK-NEXT: pop.w {r4, r5, r6, pc}
				; CHECK-NEXT: .seh_save_regs_w {r4-r6, lr}
				; CHECK-NEXT: .seh_endepilogue
				; CHECK-NEXT: .seh_endproc

				define arm_aapcs_vfpcc void @func270000() {
				entry:
				%buf = alloca [270000 x i8], align 1
				call void @llvm.lifetime.start.p0(i64 270000, ptr nonnull %buf)
				call arm_aapcs_vfpcc void @useptr(ptr noundef nonnull %buf)
				call void @llvm.lifetime.end.p0(i64 270000, ptr nonnull %buf)
				ret void
				}

				declare arm_aapcs_vfpcc void @useptr(ptr noundef)

llvm/test/CodeGen/ARM/Windows/wineh-tailmerge.ll

This file was added.

				;; Check that epilogues aren't tail merged.

				;; Check that this produces the expected assembly output
				; RUN: llc -mtriple=thumbv7-windows -o - %s -verify-machineinstrs \| FileCheck %s
				;; Also try to write an object file, which verifies that the SEH opcodes
				;; match the actual prologue/epilogue length.
				; RUN: llc -mtriple=thumbv7-windows -filetype=obj -o %t.obj %s -verify-machineinstrs

				; CHECK-LABEL: d:
				; CHECK: .seh_proc d

				; CHECK: push.w {r11, lr}
				; CHECK-NEXT: .seh_save_regs_w {r11, lr}
				; CHECK-NEXT: .seh_endprologue

				; CHECK: .seh_startepilogue
				; CHECK-NEXT: pop.w {r11, lr}
				; CHECK-NEXT: .seh_save_regs_w {r11, lr}
				; CHECK-NEXT: b.w b
				; CHECK-NEXT: .seh_nop_w
				; CHECK-NEXT: .seh_endepilogue

				; CHECK: .seh_startepilogue
				; CHECK-NEXT: pop.w {r11, lr}
				; CHECK-NEXT: .seh_save_regs_w {r11, lr}
				; CHECK-NEXT: b.w c
				; CHECK-NEXT: .seh_nop_w
				; CHECK-NEXT: .seh_endepilogue
				; CHECK-NEXT: .seh_endproc

				@a = global i32 0, align 4

				define arm_aapcs_vfpcc void @d() optsize uwtable "frame-pointer"="none" {
				entry:
				%0 = load i32, ptr @a, align 4
				switch i32 %0, label %if.then1 [
				i32 10, label %if.then
				i32 0, label %if.end2
				]

				if.then:
				tail call arm_aapcs_vfpcc void @b()
				br label %return

				if.then1:
				tail call arm_aapcs_vfpcc void @b()
				br label %if.end2

				if.end2:
				tail call arm_aapcs_vfpcc void @c()
				br label %return

				return:
				ret void
				}

				declare arm_aapcs_vfpcc void @b(...)

				declare arm_aapcs_vfpcc void @c(...)

llvm/test/DebugInfo/COFF/ARMNT/arm-register-variables.ll

	Show All 38 Lines
	; OBJ-NEXT: }			; OBJ-NEXT: }
	; OBJ-NEXT: DefRangeRegisterRelSym {			; OBJ-NEXT: DefRangeRegisterRelSym {
	; OBJ-NEXT: Kind: S_DEFRANGE_REGISTER_REL (0x1145)			; OBJ-NEXT: Kind: S_DEFRANGE_REGISTER_REL (0x1145)
	; OBJ-NEXT: BaseRegister: ARM_SP (0x17)			; OBJ-NEXT: BaseRegister: ARM_SP (0x17)
	; OBJ-NEXT: HasSpilledUDTMember: No			; OBJ-NEXT: HasSpilledUDTMember: No
	; OBJ-NEXT: OffsetInParent: 0			; OBJ-NEXT: OffsetInParent: 0
	; OBJ-NEXT: BasePointerOffset: 12			; OBJ-NEXT: BasePointerOffset: 12
	; OBJ-NEXT: LocalVariableAddrRange {			; OBJ-NEXT: LocalVariableAddrRange {
	; OBJ-NEXT: OffsetStart: .text+0x8			; OBJ-NEXT: OffsetStart: .text+0xA
	; OBJ-NEXT: ISectStart: 0x0			; OBJ-NEXT: ISectStart: 0x0
	; OBJ-NEXT: Range: 0x1A			; OBJ-NEXT: Range: 0x1C
	; OBJ-NEXT: }			; OBJ-NEXT: }
	; OBJ-NEXT: }			; OBJ-NEXT: }

	; ModuleID = 'test.c'			; ModuleID = 'test.c'
	source_filename = "test.c"			source_filename = "test.c"
	target datalayout = "e-m:w-p:32:32-Fi8-i64:64-v128:64:128-a:0:32-n32-S64"			target datalayout = "e-m:w-p:32:32-Fi8-i64:64-v128:64:128-a:0:32-n32-S64"
	target triple = "thumbv7-pc-windows-msvc19.11.0"			target triple = "thumbv7-pc-windows-msvc19.11.0"

	▲ Show 20 Lines • Show All 111 Lines • Show Last 20 Lines