Download Raw Diff

Details

Reviewers

labrinea
rengolin
efriedma

Commits

rG7e8af2fc0c06: [ARM] Support -mexecute-only with -mlong-calls.

Summary

Instead of using constant pools, use movw movt pair.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

ZhiyaoMa98 created this revision.Oct 18 2022, 2:51 PM

Herald added a project: Restricted Project. · View Herald TranscriptOct 18 2022, 2:51 PM

Herald added subscribers: hiraditya, kristof.beyls. · View Herald Transcript

ZhiyaoMa98 requested review of this revision.Oct 18 2022, 2:51 PM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptOct 18 2022, 2:51 PM

Herald added subscribers: cfe-commits, MaskRay. · View Herald Transcript

efriedma added a subscriber: efriedma.Oct 18 2022, 2:58 PM

efriedma added inline comments.

llvm/test/CodeGen/Thumb2/thumb2-execute-only-long-calls.ll
20	Is there some reason we can't just generate `movw r0, :lower16:bar; movt r0, :upper16:bar`?

Harbormaster completed remote builds in B192856: Diff 468707.Oct 18 2022, 3:18 PM

@efriedma Thank you for your suggestion. I will remove the extra indirection.

I was wondering if you could also provide some insights about the RWPI case. I believe the same optimization also applies to RWPI. However, I actually want to store the function address as a global variable when using RWPI, because I want the address to live in RAM instead of Flash, so that I can redirect the function call at runtime, for dynamic linking purpose.

Should I create a new target feature to indicate that I want to store function address in RAM?

that I can redirect the function call at runtime, for dynamic linking purpose.

Can you describe a little more what you're trying to do here?

If you want to replace the implementation of an existing function at runtime, you'd be better off implementing the indirection as a frontend feature; by the time you get to the backend, optimizations have destroyed the semantics you want.

Can you describe a little more what you're trying to do here?

Sure. My eventual goal is to enable fine-granular live-update on ARM based microcontrollers, which requires the system to do some relocation at runtime. Below I will describe the challenge with a simple C example.

Consider the following C snippet:

extern void global_func(void); // A global function whose symbol is exported by the system at runtime.
static void local_func(void) { ... }

static void main_entry(void) {
    local_func();
    global_func();
}

I want to load and run the compiled object file at runtime, which requires two steps.

Burn the object file into Flash storage.
Perform a runtime symbol resolution and relocation so that global_func is set to the runtime address.

The reason why I must store code in Flash storage is that the microcontroller I am using, as well as many other ARM based microcontrollers, has Flash storage 5x greater than RAM, and code typically directly runs from Flash.

local_func requires the compiler to use position independent code, which has already been handled by -fropi. global_func however, is the case I am trying to solve here.

Existing compiler options always store the address of global_func in Flash.

The default case:

main_entry:
    bl  local_func
    b.w global_func // Relative address is hardcoded in the instruction, in Flash.

If compile with -mlong-calls:

main_entry:
    bl  local_func
    ldr r0, [pc, #4] // Load address from constant pool, still in Flash.
    bx  r0
.Lconst_pool:
    .word global_func

In the hypothetical case if the compiler chose to use movw movt pair:

main_entry:
    bl   local_func
    movw r0, :lower16:global_func // Absolute address is hardcoded in the instruction, still in Flash.
    movt r0, :upper16:global_func
    bx.  r0

I was expecting to use the "side effect" of -mexecute-only that promotes constant pools to global variables to achieve my goal of having the function address to live in RAM.

main_entry:
    bl   local_func
    movw r0, :lower16:.const_pool(sbrel)
    movt r0, :upper16:.const_pool(sbrel) // Also using RWPI so that the jump table can be placed anywhere in RAM pointed by r9.
    ldr  r0, [r9, r0] // Absolute address is held in RAM now.
    bx   r0

As you have already pointed out, in the normal case when we do not need to put the address in RAM, the extra indirection is unnecessary and slows down the code.

But if I have a use case like above where I need to store the address in RAM, could you enlighten me about the best approach to achieve my goal?

The construct you want is pretty similar to a GOT. if you compile with -fPIE -fsemantic-interposition, you get basically the code you want, except that the compiler uses a plt by default instead of a got. If we supported -fno-plt for ARM, it would be almost exactly what you want. That said, that won't work with -frwpi... maybe we need some new kind of relocation to represent that.

Unfortunately, -fPIE seems not to be generating the PLT on LLVM for embedded ARM.

C source file (test.c):

extern void bar(void);
void foo(void) {
    bar();
}

LLVM with clang -O2 -fPIE -fsemantic-interposition -mlong-calls --target=armv7em-none-eabi -c test.c:

00000000 <foo>:
   0:   4800            ldr     r0, [pc, #0]    ; (4 <foo+0x4>)
   2:   4700            bx      r0
   4:   00000000        .word   0x00000000

ARM GNU with arm-none-eabi-gcc -O2 -fPIE -mlong-calls -msingle-pic-base -mcpu=cortex-m4 -c test.c:

00000000 <foo>:
   0:   4b01            ldr     r3, [pc, #4]    ; (8 <foo+0x8>)
   2:   f859 3003       ldr.w   r3, [r9, r3]
   6:   4718            bx      r3
   8:   00000000        .word   0x00000000

One, -mlong-calls isn't currently compatible with PIE. Two, on ARM, there are no special plt relocations; the linker just takes care of it. (You can see the differences if you try to take the address of a function without calling it.)

ZhiyaoMa98 updated this revision to Diff 468928.Oct 19 2022, 8:29 AM

ZhiyaoMa98 edited the summary of this revision. (Show Details)

I have updated the diff to avoid the extra indirection. I am thinking about adding a new option, say -mgot-calls to allow code generation with the extra indirection. Is it sensible and shall I create another diff to discuss that?

Harbormaster completed remote builds in B193016: Diff 468928.Oct 19 2022, 9:34 AM

I am thinking about adding a new option, say -mgot-calls to allow code generation with the extra indirection. Is it sensible and shall I create another diff to discuss that?

That probably makes sense, yes.

llvm/lib/Target/ARM/ARMISelLowering.cpp
2655	Can we directly check that movw/movt is available? I think that's what we do in other places? (Then just assert we aren't execute-only in the non-movw path.)

ZhiyaoMa98 updated this revision to Diff 468957.Oct 19 2022, 10:26 AM

Then just assert we aren't execute-only in the non-movw path.

When we are not execute-only, existing code handles it by using constant pools and we are all good.

In the case where we are execute-only and long-calls at the same time, we assert that we have movt like in other places in the same source file.

LGTM with one small change.

clang/lib/Driver/ToolChains/Arch/ARM.cpp
779	Fix this comment?

This revision is now accepted and ready to land.Oct 19 2022, 10:44 AM

Updated the comment to reflect that now we allow using -mlong-calls with -mexecute-only.

efriedma accepted this revision.Oct 19 2022, 11:09 AM

Harbormaster completed remote builds in B193047: Diff 468973.Oct 19 2022, 12:10 PM

Remove the unused GA variable.

Harbormaster completed remote builds in B193099: Diff 469049.Oct 19 2022, 3:07 PM

Just in case you assume that I have push permission, unfortunately I do not. Could you help me merge the patch in? Thanks.

Closed by commit rG7e8af2fc0c06: [ARM] Support -mexecute-only with -mlong-calls. (authored by ZhiyaoMa98, committed by efriedma). · Explain WhyOct 24 2022, 11:41 AM

This revision was automatically updated to reflect the committed changes.

efriedma added a commit: rG7e8af2fc0c06: [ARM] Support -mexecute-only with -mlong-calls..

Diff 470236

clang/lib/Driver/ToolChains/Arch/ARM.cpp

Show First 20 Lines • Show All 770 Lines • ▼ Show 20 Lines	if (A->getOption().matches(options::OPT_mlong_calls))
!Triple.isWatchOS()) {		!Triple.isWatchOS()) {
Features.push_back("+long-calls");		Features.push_back("+long-calls");
}		}

// Generate execute-only output (no data access to code sections).		// Generate execute-only output (no data access to code sections).
// This only makes sense for the compiler, not for the assembler.		// This only makes sense for the compiler, not for the assembler.
if (!ForAS) {		if (!ForAS) {
// Supported only on ARMv6T2 and ARMv7 and above.		// Supported only on ARMv6T2 and ARMv7 and above.
// Cannot be combined with -mno-movt or -mlong-calls		// Cannot be combined with -mno-movt.
		efriedmaUnsubmitted Done Reply Inline Actions Fix this comment? efriedma: Fix this comment?
if (Arg *A = Args.getLastArg(options::OPT_mexecute_only, options::OPT_mno_execute_only)) {		if (Arg *A = Args.getLastArg(options::OPT_mexecute_only, options::OPT_mno_execute_only)) {
if (A->getOption().matches(options::OPT_mexecute_only)) {		if (A->getOption().matches(options::OPT_mexecute_only)) {
if (getARMSubArchVersionNumber(Triple) < 7 &&		if (getARMSubArchVersionNumber(Triple) < 7 &&
llvm::ARM::parseArch(Triple.getArchName()) != llvm::ARM::ArchKind::ARMV6T2)		llvm::ARM::parseArch(Triple.getArchName()) != llvm::ARM::ArchKind::ARMV6T2)
D.Diag(diag::err_target_unsupported_execute_only) << Triple.getArchName();		D.Diag(diag::err_target_unsupported_execute_only) << Triple.getArchName();
else if (Arg *B = Args.getLastArg(options::OPT_mno_movt))		else if (Arg *B = Args.getLastArg(options::OPT_mno_movt))
D.Diag(diag::err_opt_not_valid_with_opt) << A->getAsString(Args) << B->getAsString(Args);		D.Diag(diag::err_opt_not_valid_with_opt)
// Long calls create constant pool entries and have not yet been fixed up		<< A->getAsString(Args) << B->getAsString(Args);
// to play nicely with execute-only. Hence, they cannot be used in
// execute-only code for now
else if (Arg *B = Args.getLastArg(options::OPT_mlong_calls, options::OPT_mno_long_calls)) {
if (B->getOption().matches(options::OPT_mlong_calls))
D.Diag(diag::err_opt_not_valid_with_opt) << A->getAsString(Args) << B->getAsString(Args);
}
Features.push_back("+execute-only");		Features.push_back("+execute-only");
}		}
}		}
}		}

// Kernel code has more strict alignment requirements.		// Kernel code has more strict alignment requirements.
if (KernelOrKext) {		if (KernelOrKext) {
Features.push_back("+strict-align");		Features.push_back("+strict-align");
▲ Show 20 Lines • Show All 210 Lines • Show Last 20 Lines

clang/test/Driver/arm-execute-only.c

	// RUN: not %clang -c -target thumbv6m-eabi -mexecute-only %s 2>&1 \| \			// RUN: not %clang -c -target thumbv6m-eabi -mexecute-only %s 2>&1 \| \
	// RUN: FileCheck --check-prefix CHECK-EXECUTE-ONLY-NOT-SUPPORTED %s			// RUN: FileCheck --check-prefix CHECK-EXECUTE-ONLY-NOT-SUPPORTED %s
	// CHECK-EXECUTE-ONLY-NOT-SUPPORTED: error: execute only is not supported for the thumbv6m sub-architecture			// CHECK-EXECUTE-ONLY-NOT-SUPPORTED: error: execute only is not supported for the thumbv6m sub-architecture

	// RUN: not %clang -target armv8m.main-eabi -mexecute-only -mno-movt %s 2>&1 \			// RUN: not %clang -target armv8m.main-eabi -mexecute-only -mno-movt %s 2>&1 \
	// RUN: \| FileCheck %s -check-prefix CHECK-EXECUTE-ONLY-NO-MOVT			// RUN: \| FileCheck %s -check-prefix CHECK-EXECUTE-ONLY-NO-MOVT
	// CHECK-EXECUTE-ONLY-NO-MOVT: error: option '-mexecute-only' cannot be specified with '-mno-movt'			// CHECK-EXECUTE-ONLY-NO-MOVT: error: option '-mexecute-only' cannot be specified with '-mno-movt'

	// RUN: not %clang -target armv8m.main-eabi -mexecute-only -mlong-calls %s 2>&1 \
	// RUN: \| FileCheck %s -check-prefix CHECK-EXECUTE-ONLY-LONG-CALLS
	// CHECK-EXECUTE-ONLY-LONG-CALLS: error: option '-mexecute-only' cannot be specified with '-mlong-calls'

	// RUN: %clang -target armv7m-eabi -x assembler -mexecute-only %s -c -### 2>&1 \			// RUN: %clang -target armv7m-eabi -x assembler -mexecute-only %s -c -### 2>&1 \
	// RUN: \| FileCheck %s --check-prefix=CHECK-NO-EXECUTE-ONLY-ASM			// RUN: \| FileCheck %s --check-prefix=CHECK-NO-EXECUTE-ONLY-ASM
	// CHECK-NO-EXECUTE-ONLY-ASM: warning: argument unused during compilation: '-mexecute-only'			// CHECK-NO-EXECUTE-ONLY-ASM: warning: argument unused during compilation: '-mexecute-only'

	// -mpure-code flag for GCC compatibility			// -mpure-code flag for GCC compatibility
	// RUN: not %clang -c -target thumbv6m-eabi -mpure-code %s 2>&1 \| \			// RUN: not %clang -c -target thumbv6m-eabi -mpure-code %s 2>&1 \| \
	// RUN: FileCheck --check-prefix CHECK-EXECUTE-ONLY-NOT-SUPPORTED %s			// RUN: FileCheck --check-prefix CHECK-EXECUTE-ONLY-NOT-SUPPORTED %s

	// RUN: not %clang -target armv8m.main-eabi -mpure-code -mno-movt %s 2>&1 \			// RUN: not %clang -target armv8m.main-eabi -mpure-code -mno-movt %s 2>&1 \
	// RUN: \| FileCheck %s -check-prefix CHECK-PURE-CODE-NO-MOVT			// RUN: \| FileCheck %s -check-prefix CHECK-PURE-CODE-NO-MOVT
	// CHECK-PURE-CODE-NO-MOVT: error: option '-mpure-code' cannot be specified with '-mno-movt'			// CHECK-PURE-CODE-NO-MOVT: error: option '-mpure-code' cannot be specified with '-mno-movt'

	// RUN: not %clang -target armv8m.main-eabi -mpure-code -mlong-calls %s 2>&1 \
	// RUN: \| FileCheck %s -check-prefix CHECK-PURE-CODE-LONG-CALLS
	// CHECK-PURE-CODE-LONG-CALLS: error: option '-mpure-code' cannot be specified with '-mlong-calls'

llvm/lib/Target/ARM/ARMISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,624 Lines • ▼ Show 20 Lines	ARMTargetLowering::LowerCall(TargetLowering::CallLoweringInfo &CLI,

// If the callee is a GlobalAddress/ExternalSymbol node (quite common, every		// If the callee is a GlobalAddress/ExternalSymbol node (quite common, every
// direct call is) turn it into a TargetGlobalAddress/TargetExternalSymbol		// direct call is) turn it into a TargetGlobalAddress/TargetExternalSymbol
// node so that legalize doesn't hack it.		// node so that legalize doesn't hack it.
bool isDirect = false;		bool isDirect = false;

const TargetMachine &TM = getTargetMachine();		const TargetMachine &TM = getTargetMachine();
const Module *Mod = MF.getFunction().getParent();		const Module *Mod = MF.getFunction().getParent();
const GlobalValue *GV = nullptr;		const GlobalValue *GVal = nullptr;
if (GlobalAddressSDNode *G = dyn_cast<GlobalAddressSDNode>(Callee))		if (GlobalAddressSDNode *G = dyn_cast<GlobalAddressSDNode>(Callee))
GV = G->getGlobal();		GVal = G->getGlobal();
bool isStub =		bool isStub =
!TM.shouldAssumeDSOLocal(*Mod, GV) && Subtarget->isTargetMachO();		!TM.shouldAssumeDSOLocal(*Mod, GVal) && Subtarget->isTargetMachO();

bool isARMFunc = !Subtarget->isThumb() \|\| (isStub && !Subtarget->isMClass());		bool isARMFunc = !Subtarget->isThumb() \|\| (isStub && !Subtarget->isMClass());
bool isLocalARMFunc = false;		bool isLocalARMFunc = false;
auto PtrVt = getPointerTy(DAG.getDataLayout());		auto PtrVt = getPointerTy(DAG.getDataLayout());

if (Subtarget->genLongCalls()) {		if (Subtarget->genLongCalls()) {
assert((!isPositionIndependent() \|\| Subtarget->isTargetWindows()) &&		assert((!isPositionIndependent() \|\| Subtarget->isTargetWindows()) &&
"long-calls codegen is not position independent!");		"long-calls codegen is not position independent!");
// Handle a global address or an external symbol. If it's not one of		// Handle a global address or an external symbol. If it's not one of
// those, the target's already in a register, so we don't need to do		// those, the target's already in a register, so we don't need to do
// anything extra.		// anything extra.
if (isa<GlobalAddressSDNode>(Callee)) {		if (isa<GlobalAddressSDNode>(Callee)) {
		// When generating execute-only code we use movw movt pair.
		// Currently execute-only is only available for architectures that
		// support movw movt, so we are safe to assume that.
		if (Subtarget->genExecuteOnly()) {
		assert(Subtarget->useMovt() &&
		"long-calls with execute-only requires movt and movw!");
		efriedmaUnsubmitted Done Reply Inline Actions Can we directly check that movw/movt is available? I think that's what we do in other places? (Then just assert we aren't execute-only in the non-movw path.) efriedma: Can we directly check that movw/movt is available? I think that's what we do in other places?
		++NumMovwMovt;
		Callee = DAG.getNode(ARMISD::Wrapper, dl, PtrVt,
		DAG.getTargetGlobalAddress(GVal, dl, PtrVt));
		} else {
// Create a constant pool entry for the callee address		// Create a constant pool entry for the callee address
unsigned ARMPCLabelIndex = AFI->createPICLabelUId();		unsigned ARMPCLabelIndex = AFI->createPICLabelUId();
ARMConstantPoolValue *CPV =		ARMConstantPoolValue *CPV = ARMConstantPoolConstant::Create(
ARMConstantPoolConstant::Create(GV, ARMPCLabelIndex, ARMCP::CPValue, 0);		GVal, ARMPCLabelIndex, ARMCP::CPValue, 0);

// Get the address of the callee into a register		// Get the address of the callee into a register
SDValue CPAddr = DAG.getTargetConstantPool(CPV, PtrVt, Align(4));		SDValue Addr = DAG.getTargetConstantPool(CPV, PtrVt, Align(4));
CPAddr = DAG.getNode(ARMISD::Wrapper, dl, MVT::i32, CPAddr);		Addr = DAG.getNode(ARMISD::Wrapper, dl, MVT::i32, Addr);
Callee = DAG.getLoad(		Callee = DAG.getLoad(
PtrVt, dl, DAG.getEntryNode(), CPAddr,		PtrVt, dl, DAG.getEntryNode(), Addr,
MachinePointerInfo::getConstantPool(DAG.getMachineFunction()));		MachinePointerInfo::getConstantPool(DAG.getMachineFunction()));
		}
} else if (ExternalSymbolSDNode *S=dyn_cast<ExternalSymbolSDNode>(Callee)) {		} else if (ExternalSymbolSDNode *S=dyn_cast<ExternalSymbolSDNode>(Callee)) {
const char *Sym = S->getSymbol();		const char *Sym = S->getSymbol();

		// When generating execute-only code we use movw movt pair.
		// Currently execute-only is only available for architectures that
		// support movw movt, so we are safe to assume that.
		if (Subtarget->genExecuteOnly()) {
		assert(Subtarget->useMovt() &&
		"long-calls with execute-only requires movt and movw!");
		++NumMovwMovt;
		Callee = DAG.getNode(ARMISD::Wrapper, dl, PtrVt,
		DAG.getTargetGlobalAddress(GVal, dl, PtrVt));
		} else {
// Create a constant pool entry for the callee address		// Create a constant pool entry for the callee address
unsigned ARMPCLabelIndex = AFI->createPICLabelUId();		unsigned ARMPCLabelIndex = AFI->createPICLabelUId();
ARMConstantPoolValue *CPV =		ARMConstantPoolValue *CPV = ARMConstantPoolSymbol::Create(
ARMConstantPoolSymbol::Create(*DAG.getContext(), Sym,		*DAG.getContext(), Sym, ARMPCLabelIndex, 0);
ARMPCLabelIndex, 0);
// Get the address of the callee into a register		// Get the address of the callee into a register
SDValue CPAddr = DAG.getTargetConstantPool(CPV, PtrVt, Align(4));		SDValue Addr = DAG.getTargetConstantPool(CPV, PtrVt, Align(4));
CPAddr = DAG.getNode(ARMISD::Wrapper, dl, MVT::i32, CPAddr);		Addr = DAG.getNode(ARMISD::Wrapper, dl, MVT::i32, Addr);
Callee = DAG.getLoad(		Callee = DAG.getLoad(
PtrVt, dl, DAG.getEntryNode(), CPAddr,		PtrVt, dl, DAG.getEntryNode(), Addr,
MachinePointerInfo::getConstantPool(DAG.getMachineFunction()));		MachinePointerInfo::getConstantPool(DAG.getMachineFunction()));
}		}
		}
} else if (isa<GlobalAddressSDNode>(Callee)) {		} else if (isa<GlobalAddressSDNode>(Callee)) {
if (!PreferIndirect) {		if (!PreferIndirect) {
isDirect = true;		isDirect = true;
bool isDef = GV->isStrongDefinitionForLinker();		bool isDef = GVal->isStrongDefinitionForLinker();

// ARM call to a local ARM function is predicable.		// ARM call to a local ARM function is predicable.
isLocalARMFunc = !Subtarget->isThumb() && (isDef \|\| !ARMInterworking);		isLocalARMFunc = !Subtarget->isThumb() && (isDef \|\| !ARMInterworking);
// tBX takes a register source operand.		// tBX takes a register source operand.
if (isStub && Subtarget->isThumb1Only() && !Subtarget->hasV5TOps()) {		if (isStub && Subtarget->isThumb1Only() && !Subtarget->hasV5TOps()) {
assert(Subtarget->isTargetMachO() && "WrapperPIC use on non-MachO?");		assert(Subtarget->isTargetMachO() && "WrapperPIC use on non-MachO?");
Callee = DAG.getNode(		Callee = DAG.getNode(
ARMISD::WrapperPIC, dl, PtrVt,		ARMISD::WrapperPIC, dl, PtrVt,
DAG.getTargetGlobalAddress(GV, dl, PtrVt, 0, ARMII::MO_NONLAZY));		DAG.getTargetGlobalAddress(GVal, dl, PtrVt, 0, ARMII::MO_NONLAZY));
Callee = DAG.getLoad(		Callee = DAG.getLoad(
PtrVt, dl, DAG.getEntryNode(), Callee,		PtrVt, dl, DAG.getEntryNode(), Callee,
MachinePointerInfo::getGOT(DAG.getMachineFunction()), MaybeAlign(),		MachinePointerInfo::getGOT(DAG.getMachineFunction()), MaybeAlign(),
MachineMemOperand::MODereferenceable \|		MachineMemOperand::MODereferenceable \|
MachineMemOperand::MOInvariant);		MachineMemOperand::MOInvariant);
} else if (Subtarget->isTargetCOFF()) {		} else if (Subtarget->isTargetCOFF()) {
assert(Subtarget->isTargetWindows() &&		assert(Subtarget->isTargetWindows() &&
"Windows is the only supported COFF target");		"Windows is the only supported COFF target");
unsigned TargetFlags = ARMII::MO_NO_FLAG;		unsigned TargetFlags = ARMII::MO_NO_FLAG;
if (GV->hasDLLImportStorageClass())		if (GVal->hasDLLImportStorageClass())
TargetFlags = ARMII::MO_DLLIMPORT;		TargetFlags = ARMII::MO_DLLIMPORT;
else if (!TM.shouldAssumeDSOLocal(*GV->getParent(), GV))		else if (!TM.shouldAssumeDSOLocal(*GVal->getParent(), GVal))
TargetFlags = ARMII::MO_COFFSTUB;		TargetFlags = ARMII::MO_COFFSTUB;
Callee = DAG.getTargetGlobalAddress(GV, dl, PtrVt, /offset=/0,		Callee = DAG.getTargetGlobalAddress(GVal, dl, PtrVt, /offset=/0,
TargetFlags);		TargetFlags);
if (TargetFlags & (ARMII::MO_DLLIMPORT \| ARMII::MO_COFFSTUB))		if (TargetFlags & (ARMII::MO_DLLIMPORT \| ARMII::MO_COFFSTUB))
Callee =		Callee =
DAG.getLoad(PtrVt, dl, DAG.getEntryNode(),		DAG.getLoad(PtrVt, dl, DAG.getEntryNode(),
DAG.getNode(ARMISD::Wrapper, dl, PtrVt, Callee),		DAG.getNode(ARMISD::Wrapper, dl, PtrVt, Callee),
MachinePointerInfo::getGOT(DAG.getMachineFunction()));		MachinePointerInfo::getGOT(DAG.getMachineFunction()));
} else {		} else {
Callee = DAG.getTargetGlobalAddress(GV, dl, PtrVt, 0, 0);		Callee = DAG.getTargetGlobalAddress(GVal, dl, PtrVt, 0, 0);
}		}
}		}
} else if (ExternalSymbolSDNode *S = dyn_cast<ExternalSymbolSDNode>(Callee)) {		} else if (ExternalSymbolSDNode *S = dyn_cast<ExternalSymbolSDNode>(Callee)) {
isDirect = true;		isDirect = true;
// tBX takes a register source operand.		// tBX takes a register source operand.
const char *Sym = S->getSymbol();		const char *Sym = S->getSymbol();
if (isARMFunc && Subtarget->isThumb1Only() && !Subtarget->hasV5TOps()) {		if (isARMFunc && Subtarget->isThumb1Only() && !Subtarget->hasV5TOps()) {
unsigned ARMPCLabelIndex = AFI->createPICLabelUId();		unsigned ARMPCLabelIndex = AFI->createPICLabelUId();
▲ Show 20 Lines • Show All 19,095 Lines • Show Last 20 Lines

llvm/test/CodeGen/Thumb2/thumb2-execute-only-long-calls.ll

This file was added.

				; RUN: llc < %s -mtriple=thumbv7em-arm-none-eabi -relocation-model=static \| FileCheck %s -check-prefixes=CHECK,STATIC
				; RUN: llc < %s -mtriple=thumbv7em-arm-none-eabi -relocation-model=rwpi \| FileCheck %s -check-prefixes=CHECK,RWPI

				define void @fn() #0 {
				entry:
				; CHECK-LABEL: fn:
				; CHECK: ldr [[REG:r[0-9]+]], .LCPI0_0
				; CHECK-NEXT: blx [[REG]]
				; CHECK: .LCPI0_0:
				; CHECK-NEXT: .long bar
				call void @bar()
				ret void
				}

				define void @execute_only_fn() #1 {
				; STATIC-LABEL: execute_only_fn:
				; STATIC: movw [[REG0:r[0-9]+]], :lower16:bar
				; STATIC-NEXT: movt [[REG0]], :upper16:bar
				; STATIC-NEXT: blx [[REG0]]
				; STATIC-NOT: .LCPI0_0:
				efriedmaUnsubmitted Done Reply Inline Actions Is there some reason we can't just generate `movw r0, :lower16:bar; movt r0, :upper16:bar`? efriedma: Is there some reason we can't just generate `movw r0, :lower16:bar; movt r0, :upper16:bar`?

				; RWPI-LABEL: execute_only_fn:
				; RWPI: movw [[REG0:r[0-9]+]], :lower16:bar
				; RWPI-NEXT: movt [[REG0]], :upper16:bar
				; RWPI-NEXT: blx [[REG0]]
				; RWPI-NOT: .LCPI0_0:
				entry:
				call void @bar()
				ret void
				}

				attributes #0 = { noinline optnone "target-features"="+thumb-mode,+long-calls" }
				attributes #1 = { noinline optnone "target-features"="+execute-only,+thumb-mode,+long-calls" }

				declare dso_local void @bar()

This is an archive of the discontinued LLVM Phabricator instance.

[ARM] Support -mexecute-only with -mlong-calls.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 470236

clang/lib/Driver/ToolChains/Arch/ARM.cpp

clang/test/Driver/arm-execute-only.c

llvm/lib/Target/ARM/ARMISelLowering.cpp

llvm/test/CodeGen/Thumb2/thumb2-execute-only-long-calls.ll

This is an archive of the discontinued LLVM Phabricator instance.

[ARM] Support -mexecute-only with -mlong-calls.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 470236

clang/lib/Driver/ToolChains/Arch/ARM.cpp

clang/test/Driver/arm-execute-only.c

llvm/lib/Target/ARM/ARMISelLowering.cpp

llvm/test/CodeGen/Thumb2/thumb2-execute-only-long-calls.ll

[ARM] Support -mexecute-only with -mlong-calls.
ClosedPublic