This is an archive of the discontinued LLVM Phabricator instance.

llvm/test/CodeGen/Thumb2/v8_IT_3.ll
62 ↗	(On Diff #122333)	This new code is the load from %bb moved to the end of %entry under condition of %tmp1 == 1. This is made possible by this change, because %entry has address of G@GOT precomputed and the second load is just one instruction.

efriedma added subscribers: rovka, efriedma.Nov 9 2017, 3:06 PM

efriedma added inline comments.

llvm/lib/Target/ARM/ARMISelLowering.cpp
3168 ↗	(On Diff #122333)	If I'm following correctly, whether "G" points to the global itself or the GOT entry is implicitly controlled by ARMSubtarget::getCPModifier? That's awfully confusing... can we fix it to use target flags passed to getTargetGlobalAddress() to control it instead?

eugenis added inline comments.Nov 9 2017, 3:26 PM

llvm/lib/Target/ARM/ARMISelLowering.cpp
3168 ↗	(On Diff #122333)	Yes... That's not ideal. To make sure we are on the same page: I could add something like ARMII::MO_GOT to TargetGlobalAddress flags, and use to differentiate between LDRLIT_ga_pcrel and MOV_ga_pcrel. This would let us use movt/movw for dso-local globals, yay! I'm not sure how to access the flag in ARMExpandPseudoInsts, though. Is it copied to the operand's TargetFlags? I'll check.

efriedma added inline comments.Nov 9 2017, 4:12 PM

llvm/lib/Target/ARM/ARMISelLowering.cpp
3168 ↗	(On Diff #122333)	Yes, that's what I was thinking. I think the flag gets copied into TargetFlags. See AArch64II::MO_GOT for inspiration.

Add ARMII::MO_GOT.

Harbormaster completed remote builds in B12048: Diff 122373.Nov 9 2017, 4:52 PM

I've switched to MO_GOT, and killed getCPModifier().

I did not figure out how to select between LDRLIT_ga_pcrel and MOV_ga_pcrel based on TargetFlags. Apparently, I need a ComplexPattern. Anyway, I think that it should be done in a separate patch.

This makes sense.

llvm/lib/Target/ARM/MCTargetDesc/ARMBaseInfo.h
233 ↗	(On Diff #122373)	Missing doc comment.

Added a comment for MO_GOT.

eugenis marked an inline comment as done.Nov 10 2017, 3:18 PM

LGTM, assuming you've done appropriate testing.

llvm/test/CodeGen/ARM/GlobalISel/arm-select-globals-pic.mir
59 ↗	(On Diff #122540)	It would be nice to implement getSerializableDirectMachineOperandTargetFlags() to make this clearer.

This revision is now accepted and ready to land.Nov 10 2017, 3:27 PM

I've done some testing. All sanitizer tests pass on arm/android, both with and without -asan-with-ifunc (which is what I'm doing this for).

Code size of libclang_rt.asan-arm-android.so is down by 3% (!). It is just a regular library (i.e. no asan instrumentation in it), so I assume that all other code is similarly improved.

Unfortunately, even with this change MachineCSE is not good enough to enable -asan-with-ifunc (it brings it from 15% code bloat to "only" 5%). For example, the following code generates a GOT load in each bb:

extern char x;
void bar();
void use(char *p);
void foo(int n) {
  if (n > 10) {
    use(&x);
  } else {
    bar();
    use(&x);
  }
}

Anyway, this change is a clear improvement.

Btw, none of the targets that I've looked at hoists the GOT load in the above example. It's just that on ARM it is particularly expensive.

Shared libraries in ContentShell.apk (chromium) are down by 0.8% in size (and the entire package is down by 0.45%) with this change.

Landing.

Closed by commit rL318081: [arm] Fix Unnecessary reloads from GOT. (authored by eugenis). · Explain WhyNov 13 2017, 12:45 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

trunk/

lib/

Target/

ARM/

ARMExpandPseudoInsts.cpp

5 lines

24 lines

16 lines

4 lines

4 lines

ARMInstructionSelector.cpp

6 lines

ARMSubtarget.h

9 lines

ARMSubtarget.cpp

8 lines

MCTargetDesc/

ARMBaseInfo.h

5 lines

test/

CodeGen/

ARM/

GlobalISel/

arm-select-globals-pic.mir

4 lines

load-global2.ll

30 lines

Thumb2/

v8_IT_3.ll

6 lines

Diff 122703

llvm/trunk/lib/Target/ARM/ARMExpandPseudoInsts.cpp

Show First 20 Lines • Show All 1,305 Lines • ▼ Show 20 Lines	switch (Opcode) {
case ARM::LDRLIT_ga_abs:		case ARM::LDRLIT_ga_abs:
case ARM::LDRLIT_ga_pcrel:		case ARM::LDRLIT_ga_pcrel:
case ARM::LDRLIT_ga_pcrel_ldr:		case ARM::LDRLIT_ga_pcrel_ldr:
case ARM::tLDRLIT_ga_abs:		case ARM::tLDRLIT_ga_abs:
case ARM::tLDRLIT_ga_pcrel: {		case ARM::tLDRLIT_ga_pcrel: {
unsigned DstReg = MI.getOperand(0).getReg();		unsigned DstReg = MI.getOperand(0).getReg();
bool DstIsDead = MI.getOperand(0).isDead();		bool DstIsDead = MI.getOperand(0).isDead();
const MachineOperand &MO1 = MI.getOperand(1);		const MachineOperand &MO1 = MI.getOperand(1);
		auto Flags = MO1.getTargetFlags();
const GlobalValue *GV = MO1.getGlobal();		const GlobalValue *GV = MO1.getGlobal();
bool IsARM =		bool IsARM =
Opcode != ARM::tLDRLIT_ga_pcrel && Opcode != ARM::tLDRLIT_ga_abs;		Opcode != ARM::tLDRLIT_ga_pcrel && Opcode != ARM::tLDRLIT_ga_abs;
bool IsPIC =		bool IsPIC =
Opcode != ARM::LDRLIT_ga_abs && Opcode != ARM::tLDRLIT_ga_abs;		Opcode != ARM::LDRLIT_ga_abs && Opcode != ARM::tLDRLIT_ga_abs;
unsigned LDRLITOpc = IsARM ? ARM::LDRi12 : ARM::tLDRpci;		unsigned LDRLITOpc = IsARM ? ARM::LDRi12 : ARM::tLDRpci;
unsigned PICAddOpc =		unsigned PICAddOpc =
IsARM		IsARM
? (Opcode == ARM::LDRLIT_ga_pcrel_ldr ? ARM::PICLDR : ARM::PICADD)		? (Opcode == ARM::LDRLIT_ga_pcrel_ldr ? ARM::PICLDR : ARM::PICADD)
: ARM::tPICADD;		: ARM::tPICADD;

// We need a new const-pool entry to load from.		// We need a new const-pool entry to load from.
MachineConstantPool *MCP = MBB.getParent()->getConstantPool();		MachineConstantPool *MCP = MBB.getParent()->getConstantPool();
unsigned ARMPCLabelIndex = 0;		unsigned ARMPCLabelIndex = 0;
MachineConstantPoolValue *CPV;		MachineConstantPoolValue *CPV;

if (IsPIC) {		if (IsPIC) {
unsigned PCAdj = IsARM ? 8 : 4;		unsigned PCAdj = IsARM ? 8 : 4;
auto Modifier = STI->getCPModifier(GV);		auto Modifier = (Flags & ARMII::MO_GOT)
		? ARMCP::GOT_PREL
		: ARMCP::no_modifier;
ARMPCLabelIndex = AFI->createPICLabelUId();		ARMPCLabelIndex = AFI->createPICLabelUId();
CPV = ARMConstantPoolConstant::Create(		CPV = ARMConstantPoolConstant::Create(
GV, ARMPCLabelIndex, ARMCP::CPValue, PCAdj, Modifier,		GV, ARMPCLabelIndex, ARMCP::CPValue, PCAdj, Modifier,
/AddCurrentAddr/ Modifier == ARMCP::GOT_PREL);		/AddCurrentAddr/ Modifier == ARMCP::GOT_PREL);
} else		} else
CPV = ARMConstantPoolConstant::Create(GV, ARMCP::no_modifier);		CPV = ARMConstantPoolConstant::Create(GV, ARMCP::no_modifier);

MachineInstrBuilder MIB =		MachineInstrBuilder MIB =
▲ Show 20 Lines • Show All 388 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/ARM/ARMISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,158 Lines • ▼ Show 20 Lines	SDValue ARMTargetLowering::LowerGlobalAddressELF(SDValue Op,

// promoteToConstantPool only if not generating XO text section		// promoteToConstantPool only if not generating XO text section
if (TM.shouldAssumeDSOLocal(*GV->getParent(), GV) && !Subtarget->genExecuteOnly())		if (TM.shouldAssumeDSOLocal(*GV->getParent(), GV) && !Subtarget->genExecuteOnly())
if (SDValue V = promoteToConstantPool(GV, DAG, PtrVT, dl))		if (SDValue V = promoteToConstantPool(GV, DAG, PtrVT, dl))
return V;		return V;

if (isPositionIndependent()) {		if (isPositionIndependent()) {
bool UseGOT_PREL = !TM.shouldAssumeDSOLocal(*GV->getParent(), GV);		bool UseGOT_PREL = !TM.shouldAssumeDSOLocal(*GV->getParent(), GV);
		SDValue G = DAG.getTargetGlobalAddress(GV, dl, PtrVT, 0,
MachineFunction &MF = DAG.getMachineFunction();		UseGOT_PREL ? ARMII::MO_GOT : 0);
ARMFunctionInfo *AFI = MF.getInfo<ARMFunctionInfo>();		SDValue Result = DAG.getNode(ARMISD::WrapperPIC, dl, PtrVT, G);
unsigned ARMPCLabelIndex = AFI->createPICLabelUId();
EVT PtrVT = getPointerTy(DAG.getDataLayout());
SDLoc dl(Op);
unsigned PCAdj = Subtarget->isThumb() ? 4 : 8;
ARMConstantPoolValue *CPV = ARMConstantPoolConstant::Create(
GV, ARMPCLabelIndex, ARMCP::CPValue, PCAdj,
UseGOT_PREL ? ARMCP::GOT_PREL : ARMCP::no_modifier,
/AddCurrentAddress=/UseGOT_PREL);
SDValue CPAddr = DAG.getTargetConstantPool(CPV, PtrVT, 4);
CPAddr = DAG.getNode(ARMISD::Wrapper, dl, MVT::i32, CPAddr);
SDValue Result = DAG.getLoad(
PtrVT, dl, DAG.getEntryNode(), CPAddr,
MachinePointerInfo::getConstantPool(DAG.getMachineFunction()));
SDValue Chain = Result.getValue(1);
SDValue PICLabel = DAG.getConstant(ARMPCLabelIndex, dl, MVT::i32);
Result = DAG.getNode(ARMISD::PIC_ADD, dl, PtrVT, Result, PICLabel);
if (UseGOT_PREL)		if (UseGOT_PREL)
Result =		Result =
DAG.getLoad(PtrVT, dl, Chain, Result,		DAG.getLoad(PtrVT, dl, DAG.getEntryNode(), Result,
MachinePointerInfo::getGOT(DAG.getMachineFunction()));		MachinePointerInfo::getGOT(DAG.getMachineFunction()));
return Result;		return Result;
} else if (Subtarget->isROPI() && IsRO) {		} else if (Subtarget->isROPI() && IsRO) {
// PC-relative.		// PC-relative.
SDValue G = DAG.getTargetGlobalAddress(GV, dl, PtrVT);		SDValue G = DAG.getTargetGlobalAddress(GV, dl, PtrVT);
SDValue Result = DAG.getNode(ARMISD::WrapperPIC, dl, PtrVT, G);		SDValue Result = DAG.getNode(ARMISD::WrapperPIC, dl, PtrVT, G);
return Result;		return Result;
} else if (Subtarget->isRWPI() && !IsRO) {		} else if (Subtarget->isRWPI() && !IsRO) {
▲ Show 20 Lines • Show All 11,055 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/ARM/ARMInstrInfo.td

Show First 20 Lines • Show All 326 Lines • ▼ Show 20 Lines	def UseNegativeImmediates :
Predicate<"false">,		Predicate<"false">,
AssemblerPredicate<"!FeatureNoNegativeImmediates",		AssemblerPredicate<"!FeatureNoNegativeImmediates",
"NegativeImmediates">;		"NegativeImmediates">;

// FIXME: Eventually this will be just "hasV6T2Ops".		// FIXME: Eventually this will be just "hasV6T2Ops".
let RecomputePerFunction = 1 in {		let RecomputePerFunction = 1 in {
def UseMovt : Predicate<"Subtarget->useMovt(*MF)">;		def UseMovt : Predicate<"Subtarget->useMovt(*MF)">;
def DontUseMovt : Predicate<"!Subtarget->useMovt(*MF)">;		def DontUseMovt : Predicate<"!Subtarget->useMovt(*MF)">;
		def UseMovtInPic : Predicate<"Subtarget->useMovt(*MF) && Subtarget->allowPositionIndependentMovt()">;
		def DontUseMovtInPic : Predicate<"!Subtarget->useMovt(*MF) \|\| !Subtarget->allowPositionIndependentMovt()">;
}		}
def UseFPVMLx : Predicate<"Subtarget->useFPVMLx()">;		def UseFPVMLx : Predicate<"Subtarget->useFPVMLx()">;
def UseMulOps : Predicate<"Subtarget->useMulOps()">;		def UseMulOps : Predicate<"Subtarget->useMulOps()">;

// Prefer fused MAC for fp mul + add over fp VMLA / VMLS if they are available.		// Prefer fused MAC for fp mul + add over fp VMLA / VMLS if they are available.
// But only select them if more precision in FP computation is allowed.		// But only select them if more precision in FP computation is allowed.
// Do not use them for Darwin platforms.		// Do not use them for Darwin platforms.
def UseFusedMAC : Predicate<"(TM.Options.AllowFPOpFusion =="		def UseFusedMAC : Predicate<"(TM.Options.AllowFPOpFusion =="
▲ Show 20 Lines • Show All 5,296 Lines • ▼ Show 20 Lines
// Pseudo instruction that combines movw + movt + add pc (if PIC).		// Pseudo instruction that combines movw + movt + add pc (if PIC).
// It also makes it possible to rematerialize the instructions.		// It also makes it possible to rematerialize the instructions.
// FIXME: Remove this when we can do generalized remat and when machine licm		// FIXME: Remove this when we can do generalized remat and when machine licm
// can properly the instructions.		// can properly the instructions.
let isReMaterializable = 1 in {		let isReMaterializable = 1 in {
def MOV_ga_pcrel : PseudoInst<(outs GPR:$dst), (ins i32imm:$addr),		def MOV_ga_pcrel : PseudoInst<(outs GPR:$dst), (ins i32imm:$addr),
IIC_iMOVix2addpc,		IIC_iMOVix2addpc,
[(set GPR:$dst, (ARMWrapperPIC tglobaladdr:$addr))]>,		[(set GPR:$dst, (ARMWrapperPIC tglobaladdr:$addr))]>,
Requires<[IsARM, UseMovt]>;		Requires<[IsARM, UseMovtInPic]>;

def LDRLIT_ga_pcrel : PseudoInst<(outs GPR:$dst), (ins i32imm:$addr),		def LDRLIT_ga_pcrel : PseudoInst<(outs GPR:$dst), (ins i32imm:$addr),
IIC_iLoadiALU,		IIC_iLoadiALU,
[(set GPR:$dst,		[(set GPR:$dst,
(ARMWrapperPIC tglobaladdr:$addr))]>,		(ARMWrapperPIC tglobaladdr:$addr))]>,
Requires<[IsARM, DontUseMovt]>;		Requires<[IsARM, DontUseMovtInPic]>;

let AddedComplexity = 10 in		let AddedComplexity = 10 in
def LDRLIT_ga_pcrel_ldr : PseudoInst<(outs GPR:$dst), (ins i32imm:$addr),		def LDRLIT_ga_pcrel_ldr : PseudoInst<(outs GPR:$dst), (ins i32imm:$addr),
NoItinerary,		NoItinerary,
[(set GPR:$dst,		[(set GPR:$dst,
(load (ARMWrapperPIC tglobaladdr:$addr)))]>,		(load (ARMWrapperPIC tglobaladdr:$addr)))]>,
Requires<[IsARM, DontUseMovt]>;		Requires<[IsARM, DontUseMovtInPic]>;

let AddedComplexity = 10 in		let AddedComplexity = 10 in
def MOV_ga_pcrel_ldr : PseudoInst<(outs GPR:$dst), (ins i32imm:$addr),		def MOV_ga_pcrel_ldr : PseudoInst<(outs GPR:$dst), (ins i32imm:$addr),
IIC_iMOVix2ld,		IIC_iMOVix2ld,
[(set GPR:$dst, (load (ARMWrapperPIC tglobaladdr:$addr)))]>,		[(set GPR:$dst, (load (ARMWrapperPIC tglobaladdr:$addr)))]>,
Requires<[IsARM, UseMovt]>;		Requires<[IsARM, UseMovtInPic]>;
} // isReMaterializable		} // isReMaterializable

// The many different faces of TLS access.		// The many different faces of TLS access.
def : ARMPat<(ARMWrapper tglobaltlsaddr :$dst),		def : ARMPat<(ARMWrapper tglobaltlsaddr :$dst),
(MOVi32imm tglobaltlsaddr :$dst)>,		(MOVi32imm tglobaltlsaddr :$dst)>,
Requires<[IsARM, UseMovt]>;		Requires<[IsARM, UseMovt]>;

def : Pat<(ARMWrapper tglobaltlsaddr:$src),		def : Pat<(ARMWrapper tglobaltlsaddr:$src),
(LDRLIT_ga_abs tglobaltlsaddr:$src)>,		(LDRLIT_ga_abs tglobaltlsaddr:$src)>,
Requires<[IsARM, DontUseMovt]>;		Requires<[IsARM, DontUseMovt]>;

def : Pat<(ARMWrapperPIC tglobaltlsaddr:$addr),		def : Pat<(ARMWrapperPIC tglobaltlsaddr:$addr),
(MOV_ga_pcrel tglobaltlsaddr:$addr)>, Requires<[IsARM, UseMovt]>;		(MOV_ga_pcrel tglobaltlsaddr:$addr)>, Requires<[IsARM, UseMovtInPic]>;

def : Pat<(ARMWrapperPIC tglobaltlsaddr:$addr),		def : Pat<(ARMWrapperPIC tglobaltlsaddr:$addr),
(LDRLIT_ga_pcrel tglobaltlsaddr:$addr)>,		(LDRLIT_ga_pcrel tglobaltlsaddr:$addr)>,
Requires<[IsARM, DontUseMovt]>;		Requires<[IsARM, DontUseMovtInPic]>;
let AddedComplexity = 10 in		let AddedComplexity = 10 in
def : Pat<(load (ARMWrapperPIC tglobaltlsaddr:$addr)),		def : Pat<(load (ARMWrapperPIC tglobaltlsaddr:$addr)),
(MOV_ga_pcrel_ldr tglobaltlsaddr:$addr)>,		(MOV_ga_pcrel_ldr tglobaltlsaddr:$addr)>,
Requires<[IsARM, UseMovt]>;		Requires<[IsARM, UseMovtInPic]>;


// ConstantPool, GlobalAddress, and JumpTable		// ConstantPool, GlobalAddress, and JumpTable
def : ARMPat<(ARMWrapper tconstpool :$dst), (LEApcrel tconstpool :$dst)>;		def : ARMPat<(ARMWrapper tconstpool :$dst), (LEApcrel tconstpool :$dst)>;
def : ARMPat<(ARMWrapper tglobaladdr :$dst), (MOVi32imm tglobaladdr :$dst)>,		def : ARMPat<(ARMWrapper tglobaladdr :$dst), (MOVi32imm tglobaladdr :$dst)>,
Requires<[IsARM, UseMovt]>;		Requires<[IsARM, UseMovt]>;
def : ARMPat<(ARMWrapper texternalsym :$dst), (MOVi32imm texternalsym :$dst)>,		def : ARMPat<(ARMWrapper texternalsym :$dst), (MOVi32imm texternalsym :$dst)>,
Requires<[IsARM, UseMovt]>;		Requires<[IsARM, UseMovt]>;
▲ Show 20 Lines • Show All 417 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/ARM/ARMInstrThumb.td

	Show First 20 Lines • Show All 1,503 Lines • ▼ Show 20 Lines
	// ConstantPool			// ConstantPool
	def : T1Pat<(ARMWrapper tconstpool :$dst), (tLEApcrel tconstpool :$dst)>;			def : T1Pat<(ARMWrapper tconstpool :$dst), (tLEApcrel tconstpool :$dst)>;

	// GlobalAddress			// GlobalAddress
	def tLDRLIT_ga_pcrel : PseudoInst<(outs tGPR:$dst), (ins i32imm:$addr),			def tLDRLIT_ga_pcrel : PseudoInst<(outs tGPR:$dst), (ins i32imm:$addr),
	IIC_iLoadiALU,			IIC_iLoadiALU,
	[(set tGPR:$dst,			[(set tGPR:$dst,
	(ARMWrapperPIC tglobaladdr:$addr))]>,			(ARMWrapperPIC tglobaladdr:$addr))]>,
	Requires<[IsThumb, DontUseMovt]>;			Requires<[IsThumb, DontUseMovtInPic]>;

	def tLDRLIT_ga_abs : PseudoInst<(outs tGPR:$dst), (ins i32imm:$src),			def tLDRLIT_ga_abs : PseudoInst<(outs tGPR:$dst), (ins i32imm:$src),
	IIC_iLoad_i,			IIC_iLoad_i,
	[(set tGPR:$dst,			[(set tGPR:$dst,
	(ARMWrapper tglobaladdr:$src))]>,			(ARMWrapper tglobaladdr:$src))]>,
	Requires<[IsThumb, DontUseMovt]>;			Requires<[IsThumb, DontUseMovt]>;

	// TLS globals			// TLS globals
	def : Pat<(ARMWrapperPIC tglobaltlsaddr:$addr),			def : Pat<(ARMWrapperPIC tglobaltlsaddr:$addr),
	(tLDRLIT_ga_pcrel tglobaltlsaddr:$addr)>,			(tLDRLIT_ga_pcrel tglobaltlsaddr:$addr)>,
	Requires<[IsThumb, DontUseMovt]>;			Requires<[IsThumb, DontUseMovtInPic]>;
	def : Pat<(ARMWrapper tglobaltlsaddr:$addr),			def : Pat<(ARMWrapper tglobaltlsaddr:$addr),
	(tLDRLIT_ga_abs tglobaltlsaddr:$addr)>,			(tLDRLIT_ga_abs tglobaltlsaddr:$addr)>,
	Requires<[IsThumb, DontUseMovt]>;			Requires<[IsThumb, DontUseMovt]>;


	// JumpTable			// JumpTable
	def : T1Pat<(ARMWrapperJT tjumptable:$dst),			def : T1Pat<(ARMWrapperJT tjumptable:$dst),
	(tLEApcrelJT tjumptable:$dst)>;			(tLEApcrelJT tjumptable:$dst)>;
	▲ Show 20 Lines • Show All 159 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/ARM/ARMInstrThumb2.td

	Show First 20 Lines • Show All 3,837 Lines • ▼ Show 20 Lines
	// Pseudo instruction that combines movw + movt + add pc (if pic).			// Pseudo instruction that combines movw + movt + add pc (if pic).
	// It also makes it possible to rematerialize the instructions.			// It also makes it possible to rematerialize the instructions.
	// FIXME: Remove this when we can do generalized remat and when machine licm			// FIXME: Remove this when we can do generalized remat and when machine licm
	// can properly the instructions.			// can properly the instructions.
	let isReMaterializable = 1 in {			let isReMaterializable = 1 in {
	def t2MOV_ga_pcrel : PseudoInst<(outs rGPR:$dst), (ins i32imm:$addr),			def t2MOV_ga_pcrel : PseudoInst<(outs rGPR:$dst), (ins i32imm:$addr),
	IIC_iMOVix2addpc,			IIC_iMOVix2addpc,
	[(set rGPR:$dst, (ARMWrapperPIC tglobaladdr:$addr))]>,			[(set rGPR:$dst, (ARMWrapperPIC tglobaladdr:$addr))]>,
	Requires<[IsThumb, HasV8MBaseline, UseMovt]>;			Requires<[IsThumb, HasV8MBaseline, UseMovtInPic]>;

	}			}

	def : T2Pat<(ARMWrapperPIC tglobaltlsaddr :$dst),			def : T2Pat<(ARMWrapperPIC tglobaltlsaddr :$dst),
	(t2MOV_ga_pcrel tglobaltlsaddr:$dst)>,			(t2MOV_ga_pcrel tglobaltlsaddr:$dst)>,
	Requires<[IsThumb2, UseMovt]>;			Requires<[IsThumb2, UseMovtInPic]>;
	def : T2Pat<(ARMWrapper tglobaltlsaddr:$dst),			def : T2Pat<(ARMWrapper tglobaltlsaddr:$dst),
	(t2MOVi32imm tglobaltlsaddr:$dst)>,			(t2MOVi32imm tglobaltlsaddr:$dst)>,
	Requires<[IsThumb2, UseMovt]>;			Requires<[IsThumb2, UseMovt]>;

	// ConstantPool, GlobalAddress, and JumpTable			// ConstantPool, GlobalAddress, and JumpTable
	def : T2Pat<(ARMWrapper tconstpool :$dst), (t2LEApcrel tconstpool :$dst)>;			def : T2Pat<(ARMWrapper tconstpool :$dst), (t2LEApcrel tconstpool :$dst)>;
	def : T2Pat<(ARMWrapper texternalsym :$dst), (t2MOVi32imm texternalsym :$dst)>,			def : T2Pat<(ARMWrapper texternalsym :$dst), (t2MOVi32imm texternalsym :$dst)>,
	Requires<[IsThumb, HasV8MBaseline, UseMovt]>;			Requires<[IsThumb, HasV8MBaseline, UseMovt]>;
	▲ Show 20 Lines • Show All 961 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/ARM/ARMInstructionSelector.cpp

Show First 20 Lines • Show All 532 Lines • ▼ Show 20 Lines	if (TM.isPositionIndependent()) {
// FIXME: Taking advantage of MOVT for ELF is pretty involved, so we don't		// FIXME: Taking advantage of MOVT for ELF is pretty involved, so we don't
// support it yet. See PR28229.		// support it yet. See PR28229.
unsigned Opc =		unsigned Opc =
UseMovt && !STI.isTargetELF()		UseMovt && !STI.isTargetELF()
? (Indirect ? ARM::MOV_ga_pcrel_ldr : ARM::MOV_ga_pcrel)		? (Indirect ? ARM::MOV_ga_pcrel_ldr : ARM::MOV_ga_pcrel)
: (Indirect ? ARM::LDRLIT_ga_pcrel_ldr : ARM::LDRLIT_ga_pcrel);		: (Indirect ? ARM::LDRLIT_ga_pcrel_ldr : ARM::LDRLIT_ga_pcrel);
MIB->setDesc(TII.get(Opc));		MIB->setDesc(TII.get(Opc));

		int TargetFlags = ARMII::MO_NO_FLAG;
if (STI.isTargetDarwin())		if (STI.isTargetDarwin())
MIB->getOperand(1).setTargetFlags(ARMII::MO_NONLAZY);		TargetFlags \|= ARMII::MO_NONLAZY;
		if (STI.isGVInGOT(GV))
		TargetFlags \|= ARMII::MO_GOT;
		MIB->getOperand(1).setTargetFlags(TargetFlags);

if (Indirect)		if (Indirect)
MIB.addMemOperand(MF.getMachineMemOperand(		MIB.addMemOperand(MF.getMachineMemOperand(
MachinePointerInfo::getGOT(MF), MachineMemOperand::MOLoad,		MachinePointerInfo::getGOT(MF), MachineMemOperand::MOLoad,
TM.getPointerSize(), Alignment));		TM.getPointerSize(), Alignment));

return constrainSelectedInstRegOperands(*MIB, TII, TRI, RBI);		return constrainSelectedInstRegOperands(*MIB, TII, TRI, RBI);
}		}
▲ Show 20 Lines • Show All 336 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/ARM/ARMSubtarget.h

Show First 20 Lines • Show All 746 Lines • ▼ Show 20 Lines	public:
int getPreISelOperandLatencyAdjustment() const {		int getPreISelOperandLatencyAdjustment() const {
return PreISelOperandLatencyAdjustment;		return PreISelOperandLatencyAdjustment;
}		}

/// True if the GV will be accessed via an indirect symbol.		/// True if the GV will be accessed via an indirect symbol.
bool isGVIndirectSymbol(const GlobalValue *GV) const;		bool isGVIndirectSymbol(const GlobalValue *GV) const;

/// Returns the constant pool modifier needed to access the GV.		/// Returns the constant pool modifier needed to access the GV.
ARMCP::ARMCPModifier getCPModifier(const GlobalValue *GV) const;		bool isGVInGOT(const GlobalValue *GV) const;

/// True if fast-isel is used.		/// True if fast-isel is used.
bool useFastISel() const;		bool useFastISel() const;

/// Returns the correct return opcode for the current feature set.		/// Returns the correct return opcode for the current feature set.
/// Use BX if available to allow mixing thumb/arm code, but fall back		/// Use BX if available to allow mixing thumb/arm code, but fall back
/// to plain mov pc,lr on ARMv4.		/// to plain mov pc,lr on ARMv4.
unsigned getReturnOpcode() const {		unsigned getReturnOpcode() const {
if (isThumb())		if (isThumb())
return ARM::tBX_RET;		return ARM::tBX_RET;
if (hasV4TOps())		if (hasV4TOps())
return ARM::BX_RET;		return ARM::BX_RET;
return ARM::MOVPCLR;		return ARM::MOVPCLR;
}		}

		/// Allow movt+movw for PIC global address calculation.
		/// ELF does not have GOT relocations for movt+movw.
		/// ROPI does not use GOT.
		bool allowPositionIndependentMovt() const {
		return isROPI() \|\| !isTargetELF();
		}
};		};

} // end namespace llvm		} // end namespace llvm

#endif // LLVM_LIB_TARGET_ARM_ARMSUBTARGET_H		#endif // LLVM_LIB_TARGET_ARM_ARMSUBTARGET_H

llvm/trunk/lib/Target/ARM/ARMSubtarget.cpp

Show First 20 Lines • Show All 338 Lines • ▼ Show 20 Lines	bool ARMSubtarget::isGVIndirectSymbol(const GlobalValue *GV) const {
// for GVs that are known to be local to the dso.		// for GVs that are known to be local to the dso.
if (isTargetMachO() && TM.isPositionIndependent() &&		if (isTargetMachO() && TM.isPositionIndependent() &&
(GV->isDeclarationForLinker() \|\| GV->hasCommonLinkage()))		(GV->isDeclarationForLinker() \|\| GV->hasCommonLinkage()))
return true;		return true;

return false;		return false;
}		}

ARMCP::ARMCPModifier ARMSubtarget::getCPModifier(const GlobalValue *GV) const {		bool ARMSubtarget::isGVInGOT(const GlobalValue *GV) const {
if (isTargetELF() && TM.isPositionIndependent() &&		return isTargetELF() && TM.isPositionIndependent() &&
!TM.shouldAssumeDSOLocal(*GV->getParent(), GV))		!TM.shouldAssumeDSOLocal(*GV->getParent(), GV);
return ARMCP::GOT_PREL;
return ARMCP::no_modifier;
}		}

unsigned ARMSubtarget::getMispredictionPenalty() const {		unsigned ARMSubtarget::getMispredictionPenalty() const {
return SchedModel.MispredictPenalty;		return SchedModel.MispredictPenalty;
}		}

bool ARMSubtarget::hasSinCos() const {		bool ARMSubtarget::hasSinCos() const {
return isTargetWatchOS() \|\|		return isTargetWatchOS() \|\|
▲ Show 20 Lines • Show All 48 Lines • Show Last 20 Lines

llvm/trunk/lib/Target/ARM/MCTargetDesc/ARMBaseInfo.h

Show First 20 Lines • Show All 222 Lines • ▼ Show 20 Lines	enum TOF {
MO_LO16 = 0x1,		MO_LO16 = 0x1,

/// MO_HI16 - On a symbol operand, this represents a relocation containing		/// MO_HI16 - On a symbol operand, this represents a relocation containing
/// higher 16 bit of the address. Used only via movt instruction.		/// higher 16 bit of the address. Used only via movt instruction.
MO_HI16 = 0x2,		MO_HI16 = 0x2,

/// MO_OPTION_MASK - Most flags are mutually exclusive; this mask selects		/// MO_OPTION_MASK - Most flags are mutually exclusive; this mask selects
/// just that part of the flag set.		/// just that part of the flag set.
MO_OPTION_MASK = 0x0f,		MO_OPTION_MASK = 0x3,

		/// MO_GOT - On a symbol operand, this represents a GOT relative relocation.
		MO_GOT = 0x8,

/// MO_SBREL - On a symbol operand, this represents a static base relative		/// MO_SBREL - On a symbol operand, this represents a static base relative
/// relocation. Used in movw and movt instructions.		/// relocation. Used in movw and movt instructions.
MO_SBREL = 0x10,		MO_SBREL = 0x10,

/// MO_DLLIMPORT - On a symbol operand, this represents that the reference		/// MO_DLLIMPORT - On a symbol operand, this represents that the reference
/// to the symbol is for an import stub. This is used for DLL import		/// to the symbol is for an import stub. This is used for DLL import
/// storage class indication on Windows.		/// storage class indication on Windows.
▲ Show 20 Lines • Show All 169 Lines • Show Last 20 Lines

llvm/trunk/test/CodeGen/ARM/GlobalISel/arm-select-globals-pic.mir

	Show First 20 Lines • Show All 50 Lines • ▼ Show 20 Lines
	registers:			registers:
	- { id: 0, class: gprb }			- { id: 0, class: gprb }
	- { id: 1, class: gprb }			- { id: 1, class: gprb }
	body: \|			body: \|
	bb.0:			bb.0:
	%0(p0) = G_GLOBAL_VALUE @external_global			%0(p0) = G_GLOBAL_VALUE @external_global
	; DARWIN-MOVT: [[G:%[0-9]+]]:gpr = MOV_ga_pcrel_ldr {{.*}} @external_global :: (load 4 from got)			; DARWIN-MOVT: [[G:%[0-9]+]]:gpr = MOV_ga_pcrel_ldr {{.*}} @external_global :: (load 4 from got)
	; DARWIN-NOMOVT: [[G:%[0-9]+]]:gpr = LDRLIT_ga_pcrel_ldr {{.*}}@external_global :: (load 4 from got)			; DARWIN-NOMOVT: [[G:%[0-9]+]]:gpr = LDRLIT_ga_pcrel_ldr {{.*}}@external_global :: (load 4 from got)
	; ELF: [[G:%[0-9]+]]:gpr = LDRLIT_ga_pcrel_ldr @external_global :: (load 4 from got)			; ELF: [[G:%[0-9]+]]:gpr = LDRLIT_ga_pcrel_ldr target-flags(<unknown>) @external_global :: (load 4 from got)

	%1(s32) = G_LOAD %0(p0) :: (load 4 from @external_global)			%1(s32) = G_LOAD %0(p0) :: (load 4 from @external_global)
	; CHECK: [[V:%[0-9]+]]:gpr = LDRi12 [[G]], 0, 14, _ :: (load 4 from @external_global)			; CHECK: [[V:%[0-9]+]]:gpr = LDRi12 [[G]], 0, 14, _ :: (load 4 from @external_global)

	%r0 = COPY %1(s32)			%r0 = COPY %1(s32)
	; CHECK: %r0 = COPY [[V]]			; CHECK: %r0 = COPY [[V]]

	BX_RET 14, _, implicit %r0			BX_RET 14, _, implicit %r0
	Show All 35 Lines
	registers:			registers:
	- { id: 0, class: gprb }			- { id: 0, class: gprb }
	- { id: 1, class: gprb }			- { id: 1, class: gprb }
	body: \|			body: \|
	bb.0:			bb.0:
	%0(p0) = G_GLOBAL_VALUE @external_constant			%0(p0) = G_GLOBAL_VALUE @external_constant
	; DARWIN-MOVT: [[G:%[0-9]+]]:gpr = MOV_ga_pcrel_ldr {{.*}} @external_constant :: (load 4 from got)			; DARWIN-MOVT: [[G:%[0-9]+]]:gpr = MOV_ga_pcrel_ldr {{.*}} @external_constant :: (load 4 from got)
	; DARWIN-NOMOVT: [[G:%[0-9]+]]:gpr = LDRLIT_ga_pcrel_ldr {{.*}}@external_constant :: (load 4 from got)			; DARWIN-NOMOVT: [[G:%[0-9]+]]:gpr = LDRLIT_ga_pcrel_ldr {{.*}}@external_constant :: (load 4 from got)
	; ELF: [[G:%[0-9]+]]:gpr = LDRLIT_ga_pcrel_ldr @external_constant :: (load 4 from got)			; ELF: [[G:%[0-9]+]]:gpr = LDRLIT_ga_pcrel_ldr target-flags(<unknown>) @external_constant :: (load 4 from got)

	%1(s32) = G_LOAD %0(p0) :: (load 4 from @external_constant)			%1(s32) = G_LOAD %0(p0) :: (load 4 from @external_constant)
	; CHECK: [[V:%[0-9]+]]:gpr = LDRi12 [[G]], 0, 14, _ :: (load 4 from @external_constant)			; CHECK: [[V:%[0-9]+]]:gpr = LDRi12 [[G]], 0, 14, _ :: (load 4 from @external_constant)

	%r0 = COPY %1(s32)			%r0 = COPY %1(s32)
	; CHECK: %r0 = COPY [[V]]			; CHECK: %r0 = COPY [[V]]

	BX_RET 14, _, implicit %r0			BX_RET 14, _, implicit %r0
	; CHECK: BX_RET 14, _, implicit %r0			; CHECK: BX_RET 14, _, implicit %r0
	...			...

llvm/trunk/test/CodeGen/ARM/load-global2.ll

				; PR35221. Test that external global address is not reloaded from GOT in each BB.
				; RUN: llc < %s -mtriple=armv7-linux-gnueabi -relocation-model=pic \| FileCheck %s -check-prefix=LINUX-PIC

				@x = external global i8, align 1

				define signext i8 @foo() {
				entry:
				; LINUX-PIC: ldr r[[A:.]], .LCPI0_0
				; LINUX-PIC: ldr r[[B:.]], [pc, r[[A]]]
				; LINUX-PIC: ldrb r{{.}}, [r[[B]]]
				%0 = load i8, i8* @x
				%tobool = icmp eq i8 %0, 0
				br i1 %tobool, label %bb1, label %bb2

				bb1:
				call void @bar()
				; No more pc-relative loads! Reuse r[[B]].
				; LINUX-PIC: bl bar
				; LINUX-PIC-NOT: ldr{{.*}}[pc,
				; LINUX-PIC: ldrsb r{{.}}, [r[[B]]]
				%1 = load i8, i8* @x
				ret i8 %1

				bb2:
				ret i8 0
				}

				declare void @bar()

llvm/trunk/test/CodeGen/Thumb2/v8_IT_3.ll

Show First 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	; CHECK: %bb1
%tmp12 = mul nsw i64 %tmp10, %tmp11		%tmp12 = mul nsw i64 %tmp10, %tmp11
%tmp13 = call i32 @foo(i8* getelementptr inbounds ([6 x i8], [6 x i8]* @.str1, i32 0, i32 0), i64 %tmp12, i32 %tmp5) nounwind		%tmp13 = call i32 @foo(i8* getelementptr inbounds ([6 x i8], [6 x i8]* @.str1, i32 0, i32 0), i64 %tmp12, i32 %tmp5) nounwind
br label %bb8		br label %bb8

bb4:		bb4:
; CHECK-PIC: cmp		; CHECK-PIC: cmp
; CHECK-PIC: cmp		; CHECK-PIC: cmp
; CHECK-PIC: cmp		; CHECK-PIC: cmp
; CHECK-PIC-NEXT: bne		; CHECK-PIC: it eq
		; CHECK-PIC-NEXT: ldreq
		; CHECK-PIC-NEXT: it eq
		; CHECK-PIC-NEXT: cmpeq
		; CHECK-PIC-NEXT: beq
; CHECK-PIC: %bb6		; CHECK-PIC: %bb6
; CHECK-PIC-NEXT: movs		; CHECK-PIC-NEXT: movs
; CHECK-PIC-NEXT: add		; CHECK-PIC-NEXT: add
; CHECK-PIC-NEXT: pop		; CHECK-PIC-NEXT: pop
ret i32 0		ret i32 0

bb6:		bb6:
ret i32 1		ret i32 1
Show All 12 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[arm] Fix Unnecessary reloads from GOT.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 122703

llvm/trunk/lib/Target/ARM/ARMExpandPseudoInsts.cpp

llvm/trunk/lib/Target/ARM/ARMISelLowering.cpp

llvm/trunk/lib/Target/ARM/ARMInstrInfo.td

llvm/trunk/lib/Target/ARM/ARMInstrThumb.td

llvm/trunk/lib/Target/ARM/ARMInstrThumb2.td

llvm/trunk/lib/Target/ARM/ARMInstructionSelector.cpp

llvm/trunk/lib/Target/ARM/ARMSubtarget.h

llvm/trunk/lib/Target/ARM/ARMSubtarget.cpp

llvm/trunk/lib/Target/ARM/MCTargetDesc/ARMBaseInfo.h

llvm/trunk/test/CodeGen/ARM/GlobalISel/arm-select-globals-pic.mir

llvm/trunk/test/CodeGen/ARM/load-global2.ll

llvm/trunk/test/CodeGen/Thumb2/v8_IT_3.ll

[arm] Fix Unnecessary reloads from GOT.
ClosedPublic