This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
CodeGen/
-
CommandFlags.h
-
Target/
-
TargetMachine.h
-
lib/
-
CodeGen/
3/6
CommandFlags.cpp
-
TargetLoweringObjectFileImpl.cpp
-
Target/
1/3
TargetMachine.cpp
-
test/CodeGen/X86/
-
CodeGen/
-
X86/
-
code-model-elf-sections.ll
-
tools/llc/
-
llc/
3/8
llc.cpp

Differential D149288

[X86] Introduce a large data threshold for the medium code model
ClosedPublic

Authored by aeubanks on Apr 26 2023, 1:40 PM.

Download Raw Diff

Details

Reviewers

tkoeppe
MaskRay
jyknight
rnk
efriedma

Commits

rG1feb00a28c9f: [X86] Introduce a large data threshold for the medium code model

Summary

Currently clang's medium code model treats all data as large, putting them in a large data section and using more expensive instruction sequences to access them.

Following gcc's -mlarge-data-threshold, which allows putting data under a certain size in a normal data section as opposed to a large data section. This allows using cheaper code sequences to access some portion of data in the binary (which will be implemented in LLVM in a future patch).

And under the medium codel mode, only put data above the large data threshold into large data sections, not all data.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

aeubanks created this revision.Apr 26 2023, 1:40 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 26 2023, 1:40 PM

Herald added subscribers: pengfei, hiraditya. · View Herald Transcript

aeubanks requested review of this revision.Apr 26 2023, 1:40 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 26 2023, 1:40 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

depends on D148836

aeubanks added reviewers: tkoeppe, MaskRay, jyknight.Apr 26 2023, 1:40 PM

Harbormaster completed remote builds in B228382: Diff 517301.Apr 26 2023, 1:40 PM

aeubanks mentioned this in D150297: [X86] Use RIP-relative addressing for data under large data threshold for medium code model.May 10 2023, 1:23 PM

large-data-threshold is ABI, and needs to have a default value which is the same as GCC, 65535. (And users should be discouraged from changing it.)

In many common cases you can get away with using different values in different object files, because cross-object references are going via PLT/GOT when the compiler doesn't know the definition is in-DSO. But that's definitely not a 100% solution -- e.g. __attribute__((visibility("hidden"))) or ODR-data defined in a -fPIE object is known to be defined in-DSO, so will use a pc-relative reference.

llvm/lib/Target/TargetMachine.cpp
47	W also must handle unknown-sized objects as large here. E.g. `clang -mcmodel=medium -fPIC` on __attribute__((visibility("hidden"))) extern int x[]; int bar(void) { return x[0]; } must refer to x via GOTOFF not pc-relative, because it _could_ be defined as large.

check size 0

In D149288#4343075, @jyknight wrote:

large-data-threshold is ABI, and needs to have a default value which is the same as GCC, 65535. (And users should be discouraged from changing it.)

In many common cases you can get away with using different values in different object files, because cross-object references are going via PLT/GOT when the compiler doesn't know the definition is in-DSO. But that's definitely not a 100% solution -- e.g. __attribute__((visibility("hidden"))) or ODR-data defined in a -fPIE object is known to be defined in-DSO, so will use a pc-relative reference.

Yes, changing the default value and adding a clang flag will come in a future patch; this patch preserves the current behavior by default

llvm/lib/Target/TargetMachine.cpp
47	ah ok, although that's not observable in this patch since we only use this for determining which section to place a symbol in, not how we reference a symbol (that's D150297). I'll make the change here but add tests in the other patch

Harbormaster completed remote builds in B233699: Diff 524471.May 22 2023, 1:51 PM

aeubanks mentioned this in D148836: [X86] Use "l" prefix for data sections under medium/large code model.May 23 2023, 1:44 PM

MaskRay added inline comments.May 25 2023, 8:27 AM

llvm/lib/CodeGen/CommandFlags.cpp
169	To match GCC, we can use 65536 Consider applying this to large code model as well. https://gcc.gnu.org/pipermail/gcc-patches/2023-May/619695.html
llvm/lib/Target/TargetMachine.cpp
51	I think GCC's condition is `>` instead of `>=`.

tkoeppe added inline comments.May 25 2023, 8:40 AM

llvm/tools/llc/llc.cpp
594	Can we avoid the implicit check for 0 here and spell out `; LDT > 0`? Does LLVM require C++17?

MaskRay added inline comments.May 25 2023, 1:12 PM

llvm/tools/llc/llc.cpp
594	Building llvm-project requires c++17 (libomptarget may be an exception). We can use `if (auto LDT = ...; LDT && LDT > 0)` only if the default is 0.

instead of >=

llvm/lib/CodeGen/CommandFlags.cpp
169	changes to this value will come in a later patch, and same for making the large code model respect this value
llvm/tools/llc/llc.cpp
594	yeah LLVM uses C++17 `codegen::getExplicitLargeDataThreshold` returns a `std::optional`, so we're checking if the user explicitly overrode the value, not if the value is 0. we would want to set the value to 0 if the user explicitly requested it

Harbormaster completed remote builds in B234623: Diff 525770.May 25 2023, 1:24 PM

tkoeppe added inline comments.May 25 2023, 1:30 PM

llvm/tools/llc/llc.cpp
594	Ah yes, sorry, I forgot that it's an optional<int>, not just an int. In that case all I meant was for the check to not be implicit, i.e. I'd recommend: for (auto LDT = ...; LDT.has_vaiue()) { // use *LDT } Just so there's one less implicit thing one needs to keep in mind when reading this.

MaskRay added inline comments.May 25 2023, 1:49 PM

llvm/lib/CodeGen/CommandFlags.cpp
169	This seems strange. Introducing this variable and setting it to an appropriate should come in the same patch, no ? I don't want `clang -mcmodel=medium` emitted data sections for small objects to change from `.data` to `.ldata`, then to `.data` again.

use std::optional<uint64_t> instead of auto

llvm/lib/CodeGen/CommandFlags.cpp
169	this patch doesn't change the current behavior of the medium code model in clang since a large data threshold of zero treats all data as large, which is the current behavior. I'd rather have the implementation in this patch then a separate small patch to actually change the value, especially since not all of the large data threshold functionality is done yet (e.g. D150297)
llvm/tools/llc/llc.cpp
594	I think this is the standard idiom for checking if a `std::optional` has a value, not calling `has_value()`. I did change the type to be explicit though, instead of `auto`

Harbormaster completed remote builds in B234640: Diff 525790.May 25 2023, 2:06 PM

aeubanks added a reviewer: rnk.Jun 1 2023, 12:32 PM

What about storing some kind of "large data attribute" on the global itself in the IR? That would help make sure the medium code model flags don't have to be passed to ThinLTO backend action compilations, for example, and give intuitive semantics to linking together objects compiled in the large, medium, and small code models.

The other consideration is making this feature easy for other frontends (Rust) to use, and your solution does make that easy. The best I can come up with is an LLVM pass or helper function in LLVM that annotates IR globals appropriately given a size threshold from the frontend.

In D149288#4388728, @rnk wrote:

What about storing some kind of "large data attribute" on the global itself in the IR? That would help make sure the medium code model flags don't have to be passed to ThinLTO backend action compilations, for example, and give intuitive semantics to linking together objects compiled in the large, medium, and small code models.

apparently a module metadata for the code model for passing along to LTO backends was already done a while back in https://reviews.llvm.org/D52322

and I verified that you don't need to pass -mcmodel to lld with ThinLTO

aeubanks added a reviewer: efriedma.Sep 13 2023, 11:11 AM

Right, the model should be encoded in the IR for LTO. So as long as we consider the threshold an ABI constraint, and we can accurately compute whether a global has indefinite size, we don't need to tag individual globals.

Tagging individual globals could be useful as an extension, I guess, but it doesn't need to be in this patch.

(Note there's a weird interaction between -mcmodel=medium and the gcc extension that allows initializing flexible arrays: it's possible to define a "small" global that's actually larger than the threshold, and gcc will still treat it as small. But the interaction is consistent with the way this patch works.)

One of our use cases for having a "large" attribute for global data is to have a way to mark instrumentation data as large. We would use this for ASan global metadata and PGO name data, which are not accessed frequently during program execution, but contribute significantly to binary data size.

Regardless, I'm in favor of using a simple module flag for this. It makes it easy for non-Clang frontends. The explicit "large" attribute can come later.

I have a patch that introduces the large data threshold module metadata, but I'd like to separate that patch to keep patches smallish. It has some dependencies so I need to do stacked patches but we're deprecating new patches on phab and GH doesn't support stacked patch reviews yet. Can we move forward with this one first?

rnk added inline comments.Sep 13 2023, 3:56 PM

llvm/tools/llc/llc.cpp
494	Should the threshold be a `TargetOption`? Would that avoid the need for the new TargetMachine field?

aeubanks added inline comments.Sep 13 2023, 5:20 PM

llvm/tools/llc/llc.cpp
494	it should go along with the code model since they're tied together. I do think putting all these options (code model, relocation model) in `TargetOptions` makes sense, it fits with lots of existing options there. if we put all these options into `TargetOptions` it'd simplify the `TargetMachine` constructors. if people think this makes sense I can do that cleanup first

lgtm

llvm/tools/llc/llc.cpp
494	I think what I'd like to do is to minimize the number of changes to the `createTargetMachine` prototype, since it's called in many places. It has many optional parameters, and I suspect that most callers accept the defaults: TargetMachine *createTargetMachine( StringRef TT, StringRef CPU, StringRef Features, const TargetOptions &Options, std::optional<Reloc::Model> RM, // No default, but is std::optional std::optional<CodeModel::Model> CM = std::nullopt, // Optional, with a default. CodeGenOpt::Level OL = CodeGenOpt::Default, // has a default, this will change to an enum class bool JIT = false // Boolean optional parameters are unreadable. ) const { Maybe the first change is to move the relocation and code model into target options. We probably have to leave the optimization level alone, since there are multiple callers of `TargetMachine::setOptLevel`, and it's not really an immutable option, it varies from function to function. That said, if you want to land this now, and defer that work, I'm OK with that, this change doesn't affect the `createTargetMachine` prototype, it's the CodeGenLevel one that does.

This revision is now accepted and ready to land.Sep 14 2023, 10:01 AM

MaskRay accepted this revision.Sep 14 2023, 11:11 AM

MaskRay added inline comments.

llvm/lib/CodeGen/CommandFlags.cpp
168	Perhaps we should say x86-64 medium/large code model. My https://gcc.gnu.org/pipermail/gcc-patches/2023-September/630207.html is still pending after many pings, but I think we can go ahead as the obvious choice.

aeubanks added inline comments.Sep 14 2023, 3:05 PM

llvm/lib/CodeGen/CommandFlags.cpp
168	we can change this to also say "large" once LLVM actually respects it in the large code model

This revision was landed with ongoing or failed builds.Sep 14 2023, 3:09 PM

Closed by commit rG1feb00a28c9f: [X86] Introduce a large data threshold for the medium code model (authored by aeubanks). · Explain Why

This revision was automatically updated to reflect the committed changes.

aeubanks added a commit: rG1feb00a28c9f: [X86] Introduce a large data threshold for the medium code model.

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

CommandFlags.h

3 lines

Target/

TargetMachine.h

4 lines

lib/

CodeGen/

CommandFlags.cpp

7 lines

TargetLoweringObjectFileImpl.cpp

8 lines

Target/

TargetMachine.cpp

6 lines

test/

CodeGen/

X86/

code-model-elf-sections.ll

4 lines

tools/

llc/

llc.cpp

2 lines

Diff 556814

llvm/include/llvm/CodeGen/CommandFlags.h

	Show All 39 Lines
	Reloc::Model getRelocModel();			Reloc::Model getRelocModel();
	std::optional<Reloc::Model> getExplicitRelocModel();			std::optional<Reloc::Model> getExplicitRelocModel();

	ThreadModel::Model getThreadModel();			ThreadModel::Model getThreadModel();

	CodeModel::Model getCodeModel();			CodeModel::Model getCodeModel();
	std::optional<CodeModel::Model> getExplicitCodeModel();			std::optional<CodeModel::Model> getExplicitCodeModel();

				uint64_t getLargeDataThreshold();
				std::optional<uint64_t> getExplicitLargeDataThreshold();

	llvm::ExceptionHandling getExceptionModel();			llvm::ExceptionHandling getExceptionModel();

	std::optional<CodeGenFileType> getExplicitFileType();			std::optional<CodeGenFileType> getExplicitFileType();

	CodeGenFileType getFileType();			CodeGenFileType getFileType();

	FramePointerKind getFramePointerUsage();			FramePointerKind getFramePointerUsage();

	▲ Show 20 Lines • Show All 134 Lines • Show Last 20 Lines

llvm/include/llvm/Target/TargetMachine.h

Show First 20 Lines • Show All 94 Lines • ▼ Show 20 Lines	protected: // Can only create subclasses.
/// Triple string, CPU name, and target feature strings the TargetMachine		/// Triple string, CPU name, and target feature strings the TargetMachine
/// instance is created with.		/// instance is created with.
Triple TargetTriple;		Triple TargetTriple;
std::string TargetCPU;		std::string TargetCPU;
std::string TargetFS;		std::string TargetFS;

Reloc::Model RM = Reloc::Static;		Reloc::Model RM = Reloc::Static;
CodeModel::Model CMModel = CodeModel::Small;		CodeModel::Model CMModel = CodeModel::Small;
		uint64_t LargeDataThreshold = 0;
CodeGenOptLevel OptLevel = CodeGenOptLevel::Default;		CodeGenOptLevel OptLevel = CodeGenOptLevel::Default;

/// Contains target specific asm information.		/// Contains target specific asm information.
std::unique_ptr<const MCAsmInfo> AsmInfo;		std::unique_ptr<const MCAsmInfo> AsmInfo;
std::unique_ptr<const MCRegisterInfo> MRI;		std::unique_ptr<const MCRegisterInfo> MRI;
std::unique_ptr<const MCInstrInfo> MII;		std::unique_ptr<const MCInstrInfo> MII;
std::unique_ptr<const MCSubtargetInfo> STI;		std::unique_ptr<const MCSubtargetInfo> STI;

▲ Show 20 Lines • Show All 122 Lines • ▼ Show 20 Lines	public:
CodeModel::Model getCodeModel() const { return CMModel; }		CodeModel::Model getCodeModel() const { return CMModel; }

/// Returns the maximum code size possible under the code model.		/// Returns the maximum code size possible under the code model.
uint64_t getMaxCodeSize() const;		uint64_t getMaxCodeSize() const;

/// Set the code model.		/// Set the code model.
void setCodeModel(CodeModel::Model CM) { CMModel = CM; }		void setCodeModel(CodeModel::Model CM) { CMModel = CM; }

bool isLargeData() const;		void setLargeDataThreshold(uint64_t LDT) { LargeDataThreshold = LDT; }
		bool isLargeData(const GlobalVariable *GV) const;

bool isPositionIndependent() const;		bool isPositionIndependent() const;

bool shouldAssumeDSOLocal(const Module &M, const GlobalValue *GV) const;		bool shouldAssumeDSOLocal(const Module &M, const GlobalValue *GV) const;

/// Returns true if this target uses emulated TLS.		/// Returns true if this target uses emulated TLS.
bool useEmulatedTLS() const;		bool useEmulatedTLS() const;

▲ Show 20 Lines • Show All 284 Lines • Show Last 20 Lines

llvm/lib/CodeGen/CommandFlags.cpp

Show First 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	#define CGOPT_EXP(TY, NAME) \
}		}

CGOPT(std::string, MArch)		CGOPT(std::string, MArch)
CGOPT(std::string, MCPU)		CGOPT(std::string, MCPU)
CGLIST(std::string, MAttrs)		CGLIST(std::string, MAttrs)
CGOPT_EXP(Reloc::Model, RelocModel)		CGOPT_EXP(Reloc::Model, RelocModel)
CGOPT(ThreadModel::Model, ThreadModel)		CGOPT(ThreadModel::Model, ThreadModel)
CGOPT_EXP(CodeModel::Model, CodeModel)		CGOPT_EXP(CodeModel::Model, CodeModel)
		CGOPT_EXP(uint64_t, LargeDataThreshold)
CGOPT(ExceptionHandling, ExceptionModel)		CGOPT(ExceptionHandling, ExceptionModel)
CGOPT_EXP(CodeGenFileType, FileType)		CGOPT_EXP(CodeGenFileType, FileType)
CGOPT(FramePointerKind, FramePointerUsage)		CGOPT(FramePointerKind, FramePointerUsage)
CGOPT(bool, EnableUnsafeFPMath)		CGOPT(bool, EnableUnsafeFPMath)
CGOPT(bool, EnableNoInfsFPMath)		CGOPT(bool, EnableNoInfsFPMath)
CGOPT(bool, EnableNoNaNsFPMath)		CGOPT(bool, EnableNoNaNsFPMath)
CGOPT(bool, EnableNoSignedZerosFPMath)		CGOPT(bool, EnableNoSignedZerosFPMath)
CGOPT(bool, EnableApproxFuncFPMath)		CGOPT(bool, EnableApproxFuncFPMath)
▲ Show 20 Lines • Show All 88 Lines • ▼ Show 20 Lines	static cl::opt<CodeModel::Model> CodeModel(
"code-model", cl::desc("Choose code model"),		"code-model", cl::desc("Choose code model"),
cl::values(clEnumValN(CodeModel::Tiny, "tiny", "Tiny code model"),		cl::values(clEnumValN(CodeModel::Tiny, "tiny", "Tiny code model"),
clEnumValN(CodeModel::Small, "small", "Small code model"),		clEnumValN(CodeModel::Small, "small", "Small code model"),
clEnumValN(CodeModel::Kernel, "kernel", "Kernel code model"),		clEnumValN(CodeModel::Kernel, "kernel", "Kernel code model"),
clEnumValN(CodeModel::Medium, "medium", "Medium code model"),		clEnumValN(CodeModel::Medium, "medium", "Medium code model"),
clEnumValN(CodeModel::Large, "large", "Large code model")));		clEnumValN(CodeModel::Large, "large", "Large code model")));
CGBINDOPT(CodeModel);		CGBINDOPT(CodeModel);

		static cl::opt<uint64_t> LargeDataThreshold(
		"large-data-threshold",
		cl::desc("Choose large data threshold for x86_64 medium code model"),
		MaskRayUnsubmitted Not Done Reply Inline Actions Perhaps we should say x86-64 medium/large code model. My https://gcc.gnu.org/pipermail/gcc-patches/2023-September/630207.html is still pending after many pings, but I think we can go ahead as the obvious choice. MaskRay: Perhaps we should say x86-64 medium/large code model. My https://gcc.gnu.org/pipermail/gcc…
		aeubanksAuthorUnsubmitted Done Reply Inline Actions we can change this to also say "large" once LLVM actually respects it in the large code model aeubanks: we can change this to also say "large" once LLVM actually respects it in the large code model
		cl::init(0));
		MaskRayUnsubmitted Not Done Reply Inline Actions To match GCC, we can use 65536 Consider applying this to large code model as well. https://gcc.gnu.org/pipermail/gcc-patches/2023-May/619695.html MaskRay: To match GCC, we can use 65536 Consider applying this to large code model as well. https…
		aeubanksAuthorUnsubmitted Done Reply Inline Actions changes to this value will come in a later patch, and same for making the large code model respect this value aeubanks: changes to this value will come in a later patch, and same for making the large code model…
		MaskRayUnsubmitted Not Done Reply Inline Actions This seems strange. Introducing this variable and setting it to an appropriate should come in the same patch, no ? I don't want `clang -mcmodel=medium` emitted data sections for small objects to change from `.data` to `.ldata`, then to `.data` again. MaskRay: This seems strange. Introducing this variable and setting it to an appropriate should come in…
		aeubanksAuthorUnsubmitted Done Reply Inline Actions this patch doesn't change the current behavior of the medium code model in clang since a large data threshold of zero treats all data as large, which is the current behavior. I'd rather have the implementation in this patch then a separate small patch to actually change the value, especially since not all of the large data threshold functionality is done yet (e.g. D150297) aeubanks: this patch doesn't change the current behavior of the medium code model in clang since a large…
		CGBINDOPT(LargeDataThreshold);

static cl::opt<ExceptionHandling> ExceptionModel(		static cl::opt<ExceptionHandling> ExceptionModel(
"exception-model", cl::desc("exception model"),		"exception-model", cl::desc("exception model"),
cl::init(ExceptionHandling::None),		cl::init(ExceptionHandling::None),
cl::values(		cl::values(
clEnumValN(ExceptionHandling::None, "default",		clEnumValN(ExceptionHandling::None, "default",
"default exception handling model"),		"default exception handling model"),
clEnumValN(ExceptionHandling::DwarfCFI, "dwarf",		clEnumValN(ExceptionHandling::DwarfCFI, "dwarf",
"DWARF-like CFI based exception handling"),		"DWARF-like CFI based exception handling"),
▲ Show 20 Lines • Show All 555 Lines • Show Last 20 Lines

llvm/lib/CodeGen/TargetLoweringObjectFileImpl.cpp

Show First 20 Lines • Show All 645 Lines • ▼ Show 20 Lines	if (Kind.isMergeableCString()) {

std::string SizeSpec = ".rodata.str" + utostr(EntrySize) + ".";		std::string SizeSpec = ".rodata.str" + utostr(EntrySize) + ".";
Name = SizeSpec + utostr(Alignment.value());		Name = SizeSpec + utostr(Alignment.value());
} else if (Kind.isMergeableConst()) {		} else if (Kind.isMergeableConst()) {
Name = ".rodata.cst";		Name = ".rodata.cst";
Name += utostr(EntrySize);		Name += utostr(EntrySize);
} else {		} else {
bool IsLarge = false;		bool IsLarge = false;
if (isa<GlobalVariable>(GO))		if (auto *GV = dyn_cast<GlobalVariable>(GO))
IsLarge = TM.isLargeData();		IsLarge = TM.isLargeData(GV);
Name = getSectionPrefixForGlobal(Kind, IsLarge);		Name = getSectionPrefixForGlobal(Kind, IsLarge);
}		}

bool HasPrefix = false;		bool HasPrefix = false;
if (const auto *F = dyn_cast<Function>(GO)) {		if (const auto *F = dyn_cast<Function>(GO)) {
if (std::optional<StringRef> Prefix = F->getSectionPrefix()) {		if (std::optional<StringRef> Prefix = F->getSectionPrefix()) {
raw_svector_ostream(Name) << '.' << *Prefix;		raw_svector_ostream(Name) << '.' << *Prefix;
HasPrefix = true;		HasPrefix = true;
▲ Show 20 Lines • Show All 186 Lines • ▼ Show 20 Lines	static MCSectionELF *selectELFSectionForGlobal(

StringRef Group = "";		StringRef Group = "";
bool IsComdat = false;		bool IsComdat = false;
if (const Comdat *C = getELFComdat(GO)) {		if (const Comdat *C = getELFComdat(GO)) {
Flags \|= ELF::SHF_GROUP;		Flags \|= ELF::SHF_GROUP;
Group = C->getName();		Group = C->getName();
IsComdat = C->getSelectionKind() == Comdat::Any;		IsComdat = C->getSelectionKind() == Comdat::Any;
}		}
if (isa<GlobalVariable>(GO)) {		if (auto *GV = dyn_cast<GlobalVariable>(GO)) {
if (TM.isLargeData()) {		if (TM.isLargeData(GV)) {
assert(TM.getTargetTriple().getArch() == Triple::x86_64);		assert(TM.getTargetTriple().getArch() == Triple::x86_64);
Flags \|= ELF::SHF_X86_64_LARGE;		Flags \|= ELF::SHF_X86_64_LARGE;
}		}
}		}

// Get the section entry size based on the kind.		// Get the section entry size based on the kind.
unsigned EntrySize = getEntrySizeForKind(Kind);		unsigned EntrySize = getEntrySizeForKind(Kind);

▲ Show 20 Lines • Show All 1,813 Lines • Show Last 20 Lines

llvm/lib/Target/TargetMachine.cpp

Show All 33 Lines	TargetMachine::TargetMachine(const Target &T, StringRef DataLayoutString,
const TargetOptions &Options)		const TargetOptions &Options)
: TheTarget(T), DL(DataLayoutString), TargetTriple(TT),		: TheTarget(T), DL(DataLayoutString), TargetTriple(TT),
TargetCPU(std::string(CPU)), TargetFS(std::string(FS)), AsmInfo(nullptr),		TargetCPU(std::string(CPU)), TargetFS(std::string(FS)), AsmInfo(nullptr),
MRI(nullptr), MII(nullptr), STI(nullptr), RequireStructuredCFG(false),		MRI(nullptr), MII(nullptr), STI(nullptr), RequireStructuredCFG(false),
O0WantsFastISel(false), DefaultOptions(Options), Options(Options) {}		O0WantsFastISel(false), DefaultOptions(Options), Options(Options) {}

TargetMachine::~TargetMachine() = default;		TargetMachine::~TargetMachine() = default;

bool TargetMachine::isLargeData() const {		bool TargetMachine::isLargeData(const GlobalVariable *GV) const {
if (getTargetTriple().getArch() != Triple::x86_64)		if (getTargetTriple().getArch() != Triple::x86_64)
return false;		return false;
// Large data under the large code model still needs to be thought about, so		// Large data under the large code model still needs to be thought about, so
// restrict this to medium.		// restrict this to medium.
if (getCodeModel() != CodeModel::Medium)		if (getCodeModel() != CodeModel::Medium)
		jyknightUnsubmitted Not Done Reply Inline Actions W also must handle unknown-sized objects as large here. E.g. `clang -mcmodel=medium -fPIC` on __attribute__((visibility("hidden"))) extern int x[]; int bar(void) { return x[0]; } must refer to x via GOTOFF not pc-relative, because it _could_ be defined as large. jyknight: W also must handle unknown-sized objects as large here. E.g. `clang -mcmodel=medium -fPIC` on…
		aeubanksAuthorUnsubmitted Done Reply Inline Actions ah ok, although that's not observable in this patch since we only use this for determining which section to place a symbol in, not how we reference a symbol (that's D150297). I'll make the change here but add tests in the other patch aeubanks: ah ok, although that's not observable in this patch since we only use this for determining…
return false;		return false;
return true;		const DataLayout &DL = GV->getParent()->getDataLayout();
		uint64_t Size = DL.getTypeSizeInBits(GV->getValueType()) / 8;
		return Size == 0 \|\| Size > LargeDataThreshold;
		MaskRayUnsubmitted Not Done Reply Inline Actions I think GCC's condition is `>` instead of `>=`. MaskRay: I think GCC's condition is `>` instead of `>=`.
}		}

bool TargetMachine::isPositionIndependent() const {		bool TargetMachine::isPositionIndependent() const {
return getRelocationModel() == Reloc::PIC_;		return getRelocationModel() == Reloc::PIC_;
}		}

/// Reset the target options based on the function's attributes.		/// Reset the target options based on the function's attributes.
/// setFunctionAttributes should have made the raw attribute value consistent		/// setFunctionAttributes should have made the raw attribute value consistent
▲ Show 20 Lines • Show All 191 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/code-model-elf-sections.ll

	; RUN: llc < %s -relocation-model=pic -filetype=obj -code-model=small -o %t			; RUN: llc < %s -relocation-model=pic -filetype=obj -code-model=small -o %t
	; RUN: llvm-readelf -S %t \| FileCheck %s --check-prefix=SMALL			; RUN: llvm-readelf -S %t \| FileCheck %s --check-prefix=SMALL
	; RUN: llc < %s -relocation-model=pic -filetype=obj -code-model=medium -o %t			; RUN: llc < %s -relocation-model=pic -filetype=obj -code-model=medium -o %t
	; RUN: llvm-readelf -S %t \| FileCheck %s --check-prefix=LARGE			; RUN: llvm-readelf -S %t \| FileCheck %s --check-prefix=LARGE
				; RUN: llc < %s -relocation-model=pic -filetype=obj -code-model=medium -large-data-threshold=79 -o %t
				; RUN: llvm-readelf -S %t \| FileCheck %s --check-prefix=LARGE
				; RUN: llc < %s -relocation-model=pic -filetype=obj -code-model=medium -large-data-threshold=80 -o %t
				; RUN: llvm-readelf -S %t \| FileCheck %s --check-prefix=SMALL
	; RUN: llc < %s -relocation-model=pic -filetype=obj -code-model=large -o %t			; RUN: llc < %s -relocation-model=pic -filetype=obj -code-model=large -o %t
	; RUN: llvm-readelf -S %t \| FileCheck %s --check-prefix=SMALL			; RUN: llvm-readelf -S %t \| FileCheck %s --check-prefix=SMALL

	; RUN: llc < %s -relocation-model=pic -filetype=obj -code-model=small -data-sections -o %t			; RUN: llc < %s -relocation-model=pic -filetype=obj -code-model=small -data-sections -o %t
	; RUN: llvm-readelf -S %t \| FileCheck %s --check-prefix=SMALL-DS			; RUN: llvm-readelf -S %t \| FileCheck %s --check-prefix=SMALL-DS
	; RUN: llc < %s -relocation-model=pic -filetype=obj -code-model=medium -data-sections -o %t			; RUN: llc < %s -relocation-model=pic -filetype=obj -code-model=medium -data-sections -o %t
	; RUN: llvm-readelf -S %t \| FileCheck %s --check-prefix=LARGE-DS			; RUN: llvm-readelf -S %t \| FileCheck %s --check-prefix=LARGE-DS
	; RUN: llc < %s -relocation-model=pic -filetype=obj -code-model=large -data-sections -o %t			; RUN: llc < %s -relocation-model=pic -filetype=obj -code-model=large -data-sections -o %t
	Show All 31 Lines

llvm/tools/llc/llc.cpp

Show First 20 Lines • Show All 485 Lines • ▼ Show 20 Lines if (V.consumeInteger(10, Num) || Num == 0 ||

(V.consume_front(".") && !V.consumeInteger(10, Num) && V.empty()))) { (V.consume_front(".") && !V.consumeInteger(10, Num) && V.empty()))) {

WithColor::error(errs(), argv[0]) WithColor::error(errs(), argv[0])

<< "invalid -binutils-version, accepting 'none' or major.minor\n"; << "invalid -binutils-version, accepting 'none' or major.minor\n";

return 1; return 1;

} }

TargetOptions Options; TargetOptions Options;

auto InitializeOptions = [&](const Triple &TheTriple) { auto InitializeOptions = [&](const Triple &TheTriple) {

Options = codegen::InitTargetOptionsFromCodeGenFlags(TheTriple); Options = codegen::InitTargetOptionsFromCodeGenFlags(TheTriple);

rnkUnsubmitted

Not Done

Should the threshold be a TargetOption? Would that avoid the need for the new TargetMachine field?

rnk: Should the threshold be a `TargetOption`? Would that avoid the need for the new TargetMachine…

aeubanksAuthorUnsubmitted

Done

it should go along with the code model since they're tied together. I do think putting all these options (code model, relocation model) in TargetOptions makes sense, it fits with lots of existing options there. if we put all these options into TargetOptions it'd simplify the TargetMachine constructors. if people think this makes sense I can do that cleanup first

aeubanks: it should go along with the code model since they're tied together. I do think putting all…

rnkUnsubmitted

Not Done

I think what I'd like to do is to minimize the number of changes to the createTargetMachine prototype, since it's called in many places. It has many optional parameters, and I suspect that most callers accept the defaults:

TargetMachine *createTargetMachine(
    StringRef TT, StringRef CPU, StringRef Features,
    const TargetOptions &Options,
    std::optional<Reloc::Model> RM, // No default, but is std::optional
    std::optional<CodeModel::Model> CM = std::nullopt, // Optional, with a default.
    CodeGenOpt::Level OL = CodeGenOpt::Default, // has a default, this will change to an enum class
    bool JIT = false // Boolean optional parameters are unreadable.
) const {

Maybe the first change is to move the relocation and code model into target options. We probably have to leave the optimization level alone, since there are multiple callers of TargetMachine::setOptLevel, and it's not really an immutable option, it varies from function to function.

That said, if you want to land this now, and defer that work, I'm OK with that, this change doesn't affect the createTargetMachine prototype, it's the CodeGenLevel one that does.

rnk: I think what I'd like to do is to minimize the number of changes to the `createTargetMachine`…

if (Options.XCOFFReadOnlyPointers) { if (Options.XCOFFReadOnlyPointers) {

if (!TheTriple.isOSAIX()) if (!TheTriple.isOSAIX())

reportError("-mxcoff-roptr option is only supported on AIX", reportError("-mxcoff-roptr option is only supported on AIX",

InputFilename); InputFilename);

// Since the storage mapping class is specified per csect, // Since the storage mapping class is specified per csect,

// without using data sections, it is less effective to use read-only // without using data sections, it is less effective to use read-only

▲ Show 20 Lines • Show All 83 Lines • ▼ Show 20 Lines if (!M) {

return 1; return 1;

} }

if (!TargetTriple.empty()) if (!TargetTriple.empty())

M->setTargetTriple(Triple::normalize(TargetTriple)); M->setTargetTriple(Triple::normalize(TargetTriple));

std::optional<CodeModel::Model> CM_IR = M->getCodeModel(); std::optional<CodeModel::Model> CM_IR = M->getCodeModel();

if (!CM && CM_IR) if (!CM && CM_IR)

Target->setCodeModel(*CM_IR); Target->setCodeModel(*CM_IR);

if (std::optional<uint64_t> LDT = codegen::getExplicitLargeDataThreshold())

tkoeppeUnsubmitted

Not Done

Target->setCodeModel(*CM_IR);

- if (auto LDT = codegen::getExplicitLargeDataThreshold())

+ if (auto LDT = codegen::getExplicitLargeDataThreshold(); LDT > 0)

Target->setLargeDataThreshold(*LDT);

Can we avoid the implicit check for 0 here and spell out ; LDT > 0? Does LLVM require C++17?

tkoeppe: Can we avoid the implicit check for 0 here and spell out `; LDT > 0`? Does LLVM require C++17?

aeubanksAuthorUnsubmitted

Done

yeah LLVM uses C++17

codegen::getExplicitLargeDataThreshold returns a std::optional, so we're checking if the user explicitly overrode the value, not if the value is 0. we would want to set the value to 0 if the user explicitly requested it

aeubanks: yeah LLVM uses C++17 `codegen::getExplicitLargeDataThreshold` returns a `std::optional`, so…

MaskRayUnsubmitted

Not Done

Building llvm-project requires c++17 (libomptarget may be an exception).

We can use if (auto LDT = ...; LDT && LDT > 0) only if the default is 0.

MaskRay: Building llvm-project requires c++17 (libomptarget may be an exception). We can use `if (auto…

tkoeppeUnsubmitted

Not Done

Ah yes, sorry, I forgot that it's an optional<int>, not just an int. In that case all I meant was for the check to not be implicit, i.e. I'd recommend:

for (auto LDT = ...; LDT.has_vaiue()) {
  // use *LDT
}

Just so there's one less implicit thing one needs to keep in mind when reading this.

tkoeppe: Ah yes, sorry, I forgot that it's an optional<int>, not just an int. In that case all I meant…

aeubanksAuthorUnsubmitted

Done

I think this is the standard idiom for checking if a std::optional has a value, not calling has_value().
I did change the type to be explicit though, instead of auto

aeubanks: I think this is the standard idiom for checking if a `std::optional` has a value, not calling…

Target->setLargeDataThreshold(*LDT);

} else { } else {

TheTriple = Triple(Triple::normalize(TargetTriple)); TheTriple = Triple(Triple::normalize(TargetTriple));

if (TheTriple.getTriple().empty()) if (TheTriple.getTriple().empty())

TheTriple.setTriple(sys::getDefaultTargetTriple()); TheTriple.setTriple(sys::getDefaultTargetTriple());

// Get the target specific parser. // Get the target specific parser.

std::string Error; std::string Error;

TheTarget = TheTarget =

▲ Show 20 Lines • Show All 185 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[X86] Introduce a large data threshold for the medium code modelClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 556814

llvm/include/llvm/CodeGen/CommandFlags.h

llvm/include/llvm/Target/TargetMachine.h

llvm/lib/CodeGen/CommandFlags.cpp

llvm/lib/CodeGen/TargetLoweringObjectFileImpl.cpp

llvm/lib/Target/TargetMachine.cpp

llvm/test/CodeGen/X86/code-model-elf-sections.ll

llvm/tools/llc/llc.cpp

[X86] Introduce a large data threshold for the medium code model
ClosedPublic