This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/MC/
-
llvm/
-
MC/
-
MCAssembler.h
-
MCObjectStreamer.h
-
lib/MC/
-
MC/
-
MCAssembler.cpp
-
MCObjectStreamer.cpp

Differential D45164

[MC] Change AsmParser to leverage Assembler during evaluation
ClosedPublic

Authored by niravd on Apr 2 2018, 7:37 AM.

Download Raw Diff

Details

Reviewers

echristo
rnk
probinson
• espindola
peter.smith

Commits

rG6c0665e22174: [MC] Change AsmParser to leverage Assembler during evaluation
rL331218: [MC] Change AsmParser to leverage Assembler during evaluation
rC331218: [MC] Change AsmParser to leverage Assembler during evaluation

Summary

Teach AsmParser to check with Assembler for when evaluating constant expressions. This improves the handing of preprocessor expressions that must be resolved at parse time. This idiom can be found as assembling-time assertion checks in Source-level assemblers. Note that this relies on the MCStreamer to keep sufficient tabs on Section / Fragment information which the MCAsmStreamer does not. As a result the textual output may fail where the equivalent object generation would pass. This can most easily be resolved by folding the MCAsmStreamer and MCObjectStreamer together which is planned for in a separate patch.

Diff Detail

Repository

rL LLVM

Build Status

Buildable 16625
Build 16625: arc lint + arc unit

Event Timeline

niravd created this revision.Apr 2 2018, 7:37 AM

Herald added subscribers: llvm-commits, hiraditya. · View Herald TranscriptApr 2 2018, 7:37 AM

[MC] Allow MCAssembler to be constructed without all subcomponents. NFCI.
[MC] Modify MCAsmStreamer to always build MCAssembler. NFCI.

[MC] Change AsmParser to leverage Assembler during evaluation.

Harbormaster completed remote builds in B16625: Diff 140622.Apr 2 2018, 7:40 AM

Harbormaster completed remote builds in B16626: Diff 140623.

Harbormaster completed remote builds in B16627: Diff 140624.

niravd retitled this revision from [MC] Allow MCAssembler to be constructed without all subcomponents. NFCI. to [MC] Change AsmParser to leverage Assembler during evaluation.Apr 2 2018, 7:49 AM

niravd edited the summary of this revision. (Show Details)

For ease of reviewing the history reflects 3 stages of patches:

[MC] Allow MCAssembler to be constructed without all subcomponents (NFCI) which is not guaranteed by MCAsmStreamer
[MC] Modify MCAsmStreamer to always build MCAssembler. NFCI.
[MC] Change AsmParser/Streamers to leverage Assembler during expression evaluation.

niravd added reviewers: echristo, rnk, probinson.Apr 2 2018, 7:56 AM

rnk added a reviewer: • rafael.Apr 2 2018, 10:17 AM

This can most easily be resolved by folding the MCAsmStreamer and MCObjectStreamer together which is planned for in a separate patch.

So, basically, gas directives allow the user to do a lot of reflection. In order to support that reflection, we need to lay out an object file, regardless of whether we're emitting textual assembly or an object.

I checked, and this doesn't appear to affect users of -fno-integrated-as, because the AsmPrinter will bypass the MCStreamer API when that flag is set.

• espindola edited reviewers, added: • espindola; removed: • rafael.Apr 2 2018, 4:42 PM

As a result the textual output may fail where the equivalent object generation would pass.

I don't think that is OK.

Why can't the asm streamer blindly print the entire if to the output?

In D45164#1055062, @espindola wrote:

As a result the textual output may fail where the equivalent object generation would pass.

I don't think that is OK.

It's certainly not ideal but this is at least a somewhat reasonable intermediate point until the follow up patch is finished. The divergence between object and text only happens with preprocessor directives in assembly which should mostly happen with .S files which are probably being assembled directly to object.

The follow up patch to requires merging the various ObjectStreamer and AsmStreamer paths and is rather large.

Why can't the asm streamer blindly print the entire if to the output?

The 'if' is a preprocessor directive and is only valid in the input (.S preprocessor assembly) and not the output (.s preprocessed assembly).

In D45164#1055716, @niravd wrote:

In D45164#1055062, @espindola wrote:

As a result the textual output may fail where the equivalent object generation would pass.

I don't think that is OK.

It's certainly not ideal but this is at least a somewhat reasonable intermediate point until the follow up patch is finished. The divergence between object and text only happens with preprocessor directives in assembly which should mostly happen with .S files which are probably being assembled directly to object.

The follow up patch to requires merging the various ObjectStreamer and AsmStreamer paths and is rather large.

That would cause us to compute offsets when producing a .s file, right? If we must process .if directives instead of printing them, I don't think we have another option.

But I still don't think this is a reasonable intermediary step. Producing equivalent output is a big guarantee of MC with very few exceptions (bugs).

Why can't the asm streamer blindly print the entire if to the output?

The 'if' is a preprocessor directive and is only valid in the input (.S preprocessor assembly) and not the output (.s preprocessed assembly).

Is that a documented restriction?

In D45164#1059883, @espindola wrote:

In D45164#1055716, @niravd wrote:

In D45164#1055062, @espindola wrote:

As a result the textual output may fail where the equivalent object generation would pass.

I don't think that is OK.

It's certainly not ideal but this is at least a somewhat reasonable intermediate point until the follow up patch is finished. The divergence between object and text only happens with preprocessor directives in assembly which should mostly happen with .S files which are probably being assembled directly to object.

The follow up patch to requires merging the various ObjectStreamer and AsmStreamer paths and is rather large.

That would cause us to compute offsets when producing a .s file, right? If we must process .if directives instead of printing them, I don't think we have another option.

This is really tricky too...

If you compute offsets when producing a textual assembly file, except in _very_ limited circumstances where the layout is self-evident and not up to interpretation, you're going to risk getting a different answer than the actual assembler. Consider that X86 has many ways to encode a given instruction, and different assemblers may or may not choose to encode a given textual instruction into the same size output.

For example, llvm used to assemble "movw %cs, (%eax)" as [0x66,0x8c,0x08], instead of [0x8c,0x08]. They mean the same thing, and GNU as has used the short sequence for ages. It would be pretty horrible if you had something like the following input, and at the end of processing through llvm to textual asm, and then GNU as, you ended up with only 2 bytes of output, which shouldn't be possible. (this is a contrived example, yes...)

foo:
  movw %cs, (%eax)
.if . - foo == 2
.byte 0
.endif

I suppose we might be able to emit a textual asm file that uses only ".byte"/".word"/etc directives...instead of textual instructions. Then we _could_ be certain of the size. (Although that may not actually be feasible when relocations are involved? And in any case, super-ugly and I doubt what any user would want to see...)

Why can't the asm streamer blindly print the entire if to the output?

The 'if' is a preprocessor directive and is only valid in the input (.S preprocessor assembly) and not the output (.s preprocessed assembly).

Is that a documented restriction?

I don't see a reason why we couldn't emit an ".if" directive into the output. However, I don't see how emitting the original conditions could really be viable, unless you're just passing the entire input textually through to the output without parsing it at all. Consider that _anything_ can go inside an .if/.endif. E.g. defining a macro, or starting a new section. You'd effectively need to fork the entire assembler state upon seeing such a condition, to assemble each path of the conditional separately, and then output both possibilities. And keep forking, on every conditional in the input. The combinatorics of that would be very unfortunate...

One alternative I see for supporting textual output would be to emit a _verification_ check for the value of every layout-dependent absolute expression which was evaluated during the compile. E.g., in the above example, emit something like:

foo:
  movw %cs, (%eax)
.Ltmp0:

/* Verification of layout assumptions: */
.if .Ltmp0 - foo != 3
.err
.endif

However, even given an implementation strategy that seems like it'd probably work, I'm not really sure supporting this for textual output is really that worthwhile?

Nirav: do you know if this comes up in inline asm in any real world project? If not, perhaps this feature could just be disabled when evaluating llvm inline asm expressions, where the ability to emit a .s file is critical.

But for a standalone assembler -- where this sort of use-case occurs rather frequently -- is it really important that you be able to re-emit textual asm?

It's documented that we output (.s) and I believe this is specifically so we are compatible with assemblers without sufficient preprocessor support. It may be reasonable to add a (.S) output but as it's been pointed out the textual semantics of the preprocessor are not suited for this and
error/warnings quality would almost certainly degrade.

All of the inputs I've seen are .S files; no inline assembly. They've been are limited to .data blocks where there's no ambiguity about sizes (This is what the current patch handles). The gnu assembler does a bit more and handles assembler-dependent preprocessor expressions when the intermediate artifact's sizes are explicitly known (i.e. data and instructions of known size), but it's not clear if the extra capabilities are ever used or needed (The closest I've found is a case where a .fill directive was used to do pad a block with nops but that's utterable in our assembly output currently). Regardless, gas's support rules out all of the tricky cases James mentioned so textual output is reasonable as a output artifact (at least as reasonable as what we have currently).

I have not found any documentation giving guarantees about the correspondence between output types, but it seem natural to me that direct object generation may be more permissive than compilation through assembly hence this patch. There already appear to be additional restrictions in the AsmStreamer (e.g. dwarf CUID) over the ObjectStreamer so this isn't a new thing.

That said, modulo the extra bookkeeping costs for textual assembly additional checks for incomplete assemblers, there's no real reason why MCAsmStreamer and MCObjectStreamer are separate structures and it would be good to eventually merge them.

In D45164#1064411, @niravd wrote:

I have not found any documentation giving guarantees about the correspondence between output types, but it seem natural to me that direct object generation may be more permissive than compilation through assembly hence this patch. There already appear to be additional restrictions in the AsmStreamer (e.g. dwarf CUID) over the ObjectStreamer so this isn't a new thing.

I'm also not that concerned about this difference.

That said, modulo the extra bookkeeping costs for textual assembly additional checks for incomplete assemblers, there's no real reason why MCAsmStreamer and MCObjectStreamer are separate structures and it would be good to eventually merge them.

Why? What would you replace the MCStreamer interface virtual dispatch with? Would MCStreamer become the main implementation, with every method checking if (emitTextualAssembly) OS << ".foo";? That doesn't seem like a clear win.

That said, modulo the extra bookkeeping costs for textual assembly additional checks for incomplete assemblers, there's no real reason why MCAsmStreamer and MCObjectStreamer are separate structures and it would be good to eventually merge them.

Why? What would you replace the MCStreamer interface virtual dispatch with? Would MCStreamer become the main implementation, with every method checking if (emitTextualAssembly) OS << ".foo";? That doesn't seem like a clear win.

Yes. If we're going to require the assembly and object generation must be the same, we're going to need to do equivalent bookkeeping and factoring out just the bookkeeping into a merged seems unreasonable given it depends on enough details from the various ObjectStreamer subclass. Textual output would effectively be a trace emitted during a truncated object generation.

If we're okay with object generation being more permissive than textual generation for assembly files then the only potential issue is changes exposed by inline asm; I believe we do expect compilation to always be able to generate textual assembly if we can generate an object for C compilation.
This could be resolved by disabling assembler-level information for inline assembly.

arichardson added a subscriber: arichardson.Apr 16 2018, 4:25 PM

If I've understood correctly, this will evaluate the expression if there is something simple and unrelaxable such as (Armv7a)

.thumb
start:
nop
end:
.if (end - start == 2)

But not if there may be relaxations involved:

.thumb
start:
ldr r0,=0x12345678 // Relaxable instruction that generates a constant pool.
end:
.if (end - start == 2) // expect error message here

If this is the case, please can there be a test that checks for the error message as I think it is important that we don't accidentally allow these expressions to be evaluated early if their result depends on a later layout pass.

Add tests to verify failure when layout-dependant cases happen.

Thanks for adding the test, it looks like it will cover my concern.

Disable assembler-information for parsing of inline assembly to maintain equivalence of object and assembly generation from llvm ir paths.

Herald added a subscriber: eraman. · View Herald TranscriptApr 19 2018, 10:57 AM

Ping. Just a recap out the state of this patch:

Assembler information is only enabled for compilation from assembly to object files (compilation from LLVM IR/ C will be equivalent between assembly and objects)

I've most of a follow up patch which merges the AsmSstreamer and ObjectStreamer, emitting assertion checks in the simplified assembly where decisions that may not be upheld by the eventual assembler are checked.

With this we can evaluated at parse time relative offset differences in sections with fixed sized values (data and non-relaxable instructions). The GNU assembler appears to to marginally better in that it only requires the bits between the offsets are known size, but appears non-essential and can be added afterwards.

FWIW I'm in favour of this approach, and of merging the Asm and ObjectStreamer as I've recently found a case where this would have been useful (deriving a MCSubtargetInfo when emitting constant pools). There were others with stronger objections though.

Since this is using information inside a single fragment when producing assembly I am OK with it.

Different assemblers will have different relaxations, but looking at offsets of the labels in the same fragment should be ok.

Ping? I think we've a positive consesus. Anyone want to LGTM?

I'm happy to LGTM. Apologies for the delay.

This revision is now accepted and ready to land.Apr 27 2018, 1:49 AM

Closed by commit rC331218: [MC] Change AsmParser to leverage Assembler during evaluation (authored by niravd). · Explain WhyApr 30 2018, 12:26 PM

This revision was automatically updated to reflect the committed changes.

Herald added a subscriber: cfe-commits. · View Herald TranscriptApr 30 2018, 12:26 PM

MaskRay mentioned this in D153096: [MC] Fold A-B when A's fragment precedes B's fragment.Jun 20 2023, 9:05 PM

MaskRay mentioned this in rGfb294c0612a1: [MC] Fold A-B when A's fragment precedes B's fragment.Jun 22 2023, 12:24 PM

Revision Contents

Path

Size

llvm/

include/

llvm/

MC/

MCAssembler.h

23 lines

MCObjectStreamer.h

3 lines

lib/

MC/

MCAssembler.cpp

43 lines

MCObjectStreamer.cpp

8 lines

Diff 140622

llvm/include/llvm/MC/MCAssembler.h

Show First 20 Lines • Show All 93 Lines • ▼ Show 20 Lines	using VersionInfoType = struct {
unsigned Major;		unsigned Major;
unsigned Minor;		unsigned Minor;
unsigned Update;		unsigned Update;
};		};

private:		private:
MCContext &Context;		MCContext &Context;

MCAsmBackend &Backend;		std::unique_ptr<MCAsmBackend> Backend;

MCCodeEmitter &Emitter;		std::unique_ptr<MCCodeEmitter> Emitter;

MCObjectWriter &Writer;		std::unique_ptr<MCObjectWriter> Writer;

SectionListType Sections;		SectionListType Sections;

SymbolDataListType Symbols;		SymbolDataListType Symbols;

std::vector<IndirectSymbolData> IndirectSymbols;		std::vector<IndirectSymbolData> IndirectSymbols;

std::vector<DataRegionData> DataRegions;		std::vector<DataRegionData> DataRegions;
▲ Show 20 Lines • Show All 94 Lines • ▼ Show 20 Lines	public:
std::vector<std::pair<StringRef, const MCSymbol *>> Symvers;		std::vector<std::pair<StringRef, const MCSymbol *>> Symvers;

/// Construct a new assembler instance.		/// Construct a new assembler instance.
//		//
// FIXME: How are we going to parameterize this? Two obvious options are stay		// FIXME: How are we going to parameterize this? Two obvious options are stay
// concrete and require clients to pass in a target like object. The other		// concrete and require clients to pass in a target like object. The other
// option is to make this abstract, and have targets provide concrete		// option is to make this abstract, and have targets provide concrete
// implementations as we do with AsmParser.		// implementations as we do with AsmParser.
MCAssembler(MCContext &Context, MCAsmBackend &Backend,		MCAssembler(MCContext &Context, std::unique_ptr<MCAsmBackend> Backend,
MCCodeEmitter &Emitter, MCObjectWriter &Writer);		std::unique_ptr<MCCodeEmitter> Emitter,
		std::unique_ptr<MCObjectWriter> Writer);
MCAssembler(const MCAssembler &) = delete;		MCAssembler(const MCAssembler &) = delete;
MCAssembler &operator=(const MCAssembler &) = delete;		MCAssembler &operator=(const MCAssembler &) = delete;
~MCAssembler();		~MCAssembler();

/// Compute the effective fragment size assuming it is laid out at the given		/// Compute the effective fragment size assuming it is laid out at the given
/// \p SectionAddress and \p FragmentOffset.		/// \p SectionAddress and \p FragmentOffset.
uint64_t computeFragmentSize(const MCAsmLayout &Layout,		uint64_t computeFragmentSize(const MCAsmLayout &Layout,
const MCFragment &F) const;		const MCFragment &F) const;
▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	public:
}		}

/// Reuse an assembler instance		/// Reuse an assembler instance
///		///
void reset();		void reset();

MCContext &getContext() const { return Context; }		MCContext &getContext() const { return Context; }

MCAsmBackend &getBackend() const { return Backend; }		MCAsmBackend *getBackendPtr() const { return Backend.get(); }

MCCodeEmitter &getEmitter() const { return Emitter; }		MCCodeEmitter *getEmitterPtr() const { return Emitter.get(); }

MCObjectWriter &getWriter() const { return Writer; }		MCObjectWriter *getWriterPtr() const { return Writer.get(); }

		MCAsmBackend &getBackend() const { return *Backend; }

		MCCodeEmitter &getEmitter() const { return *Emitter; }

		MCObjectWriter &getWriter() const { return *Writer; }

MCDwarfLineTableParams getDWARFLinetableParams() const { return LTParams; }		MCDwarfLineTableParams getDWARFLinetableParams() const { return LTParams; }
void setDWARFLinetableParams(MCDwarfLineTableParams P) { LTParams = P; }		void setDWARFLinetableParams(MCDwarfLineTableParams P) { LTParams = P; }

/// Finish - Do final processing and write the object to the output stream.		/// Finish - Do final processing and write the object to the output stream.
/// \p Writer is used for custom object writer (as the MCJIT does),		/// \p Writer is used for custom object writer (as the MCJIT does),
/// if not specified it is automatically created from backend.		/// if not specified it is automatically created from backend.
void Finish();		void Finish();
▲ Show 20 Lines • Show All 156 Lines • Show Last 20 Lines

llvm/include/llvm/MC/MCObjectStreamer.h

	Show All 28 Lines
	/// \brief Streaming object file generation interface.			/// \brief Streaming object file generation interface.
	///			///
	/// This class provides an implementation of the MCStreamer interface which is			/// This class provides an implementation of the MCStreamer interface which is
	/// suitable for use with the assembler backend. Specific object file formats			/// suitable for use with the assembler backend. Specific object file formats
	/// are expected to subclass this interface to implement directives specific			/// are expected to subclass this interface to implement directives specific
	/// to that file format or custom semantics expected by the object writer			/// to that file format or custom semantics expected by the object writer
	/// implementation.			/// implementation.
	class MCObjectStreamer : public MCStreamer {			class MCObjectStreamer : public MCStreamer {
	std::unique_ptr<MCObjectWriter> ObjectWriter;
	std::unique_ptr<MCAsmBackend> TAB;
	std::unique_ptr<MCCodeEmitter> Emitter;
	std::unique_ptr<MCAssembler> Assembler;			std::unique_ptr<MCAssembler> Assembler;
	MCSection::iterator CurInsertionPoint;			MCSection::iterator CurInsertionPoint;
	bool EmitEHFrame;			bool EmitEHFrame;
	bool EmitDebugFrame;			bool EmitDebugFrame;
	SmallVector<MCSymbol *, 2> PendingLabels;			SmallVector<MCSymbol *, 2> PendingLabels;

	virtual void EmitInstToData(const MCInst &Inst, const MCSubtargetInfo&) = 0;			virtual void EmitInstToData(const MCInst &Inst, const MCSubtargetInfo&) = 0;
	void EmitCFIStartProcImpl(MCDwarfFrameInfo &Frame) override;			void EmitCFIStartProcImpl(MCDwarfFrameInfo &Frame) override;
	▲ Show 20 Lines • Show All 143 Lines • Show Last 20 Lines

llvm/lib/MC/MCAssembler.cpp

Show First 20 Lines • Show All 77 Lines • ▼ Show 20 Lines

// FIXME FIXME FIXME: There are number of places in this file where we convert		// FIXME FIXME FIXME: There are number of places in this file where we convert
// what is a 64-bit assembler value used for computation into a value in the		// what is a 64-bit assembler value used for computation into a value in the
// object file, which may truncate it. We should detect that truncation where		// object file, which may truncate it. We should detect that truncation where
// invalid and report errors back.		// invalid and report errors back.

/* *** */		/* *** */

MCAssembler::MCAssembler(MCContext &Context, MCAsmBackend &Backend,		MCAssembler::MCAssembler(MCContext &Context,
MCCodeEmitter &Emitter, MCObjectWriter &Writer)		std::unique_ptr<MCAsmBackend> Backend,
: Context(Context), Backend(Backend), Emitter(Emitter), Writer(Writer),		std::unique_ptr<MCCodeEmitter> Emitter,
		std::unique_ptr<MCObjectWriter> Writer)
		: Context(Context), Backend(std::move(Backend)),
		Emitter(std::move(Emitter)), Writer(std::move(Writer)),
BundleAlignSize(0), RelaxAll(false), SubsectionsViaSymbols(false),		BundleAlignSize(0), RelaxAll(false), SubsectionsViaSymbols(false),
IncrementalLinkerCompatible(false), ELFHeaderEFlags(0) {		IncrementalLinkerCompatible(false), ELFHeaderEFlags(0) {
VersionInfo.Major = 0; // Major version == 0 for "none specified"		VersionInfo.Major = 0; // Major version == 0 for "none specified"
}		}

MCAssembler::~MCAssembler() = default;		MCAssembler::~MCAssembler() = default;

void MCAssembler::reset() {		void MCAssembler::reset() {
Sections.clear();		Sections.clear();
Symbols.clear();		Symbols.clear();
IndirectSymbols.clear();		IndirectSymbols.clear();
DataRegions.clear();		DataRegions.clear();
LinkerOptions.clear();		LinkerOptions.clear();
FileNames.clear();		FileNames.clear();
ThumbFuncs.clear();		ThumbFuncs.clear();
BundleAlignSize = 0;		BundleAlignSize = 0;
RelaxAll = false;		RelaxAll = false;
SubsectionsViaSymbols = false;		SubsectionsViaSymbols = false;
IncrementalLinkerCompatible = false;		IncrementalLinkerCompatible = false;
ELFHeaderEFlags = 0;		ELFHeaderEFlags = 0;
LOHContainer.reset();		LOHContainer.reset();
VersionInfo.Major = 0;		VersionInfo.Major = 0;

// reset objects owned by us		// reset objects owned by us
getBackend().reset();		if (getBackendPtr())
getEmitter().reset();		getBackendPtr()->reset();
getWriter().reset();		if (getEmitterPtr())
		getEmitterPtr()->reset();
		if (getWriterPtr())
		getWriterPtr()->reset();
getLOHContainer().reset();		getLOHContainer().reset();
}		}

bool MCAssembler::registerSection(MCSection &Section) {		bool MCAssembler::registerSection(MCSection &Section) {
if (Section.isRegistered())		if (Section.isRegistered())
return false;		return false;
Sections.push_back(&Section);		Sections.push_back(&Section);
Section.setIsRegistered(true);		Section.setIsRegistered(true);
▲ Show 20 Lines • Show All 86 Lines • ▼ Show 20 Lines	bool MCAssembler::evaluateFixup(const MCAsmLayout &Layout,
if (const MCSymbolRefExpr *RefB = Target.getSymB()) {		if (const MCSymbolRefExpr *RefB = Target.getSymB()) {
if (RefB->getKind() != MCSymbolRefExpr::VK_None) {		if (RefB->getKind() != MCSymbolRefExpr::VK_None) {
Ctx.reportError(Fixup.getLoc(),		Ctx.reportError(Fixup.getLoc(),
"unsupported subtraction of qualified symbol");		"unsupported subtraction of qualified symbol");
return true;		return true;
}		}
}		}

bool IsPCRel = Backend.getFixupKindInfo(		assert(getBackendPtr() && "Expected assembler backend");
Fixup.getKind()).Flags & MCFixupKindInfo::FKF_IsPCRel;		bool IsPCRel = getBackendPtr()->getFixupKindInfo(Fixup.getKind()).Flags &
		MCFixupKindInfo::FKF_IsPCRel;

bool IsResolved;		bool IsResolved;
if (IsPCRel) {		if (IsPCRel) {
if (Target.getSymB()) {		if (Target.getSymB()) {
IsResolved = false;		IsResolved = false;
} else if (!Target.getSymA()) {		} else if (!Target.getSymA()) {
IsResolved = false;		IsResolved = false;
} else {		} else {
Show All 18 Lines	if (Sym.isDefined())
Value += Layout.getSymbolOffset(Sym);		Value += Layout.getSymbolOffset(Sym);
}		}
if (const MCSymbolRefExpr *B = Target.getSymB()) {		if (const MCSymbolRefExpr *B = Target.getSymB()) {
const MCSymbol &Sym = B->getSymbol();		const MCSymbol &Sym = B->getSymbol();
if (Sym.isDefined())		if (Sym.isDefined())
Value -= Layout.getSymbolOffset(Sym);		Value -= Layout.getSymbolOffset(Sym);
}		}

bool ShouldAlignPC = Backend.getFixupKindInfo(Fixup.getKind()).Flags &		bool ShouldAlignPC = getBackend().getFixupKindInfo(Fixup.getKind()).Flags &
MCFixupKindInfo::FKF_IsAlignedDownTo32Bits;		MCFixupKindInfo::FKF_IsAlignedDownTo32Bits;
assert((ShouldAlignPC ? IsPCRel : true) &&		assert((ShouldAlignPC ? IsPCRel : true) &&
"FKF_IsAlignedDownTo32Bits is only allowed on PC-relative fixups!");		"FKF_IsAlignedDownTo32Bits is only allowed on PC-relative fixups!");

if (IsPCRel) {		if (IsPCRel) {
uint32_t Offset = Layout.getFragmentOffset(DF) + Fixup.getOffset();		uint32_t Offset = Layout.getFragmentOffset(DF) + Fixup.getOffset();

// A number of ARM fixups in Thumb mode require that the effective PC		// A number of ARM fixups in Thumb mode require that the effective PC
// address be determined as the 32-bit aligned version of the actual offset.		// address be determined as the 32-bit aligned version of the actual offset.
if (ShouldAlignPC) Offset &= ~0x3;		if (ShouldAlignPC) Offset &= ~0x3;
Value -= Offset;		Value -= Offset;
}		}

// Let the backend force a relocation if needed.		// Let the backend force a relocation if needed.
if (IsResolved && Backend.shouldForceRelocation(*this, Fixup, Target))		if (IsResolved && getBackend().shouldForceRelocation(*this, Fixup, Target))
IsResolved = false;		IsResolved = false;

return IsResolved;		return IsResolved;
}		}

uint64_t MCAssembler::computeFragmentSize(const MCAsmLayout &Layout,		uint64_t MCAssembler::computeFragmentSize(const MCAsmLayout &Layout,
const MCFragment &F) const {		const MCFragment &F) const {
		assert(getBackendPtr() && "Requires assembler backend");
switch (F.getKind()) {		switch (F.getKind()) {
case MCFragment::FT_Data:		case MCFragment::FT_Data:
return cast<MCDataFragment>(F).getContents().size();		return cast<MCDataFragment>(F).getContents().size();
case MCFragment::FT_Relaxable:		case MCFragment::FT_Relaxable:
return cast<MCRelaxableFragment>(F).getContents().size();		return cast<MCRelaxableFragment>(F).getContents().size();
case MCFragment::FT_CompactEncodedInst:		case MCFragment::FT_CompactEncodedInst:
return cast<MCCompactEncodedInstFragment>(F).getContents().size();		return cast<MCCompactEncodedInstFragment>(F).getContents().size();
case MCFragment::FT_Fill: {		case MCFragment::FT_Fill: {
▲ Show 20 Lines • Show All 147 Lines • ▼ Show 20 Lines	void MCAssembler::registerSymbol(const MCSymbol &Symbol, bool *Created) {
if (New) {		if (New) {
Symbol.setIsRegistered(true);		Symbol.setIsRegistered(true);
Symbols.push_back(&Symbol);		Symbols.push_back(&Symbol);
}		}
}		}

void MCAssembler::writeFragmentPadding(const MCFragment &F, uint64_t FSize,		void MCAssembler::writeFragmentPadding(const MCFragment &F, uint64_t FSize,
MCObjectWriter *OW) const {		MCObjectWriter *OW) const {
		assert(getBackendPtr() && "Expected assembler backend");
// Should NOP padding be written out before this fragment?		// Should NOP padding be written out before this fragment?
unsigned BundlePadding = F.getBundlePadding();		unsigned BundlePadding = F.getBundlePadding();
if (BundlePadding > 0) {		if (BundlePadding > 0) {
assert(isBundlingEnabled() &&		assert(isBundlingEnabled() &&
"Writing bundle padding with disabled bundling");		"Writing bundle padding with disabled bundling");
assert(F.hasInstructions() &&		assert(F.hasInstructions() &&
"Writing bundle padding for a fragment without instructions");		"Writing bundle padding for a fragment without instructions");

Show All 17 Lines	if (!getBackend().writeNopData(BundlePadding, OW))
report_fatal_error("unable to write NOP sequence of " +		report_fatal_error("unable to write NOP sequence of " +
Twine(BundlePadding) + " bytes");		Twine(BundlePadding) + " bytes");
}		}
}		}

/// \brief Write the fragment \p F to the output file.		/// \brief Write the fragment \p F to the output file.
static void writeFragment(const MCAssembler &Asm, const MCAsmLayout &Layout,		static void writeFragment(const MCAssembler &Asm, const MCAsmLayout &Layout,
const MCFragment &F) {		const MCFragment &F) {
MCObjectWriter *OW = &Asm.getWriter();		MCObjectWriter *OW = Asm.getWriterPtr();
		assert(OW && "Need ObjectWriter to write fragment");

// FIXME: Embed in fragments instead?		// FIXME: Embed in fragments instead?
uint64_t FragmentSize = Asm.computeFragmentSize(Layout, F);		uint64_t FragmentSize = Asm.computeFragmentSize(Layout, F);

Asm.writeFragmentPadding(F, FragmentSize, OW);		Asm.writeFragmentPadding(F, FragmentSize, OW);

// This variable (and its dummy usage) is to participate in the assert at		// This variable (and its dummy usage) is to participate in the assert at
// the end of the function.		// the end of the function.
▲ Show 20 Lines • Show All 132 Lines • ▼ Show 20 Lines	static void writeFragment(const MCAssembler &Asm, const MCAsmLayout &Layout,
}		}

assert(OW->getStream().tell() - Start == FragmentSize &&		assert(OW->getStream().tell() - Start == FragmentSize &&
"The stream should advance by fragment size");		"The stream should advance by fragment size");
}		}

void MCAssembler::writeSectionData(const MCSection *Sec,		void MCAssembler::writeSectionData(const MCSection *Sec,
const MCAsmLayout &Layout) const {		const MCAsmLayout &Layout) const {
		assert(getBackendPtr() && "Expected assembler backend");

// Ignore virtual sections.		// Ignore virtual sections.
if (Sec->isVirtualSection()) {		if (Sec->isVirtualSection()) {
assert(Layout.getSectionFileSize(Sec) == 0 && "Invalid size for section!");		assert(Layout.getSectionFileSize(Sec) == 0 && "Invalid size for section!");

// Check that contents are only things legal inside a virtual section.		// Check that contents are only things legal inside a virtual section.
for (const MCFragment &F : *Sec) {		for (const MCFragment &F : *Sec) {
switch (F.getKind()) {		switch (F.getKind()) {
default: llvm_unreachable("Invalid fragment in virtual section!");		default: llvm_unreachable("Invalid fragment in virtual section!");
▲ Show 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	if (!IsResolved) {
// writer of the relocation, and give it an opportunity to adjust the		// writer of the relocation, and give it an opportunity to adjust the
// fixup value if need be.		// fixup value if need be.
getWriter().recordRelocation(*this, Layout, &F, Fixup, Target, FixedValue);		getWriter().recordRelocation(*this, Layout, &F, Fixup, Target, FixedValue);
}		}
return std::make_tuple(Target, FixedValue, IsResolved);		return std::make_tuple(Target, FixedValue, IsResolved);
}		}

void MCAssembler::layout(MCAsmLayout &Layout) {		void MCAssembler::layout(MCAsmLayout &Layout) {
		assert(getBackendPtr() && "Expected assembler backend");
DEBUG_WITH_TYPE("mc-dump", {		DEBUG_WITH_TYPE("mc-dump", {
errs() << "assembler backend - pre-layout\n--\n";		errs() << "assembler backend - pre-layout\n--\n";
dump(); });		dump(); });

// Create dummy fragments and assign section ordinals.		// Create dummy fragments and assign section ordinals.
unsigned SectionIndex = 0;		unsigned SectionIndex = 0;
for (MCSection &Sec : *this) {		for (MCSection &Sec : *this) {
// Create dummy fragments to eliminate any empty sections, this simplifies		// Create dummy fragments to eliminate any empty sections, this simplifies
▲ Show 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	void MCAssembler::Finish() {
getWriter().writeObject(*this, Layout);		getWriter().writeObject(*this, Layout);

stats::ObjectBytes += OS.tell() - StartOffset;		stats::ObjectBytes += OS.tell() - StartOffset;
}		}

bool MCAssembler::fixupNeedsRelaxation(const MCFixup &Fixup,		bool MCAssembler::fixupNeedsRelaxation(const MCFixup &Fixup,
const MCRelaxableFragment *DF,		const MCRelaxableFragment *DF,
const MCAsmLayout &Layout) const {		const MCAsmLayout &Layout) const {
		assert(getBackendPtr() && "Expected assembler backend");
MCValue Target;		MCValue Target;
uint64_t Value;		uint64_t Value;
bool Resolved = evaluateFixup(Layout, Fixup, DF, Target, Value);		bool Resolved = evaluateFixup(Layout, Fixup, DF, Target, Value);
if (Target.getSymA() &&		if (Target.getSymA() &&
Target.getSymA()->getKind() == MCSymbolRefExpr::VK_X86_ABS8 &&		Target.getSymA()->getKind() == MCSymbolRefExpr::VK_X86_ABS8 &&
Fixup.getKind() == FK_Data_1)		Fixup.getKind() == FK_Data_1)
return false;		return false;
return getBackend().fixupNeedsRelaxationAdvanced(Fixup, Resolved, Value, DF,		return getBackend().fixupNeedsRelaxationAdvanced(Fixup, Resolved, Value, DF,
Layout);		Layout);
}		}

bool MCAssembler::fragmentNeedsRelaxation(const MCRelaxableFragment *F,		bool MCAssembler::fragmentNeedsRelaxation(const MCRelaxableFragment *F,
const MCAsmLayout &Layout) const {		const MCAsmLayout &Layout) const {
		assert(getBackendPtr() && "Expected assembler backend");
// If this inst doesn't ever need relaxation, ignore it. This occurs when we		// If this inst doesn't ever need relaxation, ignore it. This occurs when we
// are intentionally pushing out inst fragments, or because we relaxed a		// are intentionally pushing out inst fragments, or because we relaxed a
// previous instruction to one that doesn't need relaxation.		// previous instruction to one that doesn't need relaxation.
if (!getBackend().mayNeedRelaxation(F->getInst()))		if (!getBackend().mayNeedRelaxation(F->getInst()))
return false;		return false;

for (const MCFixup &Fixup : F->getFixups())		for (const MCFixup &Fixup : F->getFixups())
if (fixupNeedsRelaxation(Fixup, F, Layout))		if (fixupNeedsRelaxation(Fixup, F, Layout))
return true;		return true;

return false;		return false;
}		}

bool MCAssembler::relaxInstruction(MCAsmLayout &Layout,		bool MCAssembler::relaxInstruction(MCAsmLayout &Layout,
MCRelaxableFragment &F) {		MCRelaxableFragment &F) {
		assert(getEmitterPtr() &&
		"Expected CodeEmitter defined for relaxInstruction");
if (!fragmentNeedsRelaxation(&F, Layout))		if (!fragmentNeedsRelaxation(&F, Layout))
return false;		return false;

++stats::RelaxedInstructions;		++stats::RelaxedInstructions;

// FIXME-PERF: We could immediately lower out instructions if we can tell		// FIXME-PERF: We could immediately lower out instructions if we can tell
// they are fully resolved, to avoid retesting on later passes.		// they are fully resolved, to avoid retesting on later passes.

Show All 16 Lines	bool MCAssembler::relaxInstruction(MCAsmLayout &Layout,
F.getContents() = Code;		F.getContents() = Code;
F.getFixups() = Fixups;		F.getFixups() = Fixups;

return true;		return true;
}		}

bool MCAssembler::relaxPaddingFragment(MCAsmLayout &Layout,		bool MCAssembler::relaxPaddingFragment(MCAsmLayout &Layout,
MCPaddingFragment &PF) {		MCPaddingFragment &PF) {
		assert(getBackendPtr() && "Expected assembler backend");
uint64_t OldSize = PF.getSize();		uint64_t OldSize = PF.getSize();
if (!getBackend().relaxFragment(&PF, Layout))		if (!getBackend().relaxFragment(&PF, Layout))
return false;		return false;
uint64_t NewSize = PF.getSize();		uint64_t NewSize = PF.getSize();

++stats::PaddingFragmentsRelaxations;		++stats::PaddingFragmentsRelaxations;
stats::PaddingFragmentsBytes += NewSize;		stats::PaddingFragmentsBytes += NewSize;
stats::PaddingFragmentsBytes -= OldSize;		stats::PaddingFragmentsBytes -= OldSize;
▲ Show 20 Lines • Show All 128 Lines • ▼ Show 20 Lines	for (iterator it = begin(), ie = end(); it != ie; ++it) {
while (layoutSectionOnce(Layout, Sec))		while (layoutSectionOnce(Layout, Sec))
WasRelaxed = true;		WasRelaxed = true;
}		}

return WasRelaxed;		return WasRelaxed;
}		}

void MCAssembler::finishLayout(MCAsmLayout &Layout) {		void MCAssembler::finishLayout(MCAsmLayout &Layout) {
		assert(getBackendPtr() && "Expected assembler backend");
// The layout is done. Mark every fragment as valid.		// The layout is done. Mark every fragment as valid.
for (unsigned int i = 0, n = Layout.getSectionOrder().size(); i != n; ++i) {		for (unsigned int i = 0, n = Layout.getSectionOrder().size(); i != n; ++i) {
MCSection &Section = *Layout.getSectionOrder()[i];		MCSection &Section = *Layout.getSectionOrder()[i];
Layout.getFragmentOffset(&*Section.rbegin());		Layout.getFragmentOffset(&*Section.rbegin());
computeFragmentSize(Layout, *Section.rbegin());		computeFragmentSize(Layout, *Section.rbegin());
}		}
getBackend().finishLayout(*this, Layout);		getBackend().finishLayout(*this, Layout);
}		}

llvm/lib/MC/MCObjectStreamer.cpp

	Show All 21 Lines
	#include "llvm/Support/ErrorHandling.h"			#include "llvm/Support/ErrorHandling.h"
	#include "llvm/Support/SourceMgr.h"			#include "llvm/Support/SourceMgr.h"
	using namespace llvm;			using namespace llvm;

	MCObjectStreamer::MCObjectStreamer(MCContext &Context,			MCObjectStreamer::MCObjectStreamer(MCContext &Context,
	std::unique_ptr<MCAsmBackend> TAB,			std::unique_ptr<MCAsmBackend> TAB,
	raw_pwrite_stream &OS,			raw_pwrite_stream &OS,
	std::unique_ptr<MCCodeEmitter> Emitter)			std::unique_ptr<MCCodeEmitter> Emitter)
	: MCStreamer(Context), ObjectWriter(TAB->createObjectWriter(OS)),			: MCStreamer(Context),
	TAB(std::move(TAB)), Emitter(std::move(Emitter)),			Assembler(llvm::make_unique<MCAssembler>(Context, std::move(TAB),
	Assembler(llvm::make_unique<MCAssembler>(Context, *this->TAB,			std::move(Emitter),
	this->Emitter, ObjectWriter)),			TAB->createObjectWriter(OS))),
	EmitEHFrame(true), EmitDebugFrame(false) {}			EmitEHFrame(true), EmitDebugFrame(false) {}

	MCObjectStreamer::~MCObjectStreamer() {}			MCObjectStreamer::~MCObjectStreamer() {}

	void MCObjectStreamer::flushPendingLabels(MCFragment *F, uint64_t FOffset) {			void MCObjectStreamer::flushPendingLabels(MCFragment *F, uint64_t FOffset) {
	if (PendingLabels.empty())			if (PendingLabels.empty())
	return;			return;
	if (!F) {			if (!F) {
	▲ Show 20 Lines • Show All 604 Lines • Show Last 20 Lines