This is an archive of the discontinued LLVM Phabricator instance.

Would it be possible to emit a new Fragment type and leverage the existing NOP emission code in X86AsmBackend.cpp. The code here appears to be copied from X86MCInstLower. Which would mean we would now have 3 places that do almost the same thing.

In D82826#2121595, @craig.topper wrote:

Would it be possible to emit a new Fragment type and leverage the existing NOP emission code in X86AsmBackend.cpp. The code here appears to be copied from X86MCInstLower. Which would mean we would now have 3 places that do almost the same thing.

Thanks for the suggestion. Did not know there was a second copy already. Will try to reuse the code from X86AsmBackend.cpp.

I asked what the largest operand .nops supports https://sourceware.org/bugzilla/show_bug.cgi?id=25789 but did not get a clear answer. Maybe some reviewer can help me ask for a clarification...

llvm/test/MC/X86/x86-directive-nops.s
2	`#`
8	`# X32`

If NopSize is basically equivalent for NumBytes except for 2 cases, then I'd do:

if (!NopSize) {
  print_some_error();
  return;
}
unsigned NopSize = NumBytes;
switch (NumBytes) {
...
default:
  NopSize = 10;
  ...
}

So that you don't have to assign NopSize = NumBytes for each other case.

@craig.topper I tried to add a new Fragment type and use X86AsmBackend::writeNopData to insert NOP instructions. It however changed the outcome of the test cases I used, as X86AsmBackend::writeNopData only generatssingle-byte nop instructions if X86::FeatureNOPL feature bit is not true, while the maxLongNopLength in the current implementation (from X86MCInstLower) allows long nop instructions up to 10 bytes on a 64-bit processor. Which one is preferred? Also, which target triple should I use to test multiple-byte nop instructions? Thanks.

@MaskRay @nickdesaulniers Thanks for the comments. Will refactor the code once we decide which way to go with the implementation.

In D82826#2124581, @jcai19 wrote:

@craig.topper I tried to add a new Fragment type and use X86AsmBackend::writeNopData to insert NOP instructions. It however changed the outcome of the test cases I used, as X86AsmBackend::writeNopData only generatssingle-byte nop instructions if X86::FeatureNOPL feature bit is not true, while the maxLongNopLength in the current implementation (from X86MCInstLower) allows long nop instructions up to 10 bytes on a 64-bit processor. Which one is preferred? Also, which target triple should I use to test multiple-byte nop instructions? Thanks.

That seems like a bug in writeNopData. All real 64-bit CPUs should have FeatureNOPL. But I think llvm-mc and llc default to "generic" as a CPU which isn't a real CPU and doesn't have the feature. clang never uses the "generic". The default 64-bit CPU for clang is "x86-64" which does have the feature. I'll see what happens if we check for 64-bit mode explicitly in writeNopData and post a patch tonight or tomorrow.

Thanks! Checking if the CPU is 64-bit along FeatureNOPL seemed to work. Will verify more. Also I'd like to point out the difference of 32-bit CPUs.

writeNopsData should always use multibyte nops in 64-bit mode now.

is the 32-bit different the 2 byte case?

In D82826#2126190, @craig.topper wrote:

is the 32-bit different the 2 byte case?

Yes.

I have a concern that we haven't figured out the largest operand .nops supports. We should clarify this (https://sourceware.org/bugzilla/show_bug.cgi?id=25789 )

(1) This is a temporary implementation as it breaks the following tests:

LLVM :: MC/COFF/align-nops.s
LLVM :: MC/MachO/x86_32-optimal_nop.s
LLVM :: MC/X86/AlignedBundling/misaligned-bundle-group.s
LLVM :: MC/X86/AlignedBundling/misaligned-bundle.s
LLVM :: MC/X86/align-branch-bundle.s
LLVM :: MC/X86/align-branch-pad-max-prefix.s
LLVM :: MC/X86/x86_long_nop.s
LLVM :: MC/X86/x86_nop.s

The breakage was caused by checking X86::Mode32Bit and X86::Mode64Bit in the newly introduced X86AsmBackend::getMaximumNopSize() function. All the above tests would pass if I removed the two checks, but .nops calls would then emit byte-long nop instructions regardless of its second argument.

(2) Will address error messages in the next iteration.

I'm having trouble posting comments inline; it seems fabricator won't allow me to "save" comments.

Is it too painful to make the member of MCNopsFragment uint64_t? a la MCStreamer::emitFill

craig.topper added inline comments.Jul 1 2020, 5:31 PM

llvm/lib/Target/X86/MCTargetDesc/X86AsmBackend.cpp
1073	Mode32Bit is only relevant if FeatureNOPL is false. So I think it should be if (!STI.getFeatureBits()[X86::FeatureNOPL] && !STI.getFeatureBits()[X86::Mode64Bit]) return STI.getFeatureBits()[X86::Mode32Bit] ? 2 : 1;

In D82826#2126955, @nickdesaulniers wrote:

I'm having trouble posting comments inline; it seems fabricator won't allow me to "save" comments.

Is it too painful to make the member of MCNopsFragment uint64_t? a la MCStreamer::emitFill

I suppose it's easy to do but I'll have to explicitly cast them to signed integers to check if they are smaller than 0. That is probably more error-prone IMO. WDYT?

Refactor X86AsmBackend::getMaximumNopSize based on @craig.topper's comment.

Harbormaster failed remote builds in B62603: Diff 274963!Jul 1 2020, 6:23 PM

Harbormaster failed remote builds in B62615: Diff 274981!Jul 1 2020, 7:26 PM

Issuing multiple-byte NOP for 32-bit mode broke many tests and probably worth a separate patch itself. So this patch will keep the behavior on 32-bit mode unchange for now. This built and passed all the tests.

Harbormaster failed remote builds in B63095: Diff 275853!Jul 6 2020, 3:47 PM

reames requested changes to this revision.Jul 6 2020, 6:03 PM

reames added inline comments.

llvm/include/llvm/MC/MCFragment.h
358	Can you call this MaxNopLength or something?
llvm/lib/MC/MCAssembler.cpp
625	When parsing asm, you reject negative lengths. Should these simply be asserts?
633	Does this behaviour match existing gnu? I'd have expected the result of specifying a "too large" maximum size to simply clamp to the target's maximum. This is important as if the result is semantic, then the difference between "largest encodeable" and "largest profitable" becomes a thing the rest of the code has to care about. 15 byte nops are almost always legal they're just not fast.
644	This loop is duplicated from within emitNops. Can you pass in a MaxNopLength parameter instead?
llvm/lib/Target/X86/MCTargetDesc/X86AsmBackend.cpp
1072	Rename this function to getMaximumProfitableNop() There's a difference between legality and profit here. As commented earlier, if that matters you'll have a harder task implementation wise.
llvm/test/MC/X86/align-branch-bundle.s
9	Having a test delta in a file without .nops is highly suspicious. I'd suggest splitting your patch into a trivial version which emits single byte nops, and an change which adds the multiple byte support. That would allow us to separate the directive mechanics from the interesting profit bits.

This revision now requires changes to proceed.Jul 6 2020, 6:03 PM

jcai19 marked 2 inline comments as done.Jul 7 2020, 1:43 PM

jcai19 added inline comments.

llvm/lib/MC/MCAssembler.cpp
633	Does this behaviour match existing gnu? Appears so. $ cat foo.s .nops 16, 15 $ gcc -c foo.s foo.s: Assembler messages: foo.s:1: Error: invalid single nop size: 15 (expect within [0, 11]) With the patch applied, $ llvm-mc -filetype=obj -triple=x86_64 foo.s foo.s:1:1: error: illegal NOP size 15. (expected within [0, 10]) .nops 16, 15 ^
llvm/test/MC/X86/align-branch-bundle.s
9	How about we also print out instruction bytes here. If 64-bit processors can generate a two-byte long nop instruction here, shouldn't we emit that instead of two single-byte nop? Thanks.

jcai19 marked 2 inline comments as done.Jul 7 2020, 2:39 PM

jcai19 added inline comments.

llvm/lib/MC/MCAssembler.cpp
644	There isn't any loop in emitNops. Do you by any chance refer to the loop in writeNopData? In that case, they are not duplicate as this loop will break total bytes into nop instructions no longer than specified by the second argument of .nops if provided, while the loop in writeNopData makes sure each instruction emitted is no longer than the maximum length allowed by the target.
llvm/test/MC/X86/align-branch-bundle.s
9	On second thought, I agree that splitting the patch is the better approach in case the multiple-byte support causes any regression. Will address this in the next iteration.

Address some of @reames's concerns, and rename variables to avoid confusion.

Harbormaster failed remote builds in B63306: Diff 276226!Jul 7 2020, 2:57 PM

aganea added inline comments.Jul 7 2020, 3:19 PM

llvm/lib/Target/X86/MCTargetDesc/X86AsmBackend.cpp
1072	Any reason for not reusing `maxLongNopLength()` rather than rewriting the same thing here? https://github.com/llvm/llvm-project/blob/b2eb1c5793d78d70c1223b098aefc87050f69a8c/llvm/lib/Target/X86/X86MCInstLower.cpp#L1085 That function could perhaps be moved to `llvm/lib/Target/X86/MCTargetDesc/X86BaseInfo.h` ?

Fixed an assertion message.

Harbormaster failed remote builds in B63312: Diff 276235!Jul 7 2020, 3:29 PM

craig.topper added inline comments.Jul 7 2020, 3:33 PM

llvm/lib/Target/X86/MCTargetDesc/X86AsmBackend.cpp
1072	That function can't be moved as is. It uses X86Subtarget which isn't available to MC. It does something different than for 32-bit mode than what is currently in this patch as that causes additional test failures as discussed elsewhere in this review. That function also uses ProcIntelSLM instead of Feature7ByteNOP. And the FeatureFast flags being set assumes FeatureNOPL is set which is backwards of how it should be. I think the function here is closer to how it should be except for the 32-bit difference.

jcai19 marked an inline comment as done.Jul 7 2020, 3:36 PM

jcai19 added inline comments.

llvm/lib/Target/X86/MCTargetDesc/X86AsmBackend.cpp
1072	Any reason for not reusing maxLongNopLength() rather than rewriting the same thing here? https://github.com/llvm/llvm-project/blob/b2eb1c5793d78d70c1223b098aefc87050f69a8c/llvm/lib/Target/X86/X86MCInstLower.cpp#L1085 Yes I'm all for merging these two functions although there are some differences on both 32-bit and 64-bit mode that would break some unit tests, such as https://reviews.llvm.org/D82826?id=275853 on 64-bit mode. Maybe we can address that in a separate patch as previously discussed. That function could perhaps be moved to llvm/lib/Target/X86/MCTargetDesc/X86BaseInfo.h ? SG. Will start to work on merging them once this patch is checked in.

jcai19 marked an inline comment as done.Jul 7 2020, 3:44 PM

jcai19 added inline comments.

llvm/lib/Target/X86/MCTargetDesc/X86AsmBackend.cpp
1072	@craig.topper Okay I missed your comment. Thanks for the clarification.

@reames @craig.topper Hi just want to double check if there are any additional issues I should address. Thanks.

I think I have a very old (I emphasized it again in another comment) comment which isn't addressed (https://reviews.llvm.org/D82826#2126406 )
We should align with GNU as on the largest operand .nops supports.

In D82826#2154511, @MaskRay wrote:

I think I have a very old (I emphasized it again in another comment) comment which isn't addressed (https://reviews.llvm.org/D82826#2126406 )
We should align with GNU as on the largest operand .nops supports.

I think it's probably better to address that in a separate patch as the this patch is focused on adding support to .nops directive, similar to this comment https://reviews.llvm.org/D82826?id=275853#inline-765965.

craig.topper added inline comments.Jul 22 2020, 5:43 PM

llvm/include/llvm/MC/MCFragment.h
358	Was this comment addressed? I'm not sure what the variable was called when @reames made this comment.
llvm/lib/MC/MCAssembler.cpp
628	Should we put Asm.getBackend().getMaximumNopSize() into a variable? We call it 3 times.
634	Should we clamp the NopLength if we hit the error case? reportError won't stop immediately. Or are we relying on writeNopData to not exceed the maximum size internally?
637	Maybe just use a if statement here for !NopLength
644	I think he was refering to writeNopData. I suppose we could add a limit parameter to writeNopData, but every target would need to be updated for it.
llvm/test/MC/X86/x86-directive-nops-errors.s
6	Please use X86 rather than X32. X32 gets confusing with gnux32 where 32-bit pointers are used on a 64-bit target.

jcai19 marked an inline comment as done.Jul 23 2020, 3:02 PM

jcai19 added inline comments.

llvm/include/llvm/MC/MCFragment.h
358	It was called NopLength. The name was confusing because I called it NopLength here but the same value MaxNopLength somewhere so @reames suggested me to rename it to MaxNopLength . While working on that I realized this value was meant to specify a soft cap on the size limit of a no-op instruction, i.e. the second argument (control) as https://sourceware.org/binutils/docs/as/Nops.html#Nops specified, so I kept the name to avoid the confusion with the function name getMaximumNopSize, which returns a "hard cap" LLVM can accept. Any suggestion on what a better name could be? Also I realized MaxNopLen in the newly introduced emitNops function should be renamed once we make a final decision on the naming. Will address other comments together in the next iteration. Thanks!

Renamed a variable name and addressed comments. @craig.topper Please let me know if it looks any better. And thanks for enabling multibyte-NOPs in 64-bit mode.

Harbormaster completed remote builds in B66597: Diff 282289.Jul 31 2020, 1:42 PM

LGTM

In D82826#2189118, @craig.topper wrote:

LGTM

Thank you!

This revision was not accepted when it landed; it landed in state Needs Review.Aug 3 2020, 11:51 AM

This revision was landed with ongoing or failed builds.

Closed by commit rGc6334db577e7: [X86] support .nops directive (authored by jcai19). · Explain Why

This revision was automatically updated to reflect the committed changes.

jcai19 added a commit: rGc6334db577e7: [X86] support .nops directive.

Revision Contents

Path

Size

llvm/

include/

llvm/

MC/

4 lines

26 lines

1 line

2 lines

lib/

MC/

45 lines

9 lines

10 lines

3 lines

Target/

X86/

AsmParser/

X86AsmParser.cpp

41 lines

MCTargetDesc/

X86AsmBackend.cpp

36 lines

test/

MC/

X86/

align-branch-bundle.s

1 line

align-branch-pad-max-prefix.s

4 lines

x86-directive-nops-errors.s

12 lines

x86-directive-nops.s

12 lines

x86_64-directive-nops.s

19 lines

Diff 275853

llvm/include/llvm/MC/MCAsmBackend.h

Show First 20 Lines • Show All 169 Lines • ▼ Show 20 Lines	public:
/// @}		/// @}

/// Returns the minimum size of a nop in bytes on this target. The assembler		/// Returns the minimum size of a nop in bytes on this target. The assembler
/// will use this to emit excess padding in situations where the padding		/// will use this to emit excess padding in situations where the padding
/// required for simple alignment would be less than the minimum nop size.		/// required for simple alignment would be less than the minimum nop size.
///		///
virtual unsigned getMinimumNopSize() const { return 1; }		virtual unsigned getMinimumNopSize() const { return 1; }

		/// Returns the maximum size of a nop in bytes on this target.
		///
		virtual unsigned getMaximumNopSize() const { return 0; }

/// Write an (optimal) nop sequence of Count bytes to the given output. If the		/// Write an (optimal) nop sequence of Count bytes to the given output. If the
/// target cannot generate such a sequence, it should return an error.		/// target cannot generate such a sequence, it should return an error.
///		///
/// \return - True on success.		/// \return - True on success.
virtual bool writeNopData(raw_ostream &OS, uint64_t Count) const = 0;		virtual bool writeNopData(raw_ostream &OS, uint64_t Count) const = 0;

/// Give backend an opportunity to finish layout after relaxation		/// Give backend an opportunity to finish layout after relaxation
virtual void finishLayout(MCAssembler const &Asm,		virtual void finishLayout(MCAssembler const &Asm,
Show All 20 Lines

llvm/include/llvm/MC/MCFragment.h

Show All 31 Lines	class MCFragment : public ilist_node_with_parent<MCFragment, MCSection> {
friend class MCAsmLayout;		friend class MCAsmLayout;

public:		public:
enum FragmentType : uint8_t {		enum FragmentType : uint8_t {
FT_Align,		FT_Align,
FT_Data,		FT_Data,
FT_CompactEncodedInst,		FT_CompactEncodedInst,
FT_Fill,		FT_Fill,
		FT_Nops,
FT_Relaxable,		FT_Relaxable,
FT_Org,		FT_Org,
FT_Dwarf,		FT_Dwarf,
FT_DwarfFrame,		FT_DwarfFrame,
FT_LEB,		FT_LEB,
FT_BoundaryAlign,		FT_BoundaryAlign,
FT_SymbolId,		FT_SymbolId,
FT_CVInlineLines,		FT_CVInlineLines,
▲ Show 20 Lines • Show All 297 Lines • ▼ Show 20 Lines	public:

SMLoc getLoc() const { return Loc; }		SMLoc getLoc() const { return Loc; }

static bool classof(const MCFragment *F) {		static bool classof(const MCFragment *F) {
return F->getKind() == MCFragment::FT_Fill;		return F->getKind() == MCFragment::FT_Fill;
}		}
};		};

		class MCNopsFragment : public MCFragment {
		/// The number of bytes to insert.
		int64_t Size;
		/// Maximum number of bytes of each instruction.
		int64_t NopLength;
		reamesUnsubmitted Not Done Reply Inline Actions Can you call this MaxNopLength or something? reames: Can you call this MaxNopLength or something?
		craig.topperUnsubmitted Not Done Reply Inline Actions Was this comment addressed? I'm not sure what the variable was called when @reames made this comment. craig.topper: Was this comment addressed? I'm not sure what the variable was called when @reames made this…
		jcai19AuthorUnsubmitted Done Reply Inline Actions It was called NopLength. The name was confusing because I called it NopLength here but the same value MaxNopLength somewhere so @reames suggested me to rename it to MaxNopLength . While working on that I realized this value was meant to specify a soft cap on the size limit of a no-op instruction, i.e. the second argument (control) as https://sourceware.org/binutils/docs/as/Nops.html#Nops specified, so I kept the name to avoid the confusion with the function name getMaximumNopSize, which returns a "hard cap" LLVM can accept. Any suggestion on what a better name could be? Also I realized MaxNopLen in the newly introduced emitNops function should be renamed once we make a final decision on the naming. Will address other comments together in the next iteration. Thanks! jcai19: It was called NopLength. The name was confusing because I called it NopLength here but the same…

		/// Source location of the directive that this fragment was created for.
		SMLoc Loc;

		public:
		MCNopsFragment(int64_t NumBytes, int64_t MaxNopLength, SMLoc L,
		MCSection *Sec = nullptr)
		: MCFragment(FT_Nops, false, Sec), Size(NumBytes),
		NopLength(MaxNopLength), Loc(L) {}

		int64_t getNumBytes() const { return Size; }
		int64_t getMaxNopLength() const { return NopLength; }

		SMLoc getLoc() const { return Loc; }

		static bool classof(const MCFragment *F) {
		return F->getKind() == MCFragment::FT_Nops;
		}
		};

class MCOrgFragment : public MCFragment {		class MCOrgFragment : public MCFragment {
/// Value to use for filling bytes.		/// Value to use for filling bytes.
int8_t Value;		int8_t Value;

/// The offset this fragment should start at.		/// The offset this fragment should start at.
const MCExpr *Offset;		const MCExpr *Offset;

/// Source location of the directive that this fragment was created for.		/// Source location of the directive that this fragment was created for.
▲ Show 20 Lines • Show All 203 Lines • Show Last 20 Lines

llvm/include/llvm/MC/MCObjectStreamer.h

Show First 20 Lines • Show All 173 Lines • ▼ Show 20 Lines	public:
bool emitRelocDirective(const MCExpr &Offset, StringRef Name,		bool emitRelocDirective(const MCExpr &Offset, StringRef Name,
const MCExpr *Expr, SMLoc Loc,		const MCExpr *Expr, SMLoc Loc,
const MCSubtargetInfo &STI) override;		const MCSubtargetInfo &STI) override;
using MCStreamer::emitFill;		using MCStreamer::emitFill;
void emitFill(const MCExpr &NumBytes, uint64_t FillValue,		void emitFill(const MCExpr &NumBytes, uint64_t FillValue,
SMLoc Loc = SMLoc()) override;		SMLoc Loc = SMLoc()) override;
void emitFill(const MCExpr &NumValues, int64_t Size, int64_t Expr,		void emitFill(const MCExpr &NumValues, int64_t Size, int64_t Expr,
SMLoc Loc = SMLoc()) override;		SMLoc Loc = SMLoc()) override;
		void emitNops(int64_t NumBytes, int64_t MaxNopLen, SMLoc Loc) override;
void emitFileDirective(StringRef Filename) override;		void emitFileDirective(StringRef Filename) override;

void emitAddrsig() override;		void emitAddrsig() override;
void emitAddrsigSym(const MCSymbol *Sym) override;		void emitAddrsigSym(const MCSymbol *Sym) override;

void finishImpl() override;		void finishImpl() override;

/// Emit the absolute difference between two symbols if possible.		/// Emit the absolute difference between two symbols if possible.
Show All 20 Lines

llvm/include/llvm/MC/MCStreamer.h

Show First 20 Lines • Show All 761 Lines • ▼ Show 20 Lines	public:
/// This is used to implement assembler directives such as .fill.		/// This is used to implement assembler directives such as .fill.
///		///
/// \param NumValues - The number of copies of \p Size bytes to emit.		/// \param NumValues - The number of copies of \p Size bytes to emit.
/// \param Size - The size (in bytes) of each repeated value.		/// \param Size - The size (in bytes) of each repeated value.
/// \param Expr - The expression from which \p Size bytes are used.		/// \param Expr - The expression from which \p Size bytes are used.
virtual void emitFill(const MCExpr &NumValues, int64_t Size, int64_t Expr,		virtual void emitFill(const MCExpr &NumValues, int64_t Size, int64_t Expr,
SMLoc Loc = SMLoc());		SMLoc Loc = SMLoc());

		virtual void emitNops(int64_t NumBytes, int64_t MaxNopLen, SMLoc Loc);

/// Emit NumBytes worth of zeros.		/// Emit NumBytes worth of zeros.
/// This function properly handles data in virtual sections.		/// This function properly handles data in virtual sections.
void emitZeros(uint64_t NumBytes);		void emitZeros(uint64_t NumBytes);

/// Emit some number of copies of \p Value until the byte alignment \p		/// Emit some number of copies of \p Value until the byte alignment \p
/// ByteAlignment is reached.		/// ByteAlignment is reached.
///		///
/// If the number of bytes need to emit for the alignment is not a multiple		/// If the number of bytes need to emit for the alignment is not a multiple
▲ Show 20 Lines • Show All 300 Lines • Show Last 20 Lines

llvm/lib/MC/MCAssembler.cpp

Show First 20 Lines • Show All 56 Lines • ▼ Show 20 Lines
STATISTIC(EmittedDataFragments,		STATISTIC(EmittedDataFragments,
"Number of emitted assembler fragments - data");		"Number of emitted assembler fragments - data");
STATISTIC(EmittedCompactEncodedInstFragments,		STATISTIC(EmittedCompactEncodedInstFragments,
"Number of emitted assembler fragments - compact encoded inst");		"Number of emitted assembler fragments - compact encoded inst");
STATISTIC(EmittedAlignFragments,		STATISTIC(EmittedAlignFragments,
"Number of emitted assembler fragments - align");		"Number of emitted assembler fragments - align");
STATISTIC(EmittedFillFragments,		STATISTIC(EmittedFillFragments,
"Number of emitted assembler fragments - fill");		"Number of emitted assembler fragments - fill");
STATISTIC(EmittedOrgFragments,		STATISTIC(EmittedNopsFragments, "Number of emitted assembler fragments - nops");
"Number of emitted assembler fragments - org");		STATISTIC(EmittedOrgFragments, "Number of emitted assembler fragments - org");
STATISTIC(evaluateFixup, "Number of evaluated fixups");		STATISTIC(evaluateFixup, "Number of evaluated fixups");
STATISTIC(FragmentLayouts, "Number of fragment layouts");		STATISTIC(FragmentLayouts, "Number of fragment layouts");
STATISTIC(ObjectBytes, "Number of emitted object file bytes");		STATISTIC(ObjectBytes, "Number of emitted object file bytes");
STATISTIC(RelaxationSteps, "Number of assembler layout and relaxation steps");		STATISTIC(RelaxationSteps, "Number of assembler layout and relaxation steps");
STATISTIC(RelaxedInstructions, "Number of relaxed instructions");		STATISTIC(RelaxedInstructions, "Number of relaxed instructions");

} // end namespace stats		} // end namespace stats
} // end anonymous namespace		} // end anonymous namespace
▲ Show 20 Lines • Show All 232 Lines • ▼ Show 20 Lines	case MCFragment::FT_Fill: {
int64_t Size = NumValues * FF.getValueSize();		int64_t Size = NumValues * FF.getValueSize();
if (Size < 0) {		if (Size < 0) {
getContext().reportError(FF.getLoc(), "invalid number of bytes");		getContext().reportError(FF.getLoc(), "invalid number of bytes");
return 0;		return 0;
}		}
return Size;		return Size;
}		}

		case MCFragment::FT_Nops:
		return cast<MCNopsFragment>(F).getNumBytes();

case MCFragment::FT_LEB:		case MCFragment::FT_LEB:
return cast<MCLEBFragment>(F).getContents().size();		return cast<MCLEBFragment>(F).getContents().size();

case MCFragment::FT_BoundaryAlign:		case MCFragment::FT_BoundaryAlign:
return cast<MCBoundaryAlignFragment>(F).getSize();		return cast<MCBoundaryAlignFragment>(F).getSize();

case MCFragment::FT_SymbolId:		case MCFragment::FT_SymbolId:
return 4;		return 4;
▲ Show 20 Lines • Show All 285 Lines • ▼ Show 20 Lines	case MCFragment::FT_Fill: {

// do remainder if needed.		// do remainder if needed.
unsigned TrailingCount = FragmentSize % ChunkSize;		unsigned TrailingCount = FragmentSize % ChunkSize;
if (TrailingCount)		if (TrailingCount)
OS.write(Data, TrailingCount);		OS.write(Data, TrailingCount);
break;		break;
}		}

		case MCFragment::FT_Nops: {
		++stats::EmittedNopsFragments;
		const MCNopsFragment &NF = cast<MCNopsFragment>(F);
		int64_t NumBytes = NF.getNumBytes();
		int64_t MaxNopLength = NF.getMaxNopLength();

		if (NumBytes < 0) {
		reamesUnsubmitted Not Done Reply Inline Actions When parsing asm, you reject negative lengths. Should these simply be asserts? reames: When parsing asm, you reject negative lengths. Should these simply be asserts?
		Asm.getContext().reportError(
		NF.getLoc(),
		"expected positive NOPs fragment size.");
		craig.topperUnsubmitted Not Done Reply Inline Actions Should we put Asm.getBackend().getMaximumNopSize() into a variable? We call it 3 times. craig.topper: Should we put Asm.getBackend().getMaximumNopSize() into a variable? We call it 3 times.
		break;
		}

		if (MaxNopLength < 0 \|\|
		MaxNopLength > Asm.getBackend().getMaximumNopSize()) {
		reamesUnsubmitted Not Done Reply Inline Actions Does this behaviour match existing gnu? I'd have expected the result of specifying a "too large" maximum size to simply clamp to the target's maximum. This is important as if the result is semantic, then the difference between "largest encodeable" and "largest profitable" becomes a thing the rest of the code has to care about. 15 byte nops are almost always legal they're just not fast. reames: Does this behaviour match existing gnu? I'd have expected the result of specifying a "too…
		jcai19AuthorUnsubmitted Done Reply Inline Actions Does this behaviour match existing gnu? Appears so. $ cat foo.s .nops 16, 15 $ gcc -c foo.s foo.s: Assembler messages: foo.s:1: Error: invalid single nop size: 15 (expect within [0, 11]) With the patch applied, $ llvm-mc -filetype=obj -triple=x86_64 foo.s foo.s:1:1: error: illegal NOP size 15. (expected within [0, 10]) .nops 16, 15 ^ jcai19: > Does this behaviour match existing gnu? Appears so. $ cat foo.s .nops 16, 15 $ gcc -c foo.
		Asm.getContext().reportError(
		craig.topperUnsubmitted Not Done Reply Inline Actions Should we clamp the NopLength if we hit the error case? reportError won't stop immediately. Or are we relying on writeNopData to not exceed the maximum size internally? craig.topper: Should we clamp the NopLength if we hit the error case? reportError won't stop immediately. Or…
		NF.getLoc(),
		"illegal NOP size " +
		std::to_string(MaxNopLength) + ". (expected within [0, " +
		craig.topperUnsubmitted Not Done Reply Inline Actions Maybe just use a if statement here for !NopLength craig.topper: Maybe just use a if statement here for !NopLength
		std::to_string(Asm.getBackend().getMaximumNopSize()) + "])");
		}

		// Use maximum value if the size of each NOP is not specified
		MaxNopLength = !MaxNopLength ? Asm.getBackend().getMaximumNopSize() : MaxNopLength;

		while (NumBytes) {
		reamesUnsubmitted Not Done Reply Inline Actions This loop is duplicated from within emitNops. Can you pass in a MaxNopLength parameter instead? reames: This loop is duplicated from within emitNops. Can you pass in a MaxNopLength parameter instead?
		jcai19AuthorUnsubmitted Done Reply Inline Actions There isn't any loop in emitNops. Do you by any chance refer to the loop in writeNopData? In that case, they are not duplicate as this loop will break total bytes into nop instructions no longer than specified by the second argument of .nops if provided, while the loop in writeNopData makes sure each instruction emitted is no longer than the maximum length allowed by the target. jcai19: There isn't any loop in emitNops. Do you by any chance refer to the loop in writeNopData? In…
		craig.topperUnsubmitted Not Done Reply Inline Actions I think he was refering to writeNopData. I suppose we could add a limit parameter to writeNopData, but every target would need to be updated for it. craig.topper: I think he was refering to writeNopData. I suppose we could add a limit parameter to…
		uint64_t NopsToEmit = (uint64_t)std::min(NumBytes, MaxNopLength);
		assert(NopsToEmit && "try to emit empty NOP instruction");
		if (!Asm.getBackend().writeNopData(OS, NopsToEmit)) {
		report_fatal_error("unable to write nop sequence of the remaining " +
		Twine(NumBytes) + " bytes");
		break;
		}
		NumBytes -= NopsToEmit;
		}
		break;
		}

case MCFragment::FT_LEB: {		case MCFragment::FT_LEB: {
const MCLEBFragment &LF = cast<MCLEBFragment>(F);		const MCLEBFragment &LF = cast<MCLEBFragment>(F);
OS << LF.getContents();		OS << LF.getContents();
break;		break;
}		}

case MCFragment::FT_BoundaryAlign: {		case MCFragment::FT_BoundaryAlign: {
if (!Asm.getBackend().writeNopData(OS, FragmentSize))		if (!Asm.getBackend().writeNopData(OS, FragmentSize))
▲ Show 20 Lines • Show All 586 Lines • Show Last 20 Lines

llvm/lib/MC/MCFragment.cpp

Show First 20 Lines • Show All 273 Lines • ▼ Show 20 Lines	case FT_Data:
delete cast<MCDataFragment>(this);		delete cast<MCDataFragment>(this);
return;		return;
case FT_CompactEncodedInst:		case FT_CompactEncodedInst:
delete cast<MCCompactEncodedInstFragment>(this);		delete cast<MCCompactEncodedInstFragment>(this);
return;		return;
case FT_Fill:		case FT_Fill:
delete cast<MCFillFragment>(this);		delete cast<MCFillFragment>(this);
return;		return;
		case FT_Nops:
		delete cast<MCNopsFragment>(this);
		return;
case FT_Relaxable:		case FT_Relaxable:
delete cast<MCRelaxableFragment>(this);		delete cast<MCRelaxableFragment>(this);
return;		return;
case FT_Org:		case FT_Org:
delete cast<MCOrgFragment>(this);		delete cast<MCOrgFragment>(this);
return;		return;
case FT_Dwarf:		case FT_Dwarf:
delete cast<MCDwarfLineAddrFragment>(this);		delete cast<MCDwarfLineAddrFragment>(this);
▲ Show 20 Lines • Show All 113 Lines • ▼ Show 20 Lines	LLVM_DUMP_METHOD void MCFragment::dump() const {
}		}
case MCFragment::FT_Fill: {		case MCFragment::FT_Fill: {
const auto *FF = cast<MCFillFragment>(this);		const auto *FF = cast<MCFillFragment>(this);
OS << " Value:" << static_cast<unsigned>(FF->getValue())		OS << " Value:" << static_cast<unsigned>(FF->getValue())
<< " ValueSize:" << static_cast<unsigned>(FF->getValueSize())		<< " ValueSize:" << static_cast<unsigned>(FF->getValueSize())
<< " NumValues:" << FF->getNumValues();		<< " NumValues:" << FF->getNumValues();
break;		break;
}		}
		case MCFragment::FT_Nops: {
		const auto *NF = cast<MCNopsFragment>(this);
		OS << " NumBytes:" << NF->getNumBytes()
		<< " MaxNopLength:" << NF->getMaxNopLength();
		break;
		}
case MCFragment::FT_Relaxable: {		case MCFragment::FT_Relaxable: {
const auto *F = cast<MCRelaxableFragment>(this);		const auto *F = cast<MCRelaxableFragment>(this);
OS << "\n ";		OS << "\n ";
OS << " Inst:";		OS << " Inst:";
F->getInst().dump_pretty(OS);		F->getInst().dump_pretty(OS);
OS << " (" << F->getContents().size() << " bytes)";		OS << " (" << F->getContents().size() << " bytes)";
break;		break;
}		}
▲ Show 20 Lines • Show All 62 Lines • Show Last 20 Lines

llvm/lib/MC/MCObjectStreamer.cpp

Show First 20 Lines • Show All 736 Lines • ▼ Show 20 Lines	void MCObjectStreamer::emitFill(const MCExpr &NumValues, int64_t Size,
// Otherwise emit as fragment.		// Otherwise emit as fragment.
MCDataFragment *DF = getOrCreateDataFragment();		MCDataFragment *DF = getOrCreateDataFragment();
flushPendingLabels(DF, DF->getContents().size());		flushPendingLabels(DF, DF->getContents().size());

assert(getCurrentSectionOnly() && "need a section");		assert(getCurrentSectionOnly() && "need a section");
insert(new MCFillFragment(Expr, Size, NumValues, Loc));		insert(new MCFillFragment(Expr, Size, NumValues, Loc));
}		}

		void MCObjectStreamer::emitNops(int64_t NumBytes, int64_t MaxNopLen,
		SMLoc Loc) {
		// Emit an NOP fragment.
		MCDataFragment *DF = getOrCreateDataFragment();
		flushPendingLabels(DF, DF->getContents().size());

		assert(getCurrentSectionOnly() && "need a section");
		insert(new MCNopsFragment(NumBytes, MaxNopLen, Loc));
		}

void MCObjectStreamer::emitFileDirective(StringRef Filename) {		void MCObjectStreamer::emitFileDirective(StringRef Filename) {
getAssembler().addFileName(Filename);		getAssembler().addFileName(Filename);
}		}

void MCObjectStreamer::emitAddrsig() {		void MCObjectStreamer::emitAddrsig() {
getAssembler().getWriter().emitAddrsigSection();		getAssembler().getWriter().emitAddrsigSection();
}		}

Show All 21 Lines

llvm/lib/MC/MCStreamer.cpp

	Show First 20 Lines • Show All 196 Lines • ▼ Show 20 Lines
	}			}

	/// Emit NumBytes bytes worth of the value specified by FillValue.			/// Emit NumBytes bytes worth of the value specified by FillValue.
	/// This implements directives such as '.space'.			/// This implements directives such as '.space'.
	void MCStreamer::emitFill(uint64_t NumBytes, uint8_t FillValue) {			void MCStreamer::emitFill(uint64_t NumBytes, uint8_t FillValue) {
	emitFill(*MCConstantExpr::create(NumBytes, getContext()), FillValue);			emitFill(*MCConstantExpr::create(NumBytes, getContext()), FillValue);
	}			}

				void llvm::MCStreamer::emitNops(int64_t NumBytes, int64_t MaxNopLen,
				llvm::SMLoc) {}

	/// The implementation in this class just redirects to emitFill.			/// The implementation in this class just redirects to emitFill.
	void MCStreamer::emitZeros(uint64_t NumBytes) { emitFill(NumBytes, 0); }			void MCStreamer::emitZeros(uint64_t NumBytes) { emitFill(NumBytes, 0); }

	Expected<unsigned>			Expected<unsigned>
	MCStreamer::tryEmitDwarfFileDirective(unsigned FileNo, StringRef Directory,			MCStreamer::tryEmitDwarfFileDirective(unsigned FileNo, StringRef Directory,
	StringRef Filename,			StringRef Filename,
	Optional<MD5::MD5Result> Checksum,			Optional<MD5::MD5Result> Checksum,
	Optional<StringRef> Source,			Optional<StringRef> Source,
	▲ Show 20 Lines • Show All 956 Lines • Show Last 20 Lines

llvm/lib/Target/X86/AsmParser/X86AsmParser.cpp

Show First 20 Lines • Show All 904 Lines • ▼ Show 20 Lines	private:

bool ParseIntelMemoryOperandSize(unsigned &Size);		bool ParseIntelMemoryOperandSize(unsigned &Size);
std::unique_ptr<X86Operand>		std::unique_ptr<X86Operand>
CreateMemForMSInlineAsm(unsigned SegReg, const MCExpr *Disp, unsigned BaseReg,		CreateMemForMSInlineAsm(unsigned SegReg, const MCExpr *Disp, unsigned BaseReg,
unsigned IndexReg, unsigned Scale, SMLoc Start,		unsigned IndexReg, unsigned Scale, SMLoc Start,
SMLoc End, unsigned Size, StringRef Identifier,		SMLoc End, unsigned Size, StringRef Identifier,
const InlineAsmIdentifierInfo &Info);		const InlineAsmIdentifierInfo &Info);

		bool parseDirectiveNops(SMLoc L);
bool parseDirectiveEven(SMLoc L);		bool parseDirectiveEven(SMLoc L);
bool ParseDirectiveCode(StringRef IDVal, SMLoc L);		bool ParseDirectiveCode(StringRef IDVal, SMLoc L);

/// CodeView FPO data directives.		/// CodeView FPO data directives.
bool parseDirectiveFPOProc(SMLoc L);		bool parseDirectiveFPOProc(SMLoc L);
bool parseDirectiveFPOSetFrame(SMLoc L);		bool parseDirectiveFPOSetFrame(SMLoc L);
bool parseDirectiveFPOPushReg(SMLoc L);		bool parseDirectiveFPOPushReg(SMLoc L);
bool parseDirectiveFPOStackAlloc(SMLoc L);		bool parseDirectiveFPOStackAlloc(SMLoc L);
▲ Show 20 Lines • Show All 2,920 Lines • ▼ Show 20 Lines	if (getLexer().isNot(AsmToken::EndOfStatement)) {
if (Parser.getTok().getString() == "noprefix")		if (Parser.getTok().getString() == "noprefix")
Parser.Lex();		Parser.Lex();
else if (Parser.getTok().getString() == "prefix")		else if (Parser.getTok().getString() == "prefix")
return Error(DirectiveID.getLoc(), "'.intel_syntax prefix' is not "		return Error(DirectiveID.getLoc(), "'.intel_syntax prefix' is not "
"supported: registers must not have "		"supported: registers must not have "
"a '%' prefix in .intel_syntax");		"a '%' prefix in .intel_syntax");
}		}
return false;		return false;
} else if (IDVal == ".even")		} else if (IDVal == ".nops")
		return parseDirectiveNops(DirectiveID.getLoc());
		else if (IDVal == ".even")
return parseDirectiveEven(DirectiveID.getLoc());		return parseDirectiveEven(DirectiveID.getLoc());
else if (IDVal == ".cv_fpo_proc")		else if (IDVal == ".cv_fpo_proc")
return parseDirectiveFPOProc(DirectiveID.getLoc());		return parseDirectiveFPOProc(DirectiveID.getLoc());
else if (IDVal == ".cv_fpo_setframe")		else if (IDVal == ".cv_fpo_setframe")
return parseDirectiveFPOSetFrame(DirectiveID.getLoc());		return parseDirectiveFPOSetFrame(DirectiveID.getLoc());
else if (IDVal == ".cv_fpo_pushreg")		else if (IDVal == ".cv_fpo_pushreg")
return parseDirectiveFPOPushReg(DirectiveID.getLoc());		return parseDirectiveFPOPushReg(DirectiveID.getLoc());
else if (IDVal == ".cv_fpo_stackalloc")		else if (IDVal == ".cv_fpo_stackalloc")
Show All 13 Lines	bool X86AsmParser::ParseDirective(AsmToken DirectiveID) {
else if (IDVal == ".seh_savexmm")		else if (IDVal == ".seh_savexmm")
return parseDirectiveSEHSaveXMM(DirectiveID.getLoc());		return parseDirectiveSEHSaveXMM(DirectiveID.getLoc());
else if (IDVal == ".seh_pushframe")		else if (IDVal == ".seh_pushframe")
return parseDirectiveSEHPushFrame(DirectiveID.getLoc());		return parseDirectiveSEHPushFrame(DirectiveID.getLoc());

return true;		return true;
}		}

		/// parseDirectiveNops
		/// ::= .nops size[, control]
		bool X86AsmParser::parseDirectiveNops(SMLoc L) {
		int64_t NumBytes = 0, Control = 0;
		SMLoc NumBytesLoc, ControlLoc;
		const MCSubtargetInfo STI = getSTI();
		NumBytesLoc = getTok().getLoc();
		if (getParser().checkForValidSection() \|\|
		getParser().parseAbsoluteExpression(NumBytes))
		return true;

		if (parseOptionalToken(AsmToken::Comma)) {
		ControlLoc = getTok().getLoc();
		if (getParser().parseAbsoluteExpression(Control))
		return true;
		}
		if (getParser().parseToken(AsmToken::EndOfStatement,
		"unexpected token in '.nops' directive"))
		return true;

		if (NumBytes <= 0) {
		Error(NumBytesLoc, "'.nops' directive with non-positive size");
		return false;
		}

		if (Control < 0) {
		Error(ControlLoc, "'.nops' directive with negative NOP size");
		return false;
		}

		/// Emit nops
		getParser().getStreamer().emitNops(NumBytes, Control, L);

		return false;
		}

/// parseDirectiveEven		/// parseDirectiveEven
/// ::= .even		/// ::= .even
bool X86AsmParser::parseDirectiveEven(SMLoc L) {		bool X86AsmParser::parseDirectiveEven(SMLoc L) {
if (parseToken(AsmToken::EndOfStatement, "unexpected token in directive"))		if (parseToken(AsmToken::EndOfStatement, "unexpected token in directive"))
return false;		return false;

const MCSection *Section = getStreamer().getCurrentSectionOnly();		const MCSection *Section = getStreamer().getCurrentSectionOnly();
if (!Section) {		if (!Section) {
▲ Show 20 Lines • Show All 268 Lines • Show Last 20 Lines

llvm/lib/Target/X86/MCTargetDesc/X86AsmBackend.cpp

Show First 20 Lines • Show All 201 Lines • ▼ Show 20 Lines	public:
bool padInstructionViaPrefix(MCRelaxableFragment &RF, MCCodeEmitter &Emitter,		bool padInstructionViaPrefix(MCRelaxableFragment &RF, MCCodeEmitter &Emitter,
unsigned &RemainingSize) const;		unsigned &RemainingSize) const;

bool padInstructionEncoding(MCRelaxableFragment &RF, MCCodeEmitter &Emitter,		bool padInstructionEncoding(MCRelaxableFragment &RF, MCCodeEmitter &Emitter,
unsigned &RemainingSize) const;		unsigned &RemainingSize) const;

void finishLayout(MCAssembler const &Asm, MCAsmLayout &Layout) const override;		void finishLayout(MCAssembler const &Asm, MCAsmLayout &Layout) const override;

		unsigned getMaximumNopSize() const override;

bool writeNopData(raw_ostream &OS, uint64_t Count) const override;		bool writeNopData(raw_ostream &OS, uint64_t Count) const override;
};		};
} // end anonymous namespace		} // end anonymous namespace

static unsigned getRelaxedOpcodeBranch(const MCInst &Inst, bool Is16BitMode) {		static unsigned getRelaxedOpcodeBranch(const MCInst &Inst, bool Is16BitMode) {
unsigned Op = Inst.getOpcode();		unsigned Op = Inst.getOpcode();
switch (Op) {		switch (Op) {
default:		default:
▲ Show 20 Lines • Show All 844 Lines • ▼ Show 20 Lines	#endif
// The layout is done. Mark every fragment as valid.		// The layout is done. Mark every fragment as valid.
for (unsigned int i = 0, n = Layout.getSectionOrder().size(); i != n; ++i) {		for (unsigned int i = 0, n = Layout.getSectionOrder().size(); i != n; ++i) {
MCSection &Section = *Layout.getSectionOrder()[i];		MCSection &Section = *Layout.getSectionOrder()[i];
Layout.getFragmentOffset(&*Section.getFragmentList().rbegin());		Layout.getFragmentOffset(&*Section.getFragmentList().rbegin());
Asm.computeFragmentSize(Layout, *Section.getFragmentList().rbegin());		Asm.computeFragmentSize(Layout, *Section.getFragmentList().rbegin());
}		}
}		}

		unsigned X86AsmBackend::getMaximumNopSize() const {
		reamesUnsubmitted Not Done Reply Inline Actions Rename this function to getMaximumProfitableNop() There's a difference between legality and profit here. As commented earlier, if that matters you'll have a harder task implementation wise. reames: Rename this function to getMaximumProfitableNop() There's a difference between legality and…
		aganeaUnsubmitted Not Done Reply Inline Actions Any reason for not reusing `maxLongNopLength()` rather than rewriting the same thing here? https://github.com/llvm/llvm-project/blob/b2eb1c5793d78d70c1223b098aefc87050f69a8c/llvm/lib/Target/X86/X86MCInstLower.cpp#L1085 That function could perhaps be moved to `llvm/lib/Target/X86/MCTargetDesc/X86BaseInfo.h` ? aganea: Any reason for not reusing `maxLongNopLength()` rather than rewriting the same thing here?
		craig.topperUnsubmitted Not Done Reply Inline Actions That function can't be moved as is. It uses X86Subtarget which isn't available to MC. It does something different than for 32-bit mode than what is currently in this patch as that causes additional test failures as discussed elsewhere in this review. That function also uses ProcIntelSLM instead of Feature7ByteNOP. And the FeatureFast flags being set assumes FeatureNOPL is set which is backwards of how it should be. I think the function here is closer to how it should be except for the 32-bit difference. craig.topper: That function can't be moved as is. It uses X86Subtarget which isn't available to MC. It does…
		jcai19AuthorUnsubmitted Done Reply Inline Actions @craig.topper Okay I missed your comment. Thanks for the clarification. jcai19: @craig.topper Okay I missed your comment. Thanks for the clarification.
		jcai19AuthorUnsubmitted Done Reply Inline Actions Any reason for not reusing maxLongNopLength() rather than rewriting the same thing here? https://github.com/llvm/llvm-project/blob/b2eb1c5793d78d70c1223b098aefc87050f69a8c/llvm/lib/Target/X86/X86MCInstLower.cpp#L1085 Yes I'm all for merging these two functions although there are some differences on both 32-bit and 64-bit mode that would break some unit tests, such as https://reviews.llvm.org/D82826?id=275853 on 64-bit mode. Maybe we can address that in a separate patch as previously discussed. That function could perhaps be moved to llvm/lib/Target/X86/MCTargetDesc/X86BaseInfo.h ? SG. Will start to work on merging them once this patch is checked in. jcai19: > Any reason for not reusing maxLongNopLength() rather than rewriting the same thing here?
		if (!STI.getFeatureBits()[X86::FeatureNOPL] &&
		craig.topperUnsubmitted Not Done Reply Inline Actions Mode32Bit is only relevant if FeatureNOPL is false. So I think it should be if (!STI.getFeatureBits()[X86::FeatureNOPL] && !STI.getFeatureBits()[X86::Mode64Bit]) return STI.getFeatureBits()[X86::Mode32Bit] ? 2 : 1; craig.topper: Mode32Bit is only relevant if FeatureNOPL is false. So I think it should be if (!STI.
		!STI.getFeatureBits()[X86::Mode64Bit])
		return 1;
		if (STI.getFeatureBits()[X86::FeatureFast7ByteNOP])
		return 7;
		if (STI.getFeatureBits()[X86::FeatureFast15ByteNOP])
		return 15;
		if (STI.getFeatureBits()[X86::FeatureFast11ByteNOP])
		return 11;
		// FIXME: handle 32-bit mode
		// 15-bytes is the longest single NOP instruction, but 10-bytes is
		// commonly the longest that can be efficiently decoded.
		return 10;
		}

/// Write a sequence of optimal nops to the output, covering \p Count		/// Write a sequence of optimal nops to the output, covering \p Count
/// bytes.		/// bytes.
/// \return - true on success, false on failure		/// \return - true on success, false on failure
bool X86AsmBackend::writeNopData(raw_ostream &OS, uint64_t Count) const {		bool X86AsmBackend::writeNopData(raw_ostream &OS, uint64_t Count) const {
static const char Nops[10][11] = {		static const char Nops[10][11] = {
// nop		// nop
"\x90",		"\x90",
// xchg %ax,%ax		// xchg %ax,%ax
Show All 11 Lines	static const char Nops[10][11] = {
// nopl 0L(%[re]ax,%[re]ax,1)		// nopl 0L(%[re]ax,%[re]ax,1)
"\x0f\x1f\x84\x00\x00\x00\x00\x00",		"\x0f\x1f\x84\x00\x00\x00\x00\x00",
// nopw 0L(%[re]ax,%[re]ax,1)		// nopw 0L(%[re]ax,%[re]ax,1)
"\x66\x0f\x1f\x84\x00\x00\x00\x00\x00",		"\x66\x0f\x1f\x84\x00\x00\x00\x00\x00",
// nopw %cs:0L(%[re]ax,%[re]ax,1)		// nopw %cs:0L(%[re]ax,%[re]ax,1)
"\x66\x2e\x0f\x1f\x84\x00\x00\x00\x00\x00",		"\x66\x2e\x0f\x1f\x84\x00\x00\x00\x00\x00",
};		};

// This CPU doesn't support long nops. If needed add more.		uint64_t MaxNopLength = (uint64_t)getMaximumNopSize();
// FIXME: We could generated something better than plain 0x90.
if (!STI.getFeatureBits()[X86::FeatureNOPL]) {
for (uint64_t i = 0; i < Count; ++i)
OS << '\x90';
return true;
}

// 15-bytes is the longest single NOP instruction, but 10-bytes is
// commonly the longest that can be efficiently decoded.
uint64_t MaxNopLength = 10;
if (STI.getFeatureBits()[X86::FeatureFast7ByteNOP])
MaxNopLength = 7;
else if (STI.getFeatureBits()[X86::FeatureFast15ByteNOP])
MaxNopLength = 15;
else if (STI.getFeatureBits()[X86::FeatureFast11ByteNOP])
MaxNopLength = 11;

// Emit as many MaxNopLength NOPs as needed, then emit a NOP of the remaining		// Emit as many MaxNopLength NOPs as needed, then emit a NOP of the remaining
// length.		// length.
do {		do {
const uint8_t ThisNopLength = (uint8_t) std::min(Count, MaxNopLength);		const uint8_t ThisNopLength = (uint8_t) std::min(Count, MaxNopLength);
const uint8_t Prefixes = ThisNopLength <= 10 ? 0 : ThisNopLength - 10;		const uint8_t Prefixes = ThisNopLength <= 10 ? 0 : ThisNopLength - 10;
for (uint8_t i = 0; i < Prefixes; i++)		for (uint8_t i = 0; i < Prefixes; i++)
OS << '\x66';		OS << '\x66';
▲ Show 20 Lines • Show All 478 Lines • Show Last 20 Lines

llvm/test/MC/X86/align-branch-bundle.s

	# RUN: llvm-mc -filetype=obj -triple x86_64-unknown-unknown --x86-align-branch-boundary=16 --x86-align-branch=fused+jcc --mc-relax-all %s \| llvm-objdump -d --no-show-raw-insn - \| FileCheck %s			# RUN: llvm-mc -filetype=obj -triple x86_64-unknown-unknown --x86-align-branch-boundary=16 --x86-align-branch=fused+jcc --mc-relax-all %s \| llvm-objdump -d --no-show-raw-insn - \| FileCheck %s

	# Check using option --x86-align-branch-boundary=16 --x86-align-branch=fused+jcc --mc-relax-all with bundle won't make code crazy			# Check using option --x86-align-branch-boundary=16 --x86-align-branch=fused+jcc --mc-relax-all with bundle won't make code crazy

	# CHECK: 0: pushq %rbp			# CHECK: 0: pushq %rbp
	# CHECK-NEXT: 1: testq $2, %rdx			# CHECK-NEXT: 1: testq $2, %rdx
	# CHECK-NEXT: 8: jne			# CHECK-NEXT: 8: jne
	# CHECK-NEXT: e: nop			# CHECK-NEXT: e: nop
	# CHECK-NEXT: f: nop
	# CHECK-NEXT: 10: jle			# CHECK-NEXT: 10: jle
				reamesUnsubmitted Not Done Reply Inline Actions Having a test delta in a file without .nops is highly suspicious. I'd suggest splitting your patch into a trivial version which emits single byte nops, and an change which adds the multiple byte support. That would allow us to separate the directive mechanics from the interesting profit bits. reames: Having a test delta in a file without .nops is highly suspicious. I'd suggest splitting your…
				jcai19AuthorUnsubmitted Done Reply Inline Actions How about we also print out instruction bytes here. If 64-bit processors can generate a two-byte long nop instruction here, shouldn't we emit that instead of two single-byte nop? Thanks. jcai19: How about we also print out instruction bytes here. If 64-bit processors can generate a two…
				jcai19AuthorUnsubmitted Done Reply Inline Actions On second thought, I agree that splitting the patch is the better approach in case the multiple-byte support causes any regression. Will address this in the next iteration. jcai19: On second thought, I agree that splitting the patch is the better approach in case the multiple…

	.text			.text
	.p2align 4			.p2align 4
	foo:			foo:
	push %rbp			push %rbp
	# Will be bundle-aligning to 8 byte boundaries			# Will be bundle-aligning to 8 byte boundaries
	.bundle_align_mode 3			.bundle_align_mode 3
	test $2, %rdx			test $2, %rdx
	jne foo			jne foo
	# This jle is 6 bytes long and should have started at 0xe, so two bytes			# This jle is 6 bytes long and should have started at 0xe, so two bytes
	# of nop padding are inserted instead and it starts at 0x10			# of nop padding are inserted instead and it starts at 0x10
	jle foo			jle foo

llvm/test/MC/X86/align-branch-pad-max-prefix.s

	# RUN: llvm-mc -filetype=obj -triple x86_64 --x86-align-branch-boundary=32 --x86-align-branch=jmp -x86-pad-max-prefix-size=5 %s \| llvm-objdump -d --no-show-raw-insn - \| FileCheck %s			# RUN: llvm-mc -filetype=obj -triple x86_64 --x86-align-branch-boundary=32 --x86-align-branch=jmp -x86-pad-max-prefix-size=5 %s \| llvm-objdump -d --no-show-raw-insn - \| FileCheck %s
	# Check instructions can be aligned correctly along with option -x86-pad-max-prefix-size=5			# Check instructions can be aligned correctly along with option -x86-pad-max-prefix-size=5

	.text			.text
	.p2align 5			.p2align 5
	.rept 24			.rept 24
	int3			int3
	.endr			.endr
	# We should not increase the length of this jmp to reduce the bytes of			# We should not increase the length of this jmp to reduce the bytes of
	# following nops, doing so would make the jmp misaligned.			# following nops, doing so would make the jmp misaligned.
	# CHECK: 18: jmp			# CHECK: 18: jmp
	jmp bar			jmp bar
	# CHECK: 1d: nop			# CHECK: 1d: nopl
	# CHECK: 1e: nop
	# CHECK: 1f: nop
	# CHECK: 20: int3			# CHECK: 20: int3
	.p2align 5			.p2align 5
	int3			int3

llvm/test/MC/X86/x86-directive-nops-errors.s

This file was added.

				# RUN: not llvm-mc -triple i386 %s -filetype=obj -o /dev/null 2>&1 \| FileCheck --check-prefix=X32 %s
				# RUN: not llvm-mc -triple=x86_64 %s -filetype=obj -o /dev/null 2>&1 \| FileCheck --check-prefix=X64 %s

				.nops 4, 3
				# X32: :[[@LINE-1]]:1: error: illegal NOP size 3.
				.nops 4, 4
				craig.topperUnsubmitted Not Done Reply Inline Actions Please use X86 rather than X32. X32 gets confusing with gnux32 where 32-bit pointers are used on a 64-bit target. craig.topper: Please use X86 rather than X32. X32 gets confusing with gnux32 where 32-bit pointers are used…
				# X32: :[[@LINE-1]]:1: error: illegal NOP size 4.
				.nops 4, 5
				# X32: :[[@LINE-1]]:1: error: illegal NOP size 5.
				.nops 16, 15
				# X32: :[[@LINE-1]]:1: error: illegal NOP size 15.
				# X64: :[[@LINE-2]]:1: error: illegal NOP size 15.

llvm/test/MC/X86/x86-directive-nops.s

This file was added.

				# RUN: llvm-mc -triple i386 %s -filetype=obj \| llvm-objdump -d - \| FileCheck %s

				MaskRayUnsubmitted Not Done Reply Inline Actions `#` MaskRay: `# `
				.nops 4
				# CHECK: 0: 90 nop
				# CHECK-NEXT: 1: 90 nop
				# CHECK-NEXT: 2: 90 nop
				# CHECK-NEXT: 3: 90 nop
				.nops 4, 1
				MaskRayUnsubmitted Not Done Reply Inline Actions `# X32` MaskRay: `# X32`
				# CHECK: 4: 90 nop
				# CHECK-NEXT: 5: 90 nop
				# CHECK-NEXT: 6: 90 nop
				# CHECK-NEXT: 7: 90 nop

llvm/test/MC/X86/x86_64-directive-nops.s

This file was added.

				# RUN: llvm-mc -triple=x86_64 %s -filetype=obj \| llvm-objdump -d - \| FileCheck %s

				.nops 4, 1
				# CHECK: 0: 90 nop
				# CHECK-NEXT: 1: 90 nop
				# CHECK-NEXT: 2: 90 nop
				# CHECK-NEXT: 3: 90 nop
				.nops 4, 2
				# CHECK-NEXT: 4: 66 90 nop
				# CHECK-NEXT: 6: 66 90 nop
				.nops 4, 3
				# CHECK-NEXT: 8: 0f 1f 00 nopl (%rax)
				# CHECK-NEXT: b: 90 nop
				.nops 4, 4
				# CHECK-NEXT: c: 0f 1f 40 00 nopl (%rax)
				.nops 4, 5
				# CHECK-NEXT: 10: 0f 1f 40 00 nopl (%rax)
				.nops 4
				# CHECK-NEXT: 14: 0f 1f 40 00 nopl (%rax)

This is an archive of the discontinued LLVM Phabricator instance.

[X86] support .nops directiveClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 275853

llvm/include/llvm/MC/MCAsmBackend.h

llvm/include/llvm/MC/MCFragment.h

llvm/include/llvm/MC/MCObjectStreamer.h

llvm/include/llvm/MC/MCStreamer.h

llvm/lib/MC/MCAssembler.cpp

llvm/lib/MC/MCFragment.cpp

llvm/lib/MC/MCObjectStreamer.cpp

llvm/lib/MC/MCStreamer.cpp

llvm/lib/Target/X86/AsmParser/X86AsmParser.cpp

llvm/lib/Target/X86/MCTargetDesc/X86AsmBackend.cpp

llvm/test/MC/X86/align-branch-bundle.s

llvm/test/MC/X86/align-branch-pad-max-prefix.s

llvm/test/MC/X86/x86-directive-nops-errors.s

llvm/test/MC/X86/x86-directive-nops.s

llvm/test/MC/X86/x86_64-directive-nops.s

[X86] support .nops directive
ClosedPublic