This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/lib/Target/X86/MCTargetDesc/
-
lib/
-
Target/
-
X86/
-
MCTargetDesc/
27/28
X86MCCodeEmitter.cpp

Differential D143471

[X86][MC][NFC] Refine code in X86MCCodeEmitter.cpp about opcode prefix
ClosedPublic

Authored by skan on Feb 7 2023, 12:30 AM.

Download Raw Diff

Details

Reviewers

pengfei
craig.topper
RKSimon

Commits

rGa22e8c9dadea: [X86][MC][NFC] Refine code in X86MCCodeEmitter.cpp about opcode prefix

Summary

Make code clearer by separating the logic of setting bits from the logic of how a prefix is encoded
Extract common code into functions to avoid code duplication
Return a enum rather a boolean to ehance scalability and uniform the behavior of functions

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

skan created this revision.Feb 7 2023, 12:30 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 7 2023, 12:30 AM

Herald added subscribers: pengfei, hiraditya. · View Herald Transcript

skan requested review of this revision.Feb 7 2023, 12:30 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 7 2023, 12:30 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

skan added reviewers: pengfei, craig.topper, RKSimon.Feb 7 2023, 12:31 AM

craig.topper added inline comments.Feb 7 2023, 12:45 AM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
222	Woudln't this be better as `(W \| R \| X \| B) ? REX : None`?
224	Similar here.
283–284	clang-format
840–842	The ?: can just be combined into the return here. No need for a variable.

skan added inline comments.Feb 7 2023, 12:49 AM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
222	Good idea!
224	Will do.
840–842	Good point!

craig.topper added inline comments.Feb 7 2023, 12:56 AM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
1278	I don't know that it makes sense to make returning the bool for `HasREX` worse by spreading it to more functions. None of the other prefixes are needed and I'm not sure there's sufficient evidence they ever will be. And if they were needed would what type of prefix be enough information? A `bool &HasREX` passed to emitOpcodePrefix and emitREXPrefix feels cleaner to me. I never liked the returned bool for this.

Address review comments

skan added inline comments.Feb 7 2023, 1:22 AM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
1278	I know a little about it. When X86 introduces a bunch of new instructions, it usually extends the prefix. For example, when moved from ia32 to ia32e, REX was defined. Similarly, when we moved from SSE to AVX, VEX prefix was defined. And when we introduce new instructions, a new relocation may be needed. Here is the example for REX https://groups.google.com/g/x86-64-abi/c/n9AWHogmVY0 Linker can do relocation optimization based on the relocation. From my understanding, if no new relocation were not added for the new instructions, the optimization could be done in an incorrect way silently. Only a bool `HasREX` was defined here b/c the current interested instructions are only REX-encoded. But we should allow more possibility here. And returning a enum here is almost as cheap as bool.

skan added inline comments.Feb 7 2023, 1:30 AM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
1278	Got your idea. Let me to pass the enum by reference rather than a return value.

craig.topper added inline comments.Feb 7 2023, 1:37 AM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
1278	REX is older than the VEX, XOP, EVEX prefixes so it doesn’t seem common to need the new relocation. It was cheaper to compute I’d say we could determine if it has REX in the fixup code but we have to inspect multiple operands. If we need to know about a prefix for a future location that one might be cheaper to compute. This feels like we’re trying to solve a future problem that might never exist. Is it really worth it?
1278	*relocation

craig.topper added inline comments.Feb 7 2023, 1:46 AM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
1278	It’s also hard to add new relocations anyway since you would need a new linker that understands it. Or a command line to opt into it.

skan added inline comments.Feb 7 2023, 2:00 AM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
1278	I just gave a try. But passing the `enum` as argument increases the length of arglists for four functions: `emitREXPrefix`, `emitVEXOpcodePrefix`, `emitOpcodePrefix` and `emitPrefixImpl` longer, which made me nervous. I don't even know which order I should use for the parameters for the functions. I remembered that we once refined the code in this file to reduce the parameters, so I prefer the "return value" version. AFAICS, whether the prefix is REX, VEX, or EVEX should be enough information for the other code in `emitInstructiion`.

skan added inline comments.Feb 7 2023, 2:04 AM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
1278	Yes, I was talking about that if we know more info about prefix, we can easily extend the relocation type in `X86MCCodeEmitter::emitMemModRMByte`.

Harbormaster completed remote builds in B212298: Diff 495406.Feb 7 2023, 2:08 AM

Yikarus added a subscriber: Yikarus.Feb 7 2023, 5:20 AM

Yikarus added inline comments.

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
782–783	Need rewrite the comment

Address review comments

skan added inline comments.Feb 7 2023, 6:31 AM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
1278	I think it's worthy b/c we uniform the behavior of functions `emitPrefix`, and at the same time, there is almost no extra cost.

Harbormaster completed remote builds in B212372: Diff 495507.Feb 7 2023, 7:32 AM

craig.topper added inline comments.Feb 7 2023, 9:13 AM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
1148	The names of these functions is confusing. getKind followed by setKind looks odd. I would naively expect setKind to set the thing that getKind returns. So getKind followed by setKind looks redundant.

Address review comments

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
1148	You're right. I re-designed the interface to `setLowerBound`, `determineOptimalKind`.

Harbormaster completed remote builds in B212510: Diff 495700.Feb 7 2023, 6:55 PM

pengfei added inline comments.Feb 8 2023, 7:24 PM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
784–786	Format.
866–883	Should be better to move below logic into `X86OpcodePrefix` ? Then we can simplify the function to: X86OpcodePrefix Prefix(*Ctx.getRegisterInfo(), TSFlags); Prefix.emit(OS); return Prefix.getKind();
871	Should be better to use `setEncoding`?

skan added inline comments.Feb 8 2023, 7:41 PM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
784–786	Will do
866–883	We can not b/c we still need to read the operands the instruction before the final emit. We shouldn't b/c the first principle mentioned in the summary: "Make code clearer by separating the logic of setting bits from the logic of how a prefix is encoded"
871	I think `setLowerBound` is more clear than `setEncoding` here b/c VEX and REX are determined at the last stage. e.g `Prefix.setEncoding(VEX2)` here is counterintuitive.

Clang format

skan marked 3 inline comments as done.Feb 8 2023, 7:45 PM

skan marked 4 inline comments as done.Feb 8 2023, 8:04 PM

Harbormaster completed remote builds in B212726: Diff 496006.Feb 8 2023, 8:38 PM

LGTM. Please wait for some days for other reviewers.

This revision is now accepted and ready to land.Feb 8 2023, 9:23 PM

craig.topper added inline comments.Feb 8 2023, 9:23 PM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
874	explicit*

craig.topper added inline comments.Feb 8 2023, 9:31 PM

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
140	I'm not sure if it makes sense to have this class understand MRI and MI. I kind of think it should only receive the register encoding. Especially when you look at funcions like setRR2 that now end up looking up the encoding twice because it nests calls to setR and setR2.

Thinking.... What if we took this further and had an "encodeable instruction" object that contains the prefix fields and the modrm fields that we build by walking the operands and format once. Then we would only have 1 switch on format instead of the 3 we have now. Then we use that to emit the prefix, the opcode, and the modrm byte.

Fix typo and format code

In D143471#4114625, @craig.topper wrote:

Thinking.... What if we took this further and had an "encodeable instruction" object that contains the prefix fields and the modrm fields that we build by walking the operands and format once. Then we would only have 1 switch on format instead of the 3 we have now. Then we use that to emit the prefix, the opcode, and the modrm byte.

This sounds good! Let me give a try.

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp
140	I agree the name `X86OpcodePrefix` doesn't sound like much to understand MI, we should have a "Helper" suffix. Encoding twice is not a big issue. We can easily extract a function "getX86RegEncoding" to avoid this. I am thinking your new proposal "encodeable instruction"

Harbormaster completed remote builds in B212740: Diff 496023.Feb 8 2023, 10:38 PM

Rename class name and avoid duplicated function call

In D143471#4114638, @skan wrote:

In D143471#4114625, @craig.topper wrote:

Thinking.... What if we took this further and had an "encodable instruction" object that contains the prefix fields and the modrm fields that we build by walking the operands and format once. Then we would only have 1 switch on format instead of the 3 we have now. Then we use that to emit the prefix, the opcode, and the modrm byte.

This sounds good! Let me give a try.

@craig.topper I think it's doable with some space cost. e.g, we need to reserve some bits in "encodable instruction" to represent the "REG", "MOD", "RM" and "SIB" and It's even true when we only would like to emitPrefix.
The code change for encodable instruction is quite a big and can share some common code of this patch. I suggest we land this first and possibly implement "encodable instruction" based on it.

Harbormaster completed remote builds in B212794: Diff 496107.Feb 9 2023, 6:56 AM

LGTM

This revision was landed with ongoing or failed builds.Feb 9 2023, 7:11 PM

Closed by commit rGa22e8c9dadea: [X86][MC][NFC] Refine code in X86MCCodeEmitter.cpp about opcode prefix (authored by skan). · Explain Why

This revision was automatically updated to reflect the committed changes.

skan added a commit: rGa22e8c9dadea: [X86][MC][NFC] Refine code in X86MCCodeEmitter.cpp about opcode prefix.

uabelho added a subscriber: uabelho.Feb 10 2023, 1:31 AM

Hi,

I noticed that if I run lit tests with ubsan built binaries with this patch there are many many tests failing like

../lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp:222:19: runtime error: left shift of negative value -1

In D143471#4117637, @uabelho wrote:
Hi,

I noticed that if I run lit tests with ubsan built binaries with this patch there are many many tests failing like
../lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp:222:19: runtime error: left shift of negative value -1

I think it's false positive?

~R << 7

R is an unsigned, so ~R is unsigned too. Although the value of ~R may be 0xffffffff. But I remember the left shift operation on unsigned is always well defined as long as the shift not exceed the size of unsigned.

In D143471#4118036, @skan wrote:
In D143471#4117637, @uabelho wrote:
Hi,

I noticed that if I run lit tests with ubsan built binaries with this patch there are many many tests failing like
../lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp:222:19: runtime error: left shift of negative value -1
I think it's false positive?
~R << 7
R is an unsigned, so ~R is unsigned too. Although the value of ~R may be 0xffffffff. But I remember the left shift operation on unsigned is always well defined as long as the shift not exceed the size of unsigned.

I remember the << operator is defined to shift signed int in C language. There's no relationship with the type of shifted value.

In D143471#4118152, @pengfei wrote:
In D143471#4118036, @skan wrote:
In D143471#4117637, @uabelho wrote:
Hi,

I noticed that if I run lit tests with ubsan built binaries with this patch there are many many tests failing like
../lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp:222:19: runtime error: left shift of negative value -1
I think it's false positive?
~R << 7
R is an unsigned, so ~R is unsigned too. Although the value of ~R may be 0xffffffff. But I remember the left shift operation on unsigned is always well defined as long as the shift not exceed the size of unsigned.
I remember the << operator is defined to shift signed int in C language. There's no relationship with the type of shifted value.

I am not an expert in latest C standard. But we are in C++

https://www.open-std.org/jtc1/sc22/wg21/docs/papers/2017/n4713.pdf

8.5.7 Shift operators [expr.shift]
1 The shift operators << and >> group left-to-right.
shift-expression:
additive-expression
shift-expression << additive-expression
shift-expression >> additive-expression
The operands shall be of integral or unscoped enumeration type and integral promotions are performed. The
type of the result is that of the promoted left operand. The behavior is undefined if the right operand is
negative, or greater than or equal to the length in bits of the promoted left operand.
2 The value of E1 << E2 is E1 left-shifted E2 bit positions; vacated bits are zero-filled. If E1 has an unsigned
type, the value of the result is E1 × 2
E2, reduced modulo one more than the maximum value representable in
the result type. Otherwise, if E1 has a signed type and non-negative value, and E1 × 2
E2 is representable
in the corresponding unsigned type of the result type, then that value, converted to the result type, is the
resulting value; otherwise, the behavior is undefined.

According to the rule, unsigned does not need to be promoted and the shift operation is still on a unsigned.

I think there is promotion to int due to the ~
https://stackoverflow.com/questions/32529080/should-bit-fields-less-than-int-in-size-be-the-subject-of-integral-promotion

In D143471#4117637, @uabelho wrote:
Hi,

I noticed that if I run lit tests with ubsan built binaries with this patch there are many many tests failing like
../lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp:222:19: runtime error: left shift of negative value -1

Hi, the LLDB sanitized bots are also failing with this error https://green.lab.llvm.org/green/view/LLDB/job/lldb-cmake-sanitized/3335/console

I think uabelho is right about the promotion. According to the standard: "The operand of ~ shall have integral or unscoped enumeration type; the result is the ones’ complement of its operand. Integral promotions are performed."

I just pushed d37a31cf237cb4f8a18b12c91a9204feca5900ef to fix the usbsan issue

In D143471#4118938, @craig.topper wrote:

I just pushed d37a31cf237cb4f8a18b12c91a9204feca5900ef to fix the usbsan issue

Thank you @craig.topper !

Revision Contents

Path

Size

llvm/

lib/

Target/

X86/

MCTargetDesc/

X86MCCodeEmitter.cpp

922 lines

Diff 496313

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp

Show All 31 Lines
#include <cstdlib>		#include <cstdlib>

using namespace llvm;		using namespace llvm;

#define DEBUG_TYPE "mccodeemitter"		#define DEBUG_TYPE "mccodeemitter"

namespace {		namespace {

		enum PrefixKind { None, REX, XOP, VEX2, VEX3, EVEX };

		static void emitByte(uint8_t C, raw_ostream &OS) { OS << static_cast<char>(C); }

		class X86OpcodePrefixHelper {
		// REX (1 byte)
		// +-----+ +------+
		// \| 40H \| \| WRXB \|
		// +-----+ +------+

		// XOP (3-byte)
		// +-----+ +--------------+ +-------------------+
		// \| 8Fh \| \| RXB \| m-mmmm \| \| W \| vvvv \| L \| pp \|
		// +-----+ +--------------+ +-------------------+

		// VEX2 (2 bytes)
		// +-----+ +-------------------+
		// \| C5h \| \| R \| vvvv \| L \| pp \|
		// +-----+ +-------------------+

		// VEX3 (3 bytes)
		// +-----+ +--------------+ +-------------------+
		// \| C4h \| \| RXB \| m-mmmm \| \| W \| vvvv \| L \| pp \|
		// +-----+ +--------------+ +-------------------+

		// VEX_R: opcode externsion equivalent to REX.R in
		// 1's complement (inverted) form
		//
		// 1: Same as REX_R=0 (must be 1 in 32-bit mode)
		// 0: Same as REX_R=1 (64 bit mode only)

		// VEX_X: equivalent to REX.X, only used when a
		// register is used for index in SIB Byte.
		//
		// 1: Same as REX.X=0 (must be 1 in 32-bit mode)
		// 0: Same as REX.X=1 (64-bit mode only)

		// VEX_B:
		// 1: Same as REX_B=0 (ignored in 32-bit mode)
		// 0: Same as REX_B=1 (64 bit mode only)

		// VEX_W: opcode specific (use like REX.W, or used for
		// opcode extension, or ignored, depending on the opcode byte)

		// VEX_5M (VEX m-mmmmm field):
		//
		// 0b00000: Reserved for future use
		// 0b00001: implied 0F leading opcode
		// 0b00010: implied 0F 38 leading opcode bytes
		// 0b00011: implied 0F 3A leading opcode bytes
		// 0b00100: Reserved for future use
		// 0b00101: VEX MAP5
		// 0b00110: VEX MAP6
		// 0b00111-0b11111: Reserved for future use
		// 0b01000: XOP map select - 08h instructions with imm byte
		// 0b01001: XOP map select - 09h instructions with no imm byte
		// 0b01010: XOP map select - 0Ah instructions with imm dword

		// VEX_4V (VEX vvvv field): a register specifier
		// (in 1's complement form) or 1111 if unused.

		// VEX_PP: opcode extension providing equivalent
		// functionality of a SIMD prefix
		// 0b00: None
		// 0b01: 66
		// 0b10: F3
		// 0b11: F2

		// EVEX (4 bytes)
		// +-----+ +--------------+ +-------------------+ +------------------------+
		// \| 62h \| \| RXBR' \| 00mm \| \| W \| vvvv \| 1 \| pp \| \| z \| L'L \| b \| v' \| aaa \|
		// +-----+ +--------------+ +-------------------+ +------------------------+

		// EVEX_L2/VEX_L (Vector Length):
		// L2 L
		// 0 0: scalar or 128-bit vector
		// 0 1: 256-bit vector
		// 1 0: 512-bit vector

		private:
		unsigned W : 1;
		unsigned R : 1;
		unsigned X : 1;
		unsigned B : 1;
		unsigned VEX_4V : 4;
		unsigned VEX_L : 1;
		unsigned VEX_PP : 2;
		unsigned VEX_5M : 5;
		unsigned EVEX_R2 : 1;
		unsigned EVEX_z : 1;
		unsigned EVEX_L2 : 1;
		unsigned EVEX_b : 1;
		unsigned EVEX_V2 : 1;
		unsigned EVEX_aaa : 3;
		PrefixKind Kind = None;
		const MCRegisterInfo &MRI;

		unsigned getRegEncoding(const MCInst &MI, unsigned OpNum) const {
		return MRI.getEncodingValue(MI.getOperand(OpNum).getReg());
		}

		craig.topperUnsubmitted Not Done Reply Inline Actions I'm not sure if it makes sense to have this class understand MRI and MI. I kind of think it should only receive the register encoding. Especially when you look at funcions like setRR2 that now end up looking up the encoding twice because it nests calls to setR and setR2. craig.topper: I'm not sure if it makes sense to have this class understand MRI and MI. I kind of think it…
		skanAuthorUnsubmitted Done Reply Inline Actions I agree the name `X86OpcodePrefix` doesn't sound like much to understand MI, we should have a "Helper" suffix. Encoding twice is not a big issue. We can easily extract a function "getX86RegEncoding" to avoid this. I am thinking your new proposal "encodeable instruction" skan: I agree the name `X86OpcodePrefix` doesn't sound like much to understand MI, we should have a…
		void setR(unsigned Encoding) { R = Encoding >> 3 & 1; }
		void setR2(unsigned Encoding) { EVEX_R2 = Encoding >> 4 & 1; }
		void set4V(unsigned Encoding) { VEX_4V = Encoding & 0xf; }
		void setV2(unsigned Encoding) { EVEX_V2 = Encoding >> 4 & 1; }

		public:
		void setW(bool V) { W = V; }
		void setR(const MCInst &MI, unsigned OpNum) {
		setR(getRegEncoding(MI, OpNum));
		}
		void setX(const MCInst &MI, unsigned OpNum, unsigned Shift = 3) {
		X = getRegEncoding(MI, OpNum) >> Shift & 1;
		}
		void setB(const MCInst &MI, unsigned OpNum) {
		B = getRegEncoding(MI, OpNum) >> 3 & 1;
		}
		void set4V(const MCInst &MI, unsigned OpNum) {
		set4V(getRegEncoding(MI, OpNum));
		}
		void setL(bool V) { VEX_L = V; }
		void setPP(unsigned V) { VEX_PP = V; }
		void set5M(unsigned V) { VEX_5M = V; }
		void setR2(const MCInst &MI, unsigned OpNum) {
		setR2(getRegEncoding(MI, OpNum));
		}
		void setRR2(const MCInst &MI, unsigned OpNum) {
		unsigned Encoding = getRegEncoding(MI, OpNum);
		setR(Encoding);
		setR2(Encoding);
		}
		void setZ(bool V) { EVEX_z = V; }
		void setL2(bool V) { EVEX_L2 = V; }
		void setEVEX_b(bool V) { EVEX_b = V; }
		void setV2(const MCInst &MI, unsigned OpNum) {
		setV2(getRegEncoding(MI, OpNum));
		}
		void set4VV2(const MCInst &MI, unsigned OpNum) {
		unsigned Encoding = getRegEncoding(MI, OpNum);
		set4V(Encoding);
		setV2(Encoding);
		}
		void setAAA(const MCInst &MI, unsigned OpNum) {
		EVEX_aaa = getRegEncoding(MI, OpNum);
		}

		X86OpcodePrefixHelper(const MCRegisterInfo &MRI)
		: W(0), R(0), X(0), B(0), VEX_4V(0), VEX_L(0), VEX_PP(0), VEX_5M(0),
		EVEX_R2(0), EVEX_z(0), EVEX_L2(0), EVEX_b(0), EVEX_V2(0), EVEX_aaa(0),
		MRI(MRI) {}

		void setLowerBound(PrefixKind K) { Kind = K; }

		PrefixKind determineOptimalKind() {
		switch (Kind) {
		case None:
		Kind = (W \| R \| X \| B) ? REX : None;
		break;
		case REX:
		case XOP:
		case VEX3:
		case EVEX:
		break;
		case VEX2:
		Kind = (W \| X \| B \| (VEX_5M != 1)) ? VEX3 : VEX2;
		break;
		}
		return Kind;
		}

		void emit(raw_ostream &OS) const {
		uint8_t FirstPayload =
		((~R) & 0x1) << 7 \| ((~X) & 0x1) << 6 \| ((~B) & 0x1) << 5;
		uint8_t LastPayload = ((~VEX_4V) & 0xf) << 3 \| VEX_L << 2 \| VEX_PP;
		switch (Kind) {
		case None:
		return;
		case REX:
		emitByte(0x40 \| W << 3 \| R << 2 \| X << 1 \| B, OS);
		return;
		case VEX2:
		emitByte(0xC5, OS);
		emitByte(~R << 7 \| LastPayload, OS);
		craig.topperUnsubmitted Done Reply Inline Actions Woudln't this be better as `(W \| R \| X \| B) ? REX : None`? craig.topper: Woudln't this be better as `(W \| R \| X \| B) ? REX : None`?
		skanAuthorUnsubmitted Done Reply Inline Actions Good idea! skan: Good idea!
		return;
		case VEX3:
		craig.topperUnsubmitted Done Reply Inline Actions Similar here. craig.topper: Similar here.
		skanAuthorUnsubmitted Done Reply Inline Actions Will do. skan: Will do.
		case XOP:
		emitByte(Kind == VEX3 ? 0xC4 : 0x8F, OS);
		emitByte(FirstPayload \| VEX_5M, OS);
		emitByte(W << 7 \| LastPayload, OS);
		return;
		case EVEX:
		assert(VEX_5M & 0x7 &&
		"More than 3 significant bits in VEX.m-mmmm fields for EVEX!");
		emitByte(0x62, OS);
		emitByte(FirstPayload \| ((~EVEX_R2) & 0x1) << 4 \| VEX_5M, OS);
		emitByte(W << 7 \| ((~VEX_4V) & 0xf) << 3 \| 1 << 2 \| VEX_PP, OS);
		emitByte(EVEX_z << 7 \| EVEX_L2 << 6 \| VEX_L << 5 \| EVEX_b << 4 \|
		((~EVEX_V2) & 0x1) << 3 \| EVEX_aaa,
		OS);
		return;
		}
		}
		};

class X86MCCodeEmitter : public MCCodeEmitter {		class X86MCCodeEmitter : public MCCodeEmitter {
const MCInstrInfo &MCII;		const MCInstrInfo &MCII;
MCContext &Ctx;		MCContext &Ctx;

public:		public:
X86MCCodeEmitter(const MCInstrInfo &mcii, MCContext &ctx)		X86MCCodeEmitter(const MCInstrInfo &mcii, MCContext &ctx)
: MCII(mcii), Ctx(ctx) {}		: MCII(mcii), Ctx(ctx) {}
X86MCCodeEmitter(const X86MCCodeEmitter &) = delete;		X86MCCodeEmitter(const X86MCCodeEmitter &) = delete;
X86MCCodeEmitter &operator=(const X86MCCodeEmitter &) = delete;		X86MCCodeEmitter &operator=(const X86MCCodeEmitter &) = delete;
~X86MCCodeEmitter() override = default;		~X86MCCodeEmitter() override = default;

void emitPrefix(const MCInst &MI, raw_ostream &OS,		void emitPrefix(const MCInst &MI, raw_ostream &OS,
const MCSubtargetInfo &STI) const override;		const MCSubtargetInfo &STI) const override;

void encodeInstruction(const MCInst &MI, raw_ostream &OS,		void encodeInstruction(const MCInst &MI, raw_ostream &OS,
SmallVectorImpl<MCFixup> &Fixups,		SmallVectorImpl<MCFixup> &Fixups,
const MCSubtargetInfo &STI) const override;		const MCSubtargetInfo &STI) const override;

private:		private:
unsigned getX86RegNum(const MCOperand &MO) const;		unsigned getX86RegNum(const MCOperand &MO) const;

unsigned getX86RegEncoding(const MCInst &MI, unsigned OpNum) const;		unsigned getX86RegEncoding(const MCInst &MI, unsigned OpNum) const;

/// \param MI a single low-level machine instruction.
/// \param OpNum the operand #.
/// \returns true if the OpNumth operand of MI require a bit to be set in
/// REX prefix.
bool isREXExtendedReg(const MCInst &MI, unsigned OpNum) const;

void emitImmediate(const MCOperand &Disp, SMLoc Loc, unsigned ImmSize,		void emitImmediate(const MCOperand &Disp, SMLoc Loc, unsigned ImmSize,
MCFixupKind FixupKind, uint64_t StartByte, raw_ostream &OS,		MCFixupKind FixupKind, uint64_t StartByte, raw_ostream &OS,
SmallVectorImpl<MCFixup> &Fixups, int ImmOffset = 0) const;		SmallVectorImpl<MCFixup> &Fixups, int ImmOffset = 0) const;

void emitRegModRMByte(const MCOperand &ModRMReg, unsigned RegOpcodeFld,		void emitRegModRMByte(const MCOperand &ModRMReg, unsigned RegOpcodeFld,
raw_ostream &OS) const;		raw_ostream &OS) const;

void emitSIBByte(unsigned SS, unsigned Index, unsigned Base,		void emitSIBByte(unsigned SS, unsigned Index, unsigned Base,
raw_ostream &OS) const;		raw_ostream &OS) const;

void emitMemModRMByte(const MCInst &MI, unsigned Op, unsigned RegOpcodeField,		void emitMemModRMByte(const MCInst &MI, unsigned Op, unsigned RegOpcodeField,
uint64_t TSFlags, bool HasREX, uint64_t StartByte,		uint64_t TSFlags, PrefixKind Kind, uint64_t StartByte,
raw_ostream &OS, SmallVectorImpl<MCFixup> &Fixups,		raw_ostream &OS, SmallVectorImpl<MCFixup> &Fixups,
const MCSubtargetInfo &STI,		const MCSubtargetInfo &STI,
bool ForceSIB = false) const;		bool ForceSIB = false) const;

bool emitPrefixImpl(unsigned &CurOp, const MCInst &MI,		PrefixKind emitPrefixImpl(unsigned &CurOp, const MCInst &MI,
const MCSubtargetInfo &STI, raw_ostream &OS) const;		const MCSubtargetInfo &STI, raw_ostream &OS) const;
		craig.topperUnsubmitted Done Reply Inline Actions clang-format craig.topper: clang-format

void emitVEXOpcodePrefix(int MemOperand, const MCInst &MI,		PrefixKind emitVEXOpcodePrefix(int MemOperand, const MCInst &MI,
raw_ostream &OS) const;		raw_ostream &OS) const;

void emitSegmentOverridePrefix(unsigned SegOperand, const MCInst &MI,		void emitSegmentOverridePrefix(unsigned SegOperand, const MCInst &MI,
raw_ostream &OS) const;		raw_ostream &OS) const;

bool emitOpcodePrefix(int MemOperand, const MCInst &MI,		PrefixKind emitOpcodePrefix(int MemOperand, const MCInst &MI,
const MCSubtargetInfo &STI, raw_ostream &OS) const;		const MCSubtargetInfo &STI,
		raw_ostream &OS) const;

bool emitREXPrefix(int MemOperand, const MCInst &MI,		PrefixKind emitREXPrefix(int MemOperand, const MCInst &MI,
const MCSubtargetInfo &STI, raw_ostream &OS) const;		const MCSubtargetInfo &STI, raw_ostream &OS) const;
};		};

} // end anonymous namespace		} // end anonymous namespace

static uint8_t modRMByte(unsigned Mod, unsigned RegOpcode, unsigned RM) {		static uint8_t modRMByte(unsigned Mod, unsigned RegOpcode, unsigned RM) {
assert(Mod < 4 && RegOpcode < 8 && RM < 8 && "ModRM Fields out of range!");		assert(Mod < 4 && RegOpcode < 8 && RM < 8 && "ModRM Fields out of range!");
return RM \| (RegOpcode << 3) \| (Mod << 6);		return RM \| (RegOpcode << 3) \| (Mod << 6);
}		}

static void emitByte(uint8_t C, raw_ostream &OS) { OS << static_cast<char>(C); }

static void emitConstant(uint64_t Val, unsigned Size, raw_ostream &OS) {		static void emitConstant(uint64_t Val, unsigned Size, raw_ostream &OS) {
// Output the constant in little endian byte order.		// Output the constant in little endian byte order.
for (unsigned i = 0; i != Size; ++i) {		for (unsigned i = 0; i != Size; ++i) {
emitByte(Val & 255, OS);		emitByte(Val & 255, OS);
Val >>= 8;		Val >>= 8;
}		}
}		}

▲ Show 20 Lines • Show All 95 Lines • ▼ Show 20 Lines	unsigned X86MCCodeEmitter::getX86RegNum(const MCOperand &MO) const {
return Ctx.getRegisterInfo()->getEncodingValue(MO.getReg()) & 0x7;		return Ctx.getRegisterInfo()->getEncodingValue(MO.getReg()) & 0x7;
}		}

unsigned X86MCCodeEmitter::getX86RegEncoding(const MCInst &MI,		unsigned X86MCCodeEmitter::getX86RegEncoding(const MCInst &MI,
unsigned OpNum) const {		unsigned OpNum) const {
return Ctx.getRegisterInfo()->getEncodingValue(MI.getOperand(OpNum).getReg());		return Ctx.getRegisterInfo()->getEncodingValue(MI.getOperand(OpNum).getReg());
}		}

/// \param MI a single low-level machine instruction.
/// \param OpNum the operand #.
/// \returns true if the OpNumth operand of MI require a bit to be set in
/// REX prefix.
bool X86MCCodeEmitter::isREXExtendedReg(const MCInst &MI,
unsigned OpNum) const {
return (getX86RegEncoding(MI, OpNum) >> 3) & 1;
}

void X86MCCodeEmitter::emitImmediate(const MCOperand &DispOp, SMLoc Loc,		void X86MCCodeEmitter::emitImmediate(const MCOperand &DispOp, SMLoc Loc,
unsigned Size, MCFixupKind FixupKind,		unsigned Size, MCFixupKind FixupKind,
uint64_t StartByte, raw_ostream &OS,		uint64_t StartByte, raw_ostream &OS,
SmallVectorImpl<MCFixup> &Fixups,		SmallVectorImpl<MCFixup> &Fixups,
int ImmOffset) const {		int ImmOffset) const {
const MCExpr *Expr = nullptr;		const MCExpr *Expr = nullptr;
if (DispOp.isImm()) {		if (DispOp.isImm()) {
// If this is a simple integer displacement that doesn't require a		// If this is a simple integer displacement that doesn't require a
▲ Show 20 Lines • Show All 76 Lines • ▼ Show 20 Lines
void X86MCCodeEmitter::emitSIBByte(unsigned SS, unsigned Index, unsigned Base,		void X86MCCodeEmitter::emitSIBByte(unsigned SS, unsigned Index, unsigned Base,
raw_ostream &OS) const {		raw_ostream &OS) const {
// SIB byte is in the same format as the modRMByte.		// SIB byte is in the same format as the modRMByte.
emitByte(modRMByte(SS, Index, Base), OS);		emitByte(modRMByte(SS, Index, Base), OS);
}		}

void X86MCCodeEmitter::emitMemModRMByte(const MCInst &MI, unsigned Op,		void X86MCCodeEmitter::emitMemModRMByte(const MCInst &MI, unsigned Op,
unsigned RegOpcodeField,		unsigned RegOpcodeField,
uint64_t TSFlags, bool HasREX,		uint64_t TSFlags, PrefixKind Kind,
uint64_t StartByte, raw_ostream &OS,		uint64_t StartByte, raw_ostream &OS,
SmallVectorImpl<MCFixup> &Fixups,		SmallVectorImpl<MCFixup> &Fixups,
const MCSubtargetInfo &STI,		const MCSubtargetInfo &STI,
bool ForceSIB) const {		bool ForceSIB) const {
const MCOperand &Disp = MI.getOperand(Op + X86::AddrDisp);		const MCOperand &Disp = MI.getOperand(Op + X86::AddrDisp);
const MCOperand &Base = MI.getOperand(Op + X86::AddrBaseReg);		const MCOperand &Base = MI.getOperand(Op + X86::AddrBaseReg);
const MCOperand &Scale = MI.getOperand(Op + X86::AddrScaleAmt);		const MCOperand &Scale = MI.getOperand(Op + X86::AddrScaleAmt);
const MCOperand &IndexReg = MI.getOperand(Op + X86::AddrIndexReg);		const MCOperand &IndexReg = MI.getOperand(Op + X86::AddrIndexReg);
Show All 19 Lines	unsigned FixupKind = [&]() {
// directly if the symbol ends up in the same linkage unit.		// directly if the symbol ends up in the same linkage unit.
switch (Opcode) {		switch (Opcode) {
default:		default:
return X86::reloc_riprel_4byte;		return X86::reloc_riprel_4byte;
case X86::MOV64rm:		case X86::MOV64rm:
// movq loads is a subset of reloc_riprel_4byte_relax_rex. It is a		// movq loads is a subset of reloc_riprel_4byte_relax_rex. It is a
// special case because COFF and Mach-O don't support ELF's more		// special case because COFF and Mach-O don't support ELF's more
// flexible R_X86_64_REX_GOTPCRELX relaxation.		// flexible R_X86_64_REX_GOTPCRELX relaxation.
assert(HasREX);		assert(Kind == REX);
return X86::reloc_riprel_4byte_movq_load;		return X86::reloc_riprel_4byte_movq_load;
case X86::ADC32rm:		case X86::ADC32rm:
case X86::ADD32rm:		case X86::ADD32rm:
case X86::AND32rm:		case X86::AND32rm:
case X86::CMP32rm:		case X86::CMP32rm:
case X86::MOV32rm:		case X86::MOV32rm:
case X86::OR32rm:		case X86::OR32rm:
case X86::SBB32rm:		case X86::SBB32rm:
case X86::SUB32rm:		case X86::SUB32rm:
case X86::TEST32mr:		case X86::TEST32mr:
case X86::XOR32rm:		case X86::XOR32rm:
case X86::CALL64m:		case X86::CALL64m:
case X86::JMP64m:		case X86::JMP64m:
case X86::TAILJMPm64:		case X86::TAILJMPm64:
case X86::TEST64mr:		case X86::TEST64mr:
case X86::ADC64rm:		case X86::ADC64rm:
case X86::ADD64rm:		case X86::ADD64rm:
case X86::AND64rm:		case X86::AND64rm:
case X86::CMP64rm:		case X86::CMP64rm:
case X86::OR64rm:		case X86::OR64rm:
case X86::SBB64rm:		case X86::SBB64rm:
case X86::SUB64rm:		case X86::SUB64rm:
case X86::XOR64rm:		case X86::XOR64rm:
return HasREX ? X86::reloc_riprel_4byte_relax_rex		return Kind == REX ? X86::reloc_riprel_4byte_relax_rex
: X86::reloc_riprel_4byte_relax;		: X86::reloc_riprel_4byte_relax;
}		}
}();		}();

// rip-relative addressing is actually relative to the next instruction.		// rip-relative addressing is actually relative to the next instruction.
// Since an immediate can follow the mod/rm byte for an instruction, this		// Since an immediate can follow the mod/rm byte for an instruction, this
// means that we need to bias the displacement field of the instruction with		// means that we need to bias the displacement field of the instruction with
// the size of the immediate field. If we have this case, add it into the		// the size of the immediate field. If we have this case, add it into the
// expression to emit.		// expression to emit.
▲ Show 20 Lines • Show All 194 Lines • ▼ Show 20 Lines	if (ForceDisp8)
emitImmediate(Disp, MI.getLoc(), 1, FK_Data_1, StartByte, OS, Fixups,		emitImmediate(Disp, MI.getLoc(), 1, FK_Data_1, StartByte, OS, Fixups,
ImmOffset);		ImmOffset);
else if (ForceDisp32)		else if (ForceDisp32)
emitImmediate(Disp, MI.getLoc(), 4, MCFixupKind(X86::reloc_signed_4byte),		emitImmediate(Disp, MI.getLoc(), 4, MCFixupKind(X86::reloc_signed_4byte),
StartByte, OS, Fixups);		StartByte, OS, Fixups);
}		}

/// Emit all instruction prefixes.		/// Emit all instruction prefixes.
///		///
/// \returns true if REX prefix is used, otherwise returns false.		/// \returns one of the REX, XOP, VEX2, VEX3, EVEX if any of them is used,
		YikarusUnsubmitted Done Reply Inline Actions Need rewrite the comment Yikarus: Need rewrite the comment
bool X86MCCodeEmitter::emitPrefixImpl(unsigned &CurOp, const MCInst &MI,		/// otherwise returns None.
		PrefixKind X86MCCodeEmitter::emitPrefixImpl(unsigned &CurOp, const MCInst &MI,
const MCSubtargetInfo &STI,		const MCSubtargetInfo &STI,
		pengfeiUnsubmitted Done Reply Inline Actions Format. pengfei: Format.
		skanAuthorUnsubmitted Done Reply Inline Actions Will do skan: Will do
raw_ostream &OS) const {		raw_ostream &OS) const {
uint64_t TSFlags = MCII.get(MI.getOpcode()).TSFlags;		uint64_t TSFlags = MCII.get(MI.getOpcode()).TSFlags;
// Determine where the memory operand starts, if present.		// Determine where the memory operand starts, if present.
int MemoryOperand = X86II::getMemoryOperandNo(TSFlags);		int MemoryOperand = X86II::getMemoryOperandNo(TSFlags);
// Emit segment override opcode prefix as needed.		// Emit segment override opcode prefix as needed.
if (MemoryOperand != -1) {		if (MemoryOperand != -1) {
MemoryOperand += CurOp;		MemoryOperand += CurOp;
emitSegmentOverridePrefix(MemoryOperand + X86::AddrSegmentReg, MI, OS);		emitSegmentOverridePrefix(MemoryOperand + X86::AddrSegmentReg, MI, OS);
}		}
Show All 36 Lines	case X86II::RawFrmMemOffs: {
// Emit segment override opcode prefix as needed.		// Emit segment override opcode prefix as needed.
emitSegmentOverridePrefix(1, MI, OS);		emitSegmentOverridePrefix(1, MI, OS);
break;		break;
}		}
}		}

// REX prefix is optional, but if used must be immediately before the opcode		// REX prefix is optional, but if used must be immediately before the opcode
// Encoding type for this instruction.		// Encoding type for this instruction.
uint64_t Encoding = TSFlags & X86II::EncodingMask;		return (TSFlags & X86II::EncodingMask)
bool HasREX = false;		? emitVEXOpcodePrefix(MemoryOperand, MI, OS)
if (Encoding)		: emitOpcodePrefix(MemoryOperand, MI, STI, OS);
		craig.topperUnsubmitted Done Reply Inline Actions The ?: can just be combined into the return here. No need for a variable. craig.topper: The ?: can just be combined into the return here. No need for a variable.
		skanAuthorUnsubmitted Done Reply Inline Actions Good point! skan: Good point!
emitVEXOpcodePrefix(MemoryOperand, MI, OS);
else
HasREX = emitOpcodePrefix(MemoryOperand, MI, STI, OS);

return HasREX;
}		}

/// AVX instructions are encoded using a opcode prefix called VEX.		// AVX instructions are encoded using an encoding scheme that combines
void X86MCCodeEmitter::emitVEXOpcodePrefix(int MemOperand, const MCInst &MI,		// prefix bytes, opcode extension field, operand encoding fields, and vector
		// length encoding capability into a new prefix, referred to as VEX.

		// The majority of the AVX-512 family of instructions (operating on
		// 512/256/128-bit vector register operands) are encoded using a new prefix
		// (called EVEX).

		// XOP is a revised subset of what was originally intended as SSE5. It was
		// changed to be similar but not overlapping with AVX.

		/// Emit XOP, VEX2, VEX3 or EVEX prefix.
		/// \returns the used prefix.
		PrefixKind X86MCCodeEmitter::emitVEXOpcodePrefix(int MemOperand,
		const MCInst &MI,
raw_ostream &OS) const {		raw_ostream &OS) const {
const MCInstrDesc &Desc = MCII.get(MI.getOpcode());		const MCInstrDesc &Desc = MCII.get(MI.getOpcode());
uint64_t TSFlags = Desc.TSFlags;		uint64_t TSFlags = Desc.TSFlags;

assert(!(TSFlags & X86II::LOCK) && "Can't have LOCK VEX.");		assert(!(TSFlags & X86II::LOCK) && "Can't have LOCK VEX.");

uint64_t Encoding = TSFlags & X86II::EncodingMask;		X86OpcodePrefixHelper Prefix(*Ctx.getRegisterInfo());
		switch (TSFlags & X86II::EncodingMask) {
		default:
		break;
		case X86II::XOP:
		Prefix.setLowerBound(XOP);
		pengfeiUnsubmitted Done Reply Inline Actions Should be better to use `setEncoding`? pengfei: Should be better to use `setEncoding`?
		skanAuthorUnsubmitted Done Reply Inline Actions I think `setLowerBound` is more clear than `setEncoding` here b/c VEX and REX are determined at the last stage. e.g `Prefix.setEncoding(VEX2)` here is counterintuitive. skan: I think `setLowerBound` is more clear than `setEncoding` here b/c VEX and REX are determined at…
		break;
		case X86II::VEX:
		// VEX can be 2 byte or 3 byte, not determined yet if not explicit
		craig.topperUnsubmitted Done Reply Inline Actions explicit* craig.topper: explicit*
		Prefix.setLowerBound(MI.getFlags() & X86::IP_USE_VEX3 ? VEX3 : VEX2);
		break;
		case X86II::EVEX:
		Prefix.setLowerBound(EVEX);
		break;
		}

		Prefix.setW(TSFlags & X86II::VEX_W);

		pengfeiUnsubmitted Done Reply Inline Actions Should be better to move below logic into `X86OpcodePrefix` ? Then we can simplify the function to: X86OpcodePrefix Prefix(Ctx.getRegisterInfo(), TSFlags); Prefix.emit(OS); return Prefix.getKind(); pengfei:* Should be better to move below logic into `X86OpcodePrefix `? Then we can simplify the function…
		skanAuthorUnsubmitted Done Reply Inline Actions We can not b/c we still need to read the operands the instruction before the final emit. We shouldn't b/c the first principle mentioned in the summary: "Make code clearer by separating the logic of setting bits from the logic of how a prefix is encoded" skan: We can not b/c we still need to read the operands the instruction before the final emit. We…
bool HasEVEX_K = TSFlags & X86II::EVEX_K;		bool HasEVEX_K = TSFlags & X86II::EVEX_K;
bool HasVEX_4V = TSFlags & X86II::VEX_4V;		bool HasVEX_4V = TSFlags & X86II::VEX_4V;
bool HasEVEX_RC = TSFlags & X86II::EVEX_RC;		bool HasEVEX_RC = TSFlags & X86II::EVEX_RC;

// VEX_R: opcode externsion equivalent to REX.R in
// 1's complement (inverted) form
//
// 1: Same as REX_R=0 (must be 1 in 32-bit mode)
// 0: Same as REX_R=1 (64 bit mode only)
//
uint8_t VEX_R = 0x1;
uint8_t EVEX_R2 = 0x1;

// VEX_X: equivalent to REX.X, only used when a
// register is used for index in SIB Byte.
//
// 1: Same as REX.X=0 (must be 1 in 32-bit mode)
// 0: Same as REX.X=1 (64-bit mode only)
uint8_t VEX_X = 0x1;

// VEX_B:
//
// 1: Same as REX_B=0 (ignored in 32-bit mode)
// 0: Same as REX_B=1 (64 bit mode only)
//
uint8_t VEX_B = 0x1;

// VEX_W: opcode specific (use like REX.W, or used for
// opcode extension, or ignored, depending on the opcode byte)
uint8_t VEX_W = (TSFlags & X86II::VEX_W) ? 1 : 0;

// VEX_5M (VEX m-mmmmm field):
//
// 0b00000: Reserved for future use
// 0b00001: implied 0F leading opcode
// 0b00010: implied 0F 38 leading opcode bytes
// 0b00011: implied 0F 3A leading opcode bytes
// 0b00100: Reserved for future use
// 0b00101: VEX MAP5
// 0b00110: VEX MAP6
// 0b00111-0b11111: Reserved for future use
// 0b01000: XOP map select - 08h instructions with imm byte
// 0b01001: XOP map select - 09h instructions with no imm byte
// 0b01010: XOP map select - 0Ah instructions with imm dword
uint8_t VEX_5M;
switch (TSFlags & X86II::OpMapMask) {		switch (TSFlags & X86II::OpMapMask) {
default:		default:
llvm_unreachable("Invalid prefix!");		llvm_unreachable("Invalid prefix!");
case X86II::TB:		case X86II::TB:
VEX_5M = 0x1;		Prefix.set5M(0x1); // 0F
break; // 0F		break;
case X86II::T8:		case X86II::T8:
VEX_5M = 0x2;		Prefix.set5M(0x2); // 0F 38
break; // 0F 38		break;
case X86II::TA:		case X86II::TA:
VEX_5M = 0x3;		Prefix.set5M(0x3); // 0F 3A
break; // 0F 3A		break;
case X86II::XOP8:		case X86II::XOP8:
VEX_5M = 0x8;		Prefix.set5M(0x8);
break;		break;
case X86II::XOP9:		case X86II::XOP9:
VEX_5M = 0x9;		Prefix.set5M(0x9);
break;		break;
case X86II::XOPA:		case X86II::XOPA:
VEX_5M = 0xA;		Prefix.set5M(0xA);
break;		break;
case X86II::T_MAP5:		case X86II::T_MAP5:
VEX_5M = 0x5;		Prefix.set5M(0x5);
break;		break;
case X86II::T_MAP6:		case X86II::T_MAP6:
VEX_5M = 0x6;		Prefix.set5M(0x6);
break;		break;
}		}

// VEX_4V (VEX vvvv field): a register specifier		Prefix.setL(TSFlags & X86II::VEX_L);
// (in 1's complement form) or 1111 if unused.		Prefix.setL2(TSFlags & X86II::EVEX_L2);
uint8_t VEX_4V = 0xf;
uint8_t EVEX_V2 = 0x1;

// EVEX_L2/VEX_L (Vector Length):
//
// L2 L
// 0 0: scalar or 128-bit vector
// 0 1: 256-bit vector
// 1 0: 512-bit vector
//
uint8_t VEX_L = (TSFlags & X86II::VEX_L) ? 1 : 0;
uint8_t EVEX_L2 = (TSFlags & X86II::EVEX_L2) ? 1 : 0;

// VEX_PP: opcode extension providing equivalent
// functionality of a SIMD prefix
//
// 0b00: None
// 0b01: 66
// 0b10: F3
// 0b11: F2
//
uint8_t VEX_PP = 0;
switch (TSFlags & X86II::OpPrefixMask) {		switch (TSFlags & X86II::OpPrefixMask) {
case X86II::PD:		case X86II::PD:
VEX_PP = 0x1;		Prefix.setPP(0x1); // 66
break; // 66		break;
case X86II::XS:		case X86II::XS:
VEX_PP = 0x2;		Prefix.setPP(0x2); // F3
break; // F3		break;
case X86II::XD:		case X86II::XD:
VEX_PP = 0x3;		Prefix.setPP(0x3); // F2
break; // F2		break;
}		}

// EVEX_U		Prefix.setZ(HasEVEX_K && (TSFlags & X86II::EVEX_Z));
uint8_t EVEX_U = 1; // Always '1' so far		Prefix.setEVEX_b(TSFlags & X86II::EVEX_B);

// EVEX_z
uint8_t EVEX_z = (HasEVEX_K && (TSFlags & X86II::EVEX_Z)) ? 1 : 0;

// EVEX_b
uint8_t EVEX_b = (TSFlags & X86II::EVEX_B) ? 1 : 0;

// EVEX_rc
uint8_t EVEX_rc = 0;

// EVEX_aaa
uint8_t EVEX_aaa = 0;

bool EncodeRC = false;		bool EncodeRC = false;
		uint8_t EVEX_rc = 0;
// Classify VEX_B, VEX_4V, VEX_R, VEX_X
unsigned NumOps = Desc.getNumOperands();
unsigned CurOp = X86II::getOperandBias(Desc);		unsigned CurOp = X86II::getOperandBias(Desc);

switch (TSFlags & X86II::FormMask) {		switch (TSFlags & X86II::FormMask) {
default:		default:
llvm_unreachable("Unexpected form in emitVEXOpcodePrefix!");		llvm_unreachable("Unexpected form in emitVEXOpcodePrefix!");
case X86II::MRMDestMem4VOp3CC: {		case X86II::MRMDestMem4VOp3CC: {
// MemAddr, src1(ModR/M), src2(VEX_4V)		// MemAddr, src1(ModR/M), src2(VEX_4V)
unsigned BaseRegEnc = getX86RegEncoding(MI, MemOperand + X86::AddrBaseReg);		Prefix.setB(MI, MemOperand + X86::AddrBaseReg);
VEX_B = ~(BaseRegEnc >> 3) & 1;		Prefix.setX(MI, MemOperand + X86::AddrIndexReg);
unsigned IndexRegEnc =
getX86RegEncoding(MI, MemOperand + X86::AddrIndexReg);
VEX_X = ~(IndexRegEnc >> 3) & 1;

CurOp += X86::AddrNumOperands;		CurOp += X86::AddrNumOperands;
		Prefix.setR(MI, ++CurOp);
unsigned RegEnc = getX86RegEncoding(MI, ++CurOp);		Prefix.set4V(MI, CurOp++);
VEX_R = ~(RegEnc >> 3) & 1;

unsigned VRegEnc = getX86RegEncoding(MI, CurOp++);
VEX_4V = ~VRegEnc & 0xf;
break;		break;
}		}
case X86II::MRM_C0:		case X86II::MRM_C0:
case X86II::RawFrm:		case X86II::RawFrm:
case X86II::PrefixByte:		case X86II::PrefixByte:
break;		break;
case X86II::MRMDestMemFSIB:		case X86II::MRMDestMemFSIB:
case X86II::MRMDestMem: {		case X86II::MRMDestMem: {
// MRMDestMem instructions forms:		// MRMDestMem instructions forms:
// MemAddr, src1(ModR/M)		// MemAddr, src1(ModR/M)
// MemAddr, src1(VEX_4V), src2(ModR/M)		// MemAddr, src1(VEX_4V), src2(ModR/M)
// MemAddr, src1(ModR/M), imm8		// MemAddr, src1(ModR/M), imm8
//		//
unsigned BaseRegEnc = getX86RegEncoding(MI, MemOperand + X86::AddrBaseReg);		Prefix.setB(MI, MemOperand + X86::AddrBaseReg);
VEX_B = ~(BaseRegEnc >> 3) & 1;		Prefix.setX(MI, MemOperand + X86::AddrIndexReg);
unsigned IndexRegEnc =
getX86RegEncoding(MI, MemOperand + X86::AddrIndexReg);
VEX_X = ~(IndexRegEnc >> 3) & 1;
if (!HasVEX_4V) // Only needed with VSIB which don't use VVVV.		if (!HasVEX_4V) // Only needed with VSIB which don't use VVVV.
EVEX_V2 = ~(IndexRegEnc >> 4) & 1;		Prefix.setV2(MI, MemOperand + X86::AddrIndexReg);

CurOp += X86::AddrNumOperands;		CurOp += X86::AddrNumOperands;

if (HasEVEX_K)		if (HasEVEX_K)
EVEX_aaa = getX86RegEncoding(MI, CurOp++);		Prefix.setAAA(MI, CurOp++);

if (HasVEX_4V) {		if (HasVEX_4V)
unsigned VRegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.set4VV2(MI, CurOp++);
VEX_4V = ~VRegEnc & 0xf;
EVEX_V2 = ~(VRegEnc >> 4) & 1;
}

unsigned RegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.setRR2(MI, CurOp++);
VEX_R = ~(RegEnc >> 3) & 1;
EVEX_R2 = ~(RegEnc >> 4) & 1;
break;		break;
}		}
case X86II::MRMSrcMemFSIB:		case X86II::MRMSrcMemFSIB:
case X86II::MRMSrcMem: {		case X86II::MRMSrcMem: {
// MRMSrcMem instructions forms:		// MRMSrcMem instructions forms:
// src1(ModR/M), MemAddr		// src1(ModR/M), MemAddr
// src1(ModR/M), src2(VEX_4V), MemAddr		// src1(ModR/M), src2(VEX_4V), MemAddr
// src1(ModR/M), MemAddr, imm8		// src1(ModR/M), MemAddr, imm8
// src1(ModR/M), MemAddr, src2(Imm[7:4])		// src1(ModR/M), MemAddr, src2(Imm[7:4])
//		//
// FMA4:		// FMA4:
// dst(ModR/M.reg), src1(VEX_4V), src2(ModR/M), src3(Imm[7:4])		// dst(ModR/M.reg), src1(VEX_4V), src2(ModR/M), src3(Imm[7:4])
unsigned RegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.setRR2(MI, CurOp++);
VEX_R = ~(RegEnc >> 3) & 1;
EVEX_R2 = ~(RegEnc >> 4) & 1;

if (HasEVEX_K)		if (HasEVEX_K)
EVEX_aaa = getX86RegEncoding(MI, CurOp++);		Prefix.setAAA(MI, CurOp++);

if (HasVEX_4V) {		if (HasVEX_4V)
unsigned VRegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.set4VV2(MI, CurOp++);
VEX_4V = ~VRegEnc & 0xf;
EVEX_V2 = ~(VRegEnc >> 4) & 1;
}

unsigned BaseRegEnc = getX86RegEncoding(MI, MemOperand + X86::AddrBaseReg);		Prefix.setB(MI, MemOperand + X86::AddrBaseReg);
VEX_B = ~(BaseRegEnc >> 3) & 1;		Prefix.setX(MI, MemOperand + X86::AddrIndexReg);
unsigned IndexRegEnc =
getX86RegEncoding(MI, MemOperand + X86::AddrIndexReg);
VEX_X = ~(IndexRegEnc >> 3) & 1;
if (!HasVEX_4V) // Only needed with VSIB which don't use VVVV.		if (!HasVEX_4V) // Only needed with VSIB which don't use VVVV.
EVEX_V2 = ~(IndexRegEnc >> 4) & 1;		Prefix.setV2(MI, MemOperand + X86::AddrIndexReg);

break;		break;
}		}
case X86II::MRMSrcMem4VOp3: {		case X86II::MRMSrcMem4VOp3: {
// Instruction format for 4VOp3:		// Instruction format for 4VOp3:
// src1(ModR/M), MemAddr, src3(VEX_4V)		// src1(ModR/M), MemAddr, src3(VEX_4V)
unsigned RegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.setR(MI, CurOp++);
VEX_R = ~(RegEnc >> 3) & 1;		Prefix.setB(MI, MemOperand + X86::AddrBaseReg);
		Prefix.setX(MI, MemOperand + X86::AddrIndexReg);
unsigned BaseRegEnc = getX86RegEncoding(MI, MemOperand + X86::AddrBaseReg);		Prefix.set4V(MI, CurOp + X86::AddrNumOperands);
VEX_B = ~(BaseRegEnc >> 3) & 1;
unsigned IndexRegEnc =
getX86RegEncoding(MI, MemOperand + X86::AddrIndexReg);
VEX_X = ~(IndexRegEnc >> 3) & 1;

VEX_4V = ~getX86RegEncoding(MI, CurOp + X86::AddrNumOperands) & 0xf;
break;		break;
}		}
case X86II::MRMSrcMemOp4: {		case X86II::MRMSrcMemOp4: {
// dst(ModR/M.reg), src1(VEX_4V), src2(Imm[7:4]), src3(ModR/M),		// dst(ModR/M.reg), src1(VEX_4V), src2(Imm[7:4]), src3(ModR/M),
unsigned RegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.setR(MI, CurOp++);
VEX_R = ~(RegEnc >> 3) & 1;		Prefix.set4V(MI, CurOp++);
		Prefix.setB(MI, MemOperand + X86::AddrBaseReg);
unsigned VRegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.setX(MI, MemOperand + X86::AddrIndexReg);
VEX_4V = ~VRegEnc & 0xf;

unsigned BaseRegEnc = getX86RegEncoding(MI, MemOperand + X86::AddrBaseReg);
VEX_B = ~(BaseRegEnc >> 3) & 1;
unsigned IndexRegEnc =
getX86RegEncoding(MI, MemOperand + X86::AddrIndexReg);
VEX_X = ~(IndexRegEnc >> 3) & 1;
break;		break;
}		}
case X86II::MRM0m:		case X86II::MRM0m:
case X86II::MRM1m:		case X86II::MRM1m:
case X86II::MRM2m:		case X86II::MRM2m:
case X86II::MRM3m:		case X86II::MRM3m:
case X86II::MRM4m:		case X86II::MRM4m:
case X86II::MRM5m:		case X86II::MRM5m:
case X86II::MRM6m:		case X86II::MRM6m:
case X86II::MRM7m: {		case X86II::MRM7m: {
// MRM[0-9]m instructions forms:		// MRM[0-9]m instructions forms:
// MemAddr		// MemAddr
// src1(VEX_4V), MemAddr		// src1(VEX_4V), MemAddr
if (HasVEX_4V) {		if (HasVEX_4V)
unsigned VRegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.set4VV2(MI, CurOp++);
VEX_4V = ~VRegEnc & 0xf;
EVEX_V2 = ~(VRegEnc >> 4) & 1;
}

if (HasEVEX_K)		if (HasEVEX_K)
EVEX_aaa = getX86RegEncoding(MI, CurOp++);		Prefix.setAAA(MI, CurOp++);

unsigned BaseRegEnc = getX86RegEncoding(MI, MemOperand + X86::AddrBaseReg);		Prefix.setB(MI, MemOperand + X86::AddrBaseReg);
VEX_B = ~(BaseRegEnc >> 3) & 1;		Prefix.setX(MI, MemOperand + X86::AddrIndexReg);
unsigned IndexRegEnc =
getX86RegEncoding(MI, MemOperand + X86::AddrIndexReg);
VEX_X = ~(IndexRegEnc >> 3) & 1;
if (!HasVEX_4V) // Only needed with VSIB which don't use VVVV.		if (!HasVEX_4V) // Only needed with VSIB which don't use VVVV.
EVEX_V2 = ~(IndexRegEnc >> 4) & 1;		Prefix.setV2(MI, MemOperand + X86::AddrIndexReg);

break;		break;
}		}
case X86II::MRMSrcReg: {		case X86II::MRMSrcReg: {
// MRMSrcReg instructions forms:		// MRMSrcReg instructions forms:
// dst(ModR/M), src1(VEX_4V), src2(ModR/M), src3(Imm[7:4])		// dst(ModR/M), src1(VEX_4V), src2(ModR/M), src3(Imm[7:4])
// dst(ModR/M), src1(ModR/M)		// dst(ModR/M), src1(ModR/M)
// dst(ModR/M), src1(ModR/M), imm8		// dst(ModR/M), src1(ModR/M), imm8
//		//
// FMA4:		// FMA4:
// dst(ModR/M.reg), src1(VEX_4V), src2(Imm[7:4]), src3(ModR/M),		// dst(ModR/M.reg), src1(VEX_4V), src2(Imm[7:4]), src3(ModR/M),
unsigned RegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.setRR2(MI, CurOp++);
VEX_R = ~(RegEnc >> 3) & 1;
EVEX_R2 = ~(RegEnc >> 4) & 1;

if (HasEVEX_K)		if (HasEVEX_K)
EVEX_aaa = getX86RegEncoding(MI, CurOp++);		Prefix.setAAA(MI, CurOp++);

if (HasVEX_4V) {		if (HasVEX_4V)
unsigned VRegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.set4VV2(MI, CurOp++);
VEX_4V = ~VRegEnc & 0xf;
EVEX_V2 = ~(VRegEnc >> 4) & 1;
}

RegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.setB(MI, CurOp);
VEX_B = ~(RegEnc >> 3) & 1;		Prefix.setX(MI, CurOp, 4);
VEX_X = ~(RegEnc >> 4) & 1;		++CurOp;

if (EVEX_b) {		if (TSFlags & X86II::EVEX_B) {
if (HasEVEX_RC) {		if (HasEVEX_RC) {
		unsigned NumOps = Desc.getNumOperands();
unsigned RcOperand = NumOps - 1;		unsigned RcOperand = NumOps - 1;
assert(RcOperand >= CurOp);		assert(RcOperand >= CurOp);
EVEX_rc = MI.getOperand(RcOperand).getImm();		EVEX_rc = MI.getOperand(RcOperand).getImm();
assert(EVEX_rc <= 3 && "Invalid rounding control!");		assert(EVEX_rc <= 3 && "Invalid rounding control!");
}		}
EncodeRC = true;		EncodeRC = true;
}		}
break;		break;
}		}
case X86II::MRMSrcReg4VOp3: {		case X86II::MRMSrcReg4VOp3: {
// Instruction format for 4VOp3:		// Instruction format for 4VOp3:
// src1(ModR/M), src2(ModR/M), src3(VEX_4V)		// src1(ModR/M), src2(ModR/M), src3(VEX_4V)
unsigned RegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.setR(MI, CurOp++);
VEX_R = ~(RegEnc >> 3) & 1;		Prefix.setB(MI, CurOp++);
		Prefix.set4V(MI, CurOp++);
RegEnc = getX86RegEncoding(MI, CurOp++);
VEX_B = ~(RegEnc >> 3) & 1;

VEX_4V = ~getX86RegEncoding(MI, CurOp++) & 0xf;
break;		break;
}		}
case X86II::MRMSrcRegOp4: {		case X86II::MRMSrcRegOp4: {
// dst(ModR/M.reg), src1(VEX_4V), src2(Imm[7:4]), src3(ModR/M),		// dst(ModR/M.reg), src1(VEX_4V), src2(Imm[7:4]), src3(ModR/M),
unsigned RegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.setR(MI, CurOp++);
VEX_R = ~(RegEnc >> 3) & 1;		Prefix.set4V(MI, CurOp++);

unsigned VRegEnc = getX86RegEncoding(MI, CurOp++);
VEX_4V = ~VRegEnc & 0xf;

// Skip second register source (encoded in Imm[7:4])		// Skip second register source (encoded in Imm[7:4])
++CurOp;		++CurOp;

RegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.setB(MI, CurOp);
VEX_B = ~(RegEnc >> 3) & 1;		Prefix.setX(MI, CurOp, 4);
VEX_X = ~(RegEnc >> 4) & 1;		++CurOp;
break;		break;
}		}
case X86II::MRMDestReg: {		case X86II::MRMDestReg: {
// MRMDestReg instructions forms:		// MRMDestReg instructions forms:
// dst(ModR/M), src(ModR/M)		// dst(ModR/M), src(ModR/M)
// dst(ModR/M), src(ModR/M), imm8		// dst(ModR/M), src(ModR/M), imm8
// dst(ModR/M), src1(VEX_4V), src2(ModR/M)		// dst(ModR/M), src1(VEX_4V), src2(ModR/M)
unsigned RegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.setB(MI, CurOp);
VEX_B = ~(RegEnc >> 3) & 1;		Prefix.setX(MI, CurOp, 4);
VEX_X = ~(RegEnc >> 4) & 1;		++CurOp;

if (HasEVEX_K)		if (HasEVEX_K)
EVEX_aaa = getX86RegEncoding(MI, CurOp++);		Prefix.setAAA(MI, CurOp++);

if (HasVEX_4V) {		if (HasVEX_4V)
unsigned VRegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.set4VV2(MI, CurOp++);
VEX_4V = ~VRegEnc & 0xf;
EVEX_V2 = ~(VRegEnc >> 4) & 1;
}

RegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.setRR2(MI, CurOp++);
VEX_R = ~(RegEnc >> 3) & 1;		if (TSFlags & X86II::EVEX_B)
EVEX_R2 = ~(RegEnc >> 4) & 1;
if (EVEX_b)
EncodeRC = true;		EncodeRC = true;
break;		break;
}		}
case X86II::MRMr0: {		case X86II::MRMr0: {
// MRMr0 instructions forms:		// MRMr0 instructions forms:
// 11:rrr:000		// 11:rrr:000
// dst(ModR/M)		// dst(ModR/M)
unsigned RegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.setRR2(MI, CurOp++);
VEX_R = ~(RegEnc >> 3) & 1;
EVEX_R2 = ~(RegEnc >> 4) & 1;
break;		break;
}		}
case X86II::MRM0r:		case X86II::MRM0r:
case X86II::MRM1r:		case X86II::MRM1r:
case X86II::MRM2r:		case X86II::MRM2r:
case X86II::MRM3r:		case X86II::MRM3r:
case X86II::MRM4r:		case X86II::MRM4r:
case X86II::MRM5r:		case X86II::MRM5r:
case X86II::MRM6r:		case X86II::MRM6r:
case X86II::MRM7r: {		case X86II::MRM7r: {
// MRM0r-MRM7r instructions forms:		// MRM0r-MRM7r instructions forms:
// dst(VEX_4V), src(ModR/M), imm8		// dst(VEX_4V), src(ModR/M), imm8
if (HasVEX_4V) {		if (HasVEX_4V)
unsigned VRegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.set4VV2(MI, CurOp++);
VEX_4V = ~VRegEnc & 0xf;
EVEX_V2 = ~(VRegEnc >> 4) & 1;
}
if (HasEVEX_K)		if (HasEVEX_K)
EVEX_aaa = getX86RegEncoding(MI, CurOp++);		Prefix.setAAA(MI, CurOp++);

unsigned RegEnc = getX86RegEncoding(MI, CurOp++);		Prefix.setB(MI, CurOp);
VEX_B = ~(RegEnc >> 3) & 1;		Prefix.setX(MI, CurOp, 4);
VEX_X = ~(RegEnc >> 4) & 1;		++CurOp;
break;		break;
}		}
}		}
		if (EncodeRC) {
if (Encoding == X86II::VEX \|\| Encoding == X86II::XOP) {		Prefix.setL(EVEX_rc & 0x1);
// VEX opcode prefix can have 2 or 3 bytes		Prefix.setL2(EVEX_rc & 0x2);
//		}
// 3 bytes:		PrefixKind Kind = Prefix.determineOptimalKind();
		craig.topperUnsubmitted Done Reply Inline Actions The names of these functions is confusing. getKind followed by setKind looks odd. I would naively expect setKind to set the thing that getKind returns. So getKind followed by setKind looks redundant. craig.topper: The names of these functions is confusing. getKind followed by setKind looks odd. I would…
		skanAuthorUnsubmitted Done Reply Inline Actions You're right. I re-designed the interface to `setLowerBound`, `determineOptimalKind`. skan: You're right. I re-designed the interface to `setLowerBound`, `determineOptimalKind`.
// +-----+ +--------------+ +-------------------+		Prefix.emit(OS);
// \| C4h \| \| RXB \| m-mmmm \| \| W \| vvvv \| L \| pp \|		return Kind;
// +-----+ +--------------+ +-------------------+
// 2 bytes:
// +-----+ +-------------------+
// \| C5h \| \| R \| vvvv \| L \| pp \|
// +-----+ +-------------------+
//
// XOP uses a similar prefix:
// +-----+ +--------------+ +-------------------+
// \| 8Fh \| \| RXB \| m-mmmm \| \| W \| vvvv \| L \| pp \|
// +-----+ +--------------+ +-------------------+
uint8_t LastByte = VEX_PP \| (VEX_L << 2) \| (VEX_4V << 3);

// Can we use the 2 byte VEX prefix?
if (!(MI.getFlags() & X86::IP_USE_VEX3) && Encoding == X86II::VEX &&
VEX_B && VEX_X && !VEX_W && (VEX_5M == 1)) {
emitByte(0xC5, OS);
emitByte(LastByte \| (VEX_R << 7), OS);
return;
}

// 3 byte VEX prefix
emitByte(Encoding == X86II::XOP ? 0x8F : 0xC4, OS);
emitByte(VEX_R << 7 \| VEX_X << 6 \| VEX_B << 5 \| VEX_5M, OS);
emitByte(LastByte \| (VEX_W << 7), OS);
} else {
assert(Encoding == X86II::EVEX && "unknown encoding!");
// EVEX opcode prefix can have 4 bytes
//
// +-----+ +--------------+ +-------------------+ +------------------------+
// \| 62h \| \| RXBR' \| 0mmm \| \| W \| vvvv \| U \| pp \| \| z \| L'L \| b \| v' \| aaa \|
// +-----+ +--------------+ +-------------------+ +------------------------+
assert((VEX_5M & 0x7) == VEX_5M &&
"More than 3 significant bits in VEX.m-mmmm fields for EVEX!");

emitByte(0x62, OS);
emitByte((VEX_R << 7) \| (VEX_X << 6) \| (VEX_B << 5) \| (EVEX_R2 << 4) \|
VEX_5M,
OS);
emitByte((VEX_W << 7) \| (VEX_4V << 3) \| (EVEX_U << 2) \| VEX_PP, OS);
if (EncodeRC)
emitByte((EVEX_z << 7) \| (EVEX_rc << 5) \| (EVEX_b << 4) \| (EVEX_V2 << 3) \|
EVEX_aaa,
OS);
else
emitByte((EVEX_z << 7) \| (EVEX_L2 << 6) \| (VEX_L << 5) \| (EVEX_b << 4) \|
(EVEX_V2 << 3) \| EVEX_aaa,
OS);
}
}		}

/// Emit REX prefix which specifies		/// Emit REX prefix which specifies
/// 1) 64-bit instructions,		/// 1) 64-bit instructions,
/// 2) non-default operand size, and		/// 2) non-default operand size, and
/// 3) use of X86-64 extended registers.		/// 3) use of X86-64 extended registers.
///		///
/// \returns true if REX prefix is used, otherwise returns false.		/// \returns the used prefix (REX or None).
bool X86MCCodeEmitter::emitREXPrefix(int MemOperand, const MCInst &MI,		PrefixKind X86MCCodeEmitter::emitREXPrefix(int MemOperand, const MCInst &MI,
const MCSubtargetInfo &STI,		const MCSubtargetInfo &STI,
raw_ostream &OS) const {		raw_ostream &OS) const {
uint8_t REX = [&, MemOperand]() {		if (!STI.hasFeature(X86::Is64Bit))
uint8_t REX = 0;		return None;
		X86OpcodePrefixHelper Prefix(*Ctx.getRegisterInfo());
bool UsesHighByteReg = false;		bool UsesHighByteReg = false;

const MCInstrDesc &Desc = MCII.get(MI.getOpcode());		const MCInstrDesc &Desc = MCII.get(MI.getOpcode());
uint64_t TSFlags = Desc.TSFlags;		uint64_t TSFlags = Desc.TSFlags;
		Prefix.setW(TSFlags & X86II::REX_W);
if (TSFlags & X86II::REX_W)
REX \|= 1 << 3; // set REX.W

if (MI.getNumOperands() == 0)
return REX;

unsigned NumOps = MI.getNumOperands();		unsigned NumOps = MI.getNumOperands();
		if (!NumOps) {
		PrefixKind Kind = Prefix.determineOptimalKind();
		Prefix.emit(OS);
		return Kind;
		}
unsigned CurOp = X86II::getOperandBias(Desc);		unsigned CurOp = X86II::getOperandBias(Desc);

// If it accesses SPL, BPL, SIL, or DIL, then it requires a 0x40 REX prefix.
for (unsigned i = CurOp; i != NumOps; ++i) {		for (unsigned i = CurOp; i != NumOps; ++i) {
const MCOperand &MO = MI.getOperand(i);		const MCOperand &MO = MI.getOperand(i);
if (MO.isReg()) {		if (MO.isReg()) {
unsigned Reg = MO.getReg();		unsigned Reg = MO.getReg();
if (Reg == X86::AH \|\| Reg == X86::BH \|\| Reg == X86::CH \|\|		if (Reg == X86::AH \|\| Reg == X86::BH \|\| Reg == X86::CH \|\| Reg == X86::DH)
Reg == X86::DH)
UsesHighByteReg = true;		UsesHighByteReg = true;
		// If it accesses SPL, BPL, SIL, or DIL, then it requires a REX prefix.
if (X86II::isX86_64NonExtLowByteReg(Reg))		if (X86II::isX86_64NonExtLowByteReg(Reg))
// FIXME: The caller of determineREXPrefix slaps this prefix onto		Prefix.setLowerBound(REX);
// anything that returns non-zero.
REX \|= 0x40; // REX fixed encoding prefix
} else if (MO.isExpr() && STI.getTargetTriple().isX32()) {		} else if (MO.isExpr() && STI.getTargetTriple().isX32()) {
// GOTTPOFF and TLSDESC relocations require a REX prefix to allow		// GOTTPOFF and TLSDESC relocations require a REX prefix to allow
// linker optimizations: even if the instructions we see may not require		// linker optimizations: even if the instructions we see may not require
// any prefix, they may be replaced by instructions that do. This is		// any prefix, they may be replaced by instructions that do. This is
// handled as a special case here so that it also works for hand-written		// handled as a special case here so that it also works for hand-written
// assembly without the user needing to write REX, as with GNU as.		// assembly without the user needing to write REX, as with GNU as.
const auto *Ref = dyn_cast<MCSymbolRefExpr>(MO.getExpr());		const auto *Ref = dyn_cast<MCSymbolRefExpr>(MO.getExpr());
if (Ref && (Ref->getKind() == MCSymbolRefExpr::VK_GOTTPOFF \|\|		if (Ref && (Ref->getKind() == MCSymbolRefExpr::VK_GOTTPOFF \|\|
Ref->getKind() == MCSymbolRefExpr::VK_TLSDESC)) {		Ref->getKind() == MCSymbolRefExpr::VK_TLSDESC)) {
REX \|= 0x40; // REX fixed encoding prefix		Prefix.setLowerBound(REX);
}		}
}		}
}		}

switch (TSFlags & X86II::FormMask) {		switch (TSFlags & X86II::FormMask) {
case X86II::AddRegFrm:		case X86II::AddRegFrm:
REX \|= isREXExtendedReg(MI, CurOp++) << 0; // REX.B		Prefix.setB(MI, CurOp++);
break;		break;
case X86II::MRMSrcReg:		case X86II::MRMSrcReg:
case X86II::MRMSrcRegCC:		case X86II::MRMSrcRegCC:
REX \|= isREXExtendedReg(MI, CurOp++) << 2; // REX.R		Prefix.setR(MI, CurOp++);
REX \|= isREXExtendedReg(MI, CurOp++) << 0; // REX.B		Prefix.setB(MI, CurOp++);
break;		break;
case X86II::MRMSrcMem:		case X86II::MRMSrcMem:
case X86II::MRMSrcMemCC:		case X86II::MRMSrcMemCC:
REX \|= isREXExtendedReg(MI, CurOp++) << 2; // REX.R		Prefix.setR(MI, CurOp++);
REX \|= isREXExtendedReg(MI, MemOperand + X86::AddrBaseReg) << 0; // REX.B		Prefix.setB(MI, MemOperand + X86::AddrBaseReg);
REX \|= isREXExtendedReg(MI, MemOperand + X86::AddrIndexReg) << 1; // REX.X		Prefix.setX(MI, MemOperand + X86::AddrIndexReg);
CurOp += X86::AddrNumOperands;		CurOp += X86::AddrNumOperands;
break;		break;
case X86II::MRMDestReg:		case X86II::MRMDestReg:
REX \|= isREXExtendedReg(MI, CurOp++) << 0; // REX.B		Prefix.setB(MI, CurOp++);
REX \|= isREXExtendedReg(MI, CurOp++) << 2; // REX.R		Prefix.setR(MI, CurOp++);
break;		break;
case X86II::MRMDestMem:		case X86II::MRMDestMem:
REX \|= isREXExtendedReg(MI, MemOperand + X86::AddrBaseReg) << 0; // REX.B		Prefix.setB(MI, MemOperand + X86::AddrBaseReg);
REX \|= isREXExtendedReg(MI, MemOperand + X86::AddrIndexReg) << 1; // REX.X		Prefix.setX(MI, MemOperand + X86::AddrIndexReg);
CurOp += X86::AddrNumOperands;		CurOp += X86::AddrNumOperands;
REX \|= isREXExtendedReg(MI, CurOp++) << 2; // REX.R		Prefix.setR(MI, CurOp++);
break;		break;
case X86II::MRMXmCC:		case X86II::MRMXmCC:
case X86II::MRMXm:		case X86II::MRMXm:
case X86II::MRM0m:		case X86II::MRM0m:
case X86II::MRM1m:		case X86II::MRM1m:
case X86II::MRM2m:		case X86II::MRM2m:
case X86II::MRM3m:		case X86II::MRM3m:
case X86II::MRM4m:		case X86II::MRM4m:
case X86II::MRM5m:		case X86II::MRM5m:
case X86II::MRM6m:		case X86II::MRM6m:
case X86II::MRM7m:		case X86II::MRM7m:
REX \|= isREXExtendedReg(MI, MemOperand + X86::AddrBaseReg) << 0; // REX.B		Prefix.setB(MI, MemOperand + X86::AddrBaseReg);
REX \|= isREXExtendedReg(MI, MemOperand + X86::AddrIndexReg) << 1; // REX.X		Prefix.setX(MI, MemOperand + X86::AddrIndexReg);
break;		break;
case X86II::MRMXrCC:		case X86II::MRMXrCC:
case X86II::MRMXr:		case X86II::MRMXr:
case X86II::MRM0r:		case X86II::MRM0r:
case X86II::MRM1r:		case X86II::MRM1r:
case X86II::MRM2r:		case X86II::MRM2r:
case X86II::MRM3r:		case X86II::MRM3r:
case X86II::MRM4r:		case X86II::MRM4r:
case X86II::MRM5r:		case X86II::MRM5r:
case X86II::MRM6r:		case X86II::MRM6r:
case X86II::MRM7r:		case X86II::MRM7r:
REX \|= isREXExtendedReg(MI, CurOp++) << 0; // REX.B		Prefix.setB(MI, CurOp++);
break;		break;
case X86II::MRMr0:		case X86II::MRMr0:
REX \|= isREXExtendedReg(MI, CurOp++) << 2; // REX.R		Prefix.setR(MI, CurOp++);
break;		break;
case X86II::MRMDestMemFSIB:		case X86II::MRMDestMemFSIB:
llvm_unreachable("FSIB format never need REX prefix!");		llvm_unreachable("FSIB format never need REX prefix!");
}		}
if (REX && UsesHighByteReg)		PrefixKind Kind = Prefix.determineOptimalKind();
		if (Kind && UsesHighByteReg)
report_fatal_error(		report_fatal_error(
"Cannot encode high byte register in REX-prefixed instruction");		"Cannot encode high byte register in REX-prefixed instruction");
return REX;		Prefix.emit(OS);
}();		return Kind;

if (!REX)
return false;

emitByte(0x40 \| REX, OS);
return true;
}		}

/// Emit segment override opcode prefix as needed.		/// Emit segment override opcode prefix as needed.
void X86MCCodeEmitter::emitSegmentOverridePrefix(unsigned SegOperand,		void X86MCCodeEmitter::emitSegmentOverridePrefix(unsigned SegOperand,
const MCInst &MI,		const MCInst &MI,
raw_ostream &OS) const {		raw_ostream &OS) const {
// Check for explicit segment override on memory operand.		// Check for explicit segment override on memory operand.
if (unsigned Reg = MI.getOperand(SegOperand).getReg())		if (unsigned Reg = MI.getOperand(SegOperand).getReg())
emitByte(X86::getSegmentOverridePrefixForReg(Reg), OS);		emitByte(X86::getSegmentOverridePrefixForReg(Reg), OS);
}		}

/// Emit all instruction prefixes prior to the opcode.		/// Emit all instruction prefixes prior to the opcode.
///		///
/// \param MemOperand the operand # of the start of a memory operand if present.		/// \param MemOperand the operand # of the start of a memory operand if present.
/// If not present, it is -1.		/// If not present, it is -1.
///		///
/// \returns true if REX prefix is used, otherwise returns false.		/// \returns the used prefix (REX or None).
bool X86MCCodeEmitter::emitOpcodePrefix(int MemOperand, const MCInst &MI,		PrefixKind X86MCCodeEmitter::emitOpcodePrefix(int MemOperand, const MCInst &MI,
		craig.topperUnsubmitted Done Reply Inline Actions I don't know that it makes sense to make returning the bool for `HasREX` worse by spreading it to more functions. None of the other prefixes are needed and I'm not sure there's sufficient evidence they ever will be. And if they were needed would what type of prefix be enough information? A `bool &HasREX` passed to emitOpcodePrefix and emitREXPrefix feels cleaner to me. I never liked the returned bool for this. craig.topper: I don't know that it makes sense to make returning the bool for `HasREX` worse by spreading it…
		skanAuthorUnsubmitted Done Reply Inline Actions I know a little about it. When X86 introduces a bunch of new instructions, it usually extends the prefix. For example, when moved from ia32 to ia32e, REX was defined. Similarly, when we moved from SSE to AVX, VEX prefix was defined. And when we introduce new instructions, a new relocation may be needed. Here is the example for REX https://groups.google.com/g/x86-64-abi/c/n9AWHogmVY0 Linker can do relocation optimization based on the relocation. From my understanding, if no new relocation were not added for the new instructions, the optimization could be done in an incorrect way silently. Only a bool `HasREX` was defined here b/c the current interested instructions are only REX-encoded. But we should allow more possibility here. And returning a enum here is almost as cheap as bool. skan: I know a little about it. When X86 introduces a bunch of new instructions, it usually extends…
		skanAuthorUnsubmitted Done Reply Inline Actions Got your idea. Let me to pass the enum by reference rather than a return value. skan: Got your idea. Let me to pass the enum by reference rather than a return value.
		craig.topperUnsubmitted Done Reply Inline Actions REX is older than the VEX, XOP, EVEX prefixes so it doesn’t seem common to need the new relocation. It was cheaper to compute I’d say we could determine if it has REX in the fixup code but we have to inspect multiple operands. If we need to know about a prefix for a future location that one might be cheaper to compute. This feels like we’re trying to solve a future problem that might never exist. Is it really worth it? craig.topper: REX is older than the VEX, XOP, EVEX prefixes so it doesn’t seem common to need the new…
		craig.topperUnsubmitted Done Reply Inline Actions relocation craig.topper:* *relocation
		craig.topperUnsubmitted Done Reply Inline Actions It’s also hard to add new relocations anyway since you would need a new linker that understands it. Or a command line to opt into it. craig.topper: It’s also hard to add new relocations anyway since you would need a new linker that understands…
		skanAuthorUnsubmitted Done Reply Inline Actions Yes, I was talking about that if we know more info about prefix, we can easily extend the relocation type in `X86MCCodeEmitter::emitMemModRMByte`. skan: Yes, I was talking about that if we know more info about prefix, we can easily extend the…
		skanAuthorUnsubmitted Done Reply Inline Actions I think it's worthy b/c we uniform the behavior of functions `emitPrefix`, and at the same time, there is almost no extra cost. skan: I think it's worthy b/c we uniform the behavior of functions `emitPrefix`, and at the same…
		skanAuthorUnsubmitted Done Reply Inline Actions I just gave a try. But passing the `enum` as argument increases the length of arglists for four functions: `emitREXPrefix`, `emitVEXOpcodePrefix`, `emitOpcodePrefix` and `emitPrefixImpl` longer, which made me nervous. I don't even know which order I should use for the parameters for the functions. I remembered that we once refined the code in this file to reduce the parameters, so I prefer the "return value" version. AFAICS, whether the prefix is REX, VEX, or EVEX should be enough information for the other code in `emitInstructiion`. skan: I just gave a try. But passing the `enum` as argument increases the length of arglists for…
const MCSubtargetInfo &STI,		const MCSubtargetInfo &STI,
raw_ostream &OS) const {		raw_ostream &OS) const {
const MCInstrDesc &Desc = MCII.get(MI.getOpcode());		const MCInstrDesc &Desc = MCII.get(MI.getOpcode());
uint64_t TSFlags = Desc.TSFlags;		uint64_t TSFlags = Desc.TSFlags;

// Emit the operand size opcode prefix as needed.		// Emit the operand size opcode prefix as needed.
if ((TSFlags & X86II::OpSizeMask) ==		if ((TSFlags & X86II::OpSizeMask) ==
(STI.hasFeature(X86::Is16Bit) ? X86II::OpSize32 : X86II::OpSize16))		(STI.hasFeature(X86::Is16Bit) ? X86II::OpSize32 : X86II::OpSize16))
Show All 17 Lines	PrefixKind X86MCCodeEmitter::emitOpcodePrefix(int MemOperand, const MCInst &MI,
case X86II::XD: // F2		case X86II::XD: // F2
emitByte(0xF2, OS);		emitByte(0xF2, OS);
break;		break;
}		}

// Handle REX prefix.		// Handle REX prefix.
assert((STI.hasFeature(X86::Is64Bit) \|\| !(TSFlags & X86II::REX_W)) &&		assert((STI.hasFeature(X86::Is64Bit) \|\| !(TSFlags & X86II::REX_W)) &&
"REX.W requires 64bit mode.");		"REX.W requires 64bit mode.");
bool HasREX = STI.hasFeature(X86::Is64Bit)		PrefixKind Kind = emitREXPrefix(MemOperand, MI, STI, OS);
? emitREXPrefix(MemOperand, MI, STI, OS)
: false;

// 0x0F escape code must be emitted just before the opcode.		// 0x0F escape code must be emitted just before the opcode.
switch (TSFlags & X86II::OpMapMask) {		switch (TSFlags & X86II::OpMapMask) {
case X86II::TB: // Two-byte opcode map		case X86II::TB: // Two-byte opcode map
case X86II::T8: // 0F 38		case X86II::T8: // 0F 38
case X86II::TA: // 0F 3A		case X86II::TA: // 0F 3A
case X86II::ThreeDNow: // 0F 0F, second 0F emitted by caller.		case X86II::ThreeDNow: // 0F 0F, second 0F emitted by caller.
emitByte(0x0F, OS);		emitByte(0x0F, OS);
break;		break;
}		}

switch (TSFlags & X86II::OpMapMask) {		switch (TSFlags & X86II::OpMapMask) {
case X86II::T8: // 0F 38		case X86II::T8: // 0F 38
emitByte(0x38, OS);		emitByte(0x38, OS);
break;		break;
case X86II::TA: // 0F 3A		case X86II::TA: // 0F 3A
emitByte(0x3A, OS);		emitByte(0x3A, OS);
break;		break;
}		}

return HasREX;		return Kind;
}		}

void X86MCCodeEmitter::emitPrefix(const MCInst &MI, raw_ostream &OS,		void X86MCCodeEmitter::emitPrefix(const MCInst &MI, raw_ostream &OS,
const MCSubtargetInfo &STI) const {		const MCSubtargetInfo &STI) const {
unsigned Opcode = MI.getOpcode();		unsigned Opcode = MI.getOpcode();
const MCInstrDesc &Desc = MCII.get(Opcode);		const MCInstrDesc &Desc = MCII.get(Opcode);
uint64_t TSFlags = Desc.TSFlags;		uint64_t TSFlags = Desc.TSFlags;

Show All 17 Lines	void X86MCCodeEmitter::encodeInstruction(const MCInst &MI, raw_ostream &OS,
if (X86II::isPseudo(TSFlags))		if (X86II::isPseudo(TSFlags))
return;		return;

unsigned NumOps = Desc.getNumOperands();		unsigned NumOps = Desc.getNumOperands();
unsigned CurOp = X86II::getOperandBias(Desc);		unsigned CurOp = X86II::getOperandBias(Desc);

uint64_t StartByte = OS.tell();		uint64_t StartByte = OS.tell();

bool HasREX = emitPrefixImpl(CurOp, MI, STI, OS);		PrefixKind Kind = emitPrefixImpl(CurOp, MI, STI, OS);

// It uses the VEX.VVVV field?		// It uses the VEX.VVVV field?
bool HasVEX_4V = TSFlags & X86II::VEX_4V;		bool HasVEX_4V = TSFlags & X86II::VEX_4V;
bool HasVEX_I8Reg = (TSFlags & X86II::ImmMask) == X86II::Imm8Reg;		bool HasVEX_I8Reg = (TSFlags & X86II::ImmMask) == X86II::Imm8Reg;

// It uses the EVEX.aaa field?		// It uses the EVEX.aaa field?
bool HasEVEX_K = TSFlags & X86II::EVEX_K;		bool HasEVEX_K = TSFlags & X86II::EVEX_K;
bool HasEVEX_RC = TSFlags & X86II::EVEX_RC;		bool HasEVEX_RC = TSFlags & X86II::EVEX_RC;
▲ Show 20 Lines • Show All 82 Lines • ▼ Show 20 Lines	case X86II::MRMDestReg: {
CurOp = SrcRegNum + 1;		CurOp = SrcRegNum + 1;
break;		break;
}		}
case X86II::MRMDestMem4VOp3CC: {		case X86II::MRMDestMem4VOp3CC: {
unsigned CC = MI.getOperand(8).getImm();		unsigned CC = MI.getOperand(8).getImm();
emitByte(BaseOpcode + CC, OS);		emitByte(BaseOpcode + CC, OS);
unsigned SrcRegNum = CurOp + X86::AddrNumOperands;		unsigned SrcRegNum = CurOp + X86::AddrNumOperands;
emitMemModRMByte(MI, CurOp + 1, getX86RegNum(MI.getOperand(0)), TSFlags,		emitMemModRMByte(MI, CurOp + 1, getX86RegNum(MI.getOperand(0)), TSFlags,
HasREX, StartByte, OS, Fixups, STI, false);		Kind, StartByte, OS, Fixups, STI, false);
CurOp = SrcRegNum + 3; // skip reg, VEX_V4 and CC		CurOp = SrcRegNum + 3; // skip reg, VEX_V4 and CC
break;		break;
}		}
case X86II::MRMDestMemFSIB:		case X86II::MRMDestMemFSIB:
case X86II::MRMDestMem: {		case X86II::MRMDestMem: {
emitByte(BaseOpcode, OS);		emitByte(BaseOpcode, OS);
unsigned SrcRegNum = CurOp + X86::AddrNumOperands;		unsigned SrcRegNum = CurOp + X86::AddrNumOperands;

if (HasEVEX_K) // Skip writemask		if (HasEVEX_K) // Skip writemask
++SrcRegNum;		++SrcRegNum;

if (HasVEX_4V) // Skip 1st src (which is encoded in VEX_VVVV)		if (HasVEX_4V) // Skip 1st src (which is encoded in VEX_VVVV)
++SrcRegNum;		++SrcRegNum;

bool ForceSIB = (Form == X86II::MRMDestMemFSIB);		bool ForceSIB = (Form == X86II::MRMDestMemFSIB);
emitMemModRMByte(MI, CurOp, getX86RegNum(MI.getOperand(SrcRegNum)), TSFlags,		emitMemModRMByte(MI, CurOp, getX86RegNum(MI.getOperand(SrcRegNum)), TSFlags,
HasREX, StartByte, OS, Fixups, STI, ForceSIB);		Kind, StartByte, OS, Fixups, STI, ForceSIB);
CurOp = SrcRegNum + 1;		CurOp = SrcRegNum + 1;
break;		break;
}		}
case X86II::MRMSrcReg: {		case X86II::MRMSrcReg: {
emitByte(BaseOpcode, OS);		emitByte(BaseOpcode, OS);
unsigned SrcRegNum = CurOp + 1;		unsigned SrcRegNum = CurOp + 1;

if (HasEVEX_K) // Skip writemask		if (HasEVEX_K) // Skip writemask
▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines	case X86II::MRMSrcMem: {

if (HasVEX_4V)		if (HasVEX_4V)
++FirstMemOp; // Skip the register source (which is encoded in VEX_VVVV).		++FirstMemOp; // Skip the register source (which is encoded in VEX_VVVV).

emitByte(BaseOpcode, OS);		emitByte(BaseOpcode, OS);

bool ForceSIB = (Form == X86II::MRMSrcMemFSIB);		bool ForceSIB = (Form == X86II::MRMSrcMemFSIB);
emitMemModRMByte(MI, FirstMemOp, getX86RegNum(MI.getOperand(CurOp)),		emitMemModRMByte(MI, FirstMemOp, getX86RegNum(MI.getOperand(CurOp)),
TSFlags, HasREX, StartByte, OS, Fixups, STI, ForceSIB);		TSFlags, Kind, StartByte, OS, Fixups, STI, ForceSIB);
CurOp = FirstMemOp + X86::AddrNumOperands;		CurOp = FirstMemOp + X86::AddrNumOperands;
if (HasVEX_I8Reg)		if (HasVEX_I8Reg)
I8RegNum = getX86RegEncoding(MI, CurOp++);		I8RegNum = getX86RegEncoding(MI, CurOp++);
break;		break;
}		}
case X86II::MRMSrcMem4VOp3: {		case X86II::MRMSrcMem4VOp3: {
unsigned FirstMemOp = CurOp + 1;		unsigned FirstMemOp = CurOp + 1;

emitByte(BaseOpcode, OS);		emitByte(BaseOpcode, OS);

emitMemModRMByte(MI, FirstMemOp, getX86RegNum(MI.getOperand(CurOp)),		emitMemModRMByte(MI, FirstMemOp, getX86RegNum(MI.getOperand(CurOp)),
TSFlags, HasREX, StartByte, OS, Fixups, STI);		TSFlags, Kind, StartByte, OS, Fixups, STI);
CurOp = FirstMemOp + X86::AddrNumOperands;		CurOp = FirstMemOp + X86::AddrNumOperands;
++CurOp; // Encoded in VEX.VVVV.		++CurOp; // Encoded in VEX.VVVV.
break;		break;
}		}
case X86II::MRMSrcMemOp4: {		case X86II::MRMSrcMemOp4: {
unsigned FirstMemOp = CurOp + 1;		unsigned FirstMemOp = CurOp + 1;

++FirstMemOp; // Skip the register source (which is encoded in VEX_VVVV).		++FirstMemOp; // Skip the register source (which is encoded in VEX_VVVV).

// Capture second register source (encoded in Imm[7:4])		// Capture second register source (encoded in Imm[7:4])
assert(HasVEX_I8Reg && "MRMSrcRegOp4 should imply VEX_I8Reg");		assert(HasVEX_I8Reg && "MRMSrcRegOp4 should imply VEX_I8Reg");
I8RegNum = getX86RegEncoding(MI, FirstMemOp++);		I8RegNum = getX86RegEncoding(MI, FirstMemOp++);

emitByte(BaseOpcode, OS);		emitByte(BaseOpcode, OS);

emitMemModRMByte(MI, FirstMemOp, getX86RegNum(MI.getOperand(CurOp)),		emitMemModRMByte(MI, FirstMemOp, getX86RegNum(MI.getOperand(CurOp)),
TSFlags, HasREX, StartByte, OS, Fixups, STI);		TSFlags, Kind, StartByte, OS, Fixups, STI);
CurOp = FirstMemOp + X86::AddrNumOperands;		CurOp = FirstMemOp + X86::AddrNumOperands;
break;		break;
}		}
case X86II::MRMSrcMemCC: {		case X86II::MRMSrcMemCC: {
unsigned RegOp = CurOp++;		unsigned RegOp = CurOp++;
unsigned FirstMemOp = CurOp;		unsigned FirstMemOp = CurOp;
CurOp = FirstMemOp + X86::AddrNumOperands;		CurOp = FirstMemOp + X86::AddrNumOperands;

unsigned CC = MI.getOperand(CurOp++).getImm();		unsigned CC = MI.getOperand(CurOp++).getImm();
emitByte(BaseOpcode + CC, OS);		emitByte(BaseOpcode + CC, OS);

emitMemModRMByte(MI, FirstMemOp, getX86RegNum(MI.getOperand(RegOp)),		emitMemModRMByte(MI, FirstMemOp, getX86RegNum(MI.getOperand(RegOp)),
TSFlags, HasREX, StartByte, OS, Fixups, STI);		TSFlags, Kind, StartByte, OS, Fixups, STI);
break;		break;
}		}

case X86II::MRMXrCC: {		case X86II::MRMXrCC: {
unsigned RegOp = CurOp++;		unsigned RegOp = CurOp++;

unsigned CC = MI.getOperand(CurOp++).getImm();		unsigned CC = MI.getOperand(CurOp++).getImm();
emitByte(BaseOpcode + CC, OS);		emitByte(BaseOpcode + CC, OS);
Show All 25 Lines	void X86MCCodeEmitter::encodeInstruction(const MCInst &MI, raw_ostream &OS,

case X86II::MRMXmCC: {		case X86II::MRMXmCC: {
unsigned FirstMemOp = CurOp;		unsigned FirstMemOp = CurOp;
CurOp = FirstMemOp + X86::AddrNumOperands;		CurOp = FirstMemOp + X86::AddrNumOperands;

unsigned CC = MI.getOperand(CurOp++).getImm();		unsigned CC = MI.getOperand(CurOp++).getImm();
emitByte(BaseOpcode + CC, OS);		emitByte(BaseOpcode + CC, OS);

emitMemModRMByte(MI, FirstMemOp, 0, TSFlags, HasREX, StartByte, OS, Fixups,		emitMemModRMByte(MI, FirstMemOp, 0, TSFlags, Kind, StartByte, OS, Fixups,
STI);		STI);
break;		break;
}		}

case X86II::MRMXm:		case X86II::MRMXm:
case X86II::MRM0m:		case X86II::MRM0m:
case X86II::MRM1m:		case X86II::MRM1m:
case X86II::MRM2m:		case X86II::MRM2m:
case X86II::MRM3m:		case X86II::MRM3m:
case X86II::MRM4m:		case X86II::MRM4m:
case X86II::MRM5m:		case X86II::MRM5m:
case X86II::MRM6m:		case X86II::MRM6m:
case X86II::MRM7m:		case X86II::MRM7m:
if (HasVEX_4V) // Skip the register dst (which is encoded in VEX_VVVV).		if (HasVEX_4V) // Skip the register dst (which is encoded in VEX_VVVV).
++CurOp;		++CurOp;
if (HasEVEX_K) // Skip writemask		if (HasEVEX_K) // Skip writemask
++CurOp;		++CurOp;
emitByte(BaseOpcode, OS);		emitByte(BaseOpcode, OS);
emitMemModRMByte(MI, CurOp,		emitMemModRMByte(MI, CurOp,
(Form == X86II::MRMXm) ? 0 : Form - X86II::MRM0m, TSFlags,		(Form == X86II::MRMXm) ? 0 : Form - X86II::MRM0m, TSFlags,
HasREX, StartByte, OS, Fixups, STI);		Kind, StartByte, OS, Fixups, STI);
CurOp += X86::AddrNumOperands;		CurOp += X86::AddrNumOperands;
break;		break;

case X86II::MRM0X:		case X86II::MRM0X:
case X86II::MRM1X:		case X86II::MRM1X:
case X86II::MRM2X:		case X86II::MRM2X:
case X86II::MRM3X:		case X86II::MRM3X:
case X86II::MRM4X:		case X86II::MRM4X:
▲ Show 20 Lines • Show All 119 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[X86][MC][NFC] Refine code in X86MCCodeEmitter.cpp about opcode prefixClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 496313

llvm/lib/Target/X86/MCTargetDesc/X86MCCodeEmitter.cpp

[X86][MC][NFC] Refine code in X86MCCodeEmitter.cpp about opcode prefix
ClosedPublic