This is an archive of the discontinued LLVM Phabricator instance.

This broke test/MC/WebAssembly/global-ctor-dtor.ll in the waterfall. I checked the output but don't have any idea why it's causing differences in relocation and function information (I'm not familiar with the linker). Do you know why?

I saw the build failures too, when I was trying to land some patches today. I feel I can't land until the tests pass, even if the failures weren't caused by me.

This commit is causing the function body to change from:

from: 024041818080800041004180808080001081808080000D000F0B00000B
to:   02C00041818080800041004180808080001081808080000D000F0B00000B

That corresponds to block = 0x02 <blocktype> where blocktype has changed from being 0x40 (which was correct) to 0xC000 which is incorrect - and is -0x40 in SLEB encoding.

There's something fishy in this code even to begin with - the old code had ExprType::Void = -0x40 when the value to be written out is +0x40, and yet ExprType::I32 = -0x01 was used to write out a value of +0x7F. Somehow the encoding was treating void differently!?!

In the new code, it's been changed to ExprType::Void = +0x40 (corresponding to the byte value that's correct) and ExprType::I32 = +0x7F (again the same as the byte that's to be written out).

We could just revert this commit, or maybe spend another half hour trying to understand how on earth the old code was working...

OK, I get it. The writer is writing out operands as SLEB128 still, not as uint8_t.

So the old code had Void = -0x40, which encodes in SLEB to 0x40, and had I32 = -0x01 which encodes in SLEB to 0x7F.

This patch changed the values to their actual byte values, but they'll be mangled when they go through the SLEB operand writer (WebAssemblyMCCodeEmitter, branch for Info.OperandType == WebAssembly::OPERAND_SIGNATURE).

In D43991#1025115, @aheejin wrote:

This broke test/MC/WebAssembly/global-ctor-dtor.ll in the waterfall. I checked the output but don't have any idea why it's causing differences in relocation and function information (I'm not familiar with the linker). Do you know why?

The relocs were changing because the operand went from 0x40 (correct value) to 0xC000 (SLEB encoding of 0x40). Because the length of the sequence changed, the addresses of all the relocs were shifted.

For now, I'll revert the bad commit; if you want to re-apply it, you'll have to apply this change with it, so that SLEB encoding isn't applied to the uint8_t bytes:

diff --git a/lib/Target/WebAssembly/MCTargetDesc/WebAssemblyMCCodeEmitter.cpp b/lib/Target/WebAssembly/MCTargetDesc/WebAssemblyMCCodeEmitter.cpp
index 77744e53d62..0d35806229a 100644
--- a/lib/Target/WebAssembly/MCTargetDesc/WebAssemblyMCCodeEmitter.cpp
+++ b/lib/Target/WebAssembly/MCTargetDesc/WebAssemblyMCCodeEmitter.cpp
@@ -93,7 +93,9 @@ void WebAssemblyMCCodeEmitter::encodeInstruction(
         } else if (Info.OperandType == WebAssembly::OPERAND_GLOBAL) {
           llvm_unreachable("wasm globals should only be accessed symbolicly");
         } else if (Info.OperandType == WebAssembly::OPERAND_SIGNATURE) {
-          encodeSLEB128(int64_t(MO.getImm()), OS);
+          assert(MO.getImm() > 0 && (MO.getImm() & ~0x3f) == 0x40 &&
+                 "Signature must be pre-encoded negative single-byte SLEB");
+          OS << uint8_t(MO.getImm());
         } else {
           encodeULEB128(uint64_t(MO.getImm()), OS);
         }

aheejin mentioned this in D44034: Reland "[WebAssembly] More uses of uint8_t for single byte values".Mar 2 2018, 11:18 AM

Thank you so much for looking into this! Resubmitted in D44034.

aheejin mentioned this in rL326614: Reland "[WebAssembly] More uses of uint8_t for single byte values".Mar 2 2018, 12:55 PM

aheejin mentioned this in D56092: [WebAssembly] made assembler parse block_type.Jan 1 2019, 11:05 PM

Revision Contents

Path

Size

llvm/

trunk/

lib/

Target/

WebAssembly/

MCTargetDesc/

WebAssemblyMCTargetDesc.h

26 lines

Diff 136679

llvm/trunk/lib/Target/WebAssembly/MCTargetDesc/WebAssemblyMCTargetDesc.h

	Show First 20 Lines • Show All 152 Lines • ▼ Show 20 Lines
	static const unsigned LoadAddressOperandNo = 3;			static const unsigned LoadAddressOperandNo = 3;
	static const unsigned StoreAddressOperandNo = 2;			static const unsigned StoreAddressOperandNo = 2;

	/// The operand number of the load or store p2align in load/store instructions.			/// The operand number of the load or store p2align in load/store instructions.
	static const unsigned LoadP2AlignOperandNo = 1;			static const unsigned LoadP2AlignOperandNo = 1;
	static const unsigned StoreP2AlignOperandNo = 0;			static const unsigned StoreP2AlignOperandNo = 0;

	/// This is used to indicate block signatures.			/// This is used to indicate block signatures.
	enum class ExprType {			enum class ExprType : unsigned {
	Void = -0x40,			Void = 0x40,
	I32 = -0x01,			I32 = 0x7F,
	I64 = -0x02,			I64 = 0x7E,
	F32 = -0x03,			F32 = 0x7D,
	F64 = -0x04,			F64 = 0x7C,
	I8x16 = -0x05,			I8x16 = 0x7B,
	I16x8 = -0x06,			I16x8 = 0x7A,
	I32x4 = -0x07,			I32x4 = 0x79,
	F32x4 = -0x08,			F32x4 = 0x78,
	B8x16 = -0x09,			B8x16 = 0x77,
	B16x8 = -0x0a,			B16x8 = 0x76,
	B32x4 = -0x0b			B32x4 = 0x75
	};			};

	/// Instruction opcodes emitted via means other than CodeGen.			/// Instruction opcodes emitted via means other than CodeGen.
	static const unsigned Nop = 0x01;			static const unsigned Nop = 0x01;
	static const unsigned End = 0x0b;			static const unsigned End = 0x0b;

	wasm::ValType toValType(const MVT &Ty);			wasm::ValType toValType(const MVT &Ty);

	} // end namespace WebAssembly			} // end namespace WebAssembly
	} // end namespace llvm			} // end namespace llvm

	#endif			#endif

This is an archive of the discontinued LLVM Phabricator instance.

[WebAssembly] More uses of uint8_t for single byte valuesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 136679

llvm/trunk/lib/Target/WebAssembly/MCTargetDesc/WebAssemblyMCTargetDesc.h

[WebAssembly] More uses of uint8_t for single byte values
ClosedPublic