Renames ExprType to the more apt BlockType and adds a variant for
multivalue blocks. Currently non-void blocks are only generated at the
end of functions where the block return type needs to agree with the
function return type, and that remains true for multivalue
blocks. That invariant means that the actual signature does not need
to be stored in the block signature MachineOperand because it can be
inferred by WebAssemblyMCInstLower from the return type of the
parent function. WebAssemblyMCInstLower continues to lower block
signature operands to immediates when possible but lowers multivalue
signatures to function type symbols. The AsmParser and Disassembler
are updated to handle multivalue block types as well.
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
- Build Status
Buildable 39589 Build 39618: arc lint + arc unit
Event Timeline
llvm/tools/llvm-mc/llvm-mc.cpp | ||
---|---|---|
520 | This change to remove the context creation seems seperate. If so can you split it out? That was this change can stay wasm specific. |
llvm/tools/llvm-mc/llvm-mc.cpp | ||
---|---|---|
520 | It's not separate, unfortunately. In order to create a wasm symbol from the disassembler we need a bunch of target information that this target contains that the old one did not have. |
llvm/tools/llvm-mc/llvm-mc.cpp | ||
---|---|---|
520 | I am happy to reconsider the wisdom of creating a symbol in the disassembler, though. |
llvm/tools/llvm-mc/llvm-mc.cpp | ||
---|---|---|
520 | I was confused because the existing disassembly already needs a context and even creates one saying: // Set up the MCContext for creating symbols and MCExpr's.. So that approach seems justified and consistent with the existing code. I'm not clear why it wasn't done this way to begin with. |
llvm/tools/llvm-mc/llvm-mc.cpp | ||
---|---|---|
520 | Yeah it was surprising that the previous code didn't work. With the previous code, asking the context for a symbol returned a generic MCSymbol. With the new code it returns a MCSymbolWasm, which we need to be able to set the symbol type to wasm::WASM_SYMBOL_TYPE_FUNCTION. |
llvm/lib/Target/WebAssembly/MCTargetDesc/WebAssemblyMCTargetDesc.h | ||
---|---|---|
135 | The invariant that only fallthrough-return blocks are allowed to be multivalue should probably be restated here. |
llvm/lib/Target/WebAssembly/MCTargetDesc/WebAssemblyMCTargetDesc.h | ||
---|---|---|
135 | Right, it's the last block and the last block within that block recursively. In fact this only happens with explicit returns inside the blocks (or blocks that are otherwise never exited) because blocks that could be fallthrough return blocks instead set their results to a local then have a local.get in the return position. I will expand this comment to reiterate this invariant. I only added one additional test for this case because the algorithm for determining when to set block types was not changed and is already tested in cfg-stackify.ll. |
Nice! Mostly LGTM.
Currently non-void blocks are only generated at the end of functions where the block return type needs to agree with the function return type, and that remains true for multivalue blocks. That invariant means that the actual signature does not need to be stored in the block signature MachineOperand because it can be inferred by WebAssemblyMCInstLower from the return type of the parent function.
I guess this is a tentative state before you implement the rest of the proposal in full, right? If other blocks are able to return multivalue, are we gonna change their operands to also take typeindex?
llvm/lib/Target/WebAssembly/Disassembler/WebAssemblyDisassembler.cpp | ||
---|---|---|
224 | What are septet values, and when is Val negative? All values of BlockType look unsigned. Maybe I'm missing something..? | |
227 | When does this happen? | |
239 | Disassembler is not going to print multivalue signatures then? Is this tentative or permanent? | |
llvm/lib/Target/WebAssembly/WebAssemblyMCInstLower.cpp | ||
203 | Nit: Do we need llvm:: here? |
llvm/lib/Target/WebAssembly/WebAssemblyCFGStackify.cpp | ||
---|---|---|
1248 | It's preexisting, but I think we should add END_TRY here too. Not sure if I can generate a test case that ends with end_try returning something easily though.. |
llvm/lib/Target/WebAssembly/Disassembler/WebAssemblyDisassembler.cpp | ||
---|---|---|
224 | See https://webassembly.github.io/multi-value/core/binary/instructions.html#control-instructions for details on the encoding. By septet value I just mean a group of 7 bits. That's where the & 0x7f comes from. | |
227 | See wasm-error.txt for an example. Basically anytime you have a negative SLEB128 value here that occupies more than one byte, which is invalid according to the spec. | |
239 | I don't think there's any way to access the type section from the disassembler, so this is permanent unless there's a large overhaul. cc @aardappel for the details. | |
llvm/lib/Target/WebAssembly/WebAssemblyCFGStackify.cpp | ||
1248 | Sounds good. |
I don't think we have to change this, or if we do it could be considered a separate project. Up to now we have been ok not using the ability for blocks to produce values, so I don't see a pressing need to change that. When we do work on multivalue blocks, it will be more important to have them in Binaryen first.
llvm/lib/Target/WebAssembly/Disassembler/WebAssemblyDisassembler.cpp | ||
---|---|---|
233 | Is this necessary? A disassembler is really just a function that turns a range of bytes into a string, it having a "side effect" of creating symbols seems odd to me. |
What are septet values, and when is Val negative? All values of BlockType look unsigned. Maybe I'm missing something..?