In the WebAssembly target, we would like to allow alloca in two address
spaces. The alloca instruction already has an address space argument,
but the verifier asserts that the address space of an alloca is the
default alloca address space from the datalayout. This patch removes
this restriction. Targets that would like to impose additional
restrictions should do so via target-specific verification passes.
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Cc arsenm, as he added the address space restriction for alloca when he added address space support for alloca in https://github.com/llvm/llvm-project/commit/3c1fc768ed6cf2b01463df036ae6ae3b1f0de632.
Pre-merge checks show that https://github.com/llvm/llvm-project/blob/main/llvm/test/CodeGen/AMDGPU/invalid-alloca.ll fails; will need to follow up in a similar way.
The code looks fine, but explicitly requesting a review from @arsenm because I don't know the history of this restriction or how this affects other targets.
@wingo, have you investigated what would happen if we tried to use the normal stack address space in IR, but extracted reftype stack slots into a separate stack in the WebAssembly backend, perhaps specifically in WebAssemblyFrameLowering::emit{Prologue,Epilogue}?
I did envision this being useful, however there are a few points of concern:
- This does remove the safety check for the common case of creating an alloca without checking the datalayout's default alloca address space
- How is this going to be represented in codegen? In SelectionDAG the pointer type is immediately discarded and frame indexes don't have a tracked address space
I posted https://reviews.llvm.org/D101140 just for context -- based on this patch, and @pmatos's work on https://reviews.llvm.org/D95425.
Humm, good question. The issue as I understand it would be that letting reference-typed values slide all the way to frame lowering would be what to do with uses of the alloca at instruction-selection time. For i32 locals, isel lowersa (set i32:$dst (load frameindex:$idx)) to i32.load, but what would we do for externref values?
But if we took the same solution as in my downstream patch, maybe keeping the alloca in AS 0 might with, where the ElementType is our AS 1 externref representation. Though, I do wonder what risk we might be introducing with regards to optimizations / transformations thinking that the pointer has a size and can be written to linear memory.
Thanks for your time, arsenm.
Yeah I can see why you might want this. If we want to keep the footgun-protection, one way would be to allow a target to explicitly opt out of the alloca address-space check. Something like:
target datalayout = "A1*"
to indicate that this target uses AS 1 by default for alloca, but that it can use others. Then we'd adapt the Verifier check to see if this datalayout allows alloca in other address spaces. WDYT?
- How is this going to be represented in codegen? In SelectionDAG the pointer type is immediately discarded and frame indexes don't have a tracked address space
We override TargetLowering::getPointerTy(DataLayout&, unsigned AS) to return a different MVT for these address spaces. We then have custom lowering for ISD::LOAD / ISD::STORE for those special MVTs.
Just for context, on the wasm target, each function has two stack frames: one managed by the webassembly run-time, populated by named locals, and not addressable; and one in memory managed as on conventional targets, sharing an address space with data and heap. Clang produces IR where locals start life as alloca's as usual, and as usual the optimizer lifts most of them to SSA values in SROA. SSA values end up lowering to the "named" part of the stack. Those locals still left as allocas are put on linear memory. This patch is motivated by ongoing work to add support for a new kind of value that can't be written to linear memory. We are currently representing these values as opaque pointers in non-default address spaces, and have some special lowering hooks for these. We also use some special address spaces to denote restricted pointers to these values.
Instead of globally relaxing the restriction on alloca address spaces,
allow targets to choose whether to impose the restriction via the
datalayout.
It hadn't occurred to me to add a field to the datalayout purely for the verifier. That seems a bit strange, since it's not exactly a target property.
What is the plan to use this in codegen?
llvm/docs/LangRef.rst | ||
---|---|---|
9566 ↗ | (On Diff #339939) | c/designed/designated/? |
Yeah understood. If we want some targets to have the check and some to not have the check, do you know of a better place to express this? I thought the same place that the target's alloca address space is defined might not be unreasonable, but I am new here :)
They reach the MachineFrameInfo, but only ephemerally. At the beginning of isel, the static alloca's get collected and the MachineFrameInfo populated with abstract objects and frame indexes. Then when isel sees a load / store to these objects, the first load/store of the alloca causes it to be lifted out from the MachineFrameInfo and instead allocates it on the target-specific stack (as a WebAssembly local), and the object in the MachineFrameInfo is marked dead. Loads and stores to these objects lower to accesses to the parallel stack instead of to objects in MachineFrameInfo.
Obviously this doesn't allow for by-reference access to these allocations. This is OK for the WebAssembly target because these opaque types have source-level restrictions similar to the ones in place for ARM's SVE values -- see the ARM C Language Extensions (ACLE), §3.2.1 https://developer.arm.com/documentation/100987/0000.
I think we should have target specific IR verifiers, not pollute the datalayout. There are a number of other places such a verifier would be useful
Sure but this would be a much larger design effort. As I see it I have three options right now:
- Remove the restriction imposed on alloca with an explicitly specified address space
- Keep the restriction but allow datalayout to specify multiple valid alloca address spaces (this patch)
- Add a target-specific IR verifier somehow. It couldn't extend Verifier because Verifier is private; would have to be a separate pass. However surely in this case we'd remove the restriction on alloca address spaces in the core verifier.
I am leaning back towards (1), wdyt @arsenm ?
Go back to first version of patch, removing verifier restriction on alloca address spaces
llvm/docs/LangRef.rst | ||
---|---|---|
9574 ↗ | (On Diff #346178) | I would rephrase this instead of dropping the statement altogether. Without target knowledge, any generic code introducing an alloca should be using the alloca address space in the datalayout |
llvm/docs/LangRef.rst | ||
---|---|---|
9574 ↗ | (On Diff #346178) | Thanks for feedback; updated. I was unsure how precisely to say "alloca outside the stack address space may be meaningless", given that this is the langref; thoughts welcome if there is a better way to say this. |