This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/CodeGen/
-
llvm/
-
CodeGen/
-
ISDOpcodes.h
-
SelectionDAGISel.h
-
lib/CodeGen/SelectionDAG/
-
CodeGen/
-
SelectionDAG/
2/3
LegalizeFloatTypes.cpp
4/14
LegalizeIntegerTypes.cpp
-
LegalizeTypes.h
1
SelectionDAGBuilder.cpp
2
SelectionDAGDumper.cpp
7/18
SelectionDAGISel.cpp
-
test/CodeGen/X86/
-
CodeGen/
-
X86/
7/13
selectiondag-stackmap-legalize.ll

Differential D125680

Correctly legalise stackmap operands
ClosedPublic

Authored by vext01 on May 16 2022, 4:25 AM.

Download Raw Diff

Details

Reviewers

nikic
sepavloff
arsenm
reames
skatkov
dantrushin

Summary

Hi everyone,

This is my first real contribution to LLVM, so please be gentle!

For context, we are writing a JIT that will require LLVM's stackmap facility in order to reconstruct the native stack when deoptimising from specialised JITted code back into generic AOT-compiled code.

During my experimentation with stackmaps, I've identified a few problems that I hope to be able to fix. This diff makes a start at addressing the first of these issues, namely: https://github.com/llvm/llvm-project/issues/21657

In short, passing illegally-typed live variable operands to llvm.experimental.stackmap (at -O1 or above) will make LLVM crash by an assertion failure. The issue is that stackmap operands are emitted directly to target nodes in the selection DAG, which means their operands don't get legalised.

The diff below introduces a (non-target) stackmap DAG node, so that the stackmap node and its operands can take part in regular legalisations.

Although ninja check passes, I don't expect this to be ready to be merged just yet because:

I only legalise integer operands and constant operands for now. I expect more needs to be done to legalise floats (and are there any other types that require legalisation?)
GC statepoints and patchpoints have the same problem as stackmaps and will require the same treatment.
There are a couple of questions, marked in the diff with XXX. I'd appreciate it if someone could comment on those points.

So although incomplete, I'm raising this now to check that I'm headed in the right direction. Any and all feedback would be much appreciated (even if you just say "yes, this looks correct/on-track", that's useful).

(The diff is formatted with git clang-format version 11, as found on Debian. Hope that's OK)

Diff Detail

Event Timeline

vext01 created this revision.May 16 2022, 4:25 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 16 2022, 4:25 AM

Herald added subscribers: pengfei, hiraditya. · View Herald Transcript

vext01 requested review of this revision.May 16 2022, 4:25 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 16 2022, 4:25 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

vext01 added reviewers: nikic, sepavloff, arsenm.May 16 2022, 5:43 AM

Herald added a subscriber: wdng. · View Herald TranscriptMay 16 2022, 5:43 AM

I wasn't sure who to add as reviewers, so on the sugesstion of people in IRC, I've added a few people who have recently touched selection the selection DAG. Hope that's OK.

Harbormaster completed remote builds in B164621: Diff 429680.May 16 2022, 5:48 AM

Can you please upload the diff with context (-U99999)?

I'm not familiar with this intrinsic, and it looks like it isn't specified in LangRef :/ It doesn't look like the original author is still active. Adding @reames who might be familiar with JIT and deoptimization.

arsenm added inline comments.May 16 2022, 6:04 AM

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp
494–495	I would move this above the included file (preferably in the same position as the code is defined in the enum)
llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2256–2257	The opcode assert is implied by the cast. There's also probably no point to asserting it's i64
2265–2266	Ditto
2279–2280	You seem to be fixing up some Constants to TargetConstant here. It would be better to just do this up front in SelectionDAGBuilder like it was before
llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
3	If you're checking debug output you need REQUIRES: asserts. However, I don't think checking the legalizer output is the most helpful thing here. Better to check the final output
10	Probably should add some cases with excessively wide types, vectors and FP

Updated diff with full context.

I'm not familiar with this intrinsic, and it looks like it isn't specified in LangRef.

There's a little info here:
https://llvm.org/docs/StackMaps.html

arsenm added inline comments.May 16 2022, 6:20 AM

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2279–2280	These parameters probably should have been marked immarg in the intrinsic definition?

Responded to @arsenm's comments. Some outstanding questions.

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2256–2257	If the assertion fails, do we not end up doing a bogus cast and invoking undefined behaviour which will probably lead to a crash much later? Wouldn't it be best to have the assertion crash early?
2279–2280	You seem to be fixing up some Constants to TargetConstant here. It would be better to just do this up front in SelectionDAGBuilder like it was before Wasn't that the very reason that the constants were not being legalised? Because they were already target constants? These parameters probably should have been marked immarg in the intrinsic definition? Should that be handled as a separate change, since I'm not actually touching the intrinsic's definition in this work? I don't know how the workflow works after phabricator? Do I get an opportunity to make things into neat, self-contained commits?
llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
3	At what level should I check? I wanted to check at the MIR level, but it doesn't show the types there.
10	Yep. I was only working on integer types for now, but I can add changes to ensure those work too.

arsenm added inline comments.May 16 2022, 6:44 AM

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2256–2257	The point of cast<> is it does the type assertion for you
2279–2280	I assume these are the fixed operands at the start of the argument list, and not the variadic section. I'm also assuming legalization is only relevant for the variadic arguments. Changing the intrinsic would be a separate change. Phabricator lets you add parent/child revisions to track related changes
llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
3	The types are meaningless after selection, so that makes sense. I would default to codegen to the end. MIR is less stable and what you care about is that the types were legalized to and selected to something, not the types themselves

Respond to more of @arsenm's comments. Thanks.

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2256–2257	I had assumed that `cast<>`was an alias to `static_cast<>`, but now I see that LLVM defines it! I had no idea! (forgive me, I'm pretty much learning C++ on the fly here) So I agree that asserting the type of the `opCode` is redundant, but: There's also probably no point to asserting it's i64 How comes? Isn't it good to check the sanity of the DAG we fed in? If we remove the assertion I think the stackmap code would eventually crash somewhere a lot later in the pipeline, but wouldn't it be good to crash early?
2279–2280	I assume these are the fixed operands at the start of the argument list, and not the variadic section. I'm also assuming legalization is only relevant for the variadic arguments. Ah, do you mean we should emit the first two arguments directly to target nodes? Assuming so, that'd probably work and might be a little faster. By the way, did you have any comments on the XXXs below? Changing the intrinsic would be a separate change. Phabricator lets you add parent/child revisions to track related changes. OK. Let's do that as a separate change.

vext01 added inline comments.May 16 2022, 7:10 AM

llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
3	I would default to codegen to the end Do you mean to match asm code for the target architecture? Isn't that also going to be fragile?

Harbormaster completed remote builds in B164629: Diff 429695.May 16 2022, 7:26 AM

arsenm added inline comments.May 16 2022, 2:03 PM

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2279–2280	Yes, if they needed to be target constants to begin with they should start as targetconstants
2286	FrameIndex will also be materialized into a register, unlike TargetFrameIndex
llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
3	Yes. No, this is a target specific test anyway.

vext01 added inline comments.May 17 2022, 2:00 AM

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2286	And is that correct?
llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
3	Stackmap doesn't emit any code that we could match, but we could match the raw bytes of the `.llvmbc` section, just it's very fiddly. If that's the only way, so be it.

vext01 added inline comments.May 17 2022, 2:02 AM

llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
3	sorry, not `.llvmbc`, I meant the stackmap section.

reames added reviewers: skatkov, dantrushin.May 17 2022, 7:17 AM

General direction looks reasonable. I do want to warn you that to my knowledge, no one is using stackmap or patchpoint. I believe they've been effectively dead code for the last couple years. statepoint is used, and should provide a superset of the stackmap/patchpoint functionality.

Code structure wise, would it be possibly to split this patch? (Warning: I am no selectiondag expert!) Could we add the SDNode in an NFC patch which does not legalize, and then handle legalization in a separate patch? If we can, it might make some of the review (mapping argument orders, etc) more straight forward.

arsenm added inline comments.May 17 2022, 7:30 AM

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2286	Well the current code is emitting TargetFrameIndexes into the argument list, so I assume this should also just go direct to TargetFrameIndex from the start

Hi Philip,

In D125680#3519150, @reames wrote:

General direction looks reasonable.

Good!

I do want to warn you that to my knowledge, no one is using stackmap or patchpoint. I believe they've been effectively dead code for the last couple years. statepoint is used, and should provide a superset of the stackmap/patchpoint functionality.

This makes me a little nervous. For our system to succeed we need them to work 100%!

Code structure wise, would it be possibly to split this patch? (Warning: I am no selectiondag expert!) Could we add the SDNode in an NFC patch which does not legalize, and then handle legalization in a separate patch? If we can, it might make some of the review (mapping argument orders, etc) more straight forward.

We could split the patch, but I don't see much benefit. It will mean picking the diff apart later, just to have an intermediate "still crashing" state.

I don't know what the others think...

I don't particularly care about splitting the patch. You can't really break this down very far, and the individual changes aren't really individually testable

Revised diff addressing many comments, most notably:

Emit target nodes at DAG-construction time if we know they are already legal.
Test raw bytes of stackmap section.

Still to do:

i128 live vars causes a crash.
floats
vectors
make llvm.experimental.patchpoint and llvm.experimental.gc.statepoint legalise properly too.

Harbormaster completed remote builds in B165118: Diff 430370.May 18 2022, 7:06 AM

In D125680#3522332, @vext01 wrote:

make llvm.experimental.patchpoint and llvm.experimental.gc.statepoint legalise properly too.

Please do not touch statepoint in this patch. If you do not really need it, leave it alone at all.

Sorry, bodged the last diff upload.

Please do not touch statepoint in this patch. If you do not really need it, leave it alone at all.

Just to clarify why I've proposed this:

llvm.experimental.stackmap, llvm.experimental.patchpoint and llvm.experimental.gc.statepoint used to all share the same DAG building code via a function called addStackMapLiveVars() in SelectionDAGBuilder.cpp.

This function incorrectly emits target nodes, thus erroneously side-stepping the legalisation stage.

My change (so-far) detaches only llvm.experimental.stackmap from this function.

Ideally, all three facilities would use a common function that emits target-independent nodes when legalisation is required, but if upstream is against that, we should at the very least add a big flashing neon message to the effect of:

// XXX: this is incorrect because it emits target nodes, meaning that operands do not get legalised.

Are we on the same page?

In D125680#3522346, @vext01 wrote:

Please do not touch statepoint in this patch. If you do not really need it, leave it alone at all.

Just to clarify why I've proposed this:

llvm.experimental.stackmap, llvm.experimental.patchpoint and llvm.experimental.gc.statepoint used to all share the same DAG building code via a function called addStackMapLiveVars() in SelectionDAGBuilder.cpp.

No, Statepoint does not use this function.

This function incorrectly emits target nodes, thus erroneously side-stepping the legalisation stage.

My change (so-far) detaches only llvm.experimental.stackmap from this function.

This is enough for a _single_ patch
You can (and better) do patchpoint in a separate patch.
Statepoint is much more complicated and is in active productionuse today. It is too easy to break, so we will need to negotiate first if you'll want to change it

Ideally, all three facilities would use a common function that emits target-independent nodes when legalisation is required, but if upstream is against that, we should at the very least add a big flashing neon message to the effect of:
// XXX: this is incorrect because it emits target nodes, meaning that operands do not get legalised.

For statepoint deopt arguments are always legal right now (as well as gc live pointers), so they do not need any special processing.

Are we on the same page?

As far as I know nobody cares about stackmap/patchpoint. Neither do we.
But as I said, STATEPOINT is very important for us in its current form.

Harbormaster completed remote builds in B165119: Diff 430371.May 18 2022, 8:27 AM

Statepoint is much more complicated and is in active productionuse today. It is too easy to break, so we will need to negotiate first if you'll want to change it

We don't need statepoint, and if what you say is true, that it doesn't use addStackMapLiveVars(), then I have no need to touch statepoint and everyone is happy :)

vext01 mentioned this in D126069: Test stackmap support for i128.May 20 2022, 7:42 AM

t.p.northover mentioned this in rGc5e5cf125838: Test stackmap support for i128.May 23 2022, 3:56 AM

Here is the latest version of this diff.

Also in the meantime I've extended the stackmap tests to give us a reasonable degree of confidence that this change is not breaking things.

Added some i128 tests in D126069 (merged)
Added some float tests in D126204 (merged)
Added runtime stackmap tests in D126552 (needs review)

I had planned to include legalisation tests for things like vectors, structs and integers greater than 64-bits, but since those types already don't work properly with stackmaps (i.e. when already legal), I think that should be a separate change.

For more info on those issues, see:

I hope this diff is nearing completion, but D126552 should go in first (I'm struggling to find reviewers for that one, any takers?).

Herald added a subscriber: jsji. · View Herald TranscriptJun 8 2022, 5:30 AM

Harbormaster completed remote builds in B168528: Diff 435111.Jun 8 2022, 6:52 AM

vext01 mentioned this in D126552: [test-suite] Towards runtime tests for stackmaps.Jun 17 2022, 3:50 AM

Here's the same diff, just formatted correctly.

Since we are now no longer waiting on D126552, would it be OK for this to go into main please?

(I now have commit access, so with permission, I can do that myself)

Harbormaster completed remote builds in B170486: Diff 437863.Jun 17 2022, 7:36 AM

In general looks OK to me (modulo few style nits).
But I'm not an ISEL type legalization expert. Would be nice to have LGTM from someone else as well.

llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp
2920	Nit: could you have it in single line as cases above (if it fits in 80 chars)?
llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
1729	Nit: same comment for a single line
4671	Nit: and here too
4686	Nit: extra empty line
llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
9422	Use existing `Ops` instead of introducing new vector?
llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp
490	Single line again

vext01 added inline comments.Jun 21 2022, 1:39 AM

llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp
2920	I think `git clang-format` wanted it this way, but I will check.

I've just realised that no test covers the case where a i128 constant is used as a stackmap live variable.

Before any attempt to legalise stackmap operands, a stackmap call would be directly translated to a target node and any constant i128 operand becomes a TargetConstant (also of type i128). LLVM would then crash if the value of the i128 exceeds what can be expressed by an i64 (see this bug).

This change causes LLVM to try to legalise the constant i128, which comes with some other caveats. When legalising a constant i128, LLVM is going to want to expand the type into smaller chunks (in the case of a constant i128, two smaller i64s). This is problematic because the stackmap format has no way to express such an expansion -- it is assumed that a constant cannot be larger than 64-bits ([see LargeConstant here](https://llvm.org/docs/StackMaps.html#stack-map-format).

A similar issue will arise for non-constant i128s, which may be split across registers during legalisation. Again, the stackmap format has no way to express that: it is assumed that if something lives in registers, it cannot span more than a single register.

I don't want to change the stackmap format right now, so I'm going to have to try and find an imperfect solution in the interim. I hope that's OK.

Sorry this is taking so long.

Made changes requested by @dantrushin, fixed (and tested) the i128 issue, and tidied up quite a bit.

I was able to revert changes I had made to some of the existing assertions by having the stackmap legalisation routines replace the node themselves.

Hoping this is ready now. I have a branch to do the same for llvm.experimental.patchpoint(), which I'd like to raise a diff for, but this diff needs to go in first.

llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp
2920	Yep. If I fold the lines, then `git clang-format` just undoes it. Should I override it in this instance, or?

Harbormaster completed remote builds in B171308: Diff 439000.Jun 22 2022, 8:11 AM

arsenm added inline comments.Jun 23 2022, 6:12 AM

llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
84	For a general test, a non-register argument may be more helpful
85	Is it worth testing vectors?

vext01 added inline comments.Jun 23 2022, 6:34 AM

llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
85	We spoke on IRC, but for the others, vector support for stackmaps is currently broken, so I've made no attempt to address them for now. https://github.com/llvm/llvm-project/issues/55613

To ease reviewing, here's what changed since the last version:

diff --git a/llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp b/llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
index 64d92e60f5de..7ed2808600cf 100644
--- a/llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
@@ -5515,8 +5515,12 @@ SDValue DAGTypeLegalizer::ExpandIntOp_STACKMAP(SDNode *N, unsigned OpNo) {
         DAG.getTargetConstant(StackMaps::ConstantOp, DL, MVT::i64));
     NewOps.push_back(DAG.getTargetConstant(CN->getZExtValue(), DL, Ty));
   } else {
-    // FIXME: https://github.com/llvm/llvm-project/issues/26431
-    DAG.getContext()->emitError("Can't expand stackmap operand");
+    // FIXME: There are a couple of problems with expanding non-constants for
+    // stackmaps:
+    //  - https://github.com/llvm/llvm-project/issues/26431
+    //  - https://github.com/llvm/llvm-project/issues/55957
+    DAG.getContext()->emitError(
+        "expanding this stackmap operand is unimplemented");
   }
 
   // Copy remaining operands.
diff --git a/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll b/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
index a93ffac1e1bb..80d95e3b66a5 100644
--- a/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
+++ b/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
@@ -29,7 +29,7 @@
 ;     CHECK-NEXT:   .long {{.*}}
 ;     CHECK-NEXT:   .short {{.*}}
 ;     NumLocations
-;     CHECK-NEXT:   .short 5
+;     CHECK-NEXT:   .short 7
 ;     Location[NumLocations]
 ;       Location[0]
 ;         CHECK-NEXT: .byte   1
@@ -48,7 +48,7 @@
 ;       Location[2]
 ;         CHECK-NEXT: .byte   1
 ;         CHECK-NEXT: .byte   0
-;         CHECK-NEXT: .short  16
+;         CHECK-NEXT: .short  1
 ;         CHECK-NEXT: .short  {{.*}}
 ;         CHECK-NEXT: .short  0
 ;         CHECK-NEXT: .long   0
@@ -60,26 +60,51 @@
 ;         CHECK-NEXT: .short  0
 ;         CHECK-NEXT: .long   0
 ;       Location[4]
+;         CHECK-NEXT: .byte   1
+;         CHECK-NEXT: .byte   0
+;         CHECK-NEXT: .short  16
+;         CHECK-NEXT: .short  {{.*}}
+;         CHECK-NEXT: .short  0
+;         CHECK-NEXT: .long   0
+;       Location[5]
 ;         CHECK-NEXT: .byte   4
 ;         CHECK-NEXT: .byte   0
 ;         CHECK-NEXT: .short  8
 ;         CHECK-NEXT: .short  {{.*}}
 ;         CHECK-NEXT: .short  0
 ;         CHECK-NEXT: .long   66
+;       Location[4]
+;         CHECK-NEXT: .byte   1
+;         CHECK-NEXT: .byte   0
+;         CHECK-NEXT: .short  4
+;         CHECK-NEXT: .short  {{.*}}
+;         CHECK-NEXT: .short  0
+;         CHECK-NEXT: .long   0
+
+@p32 = external global i8 addrspace(270)*
 
 declare void @llvm.experimental.stackmap(i64, i32, ...)
 
 define dso_local i32 @main(i32 %argc, i8** %argv) {
 entry:
-  %intreg = icmp eq i32 %argc, 5
+  %i1reg = icmp eq i32 %argc, 5
+  %i7reg = zext i1 %i1reg to i7
+  %i128reg = zext i1 %i1reg to i128
   %halfreg = sitofp i32 %argc to half
+  %ptr32 = load i8 addrspace(270)*, i8 addrspace(270)** @p32
   call void (i64, i32, ...) @llvm.experimental.stackmap(
     i64 0,
     i32 0,
-    i1 %intreg,
+    i1 %i1reg,
     i7 22,
+    i7 %i7reg,
     half 1.0,
     half %halfreg,
-    i128 66)
+    i128 66,
+    ; FIXME: test non-constant i128 once these are fixed:
+    ;  - https://github.com/llvm/llvm-project/issues/26431
+    ;  - https://github.com/llvm/llvm-project/issues/55957
+    ;i128 %i128reg
+    i8 addrspace(270)* %ptr32)
   ret i32 0
 }

llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
84	I've added the missing non-constant i7, and I would have done the same for `i128`, but that's already broken in llvm before my change and my change doesn't fix it either. That can be another diff to fix that. Also added a 32-bit address space pointer into the test.

Harbormaster completed remote builds in B171617: Diff 439406.Jun 23 2022, 8:54 AM

Updated the diff to test structs and added some more comments.

Note that structs and vectors were broken before my change, and remain broken for now.

Diff to last version:

diff --git a/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll b/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
index 80d95e3b66a5..32242ac1239b 100644
--- a/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
+++ b/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
@@ -29,7 +29,7 @@
 ;     CHECK-NEXT:   .long {{.*}}
 ;     CHECK-NEXT:   .short {{.*}}
 ;     NumLocations
-;     CHECK-NEXT:   .short 7
+;     CHECK-NEXT:   .short 11
 ;     Location[NumLocations]
 ;       Location[0]
 ;         CHECK-NEXT: .byte   1
@@ -80,18 +80,50 @@
 ;         CHECK-NEXT: .short  {{.*}}
 ;         CHECK-NEXT: .short  0
 ;         CHECK-NEXT: .long   0
+;       Location[5]
+;         CHECK-NEXT: .byte   4
+;         CHECK-NEXT: .byte   0
+;         CHECK-NEXT: .short  8
+;         CHECK-NEXT: .short  0
+;         CHECK-NEXT: .short  0
+;         CHECK-NEXT: .long   0
+;       Location[6]
+;         CHECK-NEXT: .byte   1
+;         CHECK-NEXT: .byte   0
+;         CHECK-NEXT: .short  4
+;         CHECK-NEXT: .short  {{.*}}
+;         CHECK-NEXT: .short  0
+;         CHECK-NEXT: .long   0
+;       Location[7]
+;         CHECK-NEXT: .byte   4
+;         CHECK-NEXT: .byte   0
+;         CHECK-NEXT: .short  8
+;         CHECK-NEXT: .short  0
+;         CHECK-NEXT: .short  0
+;         CHECK-NEXT: .long   0
+;       Location[6]
+;         CHECK-NEXT: .byte   1
+;         CHECK-NEXT: .byte   0
+;         CHECK-NEXT: .short  1
+;         CHECK-NEXT: .short  {{.*}}
+;         CHECK-NEXT: .short  0
+;         CHECK-NEXT: .long   0
 
 @p32 = external global i8 addrspace(270)*
 
+%struct1 = type {i32, i64}
+%struct2 = type {i1, i1, i1}
+
 declare void @llvm.experimental.stackmap(i64, i32, ...)
 
 define dso_local i32 @main(i32 %argc, i8** %argv) {
 entry:
   %i1reg = icmp eq i32 %argc, 5
   %i7reg = zext i1 %i1reg to i7
-  %i128reg = zext i1 %i1reg to i128
   %halfreg = sitofp i32 %argc to half
   %ptr32 = load i8 addrspace(270)*, i8 addrspace(270)** @p32
+  %structreg1 = insertvalue %struct1 zeroinitializer, i32 %argc, 0
+  %structreg2 = insertvalue %struct2 zeroinitializer, i1 %i1reg, 0
   call void (i64, i32, ...) @llvm.experimental.stackmap(
     i64 0,
     i32 0,
@@ -101,10 +133,20 @@ entry:
     half 1.0,
     half %halfreg,
     i128 66,
+    ; FIXME: fix and test vectors. At the moment even legally sized vectors
+    ; are broken:
+    ; https://github.com/llvm/llvm-project/issues/55613
+    ;
     ; FIXME: test non-constant i128 once these are fixed:
     ;  - https://github.com/llvm/llvm-project/issues/26431
     ;  - https://github.com/llvm/llvm-project/issues/55957
-    ;i128 %i128reg
-    i8 addrspace(270)* %ptr32)
+    i8 addrspace(270)* %ptr32,
+    ; FIXME: The stackmap record generated for structs is incorrect:
+    ;  - https://github.com/llvm/llvm-project/issues/55649
+    ;  - https://github.com/llvm/llvm-project/issues/55957
+    %struct1 zeroinitializer,
+    %struct1 %structreg1,
+    %struct2 zeroinitializer,
+    %struct2 %structreg2)
   ret i32 0
 }

Let's move this forward. Can we put this into main?

Harbormaster completed remote builds in B171818: Diff 439691.Jun 24 2022, 4:28 AM

arsenm added inline comments.Jun 27 2022, 6:12 AM

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
2314	I'm a bit surprised this can unconditionally zero extend. I'd expect ANY_EXTEND or to have to consider some ABI property
5512–5519	I'd rather not add a special case to make this work only for constants
5524–5525	My skimming of the issue says this is an issue with the lowering from IR, so why do you need to error here? In any case I think emitError should be reserved for cases where reasonable user code ran into a truly unhandleable case. It would be better to just let it fall through and hit the ordinary legalize fatal error
llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2274–2277	You're still fixing this into TargetConstant at selection time instead of upfront when lowering from the IR

vext01 added inline comments.Jun 27 2022, 7:04 AM

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
2314	Perhaps not some ABI property, as a call to `llvm.experimental.stackmap` isn't really a call, so there is no ABI to speak of. I'll look into `ANY_EXTEND`. I'm not familiar.
llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2274–2277	This code is selecting the stackmap live variables. The (non-frameindex) live variables can't be emitted to target constants at DAG-build time or they won't get legalized and that's the problem that this change is trying to address. Correct me if I'm wrong though.

OK, on IRC @arsenm and I agreed that for the cases that we can't yet handle, we should see the same legaliser error that we see in-tree now.o

This change implements that, albeit slightly clumsily with a goto. If you simply don't expand the problematic operand, as I first tried, the legaliser gets stuck in an infinite loop. Note also that goto default is not valid in C/C++, so had to add another label.

We do still have to special-case constants I'm afraid.

Also fixed the EXTEND_ANY thing and fixed some incorrectly numbered comments in the test.

Diff to previous:

diff --git a/llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp b/llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
index 7ed2808600cf..44e6a9e5a6a6 100644
--- a/llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
@@ -2311,7 +2311,10 @@ SDValue DAGTypeLegalizer::PromoteIntOp_SET_ROUNDING(SDNode *N) {
 SDValue DAGTypeLegalizer::PromoteIntOp_STACKMAP(SDNode *N, unsigned OpNo) {
   assert(OpNo > 1); // Because the first two arguments are guaranteed legal.
   SmallVector<SDValue> NewOps(N->ops().begin(), N->ops().end());
-  NewOps[OpNo] = ZExtPromotedInteger(N->getOperand(OpNo));
+  SDValue Operand = N->getOperand(OpNo);
+  EVT NVT = TLI.getTypeToTransformTo(*DAG.getContext(), Operand.getValueType());
+  NewOps[OpNo] =
+      DAG.getNode(ISD::ANY_EXTEND, SDLoc(N), NVT, N->getOperand(OpNo));
   return SDValue(DAG.UpdateNodeOperands(N, NewOps), 0);
 }
 
@@ -4631,6 +4634,7 @@ bool DAGTypeLegalizer::ExpandIntegerOperand(SDNode *N, unsigned OpNo) {
     return false;
 
   switch (N->getOpcode()) {
+  fail:
   default:
   #ifndef NDEBUG
     dbgs() << "ExpandIntegerOperand Op #" << OpNo << ": ";
@@ -4665,7 +4669,11 @@ bool DAGTypeLegalizer::ExpandIntegerOperand(SDNode *N, unsigned OpNo) {
 
   case ISD::ATOMIC_STORE:      Res = ExpandIntOp_ATOMIC_STORE(N); break;
   case ISD::STACKMAP:
-    Res = ExpandIntOp_STACKMAP(N, OpNo);
+    Optional<SDValue> MaybeRes = ExpandIntOp_STACKMAP(N, OpNo);
+    if (MaybeRes.hasValue())
+      Res = MaybeRes.getValue();
+    else
+      goto fail;
     break;
   }
 
@@ -5496,7 +5504,8 @@ SDValue DAGTypeLegalizer::PromoteIntOp_CONCAT_VECTORS(SDNode *N) {
   return DAG.getBuildVector(N->getValueType(0), dl, NewOps);
 }
 
-SDValue DAGTypeLegalizer::ExpandIntOp_STACKMAP(SDNode *N, unsigned OpNo) {
+Optional<SDValue> DAGTypeLegalizer::ExpandIntOp_STACKMAP(SDNode *N,
+                                                         unsigned OpNo) {
   assert(OpNo > 1);
 
   SDValue Op = N->getOperand(OpNo);
@@ -5508,19 +5517,21 @@ SDValue DAGTypeLegalizer::ExpandIntOp_STACKMAP(SDNode *N, unsigned OpNo) {
     NewOps.push_back(N->getOperand(I));
 
   if (Op->getOpcode() == ISD::Constant) {
-    // FIXME: https://github.com/llvm/llvm-project/issues/55609
     ConstantSDNode *CN = cast<ConstantSDNode>(Op);
     EVT Ty = Op.getValueType();
-    NewOps.push_back(
-        DAG.getTargetConstant(StackMaps::ConstantOp, DL, MVT::i64));
-    NewOps.push_back(DAG.getTargetConstant(CN->getZExtValue(), DL, Ty));
+    if (CN->getConstantIntValue()->getValue().getActiveBits() < 64) {
+      NewOps.push_back(
+          DAG.getTargetConstant(StackMaps::ConstantOp, DL, MVT::i64));
+      NewOps.push_back(DAG.getTargetConstant(CN->getZExtValue(), DL, Ty));
+    } else {
+      // FIXME: https://github.com/llvm/llvm-project/issues/55609
+      return Optional<SDValue>();
+    }
   } else {
-    // FIXME: There are a couple of problems with expanding non-constants for
-    // stackmaps:
+    // FIXME: Non-constant operands are not yet handled:
     //  - https://github.com/llvm/llvm-project/issues/26431
     //  - https://github.com/llvm/llvm-project/issues/55957
-    DAG.getContext()->emitError(
-        "expanding this stackmap operand is unimplemented");
+    return Optional<SDValue>();
   }
 
   // Copy remaining operands.
diff --git a/llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h b/llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h
index 2807b7f5ae68..bc8257c9ec4c 100644
--- a/llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h
+++ b/llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h
@@ -494,7 +494,7 @@ private:
   SDValue ExpandIntOp_RETURNADDR(SDNode *N);
   SDValue ExpandIntOp_ATOMIC_STORE(SDNode *N);
   SDValue ExpandIntOp_SPLAT_VECTOR(SDNode *N);
-  SDValue ExpandIntOp_STACKMAP(SDNode *N, unsigned OpNo);
+  Optional<SDValue> ExpandIntOp_STACKMAP(SDNode *N, unsigned OpNo);
 
   void IntegerExpandSetCCOperands(SDValue &NewLHS, SDValue &NewRHS,
                                   ISD::CondCode &CCCode, const SDLoc &dl);
diff --git a/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll b/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
index 32242ac1239b..bc624be5318e 100644
--- a/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
+++ b/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
@@ -73,35 +73,35 @@
 ;         CHECK-NEXT: .short  {{.*}}
 ;         CHECK-NEXT: .short  0
 ;         CHECK-NEXT: .long   66
-;       Location[4]
+;       Location[6]
 ;         CHECK-NEXT: .byte   1
 ;         CHECK-NEXT: .byte   0
 ;         CHECK-NEXT: .short  4
 ;         CHECK-NEXT: .short  {{.*}}
 ;         CHECK-NEXT: .short  0
 ;         CHECK-NEXT: .long   0
-;       Location[5]
+;       Location[7]
 ;         CHECK-NEXT: .byte   4
 ;         CHECK-NEXT: .byte   0
 ;         CHECK-NEXT: .short  8
 ;         CHECK-NEXT: .short  0
 ;         CHECK-NEXT: .short  0
 ;         CHECK-NEXT: .long   0
-;       Location[6]
+;       Location[8]
 ;         CHECK-NEXT: .byte   1
 ;         CHECK-NEXT: .byte   0
 ;         CHECK-NEXT: .short  4
 ;         CHECK-NEXT: .short  {{.*}}
 ;         CHECK-NEXT: .short  0
 ;         CHECK-NEXT: .long   0
-;       Location[7]
+;       Location[9]
 ;         CHECK-NEXT: .byte   4
 ;         CHECK-NEXT: .byte   0
 ;         CHECK-NEXT: .short  8
 ;         CHECK-NEXT: .short  0
 ;         CHECK-NEXT: .short  0
 ;         CHECK-NEXT: .long   0
-;       Location[6]
+;       Location[10]
 ;         CHECK-NEXT: .byte   1
 ;         CHECK-NEXT: .byte   0
 ;         CHECK-NEXT: .short  1

LGTY?

Harbormaster completed remote builds in B172466: Diff 440603.Jun 28 2022, 8:00 AM

In D125680#3615526, @vext01 wrote:

Diff to previous:

FYI Phabricator gives you this if you look at the history tab

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
2274–2277	This is a pretty weird behavior, but does the type actually matter? Could you just unconditionally use 64-bit constants?

vext01 added inline comments.Jul 4 2022, 7:00 AM

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp

2274–2277

Sorry for the delay. I was away at the end of last week.

I've just tried to implement your suggestion, it has two undesirable effects:

It changes the way large constant structs which get split into e.g. mergevalues(constant, constant) are codegenned. It appears that the constituent parts of the struct get allocated to registers and not constants.
For FastISel, some some small constants get reported as long constants in the stackmap record.
For FastISel It causes large constants to be emitted in a different order than in SelectionDAGISel, which means that the test files which test both backends cannot succeed.

This is too much scary breakage, so I suggest we emit constants as we did before.

Here is the diff (to previous) that I was working on before I abandoned it:

diff --git a/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp b/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
index b9816bb34bc9..497c4c77880a 100644
--- a/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
@@ -9449,6 +9449,10 @@ void SelectionDAGBuilder::visitStackmap(const CallInst &CI) {
       const TargetLowering &TLI = DAG.getTargetLoweringInfo();
       Ops.push_back(DAG.getTargetFrameIndex(
           FI->getIndex(), TLI.getFrameIndexTy(DAG.getDataLayout())));
+    } else if (ConstantSDNode *C = dyn_cast<ConstantSDNode>(Op)) {
+		// Directly emit a 64-bit target constant.
+		Ops.push_back(DAG.getTargetConstant(StackMaps::ConstantOp, DL, MVT::i64));
+		Ops.push_back(DAG.getTargetConstant(C->getZExtValue(), DL, MVT::i64));
     } else {
       // Otherwise emit a target independent node to be legalised.
       Ops.push_back(getValue(CI.getArgOperand(I)));
diff --git a/llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp b/llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
index aff4e9a94fef..936f48f34870 100644
--- a/llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
+++ b/llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp
@@ -2269,15 +2269,7 @@ void SelectionDAGISel::Select_STACKMAP(SDNode *N) {
     // FrameIndex nodes should have been directly emitted to TargetFrameIndex
     // nodes at DAG-construction time.
     assert(OpNode->getOpcode() != ISD::FrameIndex);
-
-    if (OpNode->getOpcode() == ISD::Constant) {
-      Ops.push_back(
-          CurDAG->getTargetConstant(StackMaps::ConstantOp, DL, MVT::i64));
-      O = CurDAG->getTargetConstant(
-          cast<ConstantSDNode>(OpNode)->getZExtValue(), DL, It->getValueType());
-    } else {
-      O = *It;
-    }
+    O = *It;
     Ops.push_back(O);
   }
 
diff --git a/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll b/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
index bc624be5318e..a2d629fce00c 100644
--- a/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
+++ b/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll
@@ -81,10 +81,10 @@
 ;         CHECK-NEXT: .short  0
 ;         CHECK-NEXT: .long   0
 ;       Location[7]
-;         CHECK-NEXT: .byte   4
+;         CHECK-NEXT: .byte   1
 ;         CHECK-NEXT: .byte   0
-;         CHECK-NEXT: .short  8
-;         CHECK-NEXT: .short  0
+;         CHECK-NEXT: .short  4
+;         CHECK-NEXT: .short  {{.*}}
 ;         CHECK-NEXT: .short  0
 ;         CHECK-NEXT: .long   0
 ;       Location[8]
@@ -95,10 +95,10 @@
 ;         CHECK-NEXT: .short  0
 ;         CHECK-NEXT: .long   0
 ;       Location[9]
-;         CHECK-NEXT: .byte   4
+;         CHECK-NEXT: .byte   1
 ;         CHECK-NEXT: .byte   0
-;         CHECK-NEXT: .short  8
-;         CHECK-NEXT: .short  0
+;         CHECK-NEXT: .short  1
+;         CHECK-NEXT: .short  2
 ;         CHECK-NEXT: .short  0
 ;         CHECK-NEXT: .long   0
 ;       Location[10]
diff --git a/llvm/test/CodeGen/X86/stackmap-fast-isel.ll b/llvm/test/CodeGen/X86/stackmap-fast-isel.ll
index dd25065f3063..558029cdf1c5 100644
--- a/llvm/test/CodeGen/X86/stackmap-fast-isel.ll
+++ b/llvm/test/CodeGen/X86/stackmap-fast-isel.ll
@@ -29,8 +29,8 @@
 ; CHECK-NEXT:   .quad 4
 
 ; Large Constants
-; CHECK-NEXT:   .quad   2147483648
 ; CHECK-NEXT:   .quad   4294967295
+; CHECK-NEXT:   .quad   2147483648
 ; CHECK-NEXT:   .quad   4294967296
 
 ; Callsites
@@ -46,14 +46,14 @@
 ; CHECK-NEXT:   .short  8
 ; CHECK-NEXT:   .short  0
 ; CHECK-NEXT:   .short  0
-; CHECK-NEXT:   .long   -1
+; CHECK-NEXT:   .long   65535
 ; SmallConstant
 ; CHECK-NEXT:   .byte   4
 ; CHECK-NEXT:   .byte   0
 ; CHECK-NEXT:   .short  8
 ; CHECK-NEXT:   .short  0
 ; CHECK-NEXT:   .short  0
-; CHECK-NEXT:   .long   -1
+; CHECK-NEXT:   .long   65535
 ; SmallConstant
 ; CHECK-NEXT:   .byte   4
 ; CHECK-NEXT:   .byte   0
@@ -76,19 +76,19 @@
 ; CHECK-NEXT:   .short  0
 ; CHECK-NEXT:   .long   2147483647
 ; SmallConstant
-; CHECK-NEXT:   .byte   4
+; CHECK-NEXT:   .byte   5
 ; CHECK-NEXT:   .byte   0
 ; CHECK-NEXT:   .short  8
 ; CHECK-NEXT:   .short  0
 ; CHECK-NEXT:   .short  0
-; CHECK-NEXT:   .long   -1
+; CHECK-NEXT:   .long   0
 ; SmallConstant
-; CHECK-NEXT:   .byte   4
+; CHECK-NEXT:   .byte   5
 ; CHECK-NEXT:   .byte   0
 ; CHECK-NEXT:   .short  8
 ; CHECK-NEXT:   .short  0
 ; CHECK-NEXT:   .short  0
-; CHECK-NEXT:   .long   -1
+; CHECK-NEXT:   .long   0
 ; SmallConstant
 ; CHECK-NEXT:   .byte   4
 ; CHECK-NEXT:   .byte   0
@@ -102,14 +102,14 @@
 ; CHECK-NEXT:   .short  8
 ; CHECK-NEXT:   .short  0
 ; CHECK-NEXT:   .short  0
-; CHECK-NEXT:   .long   0
+; CHECK-NEXT:   .long   1
 ; LargeConstant at index 1
 ; CHECK-NEXT:   .byte   5
 ; CHECK-NEXT:   .byte   0
 ; CHECK-NEXT:   .short  8
 ; CHECK-NEXT:   .short  0
 ; CHECK-NEXT:   .short  0
-; CHECK-NEXT:   .long   1
+; CHECK-NEXT:   .long   0
 ; LargeConstant at index 2
 ; CHECK-NEXT:   .byte   5
 ; CHECK-NEXT:   .byte   0

arsenm added inline comments.Jul 5 2022, 8:34 AM

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
4673–4674	Don't see why you are introducing this goto
5501	Don't see why you're using Optional. Just use SDValue() like everything else?

vext01 added inline comments.Jul 5 2022, 9:14 AM

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
4673–4674	It's because it's the only path to the error state contained in the `default` branch of the switch. In an earlier conversation we said that we wanted to trigger the default "can't legalise this" error if we encounter a case we cannot handle.
5501	If we return `SDValue()` then we are signalling that the handler has already replaced the node with a new node with the operand in question legalised. If we do that, then the legaliser will get stuck in an infinite loop trying to legalise the operand that we cannot handle, since we never actually replace the operand. I introduced the optional so that we could signal the error state and break out of the legaliser loop (i.e. the goto you questioned).

arsenm added inline comments.Jul 5 2022, 10:15 AM

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp
4673–4674	But there's no actual failure case here so I don't understand what this is doing. This is always producing something
5521–5522	Just leave the node as-is? It will fail to legalize later

Removed clunky error handling.

vext01 added inline comments.Jul 6 2022, 2:52 AM

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp

5521–5522

Sorry, there was some confusion at my end.

Leaving the node as-is (i.e. return SDValue(N, 0);) is the case that will cause an infinite loop in the legaliser. Returning a "null" value (i.e. return SDValue()) is fine though.

If I put a print at the top of this function:

errs() << __func__ << ": " << OpNo << ": "; N->dump();

And update the return SDValue() to return SDValue(N, 0);, and then use an input that covers one of those cases, e.g.:

declare void @llvm.experimental.stackmap(i64, i32, ...)

define void @f() {
  call void (i64, i32, ...) @llvm.experimental.stackmap(i64 0, i32 0, i128 9223372036854775808)
  ret void
}

Then we see the function called indefinitely, trying to legalise the same operand forever:

ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808>
ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808>
ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808>
ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808>
ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808>
ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808>
ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808>
ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808>
ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808>
...

This is due to the special casing in DAGTypeLegalizer::ExpandIntegerOperand():

Res = ExpandIntOp_STACKMAP(N, OpNo);
...
    // If the result is null, the sub-method took care of registering results etc.                                                                                                                                                            
    if (!Res.getNode()) return false;                                                                                                                                                                                                         
                                                                                                                                                                                                                                              
    // If the result is N, the sub-method updated N in place.  Tell the legalizer                                                                                                                                                             
    // core about this.                                                                                                                                                                                                                       
    if (Res.getNode() == N)                                                                                                                                                                                                                   
      return true;

Returning SDValue() hits the return false, whereas return SDValue(N, 0) hits return true.

The return value is whether or not the operand needs another round of legalisation. At the call site of ExpandIntegerOperand(), there is:

NeedsReanalyzing = ExpandIntegerOperand(N, i);

So we need to return false to:

not loop forever
bail out with the right error.

So I believe return SDValue() to be correct. The latest diff reflects this.

Harbormaster completed remote builds in B173829: Diff 442478.Jul 6 2022, 3:50 AM

Small tweak.

Harbormaster completed remote builds in B173875: Diff 442540.Jul 6 2022, 6:38 AM

arsenm accepted this revision.Jul 6 2022, 6:44 AM

This revision is now accepted and ready to land.Jul 6 2022, 6:44 AM

vext01 mentioned this in rGed8ef65f3d93: [stackmaps] Start legalizing live variable operands.Jul 6 2022, 9:02 AM

vext01 mentioned this in D129268: Legalise patchpoint arguments..Jul 7 2022, 3:15 AM

vext01 mentioned this in rG2e62a26fd77d: [stackmaps] Legalise patchpoint arguments..Jul 15 2022, 4:02 AM

vext01 closed this revision.Jul 18 2022, 8:27 AM

This landed in ed8ef65

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

ISDOpcodes.h

5 lines

SelectionDAGISel.h

1 line

lib/

CodeGen/

SelectionDAG/

LegalizeFloatTypes.cpp

17 lines

LegalizeIntegerTypes.cpp

57 lines

LegalizeTypes.h

3 lines

SelectionDAGBuilder.cpp

56 lines

SelectionDAGDumper.cpp

2 lines

SelectionDAGISel.cpp

50 lines

test/

CodeGen/

X86/

selectiondag-stackmap-legalize.ll

152 lines

Diff 442540

llvm/include/llvm/CodeGen/ISDOpcodes.h

	Show First 20 Lines • Show All 1,279 Lines • ▼ Show 20 Lines
	VECREDUCE_AND,			VECREDUCE_AND,
	VECREDUCE_OR,			VECREDUCE_OR,
	VECREDUCE_XOR,			VECREDUCE_XOR,
	VECREDUCE_SMAX,			VECREDUCE_SMAX,
	VECREDUCE_SMIN,			VECREDUCE_SMIN,
	VECREDUCE_UMAX,			VECREDUCE_UMAX,
	VECREDUCE_UMIN,			VECREDUCE_UMIN,

				// The `llvm.experimental.stackmap` intrinsic.
				// Operands: input chain, glue, <id>, <numShadowBytes>, [live0[, live1...]]
				// Outputs: output chain, glue
				STACKMAP,

	// Vector Predication			// Vector Predication
	#define BEGIN_REGISTER_VP_SDNODE(VPSDID, ...) VPSDID,			#define BEGIN_REGISTER_VP_SDNODE(VPSDID, ...) VPSDID,
	#include "llvm/IR/VPIntrinsics.def"			#include "llvm/IR/VPIntrinsics.def"

	/// BUILTIN_OP_END - This must be the last enum value in this list.			/// BUILTIN_OP_END - This must be the last enum value in this list.
	/// The target-specific pre-isel opcode values start here.			/// The target-specific pre-isel opcode values start here.
	BUILTIN_OP_END			BUILTIN_OP_END
	};			};
	▲ Show 20 Lines • Show All 209 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/SelectionDAGISel.h

Show First 20 Lines • Show All 315 Lines • ▼ Show 20 Lines	private:
void Select_INLINEASM(SDNode *N);		void Select_INLINEASM(SDNode *N);
void Select_READ_REGISTER(SDNode *Op);		void Select_READ_REGISTER(SDNode *Op);
void Select_WRITE_REGISTER(SDNode *Op);		void Select_WRITE_REGISTER(SDNode *Op);
void Select_UNDEF(SDNode *N);		void Select_UNDEF(SDNode *N);
void CannotYetSelect(SDNode *N);		void CannotYetSelect(SDNode *N);

void Select_FREEZE(SDNode *N);		void Select_FREEZE(SDNode *N);
void Select_ARITH_FENCE(SDNode *N);		void Select_ARITH_FENCE(SDNode *N);
		void Select_STACKMAP(SDNode *N);

private:		private:
void DoInstructionSelection();		void DoInstructionSelection();
SDNode MorphNode(SDNode Node, unsigned TargetOpc, SDVTList VTList,		SDNode MorphNode(SDNode Node, unsigned TargetOpc, SDVTList VTList,
ArrayRef<SDValue> Ops, unsigned EmitNodeInfo);		ArrayRef<SDValue> Ops, unsigned EmitNodeInfo);

/// Prepares the landing pad to take incoming values or do other EH		/// Prepares the landing pad to take incoming values or do other EH
/// personality specific tasks. Returns true if the block should be		/// personality specific tasks. Returns true if the block should be
Show All 40 Lines

llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp

Show First 20 Lines • Show All 2,909 Lines • ▼ Show 20 Lines	bool DAGTypeLegalizer::SoftPromoteHalfOperand(SDNode *N, unsigned OpNo) {
case ISD::FP_TO_SINT_SAT:		case ISD::FP_TO_SINT_SAT:
case ISD::FP_TO_UINT_SAT:		case ISD::FP_TO_UINT_SAT:
Res = SoftPromoteHalfOp_FP_TO_XINT_SAT(N); break;		Res = SoftPromoteHalfOp_FP_TO_XINT_SAT(N); break;
case ISD::STRICT_FP_EXTEND:		case ISD::STRICT_FP_EXTEND:
case ISD::FP_EXTEND: Res = SoftPromoteHalfOp_FP_EXTEND(N); break;		case ISD::FP_EXTEND: Res = SoftPromoteHalfOp_FP_EXTEND(N); break;
case ISD::SELECT_CC: Res = SoftPromoteHalfOp_SELECT_CC(N, OpNo); break;		case ISD::SELECT_CC: Res = SoftPromoteHalfOp_SELECT_CC(N, OpNo); break;
case ISD::SETCC: Res = SoftPromoteHalfOp_SETCC(N); break;		case ISD::SETCC: Res = SoftPromoteHalfOp_SETCC(N); break;
case ISD::STORE: Res = SoftPromoteHalfOp_STORE(N, OpNo); break;		case ISD::STORE: Res = SoftPromoteHalfOp_STORE(N, OpNo); break;
		case ISD::STACKMAP:
		Res = SoftPromoteHalfOp_STACKMAP(N, OpNo);
		break;
		dantrushinUnsubmitted Not Done Reply Inline Actions Nit: could you have it in single line as cases above (if it fits in 80 chars)? dantrushin: Nit: could you have it in single line as cases above (if it fits in 80 chars)?
		vext01AuthorUnsubmitted Done Reply Inline Actions I think `git clang-format` wanted it this way, but I will check. vext01: I think `git clang-format` wanted it this way, but I will check.
		vext01AuthorUnsubmitted Done Reply Inline Actions Yep. If I fold the lines, then `git clang-format` just undoes it. Should I override it in this instance, or? vext01: Yep. If I fold the lines, then `git clang-format` just undoes it. Should I override it in this…
}		}

if (!Res.getNode())		if (!Res.getNode())
return false;		return false;

assert(Res.getNode() != N && "Expected a new node!");		assert(Res.getNode() != N && "Expected a new node!");

assert(Res.getValueType() == N->getValueType(0) && N->getNumValues() == 1 &&		assert(Res.getValueType() == N->getValueType(0) && N->getNumValues() == 1 &&
▲ Show 20 Lines • Show All 111 Lines • ▼ Show 20 Lines	SDValue DAGTypeLegalizer::SoftPromoteHalfOp_STORE(SDNode *N, unsigned OpNo) {
SDValue Val = ST->getValue();		SDValue Val = ST->getValue();
SDLoc dl(N);		SDLoc dl(N);

assert(!ST->isTruncatingStore() && "Unexpected truncating store.");		assert(!ST->isTruncatingStore() && "Unexpected truncating store.");
SDValue Promoted = GetSoftPromotedHalf(Val);		SDValue Promoted = GetSoftPromotedHalf(Val);
return DAG.getStore(ST->getChain(), dl, Promoted, ST->getBasePtr(),		return DAG.getStore(ST->getChain(), dl, Promoted, ST->getBasePtr(),
ST->getMemOperand());		ST->getMemOperand());
}		}

		SDValue DAGTypeLegalizer::SoftPromoteHalfOp_STACKMAP(SDNode *N, unsigned OpNo) {
		assert(OpNo > 1); // Because the first two arguments are guaranteed legal.
		SmallVector<SDValue> NewOps(N->ops().begin(), N->ops().end());
		SDValue Op = N->getOperand(OpNo);
		NewOps[OpNo] = GetSoftPromotedHalf(Op);
		SDValue NewNode =
		DAG.getNode(N->getOpcode(), SDLoc(N), N->getVTList(), NewOps);

		for (unsigned ResNum = 0; ResNum < N->getNumValues(); ResNum++)
		ReplaceValueWith(SDValue(N, ResNum), NewNode.getValue(ResNum));

		return SDValue(); // Signal that we replaced the node ourselves.
		}

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp

Show All 13 Lines
// computation in two identical registers of a smaller type. For example,		// computation in two identical registers of a smaller type. For example,
// implementing i64 arithmetic in two i32 registers (often needed on 32-bit		// implementing i64 arithmetic in two i32 registers (often needed on 32-bit
// targets).		// targets).
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "LegalizeTypes.h"		#include "LegalizeTypes.h"
#include "llvm/Analysis/TargetLibraryInfo.h"		#include "llvm/Analysis/TargetLibraryInfo.h"
		#include "llvm/CodeGen/StackMaps.h"
#include "llvm/IR/DerivedTypes.h"		#include "llvm/IR/DerivedTypes.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/Support/KnownBits.h"		#include "llvm/Support/KnownBits.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include <algorithm>		#include <algorithm>
using namespace llvm;		using namespace llvm;

#define DEBUG_TYPE "legalize-types"		#define DEBUG_TYPE "legalize-types"
▲ Show 20 Lines • Show All 1,688 Lines • ▼ Show 20 Lines	bool DAGTypeLegalizer::PromoteIntegerOperand(SDNode *N, unsigned OpNo) {
case ISD::VP_REDUCE_SMAX:		case ISD::VP_REDUCE_SMAX:
case ISD::VP_REDUCE_SMIN:		case ISD::VP_REDUCE_SMIN:
case ISD::VP_REDUCE_UMAX:		case ISD::VP_REDUCE_UMAX:
case ISD::VP_REDUCE_UMIN:		case ISD::VP_REDUCE_UMIN:
Res = PromoteIntOp_VP_REDUCE(N, OpNo);		Res = PromoteIntOp_VP_REDUCE(N, OpNo);
break;		break;

case ISD::SET_ROUNDING: Res = PromoteIntOp_SET_ROUNDING(N); break;		case ISD::SET_ROUNDING: Res = PromoteIntOp_SET_ROUNDING(N); break;
		case ISD::STACKMAP:
		Res = PromoteIntOp_STACKMAP(N, OpNo);
		break;
		dantrushinUnsubmitted Not Done Reply Inline Actions Nit: same comment for a single line dantrushin: Nit: same comment for a single line
}		}

// If the result is null, the sub-method took care of registering results etc.		// If the result is null, the sub-method took care of registering results etc.
if (!Res.getNode()) return false;		if (!Res.getNode()) return false;

// If the result is N, the sub-method updated N in place. Tell the legalizer		// If the result is N, the sub-method updated N in place. Tell the legalizer
// core about this.		// core about this.
if (Res.getNode() == N)		if (Res.getNode() == N)
▲ Show 20 Lines • Show All 565 Lines • ▼ Show 20 Lines	SDValue DAGTypeLegalizer::PromoteIntOp_VP_REDUCE(SDNode *N, unsigned OpNo) {
return DAG.getNode(ISD::TRUNCATE, DL, VT, Reduce);		return DAG.getNode(ISD::TRUNCATE, DL, VT, Reduce);
}		}

SDValue DAGTypeLegalizer::PromoteIntOp_SET_ROUNDING(SDNode *N) {		SDValue DAGTypeLegalizer::PromoteIntOp_SET_ROUNDING(SDNode *N) {
SDValue Op = ZExtPromotedInteger(N->getOperand(1));		SDValue Op = ZExtPromotedInteger(N->getOperand(1));
return SDValue(DAG.UpdateNodeOperands(N, N->getOperand(0), Op), 0);		return SDValue(DAG.UpdateNodeOperands(N, N->getOperand(0), Op), 0);
}		}

		SDValue DAGTypeLegalizer::PromoteIntOp_STACKMAP(SDNode *N, unsigned OpNo) {
		assert(OpNo > 1); // Because the first two arguments are guaranteed legal.
		SmallVector<SDValue> NewOps(N->ops().begin(), N->ops().end());
		SDValue Operand = N->getOperand(OpNo);
		arsenmUnsubmitted Not Done Reply Inline Actions I'm a bit surprised this can unconditionally zero extend. I'd expect ANY_EXTEND or to have to consider some ABI property arsenm: I'm a bit surprised this can unconditionally zero extend. I'd expect ANY_EXTEND or to have to…
		vext01AuthorUnsubmitted Done Reply Inline Actions Perhaps not some ABI property, as a call to `llvm.experimental.stackmap` isn't really a call, so there is no ABI to speak of. I'll look into `ANY_EXTEND`. I'm not familiar. vext01: Perhaps not some ABI property, as a call to `llvm.experimental.stackmap` isn't really a call…
		EVT NVT = TLI.getTypeToTransformTo(*DAG.getContext(), Operand.getValueType());
		NewOps[OpNo] = DAG.getNode(ISD::ANY_EXTEND, SDLoc(N), NVT, Operand);
		return SDValue(DAG.UpdateNodeOperands(N, NewOps), 0);
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Integer Result Expansion		// Integer Result Expansion
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

/// ExpandIntegerResult - This method is called when the specified result of the		/// ExpandIntegerResult - This method is called when the specified result of the
/// specified node is found to need expansion. At this point, the node may also		/// specified node is found to need expansion. At this point, the node may also
/// have invalid operands or may have other results that need promotion, we just		/// have invalid operands or may have other results that need promotion, we just
/// know that (at least) one result needs expansion.		/// know that (at least) one result needs expansion.
▲ Show 20 Lines • Show All 2,333 Lines • ▼ Show 20 Lines	bool DAGTypeLegalizer::ExpandIntegerOperand(SDNode *N, unsigned OpNo) {
case ISD::SRA:		case ISD::SRA:
case ISD::SRL:		case ISD::SRL:
case ISD::ROTL:		case ISD::ROTL:
case ISD::ROTR: Res = ExpandIntOp_Shift(N); break;		case ISD::ROTR: Res = ExpandIntOp_Shift(N); break;
case ISD::RETURNADDR:		case ISD::RETURNADDR:
case ISD::FRAMEADDR: Res = ExpandIntOp_RETURNADDR(N); break;		case ISD::FRAMEADDR: Res = ExpandIntOp_RETURNADDR(N); break;

case ISD::ATOMIC_STORE: Res = ExpandIntOp_ATOMIC_STORE(N); break;		case ISD::ATOMIC_STORE: Res = ExpandIntOp_ATOMIC_STORE(N); break;
		case ISD::STACKMAP:
		Res = ExpandIntOp_STACKMAP(N, OpNo);
		break;
		dantrushinUnsubmitted Not Done Reply Inline Actions Nit: and here too dantrushin: Nit: and here too
}		}

// If the result is null, the sub-method took care of registering results etc.		// If the result is null, the sub-method took care of registering results etc.
		arsenmUnsubmitted Not Done Reply Inline Actions Don't see why you are introducing this goto arsenm: Don't see why you are introducing this goto
		vext01AuthorUnsubmitted Done Reply Inline Actions It's because it's the only path to the error state contained in the `default` branch of the switch. In an earlier conversation we said that we wanted to trigger the default "can't legalise this" error if we encounter a case we cannot handle. vext01: It's because it's the only path to the error state contained in the `default` branch of the…
		arsenmUnsubmitted Not Done Reply Inline Actions But there's no actual failure case here so I don't understand what this is doing. This is always producing something arsenm: But there's no actual failure case here so I don't understand what this is doing. This is…
if (!Res.getNode()) return false;		if (!Res.getNode()) return false;

// If the result is N, the sub-method updated N in place. Tell the legalizer		// If the result is N, the sub-method updated N in place. Tell the legalizer
// core about this.		// core about this.
if (Res.getNode() == N)		if (Res.getNode() == N)
return true;		return true;

assert(Res.getValueType() == N->getValueType(0) && N->getNumValues() == 1 &&		assert(Res.getValueType() == N->getValueType(0) && N->getNumValues() == 1 &&
"Invalid operand expansion");		"Invalid operand expansion");

ReplaceValueWith(SDValue(N, 0), Res);		ReplaceValueWith(SDValue(N, 0), Res);
return false;		return false;
		dantrushinUnsubmitted Not Done Reply Inline Actions Nit: extra empty line dantrushin: Nit: extra empty line
}		}

/// IntegerExpandSetCCOperands - Expand the operands of a comparison. This code		/// IntegerExpandSetCCOperands - Expand the operands of a comparison. This code
/// is shared among BR_CC, SELECT_CC, and SETCC handlers.		/// is shared among BR_CC, SELECT_CC, and SETCC handlers.
void DAGTypeLegalizer::IntegerExpandSetCCOperands(SDValue &NewLHS,		void DAGTypeLegalizer::IntegerExpandSetCCOperands(SDValue &NewLHS,
SDValue &NewRHS,		SDValue &NewRHS,
ISD::CondCode &CCCode,		ISD::CondCode &CCCode,
const SDLoc &dl) {		const SDLoc &dl) {
▲ Show 20 Lines • Show All 797 Lines • ▼ Show 20 Lines	for (unsigned i=0; i<NumElem; ++i) {
DAG.getVectorIdxConstant(i, dl));		DAG.getVectorIdxConstant(i, dl));
SDValue Tr = DAG.getNode(ISD::TRUNCATE, dl, RetSclrTy, Ex);		SDValue Tr = DAG.getNode(ISD::TRUNCATE, dl, RetSclrTy, Ex);
NewOps.push_back(Tr);		NewOps.push_back(Tr);
}		}
}		}

return DAG.getBuildVector(N->getValueType(0), dl, NewOps);		return DAG.getBuildVector(N->getValueType(0), dl, NewOps);
}		}

		SDValue DAGTypeLegalizer::ExpandIntOp_STACKMAP(SDNode *N, unsigned OpNo) {
		arsenmUnsubmitted Not Done Reply Inline Actions Don't see why you're using Optional. Just use SDValue() like everything else? arsenm: Don't see why you're using Optional. Just use SDValue() like everything else?
		vext01AuthorUnsubmitted Done Reply Inline Actions If we return `SDValue()` then we are signalling that the handler has already replaced the node with a new node with the operand in question legalised. If we do that, then the legaliser will get stuck in an infinite loop trying to legalise the operand that we cannot handle, since we never actually replace the operand. I introduced the optional so that we could signal the error state and break out of the legaliser loop (i.e. the goto you questioned). vext01: If we return `SDValue()` then we are signalling that the handler has already replaced the node…
		assert(OpNo > 1);

		SDValue Op = N->getOperand(OpNo);
		SDLoc DL = SDLoc(N);
		SmallVector<SDValue> NewOps;

		// Copy operands before the one being expanded.
		for (unsigned I = 0; I < OpNo; I++)
		NewOps.push_back(N->getOperand(I));

		if (Op->getOpcode() == ISD::Constant) {
		ConstantSDNode *CN = cast<ConstantSDNode>(Op);
		EVT Ty = Op.getValueType();
		if (CN->getConstantIntValue()->getValue().getActiveBits() < 64) {
		NewOps.push_back(
		DAG.getTargetConstant(StackMaps::ConstantOp, DL, MVT::i64));
		NewOps.push_back(DAG.getTargetConstant(CN->getZExtValue(), DL, Ty));
		} else {
		arsenmUnsubmitted Not Done Reply Inline Actions I'd rather not add a special case to make this work only for constants arsenm: I'd rather not add a special case to make this work only for constants
		// FIXME: https://github.com/llvm/llvm-project/issues/55609
		return SDValue();
		}
		arsenmUnsubmitted Not Done Reply Inline Actions Just leave the node as-is? It will fail to legalize later arsenm: Just leave the node as-is? It will fail to legalize later
		vext01AuthorUnsubmitted Done Reply Inline Actions Sorry, there was some confusion at my end. Leaving the node as-is (i.e. `return SDValue(N, 0);`) is the case that will cause an infinite loop in the legaliser. Returning a "null" value (i.e. `return SDValue()`) is fine though. If I put a print at the top of this function: errs() << __func__ << ": " << OpNo << ": "; N->dump(); And update the `return SDValue()` to `return SDValue(N, 0);`, and then use an input that covers one of those cases, e.g.: declare void @llvm.experimental.stackmap(i64, i32, ...) define void @f() { call void (i64, i32, ...) @llvm.experimental.stackmap(i64 0, i32 0, i128 9223372036854775808) ret void } Then we see the function called indefinitely, trying to legalise the same operand forever: ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808> ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808> ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808> ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808> ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808> ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808> ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808> ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808> ExpandIntOp_STACKMAP: 4: t8: ch,glue = stackmap t3, t3:1, TargetConstant:i64<0>, TargetConstant:i32<0>, Constant:i128<9223372036854775808> ... This is due to the special casing in `DAGTypeLegalizer::ExpandIntegerOperand()`: Res = ExpandIntOp_STACKMAP(N, OpNo); ... // If the result is null, the sub-method took care of registering results etc. if (!Res.getNode()) return false; // If the result is N, the sub-method updated N in place. Tell the legalizer // core about this. if (Res.getNode() == N) return true; Returning `SDValue()` hits the `return false`, whereas `return SDValue(N, 0)` hits `return true`. The return value is whether or not the operand needs another round of legalisation. At the call site of `ExpandIntegerOperand()`, there is: NeedsReanalyzing = ExpandIntegerOperand(N, i); So we need to return `false` to: not loop forever bail out with the right error. So I believe `return SDValue()` to be correct. The latest diff reflects this. vext01: Sorry, there was some confusion at my end. Leaving the node as-is (i.e. `return SDValue(N, 0)…
		} else {
		// FIXME: Non-constant operands are not yet handled:
		// - https://github.com/llvm/llvm-project/issues/26431
		arsenmUnsubmitted Not Done Reply Inline Actions My skimming of the issue says this is an issue with the lowering from IR, so why do you need to error here? In any case I think emitError should be reserved for cases where reasonable user code ran into a truly unhandleable case. It would be better to just let it fall through and hit the ordinary legalize fatal error arsenm: My skimming of the issue says this is an issue with the lowering from IR, so why do you need to…
		// - https://github.com/llvm/llvm-project/issues/55957
		return SDValue();
		}

		// Copy remaining operands.
		for (unsigned I = OpNo + 1; I < N->getNumOperands(); I++)
		NewOps.push_back(N->getOperand(I));

		SDValue NewNode = DAG.getNode(N->getOpcode(), DL, N->getVTList(), NewOps);

		for (unsigned ResNum = 0; ResNum < N->getNumValues(); ResNum++)
		ReplaceValueWith(SDValue(N, ResNum), NewNode.getValue(ResNum));

		return SDValue(); // Signal that we have replaced the node already.
		}

llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h

Show First 20 Lines • Show All 396 Lines • ▼ Show 20 Lines	private:
SDValue PromoteIntOp_ADDSUBCARRY(SDNode *N, unsigned OpNo);		SDValue PromoteIntOp_ADDSUBCARRY(SDNode *N, unsigned OpNo);
SDValue PromoteIntOp_FRAMERETURNADDR(SDNode *N);		SDValue PromoteIntOp_FRAMERETURNADDR(SDNode *N);
SDValue PromoteIntOp_PREFETCH(SDNode *N, unsigned OpNo);		SDValue PromoteIntOp_PREFETCH(SDNode *N, unsigned OpNo);
SDValue PromoteIntOp_FIX(SDNode *N);		SDValue PromoteIntOp_FIX(SDNode *N);
SDValue PromoteIntOp_FPOWI(SDNode *N);		SDValue PromoteIntOp_FPOWI(SDNode *N);
SDValue PromoteIntOp_VECREDUCE(SDNode *N);		SDValue PromoteIntOp_VECREDUCE(SDNode *N);
SDValue PromoteIntOp_VP_REDUCE(SDNode *N, unsigned OpNo);		SDValue PromoteIntOp_VP_REDUCE(SDNode *N, unsigned OpNo);
SDValue PromoteIntOp_SET_ROUNDING(SDNode *N);		SDValue PromoteIntOp_SET_ROUNDING(SDNode *N);
		SDValue PromoteIntOp_STACKMAP(SDNode *N, unsigned OpNo);

void PromoteSetCCOperands(SDValue &LHS,SDValue &RHS, ISD::CondCode Code);		void PromoteSetCCOperands(SDValue &LHS,SDValue &RHS, ISD::CondCode Code);

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Integer Expansion Support: LegalizeIntegerTypes.cpp		// Integer Expansion Support: LegalizeIntegerTypes.cpp
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

/// Given a processed operand Op which was expanded into two integers of half		/// Given a processed operand Op which was expanded into two integers of half
▲ Show 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	private:
SDValue ExpandIntOp_Shift(SDNode *N);		SDValue ExpandIntOp_Shift(SDNode *N);
SDValue ExpandIntOp_SINT_TO_FP(SDNode *N);		SDValue ExpandIntOp_SINT_TO_FP(SDNode *N);
SDValue ExpandIntOp_STORE(StoreSDNode *N, unsigned OpNo);		SDValue ExpandIntOp_STORE(StoreSDNode *N, unsigned OpNo);
SDValue ExpandIntOp_TRUNCATE(SDNode *N);		SDValue ExpandIntOp_TRUNCATE(SDNode *N);
SDValue ExpandIntOp_UINT_TO_FP(SDNode *N);		SDValue ExpandIntOp_UINT_TO_FP(SDNode *N);
SDValue ExpandIntOp_RETURNADDR(SDNode *N);		SDValue ExpandIntOp_RETURNADDR(SDNode *N);
SDValue ExpandIntOp_ATOMIC_STORE(SDNode *N);		SDValue ExpandIntOp_ATOMIC_STORE(SDNode *N);
SDValue ExpandIntOp_SPLAT_VECTOR(SDNode *N);		SDValue ExpandIntOp_SPLAT_VECTOR(SDNode *N);
		SDValue ExpandIntOp_STACKMAP(SDNode *N, unsigned OpNo);

void IntegerExpandSetCCOperands(SDValue &NewLHS, SDValue &NewRHS,		void IntegerExpandSetCCOperands(SDValue &NewLHS, SDValue &NewRHS,
ISD::CondCode &CCCode, const SDLoc &dl);		ISD::CondCode &CCCode, const SDLoc &dl);

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Float to Integer Conversion Support: LegalizeFloatTypes.cpp		// Float to Integer Conversion Support: LegalizeFloatTypes.cpp
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

▲ Show 20 Lines • Show All 232 Lines • ▼ Show 20 Lines	private:
SDValue SoftPromoteHalfOp_BITCAST(SDNode *N);		SDValue SoftPromoteHalfOp_BITCAST(SDNode *N);
SDValue SoftPromoteHalfOp_FCOPYSIGN(SDNode *N, unsigned OpNo);		SDValue SoftPromoteHalfOp_FCOPYSIGN(SDNode *N, unsigned OpNo);
SDValue SoftPromoteHalfOp_FP_EXTEND(SDNode *N);		SDValue SoftPromoteHalfOp_FP_EXTEND(SDNode *N);
SDValue SoftPromoteHalfOp_FP_TO_XINT(SDNode *N);		SDValue SoftPromoteHalfOp_FP_TO_XINT(SDNode *N);
SDValue SoftPromoteHalfOp_FP_TO_XINT_SAT(SDNode *N);		SDValue SoftPromoteHalfOp_FP_TO_XINT_SAT(SDNode *N);
SDValue SoftPromoteHalfOp_SETCC(SDNode *N);		SDValue SoftPromoteHalfOp_SETCC(SDNode *N);
SDValue SoftPromoteHalfOp_SELECT_CC(SDNode *N, unsigned OpNo);		SDValue SoftPromoteHalfOp_SELECT_CC(SDNode *N, unsigned OpNo);
SDValue SoftPromoteHalfOp_STORE(SDNode *N, unsigned OpNo);		SDValue SoftPromoteHalfOp_STORE(SDNode *N, unsigned OpNo);
		SDValue SoftPromoteHalfOp_STACKMAP(SDNode *N, unsigned OpNo);

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Scalarization Support: LegalizeVectorTypes.cpp		// Scalarization Support: LegalizeVectorTypes.cpp
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

/// Given a processed one-element vector Op which was scalarized to its		/// Given a processed one-element vector Op which was scalarized to its
/// element type, this returns the element. For example, if Op is a v1i32,		/// element type, this returns the element. For example, if Op is a v1i32,
/// Op = < i32 val >, this method returns val, an i32.		/// Op = < i32 val >, this method returns val, an i32.
▲ Show 20 Lines • Show All 357 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 9,386 Lines • ▼ Show 20 Lines	if (ConstantSDNode *C = dyn_cast<ConstantSDNode>(OpVal)) {
const TargetLowering &TLI = Builder.DAG.getTargetLoweringInfo();		const TargetLowering &TLI = Builder.DAG.getTargetLoweringInfo();
Ops.push_back(Builder.DAG.getTargetFrameIndex(		Ops.push_back(Builder.DAG.getTargetFrameIndex(
FI->getIndex(), TLI.getFrameIndexTy(Builder.DAG.getDataLayout())));		FI->getIndex(), TLI.getFrameIndexTy(Builder.DAG.getDataLayout())));
} else		} else
Ops.push_back(OpVal);		Ops.push_back(OpVal);
}		}
}		}

/// Lower llvm.experimental.stackmap directly to its target opcode.		/// Lower llvm.experimental.stackmap.
void SelectionDAGBuilder::visitStackmap(const CallInst &CI) {		void SelectionDAGBuilder::visitStackmap(const CallInst &CI) {
// void @llvm.experimental.stackmap(i32 <id>, i32 <numShadowBytes>,		// void @llvm.experimental.stackmap(i64 <id>, i32 <numShadowBytes>,
// [live variables...])		// [live variables...])

assert(CI.getType()->isVoidTy() && "Stackmap cannot return a value.");		assert(CI.getType()->isVoidTy() && "Stackmap cannot return a value.");

SDValue Chain, InFlag, Callee, NullPtr;		SDValue Chain, InFlag, Callee, NullPtr;
SmallVector<SDValue, 32> Ops;		SmallVector<SDValue, 32> Ops;

SDLoc DL = getCurSDLoc();		SDLoc DL = getCurSDLoc();
Callee = getValue(CI.getCalledOperand());		Callee = getValue(CI.getCalledOperand());
NullPtr = DAG.getIntPtrConstant(0, DL, true);		NullPtr = DAG.getIntPtrConstant(0, DL, true);

// The stackmap intrinsic only records the live variables (the arguments		// The stackmap intrinsic only records the live variables (the arguments
// passed to it) and emits NOPS (if requested). Unlike the patchpoint		// passed to it) and emits NOPS (if requested). Unlike the patchpoint
// intrinsic, this won't be lowered to a function call. This means we don't		// intrinsic, this won't be lowered to a function call. This means we don't
// have to worry about calling conventions and target specific lowering code.		// have to worry about calling conventions and target specific lowering code.
// Instead we perform the call lowering right here.		// Instead we perform the call lowering right here.
//		//
// chain, flag = CALLSEQ_START(chain, 0, 0)		// chain, flag = CALLSEQ_START(chain, 0, 0)
// chain, flag = STACKMAP(id, nbytes, ..., chain, flag)		// chain, flag = STACKMAP(id, nbytes, ..., chain, flag)
// chain, flag = CALLSEQ_END(chain, 0, 0, flag)		// chain, flag = CALLSEQ_END(chain, 0, 0, flag)
//		//
Chain = DAG.getCALLSEQ_START(getRoot(), 0, 0, DL);		Chain = DAG.getCALLSEQ_START(getRoot(), 0, 0, DL);
InFlag = Chain.getValue(1);		InFlag = Chain.getValue(1);

// Add the <id> and <numBytes> constants.		// Add the STACKMAP operands, starting with DAG house-keeping.
		dantrushinUnsubmitted Not Done Reply Inline Actions Use existing `Ops` instead of introducing new vector? dantrushin: Use existing `Ops` instead of introducing new vector?
SDValue IDVal = getValue(CI.getOperand(PatchPointOpers::IDPos));
Ops.push_back(DAG.getTargetConstant(
cast<ConstantSDNode>(IDVal)->getZExtValue(), DL, MVT::i64));
SDValue NBytesVal = getValue(CI.getOperand(PatchPointOpers::NBytesPos));
Ops.push_back(DAG.getTargetConstant(
cast<ConstantSDNode>(NBytesVal)->getZExtValue(), DL,
MVT::i32));

// Push live variables for the stack map.
addStackMapLiveVars(CI, 2, DL, Ops, *this);

// We are not pushing any register mask info here on the operands list,
// because the stackmap doesn't clobber anything.

// Push the chain and the glue flag.
Ops.push_back(Chain);		Ops.push_back(Chain);
Ops.push_back(InFlag);		Ops.push_back(InFlag);

		// Add the <id>, <numShadowBytes> operands.
		//
		// These do not require legalisation, and can be emitted directly to target
		// constant nodes.
		SDValue ID = getValue(CI.getArgOperand(0));
		assert(ID.getValueType() == MVT::i64);
		SDValue IDConst = DAG.getTargetConstant(
		cast<ConstantSDNode>(ID)->getZExtValue(), DL, ID.getValueType());
		Ops.push_back(IDConst);

		SDValue Shad = getValue(CI.getArgOperand(1));
		assert(Shad.getValueType() == MVT::i32);
		SDValue ShadConst = DAG.getTargetConstant(
		cast<ConstantSDNode>(Shad)->getZExtValue(), DL, Shad.getValueType());
		Ops.push_back(ShadConst);

		// Add the live variables.
		for (unsigned I = 2; I < CI.arg_size(); I++) {
		SDValue Op = getValue(CI.getArgOperand(I));

		// Things on the stack are pointer-typed, meaning that they are already
		// legal and can be emitted directly to target nodes.
		if (FrameIndexSDNode *FI = dyn_cast<FrameIndexSDNode>(Op)) {
		const TargetLowering &TLI = DAG.getTargetLoweringInfo();
		Ops.push_back(DAG.getTargetFrameIndex(
		FI->getIndex(), TLI.getFrameIndexTy(DAG.getDataLayout())));
		} else {
		// Otherwise emit a target independent node to be legalised.
		Ops.push_back(getValue(CI.getArgOperand(I)));
		}
		}

// Create the STACKMAP node.		// Create the STACKMAP node.
SDVTList NodeTys = DAG.getVTList(MVT::Other, MVT::Glue);		SDVTList NodeTys = DAG.getVTList(MVT::Other, MVT::Glue);
SDNode *SM = DAG.getMachineNode(TargetOpcode::STACKMAP, DL, NodeTys, Ops);		Chain = DAG.getNode(ISD::STACKMAP, DL, NodeTys, Ops);
Chain = SDValue(SM, 0);
InFlag = Chain.getValue(1);		InFlag = Chain.getValue(1);

Chain = DAG.getCALLSEQ_END(Chain, NullPtr, NullPtr, InFlag, DL);		Chain = DAG.getCALLSEQ_END(Chain, NullPtr, NullPtr, InFlag, DL);

// Stackmaps don't generate values, so nothing goes into the NodeMap.		// Stackmaps don't generate values, so nothing goes into the NodeMap.

// Set the root to the target-lowered call chain.		// Set the root to the target-lowered call chain.
DAG.setRoot(Chain);		DAG.setRoot(Chain);
▲ Show 20 Lines • Show All 1,973 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

Show First 20 Lines • Show All 480 Lines • ▼ Show 20 Lines	#endif
case ISD::VECREDUCE_OR: return "vecreduce_or";		case ISD::VECREDUCE_OR: return "vecreduce_or";
case ISD::VECREDUCE_XOR: return "vecreduce_xor";		case ISD::VECREDUCE_XOR: return "vecreduce_xor";
case ISD::VECREDUCE_SMAX: return "vecreduce_smax";		case ISD::VECREDUCE_SMAX: return "vecreduce_smax";
case ISD::VECREDUCE_SMIN: return "vecreduce_smin";		case ISD::VECREDUCE_SMIN: return "vecreduce_smin";
case ISD::VECREDUCE_UMAX: return "vecreduce_umax";		case ISD::VECREDUCE_UMAX: return "vecreduce_umax";
case ISD::VECREDUCE_UMIN: return "vecreduce_umin";		case ISD::VECREDUCE_UMIN: return "vecreduce_umin";
case ISD::VECREDUCE_FMAX: return "vecreduce_fmax";		case ISD::VECREDUCE_FMAX: return "vecreduce_fmax";
case ISD::VECREDUCE_FMIN: return "vecreduce_fmin";		case ISD::VECREDUCE_FMIN: return "vecreduce_fmin";
		case ISD::STACKMAP:
		return "stackmap";
		dantrushinUnsubmitted Not Done Reply Inline Actions Single line again dantrushin: Single line again

// Vector Predication		// Vector Predication
#define BEGIN_REGISTER_VP_SDNODE(SDID, LEGALARG, NAME, ...) \		#define BEGIN_REGISTER_VP_SDNODE(SDID, LEGALARG, NAME, ...) \
case ISD::SDID: \		case ISD::SDID: \
return #NAME;		return #NAME;
		arsenmUnsubmitted Not Done Reply Inline Actions I would move this above the included file (preferably in the same position as the code is defined in the enum) arsenm: I would move this above the included file (preferably in the same position as the code is…
#include "llvm/IR/VPIntrinsics.def"		#include "llvm/IR/VPIntrinsics.def"
}		}
}		}

const char *SDNode::getIndexedModeName(ISD::MemIndexedMode AM) {		const char *SDNode::getIndexedModeName(ISD::MemIndexedMode AM) {
switch (AM) {		switch (AM) {
default: return "";		default: return "";
case ISD::PRE_INC: return "<pre-inc>";		case ISD::PRE_INC: return "<pre-inc>";
▲ Show 20 Lines • Show All 565 Lines • Show Last 20 Lines

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp

Show First 20 Lines • Show All 45 Lines • ▼ Show 20 Lines
#include "llvm/CodeGen/MachineMemOperand.h"		#include "llvm/CodeGen/MachineMemOperand.h"
#include "llvm/CodeGen/MachineModuleInfo.h"		#include "llvm/CodeGen/MachineModuleInfo.h"
#include "llvm/CodeGen/MachineOperand.h"		#include "llvm/CodeGen/MachineOperand.h"
#include "llvm/CodeGen/MachinePassRegistry.h"		#include "llvm/CodeGen/MachinePassRegistry.h"
#include "llvm/CodeGen/MachineRegisterInfo.h"		#include "llvm/CodeGen/MachineRegisterInfo.h"
#include "llvm/CodeGen/SchedulerRegistry.h"		#include "llvm/CodeGen/SchedulerRegistry.h"
#include "llvm/CodeGen/SelectionDAG.h"		#include "llvm/CodeGen/SelectionDAG.h"
#include "llvm/CodeGen/SelectionDAGNodes.h"		#include "llvm/CodeGen/SelectionDAGNodes.h"
		#include "llvm/CodeGen/StackMaps.h"
#include "llvm/CodeGen/StackProtector.h"		#include "llvm/CodeGen/StackProtector.h"
#include "llvm/CodeGen/SwiftErrorValueTracking.h"		#include "llvm/CodeGen/SwiftErrorValueTracking.h"
#include "llvm/CodeGen/TargetInstrInfo.h"		#include "llvm/CodeGen/TargetInstrInfo.h"
#include "llvm/CodeGen/TargetLowering.h"		#include "llvm/CodeGen/TargetLowering.h"
#include "llvm/CodeGen/TargetRegisterInfo.h"		#include "llvm/CodeGen/TargetRegisterInfo.h"
#include "llvm/CodeGen/TargetSubtargetInfo.h"		#include "llvm/CodeGen/TargetSubtargetInfo.h"
#include "llvm/CodeGen/ValueTypes.h"		#include "llvm/CodeGen/ValueTypes.h"
#include "llvm/IR/BasicBlock.h"		#include "llvm/IR/BasicBlock.h"
▲ Show 20 Lines • Show All 2,174 Lines • ▼ Show 20 Lines	CurDAG->SelectNodeTo(N, TargetOpcode::COPY, N->getValueType(0),
N->getOperand(0));		N->getOperand(0));
}		}

void SelectionDAGISel::Select_ARITH_FENCE(SDNode *N) {		void SelectionDAGISel::Select_ARITH_FENCE(SDNode *N) {
CurDAG->SelectNodeTo(N, TargetOpcode::ARITH_FENCE, N->getValueType(0),		CurDAG->SelectNodeTo(N, TargetOpcode::ARITH_FENCE, N->getValueType(0),
N->getOperand(0));		N->getOperand(0));
}		}

		void SelectionDAGISel::Select_STACKMAP(SDNode *N) {
		std::vector<SDValue> Ops;
		auto *It = N->op_begin();
		SDLoc DL(N);

		// Stash the chain and glue operands so we can move them to the end.
		SDValue Chain = *It++;
		SDValue InFlag = *It++;

		// <id> operand.
		SDValue ID = *It++;
		assert(ID.getValueType() == MVT::i64);
		Ops.push_back(ID);
		arsenmUnsubmitted Not Done Reply Inline Actions The opcode assert is implied by the cast. There's also probably no point to asserting it's i64 arsenm: The opcode assert is implied by the cast. There's also probably no point to asserting it's i64
		vext01AuthorUnsubmitted Done Reply Inline Actions If the assertion fails, do we not end up doing a bogus cast and invoking undefined behaviour which will probably lead to a crash much later? Wouldn't it be best to have the assertion crash early? vext01: If the assertion fails, do we not end up doing a bogus cast and invoking undefined behaviour…
		arsenmUnsubmitted Not Done Reply Inline Actions The point of cast<> is it does the type assertion for you arsenm: The point of cast<> is it does the type assertion for you
		vext01AuthorUnsubmitted Done Reply Inline Actions I had assumed that `cast<>`was an alias to `static_cast<>`, but now I see that LLVM defines it! I had no idea! (forgive me, I'm pretty much learning C++ on the fly here) So I agree that asserting the type of the `opCode` is redundant, but: There's also probably no point to asserting it's i64 How comes? Isn't it good to check the sanity of the DAG we fed in? If we remove the assertion I think the stackmap code would eventually crash somewhere a lot later in the pipeline, but wouldn't it be good to crash early? vext01: I had assumed that `cast<>`was an alias to `static_cast<>`, but now I see that LLVM defines it!

		// <numShadowBytes> operand.
		SDValue Shad = *It++;
		assert(Shad.getValueType() == MVT::i32);
		Ops.push_back(Shad);

		// Live variable operands.
		for (; It != N->op_end(); It++) {
		SDNode *OpNode = It->getNode();
		arsenmUnsubmitted Not Done Reply Inline Actions Ditto arsenm: Ditto
		SDValue O;

		// FrameIndex nodes should have been directly emitted to TargetFrameIndex
		// nodes at DAG-construction time.
		assert(OpNode->getOpcode() != ISD::FrameIndex);

		if (OpNode->getOpcode() == ISD::Constant) {
		Ops.push_back(
		CurDAG->getTargetConstant(StackMaps::ConstantOp, DL, MVT::i64));
		O = CurDAG->getTargetConstant(
		cast<ConstantSDNode>(OpNode)->getZExtValue(), DL, It->getValueType());
		arsenmUnsubmitted Not Done Reply Inline Actions You're still fixing this into TargetConstant at selection time instead of upfront when lowering from the IR arsenm: You're still fixing this into TargetConstant at selection time instead of upfront when lowering…
		vext01AuthorUnsubmitted Done Reply Inline Actions This code is selecting the stackmap live variables. The (non-frameindex) live variables can't be emitted to target constants at DAG-build time or they won't get legalized and that's the problem that this change is trying to address. Correct me if I'm wrong though. vext01: This code is selecting the stackmap live variables. The (non-frameindex) live variables can't…
		arsenmUnsubmitted Not Done Reply Inline Actions This is a pretty weird behavior, but does the type actually matter? Could you just unconditionally use 64-bit constants? arsenm: This is a pretty weird behavior, but does the type actually matter? Could you just…
		vext01AuthorUnsubmitted Done Reply Inline Actions Sorry for the delay. I was away at the end of last week. I've just tried to implement your suggestion, it has two undesirable effects: It changes the way large constant structs which get split into e.g. `mergevalues(constant, constant)` are codegenned. It appears that the constituent parts of the struct get allocated to registers and not constants. For FastISel, some some small constants get reported as long constants in the stackmap record. For FastISel It causes large constants to be emitted in a different order than in `SelectionDAGISel`, which means that the test files which test both backends cannot succeed. This is too much scary breakage, so I suggest we emit constants as we did before. Here is the diff (to previous) that I was working on before I abandoned it: diff --git a/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp b/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp index b9816bb34bc9..497c4c77880a 100644 --- a/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp +++ b/llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp @@ -9449,6 +9449,10 @@ void SelectionDAGBuilder::visitStackmap(const CallInst &CI) { const TargetLowering &TLI = DAG.getTargetLoweringInfo(); Ops.push_back(DAG.getTargetFrameIndex( FI->getIndex(), TLI.getFrameIndexTy(DAG.getDataLayout()))); + } else if (ConstantSDNode C = dyn_cast<ConstantSDNode>(Op)) { + // Directly emit a 64-bit target constant. + Ops.push_back(DAG.getTargetConstant(StackMaps::ConstantOp, DL, MVT::i64)); + Ops.push_back(DAG.getTargetConstant(C->getZExtValue(), DL, MVT::i64)); } else { // Otherwise emit a target independent node to be legalised. Ops.push_back(getValue(CI.getArgOperand(I))); diff --git a/llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp b/llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp index aff4e9a94fef..936f48f34870 100644 --- a/llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp +++ b/llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp @@ -2269,15 +2269,7 @@ void SelectionDAGISel::Select_STACKMAP(SDNode N) { // FrameIndex nodes should have been directly emitted to TargetFrameIndex // nodes at DAG-construction time. assert(OpNode->getOpcode() != ISD::FrameIndex); - - if (OpNode->getOpcode() == ISD::Constant) { - Ops.push_back( - CurDAG->getTargetConstant(StackMaps::ConstantOp, DL, MVT::i64)); - O = CurDAG->getTargetConstant( - cast<ConstantSDNode>(OpNode)->getZExtValue(), DL, It->getValueType()); - } else { - O = It; - } + O = It; Ops.push_back(O); } diff --git a/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll b/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll index bc624be5318e..a2d629fce00c 100644 --- a/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll +++ b/llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll @@ -81,10 +81,10 @@ ; CHECK-NEXT: .short 0 ; CHECK-NEXT: .long 0 ; Location[7] -; CHECK-NEXT: .byte 4 +; CHECK-NEXT: .byte 1 ; CHECK-NEXT: .byte 0 -; CHECK-NEXT: .short 8 -; CHECK-NEXT: .short 0 +; CHECK-NEXT: .short 4 +; CHECK-NEXT: .short {{.}} ; CHECK-NEXT: .short 0 ; CHECK-NEXT: .long 0 ; Location[8] @@ -95,10 +95,10 @@ ; CHECK-NEXT: .short 0 ; CHECK-NEXT: .long 0 ; Location[9] -; CHECK-NEXT: .byte 4 +; CHECK-NEXT: .byte 1 ; CHECK-NEXT: .byte 0 -; CHECK-NEXT: .short 8 -; CHECK-NEXT: .short 0 +; CHECK-NEXT: .short 1 +; CHECK-NEXT: .short 2 ; CHECK-NEXT: .short 0 ; CHECK-NEXT: .long 0 ; Location[10] diff --git a/llvm/test/CodeGen/X86/stackmap-fast-isel.ll b/llvm/test/CodeGen/X86/stackmap-fast-isel.ll index dd25065f3063..558029cdf1c5 100644 --- a/llvm/test/CodeGen/X86/stackmap-fast-isel.ll +++ b/llvm/test/CodeGen/X86/stackmap-fast-isel.ll @@ -29,8 +29,8 @@ ; CHECK-NEXT: .quad 4 ; Large Constants -; CHECK-NEXT: .quad 2147483648 ; CHECK-NEXT: .quad 4294967295 +; CHECK-NEXT: .quad 2147483648 ; CHECK-NEXT: .quad 4294967296 ; Callsites @@ -46,14 +46,14 @@ ; CHECK-NEXT: .short 8 ; CHECK-NEXT: .short 0 ; CHECK-NEXT: .short 0 -; CHECK-NEXT: .long -1 +; CHECK-NEXT: .long 65535 ; SmallConstant ; CHECK-NEXT: .byte 4 ; CHECK-NEXT: .byte 0 ; CHECK-NEXT: .short 8 ; CHECK-NEXT: .short 0 ; CHECK-NEXT: .short 0 -; CHECK-NEXT: .long -1 +; CHECK-NEXT: .long 65535 ; SmallConstant ; CHECK-NEXT: .byte 4 ; CHECK-NEXT: .byte 0 @@ -76,19 +76,19 @@ ; CHECK-NEXT: .short 0 ; CHECK-NEXT: .long 2147483647 ; SmallConstant -; CHECK-NEXT: .byte 4 +; CHECK-NEXT: .byte 5 ; CHECK-NEXT: .byte 0 ; CHECK-NEXT: .short 8 ; CHECK-NEXT: .short 0 ; CHECK-NEXT: .short 0 -; CHECK-NEXT: .long -1 +; CHECK-NEXT: .long 0 ; SmallConstant -; CHECK-NEXT: .byte 4 +; CHECK-NEXT: .byte 5 ; CHECK-NEXT: .byte 0 ; CHECK-NEXT: .short 8 ; CHECK-NEXT: .short 0 ; CHECK-NEXT: .short 0 -; CHECK-NEXT: .long -1 +; CHECK-NEXT: .long 0 ; SmallConstant ; CHECK-NEXT: .byte 4 ; CHECK-NEXT: .byte 0 @@ -102,14 +102,14 @@ ; CHECK-NEXT: .short 8 ; CHECK-NEXT: .short 0 ; CHECK-NEXT: .short 0 -; CHECK-NEXT: .long 0 +; CHECK-NEXT: .long 1 ; LargeConstant at index 1 ; CHECK-NEXT: .byte 5 ; CHECK-NEXT: .byte 0 ; CHECK-NEXT: .short 8 ; CHECK-NEXT: .short 0 ; CHECK-NEXT: .short 0 -; CHECK-NEXT: .long 1 +; CHECK-NEXT: .long 0 ; LargeConstant at index 2 ; CHECK-NEXT: .byte 5 ; CHECK-NEXT: .byte 0 vext01:* Sorry for the delay. I was away at the end of last week. I've just tried to implement your…
		} else {
		O = *It;
		}
		arsenmUnsubmitted Not Done Reply Inline Actions You seem to be fixing up some Constants to TargetConstant here. It would be better to just do this up front in SelectionDAGBuilder like it was before arsenm: You seem to be fixing up some Constants to TargetConstant here. It would be better to just do…
		arsenmUnsubmitted Not Done Reply Inline Actions These parameters probably should have been marked immarg in the intrinsic definition? arsenm: These parameters probably should have been marked immarg in the intrinsic definition?
		vext01AuthorUnsubmitted Done Reply Inline Actions You seem to be fixing up some Constants to TargetConstant here. It would be better to just do this up front in SelectionDAGBuilder like it was before Wasn't that the very reason that the constants were not being legalised? Because they were already target constants? These parameters probably should have been marked immarg in the intrinsic definition? Should that be handled as a separate change, since I'm not actually touching the intrinsic's definition in this work? I don't know how the workflow works after phabricator? Do I get an opportunity to make things into neat, self-contained commits? vext01: > You seem to be fixing up some Constants to TargetConstant here. It would be better to just do…
		arsenmUnsubmitted Not Done Reply Inline Actions I assume these are the fixed operands at the start of the argument list, and not the variadic section. I'm also assuming legalization is only relevant for the variadic arguments. Changing the intrinsic would be a separate change. Phabricator lets you add parent/child revisions to track related changes arsenm: I assume these are the fixed operands at the start of the argument list, and not the variadic…
		vext01AuthorUnsubmitted Done Reply Inline Actions I assume these are the fixed operands at the start of the argument list, and not the variadic section. I'm also assuming legalization is only relevant for the variadic arguments. Ah, do you mean we should emit the first two arguments directly to target nodes? Assuming so, that'd probably work and might be a little faster. By the way, did you have any comments on the XXXs below? Changing the intrinsic would be a separate change. Phabricator lets you add parent/child revisions to track related changes. OK. Let's do that as a separate change. vext01: > I assume these are the fixed operands at the start of the argument list, and not the variadic…
		arsenmUnsubmitted Not Done Reply Inline Actions Yes, if they needed to be target constants to begin with they should start as targetconstants arsenm: Yes, if they needed to be target constants to begin with they should start as targetconstants
		Ops.push_back(O);
		}

		Ops.push_back(Chain);
		Ops.push_back(InFlag);

		arsenmUnsubmitted Not Done Reply Inline Actions FrameIndex will also be materialized into a register, unlike TargetFrameIndex arsenm: FrameIndex will also be materialized into a register, unlike TargetFrameIndex
		vext01AuthorUnsubmitted Done Reply Inline Actions And is that correct? vext01: And is that correct?
		arsenmUnsubmitted Not Done Reply Inline Actions Well the current code is emitting TargetFrameIndexes into the argument list, so I assume this should also just go direct to TargetFrameIndex from the start arsenm: Well the current code is emitting TargetFrameIndexes into the argument list, so I assume this…
		SDVTList NodeTys = CurDAG->getVTList(MVT::Other, MVT::Glue);
		CurDAG->SelectNodeTo(N, TargetOpcode::STACKMAP, NodeTys, Ops);
		}

/// GetVBR - decode a vbr encoding whose top bit is set.		/// GetVBR - decode a vbr encoding whose top bit is set.
LLVM_ATTRIBUTE_ALWAYS_INLINE static uint64_t		LLVM_ATTRIBUTE_ALWAYS_INLINE static uint64_t
GetVBR(uint64_t Val, const unsigned char *MatcherTable, unsigned &Idx) {		GetVBR(uint64_t Val, const unsigned char *MatcherTable, unsigned &Idx) {
assert(Val >= 128 && "Not a VBR");		assert(Val >= 128 && "Not a VBR");
Val &= 127; // Remove first vbr bit.		Val &= 127; // Remove first vbr bit.

unsigned Shift = 7;		unsigned Shift = 7;
uint64_t NextBits;		uint64_t NextBits;
▲ Show 20 Lines • Show All 538 Lines • ▼ Show 20 Lines	case ISD::UNDEF:
Select_UNDEF(NodeToMatch);		Select_UNDEF(NodeToMatch);
return;		return;
case ISD::FREEZE:		case ISD::FREEZE:
Select_FREEZE(NodeToMatch);		Select_FREEZE(NodeToMatch);
return;		return;
case ISD::ARITH_FENCE:		case ISD::ARITH_FENCE:
Select_ARITH_FENCE(NodeToMatch);		Select_ARITH_FENCE(NodeToMatch);
return;		return;
		case ISD::STACKMAP:
		Select_STACKMAP(NodeToMatch);
		return;
}		}

assert(!NodeToMatch->isMachineOpcode() && "Node already selected!");		assert(!NodeToMatch->isMachineOpcode() && "Node already selected!");

// Set up the node stack with NodeToMatch as the only node on the stack.		// Set up the node stack with NodeToMatch as the only node on the stack.
SmallVector<SDValue, 8> NodeStack;		SmallVector<SDValue, 8> NodeStack;
SDValue N = SDValue(NodeToMatch, 0);		SDValue N = SDValue(NodeToMatch, 0);
NodeStack.push_back(N);		NodeStack.push_back(N);
▲ Show 20 Lines • Show All 905 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll

This file was added.

				; RUN: llc < %s -mtriple=x86_64-apple-darwin -mcpu=corei7-avx -enable-patchpoint-liveness=false \| FileCheck %s

				; CHECK-LABEL: .section __LLVM_STACKMAPS,__llvm_stackmaps
				arsenmUnsubmitted Not Done Reply Inline Actions If you're checking debug output you need REQUIRES: asserts. However, I don't think checking the legalizer output is the most helpful thing here. Better to check the final output arsenm: If you're checking debug output you need REQUIRES: asserts. However, I don't think checking the…
				vext01AuthorUnsubmitted Done Reply Inline Actions At what level should I check? I wanted to check at the MIR level, but it doesn't show the types there. vext01: At what level should I check? I wanted to check at the MIR level, but it doesn't show the…
				arsenmUnsubmitted Not Done Reply Inline Actions The types are meaningless after selection, so that makes sense. I would default to codegen to the end. MIR is less stable and what you care about is that the types were legalized to and selected to something, not the types themselves arsenm: The types are meaningless after selection, so that makes sense. I would default to codegen to…
				vext01AuthorUnsubmitted Done Reply Inline Actions I would default to codegen to the end Do you mean to match asm code for the target architecture? Isn't that also going to be fragile? vext01: > I would default to codegen to the end Do you mean to match asm code for the target…
				arsenmUnsubmitted Not Done Reply Inline Actions Yes. No, this is a target specific test anyway. arsenm: Yes. No, this is a target specific test anyway.
				vext01AuthorUnsubmitted Done Reply Inline Actions Stackmap doesn't emit any code that we could match, but we could match the raw bytes of the `.llvmbc` section, just it's very fiddly. If that's the only way, so be it. vext01: Stackmap doesn't emit any code that we could match, but we could match the raw bytes of the `.
				vext01AuthorUnsubmitted Done Reply Inline Actions sorry, not `.llvmbc`, I meant the stackmap section. vext01: sorry, not `.llvmbc`, I meant the stackmap section.
				; CHECK-NEXT: __LLVM_StackMaps:

				; Header
				; CHECK-NEXT: .byte 3
				; CHECK-NEXT: .byte 0
				; CHECK-NEXT: .short 0

				arsenmUnsubmitted Not Done Reply Inline Actions Probably should add some cases with excessively wide types, vectors and FP arsenm: Probably should add some cases with excessively wide types, vectors and FP
				vext01AuthorUnsubmitted Done Reply Inline Actions Yep. I was only working on integer types for now, but I can add changes to ensure those work too. vext01: Yep. I was only working on integer types for now, but I can add changes to ensure those work…
				; NumFunctions
				; CHECK-NEXT: .long 1
				; NumConstants
				; CHECK-NEXT: .long 0
				; NumRecords
				; CHECK-NEXT: .long 1

				; StackSizeRecord[NumFunctions]
				; StackSizeRecord[0]
				; CHECK-NEXT: .quad _main
				; CHECK-NEXT: .quad 24
				; CHECK-NEXT: .quad 1

				; Constants[NumConstants] (empty)

				; StkMapRecord[NumRecords]
				; StkMapRecord[0]
				; CHECK-NEXT: .quad 0
				; CHECK-NEXT: .long {{.*}}
				; CHECK-NEXT: .short {{.*}}
				; NumLocations
				; CHECK-NEXT: .short 11
				; Location[NumLocations]
				; Location[0]
				; CHECK-NEXT: .byte 1
				; CHECK-NEXT: .byte 0
				; CHECK-NEXT: .short 1
				; CHECK-NEXT: .short {{.*}}
				; CHECK-NEXT: .short 0
				; CHECK-NEXT: .long 0
				; Location[1]
				; CHECK-NEXT: .byte 4
				; CHECK-NEXT: .byte 0
				; CHECK-NEXT: .short 8
				; CHECK-NEXT: .short 0
				; CHECK-NEXT: .short 0
				; CHECK-NEXT: .long 22
				; Location[2]
				; CHECK-NEXT: .byte 1
				; CHECK-NEXT: .byte 0
				; CHECK-NEXT: .short 1
				; CHECK-NEXT: .short {{.*}}
				; CHECK-NEXT: .short 0
				; CHECK-NEXT: .long 0
				; Location[3]
				; CHECK-NEXT: .byte 1
				; CHECK-NEXT: .byte 0
				; CHECK-NEXT: .short 16
				; CHECK-NEXT: .short {{.*}}
				; CHECK-NEXT: .short 0
				; CHECK-NEXT: .long 0
				; Location[4]
				; CHECK-NEXT: .byte 1
				; CHECK-NEXT: .byte 0
				; CHECK-NEXT: .short 16
				; CHECK-NEXT: .short {{.*}}
				; CHECK-NEXT: .short 0
				; CHECK-NEXT: .long 0
				; Location[5]
				; CHECK-NEXT: .byte 4
				; CHECK-NEXT: .byte 0
				; CHECK-NEXT: .short 8
				; CHECK-NEXT: .short {{.*}}
				; CHECK-NEXT: .short 0
				; CHECK-NEXT: .long 66
				; Location[6]
				; CHECK-NEXT: .byte 1
				; CHECK-NEXT: .byte 0
				; CHECK-NEXT: .short 4
				; CHECK-NEXT: .short {{.*}}
				; CHECK-NEXT: .short 0
				; CHECK-NEXT: .long 0
				; Location[7]
				; CHECK-NEXT: .byte 4
				arsenmUnsubmitted Not Done Reply Inline Actions For a general test, a non-register argument may be more helpful arsenm: For a general test, a non-register argument may be more helpful
				vext01AuthorUnsubmitted Done Reply Inline Actions I've added the missing non-constant i7, and I would have done the same for `i128`, but that's already broken in llvm before my change and my change doesn't fix it either. That can be another diff to fix that. Also added a 32-bit address space pointer into the test. vext01: I've added the missing non-constant i7, and I would have done the same for `i128`, but that's…
				; CHECK-NEXT: .byte 0
				arsenmUnsubmitted Not Done Reply Inline Actions Is it worth testing vectors? arsenm: Is it worth testing vectors?
				vext01AuthorUnsubmitted Done Reply Inline Actions We spoke on IRC, but for the others, vector support for stackmaps is currently broken, so I've made no attempt to address them for now. https://github.com/llvm/llvm-project/issues/55613 vext01: We spoke on IRC, but for the others, vector support for stackmaps is currently broken, so I've…
				; CHECK-NEXT: .short 8
				; CHECK-NEXT: .short 0
				; CHECK-NEXT: .short 0
				; CHECK-NEXT: .long 0
				; Location[8]
				; CHECK-NEXT: .byte 1
				; CHECK-NEXT: .byte 0
				; CHECK-NEXT: .short 4
				; CHECK-NEXT: .short {{.*}}
				; CHECK-NEXT: .short 0
				; CHECK-NEXT: .long 0
				; Location[9]
				; CHECK-NEXT: .byte 4
				; CHECK-NEXT: .byte 0
				; CHECK-NEXT: .short 8
				; CHECK-NEXT: .short 0
				; CHECK-NEXT: .short 0
				; CHECK-NEXT: .long 0
				; Location[10]
				; CHECK-NEXT: .byte 1
				; CHECK-NEXT: .byte 0
				; CHECK-NEXT: .short 1
				; CHECK-NEXT: .short {{.*}}
				; CHECK-NEXT: .short 0
				; CHECK-NEXT: .long 0

				@p32 = external global i8 addrspace(270)*

				%struct1 = type {i32, i64}
				%struct2 = type {i1, i1, i1}

				declare void @llvm.experimental.stackmap(i64, i32, ...)

				define dso_local i32 @main(i32 %argc, i8** %argv) {
				entry:
				%i1reg = icmp eq i32 %argc, 5
				%i7reg = zext i1 %i1reg to i7
				%halfreg = sitofp i32 %argc to half
				%ptr32 = load i8 addrspace(270), i8 addrspace(270)* @p32
				%structreg1 = insertvalue %struct1 zeroinitializer, i32 %argc, 0
				%structreg2 = insertvalue %struct2 zeroinitializer, i1 %i1reg, 0
				call void (i64, i32, ...) @llvm.experimental.stackmap(
				i64 0,
				i32 0,
				i1 %i1reg,
				i7 22,
				i7 %i7reg,
				half 1.0,
				half %halfreg,
				i128 66,
				; FIXME: fix and test vectors. At the moment even legally sized vectors
				; are broken:
				; https://github.com/llvm/llvm-project/issues/55613
				;
				; FIXME: test non-constant i128 once these are fixed:
				; - https://github.com/llvm/llvm-project/issues/26431
				; - https://github.com/llvm/llvm-project/issues/55957
				i8 addrspace(270)* %ptr32,
				; FIXME: The stackmap record generated for structs is incorrect:
				; - https://github.com/llvm/llvm-project/issues/55649
				; - https://github.com/llvm/llvm-project/issues/55957
				%struct1 zeroinitializer,
				%struct1 %structreg1,
				%struct2 zeroinitializer,
				%struct2 %structreg2)
				ret i32 0
				}

This is an archive of the discontinued LLVM Phabricator instance.

Correctly legalise stackmap operandsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 442540

llvm/include/llvm/CodeGen/ISDOpcodes.h

llvm/include/llvm/CodeGen/SelectionDAGISel.h

llvm/lib/CodeGen/SelectionDAG/LegalizeFloatTypes.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeIntegerTypes.cpp

llvm/lib/CodeGen/SelectionDAG/LegalizeTypes.h

llvm/lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGDumper.cpp

llvm/lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp

llvm/test/CodeGen/X86/selectiondag-stackmap-legalize.ll

Correctly legalise stackmap operands
ClosedPublic