This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
CodeGen/GlobalISel/
-
GlobalISel/
-
ConstantFoldingMIRBuilder.h
-
LegalizationArtifactCombiner.h
-
LegalizerInfo.h
-
Utils.h
-
MC/
-
MCInstrDesc.h
-
Support/
-
TargetOpcodes.def
-
Target/
15/16
GenericOpcodes.td
5/6
Target.td
-
lib/
-
CodeGen/
-
GlobalISel/
-
CSEMIRBuilder.cpp
25/32
LegalizerHelper.cpp
-
LegalizerInfo.cpp
-
Utils.cpp
2/2
MachineVerifier.cpp
-
Target/
-
AArch64/
-
AArch64LegalizerInfo.cpp
-
AMDGPU/
-
AMDGPULegalizerInfo.cpp
-
ARM/
5/5
ARMLegalizerInfo.cpp
-
Mips/
-
MipsLegalizerInfo.cpp
-
X86/
-
X86LegalizerInfo.cpp
-
test/
-
CodeGen/
-
AArch64/GlobalISel/
-
GlobalISel/
-
irtranslator-extends.ll
-
legalize-div.mir
6/6
legalize-ext.mir
-
legalize-gep.mir
-
legalize-itofp.mir
-
legalize-rem.mir
3/3
legalize-sext.mir
-
legalize-shift.mir
3/3
legalize-undef.mir
-
legalizer-info-validation.mir
-
AMDGPU/GlobalISel/
-
GlobalISel/
1/1
artifact-combiner-sext.mir
-
combine-ext-legalizer.mir
-
legalize-ashr.mir
-
legalize-extract-vector-elt.mir
-
legalize-sext.mir
-
legalize-sextload-flat.mir
-
ARM/GlobalISel/
-
GlobalISel/
-
arm-legalize-divmod.mir
-
arm-legalize-exts.mir
-
Mips/GlobalISel/legalizer/
-
GlobalISel/
-
legalizer/
-
add.mir
-
constants.mir
-
mul.mir
-
rem_and_div.mir
-
sub.mir
-
X86/GlobalISel/
-
GlobalISel/
-
legalize-ext-x86-64.mir
-
x86_64-legalize-sitofp.mir
-
MachineVerifier/
-
test_g_sext_inreg.mir
-
unittests/CodeGen/GlobalISel/
-
CodeGen/
-
GlobalISel/
-
LegalizerHelperTest.cpp
2/2
PatternMatchTest.cpp

Differential D61289

[globalisel] Add G_SEXT_INREG
ClosedPublic

Authored by dsanders on Apr 29 2019, 5:27 PM.

Download Raw Diff

Details

Reviewers

bogner
aditya_nandakumar
volkan
aemerson
paquette
arsenm

Commits

rGe9a57c2b23c2: [globalisel] Add G_SEXT_INREG
rL368487: [globalisel] Add G_SEXT_INREG

Summary

Targets often have instructions that can sign-extend certain cases faster
than the equivalent shift-left/arithmetic-shift-right. Such cases can be
identified by matching a shift-left/shift-right pair but there are some
issues with this in the context of combines. For example, suppose you can
sign-extend 8-bit up to 32-bit with a target extend instruction.

%1:_(s32) = G_SHL %0:_(s32), i32 24 # (I've inlined the G_CONSTANT for brevity)
%2:_(s32) = G_ASHR %1:_(s32), i32 24
%3:_(s32) = G_ASHR %2:_(s32), i32 1

would reasonably combine to:

%1:_(s32) = G_SHL %0:_(s32), i32 24
%2:_(s32) = G_ASHR %1:_(s32), i32 25

which no longer matches the special case. If your shifts and extend are
equal cost, this would break even as a pair of shifts but if your shift is
more expensive than the extend then it's cheaper as:

%2:_(s32) = G_SEXT_INREG %0:_(s32), i32 8
%3:_(s32) = G_ASHR %2:_(s32), i32 1

It's possible to match the shift-pair in ISel and emit an extend and ashr.
However, this is far from the only way to break this shift pair and make
it hard to match the extends. Another example is that with the right
known-zeros, this:

%1:_(s32) = G_SHL %0:_(s32), i32 24
%2:_(s32) = G_ASHR %1:_(s32), i32 24
%3:_(s32) = G_MUL %2:_(s32), i32 2

can become:

%1:_(s32) = G_SHL %0:_(s32), i32 24
%2:_(s32) = G_ASHR %1:_(s32), i32 23

All upstream targets have been configured to lower it to the current
G_SHL,G_ASHR pair but will likely want to make it legal in some cases to
handle their faster cases.

To follow-up: Provide a way to legalize based on the constant. At the
moment, I'm thinking that the best way to achieve this is to provide the
MI in LegalityQuery but that opens the door to breaking core principles
of the legalizer (legality is not context sensitive). That said, it's
worth noting that looking at other instructions and acting on that
information doesn't violate this principle in itself. It's only a
violation if, at the end of legalization, a pass that checks legality
without being able to see the context would say an instruction might not be
legal. That's a fairly subtle distinction so to give a concrete example,
saying %2 in:

%1 = G_CONSTANT 16
%2 = G_SEXT_INREG %0, %1

is legal is in violation of that principle if the legality of %2 depends
on %1 being constant and/or being 16. However, legalizing to either:

%2 = G_SEXT_INREG %0, 16

or:

%1 = G_CONSTANT 16
%2:_(s32) = G_SHL %0, %1
%3:_(s32) = G_ASHR %2, %1

depending on whether %1 is constant and 16 does not violate that principle
since both outputs are genuinely legal.

Diff Detail

Repository

rG LLVM Github Monorepo

Build Status

Buildable 34310
Build 34309: arc lint + arc unit

Event Timeline

dsanders created this revision.Apr 29 2019, 5:27 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 29 2019, 5:27 PM

Herald added subscribers: Petar.Avramovic, atanasyan, jrtc27 and 8 others. · View Herald Transcript

Harbormaster completed remote builds in B31125: Diff 197230.Apr 29 2019, 5:31 PM

dsanders marked an inline comment as done.Apr 29 2019, 5:31 PM

dsanders added inline comments.

llvm/test/CodeGen/AMDGPU/GlobalISel/artifact-combiner-sext.mir
64–68	@arsenm: This test currently doesn't complete legalization because the pre-existing G_TRUNC causes the legalizer to stop processing since it's illegal and it's unable to legalize it. This blocks the lowering of the G_SEXT_INREG which appears later in the work list.

dsanders added a child revision: D61290: globalisel][aarch64] Make G_SEXT_INREG legal from all bit sizes to s32/s64.Apr 29 2019, 6:38 PM

Is the intention to now use this instead of the current system of relying on the cleanup of legalization artifacts?

llvm/include/llvm/CodeGen/GlobalISel/MachineIRBuilder.h
119 ↗	(On Diff #197230)	I think the SrcOp changes should be split to a separate patch
llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
715–718	This wrapping is really weird looking. There should probably be a dedicated buildTrunc which would help some
1704–1705	This should be illegal and caught by the verifier? This shouldn't need to check for illegal cases
llvm/lib/CodeGen/MachineVerifier.cpp
1328	Should check for matching vector elements

arsenm added inline comments.Apr 30 2019, 12:36 AM

llvm/test/CodeGen/AArch64/GlobalISel/verify-g_sext_inreg.mir
1–39 ↗	(On Diff #197230)	This test file should go in test/MachineVerifier

rovka added inline comments.Apr 30 2019, 5:01 AM

llvm/include/llvm/Target/GenericOpcodes.td
40	This comment really needs to do a better job explaining the difference between G_SEXT and G_SEXT_INREG. It only covers the mechanical differences (i.e. that you have an immediate operand), but it says nothing about why this different opcode exists or where it would come from. Is the fact that the IRTranslator never creates such instructions relevant? Should we mention that it is only a legalization artifact? Targets can already say that G_SEXT for certain bitwidths is legal, why don't we just allow them to say which bitwidths should be lowered (instead of adding a new opcode)?
43	Nitpick: since you're adding support for imm everywhere else, it would be nice if we could say "imm:$sz" instead of "unknown:$sz" here.
llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
706	Typo: is has.
730	Missing period.
llvm/test/CodeGen/AArch64/GlobalISel/legalize-ext.mir
64	Is the order really irrelevant for all of these? If so, maybe commit just the change from CHECK to CHECK-DAG separately. Personally, I wouldn't mind keeping the CHECK lines so we can see what actually changed with this patch. Ditto for the other tests.

dsanders mentioned this in D61321: [globalisel] Allow SrcOp to convert an APInt and render it as an immediate operand (MO.isImm() == true).Apr 30 2019, 9:43 AM

dsanders added a parent revision: D61321: [globalisel] Allow SrcOp to convert an APInt and render it as an immediate operand (MO.isImm() == true).Apr 30 2019, 9:43 AM

In D61289#1483719, @arsenm wrote:

Is the intention to now use this instead of the current system of relying on the cleanup of legalization artifacts?

This changes the output of the legalization artifact pass but doesn't change the system itself. If a target can make use of the G_SEXT_INREG then it can mark it legal but if it can't then it can lower it and end up in the same place as before (except that the G_CONSTANT is positioned slightly differently).

llvm/include/llvm/CodeGen/GlobalISel/MachineIRBuilder.h
119 ↗	(On Diff #197230)	Done, it's D61321
llvm/include/llvm/Target/GenericOpcodes.td
40	This comment really needs to do a better job explaining the difference between G_SEXT and G_SEXT_INREG. It only covers the mechanical differences (i.e. that you have an immediate operand), but it says nothing about why this different opcode exists or where it would come from. Ok I can add to that Is the fact that the IRTranslator never creates such instructions relevant? No, who creates it is irrelevant to the operation of the instruction. There's no guarantee that the legalizer won't receive them as input. In the case of the IRTranslator, the IRTranslator could create it if it wanted but it's a simple 1:1 converter (for the most part) and chooses not to at the moment as there's no LLVM-IR equivalent. Target-specific passes are also free to create them. Should we mention that it is only a legalization artifact? It's (currently only) created by code that deals with legalization artifacts but it's not a legalization artifact itself. Targets can already say that G_SEXT for certain bitwidths is legal, why don't we just allow them to say which bitwidths should be lowered (instead of adding a new opcode)? It becomes important when you start optimizing with GlobalISel. Suppose that ARM's SXTB instruction has a latency of 1 and and LSL/ASR have a latency of 2 and that this includes forwarding paths in the hardware (if any). Having the signextend as a single atom in the MIR becomes useful for emitting the most efficient code since given code like: int foo(char a) { return (int)a << 2; } it's cheaper to emit: sxtb r0, r1 lsl r0, r0, #2 // 3 cycles than: lsl r0, r1, #16 asr r0, r0, #16 lsl r0, r0, #2 // 6 cycles even if you can exploit known-bits to emit: lsl r0, r1, #16 asr r0, r0, #14 // 4 cycles it would still be better to use the sxtb. The latter example also illustrates that optimization can make it hard to recognise sign-extension. It gets harder if you also reduce the strength of instructions (maybe lsl r0, r0, #1 is faster as add r0, r0, r0) and there's plenty of ways to make things even more difficult. Essentially, the more mangling the optimizer does while ignorant of the desirable code, the harder it is to select the optimal code at the end.
43	I agree. I've added an untyped_imm which for all functional purposes w.r.t the type constraint system is the same as unknown but at least documents that we expect an immediate.
llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
1704–1705	I've turned it into an assert rather than remove it since it's easier to debug if the debugger stops where it happens and this is the first use of a real immediate in GlobalISel so the chances of misuse are higher than normal.
llvm/lib/CodeGen/MachineVerifier.cpp
1325	I just spotted this one too. This should be getScalarSizeInBits()
llvm/test/CodeGen/AArch64/GlobalISel/legalize-ext.mir
64	The legalizer doesn't provide any guarantees on the order beyond that defs will precede uses. By changing to CHECK-DAG we make the test robust against future changes to the legalizer too. For this patch, the only thing that changed in many cases was the placement of the G_CONSTANT used in the sign-extending shifts

Expanded on the use of G_SEXT_INREG
Typos, nits, and other minor changes

Harbormaster completed remote builds in B31197: Diff 197439.Apr 30 2019, 2:16 PM

rovka added inline comments.May 2 2019, 5:16 AM

llvm/include/llvm/Target/GenericOpcodes.td
40	Sorry, but I still don't get it. I understand why you're trying to avoid the shifts, what I don't understand is why adding this new node is the best solution. For one thing, the name is not very descriptive. I guess you just copied it from SelectionDAG, where you can actually constrain the source to match the destination. We don't do that here, so it's just confusing (I mean it sounds as if a legal G_SEXT would be going through memory or something). Secondly, it looks like what we need is just a way to tell the artifact combiner "don't turn sext into shifts on this target, for these sizes". Why don't we just use G_SEXT's legality for that? I.e. actually use the regular legality actions on G_SEXT directly instead of G_SEXT_INREG, and tell the combiner to not mess with G_SEXT with legal sizes. With G_SEXT_INREG as proposed in this patch, it looks like you're just moving the type legality problem into a value-of-immediate legality problem for which we need new infrastructure. I'm probably missing something, so please bear with me :)
llvm/lib/Target/ARM/ARMLegalizerInfo.cpp
87	Testcase?
llvm/test/CodeGen/AArch64/GlobalISel/legalize-ext.mir
64	I looked in more detail and I agree that the order isn't that important. I still think this is an independent change that you can commit before this patch. Keeping it here makes it a bit difficult to spot the tests that are actually relevant for G_SEXT_INREG.

dsanders marked 3 inline comments as done.May 2 2019, 2:38 PM

dsanders added inline comments.

llvm/include/llvm/Target/GenericOpcodes.td
40	Sorry, but I still don't get it. I understand why you're trying to avoid the shifts, what I don't understand is why adding this new node is the best solution. For one thing, the name is not very descriptive. I guess you just copied it from SelectionDAG, where you can actually constrain the source to match the destination. We don't do that here, so it's just confusing (I mean it sounds as if a legal G_SEXT would be going through memory or something). We actually do constrain the source and destination types. The constraint is specified here via type0:$dst and type0:$src where the use of the same type-index specifies a type matching constraint. It's tested on line 36 of llvm/test/MachineVerifier/test_g_sext_inreg.mir which is emitted when the types are not equal. We don't really need the message on line 38 as it's triggered by the subset of mismatches where they aren't even the same kind of type but it's somewhat useful to report how the types are different as well as that they're different. For the naming part of this, I couldn't think of a better name and sticking to SelectionDAG's name had some slight benefits in the sense that someone who knows the distinction in SelectionDAG would also know the distinction here as it's the same. The difference is that G_SEXT makes the container bigger and the newly-created bits are copies of the previous sign bit. With G_SEXT_INREG, the container remains the same size and a specified bit (Size-1) is replicated to all the bits to its left. As for where you'd use each one, G_SEXT_INREG is useful for cases where you don't want to handle the smaller type. For example, most upstream targets have legal operations for s32 and s64 and widen s1-s31 to s32 as well as s33-s63 to s64. However, they still have to support sign extension from say, s7 to s32 if the input IR had that. One way to achieve that is to use s32 -> G_TRUNC -> s7 -> G_SEXT -> s32. This costs more memory than G_SEXT_INREG (which can add up if you do it a lot, e.g. for code heavily using short or char) but aside from that, it also means that all the register allocation code has to support s7. Similarly, spill/reload/move has to support s7, frame lowering has to support it. Instruction selection has to support it too which is a problem for imported SelectionDAG patterns as they can't describe s7 unless there's a register class with an i7 type which isn't possible as it isn't one of the MVT types. There's probably more but the point is that being able to eliminate some types simplifies the backend. You might think that this sounds like type legalization (and I'd be inclined to agree w.r.t the effect at least but I'd still call it operation legalization as the possible removal of types is a side-effect) but the key difference from SelectionDAG is that GlobalISel itself doesn't mandate it or treat it separately from operation legalization. If a target works in s8 a lot but doesn't really have s8 operations or registers, it can choose to make s8 operations legal anyway and trade more complexity in the backend for (hopefully) better code quality. Secondly, it looks like what we need is just a way to tell the artifact combiner "don't turn sext into shifts on this target, for these sizes". Why don't we just use G_SEXT's legality for that? I.e. actually use the regular legality actions on G_SEXT directly instead of G_SEXT_INREG, and tell the combiner to not mess with G_SEXT with legal sizes. With G_SEXT_INREG as proposed in this patch, it looks like you're just moving the type legality problem into a value-of-immediate legality problem for which we need new infrastructure. We want to eliminate the smaller types _and_ have a sign-extension operation which are mutually exclusive demands at the moment. It's not just about legalization though, it's more about the handling of optimization and instruction selection in all passes from the legalizer onwards (including target specific passes). In the previous example, I showed that hanging on to the knowledge that we had a sign-extension led to the optimal code. How do we hang on to that knowledge for as long as it's useful and only let go of that knowledge when it's beneficial to do so? Suppose we lowered our sign-extend to a lsl, ashr pair. There is (or rather, will be) lots of combines that know how to transform various shifts into other forms (not all of them shifts). Some use known-bits analysis to prove they're valid, some are much simpler. There's also lots of lowerings that do likewise and lots of other optimizations with various effects on shifts. Each and every one can potentially permanently remove our knowledge that we have a sign-extend operation and force us to use the slower code because we can't reconstruct the desired operation later. So how do we get our sign-extend past the combiners and other optimizers that only want to do their job? One answer to this is we teach every single one how to recognize a sign-extending-shift-pair and ask the target if it wants us to leave it alone. This gets impractical really quickly. Even assuming we can teach hundreds of optimizations to recognize dozens of conventional sign-extension patterns and all the target specific patterns in a reasonable way, we'd still be burning large amounts of compile-time checking for all the possible ways a sign-extend can be accomplished just to prevent undesirable optimizations from happening. A better answer is to form a higher-level 'composite' operation to smuggle it past all the combiners and optimizers we don't want to happen. This is what G_SEXT_INREG does. In this approach, it's cheap to determine that the undesirable combines/optimizations shouldn't happen because the opcode isn't the one they want. The downside is that any optimization you do want to happen needs to be taught to recognize the new opcode as well. This is much more managable than the alternative of teaching everything to reject everything they shouldn't change as a list of things they should do grows much slower than the list of things not to do. To put this in another context that doesn't have the legalizer baggage, consider byte swapping and let's pretend there's no intrinsic so we can only emit a byte swap instruction if we actually recognize a byteswap in the code. It's usually a pretty big win to emit a byte-swap instruction so we want to find as many as possible. Unfortunately, there are lots of ways to write byte swapping code and it's difficult to recognize even without optimizations getting in the way. The chances of still being able to recognize a byteswap after the optimizers have picked at the code are fairly low. Some of the masks, shifts, and ors may have been mangled or disappeared entirely. So when we do find one, we want to make sure it gets to the instruction selector in tact. Much like I described above, we form a composite operation from the masks, shifts, and ors the moment we see a byte-swap pattern (ideally before legalization) and smuggle the byte swap operation through to the instruction selector. If we didn't do that, the dozens of patterns to match would become hundreds by the time we reach isel. Another context that has the same principles behind it is bit-rotation. With G_SEXT_INREG as proposed in this patch, it looks like you're just moving the type legality problem into a value-of-immediate legality problem for which we need new infrastructure. I disagree with this summary as it's too focused on the legalizer. I believe I'm solving an information-preservation problem for the compilation pipeline by preventing the decomposition of a higher-level operation into lower-level components in cases where that information is still useful to optimal codegen I also disagree w.r.t to the legalizer but I might be being picky here. I would say I'm removing a requirement that all targets with sign-extension instructions which outperform shift-pairs make G_SEXT legal for all possible source types that benefit from that instruction (for AArch64, this means every type from s1 up to s63). In the context of the legalizer, this means being able to promote the source type without changing the operation and thereby making the operation specified by the opcode and immediate rather than the opcode and types. In terms of implementation, this does turn an operation legality problem from being about types to being about value-of-immediate which is pretty close to the way you stated it and is what makes me think I might be being picky. I do think there is a small distinction between the two though as the value-of-immediate bit falls out of excluding the types from the definition of the operation. I'm probably missing something, so please bear with me :) No worries :-)
llvm/lib/Target/ARM/ARMLegalizerInfo.cpp
87	This is just to maintain the status quo for ARM. It will be whatever test cases you already had for the lowering of G_SEXT into G_LSL/G_ASHR which appears to be just llvm/test/CodeGen/ARM/GlobalISel/arm-legalize-divmod.mir
llvm/test/CodeGen/AArch64/GlobalISel/legalize-ext.mir
64	Sure. I can commit it separately. Keeping it here makes it a bit difficult to spot the tests that are actually relevant for G_SEXT_INREG. It's inclusion in this patch indicates that the test was affected by the addition of G_SEXT_INREG. It takes a different code path to the same end result and slightly peturbs the order in the process. FWIW, I think that makes it relevant to G_SEXT_INREG but I don't mind committing the status-quo tests separately. The two tests that test something other than the maintenance of the status quo are: llvm/unittests/CodeGen/GlobalISel/LegalizerHelperTest.cpp llvm/unittests/CodeGen/GlobalISel/PatternMatchTest.cpp D61290 is the patch that makes G_SEXT_INREG legal for a target and changes the code for that target.

Perfect. What about G_ZEXT_INREG since it is more efficient to narrow scalar G_ZEXT_INREG then bitwise instruction with some bit mask?

llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
698	NarrowScalar is good candidate for separate patch. Reason is that artifact combiner currently can not handle test that would use G_SEXT_INREG since it will have chained artifacts. G_SEXT will be G_UNMERGE-d, G_UNMERGE has to wait for: G_SEXT to be combined into G_SEXT_INREG, and then for G_SEXT_INREG to be narrowScalared into sequence of instructions that will end with G_MERGE_VALUES. Only at this point G_UNMERGE can be combined.
708	What is the idea behind this block, we generate two instructions with types larger then NarrowTy? G_TRUNC and G_SEXT that are generated have to be narrowScalared and then we have a few more merge/unmerge combines to do. Also LegalizerHelper does not know how to narrow scalar G_TRUNC and G_SEXT at the moment.
756	This loop works for `NarrowTy.getScalarSizeInBits() >= SizeInBits` as well. `NarrowTy.getScalarSizeInBits()` -> `NarrowSize`
767	Considering efficiency it might be better to create only one G_ASHR that will have sign of "extension point SEXT_INREG" and copy it to remaining registers that hold higher bits.
770	`getOperand(0).getReg()` -> `getReg(0)`

Thanks for the explanations. I think you have some good points. Overall, it still looks to me like we're complicating things for backend writers and introducing a lot of subtle distinctions to keep in mind. It would be useful to hear what other people think about this.

llvm/include/llvm/Target/GenericOpcodes.td
40	We actually do constrain the source and destination types. The constraint is specified here via type0:$dst and type0:$src where the use of the same type-index specifies a type matching constraint. It's tested on line 36 of llvm/test/MachineVerifier/test_g_sext_inreg.mir which is emitted when the types are not equal. We don't really need the message on line 38 as it's triggered by the subset of mismatches where they aren't even the same kind of type but it's somewhat useful to report how the types are different as well as that they're different. Sorry, I had seen all that, but I thought the DAG constraint referred to the actual register as well, not just the type. I guess the name is at least not worse then, and I can't think of a better one either. One way to achieve that is to use s32 -> G_TRUNC -> s7 -> G_SEXT -> s32. This costs more memory than G_SEXT_INREG (which can add up if you do it a lot, e.g. for code heavily using short or char). Fair enough, but if that's the only good argument then this is premature optimization. [...] it also means that all the register allocation code has to support s7. Similarly, spill/reload/move has to support s7, frame lowering has to support it. You mean register bank selection? Otherwise, doing register allocation before instruction select is kind of a big difference from what happens in the upstream targets. Anyway, any passes would only have to support such types for G_TRUNC and G_SEXT, not for everything (just like any other legal operation). If we introduce G_SEXT_INREG, they now also have to support G_SEXT_INREG in addition to G_SEXT, since you can't guarantee that after the legalization we won't have any G_SEXT left. Instruction selection has to support it too which is a problem for imported SelectionDAG patterns as they can't describe s7 unless there's a register class with an i7 type which isn't possible as it isn't one of the MVT types. This is a good point, but then again if a target wants to keep those instructions unlowered, then they are legal and should be selected somehow. We should teach TableGen to handle such situations rather than hang on to whatever limitations SelectionDAG had. There's probably more but the point is that being able to eliminate some types simplifies the backend. You might think that this sounds like type legalization (and I'd be inclined to agree w.r.t the effect at least but I'd still call it operation legalization as the possible removal of types is a side-effect) but the key difference from SelectionDAG is that GlobalISel itself doesn't mandate it or treat it separately from operation legalization. If a target works in s8 a lot but doesn't really have s8 operations or registers, it can choose to make s8 operations legal anyway and trade more complexity in the backend for (hopefully) better code quality. It's a trade-off, fewer types is a simplification, but more opcodes isn't. Having to always keep in mind both G_SEXT and G_SEXT_INREG is going to be a burden for maintainers. In a sense, this is worse than type legalization, because after type legalization you were certain any funny types were gone, but now they may or may not have all been eaten up by G_SEXT_INREG and G_SEXTLOAD, so you may or may not need to worry about G_SEXT, depending on whether or not you left any legal producer for a certain type. It seems easier to shoot yourself in the foot. Suppose we lowered our sign-extend to a lsl, ashr pair. That means we're not interested in it, otherwise we could mark it as legal and select it to whatever better sequence we know in the instruction select or some other smarter pass. A better answer is to form a higher-level 'composite' operation to smuggle it past all the combiners and optimizers we don't want to happen. This is what G_SEXT_INREG does. I'm still not convinced. This could be true for any other operation that can be lowered. You wouldn't propose adding G_SREM_FOR_REAL versus G_SREM just because some targets don't want to lower it, right? They'd just have to mark it as legal or custom. I disagree with this summary as it's too focused on the legalizer. I believe I'm solving an information-preservation problem for the compilation pipeline by preventing the decomposition of a higher-level operation into lower-level components in cases where that information is still useful to optimal codegen You could also solve it by keeping the type, which would be a more honest representation imo. I also disagree w.r.t to the legalizer but I might be being picky here. I would say I'm removing a requirement that all targets with sign-extension instructions which outperform shift-pairs make G_SEXT legal for all possible source types that benefit from that instruction (for AArch64, this means every type from s1 up to s63). In the context of the legalizer, this means being able to promote the source type without changing the operation and thereby making the operation specified by the opcode and immediate rather than the opcode and types. In terms of implementation, this does turn an operation legality problem from being about types to being about value-of-immediate which is pretty close to the way you stated it and is what makes me think I might be being picky. I do think there is a small distinction between the two though as the value-of-immediate bit falls out of excluding the types from the definition of the operation. So, you're only removing the requirement because types are implicitly illegal (unless the target says otherwise), whereas immediate values are implicitly legal. This means that types force you to think about whether or not something should be legal, whereas immediates will just sneak through. At any rate this is something that hasn't really been discussed.
llvm/include/llvm/Target/Target.td
839	"has no" ... ?
840	How about "is only used for clarity"?
llvm/lib/Target/ARM/ARMLegalizerInfo.cpp
87	Not really, there are tests for standalone G_SEXT in llvm/test/CodeGen/ARM/GlobalISel/arm-legalize-exts.mir. The tests in divmod are testing divmod, and only incidentally the extensions. Since G_SEXT_INREG is an independent opcpde and, as you said in a previous comment, not just a legalization artifact, it should have a standalone test (without any combines). Otherwise, if we changed our handling of legalization artifacts again in the future, we'd leave G_SEXT_INREG uncovered. Anyway, I can add it myself as a follow-up if this gets committed.
llvm/test/CodeGen/AArch64/GlobalISel/legalize-ext.mir
64	It's inclusion in this patch indicates that the test was affected by the addition of G_SEXT_INREG. It takes a different code path to the same end result and slightly peturbs the order in the process. FWIW, I think that makes it relevant to G_SEXT_INREG but I don't mind committing the status-quo tests separately. You're contradicting yourself a bit. If the order is relevant now, then it will be relevant for future changes as well, so you shouldn't switch to CHECK-DAG.

In D61289#1493275, @rovka wrote:

Thanks for the explanations. I think you have some good points. Overall, it still looks to me like we're complicating things for backend writers and introducing a lot of subtle distinctions to keep in mind. It would be useful to hear what other people think about this.

Thanks for drilling into this Diana. There's some good points raised about how necessary this is. After some consideration, I think on balance this is useful enough to move forward with.

m2c: While technically it may be true that we can generate correct code without the use of SEXT_IN_REG, I think the use cases warrants inclusion because of how critical extends are to the way the legalizer works. Given that sooner or later optimizations are going to break the extension-through-shifts idiom, and that there's no easy way to recover, I think it will become necessary to have this in order to have competitive performant code. There is additional complexity, but that complexity can be opted out of for targets that don't want to deal with this. If they do, then a supported route is available.

In D61289#1493205, @Petar.Avramovic wrote:

Perfect. What about G_ZEXT_INREG since it is more efficient to narrow scalar G_ZEXT_INREG then bitwise instruction with some bit mask?

Some of the arguments work for a G_ZEXT_INREG but I don't think it's as widely applicable. It would essentially be a G_AND with the constraint that the mask be a contiguous series of bits starting at position 0 and I'm not aware of a target that can improve performance by preserving that constraint. Most targets I know would select G_ZEXT_INREG to an AND using either an inline or materialized immediate at which point we haven't really gained anything by protecting it against harmful 'optimization'. On MIPS for example, andi r1, r2, #0xff and andi r1, r2, #0xf0 have the same performance. I can see some cases where certain immediates have better performance such as 0xffffffff on a 64-bit and for a target that can access a zero-extended 32-bit subregister in other instructions, or 0xffff and zero-extended 16-bit subregs. Mips isn't one of those targets since it sign-extends all operands and results to infinite bits but I think ARM and X86 can benefit for some sizes, I don't know the performance characteristics compared to an AND though, it could be the same cycle-count/code-size/etc. either way. Do you (or others) know of targets that would benefit from G_ZEXT_INREG?

llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
698	There's a test for narrowScalar of G_SEXT_INREG in this patch. Could you elaborate on what kind of test you're looking for? I think you're looking for a test with neighbouring instructions which are also narrowScalar'd
708	It's not possible to eliminate types larger than NarrowTy with a single narrowScalar. narrowScalar's job is to replace this instruction with one or more that work on NarrowTy sized types and some legalization artifacts (G_SEXT and G_TRUNC in this case) which are required to maintain type correctness. Those legalization artifacts are either combined away or further legalized according to their legalization rules. This block aims to optimize the case where all but one of the narrowed components is a wholly a copy of the sign bit. It expects the G_TRUNC and G_SEXT to be either combined away by the artifact combiner or further legalized. For example: narrowing `%0:_(s32) = G_SEXT %1:_(s32), 8` down to s16 only requires us to preserve the component containing bits 0-15. We can forget about bits 16-31 and reconstruct them with the G_SEXT. This will help chains of instructions to be deleted as they are dead.
710	This statement is redundant though. It's already SizeInBits because we just read it from there

dsanders added inline comments.May 9 2019, 4:21 PM

llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
756	It does but it potentially generates more instructions to do it. Consider `%0:_(s32) = G_SEXT %1:_(s32), 4` narrowScalar'd to s8. This method will break it into four components and will emit one G_SEXT_INREG and one G_ASHR. The other method will emit one G_SEXT_INREG and one G_SEXT which may be eliminated entirely or at worst, lower to one G_ASHR
767	It already does that. We save the register in FullExtensionReg and re-use it if we see any further full extensions. The test for it is in LegalizerHelperTest.cpp on line 821 (the last two operands of the G_MERGE_VALUES are both [[T6]]

Most targets I know would select G_ZEXT_INREG to an AND using either an inline or materialized immediate at which point we haven't really gained anything by protecting it against harmful 'optimization'.

My only consideration was that is faster to narrow scalar G_ZEXT_INREG, then to narrow scalar G_AND and G_CONSTANT. On the other hand AND has simple narrow scalar unlike G_SHL and G_ASHR so it is not that big performance/code size improvement compared to G_SEXT_INREG. Also as sign and zero extend are most of the time mentioned together, I thought that we could add G_ZEXT_INREG alongside with G_SEXT_INREG.

llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
698	Yes. For example with this test on aarch64, store is narrowScalared: define void @func(i8 %x, i128* %p) { entry: %conv = sext i8 %x to i128 store i128 %conv, i128* %p ret void } Legalizer is not able to produce something like this %2:_(s32) = COPY $w0 %1:_(p0) = COPY $x1 %11:_(s64) = G_CONSTANT i64 63 %12:_(s64) = G_SEXT_INREG %2, 8 %13:_(s64) = G_ASHR %12, %11(s64) G_STORE %12(s64), %1(p0) :: (store 8 into %ir.p, align 16) %7:_(s64) = G_CONSTANT i64 8 %6:_(p0) = G_GEP %1, %7(s64) G_STORE %13(s64), %6(p0) :: (store 8 into %ir.p + 8, align 16) RET_ReallyLR which I assume is a desired output.
708	I did not consider situation when G_SEXT(G_SEXT_INREG) could combine away. Currently G_SEXT can only combine into G_TRUNC. The use of this G_SEXT is something legal or G_MERGE that is waiting for G_UNMERGE. We know this since uses are legalized before defs (G_SEXT is def in this context). Yet again if use is legal, then G_SEXT_INREG was legal or something else like lower but not narrow scalar. Are we talking about artifact combiner, or something that is not in tree? Could you provide a test where G_SEXT and G_TRUNC produced with this narrowScalar combine away?
756	Could we then add a possibility for targets to chose how they want to narrow scalar that is to have: narrowScalar - always creates G_UNMERGE+...+G_MERGE narrowScalarExt - creates G_TRUNC+...+G_{S\|Z\|ANY}EXT

In D61289#1493275, @rovka wrote:

Thanks for the explanations. I think you have some good points. Overall, it still looks to me like we're complicating things for backend writers and introducing a lot of subtle distinctions to keep in mind. It would be useful to hear what other people think about this.

It's a optional and small though and I think the wins make it worth the cost for the targets that benefit from it (I expect this to be quite a few targets since 11 in-tree targets had at least some support for the SelectionDAG equivalent*). I'm expecting that to be most targets since most elected to support ISD::SIGN_EXTEND_INREG rather than letting SelectionDAG expand it. Targets that don't benefit from it or don't want to deal with it yet can just use .lower() to opt out of it.

*For targets outside my knowledge this is based on matches for SIGN_EXTEND_INREG that aren't calls to make SelectionDAG expand it.

llvm/include/llvm/Target/GenericOpcodes.td
40	One way to achieve that is to use s32 -> G_TRUNC -> s7 -> G_SEXT -> s32. This costs more memory than G_SEXT_INREG (which can add up if you do it a lot, e.g. for code heavily using short or char). Fair enough, but if that's the only good argument then this is premature optimization. Yep, it's a nice bonus rather than the motivation. [...] it also means that all the register allocation code has to support s7. Similarly, spill/reload/move has to support s7, frame lowering has to support it. You mean register bank selection? Otherwise, doing register allocation before instruction select is kind of a big difference from what happens in the upstream targets. Both unless there's a pass that strips out the s7 types. InstructionSelection doesn't normally change the types so any that get there will also pass through to the next pass. RegisterAllocation would be the pass that assigns 32-bit registers to the 7-bit type. Anyway, any passes would only have to support such types for G_TRUNC and G_SEXT, not for everything (just like any other legal operation). It depends on the pass. Some can limit their support to G_TRUNC/G_SEXT but others don't really work with the operation. For example, Frame lowering needs to know how to promote the s7 to s8 both for storage allocation and for emitting code to load/store from/to a given frame index. Register Allocation needs to know which registers are appropriate for an s7. Also, the list of types can potentially be large. For a 64-bit target, we have to ensure those later passes can deal with s1 through s64 unless a pass like the legalizer limits it. If we introduce G_SEXT_INREG, they now also have to support G_SEXT_INREG in addition to G_SEXT, since you can't guarantee that after the legalization we won't have any G_SEXT left. That's partly true. Targets that don't have `getActionDefinitionBuilder(G_SEXT_INREG).lower()` in their legalizer have to support both to whatever degree the legalization rules say are legal. There's no `lower()` for G_SEXT yet but it would be a G_ANYEXT followed by G_SEXT_INREG (which could further lower to a shift pair) which would allow us to limit support to G_ANYEXT (trivial) and G_SEXT_INREG. Those that do have `getActionDefinitionBuilder(G_SEXT_INREG).lower()` only need to support G_SEXT as G_SEXT_INREG will be lowered to G_SHL/G_ASHR by the legalizer and produce the same legal MIR we have today. Instruction selection has to support it too which is a problem for imported SelectionDAG patterns as they can't describe s7 unless there's a register class with an i7 type which isn't possible as it isn't one of the MVT types. This is a good point, but then again if a target wants to keep those instructions unlowered, then they are legal and should be selected somehow. We should teach TableGen to handle such situations rather than hang on to whatever limitations SelectionDAG had. I agree. It's not feasible in the SelectionDAG patterns we import (it requires changing the type inferencing engine in a big way). For now, C++ is the only way but hopefully we'll have GlobalISel rules at some point There's probably more but the point is that being able to eliminate some types simplifies the backend. You might think that this sounds like type legalization (and I'd be inclined to agree w.r.t the effect at least but I'd still call it operation legalization as the possible removal of types is a side effect) but the key difference from SelectionDAG is that GlobalISel itself doesn't mandate it or treat it separately from operation legalization. If a target works in s8 a lot but doesn't really have s8 operations or registers, it can choose to make s8 operations legal anyway and trade more complexity in the backend for (hopefully) better code quality. It's a trade-off, fewer types is a simplification, but more opcodes isn't. Having to always keep in mind both G_SEXT and G_SEXT_INREG is going to be a burden for maintainers. I agree it's a trade-off and the balance of it will differ between different targets. Where each target falls on that scale will depend on how much of a win selecting a specialized sign-extend instruction is compared to not doing it. Another factor that's important is how difficult it is to recognize a sign-extend after the optimizer has worked on the IR which ranges from easy (the optimizer didn't do much), to tricky (the optimizer mangled the pattern but it's still just about recognizable if you cover all the possibilities), to impossible (it's different code that happens to sign-extend as well). For the targets I'm interested in G_SEXT_INREG is well worth the price and I believe that other targets will land on the same end of the scale once they're optimizing more heavily. In a sense, this is worse than type legalization, because after type legalization you were certain any funny types were gone, but now they may or may not have all been eaten up by G_SEXT_INREG and G_SEXTLOAD, so you may or may not need to worry about G_SEXT, depending on whether or not you left any legal producer for a certain type. It seems easier to shoot yourself in the foot. It's fairly simple to catch the gaps in a legalization ruleset that aims to eliminate certain types. If any operation involving a given type is legal then G_ANYEXT and G_TRUNC for that type must also be legal. If either of those two are marked unsupported for a given type then the legalizer will fail on the IR and you can tell something is missing from the ruleset. Aside from that, being consistent with .clampScalar() and similar is the way to ensure certain types get eliminated. Suppose we lowered our sign-extend to a lsl, ashr pair. That means we're not interested in it, otherwise we could mark it as legal and select it to whatever better sequence we know in the instruction select or some other smarter pass. Not necessarily. It happens whenever the artifact combiner combines (sext (trunc x)). The reason this happens is because after merging those artifacts together, there's no way to represent the sign-extension other than the G_SHL/G_ASHR pair (or G_SEXT_INREG) but the operation is still required to happen. If the artifact combiner didn't do this then it would be impossible to eliminate types. A better answer is to form a higher-level 'composite' operation to smuggle it past all the combiners and optimizers we don't want to happen. This is what G_SEXT_INREG does. I'm still not convinced. This could be true for any other operation that can be lowered. You wouldn't propose adding G_SREM_FOR_REAL versus G_SREM just because some targets don't want to lower it, right? They'd just have to mark it as legal or custom. It is true for any other operation that can be lowered and is important for performance. Whether I'd propose it for the G_* namespace upstream largely depends on whether I think it has general applicability to other targets. I've been meaning to propose widening multiplies for a while (e.g. 32x32 -> 64) as many targets have these but I wouldn't propose a G_FFT because only DSP's have a potential use for one (and even then, why not use G_INTRINSIC). For situations where only one or a couple backends would be interested, I recommend legalizing or combining to what we've been calling `target-specific generic instructions` (which is a _terrible_ name) which is essentially a target-pseudo that uses the type[0-9] constraints. The effect is very much like a generic instruction, but is target specific. I disagree with this summary as it's too focused on the legalizer. I believe I'm solving an information-preservation problem for the compilation pipeline by preventing the decomposition of a higher-level operation into lower-level components in cases where that information is still useful to optimal codegen You could also solve it by keeping the type, which would be a more honest representation imo. I disagree that that's a more honest representation. It starts off honest but it becomes a lie as the compilation pipeline progresses. At the IR Translator stage, I agree that keeping the type is the better representation but once I'm shaping the MIR to suit my target (the legalizer being one of the bigger passes that does that but they all do it to some degree) that opinion changes. It takes on more target specific impurities until we get to a purely target dependent representation. Swapping out G_SEXT on non-existant types for a G_SEXT_INREG or G_SHL/G_ASHR pair is changing honesty w.r.t target independence to honesty w.r.t the target. In the same way, I swap out generic opcodes for target-specific-generic-opcodes and target-instructions as the pipeline progresses. This one just has more common ground with other targets than most. I also disagree w.r.t to the legalizer but I might be being picky here. I would say I'm removing a requirement that all targets with sign-extension instructions which outperform shift-pairs make G_SEXT legal for all possible source types that benefit from that instruction (for AArch64, this means every type from s1 up to s63). In the context of the legalizer, this means being able to promote the source type without changing the operation and thereby making the operation specified by the opcode and immediate rather than the opcode and types. In terms of implementation, this does turn an operation legality problem from being about types to being about value-of-immediate which is pretty close to the way you stated it and is what makes me think I might be being picky. I do think there is a small distinction between the two though as the value-of-immediate bit falls out of excluding the types from the definition of the operation. So, you're only removing the requirement because types are implicitly illegal (unless the target says otherwise), whereas immediate values are implicitly legal. This means that types force you to think about whether or not something should be legal, whereas immediates will just sneak through. At any rate this is something that hasn't really been discussed. Could you elaborate on why you think immediates are implicitly legal? Their legality is specified by the legalization rules. AArch64 happens to support all of them in D61290 so it says legal without even looking at the immediate but LegalizerHelperTest.cpp in D61290 has an example where only certain immediates are legal. I think you have to think about what is legal either way and define your legalization rules accordingly to consume any input and produce a legal output. If your target doesn't have fast sign-extend instructions then use `.lower()`, if you only have ones for s8 and s16 to s32 then use `.legalForTypeWithImm({s32, 8}, {s32, 16}).clampScalar(0, s32, s32).lower()`, if you can handle any sX to s32 then use `.legalFor({s32}).clampScalar(0, s32, s32).lower()` I would say I'm removing the requirement because: Backends should be able to constrain the types they have to support. They shouldn't have to support all types smaller than register width just to handle G_SEXT. Backends shouldn't be required to have G_SEXT legal for every type combination smaller than register width. Changing value sizes isn't the only way to get a sign-extension (e.g. (x << 24) >> 24 in C/C++) It simplifies the optimizers because they can easily identify sign-extends cases which may cause them to harm rather than improve the code It allows the compiler to produce better code It allows the MIR to more accurately reflect the target
llvm/include/llvm/Target/Target.td
839	Umm, I'm not sure what happened there. It was supposed to have 'special behaviour' after that. Fixed it
840	That sounds better to me. Done
llvm/lib/Target/ARM/ARMLegalizerInfo.cpp
87	Ah ok, I didn't find that one because it only tests a legal G_SEXT and not one that's legalized. I've added a test that checks it gets lowered to the shift pair
llvm/test/CodeGen/AArch64/GlobalISel/legalize-ext.mir
64	It's inclusion in this patch indicates that the test was affected by the addition of G_SEXT_INREG. It takes a different code path to the same end result and slightly peturbs the order in the process. FWIW, I think that makes it relevant to G_SEXT_INREG but I don't mind committing the status-quo tests separately. You're contradicting yourself a bit. If the order is relevant now, then it will be relevant for future changes as well, so you shouldn't switch to CHECK-DAG. The test is relevant because it checks that the lowering path produces the same instructions as before (.lower() must cause the intermediate G_SEXT_INREG to lower to the same G_SHL/G_ASHR pair as before). The instruction order (specifically the placement of the G_CONSTANT argument to the G_SHL/G_ASHR pair) is not important because it has no bearing on the correctness of the output. Using CHECK-DAG lets us ignore the unimportant bit (instruction order) while still checking the important bit (opcode, arguments, etc.).

Additional test and small nits

Harbormaster completed remote builds in B31765: Diff 199089.May 10 2019, 2:32 PM

dsanders marked 2 inline comments as done.May 10 2019, 2:56 PM

dsanders added inline comments.

llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
698	Ah ok. The issue there is that G_STORE hasn't implemented narrowScalar yet. I don't think that we should prevent any opcode from getting a narrowScalar implementation until all opcodes have one. I think we should be implementing them as we need them and gradually build up the full set over time.
708	It does look like the artifact combiner is missing some combines, for the `NarrowTy.getScalarSizeInBits() < SizeInBits` case the resulting: %6:_(s128) = G_MERGE_VALUES %0(s64), %12(s64) %7:_(s32) = G_TRUNC %6(s128) ought to be: %7:_(s32) = G_TRUNC %0(s64) and for the `NarrowTy.getScalarSizeInBits() >= SizeInBits` there's a: %10:_(s64) = G_SEXT_INREG %9, 32 %6:_(s128) = G_SEXT %10(s64) %7:_(s32) = G_TRUNC %6(s128) which firstly ought to be: %10:_(s64) = G_SEXT_INREG %9, 32 %7:_(s32) = G_TRUNC %10(s64) and secondly: %10:_(s64) = G_SEXT_INREG %9, 32 %7:_(s32) = %9(s32) the former of those two isn't really related to this patch (the G_SEXT_INREG isn't involved in the combine) but the latter is.
756	Assuming we fix the missing artifact combines, what would be the case where G_UNMERGE/G_MERGE is the better option?

Petar.Avramovic added inline comments.May 10 2019, 6:16 PM

llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
698	The problem (if we use narrow scalar with G_MERGE/G_UNMERGE) is the order in which we attempt to combine artifacts in Legalizer. D61787 and narrow scalar for G_ANYEXT(with G_MERGE/G_UNMERGE) will be able to give output like this.
708	Sorry, I still don't understand. In this fragment %9 should be s64 not s32. An IR function that results in situation where it is better to narrow scalar with SEXT+TRUNC where they combine with something would be helpful. Maybe it is possible to perform other combines before we make G_SEXT_INREG that would benefit from narrow scalar with G_TRUNC/G_SEXT? That is to not create G_SEXT_INREG that would benefit from narrow scalar with G_TRUNC/G_SEXT and solve everything in combiner with for example some pattern that combines multiple artifacts instead of only two.
756	%0:_(s128) = G_ANYEXT ..... %1:_(s128) = G_SEXT_INREG %0, 8 %2:_(s64), %3:_(s64) = G_UNMERGE_VALUES %1:_(s128) At the moment most(if not all?) of narrow scalars make sequence of instructions that starts with G_UNMERGE and ends with G_MERGE. In case we emit G_SEXT as end of sequence it won't be able to combine with G_UNMERGE_VALUES. We then have to perform narrow scalar of this G_SEXT (that will end with G_MERGE) then combine this with %2:_(s64), %3:_(s64) = G_UNMERGE_VALUES %1:_(s128) Same for G_TRUNC at the start. Not to mention that this complicates order in which we attempt to combine artifacts. And if G_SEXT_INREG is to be narrow scalared I would expect that it is surrounded with G_MERGE and G_UNMERGE, on mips at least. Considering llvm test-suite and mips I would say that it is always better to emit G_UNMERGE/G_MERGE. I cannot think of example where G_SEXT would be able to combine with something. And for general answer I would say that it is better to emit G_UNMERGE/G_MERGE if def of G_SEXT_INREG is used in G_UNMERGE_VALUES. This is problematic since it requires legalizer to decide how to narrow scalar based on surrounding instructions.

rovka added inline comments.May 13 2019, 5:36 AM

llvm/include/llvm/Target/GenericOpcodes.td
40	Could you elaborate on why you think immediates are implicitly legal? Their legality is specified by the legalization rules. Sorry about being unclear, let me give it another try. AArch64 happens to support all of them in D61290 so it says legal without even looking at the immediate [...] That's exactly what I meant, that it says legal without even looking at the immediate. I haven't looked at D61290 in too much detail, so excuse me if I'm misunderstanding again. Suppose you had a G_OP with Type0 and Type1, if you say .legalFor({s32}) and just that, you'll get an error because you're not covering Type1 with your rules. From your comments I got the impression that for a G_OP2 with Type0 and Imm1, if you say .legalFor({s32}) then that's enough and all values of Imm1 will just be legal. If that's not the case, then I take this comment back.
llvm/include/llvm/Target/Target.td
840	That sounds better to me. Done "That" = the old version? I don't mind either way, just making sure you didn't miss it by mistake.
llvm/lib/Target/ARM/ARMLegalizerInfo.cpp
87	Cool, thanks!

Petar.Avramovic mentioned this in D61787: [GlobalISel Legalizer] Improve artifact combiner.May 15 2019, 6:12 AM

Sorry for the slow reply. I had to work on other things for a couple days.

llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
698	I don't understand this. What is the issue with the order artifacts are combined?
708	Sorry, I still don't understand. In this fragment %9 should be s64 not s32. You're right, that step isn't quite right. It should be `%7:_(s32) = G_TRUNC %9(s64)`. The point was to consume the input rather than the output as the lower 32-bits are unchanged by the G_SEXT_INREG. Climbing the dependency chain like this allows the uses of %7 to start sooner. An IR function that results in situation where it is better to narrow scalar with SEXT+TRUNC where they combine with something would be helpful. Any IR where many operations are too large for your target would provide good examples. Consider: %2:_(s64) = G_ADD %0:_(s64), %1:_(s64) %3:_(s64) = G_SEXT_INREG %2:_(s64), 16 and that both have narrowScalar(/typeidx=/0, s32). The legalization would proceed to: %2:_(s64) = G_ADD %0:_(s64), %1:_(s64) %4:_(s32), %5:_(s32) = G_UNMERGE_VALUES %2:_(s64) %6:_(s32) = G_SEXT_INREG %4:_(s32), 16 %3:_(s64) = G_MERGE_VALUES %6:_(s32), %5:_(s32) and then to: %9:_(s32), %10:_(s32) = G_UNMERGE_VALUES %0:_(s64) %11:_(s32), %12:_(s32) = G_UNMERGE_VALUES %1:_(s64) %7:_(s32) = G_ADD %9:_(s32), %11:_(s32) %8:_(s32) = G_ADD %10:_(s32), %12:_(s32) %2:_(s64) = G_MERGE_VALUES %7:_(s32), %8:_(s32) %4:_(s32), %5:_(s32) = G_UNMERGE_VALUES %2:_(s64) %6:_(s32) = G_SEXT_INREG %4:_(s32), 16 %3:_(s64) = G_MERGE_VALUES %6:_(s32), %5:_(s32) then the artifact combiner would fold the middle merge/unmerge to: %9:_(s32), %10:_(s32) = G_UNMERGE_VALUES %0:_(s64) %11:_(s32), %12:_(s32) = G_UNMERGE_VALUES %1:_(s64) %7:_(s32) = G_ADD %9:_(s32), %11:_(s32) %8:_(s32) = G_ADD %10:_(s32), %12:_(s32) %6:_(s32) = G_SEXT_INREG %7:_(s32), 16 %3:_(s64) = G_MERGE_VALUES %6:_(s32), %8:_(s32) Notice that we still have the `%8:_(s32) = G_ADD %10:_(s32), %12:_(s32)` at this point even though we're about to overwrite it. We're not going to be able to improve on this until a post-legalize combiner. Now consider the same case with this optimization: %2:_(s64) = G_ADD %0:_(s64), %1:_(s64) %3:_(s64) = G_SEXT_INREG %2:_(s64), 16 becomes: %2:_(s64) = G_ADD %0:_(s64), %1:_(s64) %4:_(s32) = G_TRUNC %2:_(s64) %6:_(s32) = G_SEXT_INREG %4:_(s32), 16 %3:_(s64) = G_SEXT %6:_(s32) then: %9:_(s32), %10:_(s32) = G_UNMERGE_VALUES %0:_(s64) %11:_(s32), %12:_(s32) = G_UNMERGE_VALUES %1:_(s64) %7:_(s32) = G_ADD %9:_(s32), %11:_(s32) %8:_(s32) = G_ADD %10:_(s32), %12:_(s32) %2:_(s64) = G_MERGE_VALUES %7:_(s32), %8:_(s32) %4:_(s32) = G_TRUNC %2:_(s64) %6:_(s32) = G_SEXT_INREG %4:_(s32), 16 %3:_(s64) = G_SEXT %6:_(s32) which the artifact combiner (should) simplify to: %9:_(s32), %10:_(s32) = G_UNMERGE_VALUES %0:_(s64) %11:_(s32), %12:_(s32) = G_UNMERGE_VALUES %1:_(s64) %7:_(s32) = G_ADD %9:_(s32), %11:_(s32) %8:_(s32) = G_ADD %10:_(s32), %12:_(s32) %6:_(s32) = G_SEXT_INREG %7:_(s32), 16 %3:_(s64) = G_SEXT %6:_(s32) and then to: %9:_(s32), %10:_(s32) = G_UNMERGE_VALUES %0:_(s64) %11:_(s32), %12:_(s32) = G_UNMERGE_VALUES %1:_(s64) %7:_(s32) = G_ADD %9:_(s32), %11:_(s32) %6:_(s32) = G_SEXT_INREG %7:_(s32), 16 %3:_(s64) = G_SEXT %6:_(s32) and then to: %9:_(s32) = G_TRUNC %0:_(s64) %11:_(s32) = G_TRUNC %1:_(s64) %7:_(s32) = G_ADD %9:_(s32), %11:_(s32) %6:_(s32) = G_SEXT_INREG %7:_(s32), 16 %3:_(s64) = G_SEXT %6:_(s32) which is simpler. The second G_ADD was correctly recognized as being dead and removed. As a result fewer instructions were emitted by the legalizer and subsequent passes have less work to do. Maybe it is possible to perform other combines before we make G_SEXT_INREG that would benefit from narrow scalar with G_TRUNC/G_SEXT? That is to not create G_SEXT_INREG that would benefit from narrow scalar with G_TRUNC/G_SEXT and solve everything in combiner with for example some pattern that combines multiple artifacts instead of only two. It's not entirely clear what you're suggesting here. I'm particularly confused by the 'that would benefit from narrow scalar' bit since it's not a choice to narrow scalar or not. An operation is either too wide for the target and must be narrowScalar'd or it's not. If your suggestion is to try to form G_SEXT_INREG instructions in a combine pass after the legalizer then I'd say that's too late in the pipeline. There's no guarantee that the combine to form a G_SEXT_INREG will run before a combine that makes them unrecognizable. Ideally we want to make a best effort to form them in a pre-legalizer combiner in addition to the legalizer.
756	You're assuming that we _don't_ fix the missing artifact combines there. Given: %0:_(s128) = G_ANYEXT ..... %1:_(s128) = G_SEXT_INREG %0, 8 %2:_(s64), %3:_(s64) = G_UNMERGE_VALUES %1:_(s128) ... = ... %2:_(s64) ... = ... %3:_(s64) then assuming you are doing narrowScalar(/typeidx=/0, s64) on the G_SEXT_INREG, we should get: %0:_(s128) = G_ANYEXT ..... %5:_(s64) = G_TRUNC %0:_(s128) %1:_(s64) = G_SEXT_INREG %0, 8 %4:_(s128) = G_SEXT %1 %2:_(s64), %3:_(s64) = G_UNMERGE_VALUES %1:_(s128) ... = ... %2:_(s64) ... = ... %3:_(s64) then the artifact combiner should give: %0:_(s128) = G_ANYEXT ..... %5:_(s64) = G_TRUNC %0:_(s128) %1:_(s64) = G_SEXT_INREG %0, 8 %6:_(s64) = G_ASHR %1, i64 63 ... = ... %1:_(s64) ... = ... %6:_(s64) then if we assume that G_ANYEXT was from anything smaller than s64 (which should be the case for MIPS): %5:_(s64) = G_ANYEXT ..... %1:_(s64) = G_SEXT_INREG %0, 8 %6:_(s64) = G_ASHR %1, i64 63 ... = ... %1:_(s64) ... = ... %6:_(s64) I cannot think of example where G_SEXT would be able to combine with something. There's plenty of cases where that should be able to combine. The common cases for targets that expect the input MIR to contain s8, s16, s32, s64 operations and need to legalize to only s32 and s64 operations are: If X == 2Y: %1:_(sX) = G_SEXT %0:_(sY) %2:_(sY), %3:_(sY) = G_UNMERGE_VALUES %1 ... = ... %2:_(sY) ... = ... %3:_(sY) to %3:_(sY) = G_ASHR %0:_(sY), iY (Y-1) ... = ... %0:_(sY) ... = ... %3:_(sY) If X == 4Y: %1:_(sX) = G_SEXT %0:_(sY) %2:_(sY), %3:_(sY), %4:_(sY), %5:_(sY) = G_UNMERGE_VALUES %1 ... = ... %2:_(sY) ... = ... %3:_(sY) ... = ... %4:_(sY) ... = ... %5:_(sY) to %3:_(sY) = G_ASHR %0:_(sY), iY (Y-1) ... = ... %0:_(sY) ... = ... %3:_(sY) ... = ... %3:_(sY) ... = ... %3:_(sY) For G_ZEXT, replace the `G_ASHR` with `G_CONSTANT iY 0`. For G_ANYEXT use IMPLICIT_DEF instead

Small comment fix

Harbormaster completed remote builds in B32270: Diff 200627.May 21 2019, 7:08 PM

dsanders added inline comments.May 21 2019, 7:13 PM

llvm/include/llvm/Target/GenericOpcodes.td
40	Ah ok, you're just looking for the rule verifier to check that each target looked at it like it does for type indices. Sure, I can add that. For AArch64 we'd say 1-63 is legal (i.e. every valid immediate for an s64 G_SEXT_INREG) because the SBFMX instruction handles them all but that still counts as looking at it and the verifier would still reject it if we didn't at least look.
llvm/include/llvm/Target/Target.td
840	I meant 'is only used for clarity' sounds better. I made the change in my working copy

Thanks for the explanation. Summary of inline discussion so far:
NarrowScalar G_SEXT_INREG cannot have decent (able to combine away artifacts) mir test with this patch alone since there are problems with artifact combiner.
Approach 1: narrowScalar G_SEXT_INREG with G_UNMERGE/G_MERGE + D61787. Able to perform all combines introduced so far and generates desired output.
Approach 2: narrowScalar G_SEXT_INREG with G_TRUNC/G_SEXT + TODO: introduce more artifacts combines and fix artifact combiner to support them.
I expect that both approaches should have similar performance and generate equivalent output.
Are there there cases where Approach 2 generates better code, as I am now convinced that Approach 2 will have similar (if not better ?) performance compared to Approach 1?

llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
698	Artifact might have to wait for another instruction to be legalized (with e.g. narrowScalar) and only then perform combine. For more detail look at D61787, it covers artifact combines available at the moment.
708	The point was to consume the input rather than the output as the lower 32-bits are unchanged by the G_SEXT_INREG. Climbing the dependency chain like this allows the uses of %7 to start sooner. Something like look through copy, but here it is look through G_SEXT_INREG depending on immediate (operand 2) ? Assuming Artifact combiner is able to perform all mentioned combines, it gets same output as narrowScalar using G_UNMERGE/G_MERGE + D61787. High bits in `%3:_(s64) = G_MERGE_VALUES %6:_(s32), %5:_(s32)` should be sign bit of %6 `%3:_(s64) = G_SEXT_INREG %2:_(s64), 16` narrowScalars to: %4:_(s32), %5:_(s32) = G_UNMERGE_VALUES %2:_(s64) %8:_(s32) = G_CONSTANT i32 31 %6:_(s32) = G_SEXT_INREG %4:_, 16 %7:_(s32) = G_ASHR %6:_, %8:_(s32) %3:_(s64) = G_MERGE_VALUES %6:_(s32), %7:_(s32) also `%3:_(s64) = G_SEXT %6:_(s32)` has to be narrowScalared/combined away by G_UNMERGE when available. I run mentioned example and got following results: Setup for test is: applying: D61289, D61289 and D61787, in MipsLegalizerInfo changing mips G_SEXT_INREG to: getActionDefinitionsBuilder(G_SEXT_INREG) .legalForTypeWithImm({{s32,8},{s32,16}}) .maxScalar(0, s32) .lower(); and changing G_SEXT_INREG to do narrow scalar with G_UNMERGE/G_MERGE always. running following test with -march=mipsel -global-isel -O0 -stop-after=legalizer define i64 @f(i64 signext %a, i64 signext %b) { entry: %add = add i64 %a, %b %conv = trunc i64 %add to i16 %conv1 = sext i16 %conv to i64 ret i64 %conv1 } gives following result: %2:_(s32) = COPY $a0 %3:_(s32) = COPY $a1 %4:_(s32) = COPY $a2 %5:_(s32) = COPY $a3 %25:_(s32) = G_CONSTANT i32 0 %22:_(s32) = G_ADD %2, %4 %26:_(s32) = G_CONSTANT i32 1 %27:_(s32) = COPY %25(s32) %23:_(s32) = G_AND %27, %26 %16:_(s32) = G_ADD %22, %23 %24:_(s32) = G_ICMP intpred(ult), %16(s32), %2 %20:_(s32) = G_ADD %3, %5 %28:_(s32) = COPY %24(s32) %21:_(s32) = G_AND %28, %26 %18:_(s32) = G_ADD %20, %21 %32:_(s32) = G_CONSTANT i32 31 %33:_(s32) = G_SEXT_INREG %16, 16 %34:_(s32) = G_ASHR %33, %32(s32) $v0 = COPY %33(s32) $v1 = COPY %34(s32) RetRA implicit $v0, implicit $v1 there are few places for improvement: first, lets remove dead instructions (this could take place after main do/while loop in Legalizer.cpp) %2:_(s32) = COPY $a0 %4:_(s32) = COPY $a2 %25:_(s32) = G_CONSTANT i32 0 %22:_(s32) = G_ADD %2, %4 %26:_(s32) = G_CONSTANT i32 1 %27:_(s32) = COPY %25(s32) %23:_(s32) = G_AND %27, %26 %16:_(s32) = G_ADD %22, %23 %32:_(s32) = G_CONSTANT i32 31 %33:_(s32) = G_SEXT_INREG %16, 16 %34:_(s32) = G_ASHR %33, %32(s32) $v0 = COPY %33(s32) $v1 = COPY %34(s32) RetRA implicit $v0, implicit $v1 fragment %25:_(s32) = G_CONSTANT i32 0 %22:_(s32) = G_ADD %2, %4 %26:_(s32) = G_CONSTANT i32 1 %27:_(s32) = COPY %25(s32) %23:_(s32) = G_AND %27, %26 %16:_(s32) = G_ADD %22, %23 comes from lower() of %15:_(s1) = G_CONSTANT i1 false %16:_(s32), %17:_(s1) = G_UADDE %11:_, %13:_, %15:_ from narrowScalar of G_ADD. If we used %16:_(s32), %17:_(s1) = G_UADDO %11:_, %13:_ for low bits, we would get %2:_(s32) = COPY $a0 %4:_(s32) = COPY $a2 %16:_(s32) = G_ADD %2, %4 %32:_(s32) = G_CONSTANT i32 31 %33:_(s32) = G_SEXT_INREG %16, 16 %34:_(s32) = G_ASHR %33, %32(s32) $v0 = COPY %33(s32) $v1 = COPY %34(s32) RetRA implicit $v0, implicit $v1 This should be equivalent to mentioned desired output when G_TRUNC/G_SEXT is used in narrow scalar. It's not entirely clear what you're suggesting here %10:_(s64) = G_SEXT_INREG %9, 32 %6:_(s128) = G_SEXT %10(s64) %7:_(s32) = G_TRUNC %6(s128) Was most likely generated from something like: %11:_(s32) = G_TRUNC %9(s64) %10:_(s64) = G_SEXT %11:_(s32) %6:_(s128) = G_SEXT %10(s64) %7:_(s32) = G_TRUNC %6(s128) I meant that we might be able to figure out that these 4 instructions are equivalent to `%7:_(s32) = G_TRUNC %9(s64)` (combine more then 2 artifact in artifact combiner) instead of combining first two into into G_SEXT_INREG. that would benefit from narrow scalar Is there a test that produces better/smaller code when when G_SEXT_INREG is narrow scalar-ed with G_TRUNC/G_SEXT instead of G_UNMERGE/G_MERGE? From discussion so far I am convinced that both approaches generate/should generate same output.
756	Ah, didn't think of G_UNMERGE + G_SEXT combine. Then we perform work from "the for loop from G_SEXT_INREG narrow scalar" inside combine and it is pretty much same thing. Both narrowScalar approaches have similar overall performance and generate same output.

Add verification that immediate indices are used with the same caveats as the type index checks (custom predicates are assumed to check, etc)

Herald added a subscriber: jfb. · View Herald TranscriptJun 17 2019, 5:35 PM

Harbormaster completed remote builds in B33519: Diff 205223.Jun 17 2019, 5:37 PM

I'm running into a related problem attempting to implement narrowScalar for G_SEXT. AMDGPU is allergic to 64-bit shifts, so I want to implement

%1:_(s32) = G_TRUNC %0
%2:_(s64).= G_SEXT %1

%1:_(s32) = G_TRUNC %0
%2:_(s32) = G_ASHR %1, 31
%3:_:(s64) = G_MERGE_VALUES %1, %2

Since the 64-bit shift is possible, the artifact combiner produces the undesirable 64-bit shift combination. Worse, this combination ends up infinitely looping if the required shift requires legalizatiion

I think it would be useful to have generic UBFE/SBFE instructions, which could take the place of sext_inreg. However, those would allow variable offset/width, so it wouldn't bee quite the same.

In D61289#1563253, @arsenm wrote:

I think it would be useful to have generic UBFE/SBFE instructions, which could take the place of sext_inreg. However, those would allow variable offset/width, so it wouldn't bee quite the same.

Not sure what the benefit would be there. IIRC we don't really have a need for zext_in_reg and the legalizer isn't going to produce variable shifts in most cases.

I think this has been reviewed enough now.

This revision is now accepted and ready to land.Jul 2 2019, 1:32 PM

Sorry for the slow reply, I've been split between quite a few tasks recently.

@rovka: I think I resolved the last issue you had which was that the verifier didn't ensure the immediate had been checked. Could you confirm that?

In D61289#1562830, @arsenm wrote:
I'm running into a related problem attempting to implement narrowScalar for G_SEXT. AMDGPU is allergic to 64-bit shifts, so I want to implement
%1:_(s32) = G_TRUNC %0
%2:_(s64).= G_SEXT %1
As
%1:_(s32) = G_TRUNC %0
%2:_(s32) = G_ASHR %1, 31
%3:_:(s64) = G_MERGE_VALUES %1, %2
Since the 64-bit shift is possible, the artifact combiner produces the undesirable 64-bit shift combination. Worse, this combination ends up infinitely looping if the required shift requires legalizatiion

With this patch, that will input turn to:

%1:_(s64) = G_ANYEXT %0 // Skipped if %0 is s64
%2:_(s64) = G_SEXT_INREG %0, 32

If you then narrowScalar the G_SEXT_INREG (and I've just noticed my last update lost this part of the patch, I'll fix that in a moment), you'd get:

%1:_(s64) = G_ANYEXT %0 // Skipped if %0 is s64
%3:_(s32), %4:_(s32) = G_UNMERGE_VALUES %1:_(s64)
%5:_(s32) = G_ASHR %4, 32
%2:_(s64) = G_MERGE_VALUES %3:_(s32), %5:_(s32)

assuming we have a full set of artifact combines, then we'd get one of the following: For %0 is s64:

%3:_(s32), %4:_(s32) = G_UNMERGE_VALUES %1:_(s64)
%5:_(s32) = G_ASHR %4, 32
%2:_(s64) = G_MERGE_VALUES %3:_(s32), %5:_(s32)

for %0 is <s64 and >s32:

%1:_(s64) = G_ANYEXT %0 // Skipped if %0 is s64
%3:_(s32), %4:_(s32) = G_UNMERGE_VALUES %1:_(s64)
%5:_(s32) = G_ASHR %4, 32
%2:_(s64) = G_MERGE_VALUES %3:_(s32), %5:_(s32)

for %0 is s32:

%5:_(s32) = G_ASHR %0, 32 // %0 is s32
%2:_(s64) = G_MERGE_VALUES %0:_(s32), %5:_(s32)

or this for %0 <s32:

%1:_(s32) = G_ANYEXT %0 // %0 is <s32
%5:_(s32) = G_ASHR %1, 32
%2:_(s64) = G_MERGE_VALUES %1:_(s32), %5:_(s32)

which all look like they'd do what you want.

In D61289#1563253, @arsenm wrote:

I think it would be useful to have generic UBFE/SBFE instructions, which could take the place of sext_inreg. However, those would allow variable offset/width, so it wouldn't bee quite the same.

That's a good point, G_SEXT_INREG %0, 16 is really just SBFE %0, 0, 16. The snag is that, as you hint at, including variable offset and width would require us to support context sensitive legality to cover targets that have sext instructions but don't have a full signed-bit-field-extract since the immediates would have to be represented with G_CONSTANT. We could potentially fix that by supporting instructions that allow both MO.isReg() and MO.isImm() operands somehow (currently the opcode statically indicates one or the other is permitted but not both). Another option would be to have a static SBFE that only allows immediates and a dynamic one that only allows registers. In the worst case for that approach, we'd need four but I'm not sure if we'd really need them all. Do any targets allow both the position and width to be specified as registers?

In D61289#1568751, @dsanders wrote:

. Do any targets allow both the position and width to be specified as registers?

Both the offset and width can be registers on AMDGPU (for the SALU version they are both packed in one register)

dsanders marked an inline comment as done.Jul 3 2019, 10:48 AM

dsanders added inline comments.

llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
708	The point was to consume the input rather than the output as the lower 32-bits are unchanged by the G_SEXT_INREG. Climbing the dependency chain like this allows the uses of %7 to start sooner. Something like look through copy, but here it is look through G_SEXT_INREG depending on immediate (operand 2) ? I suppose it can be viewed that way as both end up simplifying the MIR. I think of them as separate things as this is a pattern-match-and-replace whereas look through copy is something that's done to help the pattern-match part of that by ignoring instructions that aren't relevant to whether the pattern should match or not.

In D61289#1568788, @arsenm wrote:

In D61289#1568751, @dsanders wrote:

Do any targets allow both the position and width to be specified as registers?

Both the offset and width can be registers on AMDGPU (for the SALU version they are both packed in one register)

In that case we'd need all four combinations if we did it by opcode. It would probably be better to go the other route to achieve a bitfield extract. That has some nasty side effects to sort out (e.g. what happens to the type indices if one of the operands is an immediate) but it should be possible.

I think we should land this patch while we think of that as it will be fairly simple to convert G_SEXT_INREG to the more flexible G_SFBE in future. It's just a matter of checking for the 0 immediate when looking for G_SEXT_INREG and adding it when we build a G_SBFE

In D61289#1568835, @dsanders wrote:

In D61289#1568788, @arsenm wrote:

In D61289#1568751, @dsanders wrote:

Do any targets allow both the position and width to be specified as registers?

Both the offset and width can be registers on AMDGPU (for the SALU version they are both packed in one register)

In that case we'd need all four combinations if we did it by opcode. It would probably be better to go the other route to achieve a bitfield extract. That has some nasty side effects to sort out (e.g. what happens to the type indices if one of the operands is an immediate) but it should be possible.

I think we should land this patch while we think of that as it will be fairly simple to convert G_SEXT_INREG to the more flexible G_SFBE in future. It's just a matter of checking for the 0 immediate when looking for G_SEXT_INREG and adding it when we build a G_SBFE

I would rather have G_SBFE/G_UBFE that accept arbitrary registers, and a separate G_SEXT_INREG. I wouldn't get any benefit from additional static versions

Bring back the code that went missing on my last update

Harbormaster completed remote builds in B34310: Diff 207854.Jul 3 2019, 11:46 AM

arsenm added inline comments.Jul 3 2019, 12:24 PM

llvm/include/llvm/Target/GenericOpcodes.td
46	Are tablegen emitter changes needed for this? I was trying to add an immediate operand in D64054, which seemed to not work correctly

arsenm added inline comments.Jul 3 2019, 12:25 PM

llvm/unittests/CodeGen/GlobalISel/PatternMatchTest.cpp
276	It's a gtestism that these should be swapped to get the correct expected/actual error message
284	Ditto

EXPECT_EQ argument order

llvm/include/llvm/Target/GenericOpcodes.td
46	You mean the untyped_imm_0? It's needed to drive the verifier that Diana requested and tells it which immediate checks it needs to look for

Harbormaster completed remote builds in B34313: Diff 207863.Jul 3 2019, 12:32 PM

arsenm added inline comments.Jul 3 2019, 12:38 PM

llvm/include/llvm/Target/GenericOpcodes.td
46	Yes. As far as I can tell the emitter will try looking for G_CONSTANT defined register for this. I agree there should be a special immediate operand for this, but I don't think this will work in a tablegen pattern as-is. The current form has a ValueType leaf, so matching the immediate wouldn't be quite the same.

dsanders marked an inline comment as done.Jul 3 2019, 12:45 PM

dsanders added inline comments.

llvm/include/llvm/Target/GenericOpcodes.td
46	I'm not sure I'm following. InOperandList in a GenericInstruction specifies the type constraints and uses special TypeOperand subclasses. They don't factor into tablegen patterns except in so far as requiring that types match.

arsenm added inline comments.Jul 3 2019, 12:58 PM

llvm/include/llvm/Target/GenericOpcodes.td
46	I mean I believe there's no way to define a node that will be capable of matching this instruction

Rebase and update to match changes in D61321

dsanders marked an inline comment as done.Aug 2 2019, 9:53 PM

dsanders added inline comments.

llvm/test/CodeGen/AArch64/GlobalISel/legalize-sext.mir
12–13	@paquette @aemerson: I ought to draw attention to this. My code appears to be doing the right thing for lower() of G_SEXT_INREG but you appear to have a custom legalization that promotes one of the two constants to s64. Is that intentional? If so, is it also intentional that it only does it for G_ASHR and not G_SHL too?

Harbormaster completed remote builds in B36064: Diff 213174.Aug 2 2019, 9:54 PM

aemerson added inline comments.Aug 5 2019, 10:39 AM

llvm/test/CodeGen/AArch64/GlobalISel/legalize-sext.mir
12–13	Yes this is intentional. In order to re-use the existing imported patterns for ashr & lshr we promote the shift amount to i64. For G_SHL we have some custom selection code to deal with non-64b immediates so it's not necessary.
llvm/test/CodeGen/AArch64/GlobalISel/legalize-undef.mir
29	Did constant and impdef order change? If so we can just re-run the test update script.

dsanders marked 2 inline comments as done.Aug 5 2019, 12:08 PM

dsanders added inline comments.

llvm/test/CodeGen/AArch64/GlobalISel/legalize-sext.mir
12–13	That's ok then. Thanks
llvm/test/CodeGen/AArch64/GlobalISel/legalize-undef.mir
29	Did constant and impdef order change? Yes. Should instruction order matter for these tests? Legalization doesn't give any guarantees on the output order and using CHECK-DAG makes the test robust against that

arsenm added inline comments.Aug 5 2019, 12:11 PM

llvm/test/CodeGen/AArch64/GlobalISel/legalize-undef.mir
29	It would, but as it's autogenerated It doesn't/shouldn't uses -DAG checks

dsanders mentioned this in rL368063: [globalisel] Allow SrcOp to convert an APInt and render it as an immediate….Aug 6 2019, 10:19 AM

dsanders mentioned this in rGd9934d4939af: [globalisel] Allow SrcOp to convert an APInt and render it as an immediate….

The next update fixes the last couple nits that were left open after the LGTM. I'm planning to commit this tomorrow

llvm/include/llvm/Target/GenericOpcodes.td
46	Ah I see. Yes, we'll need to map sext_inreg to G_SEXT_INREG and add some special handling to convert the MVT operand to an immediate.
llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp
770	I tried this change but it resulted in errors.

Fix the last few nits

Harbormaster completed remote builds in B36390: Diff 214060.Aug 7 2019, 8:28 PM

Closed by commit rL368487: [globalisel] Add G_SEXT_INREG (authored by dsanders). · Explain WhyAug 9 2019, 2:10 PM

This revision was automatically updated to reflect the committed changes.

Herald added a subscriber: kristina. · View Herald TranscriptAug 9 2019, 2:10 PM

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

GlobalISel/

ConstantFoldingMIRBuilder.h

11 lines

LegalizationArtifactCombiner.h

18 lines

LegalizerInfo.h

57 lines

Utils.h

3 lines

MC/

MCInstrDesc.h

16 lines

Support/

TargetOpcodes.def

1 line

Target/

GenericOpcodes.td

14 lines

Target.td

7 lines

lib/

CodeGen/

GlobalISel/

11 lines

119 lines

40 lines

17 lines

17 lines

Target/

AArch64/

AArch64LegalizerInfo.cpp

2 lines

AMDGPU/

AMDGPULegalizerInfo.cpp

2 lines

ARM/

ARMLegalizerInfo.cpp

2 lines

Mips/

MipsLegalizerInfo.cpp

2 lines

X86/

X86LegalizerInfo.cpp

1 line

test/

CodeGen/

AArch64/

GlobalISel/

irtranslator-extends.ll

30 lines

40 lines

142 lines

16 lines

42 lines

100 lines

16 lines

50 lines

10 lines

legalizer-info-validation.mir

722 lines

AMDGPU/

GlobalISel/

artifact-combiner-sext.mir

87 lines

combine-ext-legalizer.mir

16 lines

legalize-ashr.mir

150 lines

legalize-extract-vector-elt.mir

80 lines

legalize-sext.mir

40 lines

legalize-sextload-flat.mir

76 lines

ARM/

GlobalISel/

arm-legalize-divmod.mir

56 lines

arm-legalize-exts.mir

27 lines

Mips/

GlobalISel/

legalizer/

44 lines

28 lines

44 lines

232 lines

60 lines

X86/

GlobalISel/

legalize-ext-x86-64.mir

18 lines

x86_64-legalize-sitofp.mir

88 lines

MachineVerifier/

test_g_sext_inreg.mir

53 lines

unittests/

CodeGen/

GlobalISel/

LegalizerHelperTest.cpp

128 lines

PatternMatchTest.cpp

16 lines

Diff 207854

llvm/include/llvm/CodeGen/GlobalISel/ConstantFoldingMIRBuilder.h

Show First 20 Lines • Show All 48 Lines • ▼ Show 20 Lines	case TargetOpcode::G_SREM: {
const DstOp &Dst = DstOps[0];		const DstOp &Dst = DstOps[0];
const SrcOp &Src0 = SrcOps[0];		const SrcOp &Src0 = SrcOps[0];
const SrcOp &Src1 = SrcOps[1];		const SrcOp &Src1 = SrcOps[1];
if (auto MaybeCst =		if (auto MaybeCst =
ConstantFoldBinOp(Opc, Src0.getReg(), Src1.getReg(), *getMRI()))		ConstantFoldBinOp(Opc, Src0.getReg(), Src1.getReg(), *getMRI()))
return buildConstant(Dst, MaybeCst->getSExtValue());		return buildConstant(Dst, MaybeCst->getSExtValue());
break;		break;
}		}
		case TargetOpcode::G_SEXT_INREG: {
		assert(DstOps.size() == 1 && "Invalid dst ops");
		assert(SrcOps.size() == 2 && "Invalid src ops");
		const DstOp &Dst = DstOps[0];
		const SrcOp &Src0 = SrcOps[0];
		const SrcOp &Src1 = SrcOps[1];
		if (auto MaybeCst = ConstantFoldExtOp(
		Opc, Src0.getReg(), Src1.getImm().getSExtValue(), *getMRI()))
		return buildConstant(Dst, MaybeCst->getSExtValue());
		break;
		}
}		}
return MachineIRBuilder::buildInstr(Opc, DstOps, SrcOps);		return MachineIRBuilder::buildInstr(Opc, DstOps, SrcOps);
}		}
};		};
} // namespace llvm		} // namespace llvm

llvm/include/llvm/CodeGen/GlobalISel/LegalizationArtifactCombiner.h

Show First 20 Lines • Show All 98 Lines • ▼ Show 20 Lines	bool tryCombineSExt(MachineInstr &MI,

if (MI.getOpcode() != TargetOpcode::G_SEXT)		if (MI.getOpcode() != TargetOpcode::G_SEXT)
return false;		return false;

Builder.setInstr(MI);		Builder.setInstr(MI);
unsigned DstReg = MI.getOperand(0).getReg();		unsigned DstReg = MI.getOperand(0).getReg();
unsigned SrcReg = lookThroughCopyInstrs(MI.getOperand(1).getReg());		unsigned SrcReg = lookThroughCopyInstrs(MI.getOperand(1).getReg());

// sext(trunc x) - > ashr (shl (aext/copy/trunc x), c), c		// sext(trunc x) - > (sext_inreg (aext/copy/trunc x), c)
unsigned TruncSrc;		unsigned TruncSrc;
if (mi_match(SrcReg, MRI, m_GTrunc(m_Reg(TruncSrc)))) {		if (mi_match(SrcReg, MRI, m_GTrunc(m_Reg(TruncSrc)))) {
LLT DstTy = MRI.getType(DstReg);		LLT DstTy = MRI.getType(DstReg);
// Guess on the RHS shift amount type, which should be re-legalized if		if (isInstUnsupported({TargetOpcode::G_SEXT_INREG, {DstTy}}))
// applicable.
if (isInstUnsupported({TargetOpcode::G_SHL, {DstTy, DstTy}}) \|\|
isInstUnsupported({TargetOpcode::G_ASHR, {DstTy, DstTy}}) \|\|
isConstantUnsupported(DstTy))
return false;		return false;
LLVM_DEBUG(dbgs() << ".. Combine MI: " << MI;);		LLVM_DEBUG(dbgs() << ".. Combine MI: " << MI;);
LLT SrcTy = MRI.getType(SrcReg);		LLT SrcTy = MRI.getType(SrcReg);
unsigned ShAmt = DstTy.getScalarSizeInBits() - SrcTy.getScalarSizeInBits();		unsigned SizeInBits = SrcTy.getScalarSizeInBits();
auto MIBShAmt = Builder.buildConstant(DstTy, ShAmt);		Builder.buildInstr(
auto MIBShl = Builder.buildInstr(		TargetOpcode::G_SEXT_INREG, {DstReg},
TargetOpcode::G_SHL, {DstTy},		{Builder.buildAnyExtOrTrunc(DstTy, TruncSrc), APInt(64, SizeInBits)});
{Builder.buildAnyExtOrTrunc(DstTy, TruncSrc), MIBShAmt});
Builder.buildInstr(TargetOpcode::G_ASHR, {DstReg}, {MIBShl, MIBShAmt});
markInstAndDefDead(MI, *MRI.getVRegDef(SrcReg), DeadInsts);		markInstAndDefDead(MI, *MRI.getVRegDef(SrcReg), DeadInsts);
return true;		return true;
}		}
return tryFoldImplicitDef(MI, DeadInsts);		return tryFoldImplicitDef(MI, DeadInsts);
}		}

/// Try to fold G_[ASZ]EXT (G_IMPLICIT_DEF).		/// Try to fold G_[ASZ]EXT (G_IMPLICIT_DEF).
bool tryFoldImplicitDef(MachineInstr &MI,		bool tryFoldImplicitDef(MachineInstr &MI,
▲ Show 20 Lines • Show All 319 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/GlobalISel/LegalizerInfo.h

Show First 20 Lines • Show All 325 Lines • ▼ Show 20 Lines	#ifndef NDEBUG
/// or perform an action upon (or both)) the type index I. The uncertainty		/// or perform an action upon (or both)) the type index I. The uncertainty
/// comes from free-form rules executing user-provided lambda functions. We		/// comes from free-form rules executing user-provided lambda functions. We
/// conservatively assume such rules do the right thing and cover all type		/// conservatively assume such rules do the right thing and cover all type
/// indices. The bitset is intentionally 1 bit wider than it absolutely needs		/// indices. The bitset is intentionally 1 bit wider than it absolutely needs
/// to be to distinguish such cases from the cases where all type indices are		/// to be to distinguish such cases from the cases where all type indices are
/// individually handled.		/// individually handled.
SmallBitVector TypeIdxsCovered{MCOI::OPERAND_LAST_GENERIC -		SmallBitVector TypeIdxsCovered{MCOI::OPERAND_LAST_GENERIC -
MCOI::OPERAND_FIRST_GENERIC + 2};		MCOI::OPERAND_FIRST_GENERIC + 2};
		SmallBitVector ImmIdxsCovered{MCOI::OPERAND_LAST_GENERIC_IMM -
		MCOI::OPERAND_FIRST_GENERIC_IMM + 2};
#endif		#endif

unsigned typeIdx(unsigned TypeIdx) {		unsigned typeIdx(unsigned TypeIdx) {
assert(TypeIdx <=		assert(TypeIdx <=
(MCOI::OPERAND_LAST_GENERIC - MCOI::OPERAND_FIRST_GENERIC) &&		(MCOI::OPERAND_LAST_GENERIC - MCOI::OPERAND_FIRST_GENERIC) &&
"Type Index is out of bounds");		"Type Index is out of bounds");
#ifndef NDEBUG		#ifndef NDEBUG
TypeIdxsCovered.set(TypeIdx);		TypeIdxsCovered.set(TypeIdx);
#endif		#endif
return TypeIdx;		return TypeIdx;
}		}
void markAllTypeIdxsAsCovered() {
		unsigned immIdx(unsigned ImmIdx) {
		assert(ImmIdx <= (MCOI::OPERAND_LAST_GENERIC_IMM -
		MCOI::OPERAND_FIRST_GENERIC_IMM) &&
		"Imm Index is out of bounds");
		#ifndef NDEBUG
		ImmIdxsCovered.set(ImmIdx);
		#endif
		return ImmIdx;
		}

		void markAllIdxsAsCovered() {
#ifndef NDEBUG		#ifndef NDEBUG
TypeIdxsCovered.set();		TypeIdxsCovered.set();
		ImmIdxsCovered.set();
#endif		#endif
}		}

void add(const LegalizeRule &Rule) {		void add(const LegalizeRule &Rule) {
assert(AliasOf == 0 &&		assert(AliasOf == 0 &&
"RuleSet is aliased, change the representative opcode instead");		"RuleSet is aliased, change the representative opcode instead");
Rules.push_back(Rule);		Rules.push_back(Rule);
}		}
▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines	#endif
/// Action should be an action that requires mutation.		/// Action should be an action that requires mutation.
LegalizeRuleSet &actionFor(LegalizeAction Action,		LegalizeRuleSet &actionFor(LegalizeAction Action,
std::initializer_list<std::pair<LLT, LLT>> Types,		std::initializer_list<std::pair<LLT, LLT>> Types,
LegalizeMutation Mutation) {		LegalizeMutation Mutation) {
using namespace LegalityPredicates;		using namespace LegalityPredicates;
return actionIf(Action, typePairInSet(typeIdx(0), typeIdx(1), Types),		return actionIf(Action, typePairInSet(typeIdx(0), typeIdx(1), Types),
Mutation);		Mutation);
}		}
		/// Use the given action when type index 0 is any type in the given list and
		/// imm index 0 is anything. Action should not be an action that requires
		/// mutation.
		LegalizeRuleSet &actionForTypeWithAnyImm(LegalizeAction Action,
		std::initializer_list<LLT> Types) {
		using namespace LegalityPredicates;
		immIdx(0); // Inform verifier imm idx 0 is handled.
		return actionIf(Action, typeInSet(typeIdx(0), Types));
		}
/// Use the given action when type indexes 0 and 1 are both in the given list.		/// Use the given action when type indexes 0 and 1 are both in the given list.
/// That is, the type pair is in the cartesian product of the list.		/// That is, the type pair is in the cartesian product of the list.
/// Action should not be an action that requires mutation.		/// Action should not be an action that requires mutation.
LegalizeRuleSet &actionForCartesianProduct(LegalizeAction Action,		LegalizeRuleSet &actionForCartesianProduct(LegalizeAction Action,
std::initializer_list<LLT> Types) {		std::initializer_list<LLT> Types) {
using namespace LegalityPredicates;		using namespace LegalityPredicates;
return actionIf(Action, all(typeInSet(typeIdx(0), Types),		return actionIf(Action, all(typeInSet(typeIdx(0), Types),
typeInSet(typeIdx(1), Types)));		typeInSet(typeIdx(1), Types)));
Show All 35 Lines	void aliasTo(unsigned Opcode) {
AliasOf = Opcode;		AliasOf = Opcode;
}		}
unsigned getAlias() const { return AliasOf; }		unsigned getAlias() const { return AliasOf; }

/// The instruction is legal if predicate is true.		/// The instruction is legal if predicate is true.
LegalizeRuleSet &legalIf(LegalityPredicate Predicate) {		LegalizeRuleSet &legalIf(LegalityPredicate Predicate) {
// We have no choice but conservatively assume that the free-form		// We have no choice but conservatively assume that the free-form
// user-provided Predicate properly handles all type indices:		// user-provided Predicate properly handles all type indices:
markAllTypeIdxsAsCovered();		markAllIdxsAsCovered();
return actionIf(LegalizeAction::Legal, Predicate);		return actionIf(LegalizeAction::Legal, Predicate);
}		}
/// The instruction is legal when type index 0 is any type in the given list.		/// The instruction is legal when type index 0 is any type in the given list.
LegalizeRuleSet &legalFor(std::initializer_list<LLT> Types) {		LegalizeRuleSet &legalFor(std::initializer_list<LLT> Types) {
return actionFor(LegalizeAction::Legal, Types);		return actionFor(LegalizeAction::Legal, Types);
}		}
/// The instruction is legal when type indexes 0 and 1 is any type pair in the		/// The instruction is legal when type indexes 0 and 1 is any type pair in the
/// given list.		/// given list.
LegalizeRuleSet &legalFor(std::initializer_list<std::pair<LLT, LLT>> Types) {		LegalizeRuleSet &legalFor(std::initializer_list<std::pair<LLT, LLT>> Types) {
return actionFor(LegalizeAction::Legal, Types);		return actionFor(LegalizeAction::Legal, Types);
}		}
		/// The instruction is legal when type index 0 is any type in the given list
		/// and imm index 0 is anything.
		LegalizeRuleSet &legalForTypeWithAnyImm(std::initializer_list<LLT> Types) {
		markAllIdxsAsCovered();
		return actionForTypeWithAnyImm(LegalizeAction::Legal, Types);
		}
/// The instruction is legal when type indexes 0 and 1 along with the memory		/// The instruction is legal when type indexes 0 and 1 along with the memory
/// size and minimum alignment is any type and size tuple in the given list.		/// size and minimum alignment is any type and size tuple in the given list.
LegalizeRuleSet &legalForTypesWithMemDesc(		LegalizeRuleSet &legalForTypesWithMemDesc(
std::initializer_list<LegalityPredicates::TypePairAndMemDesc>		std::initializer_list<LegalityPredicates::TypePairAndMemDesc>
TypesAndMemDesc) {		TypesAndMemDesc) {
return actionIf(LegalizeAction::Legal,		return actionIf(LegalizeAction::Legal,
LegalityPredicates::typePairAndMemDescInSet(		LegalityPredicates::typePairAndMemDescInSet(
typeIdx(0), typeIdx(1), /MMOIdx/ 0, TypesAndMemDesc));		typeIdx(0), typeIdx(1), /MMOIdx/ 0, TypesAndMemDesc));
Show All 15 Lines	LegalizeRuleSet &legalForCartesianProduct(std::initializer_list<LLT> Types0,
std::initializer_list<LLT> Types1,		std::initializer_list<LLT> Types1,
std::initializer_list<LLT> Types2) {		std::initializer_list<LLT> Types2) {
return actionForCartesianProduct(LegalizeAction::Legal, Types0, Types1,		return actionForCartesianProduct(LegalizeAction::Legal, Types0, Types1,
Types2);		Types2);
}		}

LegalizeRuleSet &alwaysLegal() {		LegalizeRuleSet &alwaysLegal() {
using namespace LegalizeMutations;		using namespace LegalizeMutations;
markAllTypeIdxsAsCovered();		markAllIdxsAsCovered();
return actionIf(LegalizeAction::Legal, always);		return actionIf(LegalizeAction::Legal, always);
}		}

/// The instruction is lowered.		/// The instruction is lowered.
LegalizeRuleSet &lower() {		LegalizeRuleSet &lower() {
using namespace LegalizeMutations;		using namespace LegalizeMutations;
// We have no choice but conservatively assume that predicate-less lowering		// We have no choice but conservatively assume that predicate-less lowering
// properly handles all type indices by design:		// properly handles all type indices by design:
markAllTypeIdxsAsCovered();		markAllIdxsAsCovered();
return actionIf(LegalizeAction::Lower, always);		return actionIf(LegalizeAction::Lower, always);
}		}
/// The instruction is lowered if predicate is true. Keep type index 0 as the		/// The instruction is lowered if predicate is true. Keep type index 0 as the
/// same type.		/// same type.
LegalizeRuleSet &lowerIf(LegalityPredicate Predicate) {		LegalizeRuleSet &lowerIf(LegalityPredicate Predicate) {
using namespace LegalizeMutations;		using namespace LegalizeMutations;
// We have no choice but conservatively assume that lowering with a		// We have no choice but conservatively assume that lowering with a
// free-form user provided Predicate properly handles all type indices:		// free-form user provided Predicate properly handles all type indices:
markAllTypeIdxsAsCovered();		markAllIdxsAsCovered();
return actionIf(LegalizeAction::Lower, Predicate);		return actionIf(LegalizeAction::Lower, Predicate);
}		}
/// The instruction is lowered if predicate is true.		/// The instruction is lowered if predicate is true.
LegalizeRuleSet &lowerIf(LegalityPredicate Predicate,		LegalizeRuleSet &lowerIf(LegalityPredicate Predicate,
LegalizeMutation Mutation) {		LegalizeMutation Mutation) {
// We have no choice but conservatively assume that lowering with a		// We have no choice but conservatively assume that lowering with a
// free-form user provided Predicate properly handles all type indices:		// free-form user provided Predicate properly handles all type indices:
markAllTypeIdxsAsCovered();		markAllIdxsAsCovered();
return actionIf(LegalizeAction::Lower, Predicate, Mutation);		return actionIf(LegalizeAction::Lower, Predicate, Mutation);
}		}
/// The instruction is lowered when type index 0 is any type in the given		/// The instruction is lowered when type index 0 is any type in the given
/// list. Keep type index 0 as the same type.		/// list. Keep type index 0 as the same type.
LegalizeRuleSet &lowerFor(std::initializer_list<LLT> Types) {		LegalizeRuleSet &lowerFor(std::initializer_list<LLT> Types) {
return actionFor(LegalizeAction::Lower, Types,		return actionFor(LegalizeAction::Lower, Types,
LegalizeMutations::changeTo(0, 0));		LegalizeMutations::changeTo(0, 0));
}		}
Show All 31 Lines	LegalizeRuleSet &lowerForCartesianProduct(std::initializer_list<LLT> Types0,
return actionForCartesianProduct(LegalizeAction::Lower, Types0, Types1,		return actionForCartesianProduct(LegalizeAction::Lower, Types0, Types1,
Types2);		Types2);
}		}

/// Like legalIf, but for the Libcall action.		/// Like legalIf, but for the Libcall action.
LegalizeRuleSet &libcallIf(LegalityPredicate Predicate) {		LegalizeRuleSet &libcallIf(LegalityPredicate Predicate) {
// We have no choice but conservatively assume that a libcall with a		// We have no choice but conservatively assume that a libcall with a
// free-form user provided Predicate properly handles all type indices:		// free-form user provided Predicate properly handles all type indices:
markAllTypeIdxsAsCovered();		markAllIdxsAsCovered();
return actionIf(LegalizeAction::Libcall, Predicate);		return actionIf(LegalizeAction::Libcall, Predicate);
}		}
LegalizeRuleSet &libcallFor(std::initializer_list<LLT> Types) {		LegalizeRuleSet &libcallFor(std::initializer_list<LLT> Types) {
return actionFor(LegalizeAction::Libcall, Types);		return actionFor(LegalizeAction::Libcall, Types);
}		}
LegalizeRuleSet &		LegalizeRuleSet &
libcallFor(std::initializer_list<std::pair<LLT, LLT>> Types) {		libcallFor(std::initializer_list<std::pair<LLT, LLT>> Types) {
return actionFor(LegalizeAction::Libcall, Types);		return actionFor(LegalizeAction::Libcall, Types);
Show All 9 Lines	public:
}		}

/// Widen the scalar to the one selected by the mutation if the predicate is		/// Widen the scalar to the one selected by the mutation if the predicate is
/// true.		/// true.
LegalizeRuleSet &widenScalarIf(LegalityPredicate Predicate,		LegalizeRuleSet &widenScalarIf(LegalityPredicate Predicate,
LegalizeMutation Mutation) {		LegalizeMutation Mutation) {
// We have no choice but conservatively assume that an action with a		// We have no choice but conservatively assume that an action with a
// free-form user provided Predicate properly handles all type indices:		// free-form user provided Predicate properly handles all type indices:
markAllTypeIdxsAsCovered();		markAllIdxsAsCovered();
return actionIf(LegalizeAction::WidenScalar, Predicate, Mutation);		return actionIf(LegalizeAction::WidenScalar, Predicate, Mutation);
}		}
/// Narrow the scalar to the one selected by the mutation if the predicate is		/// Narrow the scalar to the one selected by the mutation if the predicate is
/// true.		/// true.
LegalizeRuleSet &narrowScalarIf(LegalityPredicate Predicate,		LegalizeRuleSet &narrowScalarIf(LegalityPredicate Predicate,
LegalizeMutation Mutation) {		LegalizeMutation Mutation) {
// We have no choice but conservatively assume that an action with a		// We have no choice but conservatively assume that an action with a
// free-form user provided Predicate properly handles all type indices:		// free-form user provided Predicate properly handles all type indices:
markAllTypeIdxsAsCovered();		markAllIdxsAsCovered();
return actionIf(LegalizeAction::NarrowScalar, Predicate, Mutation);		return actionIf(LegalizeAction::NarrowScalar, Predicate, Mutation);
}		}

/// Add more elements to reach the type selected by the mutation if the		/// Add more elements to reach the type selected by the mutation if the
/// predicate is true.		/// predicate is true.
LegalizeRuleSet &moreElementsIf(LegalityPredicate Predicate,		LegalizeRuleSet &moreElementsIf(LegalityPredicate Predicate,
LegalizeMutation Mutation) {		LegalizeMutation Mutation) {
// We have no choice but conservatively assume that an action with a		// We have no choice but conservatively assume that an action with a
// free-form user provided Predicate properly handles all type indices:		// free-form user provided Predicate properly handles all type indices:
markAllTypeIdxsAsCovered();		markAllIdxsAsCovered();
return actionIf(LegalizeAction::MoreElements, Predicate, Mutation);		return actionIf(LegalizeAction::MoreElements, Predicate, Mutation);
}		}
/// Remove elements to reach the type selected by the mutation if the		/// Remove elements to reach the type selected by the mutation if the
/// predicate is true.		/// predicate is true.
LegalizeRuleSet &fewerElementsIf(LegalityPredicate Predicate,		LegalizeRuleSet &fewerElementsIf(LegalityPredicate Predicate,
LegalizeMutation Mutation) {		LegalizeMutation Mutation) {
// We have no choice but conservatively assume that an action with a		// We have no choice but conservatively assume that an action with a
// free-form user provided Predicate properly handles all type indices:		// free-form user provided Predicate properly handles all type indices:
markAllTypeIdxsAsCovered();		markAllIdxsAsCovered();
return actionIf(LegalizeAction::FewerElements, Predicate, Mutation);		return actionIf(LegalizeAction::FewerElements, Predicate, Mutation);
}		}

/// The instruction is unsupported.		/// The instruction is unsupported.
LegalizeRuleSet &unsupported() {		LegalizeRuleSet &unsupported() {
return actionIf(LegalizeAction::Unsupported, always);		return actionIf(LegalizeAction::Unsupported, always);
}		}
LegalizeRuleSet &unsupportedIf(LegalityPredicate Predicate) {		LegalizeRuleSet &unsupportedIf(LegalityPredicate Predicate) {
return actionIf(LegalizeAction::Unsupported, Predicate);		return actionIf(LegalizeAction::Unsupported, Predicate);
}		}
LegalizeRuleSet &unsupportedIfMemSizeNotPow2() {		LegalizeRuleSet &unsupportedIfMemSizeNotPow2() {
return actionIf(LegalizeAction::Unsupported,		return actionIf(LegalizeAction::Unsupported,
LegalityPredicates::memSizeInBytesNotPow2(0));		LegalityPredicates::memSizeInBytesNotPow2(0));
}		}

LegalizeRuleSet &customIf(LegalityPredicate Predicate) {		LegalizeRuleSet &customIf(LegalityPredicate Predicate) {
// We have no choice but conservatively assume that a custom action with a		// We have no choice but conservatively assume that a custom action with a
// free-form user provided Predicate properly handles all type indices:		// free-form user provided Predicate properly handles all type indices:
markAllTypeIdxsAsCovered();		markAllIdxsAsCovered();
return actionIf(LegalizeAction::Custom, Predicate);		return actionIf(LegalizeAction::Custom, Predicate);
}		}
LegalizeRuleSet &customFor(std::initializer_list<LLT> Types) {		LegalizeRuleSet &customFor(std::initializer_list<LLT> Types) {
return actionFor(LegalizeAction::Custom, Types);		return actionFor(LegalizeAction::Custom, Types);
}		}
LegalizeRuleSet &customForCartesianProduct(std::initializer_list<LLT> Types) {		LegalizeRuleSet &customForCartesianProduct(std::initializer_list<LLT> Types) {
return actionForCartesianProduct(LegalizeAction::Custom, Types);		return actionForCartesianProduct(LegalizeAction::Custom, Types);
}		}
▲ Show 20 Lines • Show All 214 Lines • ▼ Show 20 Lines	LegalizeRuleSet &fallback() {
add({always, LegalizeAction::UseLegacyRules});		add({always, LegalizeAction::UseLegacyRules});
return *this;		return *this;
}		}

/// Check if there is no type index which is obviously not handled by the		/// Check if there is no type index which is obviously not handled by the
/// LegalizeRuleSet in any way at all.		/// LegalizeRuleSet in any way at all.
/// \pre Type indices of the opcode form a dense [0, \p NumTypeIdxs) set.		/// \pre Type indices of the opcode form a dense [0, \p NumTypeIdxs) set.
bool verifyTypeIdxsCoverage(unsigned NumTypeIdxs) const;		bool verifyTypeIdxsCoverage(unsigned NumTypeIdxs) const;
		/// Check if there is no imm index which is obviously not handled by the
		/// LegalizeRuleSet in any way at all.
		/// \pre Type indices of the opcode form a dense [0, \p NumTypeIdxs) set.
		bool verifyImmIdxsCoverage(unsigned NumImmIdxs) const;

/// Apply the ruleset to the given LegalityQuery.		/// Apply the ruleset to the given LegalityQuery.
LegalizeActionStep apply(const LegalityQuery &Query) const;		LegalizeActionStep apply(const LegalityQuery &Query) const;
};		};

class LegalizerInfo {		class LegalizerInfo {
public:		public:
LegalizerInfo();		LegalizerInfo();
▲ Show 20 Lines • Show All 421 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/GlobalISel/Utils.h

	Show First 20 Lines • Show All 137 Lines • ▼ Show 20 Lines

	/// Modify analysis usage so it preserves passes required for the SelectionDAG			/// Modify analysis usage so it preserves passes required for the SelectionDAG
	/// fallback.			/// fallback.
	void getSelectionDAGFallbackAnalysisUsage(AnalysisUsage &AU);			void getSelectionDAGFallbackAnalysisUsage(AnalysisUsage &AU);

	Optional<APInt> ConstantFoldBinOp(unsigned Opcode, const unsigned Op1,			Optional<APInt> ConstantFoldBinOp(unsigned Opcode, const unsigned Op1,
	const unsigned Op2,			const unsigned Op2,
	const MachineRegisterInfo &MRI);			const MachineRegisterInfo &MRI);

				Optional<APInt> ConstantFoldExtOp(unsigned Opcode, const unsigned Op1,
				uint64_t Imm, const MachineRegisterInfo &MRI);
	} // End namespace llvm.			} // End namespace llvm.
	#endif			#endif

llvm/include/llvm/MC/MCInstrDesc.h

Show First 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	enum OperandType {
OPERAND_GENERIC_0 = 6,		OPERAND_GENERIC_0 = 6,
OPERAND_GENERIC_1 = 7,		OPERAND_GENERIC_1 = 7,
OPERAND_GENERIC_2 = 8,		OPERAND_GENERIC_2 = 8,
OPERAND_GENERIC_3 = 9,		OPERAND_GENERIC_3 = 9,
OPERAND_GENERIC_4 = 10,		OPERAND_GENERIC_4 = 10,
OPERAND_GENERIC_5 = 11,		OPERAND_GENERIC_5 = 11,
OPERAND_LAST_GENERIC = 11,		OPERAND_LAST_GENERIC = 11,

OPERAND_FIRST_TARGET = 12,		OPERAND_FIRST_GENERIC_IMM = 12,
		OPERAND_GENERIC_IMM_0 = 12,
		OPERAND_LAST_GENERIC_IMM = 12,

		OPERAND_FIRST_TARGET = 13,
};		};

}		}

/// This holds information about one operand of a machine instruction,		/// This holds information about one operand of a machine instruction,
/// indicating the register class for register operands, etc.		/// indicating the register class for register operands, etc.
class MCOperandInfo {		class MCOperandInfo {
public:		public:
Show All 30 Lines	bool isGenericType() const {
return OperandType >= MCOI::OPERAND_FIRST_GENERIC &&		return OperandType >= MCOI::OPERAND_FIRST_GENERIC &&
OperandType <= MCOI::OPERAND_LAST_GENERIC;		OperandType <= MCOI::OPERAND_LAST_GENERIC;
}		}

unsigned getGenericTypeIndex() const {		unsigned getGenericTypeIndex() const {
assert(isGenericType() && "non-generic types don't have an index");		assert(isGenericType() && "non-generic types don't have an index");
return OperandType - MCOI::OPERAND_FIRST_GENERIC;		return OperandType - MCOI::OPERAND_FIRST_GENERIC;
}		}

		bool isGenericImm() const {
		return OperandType >= MCOI::OPERAND_FIRST_GENERIC_IMM &&
		OperandType <= MCOI::OPERAND_LAST_GENERIC_IMM;
		}

		unsigned getGenericImmIndex() const {
		assert(isGenericImm() && "non-generic immediates don't have an index");
		return OperandType - MCOI::OPERAND_FIRST_GENERIC_IMM;
		}
};		};

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Machine Instruction Flags and Description		// Machine Instruction Flags and Description
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

namespace MCID {		namespace MCID {
/// These should be considered private to the implementation of the		/// These should be considered private to the implementation of the
▲ Show 20 Lines • Show All 489 Lines • Show Last 20 Lines

llvm/include/llvm/Support/TargetOpcodes.def

	Show First 20 Lines • Show All 345 Lines • ▼ Show 20 Lines
	/// Generic va_start instruction. Stores to its one pointer operand.			/// Generic va_start instruction. Stores to its one pointer operand.
	HANDLE_TARGET_OPCODE(G_VASTART)			HANDLE_TARGET_OPCODE(G_VASTART)

	/// Generic va_start instruction. Stores to its one pointer operand.			/// Generic va_start instruction. Stores to its one pointer operand.
	HANDLE_TARGET_OPCODE(G_VAARG)			HANDLE_TARGET_OPCODE(G_VAARG)

	// Generic sign extend			// Generic sign extend
	HANDLE_TARGET_OPCODE(G_SEXT)			HANDLE_TARGET_OPCODE(G_SEXT)
				HANDLE_TARGET_OPCODE(G_SEXT_INREG)

	// Generic zero extend			// Generic zero extend
	HANDLE_TARGET_OPCODE(G_ZEXT)			HANDLE_TARGET_OPCODE(G_ZEXT)

	// Generic left-shift			// Generic left-shift
	HANDLE_TARGET_OPCODE(G_SHL)			HANDLE_TARGET_OPCODE(G_SHL)

	// Generic logical right-shift			// Generic logical right-shift
	▲ Show 20 Lines • Show All 199 Lines • Show Last 20 Lines

llvm/include/llvm/Target/GenericOpcodes.td

	Show All 27 Lines
	// Sign extend the underlying scalar type of an operation, copying the sign bit			// Sign extend the underlying scalar type of an operation, copying the sign bit
	// into the newly-created space.			// into the newly-created space.
	def G_SEXT : GenericInstruction {			def G_SEXT : GenericInstruction {
	let OutOperandList = (outs type0:$dst);			let OutOperandList = (outs type0:$dst);
	let InOperandList = (ins type1:$src);			let InOperandList = (ins type1:$src);
	let hasSideEffects = 0;			let hasSideEffects = 0;
	}			}

				// Sign extend the a value from an arbitrary bit position, copying the sign bit
				// into all bits above it. This is equivalent to a shl + ashr pair with an
				// appropriate shift amount. $sz is an immediate (MachineOperand::isImm()
				// returns true) to allow targets to have some bitwidths legal and others
				// lowered. This opcode is particularly useful if the target has sign-extension
				rovkaUnsubmitted Done Reply Inline Actions This comment really needs to do a better job explaining the difference between G_SEXT and G_SEXT_INREG. It only covers the mechanical differences (i.e. that you have an immediate operand), but it says nothing about why this different opcode exists or where it would come from. Is the fact that the IRTranslator never creates such instructions relevant? Should we mention that it is only a legalization artifact? Targets can already say that G_SEXT for certain bitwidths is legal, why don't we just allow them to say which bitwidths should be lowered (instead of adding a new opcode)? rovka: This comment really needs to do a better job explaining the difference between G_SEXT and…
				dsandersAuthorUnsubmitted Done Reply Inline Actions This comment really needs to do a better job explaining the difference between G_SEXT and G_SEXT_INREG. It only covers the mechanical differences (i.e. that you have an immediate operand), but it says nothing about why this different opcode exists or where it would come from. Ok I can add to that Is the fact that the IRTranslator never creates such instructions relevant? No, who creates it is irrelevant to the operation of the instruction. There's no guarantee that the legalizer won't receive them as input. In the case of the IRTranslator, the IRTranslator could create it if it wanted but it's a simple 1:1 converter (for the most part) and chooses not to at the moment as there's no LLVM-IR equivalent. Target-specific passes are also free to create them. Should we mention that it is only a legalization artifact? It's (currently only) created by code that deals with legalization artifacts but it's not a legalization artifact itself. Targets can already say that G_SEXT for certain bitwidths is legal, why don't we just allow them to say which bitwidths should be lowered (instead of adding a new opcode)? It becomes important when you start optimizing with GlobalISel. Suppose that ARM's SXTB instruction has a latency of 1 and and LSL/ASR have a latency of 2 and that this includes forwarding paths in the hardware (if any). Having the signextend as a single atom in the MIR becomes useful for emitting the most efficient code since given code like: int foo(char a) { return (int)a << 2; } it's cheaper to emit: sxtb r0, r1 lsl r0, r0, #2 // 3 cycles than: lsl r0, r1, #16 asr r0, r0, #16 lsl r0, r0, #2 // 6 cycles even if you can exploit known-bits to emit: lsl r0, r1, #16 asr r0, r0, #14 // 4 cycles it would still be better to use the sxtb. The latter example also illustrates that optimization can make it hard to recognise sign-extension. It gets harder if you also reduce the strength of instructions (maybe lsl r0, r0, #1 is faster as add r0, r0, r0) and there's plenty of ways to make things even more difficult. Essentially, the more mangling the optimizer does while ignorant of the desirable code, the harder it is to select the optimal code at the end. dsanders: > This comment really needs to do a better job explaining the difference between G_SEXT and…
				rovkaUnsubmitted Done Reply Inline Actions Sorry, but I still don't get it. I understand why you're trying to avoid the shifts, what I don't understand is why adding this new node is the best solution. For one thing, the name is not very descriptive. I guess you just copied it from SelectionDAG, where you can actually constrain the source to match the destination. We don't do that here, so it's just confusing (I mean it sounds as if a legal G_SEXT would be going through memory or something). Secondly, it looks like what we need is just a way to tell the artifact combiner "don't turn sext into shifts on this target, for these sizes". Why don't we just use G_SEXT's legality for that? I.e. actually use the regular legality actions on G_SEXT directly instead of G_SEXT_INREG, and tell the combiner to not mess with G_SEXT with legal sizes. With G_SEXT_INREG as proposed in this patch, it looks like you're just moving the type legality problem into a value-of-immediate legality problem for which we need new infrastructure. I'm probably missing something, so please bear with me :) rovka: Sorry, but I still don't get it. I understand why you're trying to avoid the shifts, what I…
				dsandersAuthorUnsubmitted Done Reply Inline Actions Sorry, but I still don't get it. I understand why you're trying to avoid the shifts, what I don't understand is why adding this new node is the best solution. For one thing, the name is not very descriptive. I guess you just copied it from SelectionDAG, where you can actually constrain the source to match the destination. We don't do that here, so it's just confusing (I mean it sounds as if a legal G_SEXT would be going through memory or something). We actually do constrain the source and destination types. The constraint is specified here via type0:$dst and type0:$src where the use of the same type-index specifies a type matching constraint. It's tested on line 36 of llvm/test/MachineVerifier/test_g_sext_inreg.mir which is emitted when the types are not equal. We don't really need the message on line 38 as it's triggered by the subset of mismatches where they aren't even the same kind of type but it's somewhat useful to report how the types are different as well as that they're different. For the naming part of this, I couldn't think of a better name and sticking to SelectionDAG's name had some slight benefits in the sense that someone who knows the distinction in SelectionDAG would also know the distinction here as it's the same. The difference is that G_SEXT makes the container bigger and the newly-created bits are copies of the previous sign bit. With G_SEXT_INREG, the container remains the same size and a specified bit (Size-1) is replicated to all the bits to its left. As for where you'd use each one, G_SEXT_INREG is useful for cases where you don't want to handle the smaller type. For example, most upstream targets have legal operations for s32 and s64 and widen s1-s31 to s32 as well as s33-s63 to s64. However, they still have to support sign extension from say, s7 to s32 if the input IR had that. One way to achieve that is to use s32 -> G_TRUNC -> s7 -> G_SEXT -> s32. This costs more memory than G_SEXT_INREG (which can add up if you do it a lot, e.g. for code heavily using short or char) but aside from that, it also means that all the register allocation code has to support s7. Similarly, spill/reload/move has to support s7, frame lowering has to support it. Instruction selection has to support it too which is a problem for imported SelectionDAG patterns as they can't describe s7 unless there's a register class with an i7 type which isn't possible as it isn't one of the MVT types. There's probably more but the point is that being able to eliminate some types simplifies the backend. You might think that this sounds like type legalization (and I'd be inclined to agree w.r.t the effect at least but I'd still call it operation legalization as the possible removal of types is a side-effect) but the key difference from SelectionDAG is that GlobalISel itself doesn't mandate it or treat it separately from operation legalization. If a target works in s8 a lot but doesn't really have s8 operations or registers, it can choose to make s8 operations legal anyway and trade more complexity in the backend for (hopefully) better code quality. Secondly, it looks like what we need is just a way to tell the artifact combiner "don't turn sext into shifts on this target, for these sizes". Why don't we just use G_SEXT's legality for that? I.e. actually use the regular legality actions on G_SEXT directly instead of G_SEXT_INREG, and tell the combiner to not mess with G_SEXT with legal sizes. With G_SEXT_INREG as proposed in this patch, it looks like you're just moving the type legality problem into a value-of-immediate legality problem for which we need new infrastructure. We want to eliminate the smaller types _and_ have a sign-extension operation which are mutually exclusive demands at the moment. It's not just about legalization though, it's more about the handling of optimization and instruction selection in all passes from the legalizer onwards (including target specific passes). In the previous example, I showed that hanging on to the knowledge that we had a sign-extension led to the optimal code. How do we hang on to that knowledge for as long as it's useful and only let go of that knowledge when it's beneficial to do so? Suppose we lowered our sign-extend to a lsl, ashr pair. There is (or rather, will be) lots of combines that know how to transform various shifts into other forms (not all of them shifts). Some use known-bits analysis to prove they're valid, some are much simpler. There's also lots of lowerings that do likewise and lots of other optimizations with various effects on shifts. Each and every one can potentially permanently remove our knowledge that we have a sign-extend operation and force us to use the slower code because we can't reconstruct the desired operation later. So how do we get our sign-extend past the combiners and other optimizers that only want to do their job? One answer to this is we teach every single one how to recognize a sign-extending-shift-pair and ask the target if it wants us to leave it alone. This gets impractical really quickly. Even assuming we can teach hundreds of optimizations to recognize dozens of conventional sign-extension patterns and all the target specific patterns in a reasonable way, we'd still be burning large amounts of compile-time checking for all the possible ways a sign-extend can be accomplished just to prevent undesirable optimizations from happening. A better answer is to form a higher-level 'composite' operation to smuggle it past all the combiners and optimizers we don't want to happen. This is what G_SEXT_INREG does. In this approach, it's cheap to determine that the undesirable combines/optimizations shouldn't happen because the opcode isn't the one they want. The downside is that any optimization you do want to happen needs to be taught to recognize the new opcode as well. This is much more managable than the alternative of teaching everything to reject everything they shouldn't change as a list of things they should do grows much slower than the list of things not to do. To put this in another context that doesn't have the legalizer baggage, consider byte swapping and let's pretend there's no intrinsic so we can only emit a byte swap instruction if we actually recognize a byteswap in the code. It's usually a pretty big win to emit a byte-swap instruction so we want to find as many as possible. Unfortunately, there are lots of ways to write byte swapping code and it's difficult to recognize even without optimizations getting in the way. The chances of still being able to recognize a byteswap after the optimizers have picked at the code are fairly low. Some of the masks, shifts, and ors may have been mangled or disappeared entirely. So when we do find one, we want to make sure it gets to the instruction selector in tact. Much like I described above, we form a composite operation from the masks, shifts, and ors the moment we see a byte-swap pattern (ideally before legalization) and smuggle the byte swap operation through to the instruction selector. If we didn't do that, the dozens of patterns to match would become hundreds by the time we reach isel. Another context that has the same principles behind it is bit-rotation. With G_SEXT_INREG as proposed in this patch, it looks like you're just moving the type legality problem into a value-of-immediate legality problem for which we need new infrastructure. I disagree with this summary as it's too focused on the legalizer. I believe I'm solving an information-preservation problem for the compilation pipeline by preventing the decomposition of a higher-level operation into lower-level components in cases where that information is still useful to optimal codegen I also disagree w.r.t to the legalizer but I might be being picky here. I would say I'm removing a requirement that all targets with sign-extension instructions which outperform shift-pairs make G_SEXT legal for all possible source types that benefit from that instruction (for AArch64, this means every type from s1 up to s63). In the context of the legalizer, this means being able to promote the source type without changing the operation and thereby making the operation specified by the opcode and immediate rather than the opcode and types. In terms of implementation, this does turn an operation legality problem from being about types to being about value-of-immediate which is pretty close to the way you stated it and is what makes me think I might be being picky. I do think there is a small distinction between the two though as the value-of-immediate bit falls out of excluding the types from the definition of the operation. I'm probably missing something, so please bear with me :) No worries :-) dsanders: > Sorry, but I still don't get it. I understand why you're trying to avoid the shifts, what I…
				rovkaUnsubmitted Done Reply Inline Actions We actually do constrain the source and destination types. The constraint is specified here via type0:$dst and type0:$src where the use of the same type-index specifies a type matching constraint. It's tested on line 36 of llvm/test/MachineVerifier/test_g_sext_inreg.mir which is emitted when the types are not equal. We don't really need the message on line 38 as it's triggered by the subset of mismatches where they aren't even the same kind of type but it's somewhat useful to report how the types are different as well as that they're different. Sorry, I had seen all that, but I thought the DAG constraint referred to the actual register as well, not just the type. I guess the name is at least not worse then, and I can't think of a better one either. One way to achieve that is to use s32 -> G_TRUNC -> s7 -> G_SEXT -> s32. This costs more memory than G_SEXT_INREG (which can add up if you do it a lot, e.g. for code heavily using short or char). Fair enough, but if that's the only good argument then this is premature optimization. [...] it also means that all the register allocation code has to support s7. Similarly, spill/reload/move has to support s7, frame lowering has to support it. You mean register bank selection? Otherwise, doing register allocation before instruction select is kind of a big difference from what happens in the upstream targets. Anyway, any passes would only have to support such types for G_TRUNC and G_SEXT, not for everything (just like any other legal operation). If we introduce G_SEXT_INREG, they now also have to support G_SEXT_INREG in addition to G_SEXT, since you can't guarantee that after the legalization we won't have any G_SEXT left. Instruction selection has to support it too which is a problem for imported SelectionDAG patterns as they can't describe s7 unless there's a register class with an i7 type which isn't possible as it isn't one of the MVT types. This is a good point, but then again if a target wants to keep those instructions unlowered, then they are legal and should be selected somehow. We should teach TableGen to handle such situations rather than hang on to whatever limitations SelectionDAG had. There's probably more but the point is that being able to eliminate some types simplifies the backend. You might think that this sounds like type legalization (and I'd be inclined to agree w.r.t the effect at least but I'd still call it operation legalization as the possible removal of types is a side-effect) but the key difference from SelectionDAG is that GlobalISel itself doesn't mandate it or treat it separately from operation legalization. If a target works in s8 a lot but doesn't really have s8 operations or registers, it can choose to make s8 operations legal anyway and trade more complexity in the backend for (hopefully) better code quality. It's a trade-off, fewer types is a simplification, but more opcodes isn't. Having to always keep in mind both G_SEXT and G_SEXT_INREG is going to be a burden for maintainers. In a sense, this is worse than type legalization, because after type legalization you were certain any funny types were gone, but now they may or may not have all been eaten up by G_SEXT_INREG and G_SEXTLOAD, so you may or may not need to worry about G_SEXT, depending on whether or not you left any legal producer for a certain type. It seems easier to shoot yourself in the foot. Suppose we lowered our sign-extend to a lsl, ashr pair. That means we're not interested in it, otherwise we could mark it as legal and select it to whatever better sequence we know in the instruction select or some other smarter pass. A better answer is to form a higher-level 'composite' operation to smuggle it past all the combiners and optimizers we don't want to happen. This is what G_SEXT_INREG does. I'm still not convinced. This could be true for any other operation that can be lowered. You wouldn't propose adding G_SREM_FOR_REAL versus G_SREM just because some targets don't want to lower it, right? They'd just have to mark it as legal or custom. I disagree with this summary as it's too focused on the legalizer. I believe I'm solving an information-preservation problem for the compilation pipeline by preventing the decomposition of a higher-level operation into lower-level components in cases where that information is still useful to optimal codegen You could also solve it by keeping the type, which would be a more honest representation imo. I also disagree w.r.t to the legalizer but I might be being picky here. I would say I'm removing a requirement that all targets with sign-extension instructions which outperform shift-pairs make G_SEXT legal for all possible source types that benefit from that instruction (for AArch64, this means every type from s1 up to s63). In the context of the legalizer, this means being able to promote the source type without changing the operation and thereby making the operation specified by the opcode and immediate rather than the opcode and types. In terms of implementation, this does turn an operation legality problem from being about types to being about value-of-immediate which is pretty close to the way you stated it and is what makes me think I might be being picky. I do think there is a small distinction between the two though as the value-of-immediate bit falls out of excluding the types from the definition of the operation. So, you're only removing the requirement because types are implicitly illegal (unless the target says otherwise), whereas immediate values are implicitly legal. This means that types force you to think about whether or not something should be legal, whereas immediates will just sneak through. At any rate this is something that hasn't really been discussed. rovka: > We actually do constrain the source and destination types. The constraint is specified here…
				dsandersAuthorUnsubmitted Done Reply Inline Actions One way to achieve that is to use s32 -> G_TRUNC -> s7 -> G_SEXT -> s32. This costs more memory than G_SEXT_INREG (which can add up if you do it a lot, e.g. for code heavily using short or char). Fair enough, but if that's the only good argument then this is premature optimization. Yep, it's a nice bonus rather than the motivation. [...] it also means that all the register allocation code has to support s7. Similarly, spill/reload/move has to support s7, frame lowering has to support it. You mean register bank selection? Otherwise, doing register allocation before instruction select is kind of a big difference from what happens in the upstream targets. Both unless there's a pass that strips out the s7 types. InstructionSelection doesn't normally change the types so any that get there will also pass through to the next pass. RegisterAllocation would be the pass that assigns 32-bit registers to the 7-bit type. Anyway, any passes would only have to support such types for G_TRUNC and G_SEXT, not for everything (just like any other legal operation). It depends on the pass. Some can limit their support to G_TRUNC/G_SEXT but others don't really work with the operation. For example, Frame lowering needs to know how to promote the s7 to s8 both for storage allocation and for emitting code to load/store from/to a given frame index. Register Allocation needs to know which registers are appropriate for an s7. Also, the list of types can potentially be large. For a 64-bit target, we have to ensure those later passes can deal with s1 through s64 unless a pass like the legalizer limits it. If we introduce G_SEXT_INREG, they now also have to support G_SEXT_INREG in addition to G_SEXT, since you can't guarantee that after the legalization we won't have any G_SEXT left. That's partly true. Targets that don't have `getActionDefinitionBuilder(G_SEXT_INREG).lower()` in their legalizer have to support both to whatever degree the legalization rules say are legal. There's no `lower()` for G_SEXT yet but it would be a G_ANYEXT followed by G_SEXT_INREG (which could further lower to a shift pair) which would allow us to limit support to G_ANYEXT (trivial) and G_SEXT_INREG. Those that do have `getActionDefinitionBuilder(G_SEXT_INREG).lower()` only need to support G_SEXT as G_SEXT_INREG will be lowered to G_SHL/G_ASHR by the legalizer and produce the same legal MIR we have today. Instruction selection has to support it too which is a problem for imported SelectionDAG patterns as they can't describe s7 unless there's a register class with an i7 type which isn't possible as it isn't one of the MVT types. This is a good point, but then again if a target wants to keep those instructions unlowered, then they are legal and should be selected somehow. We should teach TableGen to handle such situations rather than hang on to whatever limitations SelectionDAG had. I agree. It's not feasible in the SelectionDAG patterns we import (it requires changing the type inferencing engine in a big way). For now, C++ is the only way but hopefully we'll have GlobalISel rules at some point There's probably more but the point is that being able to eliminate some types simplifies the backend. You might think that this sounds like type legalization (and I'd be inclined to agree w.r.t the effect at least but I'd still call it operation legalization as the possible removal of types is a side effect) but the key difference from SelectionDAG is that GlobalISel itself doesn't mandate it or treat it separately from operation legalization. If a target works in s8 a lot but doesn't really have s8 operations or registers, it can choose to make s8 operations legal anyway and trade more complexity in the backend for (hopefully) better code quality. It's a trade-off, fewer types is a simplification, but more opcodes isn't. Having to always keep in mind both G_SEXT and G_SEXT_INREG is going to be a burden for maintainers. I agree it's a trade-off and the balance of it will differ between different targets. Where each target falls on that scale will depend on how much of a win selecting a specialized sign-extend instruction is compared to not doing it. Another factor that's important is how difficult it is to recognize a sign-extend after the optimizer has worked on the IR which ranges from easy (the optimizer didn't do much), to tricky (the optimizer mangled the pattern but it's still just about recognizable if you cover all the possibilities), to impossible (it's different code that happens to sign-extend as well). For the targets I'm interested in G_SEXT_INREG is well worth the price and I believe that other targets will land on the same end of the scale once they're optimizing more heavily. In a sense, this is worse than type legalization, because after type legalization you were certain any funny types were gone, but now they may or may not have all been eaten up by G_SEXT_INREG and G_SEXTLOAD, so you may or may not need to worry about G_SEXT, depending on whether or not you left any legal producer for a certain type. It seems easier to shoot yourself in the foot. It's fairly simple to catch the gaps in a legalization ruleset that aims to eliminate certain types. If any operation involving a given type is legal then G_ANYEXT and G_TRUNC for that type must also be legal. If either of those two are marked unsupported for a given type then the legalizer will fail on the IR and you can tell something is missing from the ruleset. Aside from that, being consistent with .clampScalar() and similar is the way to ensure certain types get eliminated. Suppose we lowered our sign-extend to a lsl, ashr pair. That means we're not interested in it, otherwise we could mark it as legal and select it to whatever better sequence we know in the instruction select or some other smarter pass. Not necessarily. It happens whenever the artifact combiner combines (sext (trunc x)). The reason this happens is because after merging those artifacts together, there's no way to represent the sign-extension other than the G_SHL/G_ASHR pair (or G_SEXT_INREG) but the operation is still required to happen. If the artifact combiner didn't do this then it would be impossible to eliminate types. A better answer is to form a higher-level 'composite' operation to smuggle it past all the combiners and optimizers we don't want to happen. This is what G_SEXT_INREG does. I'm still not convinced. This could be true for any other operation that can be lowered. You wouldn't propose adding G_SREM_FOR_REAL versus G_SREM just because some targets don't want to lower it, right? They'd just have to mark it as legal or custom. It is true for any other operation that can be lowered and is important for performance. Whether I'd propose it for the G_* namespace upstream largely depends on whether I think it has general applicability to other targets. I've been meaning to propose widening multiplies for a while (e.g. 32x32 -> 64) as many targets have these but I wouldn't propose a G_FFT because only DSP's have a potential use for one (and even then, why not use G_INTRINSIC). For situations where only one or a couple backends would be interested, I recommend legalizing or combining to what we've been calling `target-specific generic instructions` (which is a _terrible_ name) which is essentially a target-pseudo that uses the type[0-9] constraints. The effect is very much like a generic instruction, but is target specific. I disagree with this summary as it's too focused on the legalizer. I believe I'm solving an information-preservation problem for the compilation pipeline by preventing the decomposition of a higher-level operation into lower-level components in cases where that information is still useful to optimal codegen You could also solve it by keeping the type, which would be a more honest representation imo. I disagree that that's a more honest representation. It starts off honest but it becomes a lie as the compilation pipeline progresses. At the IR Translator stage, I agree that keeping the type is the better representation but once I'm shaping the MIR to suit my target (the legalizer being one of the bigger passes that does that but they all do it to some degree) that opinion changes. It takes on more target specific impurities until we get to a purely target dependent representation. Swapping out G_SEXT on non-existant types for a G_SEXT_INREG or G_SHL/G_ASHR pair is changing honesty w.r.t target independence to honesty w.r.t the target. In the same way, I swap out generic opcodes for target-specific-generic-opcodes and target-instructions as the pipeline progresses. This one just has more common ground with other targets than most. I also disagree w.r.t to the legalizer but I might be being picky here. I would say I'm removing a requirement that all targets with sign-extension instructions which outperform shift-pairs make G_SEXT legal for all possible source types that benefit from that instruction (for AArch64, this means every type from s1 up to s63). In the context of the legalizer, this means being able to promote the source type without changing the operation and thereby making the operation specified by the opcode and immediate rather than the opcode and types. In terms of implementation, this does turn an operation legality problem from being about types to being about value-of-immediate which is pretty close to the way you stated it and is what makes me think I might be being picky. I do think there is a small distinction between the two though as the value-of-immediate bit falls out of excluding the types from the definition of the operation. So, you're only removing the requirement because types are implicitly illegal (unless the target says otherwise), whereas immediate values are implicitly legal. This means that types force you to think about whether or not something should be legal, whereas immediates will just sneak through. At any rate this is something that hasn't really been discussed. Could you elaborate on why you think immediates are implicitly legal? Their legality is specified by the legalization rules. AArch64 happens to support all of them in D61290 so it says legal without even looking at the immediate but LegalizerHelperTest.cpp in D61290 has an example where only certain immediates are legal. I think you have to think about what is legal either way and define your legalization rules accordingly to consume any input and produce a legal output. If your target doesn't have fast sign-extend instructions then use `.lower()`, if you only have ones for s8 and s16 to s32 then use `.legalForTypeWithImm({s32, 8}, {s32, 16}).clampScalar(0, s32, s32).lower()`, if you can handle any sX to s32 then use `.legalFor({s32}).clampScalar(0, s32, s32).lower()` I would say I'm removing the requirement because: Backends should be able to constrain the types they have to support. They shouldn't have to support all types smaller than register width just to handle G_SEXT. Backends shouldn't be required to have G_SEXT legal for every type combination smaller than register width. Changing value sizes isn't the only way to get a sign-extension (e.g. (x << 24) >> 24 in C/C++) It simplifies the optimizers because they can easily identify sign-extends cases which may cause them to harm rather than improve the code It allows the compiler to produce better code It allows the MIR to more accurately reflect the target dsanders: > > One way to achieve that is to use s32 -> G_TRUNC -> s7 -> G_SEXT -> s32. This costs more…
				rovkaUnsubmitted Done Reply Inline Actions Could you elaborate on why you think immediates are implicitly legal? Their legality is specified by the legalization rules. Sorry about being unclear, let me give it another try. AArch64 happens to support all of them in D61290 so it says legal without even looking at the immediate [...] That's exactly what I meant, that it says legal without even looking at the immediate. I haven't looked at D61290 in too much detail, so excuse me if I'm misunderstanding again. Suppose you had a G_OP with Type0 and Type1, if you say .legalFor({s32}) and just that, you'll get an error because you're not covering Type1 with your rules. From your comments I got the impression that for a G_OP2 with Type0 and Imm1, if you say .legalFor({s32}) then that's enough and all values of Imm1 will just be legal. If that's not the case, then I take this comment back. rovka: > Could you elaborate on why you think immediates are implicitly legal? Their legality is…
				dsandersAuthorUnsubmitted Done Reply Inline Actions Ah ok, you're just looking for the rule verifier to check that each target looked at it like it does for type indices. Sure, I can add that. For AArch64 we'd say 1-63 is legal (i.e. every valid immediate for an s64 G_SEXT_INREG) because the SBFMX instruction handles them all but that still counts as looking at it and the verifier would still reject it if we didn't at least look. dsanders: Ah ok, you're just looking for the rule verifier to check that each target looked at it like it…
				// instructions that are cheaper than the constituent shifts as the optimizer is
				// able to make decisions on whether it's better to hang on to the G_SEXT_INREG
				// or to lower it and optimize the individual shifts.
				rovkaUnsubmitted Done Reply Inline Actions Nitpick: since you're adding support for imm everywhere else, it would be nice if we could say "imm:$sz" instead of "unknown:$sz" here. rovka: Nitpick: since you're adding support for imm everywhere else, it would be nice if we could say…
				dsandersAuthorUnsubmitted Done Reply Inline Actions I agree. I've added an untyped_imm which for all functional purposes w.r.t the type constraint system is the same as unknown but at least documents that we expect an immediate. dsanders: I agree. I've added an untyped_imm which for all functional purposes w.r.t the type constraint…
				def G_SEXT_INREG : GenericInstruction {
				let OutOperandList = (outs type0:$dst);
				let InOperandList = (ins type0:$src, untyped_imm_0:$sz);
				arsenmUnsubmitted Done Reply Inline Actions Are tablegen emitter changes needed for this? I was trying to add an immediate operand in D64054, which seemed to not work correctly arsenm: Are tablegen emitter changes needed for this? I was trying to add an immediate operand in…
				dsandersAuthorUnsubmitted Done Reply Inline Actions You mean the untyped_imm_0? It's needed to drive the verifier that Diana requested and tells it which immediate checks it needs to look for dsanders: You mean the untyped_imm_0? It's needed to drive the verifier that Diana requested and tells it…
				arsenmUnsubmitted Done Reply Inline Actions Yes. As far as I can tell the emitter will try looking for G_CONSTANT defined register for this. I agree there should be a special immediate operand for this, but I don't think this will work in a tablegen pattern as-is. The current form has a ValueType leaf, so matching the immediate wouldn't be quite the same. arsenm: Yes. As far as I can tell the emitter will try looking for G_CONSTANT defined register for this.
				dsandersAuthorUnsubmitted Done Reply Inline Actions I'm not sure I'm following. InOperandList in a GenericInstruction specifies the type constraints and uses special TypeOperand subclasses. They don't factor into tablegen patterns except in so far as requiring that types match. dsanders: I'm not sure I'm following. InOperandList in a GenericInstruction specifies the type…
				arsenmUnsubmitted Not Done Reply Inline Actions I mean I believe there's no way to define a node that will be capable of matching this instruction arsenm: I mean I believe there's no way to define a node that will be capable of matching this…
				dsandersAuthorUnsubmitted Done Reply Inline Actions Ah I see. Yes, we'll need to map sext_inreg to G_SEXT_INREG and add some special handling to convert the MVT operand to an immediate. dsanders: Ah I see. Yes, we'll need to map sext_inreg to G_SEXT_INREG and add some special handling to…
				let hasSideEffects = 0;
				}

	// Zero extend the underlying scalar type of an operation, putting zero bits			// Zero extend the underlying scalar type of an operation, putting zero bits
	// into the newly-created space.			// into the newly-created space.
	def G_ZEXT : GenericInstruction {			def G_ZEXT : GenericInstruction {
	let OutOperandList = (outs type0:$dst);			let OutOperandList = (outs type0:$dst);
	let InOperandList = (ins type1:$src);			let InOperandList = (ins type1:$src);
	let hasSideEffects = 0;			let hasSideEffects = 0;
	}			}

	▲ Show 20 Lines • Show All 798 Lines • Show Last 20 Lines

llvm/include/llvm/Target/Target.td

	Show First 20 Lines • Show All 811 Lines • ▼ Show 20 Lines
	}			}

	// Register operands for generic instructions don't have an MVT, but do have			// Register operands for generic instructions don't have an MVT, but do have
	// constraints linking the operands (e.g. all operands of a G_ADD must			// constraints linking the operands (e.g. all operands of a G_ADD must
	// have the same LLT).			// have the same LLT).
	class TypedOperand<string Ty> : Operand<untyped> {			class TypedOperand<string Ty> : Operand<untyped> {
	let OperandType = Ty;			let OperandType = Ty;
	bit IsPointer = 0;			bit IsPointer = 0;
				bit IsImmediate = 0;
	}			}

	def type0 : TypedOperand<"OPERAND_GENERIC_0">;			def type0 : TypedOperand<"OPERAND_GENERIC_0">;
	def type1 : TypedOperand<"OPERAND_GENERIC_1">;			def type1 : TypedOperand<"OPERAND_GENERIC_1">;
	def type2 : TypedOperand<"OPERAND_GENERIC_2">;			def type2 : TypedOperand<"OPERAND_GENERIC_2">;
	def type3 : TypedOperand<"OPERAND_GENERIC_3">;			def type3 : TypedOperand<"OPERAND_GENERIC_3">;
	def type4 : TypedOperand<"OPERAND_GENERIC_4">;			def type4 : TypedOperand<"OPERAND_GENERIC_4">;
	def type5 : TypedOperand<"OPERAND_GENERIC_5">;			def type5 : TypedOperand<"OPERAND_GENERIC_5">;

	let IsPointer = 1 in {			let IsPointer = 1 in {
	def ptype0 : TypedOperand<"OPERAND_GENERIC_0">;			def ptype0 : TypedOperand<"OPERAND_GENERIC_0">;
	def ptype1 : TypedOperand<"OPERAND_GENERIC_1">;			def ptype1 : TypedOperand<"OPERAND_GENERIC_1">;
	def ptype2 : TypedOperand<"OPERAND_GENERIC_2">;			def ptype2 : TypedOperand<"OPERAND_GENERIC_2">;
	def ptype3 : TypedOperand<"OPERAND_GENERIC_3">;			def ptype3 : TypedOperand<"OPERAND_GENERIC_3">;
	def ptype4 : TypedOperand<"OPERAND_GENERIC_4">;			def ptype4 : TypedOperand<"OPERAND_GENERIC_4">;
	def ptype5 : TypedOperand<"OPERAND_GENERIC_5">;			def ptype5 : TypedOperand<"OPERAND_GENERIC_5">;
	}			}

				// untyped_imm is for operands where isImm() will be true. It currently has no
				rovkaUnsubmitted Done Reply Inline Actions "has no" ... ? rovka: "has no" ... ?
				dsandersAuthorUnsubmitted Done Reply Inline Actions Umm, I'm not sure what happened there. It was supposed to have 'special behaviour' after that. Fixed it dsanders: Umm, I'm not sure what happened there. It was supposed to have 'special behaviour' after that.
				// special behaviour and is only used for clarity.
				rovkaUnsubmitted Done Reply Inline Actions How about "is only used for clarity"? rovka: How about "is only used for clarity"?
				dsandersAuthorUnsubmitted Done Reply Inline Actions That sounds better to me. Done dsanders: That sounds better to me. Done
				rovkaUnsubmitted Not Done Reply Inline Actions That sounds better to me. Done "That" = the old version? I don't mind either way, just making sure you didn't miss it by mistake. rovka: > That sounds better to me. Done "That" = the old version? I don't mind either way, just…
				dsandersAuthorUnsubmitted Done Reply Inline Actions I meant 'is only used for clarity' sounds better. I made the change in my working copy dsanders: I meant 'is only used for clarity' sounds better. I made the change in my working copy
				def untyped_imm_0 : TypedOperand<"OPERAND_GENERIC_IMM_0"> {
				let IsImmediate = 1;
				}

	/// zero_reg definition - Special node to stand for the zero register.			/// zero_reg definition - Special node to stand for the zero register.
	///			///
	def zero_reg;			def zero_reg;

	/// All operands which the MC layer classifies as predicates should inherit from			/// All operands which the MC layer classifies as predicates should inherit from
	/// this class in some manner. This is already handled for the most commonly			/// this class in some manner. This is already handled for the most commonly
	/// used PredicateOperand, but may be useful in other circumstances.			/// used PredicateOperand, but may be useful in other circumstances.
	class PredicateOp;			class PredicateOp;
	▲ Show 20 Lines • Show All 734 Lines • Show Last 20 Lines

llvm/lib/CodeGen/GlobalISel/CSEMIRBuilder.cpp

Show First 20 Lines • Show All 156 Lines • ▼ Show 20 Lines	case TargetOpcode::G_SREM: {
// Try to constant fold these.		// Try to constant fold these.
assert(SrcOps.size() == 2 && "Invalid sources");		assert(SrcOps.size() == 2 && "Invalid sources");
assert(DstOps.size() == 1 && "Invalid dsts");		assert(DstOps.size() == 1 && "Invalid dsts");
if (Optional<APInt> Cst = ConstantFoldBinOp(Opc, SrcOps[0].getReg(),		if (Optional<APInt> Cst = ConstantFoldBinOp(Opc, SrcOps[0].getReg(),
SrcOps[1].getReg(), *getMRI()))		SrcOps[1].getReg(), *getMRI()))
return buildConstant(DstOps[0], Cst->getSExtValue());		return buildConstant(DstOps[0], Cst->getSExtValue());
break;		break;
}		}
		case TargetOpcode::G_SEXT_INREG: {
		assert(DstOps.size() == 1 && "Invalid dst ops");
		assert(SrcOps.size() == 2 && "Invalid src ops");
		const DstOp &Dst = DstOps[0];
		const SrcOp &Src0 = SrcOps[0];
		const SrcOp &Src1 = SrcOps[1];
		if (auto MaybeCst = ConstantFoldExtOp(
		Opc, Src0.getReg(), Src1.getImm().getSExtValue(), *getMRI()))
		return buildConstant(Dst, MaybeCst->getSExtValue());
		break;
		}
}		}
bool CanCopy = checkCopyToDefsPossible(DstOps);		bool CanCopy = checkCopyToDefsPossible(DstOps);
if (!canPerformCSEForOpc(Opc))		if (!canPerformCSEForOpc(Opc))
return MachineIRBuilder::buildInstr(Opc, DstOps, SrcOps, Flag);		return MachineIRBuilder::buildInstr(Opc, DstOps, SrcOps, Flag);
// If we can CSE this instruction, but involves generating copies to multiple		// If we can CSE this instruction, but involves generating copies to multiple
// regs, give up. This frequently happens to UNMERGEs.		// regs, give up. This frequently happens to UNMERGEs.
if (!CanCopy) {		if (!CanCopy) {
auto MIB = MachineIRBuilder::buildInstr(Opc, DstOps, SrcOps, Flag);		auto MIB = MachineIRBuilder::buildInstr(Opc, DstOps, SrcOps, Flag);
▲ Show 20 Lines • Show All 72 Lines • Show Last 20 Lines

llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp

Show First 20 Lines • Show All 689 Lines • ▼ Show 20 Lines	LegalizerHelper::LegalizeResult LegalizerHelper::narrowScalar(MachineInstr &MI,
case TargetOpcode::G_PTRTOINT:		case TargetOpcode::G_PTRTOINT:
if (TypeIdx != 0)		if (TypeIdx != 0)
return UnableToLegalize;		return UnableToLegalize;

Observer.changingInstr(MI);		Observer.changingInstr(MI);
narrowScalarDst(MI, NarrowTy, 0, TargetOpcode::G_ZEXT);		narrowScalarDst(MI, NarrowTy, 0, TargetOpcode::G_ZEXT);
Observer.changedInstr(MI);		Observer.changedInstr(MI);
return Legalized;		return Legalized;
		case TargetOpcode::G_SEXT_INREG: {
		Petar.AvramovicUnsubmitted Not Done Reply Inline Actions NarrowScalar is good candidate for separate patch. Reason is that artifact combiner currently can not handle test that would use G_SEXT_INREG since it will have chained artifacts. G_SEXT will be G_UNMERGE-d, G_UNMERGE has to wait for: G_SEXT to be combined into G_SEXT_INREG, and then for G_SEXT_INREG to be narrowScalared into sequence of instructions that will end with G_MERGE_VALUES. Only at this point G_UNMERGE can be combined. Petar.Avramovic: NarrowScalar is good candidate for separate patch. Reason is that artifact combiner currently…
		dsandersAuthorUnsubmitted Done Reply Inline Actions There's a test for narrowScalar of G_SEXT_INREG in this patch. Could you elaborate on what kind of test you're looking for? I think you're looking for a test with neighbouring instructions which are also narrowScalar'd dsanders: There's a test for narrowScalar of G_SEXT_INREG in this patch. Could you elaborate on what kind…
		Petar.AvramovicUnsubmitted Not Done Reply Inline Actions Yes. For example with this test on aarch64, store is narrowScalared: define void @func(i8 %x, i128* %p) { entry: %conv = sext i8 %x to i128 store i128 %conv, i128* %p ret void } Legalizer is not able to produce something like this %2:_(s32) = COPY $w0 %1:_(p0) = COPY $x1 %11:_(s64) = G_CONSTANT i64 63 %12:_(s64) = G_SEXT_INREG %2, 8 %13:_(s64) = G_ASHR %12, %11(s64) G_STORE %12(s64), %1(p0) :: (store 8 into %ir.p, align 16) %7:_(s64) = G_CONSTANT i64 8 %6:_(p0) = G_GEP %1, %7(s64) G_STORE %13(s64), %6(p0) :: (store 8 into %ir.p + 8, align 16) RET_ReallyLR which I assume is a desired output. Petar.Avramovic: Yes. For example with this test on aarch64, store is narrowScalared: ``` define void @func…
		dsandersAuthorUnsubmitted Done Reply Inline Actions Ah ok. The issue there is that G_STORE hasn't implemented narrowScalar yet. I don't think that we should prevent any opcode from getting a narrowScalar implementation until all opcodes have one. I think we should be implementing them as we need them and gradually build up the full set over time. dsanders: Ah ok. The issue there is that G_STORE hasn't implemented narrowScalar yet. I don't think that…
		Petar.AvramovicUnsubmitted Not Done Reply Inline Actions The problem (if we use narrow scalar with G_MERGE/G_UNMERGE) is the order in which we attempt to combine artifacts in Legalizer. D61787 and narrow scalar for G_ANYEXT(with G_MERGE/G_UNMERGE) will be able to give output like this. Petar.Avramovic: The problem (if we use narrow scalar with G_MERGE/G_UNMERGE) is the order in which we attempt…
		dsandersAuthorUnsubmitted Done Reply Inline Actions I don't understand this. What is the issue with the order artifacts are combined? dsanders: I don't understand this. What is the issue with the order artifacts are combined?
		Petar.AvramovicUnsubmitted Done Reply Inline Actions Artifact might have to wait for another instruction to be legalized (with e.g. narrowScalar) and only then perform combine. For more detail look at D61787, it covers artifact combines available at the moment. Petar.Avramovic: Artifact might have to wait for another instruction to be legalized (with e.g. narrowScalar)…
		if (TypeIdx != 0)
		return UnableToLegalize;

		if (!MI.getOperand(2).isImm())
		return UnableToLegalize;
		int64_t SizeInBits = MI.getOperand(2).getImm();

		// So long as the new type has more bits than the bits we're extending we
		rovkaUnsubmitted Done Reply Inline Actions Typo: is has. rovka: Typo: is has.
		// don't need to break it apart.
		if (NarrowTy.getScalarSizeInBits() >= SizeInBits) {
		Petar.AvramovicUnsubmitted Not Done Reply Inline Actions What is the idea behind this block, we generate two instructions with types larger then NarrowTy? G_TRUNC and G_SEXT that are generated have to be narrowScalared and then we have a few more merge/unmerge combines to do. Also LegalizerHelper does not know how to narrow scalar G_TRUNC and G_SEXT at the moment. Petar.Avramovic: What is the idea behind this block, we generate two instructions with types larger then…
		dsandersAuthorUnsubmitted Done Reply Inline Actions It's not possible to eliminate types larger than NarrowTy with a single narrowScalar. narrowScalar's job is to replace this instruction with one or more that work on NarrowTy sized types and some legalization artifacts (G_SEXT and G_TRUNC in this case) which are required to maintain type correctness. Those legalization artifacts are either combined away or further legalized according to their legalization rules. This block aims to optimize the case where all but one of the narrowed components is a wholly a copy of the sign bit. It expects the G_TRUNC and G_SEXT to be either combined away by the artifact combiner or further legalized. For example: narrowing `%0:_(s32) = G_SEXT %1:_(s32), 8` down to s16 only requires us to preserve the component containing bits 0-15. We can forget about bits 16-31 and reconstruct them with the G_SEXT. This will help chains of instructions to be deleted as they are dead. dsanders: It's not possible to eliminate types larger than NarrowTy with a single narrowScalar.
		Petar.AvramovicUnsubmitted Not Done Reply Inline Actions I did not consider situation when G_SEXT(G_SEXT_INREG) could combine away. Currently G_SEXT can only combine into G_TRUNC. The use of this G_SEXT is something legal or G_MERGE that is waiting for G_UNMERGE. We know this since uses are legalized before defs (G_SEXT is def in this context). Yet again if use is legal, then G_SEXT_INREG was legal or something else like lower but not narrow scalar. Are we talking about artifact combiner, or something that is not in tree? Could you provide a test where G_SEXT and G_TRUNC produced with this narrowScalar combine away? Petar.Avramovic: I did not consider situation when G_SEXT(G_SEXT_INREG) could combine away. Currently G_SEXT…
		dsandersAuthorUnsubmitted Done Reply Inline Actions It does look like the artifact combiner is missing some combines, for the `NarrowTy.getScalarSizeInBits() < SizeInBits` case the resulting: %6:_(s128) = G_MERGE_VALUES %0(s64), %12(s64) %7:_(s32) = G_TRUNC %6(s128) ought to be: %7:_(s32) = G_TRUNC %0(s64) and for the `NarrowTy.getScalarSizeInBits() >= SizeInBits` there's a: %10:_(s64) = G_SEXT_INREG %9, 32 %6:_(s128) = G_SEXT %10(s64) %7:_(s32) = G_TRUNC %6(s128) which firstly ought to be: %10:_(s64) = G_SEXT_INREG %9, 32 %7:_(s32) = G_TRUNC %10(s64) and secondly: %10:_(s64) = G_SEXT_INREG %9, 32 %7:_(s32) = %9(s32) the former of those two isn't really related to this patch (the G_SEXT_INREG isn't involved in the combine) but the latter is. dsanders: It does look like the artifact combiner is missing some combines, for the `NarrowTy.
		Petar.AvramovicUnsubmitted Not Done Reply Inline Actions Sorry, I still don't understand. In this fragment %9 should be s64 not s32. An IR function that results in situation where it is better to narrow scalar with SEXT+TRUNC where they combine with something would be helpful. Maybe it is possible to perform other combines before we make G_SEXT_INREG that would benefit from narrow scalar with G_TRUNC/G_SEXT? That is to not create G_SEXT_INREG that would benefit from narrow scalar with G_TRUNC/G_SEXT and solve everything in combiner with for example some pattern that combines multiple artifacts instead of only two. Petar.Avramovic: Sorry, I still don't understand. In this fragment %9 should be s64 not s32. An IR function that…
		dsandersAuthorUnsubmitted Done Reply Inline Actions Sorry, I still don't understand. In this fragment %9 should be s64 not s32. You're right, that step isn't quite right. It should be `%7:_(s32) = G_TRUNC %9(s64)`. The point was to consume the input rather than the output as the lower 32-bits are unchanged by the G_SEXT_INREG. Climbing the dependency chain like this allows the uses of %7 to start sooner. An IR function that results in situation where it is better to narrow scalar with SEXT+TRUNC where they combine with something would be helpful. Any IR where many operations are too large for your target would provide good examples. Consider: %2:_(s64) = G_ADD %0:_(s64), %1:_(s64) %3:_(s64) = G_SEXT_INREG %2:_(s64), 16 and that both have narrowScalar(/typeidx=/0, s32). The legalization would proceed to: %2:_(s64) = G_ADD %0:_(s64), %1:_(s64) %4:_(s32), %5:_(s32) = G_UNMERGE_VALUES %2:_(s64) %6:_(s32) = G_SEXT_INREG %4:_(s32), 16 %3:_(s64) = G_MERGE_VALUES %6:_(s32), %5:_(s32) and then to: %9:_(s32), %10:_(s32) = G_UNMERGE_VALUES %0:_(s64) %11:_(s32), %12:_(s32) = G_UNMERGE_VALUES %1:_(s64) %7:_(s32) = G_ADD %9:_(s32), %11:_(s32) %8:_(s32) = G_ADD %10:_(s32), %12:_(s32) %2:_(s64) = G_MERGE_VALUES %7:_(s32), %8:_(s32) %4:_(s32), %5:_(s32) = G_UNMERGE_VALUES %2:_(s64) %6:_(s32) = G_SEXT_INREG %4:_(s32), 16 %3:_(s64) = G_MERGE_VALUES %6:_(s32), %5:_(s32) then the artifact combiner would fold the middle merge/unmerge to: %9:_(s32), %10:_(s32) = G_UNMERGE_VALUES %0:_(s64) %11:_(s32), %12:_(s32) = G_UNMERGE_VALUES %1:_(s64) %7:_(s32) = G_ADD %9:_(s32), %11:_(s32) %8:_(s32) = G_ADD %10:_(s32), %12:_(s32) %6:_(s32) = G_SEXT_INREG %7:_(s32), 16 %3:_(s64) = G_MERGE_VALUES %6:_(s32), %8:_(s32) Notice that we still have the `%8:_(s32) = G_ADD %10:_(s32), %12:_(s32)` at this point even though we're about to overwrite it. We're not going to be able to improve on this until a post-legalize combiner. Now consider the same case with this optimization: %2:_(s64) = G_ADD %0:_(s64), %1:_(s64) %3:_(s64) = G_SEXT_INREG %2:_(s64), 16 becomes: %2:_(s64) = G_ADD %0:_(s64), %1:_(s64) %4:_(s32) = G_TRUNC %2:_(s64) %6:_(s32) = G_SEXT_INREG %4:_(s32), 16 %3:_(s64) = G_SEXT %6:_(s32) then: %9:_(s32), %10:_(s32) = G_UNMERGE_VALUES %0:_(s64) %11:_(s32), %12:_(s32) = G_UNMERGE_VALUES %1:_(s64) %7:_(s32) = G_ADD %9:_(s32), %11:_(s32) %8:_(s32) = G_ADD %10:_(s32), %12:_(s32) %2:_(s64) = G_MERGE_VALUES %7:_(s32), %8:_(s32) %4:_(s32) = G_TRUNC %2:_(s64) %6:_(s32) = G_SEXT_INREG %4:_(s32), 16 %3:_(s64) = G_SEXT %6:_(s32) which the artifact combiner (should) simplify to: %9:_(s32), %10:_(s32) = G_UNMERGE_VALUES %0:_(s64) %11:_(s32), %12:_(s32) = G_UNMERGE_VALUES %1:_(s64) %7:_(s32) = G_ADD %9:_(s32), %11:_(s32) %8:_(s32) = G_ADD %10:_(s32), %12:_(s32) %6:_(s32) = G_SEXT_INREG %7:_(s32), 16 %3:_(s64) = G_SEXT %6:_(s32) and then to: %9:_(s32), %10:_(s32) = G_UNMERGE_VALUES %0:_(s64) %11:_(s32), %12:_(s32) = G_UNMERGE_VALUES %1:_(s64) %7:_(s32) = G_ADD %9:_(s32), %11:_(s32) %6:_(s32) = G_SEXT_INREG %7:_(s32), 16 %3:_(s64) = G_SEXT %6:_(s32) and then to: %9:_(s32) = G_TRUNC %0:_(s64) %11:_(s32) = G_TRUNC %1:_(s64) %7:_(s32) = G_ADD %9:_(s32), %11:_(s32) %6:_(s32) = G_SEXT_INREG %7:_(s32), 16 %3:_(s64) = G_SEXT %6:_(s32) which is simpler. The second G_ADD was correctly recognized as being dead and removed. As a result fewer instructions were emitted by the legalizer and subsequent passes have less work to do. Maybe it is possible to perform other combines before we make G_SEXT_INREG that would benefit from narrow scalar with G_TRUNC/G_SEXT? That is to not create G_SEXT_INREG that would benefit from narrow scalar with G_TRUNC/G_SEXT and solve everything in combiner with for example some pattern that combines multiple artifacts instead of only two. It's not entirely clear what you're suggesting here. I'm particularly confused by the 'that would benefit from narrow scalar' bit since it's not a choice to narrow scalar or not. An operation is either too wide for the target and must be narrowScalar'd or it's not. If your suggestion is to try to form G_SEXT_INREG instructions in a combine pass after the legalizer then I'd say that's too late in the pipeline. There's no guarantee that the combine to form a G_SEXT_INREG will run before a combine that makes them unrecognizable. Ideally we want to make a best effort to form them in a pre-legalizer combiner in addition to the legalizer. dsanders: > Sorry, I still don't understand. In this fragment %9 should be s64 not s32. You're right…
		Petar.AvramovicUnsubmitted Done Reply Inline Actions The point was to consume the input rather than the output as the lower 32-bits are unchanged by the G_SEXT_INREG. Climbing the dependency chain like this allows the uses of %7 to start sooner. Something like look through copy, but here it is look through G_SEXT_INREG depending on immediate (operand 2) ? Assuming Artifact combiner is able to perform all mentioned combines, it gets same output as narrowScalar using G_UNMERGE/G_MERGE + D61787. High bits in `%3:_(s64) = G_MERGE_VALUES %6:_(s32), %5:_(s32)` should be sign bit of %6 `%3:_(s64) = G_SEXT_INREG %2:_(s64), 16` narrowScalars to: %4:_(s32), %5:_(s32) = G_UNMERGE_VALUES %2:_(s64) %8:_(s32) = G_CONSTANT i32 31 %6:_(s32) = G_SEXT_INREG %4:_, 16 %7:_(s32) = G_ASHR %6:_, %8:_(s32) %3:_(s64) = G_MERGE_VALUES %6:_(s32), %7:_(s32) also `%3:_(s64) = G_SEXT %6:_(s32)` has to be narrowScalared/combined away by G_UNMERGE when available. I run mentioned example and got following results: Setup for test is: applying: D61289, D61289 and D61787, in MipsLegalizerInfo changing mips G_SEXT_INREG to: getActionDefinitionsBuilder(G_SEXT_INREG) .legalForTypeWithImm({{s32,8},{s32,16}}) .maxScalar(0, s32) .lower(); and changing G_SEXT_INREG to do narrow scalar with G_UNMERGE/G_MERGE always. running following test with -march=mipsel -global-isel -O0 -stop-after=legalizer define i64 @f(i64 signext %a, i64 signext %b) { entry: %add = add i64 %a, %b %conv = trunc i64 %add to i16 %conv1 = sext i16 %conv to i64 ret i64 %conv1 } gives following result: %2:_(s32) = COPY $a0 %3:_(s32) = COPY $a1 %4:_(s32) = COPY $a2 %5:_(s32) = COPY $a3 %25:_(s32) = G_CONSTANT i32 0 %22:_(s32) = G_ADD %2, %4 %26:_(s32) = G_CONSTANT i32 1 %27:_(s32) = COPY %25(s32) %23:_(s32) = G_AND %27, %26 %16:_(s32) = G_ADD %22, %23 %24:_(s32) = G_ICMP intpred(ult), %16(s32), %2 %20:_(s32) = G_ADD %3, %5 %28:_(s32) = COPY %24(s32) %21:_(s32) = G_AND %28, %26 %18:_(s32) = G_ADD %20, %21 %32:_(s32) = G_CONSTANT i32 31 %33:_(s32) = G_SEXT_INREG %16, 16 %34:_(s32) = G_ASHR %33, %32(s32) $v0 = COPY %33(s32) $v1 = COPY %34(s32) RetRA implicit $v0, implicit $v1 there are few places for improvement: first, lets remove dead instructions (this could take place after main do/while loop in Legalizer.cpp) %2:_(s32) = COPY $a0 %4:_(s32) = COPY $a2 %25:_(s32) = G_CONSTANT i32 0 %22:_(s32) = G_ADD %2, %4 %26:_(s32) = G_CONSTANT i32 1 %27:_(s32) = COPY %25(s32) %23:_(s32) = G_AND %27, %26 %16:_(s32) = G_ADD %22, %23 %32:_(s32) = G_CONSTANT i32 31 %33:_(s32) = G_SEXT_INREG %16, 16 %34:_(s32) = G_ASHR %33, %32(s32) $v0 = COPY %33(s32) $v1 = COPY %34(s32) RetRA implicit $v0, implicit $v1 fragment %25:_(s32) = G_CONSTANT i32 0 %22:_(s32) = G_ADD %2, %4 %26:_(s32) = G_CONSTANT i32 1 %27:_(s32) = COPY %25(s32) %23:_(s32) = G_AND %27, %26 %16:_(s32) = G_ADD %22, %23 comes from lower() of %15:_(s1) = G_CONSTANT i1 false %16:_(s32), %17:_(s1) = G_UADDE %11:_, %13:_, %15:_ from narrowScalar of G_ADD. If we used %16:_(s32), %17:_(s1) = G_UADDO %11:_, %13:_ for low bits, we would get %2:_(s32) = COPY $a0 %4:_(s32) = COPY $a2 %16:_(s32) = G_ADD %2, %4 %32:_(s32) = G_CONSTANT i32 31 %33:_(s32) = G_SEXT_INREG %16, 16 %34:_(s32) = G_ASHR %33, %32(s32) $v0 = COPY %33(s32) $v1 = COPY %34(s32) RetRA implicit $v0, implicit $v1 This should be equivalent to mentioned desired output when G_TRUNC/G_SEXT is used in narrow scalar. It's not entirely clear what you're suggesting here %10:_(s64) = G_SEXT_INREG %9, 32 %6:_(s128) = G_SEXT %10(s64) %7:_(s32) = G_TRUNC %6(s128) Was most likely generated from something like: %11:_(s32) = G_TRUNC %9(s64) %10:_(s64) = G_SEXT %11:_(s32) %6:_(s128) = G_SEXT %10(s64) %7:_(s32) = G_TRUNC %6(s128) I meant that we might be able to figure out that these 4 instructions are equivalent to `%7:_(s32) = G_TRUNC %9(s64)` (combine more then 2 artifact in artifact combiner) instead of combining first two into into G_SEXT_INREG. that would benefit from narrow scalar Is there a test that produces better/smaller code when when G_SEXT_INREG is narrow scalar-ed with G_TRUNC/G_SEXT instead of G_UNMERGE/G_MERGE? From discussion so far I am convinced that both approaches generate/should generate same output. Petar.Avramovic: > The point was to consume the input rather than the output as the lower 32-bits are unchanged…
		dsandersAuthorUnsubmitted Done Reply Inline Actions The point was to consume the input rather than the output as the lower 32-bits are unchanged by the G_SEXT_INREG. Climbing the dependency chain like this allows the uses of %7 to start sooner. Something like look through copy, but here it is look through G_SEXT_INREG depending on immediate (operand 2) ? I suppose it can be viewed that way as both end up simplifying the MIR. I think of them as separate things as this is a pattern-match-and-replace whereas look through copy is something that's done to help the pattern-match part of that by ignoring instructions that aren't relevant to whether the pattern should match or not. dsanders: > > The point was to consume the input rather than the output as the lower 32-bits are…
		Observer.changingInstr(MI);
		// We don't lose any non-extension bits by truncating the src and
		dsandersAuthorUnsubmitted Not Done Reply Inline Actions This statement is redundant though. It's already SizeInBits because we just read it from there dsanders: This statement is redundant though. It's already SizeInBits because we just read it from there
		// sign-extending the dst.
		MachineOperand &MO1 = MI.getOperand(1);
		auto TruncMIB = MIRBuilder.buildTrunc(NarrowTy, MO1.getReg());
		MO1.setReg(TruncMIB->getOperand(0).getReg());

		MachineOperand &MO2 = MI.getOperand(0);
		unsigned DstExt = MRI.createGenericVirtualRegister(NarrowTy);
		MIRBuilder.setInsertPt(MIRBuilder.getMBB(), ++MIRBuilder.getInsertPt());
		arsenmUnsubmitted Done Reply Inline Actions This wrapping is really weird looking. There should probably be a dedicated buildTrunc which would help some arsenm: This wrapping is really weird looking. There should probably be a dedicated buildTrunc which…
		MIRBuilder.buildInstr(TargetOpcode::G_SEXT, {MO2.getReg()}, {DstExt});
		MO2.setReg(DstExt);
		Observer.changedInstr(MI);
		return Legalized;
		}

		// Break it apart. Components below the extension point are unmodified. The
		// component containing the extension point becomes a narrower SEXT_INREG.
		// Components above it are ashr'd from the component containing the
		// extension point.
		if (SizeOp0 % NarrowSize != 0)
		return UnableToLegalize;
		rovkaUnsubmitted Done Reply Inline Actions Missing period. rovka: Missing period.
		int NumParts = SizeOp0 / NarrowSize;

		// List the registers where the destination will be scattered.
		SmallVector<unsigned, 2> DstRegs;
		// List the registers where the source will be split.
		SmallVector<unsigned, 2> SrcRegs;

		// Create all the temporary registers.
		for (int i = 0; i < NumParts; ++i) {
		unsigned SrcReg = MRI.createGenericVirtualRegister(NarrowTy);

		SrcRegs.push_back(SrcReg);
		}

		// Explode the big arguments into smaller chunks.
		MIRBuilder.buildUnmerge(SrcRegs, MI.getOperand(1).getReg());

		unsigned AshrCstReg =
		MIRBuilder.buildConstant(NarrowTy, NarrowTy.getScalarSizeInBits() - 1)
		->getOperand(0)
		.getReg();
		unsigned FullExtensionReg = 0;
		unsigned PartialExtensionReg = 0;

		// Do the operation on each small part.
		for (int i = 0; i < NumParts; ++i) {
		Petar.AvramovicUnsubmitted Done Reply Inline Actions This loop works for `NarrowTy.getScalarSizeInBits() >= SizeInBits` as well. `NarrowTy.getScalarSizeInBits()` -> `NarrowSize` Petar.Avramovic: This loop works for `NarrowTy.getScalarSizeInBits() >= SizeInBits` as well. `NarrowTy.
		dsandersAuthorUnsubmitted Done Reply Inline Actions It does but it potentially generates more instructions to do it. Consider `%0:_(s32) = G_SEXT %1:_(s32), 4` narrowScalar'd to s8. This method will break it into four components and will emit one G_SEXT_INREG and one G_ASHR. The other method will emit one G_SEXT_INREG and one G_SEXT which may be eliminated entirely or at worst, lower to one G_ASHR dsanders: It does but it potentially generates more instructions to do it. Consider `%0:_(s32) = G_SEXT…
		Petar.AvramovicUnsubmitted Done Reply Inline Actions Could we then add a possibility for targets to chose how they want to narrow scalar that is to have: narrowScalar - always creates G_UNMERGE+...+G_MERGE narrowScalarExt - creates G_TRUNC+...+G_{S\|Z\|ANY}EXT Petar.Avramovic: Could we then add a possibility for targets to chose how they want to narrow scalar that is to…
		dsandersAuthorUnsubmitted Done Reply Inline Actions Assuming we fix the missing artifact combines, what would be the case where G_UNMERGE/G_MERGE is the better option? dsanders: Assuming we fix the missing artifact combines, what would be the case where G_UNMERGE/G_MERGE…
		Petar.AvramovicUnsubmitted Done Reply Inline Actions %0:_(s128) = G_ANYEXT ..... %1:_(s128) = G_SEXT_INREG %0, 8 %2:_(s64), %3:_(s64) = G_UNMERGE_VALUES %1:_(s128) At the moment most(if not all?) of narrow scalars make sequence of instructions that starts with G_UNMERGE and ends with G_MERGE. In case we emit G_SEXT as end of sequence it won't be able to combine with G_UNMERGE_VALUES. We then have to perform narrow scalar of this G_SEXT (that will end with G_MERGE) then combine this with %2:_(s64), %3:_(s64) = G_UNMERGE_VALUES %1:_(s128) Same for G_TRUNC at the start. Not to mention that this complicates order in which we attempt to combine artifacts. And if G_SEXT_INREG is to be narrow scalared I would expect that it is surrounded with G_MERGE and G_UNMERGE, on mips at least. Considering llvm test-suite and mips I would say that it is always better to emit G_UNMERGE/G_MERGE. I cannot think of example where G_SEXT would be able to combine with something. And for general answer I would say that it is better to emit G_UNMERGE/G_MERGE if def of G_SEXT_INREG is used in G_UNMERGE_VALUES. This is problematic since it requires legalizer to decide how to narrow scalar based on surrounding instructions. Petar.Avramovic: %0:_(s128) = G_ANYEXT ..... %1:_(s128) = G_SEXT_INREG %0, 8 %2:_(s64), %3:_(s64) =…
		dsandersAuthorUnsubmitted Done Reply Inline Actions You're assuming that we _don't_ fix the missing artifact combines there. Given: %0:_(s128) = G_ANYEXT ..... %1:_(s128) = G_SEXT_INREG %0, 8 %2:_(s64), %3:_(s64) = G_UNMERGE_VALUES %1:_(s128) ... = ... %2:_(s64) ... = ... %3:_(s64) then assuming you are doing narrowScalar(/typeidx=/0, s64) on the G_SEXT_INREG, we should get: %0:_(s128) = G_ANYEXT ..... %5:_(s64) = G_TRUNC %0:_(s128) %1:_(s64) = G_SEXT_INREG %0, 8 %4:_(s128) = G_SEXT %1 %2:_(s64), %3:_(s64) = G_UNMERGE_VALUES %1:_(s128) ... = ... %2:_(s64) ... = ... %3:_(s64) then the artifact combiner should give: %0:_(s128) = G_ANYEXT ..... %5:_(s64) = G_TRUNC %0:_(s128) %1:_(s64) = G_SEXT_INREG %0, 8 %6:_(s64) = G_ASHR %1, i64 63 ... = ... %1:_(s64) ... = ... %6:_(s64) then if we assume that G_ANYEXT was from anything smaller than s64 (which should be the case for MIPS): %5:_(s64) = G_ANYEXT ..... %1:_(s64) = G_SEXT_INREG %0, 8 %6:_(s64) = G_ASHR %1, i64 63 ... = ... %1:_(s64) ... = ... %6:_(s64) I cannot think of example where G_SEXT would be able to combine with something. There's plenty of cases where that should be able to combine. The common cases for targets that expect the input MIR to contain s8, s16, s32, s64 operations and need to legalize to only s32 and s64 operations are: If X == 2Y: %1:_(sX) = G_SEXT %0:_(sY) %2:_(sY), %3:_(sY) = G_UNMERGE_VALUES %1 ... = ... %2:_(sY) ... = ... %3:_(sY) to %3:_(sY) = G_ASHR %0:_(sY), iY (Y-1) ... = ... %0:_(sY) ... = ... %3:_(sY) If X == 4Y: %1:_(sX) = G_SEXT %0:_(sY) %2:_(sY), %3:_(sY), %4:_(sY), %5:_(sY) = G_UNMERGE_VALUES %1 ... = ... %2:_(sY) ... = ... %3:_(sY) ... = ... %4:_(sY) ... = ... %5:_(sY) to %3:_(sY) = G_ASHR %0:_(sY), iY (Y-1) ... = ... %0:_(sY) ... = ... %3:_(sY) ... = ... %3:_(sY) ... = ... %3:_(sY) For G_ZEXT, replace the `G_ASHR` with `G_CONSTANT iY 0`. For G_ANYEXT use IMPLICIT_DEF instead dsanders: You're assuming that we _don't_ fix the missing artifact combines there. Given: %0:_(s128) =…
		Petar.AvramovicUnsubmitted Done Reply Inline Actions Ah, didn't think of G_UNMERGE + G_SEXT combine. Then we perform work from "the for loop from G_SEXT_INREG narrow scalar" inside combine and it is pretty much same thing. Both narrowScalar approaches have similar overall performance and generate same output. Petar.Avramovic: Ah, didn't think of G_UNMERGE + G_SEXT combine. Then we perform work from "the for loop from…
		if ((i + 1) * NarrowTy.getScalarSizeInBits() < SizeInBits)
		DstRegs.push_back(SrcRegs[i]);
		else if (i * NarrowTy.getScalarSizeInBits() > SizeInBits) {
		assert(PartialExtensionReg &&
		"Expected to visit partial extension before full");
		if (FullExtensionReg) {
		DstRegs.push_back(FullExtensionReg);
		continue;
		}
		DstRegs.push_back(MIRBuilder
		.buildInstr(TargetOpcode::G_ASHR, {NarrowTy},
		Petar.AvramovicUnsubmitted Done Reply Inline Actions Considering efficiency it might be better to create only one G_ASHR that will have sign of "extension point SEXT_INREG" and copy it to remaining registers that hold higher bits. Petar.Avramovic: Considering efficiency it might be better to create only one G_ASHR that will have sign of…
		dsandersAuthorUnsubmitted Done Reply Inline Actions It already does that. We save the register in FullExtensionReg and re-use it if we see any further full extensions. The test for it is in LegalizerHelperTest.cpp on line 821 (the last two operands of the G_MERGE_VALUES are both [[T6]] dsanders: It already does that. We save the register in FullExtensionReg and re-use it if we see any…
		{PartialExtensionReg, AshrCstReg})
		->getOperand(0)
		.getReg());
		Petar.AvramovicUnsubmitted Done Reply Inline Actions `getOperand(0).getReg()` -> `getReg(0)` Petar.Avramovic: `getOperand(0).getReg()` -> `getReg(0)`
		dsandersAuthorUnsubmitted Done Reply Inline Actions I tried this change but it resulted in errors. dsanders: I tried this change but it resulted in errors.
		FullExtensionReg = DstRegs.back();
		} else {
		DstRegs.push_back(
		MIRBuilder
		.buildInstr(
		TargetOpcode::G_SEXT_INREG, {NarrowTy},
		{SrcRegs[i],
		APInt(NarrowTy.getScalarSizeInBits(),
		SizeInBits % NarrowTy.getScalarSizeInBits())})
		->getOperand(0)
		.getReg());
		PartialExtensionReg = DstRegs.back();
		}
		}

		// Gather the destination registers into the final destination.
		unsigned DstReg = MI.getOperand(0).getReg();
		MIRBuilder.buildMerge(DstReg, DstRegs);
		MI.eraseFromParent();
		return Legalized;
		}
}		}
}		}

void LegalizerHelper::widenScalarSrc(MachineInstr &MI, LLT WideTy,		void LegalizerHelper::widenScalarSrc(MachineInstr &MI, LLT WideTy,
unsigned OpIdx, unsigned ExtOpcode) {		unsigned OpIdx, unsigned ExtOpcode) {
MachineOperand &MO = MI.getOperand(OpIdx);		MachineOperand &MO = MI.getOperand(OpIdx);
auto ExtB = MIRBuilder.buildInstr(ExtOpcode, {WideTy}, {MO.getReg()});		auto ExtB = MIRBuilder.buildInstr(ExtOpcode, {WideTy}, {MO.getReg()});
MO.setReg(ExtB->getOperand(0).getReg());		MO.setReg(ExtB->getOperand(0).getReg());
▲ Show 20 Lines • Show All 644 Lines • ▼ Show 20 Lines	LegalizerHelper::widenScalar(MachineInstr &MI, unsigned TypeIdx, LLT WideTy) {
case TargetOpcode::G_PTRTOINT:		case TargetOpcode::G_PTRTOINT:
if (TypeIdx != 0)		if (TypeIdx != 0)
return UnableToLegalize;		return UnableToLegalize;

Observer.changingInstr(MI);		Observer.changingInstr(MI);
widenScalarDst(MI, WideTy, 0);		widenScalarDst(MI, WideTy, 0);
Observer.changedInstr(MI);		Observer.changedInstr(MI);
return Legalized;		return Legalized;
		case TargetOpcode::G_SEXT_INREG:
		if (TypeIdx != 0)
		return UnableToLegalize;

		Observer.changingInstr(MI);
		widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_ANYEXT);
		widenScalarDst(MI, WideTy, 0, TargetOpcode::G_TRUNC);
		Observer.changedInstr(MI);
		return Legalized;
}		}
}		}

LegalizerHelper::LegalizeResult		LegalizerHelper::LegalizeResult
LegalizerHelper::lower(MachineInstr &MI, unsigned TypeIdx, LLT Ty) {		LegalizerHelper::lower(MachineInstr &MI, unsigned TypeIdx, LLT Ty) {
using namespace TargetOpcode;		using namespace TargetOpcode;
MIRBuilder.setInstr(MI);		MIRBuilder.setInstr(MI);

▲ Show 20 Lines • Show All 225 Lines • ▼ Show 20 Lines	case G_USUBE: {
MIRBuilder.buildSub(Res, TmpRes, ZExtBorrowIn);		MIRBuilder.buildSub(Res, TmpRes, ZExtBorrowIn);
MIRBuilder.buildICmp(CmpInst::ICMP_EQ, LHS_EQ_RHS, LHS, RHS);		MIRBuilder.buildICmp(CmpInst::ICMP_EQ, LHS_EQ_RHS, LHS, RHS);
MIRBuilder.buildICmp(CmpInst::ICMP_ULT, LHS_ULT_RHS, LHS, RHS);		MIRBuilder.buildICmp(CmpInst::ICMP_ULT, LHS_ULT_RHS, LHS, RHS);
MIRBuilder.buildSelect(BorrowOut, LHS_EQ_RHS, BorrowIn, LHS_ULT_RHS);		MIRBuilder.buildSelect(BorrowOut, LHS_EQ_RHS, BorrowIn, LHS_ULT_RHS);

MI.eraseFromParent();		MI.eraseFromParent();
return Legalized;		return Legalized;
}		}

		case TargetOpcode::G_SEXT_INREG: {
		assert(MI.getOperand(2).isImm() && "Expected immediate");
		int64_t SizeInBits = MI.getOperand(2).getImm();
		arsenmUnsubmitted Done Reply Inline Actions This should be illegal and caught by the verifier? This shouldn't need to check for illegal cases arsenm: This should be illegal and caught by the verifier? This shouldn't need to check for illegal…
		dsandersAuthorUnsubmitted Done Reply Inline Actions I've turned it into an assert rather than remove it since it's easier to debug if the debugger stops where it happens and this is the first use of a real immediate in GlobalISel so the chances of misuse are higher than normal. dsanders: I've turned it into an assert rather than remove it since it's easier to debug if the debugger…

		unsigned DstReg = MI.getOperand(0).getReg();
		unsigned SrcReg = MI.getOperand(1).getReg();
		LLT DstTy = MRI.getType(DstReg);
		unsigned TmpRes = MRI.createGenericVirtualRegister(DstTy);

		auto MIBSz = MIRBuilder.buildConstant(DstTy, DstTy.getScalarSizeInBits() - SizeInBits);
		MIRBuilder.buildInstr(TargetOpcode::G_SHL, {TmpRes}, {SrcReg, MIBSz->getOperand(0).getReg()});
		MIRBuilder.buildInstr(TargetOpcode::G_ASHR, {DstReg}, {TmpRes, MIBSz->getOperand(0).getReg()});
		MI.eraseFromParent();
		return Legalized;
		}
}		}
}		}

LegalizerHelper::LegalizeResult LegalizerHelper::fewerElementsVectorImplicitDef(		LegalizerHelper::LegalizeResult LegalizerHelper::fewerElementsVectorImplicitDef(
MachineInstr &MI, unsigned TypeIdx, LLT NarrowTy) {		MachineInstr &MI, unsigned TypeIdx, LLT NarrowTy) {
SmallVector<unsigned, 2> DstRegs;		SmallVector<unsigned, 2> DstRegs;

unsigned NarrowSize = NarrowTy.getSizeInBits();		unsigned NarrowSize = NarrowTy.getSizeInBits();
▲ Show 20 Lines • Show All 1,364 Lines • Show Last 20 Lines

llvm/lib/CodeGen/GlobalISel/LegalizerInfo.cpp

Show First 20 Lines • Show All 209 Lines • ▼ Show 20 Lines	#ifndef NDEBUG
}		}
const int64_t FirstUncovered = TypeIdxsCovered.find_first_unset();		const int64_t FirstUncovered = TypeIdxsCovered.find_first_unset();
if (FirstUncovered < 0) {		if (FirstUncovered < 0) {
LLVM_DEBUG(dbgs() << ".. type index coverage check SKIPPED:"		LLVM_DEBUG(dbgs() << ".. type index coverage check SKIPPED:"
" user-defined predicate detected\n");		" user-defined predicate detected\n");
return true;		return true;
}		}
const bool AllCovered = (FirstUncovered >= NumTypeIdxs);		const bool AllCovered = (FirstUncovered >= NumTypeIdxs);
		if (NumTypeIdxs > 0)
LLVM_DEBUG(dbgs() << ".. the first uncovered type index: " << FirstUncovered		LLVM_DEBUG(dbgs() << ".. the first uncovered type index: " << FirstUncovered
<< ", " << (AllCovered ? "OK" : "FAIL") << "\n");		<< ", " << (AllCovered ? "OK" : "FAIL") << "\n");
return AllCovered;		return AllCovered;
#else		#else
return true;		return true;
#endif		#endif
}		}

		bool LegalizeRuleSet::verifyImmIdxsCoverage(unsigned NumImmIdxs) const {
		#ifndef NDEBUG
		if (Rules.empty()) {
		LLVM_DEBUG(
		dbgs() << ".. imm index coverage check SKIPPED: no rules defined\n");
		return true;
		}
		const int64_t FirstUncovered = ImmIdxsCovered.find_first_unset();
		if (FirstUncovered < 0) {
		LLVM_DEBUG(dbgs() << ".. imm index coverage check SKIPPED:"
		" user-defined predicate detected\n");
		return true;
		}
		const bool AllCovered = (FirstUncovered >= NumImmIdxs);
		LLVM_DEBUG(dbgs() << ".. the first uncovered imm index: " << FirstUncovered
		<< ", " << (AllCovered ? "OK" : "FAIL") << "\n");
		return AllCovered;
		#else
		return true;
		#endif
		}

LegalizerInfo::LegalizerInfo() : TablesInitialized(false) {		LegalizerInfo::LegalizerInfo() : TablesInitialized(false) {
// Set defaults.		// Set defaults.
// FIXME: these two (G_ANYEXT and G_TRUNC?) can be legalized to the		// FIXME: these two (G_ANYEXT and G_TRUNC?) can be legalized to the
// fundamental load/store Jakob proposed. Once loads & stores are supported.		// fundamental load/store Jakob proposed. Once loads & stores are supported.
setScalarAction(TargetOpcode::G_ANYEXT, 1, {{1, Legal}});		setScalarAction(TargetOpcode::G_ANYEXT, 1, {{1, Legal}});
setScalarAction(TargetOpcode::G_ZEXT, 1, {{1, Legal}});		setScalarAction(TargetOpcode::G_ZEXT, 1, {{1, Legal}});
setScalarAction(TargetOpcode::G_SEXT, 1, {{1, Legal}});		setScalarAction(TargetOpcode::G_SEXT, 1, {{1, Legal}});
setScalarAction(TargetOpcode::G_TRUNC, 0, {{1, Legal}});		setScalarAction(TargetOpcode::G_TRUNC, 0, {{1, Legal}});
▲ Show 20 Lines • Show All 148 Lines • ▼ Show 20 Lines
}		}

unsigned LegalizerInfo::getActionDefinitionsIdx(unsigned Opcode) const {		unsigned LegalizerInfo::getActionDefinitionsIdx(unsigned Opcode) const {
unsigned OpcodeIdx = getOpcodeIdxForOpcode(Opcode);		unsigned OpcodeIdx = getOpcodeIdxForOpcode(Opcode);
if (unsigned Alias = RulesForOpcode[OpcodeIdx].getAlias()) {		if (unsigned Alias = RulesForOpcode[OpcodeIdx].getAlias()) {
LLVM_DEBUG(dbgs() << ".. opcode " << Opcode << " is aliased to " << Alias		LLVM_DEBUG(dbgs() << ".. opcode " << Opcode << " is aliased to " << Alias
<< "\n");		<< "\n");
OpcodeIdx = getOpcodeIdxForOpcode(Alias);		OpcodeIdx = getOpcodeIdxForOpcode(Alias);
LLVM_DEBUG(dbgs() << ".. opcode " << Alias << " is aliased to "
<< RulesForOpcode[OpcodeIdx].getAlias() << "\n");
assert(RulesForOpcode[OpcodeIdx].getAlias() == 0 && "Cannot chain aliases");		assert(RulesForOpcode[OpcodeIdx].getAlias() == 0 && "Cannot chain aliases");
}		}

return OpcodeIdx;		return OpcodeIdx;
}		}

const LegalizeRuleSet &		const LegalizeRuleSet &
LegalizerInfo::getActionDefinitions(unsigned Opcode) const {		LegalizerInfo::getActionDefinitions(unsigned Opcode) const {
▲ Show 20 Lines • Show All 267 Lines • ▼ Show 20 Lines	for (unsigned Opcode = FirstOp; Opcode <= LastOp; ++Opcode) {
const MCInstrDesc &MCID = MII.get(Opcode);		const MCInstrDesc &MCID = MII.get(Opcode);
const unsigned NumTypeIdxs = std::accumulate(		const unsigned NumTypeIdxs = std::accumulate(
MCID.opInfo_begin(), MCID.opInfo_end(), 0U,		MCID.opInfo_begin(), MCID.opInfo_end(), 0U,
[](unsigned Acc, const MCOperandInfo &OpInfo) {		[](unsigned Acc, const MCOperandInfo &OpInfo) {
return OpInfo.isGenericType()		return OpInfo.isGenericType()
? std::max(OpInfo.getGenericTypeIndex() + 1U, Acc)		? std::max(OpInfo.getGenericTypeIndex() + 1U, Acc)
: Acc;		: Acc;
});		});
		const unsigned NumImmIdxs = std::accumulate(
		MCID.opInfo_begin(), MCID.opInfo_end(), 0U,
		[](unsigned Acc, const MCOperandInfo &OpInfo) {
		return OpInfo.isGenericImm()
		? std::max(OpInfo.getGenericImmIndex() + 1U, Acc)
		: Acc;
		});
LLVM_DEBUG(dbgs() << MII.getName(Opcode) << " (opcode " << Opcode		LLVM_DEBUG(dbgs() << MII.getName(Opcode) << " (opcode " << Opcode
<< "): " << NumTypeIdxs << " type ind"		<< "): " << NumTypeIdxs << " type ind"
<< (NumTypeIdxs == 1 ? "ex" : "ices") << "\n");		<< (NumTypeIdxs == 1 ? "ex" : "ices") << ", "
		<< NumImmIdxs << " imm ind"
		<< (NumImmIdxs == 1 ? "ex" : "ices") << "\n");
const LegalizeRuleSet &RuleSet = getActionDefinitions(Opcode);		const LegalizeRuleSet &RuleSet = getActionDefinitions(Opcode);
if (!RuleSet.verifyTypeIdxsCoverage(NumTypeIdxs))		if (!RuleSet.verifyTypeIdxsCoverage(NumTypeIdxs))
FailedOpcodes.push_back(Opcode);		FailedOpcodes.push_back(Opcode);
		else if (!RuleSet.verifyImmIdxsCoverage(NumImmIdxs))
		FailedOpcodes.push_back(Opcode);
}		}
if (!FailedOpcodes.empty()) {		if (!FailedOpcodes.empty()) {
errs() << "The following opcodes have ill-defined legalization rules:";		errs() << "The following opcodes have ill-defined legalization rules:";
for (unsigned Opcode : FailedOpcodes)		for (unsigned Opcode : FailedOpcodes)
errs() << " " << MII.getName(Opcode);		errs() << " " << MII.getName(Opcode);
errs() << "\n";		errs() << "\n";

report_fatal_error("ill-defined LegalizerInfo"		report_fatal_error("ill-defined LegalizerInfo"
Show All 23 Lines

llvm/lib/CodeGen/GlobalISel/Utils.cpp

Show First 20 Lines • Show All 355 Lines • ▼ Show 20 Lines	case TargetOpcode::G_SREM:
if (!C2.getBoolValue())		if (!C2.getBoolValue())
break;		break;
return C1.srem(C2);		return C1.srem(C2);
}		}
}		}
return None;		return None;
}		}

		Optional<APInt> llvm::ConstantFoldExtOp(unsigned Opcode, const unsigned Op1,
		uint64_t Imm,
		const MachineRegisterInfo &MRI) {
		auto MaybeOp1Cst = getConstantVRegVal(Op1, MRI);
		if (MaybeOp1Cst) {
		LLT Ty = MRI.getType(Op1);
		APInt C1(Ty.getSizeInBits(), *MaybeOp1Cst, true);
		switch (Opcode) {
		default:
		break;
		case TargetOpcode::G_SEXT_INREG:
		return C1.trunc(Imm).sext(C1.getBitWidth());
		}
		}
		return None;
		}

void llvm::getSelectionDAGFallbackAnalysisUsage(AnalysisUsage &AU) {		void llvm::getSelectionDAGFallbackAnalysisUsage(AnalysisUsage &AU) {
AU.addPreserved<StackProtector>();		AU.addPreserved<StackProtector>();
}		}

llvm/lib/CodeGen/MachineVerifier.cpp

Show First 20 Lines • Show All 1,306 Lines • ▼ Show 20 Lines	case TargetOpcode::G_INSERT: {
if (DstSize <= SrcSize)		if (DstSize <= SrcSize)
report("inserted size must be smaller than total register", MI);		report("inserted size must be smaller than total register", MI);

if (SrcSize + OffsetOp.getImm() > DstSize)		if (SrcSize + OffsetOp.getImm() > DstSize)
report("insert writes past end of register", MI);		report("insert writes past end of register", MI);

break;		break;
}		}
		case TargetOpcode::G_SEXT_INREG: {
		if (!MI->getOperand(2).isImm()) {
		report("G_SEXT_INREG expects an immediate operand #2", MI);
		break;
		}

		LLT DstTy = MRI->getType(MI->getOperand(0).getReg());
		LLT SrcTy = MRI->getType(MI->getOperand(1).getReg());
		verifyVectorElementMatch(DstTy, SrcTy, MI);

		int64_t Imm = MI->getOperand(2).getImm();
		dsandersAuthorUnsubmitted Done Reply Inline Actions I just spotted this one too. This should be getScalarSizeInBits() dsanders: I just spotted this one too. This should be getScalarSizeInBits()
		if (Imm <= 0)
		report("G_SEXT_INREG size must be >= 1", MI);
		if (Imm >= SrcTy.getScalarSizeInBits())
		arsenmUnsubmitted Done Reply Inline Actions Should check for matching vector elements arsenm: Should check for matching vector elements
		report("G_SEXT_INREG size must be less than source bit width", MI);
		break;
		}
default:		default:
break;		break;
}		}
}		}

void MachineVerifier::visitMachineInstrBefore(const MachineInstr *MI) {		void MachineVerifier::visitMachineInstrBefore(const MachineInstr *MI) {
const MCInstrDesc &MCID = MI->getDesc();		const MCInstrDesc &MCID = MI->getDesc();
if (MI->getNumOperands() < MCID.getNumOperands()) {		if (MI->getNumOperands() < MCID.getNumOperands()) {
▲ Show 20 Lines • Show All 1,360 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/AArch64LegalizerInfo.cpp

Show First 20 Lines • Show All 341 Lines • ▼ Show 20 Lines	getActionDefinitionsBuilder({G_ZEXT, G_SEXT, G_ANYEXT})
if (SrcSize < 8 \|\| !isPowerOf2_32(SrcSize))		if (SrcSize < 8 \|\| !isPowerOf2_32(SrcSize))
return false;		return false;

return true;		return true;
});		});

getActionDefinitionsBuilder(G_TRUNC).alwaysLegal();		getActionDefinitionsBuilder(G_TRUNC).alwaysLegal();

		getActionDefinitionsBuilder(G_SEXT_INREG).lower();

// FP conversions		// FP conversions
getActionDefinitionsBuilder(G_FPTRUNC).legalFor(		getActionDefinitionsBuilder(G_FPTRUNC).legalFor(
{{s16, s32}, {s16, s64}, {s32, s64}, {v4s16, v4s32}, {v2s32, v2s64}});		{{s16, s32}, {s16, s64}, {s32, s64}, {v4s16, v4s32}, {v2s32, v2s64}});
getActionDefinitionsBuilder(G_FPEXT).legalFor(		getActionDefinitionsBuilder(G_FPEXT).legalFor(
{{s32, s16}, {s64, s16}, {s64, s32}, {v4s32, v4s16}, {v2s64, v2s32}});		{{s32, s16}, {s64, s16}, {s64, s32}, {v4s32, v4s16}, {v2s64, v2s32}});

// Conversions		// Conversions
getActionDefinitionsBuilder({G_FPTOSI, G_FPTOUI})		getActionDefinitionsBuilder({G_FPTOSI, G_FPTOUI})
▲ Show 20 Lines • Show All 331 Lines • Show Last 20 Lines

llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp

Show First 20 Lines • Show All 654 Lines • ▼ Show 20 Lines	getActionDefinitionsBuilder(Op)
LitTy.getSizeInBits() % 16 == 0 &&		LitTy.getSizeInBits() % 16 == 0 &&
BigTy.getSizeInBits() <= 512;		BigTy.getSizeInBits() <= 512;
})		})
// Any vectors left are the wrong size. Scalarize them.		// Any vectors left are the wrong size. Scalarize them.
.scalarize(0)		.scalarize(0)
.scalarize(1);		.scalarize(1);
}		}

		getActionDefinitionsBuilder(G_SEXT_INREG).lower();

computeTables();		computeTables();
verify(*ST.getInstrInfo());		verify(*ST.getInstrInfo());
}		}

bool AMDGPULegalizerInfo::legalizeCustom(MachineInstr &MI,		bool AMDGPULegalizerInfo::legalizeCustom(MachineInstr &MI,
MachineRegisterInfo &MRI,		MachineRegisterInfo &MRI,
MachineIRBuilder &MIRBuilder,		MachineIRBuilder &MIRBuilder,
GISelChangeObserver &Observer) const {		GISelChangeObserver &Observer) const {
▲ Show 20 Lines • Show All 159 Lines • Show Last 20 Lines

llvm/lib/Target/ARM/ARMLegalizerInfo.cpp

Show First 20 Lines • Show All 78 Lines • ▼ Show 20 Lines	if (ST.isThumb1Only()) {
computeTables();		computeTables();
verify(*ST.getInstrInfo());		verify(*ST.getInstrInfo());
return;		return;
}		}

getActionDefinitionsBuilder({G_SEXT, G_ZEXT, G_ANYEXT})		getActionDefinitionsBuilder({G_SEXT, G_ZEXT, G_ANYEXT})
.legalForCartesianProduct({s32}, {s1, s8, s16});		.legalForCartesianProduct({s32}, {s1, s8, s16});

		getActionDefinitionsBuilder(G_SEXT_INREG).lower();
		rovkaUnsubmitted Done Reply Inline Actions Testcase? rovka: Testcase?
		dsandersAuthorUnsubmitted Done Reply Inline Actions This is just to maintain the status quo for ARM. It will be whatever test cases you already had for the lowering of G_SEXT into G_LSL/G_ASHR which appears to be just llvm/test/CodeGen/ARM/GlobalISel/arm-legalize-divmod.mir dsanders: This is just to maintain the status quo for ARM. It will be whatever test cases you already had…
		rovkaUnsubmitted Done Reply Inline Actions Not really, there are tests for standalone G_SEXT in llvm/test/CodeGen/ARM/GlobalISel/arm-legalize-exts.mir. The tests in divmod are testing divmod, and only incidentally the extensions. Since G_SEXT_INREG is an independent opcpde and, as you said in a previous comment, not just a legalization artifact, it should have a standalone test (without any combines). Otherwise, if we changed our handling of legalization artifacts again in the future, we'd leave G_SEXT_INREG uncovered. Anyway, I can add it myself as a follow-up if this gets committed. rovka: Not really, there are tests for standalone G_SEXT in llvm/test/CodeGen/ARM/GlobalISel/arm…
		dsandersAuthorUnsubmitted Done Reply Inline Actions Ah ok, I didn't find that one because it only tests a legal G_SEXT and not one that's legalized. I've added a test that checks it gets lowered to the shift pair dsanders: Ah ok, I didn't find that one because it only tests a legal G_SEXT and not one that's legalized.
		rovkaUnsubmitted Done Reply Inline Actions Cool, thanks! rovka: Cool, thanks!

getActionDefinitionsBuilder({G_ADD, G_SUB, G_MUL, G_AND, G_OR, G_XOR})		getActionDefinitionsBuilder({G_ADD, G_SUB, G_MUL, G_AND, G_OR, G_XOR})
.legalFor({s32})		.legalFor({s32})
.minScalar(0, s32);		.minScalar(0, s32);

getActionDefinitionsBuilder({G_ASHR, G_LSHR, G_SHL})		getActionDefinitionsBuilder({G_ASHR, G_LSHR, G_SHL})
.legalFor({{s32, s32}})		.legalFor({{s32, s32}})
.clampScalar(1, s32, s32);		.clampScalar(1, s32, s32);

▲ Show 20 Lines • Show All 367 Lines • Show Last 20 Lines

llvm/lib/Target/Mips/MipsLegalizerInfo.cpp

Show First 20 Lines • Show All 91 Lines • ▼ Show 20 Lines	MipsLegalizerInfo::MipsLegalizerInfo(const MipsSubtarget &ST) {

// FP instructions		// FP instructions
getActionDefinitionsBuilder(G_FCONSTANT)		getActionDefinitionsBuilder(G_FCONSTANT)
.legalFor({s32, s64});		.legalFor({s32, s64});

getActionDefinitionsBuilder({G_FADD, G_FSUB, G_FMUL, G_FDIV})		getActionDefinitionsBuilder({G_FADD, G_FSUB, G_FMUL, G_FDIV})
.legalFor({s32, s64});		.legalFor({s32, s64});

		getActionDefinitionsBuilder(G_SEXT_INREG).lower();

computeTables();		computeTables();
verify(*ST.getInstrInfo());		verify(*ST.getInstrInfo());
}		}

bool MipsLegalizerInfo::legalizeCustom(MachineInstr &MI,		bool MipsLegalizerInfo::legalizeCustom(MachineInstr &MI,
MachineRegisterInfo &MRI,		MachineRegisterInfo &MRI,
MachineIRBuilder &MIRBuilder,		MachineIRBuilder &MIRBuilder,
GISelChangeObserver &Observer) const {		GISelChangeObserver &Observer) const {

using namespace TargetOpcode;		using namespace TargetOpcode;

MIRBuilder.setInstr(MI);		MIRBuilder.setInstr(MI);

return false;		return false;
}		}

llvm/lib/Target/X86/X86LegalizerInfo.cpp

Show First 20 Lines • Show All 152 Lines • ▼ Show 20 Lines	void X86LegalizerInfo::setLegalizerInfo32bit() {

// Extensions		// Extensions
for (auto Ty : {s8, s16, s32}) {		for (auto Ty : {s8, s16, s32}) {
setAction({G_ZEXT, Ty}, Legal);		setAction({G_ZEXT, Ty}, Legal);
setAction({G_SEXT, Ty}, Legal);		setAction({G_SEXT, Ty}, Legal);
setAction({G_ANYEXT, Ty}, Legal);		setAction({G_ANYEXT, Ty}, Legal);
}		}
setAction({G_ANYEXT, s128}, Legal);		setAction({G_ANYEXT, s128}, Legal);
		getActionDefinitionsBuilder(G_SEXT_INREG).lower();

// Comparison		// Comparison
setAction({G_ICMP, s1}, Legal);		setAction({G_ICMP, s1}, Legal);

for (auto Ty : {s8, s16, s32, p0})		for (auto Ty : {s8, s16, s32, p0})
setAction({G_ICMP, 1, Ty}, Legal);		setAction({G_ICMP, 1, Ty}, Legal);

// Merge/Unmerge		// Merge/Unmerge
▲ Show 20 Lines • Show All 343 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/GlobalISel/irtranslator-extends.ll

This file was added.

				; RUN: llc -O0 -mtriple=aarch64-apple-ios -global-isel -stop-after=irtranslator %s -o - \| FileCheck %s

				; Test that extends correctly translate to G_[ZS]EXT. The translator will never
				; emit a G_SEXT_INREG.

				define i32 @test_zext(i32 %a) {
				; CHECK-LABEL: name: test_zext
				; CHECK: %0:_(s32) = COPY $w0
				; CHECK: %1:_(s8) = G_TRUNC %0(s32)
				; CHECK: %2:_(s16) = G_ZEXT %1(s8)
				; CHECK: %3:_(s32) = G_ZEXT %2(s16)
				; CHECK: $w0 = COPY %3(s32)
				%tmp0 = trunc i32 %a to i8
				%tmp1 = zext i8 %tmp0 to i16
				%tmp2 = zext i16 %tmp1 to i32
				ret i32 %tmp2
				}

				define i32 @test_sext(i32 %a) {
				; CHECK-LABEL: name: test_sext
				; CHECK: %0:_(s32) = COPY $w0
				; CHECK: %1:_(s8) = G_TRUNC %0(s32)
				; CHECK: %2:_(s16) = G_SEXT %1(s8)
				; CHECK: %3:_(s32) = G_SEXT %2(s16)
				; CHECK: $w0 = COPY %3(s32)
				%tmp0 = trunc i32 %a to i8
				%tmp1 = sext i8 %tmp0 to i16
				%tmp2 = sext i16 %tmp1 to i32
				ret i32 %tmp2
				}

llvm/test/CodeGen/AArch64/GlobalISel/legalize-div.mir

	# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py			# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
	# RUN: llc -O0 -march=aarch64 -run-pass=legalizer %s -o - \| FileCheck %s			# RUN: llc -O0 -march=aarch64 -run-pass=legalizer %s -o - \| FileCheck %s
	---			---
	name: test_div			name: test_div
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_div			; CHECK-LABEL: name: test_div
	; CHECK: [[COPY:%[0-9]+]]:_(s64) = COPY $x0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s64) = COPY $x0
	; CHECK: [[COPY1:%[0-9]+]]:_(s64) = COPY $x1			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s64) = COPY $x1
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; CHECK: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[TRUNC]], [[C]](s32)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[TRUNC]], [[C]](s32)
	; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; CHECK: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)			; CHECK-DAG: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)
	; CHECK: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[TRUNC1]], [[C]](s32)			; CHECK-DAG: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[TRUNC1]], [[C]](s32)
	; CHECK: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]](s32)			; CHECK-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]](s32)
	; CHECK: [[SDIV:%[0-9]+]]:_(s32) = G_SDIV [[ASHR]], [[ASHR1]]			; CHECK-DAG: [[SDIV:%[0-9]+]]:_(s32) = G_SDIV [[ASHR]], [[ASHR1]]
	; CHECK: [[COPY2:%[0-9]+]]:_(s32) = COPY [[SDIV]](s32)			; CHECK-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[SDIV]](s32)
	; CHECK: $w0 = COPY [[COPY2]](s32)			; CHECK-DAG: $w0 = COPY [[COPY2]](s32)
	; CHECK: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 255			; CHECK-DAG: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
	; CHECK: [[TRUNC2:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC2:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: [[AND:%[0-9]+]]:_(s32) = G_AND [[TRUNC2]], [[C2]]			; CHECK-DAG: [[AND:%[0-9]+]]:_(s32) = G_AND [[TRUNC2]], [[C2]]
	; CHECK: [[TRUNC3:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)			; CHECK-DAG: [[TRUNC3:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)
	; CHECK: [[AND1:%[0-9]+]]:_(s32) = G_AND [[TRUNC3]], [[C2]]			; CHECK-DAG: [[AND1:%[0-9]+]]:_(s32) = G_AND [[TRUNC3]], [[C2]]
	; CHECK: [[UDIV:%[0-9]+]]:_(s32) = G_UDIV [[AND]], [[AND1]]			; CHECK-DAG: [[UDIV:%[0-9]+]]:_(s32) = G_UDIV [[AND]], [[AND1]]
	; CHECK: [[COPY3:%[0-9]+]]:_(s32) = COPY [[UDIV]](s32)			; CHECK-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[UDIV]](s32)
	; CHECK: $w0 = COPY [[COPY3]](s32)			; CHECK-DAG: $w0 = COPY [[COPY3]](s32)
	%0:_(s64) = COPY $x0			%0:_(s64) = COPY $x0
	%1:_(s64) = COPY $x1			%1:_(s64) = COPY $x1
	%2:_(s8) = G_TRUNC %0(s64)			%2:_(s8) = G_TRUNC %0(s64)
	%3:_(s8) = G_TRUNC %1(s64)			%3:_(s8) = G_TRUNC %1(s64)
	%4:_(s8) = G_SDIV %2, %3			%4:_(s8) = G_SDIV %2, %3
	%6:_(s32) = G_ANYEXT %4(s8)			%6:_(s32) = G_ANYEXT %4(s8)
	$w0 = COPY %6(s32)			$w0 = COPY %6(s32)
	%5:_(s8) = G_UDIV %2, %3			%5:_(s8) = G_UDIV %2, %3
	Show All 33 Lines

llvm/test/CodeGen/AArch64/GlobalISel/legalize-ext.mir

	# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py			# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
	# RUN: llc -O0 -march=aarch64 -run-pass=legalizer %s -o - \| FileCheck %s			# RUN: llc -O0 -march=aarch64 -run-pass=legalizer %s -o - \| FileCheck %s
	---			---
	name: test_ext			name: test_ext
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_ext			; CHECK-LABEL: name: test_ext
	; CHECK: [[COPY:%[0-9]+]]:_(s64) = COPY $x0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s64) = COPY $x0
	; CHECK: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: $w0 = COPY [[TRUNC]](s32)			; CHECK-DAG: $w0 = COPY [[TRUNC]](s32)
	; CHECK: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: $w0 = COPY [[TRUNC1]](s32)			; CHECK-DAG: $w0 = COPY [[TRUNC1]](s32)
	; CHECK: [[TRUNC2:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC2:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: $w0 = COPY [[TRUNC2]](s32)			; CHECK-DAG: $w0 = COPY [[TRUNC2]](s32)
	; CHECK: [[TRUNC3:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC3:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: $w0 = COPY [[TRUNC3]](s32)			; CHECK-DAG: $w0 = COPY [[TRUNC3]](s32)
	; CHECK: [[COPY1:%[0-9]+]]:_(s64) = COPY [[COPY]](s64)			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s64) = COPY [[COPY]](s64)
	; CHECK: $x0 = COPY [[COPY1]](s64)			; CHECK-DAG: $x0 = COPY [[COPY1]](s64)
	; CHECK: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 255			; CHECK-DAG: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 255
	; CHECK: [[COPY2:%[0-9]+]]:_(s64) = COPY [[COPY]](s64)			; CHECK-DAG: [[COPY2:%[0-9]+]]:_(s64) = COPY [[COPY]](s64)
	; CHECK: [[AND:%[0-9]+]]:_(s64) = G_AND [[COPY2]], [[C]]			; CHECK-DAG: [[AND:%[0-9]+]]:_(s64) = G_AND [[COPY2]], [[C]]
	; CHECK: $x0 = COPY [[AND]](s64)			; CHECK-DAG: $x0 = COPY [[AND]](s64)
	; CHECK: [[COPY3:%[0-9]+]]:_(s64) = COPY [[COPY]](s64)			; CHECK-DAG: [[COPY3:%[0-9]+]]:_(s64) = COPY [[COPY]](s64)
	; CHECK: $x0 = COPY [[COPY3]](s64)			; CHECK-DAG: $x0 = COPY [[COPY3]](s64)
	; CHECK: [[C1:%[0-9]+]]:_(s64) = G_CONSTANT i64 32			; CHECK-DAG: [[C1:%[0-9]+]]:_(s64) = G_CONSTANT i64 32
	; CHECK: [[COPY4:%[0-9]+]]:_(s64) = COPY [[COPY]](s64)			; CHECK-DAG: [[COPY4:%[0-9]+]]:_(s64) = COPY [[COPY]](s64)
	; CHECK: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[COPY4]], [[C1]]			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[COPY4]], [[C1]]
	; CHECK: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[C1]]			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[C1]]
	; CHECK: $x0 = COPY [[ASHR]](s64)			; CHECK-DAG: $x0 = COPY [[ASHR]](s64)
	; CHECK: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 31			; CHECK-DAG: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 31
	; CHECK: [[TRUNC4:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC4:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[TRUNC4]], [[C2]]			; CHECK-DAG: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[TRUNC4]], [[C2]]
	; CHECK: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C2]]			; CHECK-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C2]]
	; CHECK: $w0 = COPY [[ASHR1]](s32)			; CHECK-DAG: $w0 = COPY [[ASHR1]](s32)
	; CHECK: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 255			; CHECK-DAG: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
	; CHECK: [[TRUNC5:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC5:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: [[AND1:%[0-9]+]]:_(s32) = G_AND [[TRUNC5]], [[C3]]			; CHECK-DAG: [[AND1:%[0-9]+]]:_(s32) = G_AND [[TRUNC5]], [[C3]]
	; CHECK: $w0 = COPY [[AND1]](s32)			; CHECK-DAG: $w0 = COPY [[AND1]](s32)
	; CHECK: [[TRUNC6:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC6:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: $w0 = COPY [[TRUNC6]](s32)			; CHECK-DAG: $w0 = COPY [[TRUNC6]](s32)
	; CHECK: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 1			; CHECK-DAG: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 1
	; CHECK: [[TRUNC7:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC7:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: [[AND2:%[0-9]+]]:_(s32) = G_AND [[TRUNC7]], [[C4]]			; CHECK-DAG: [[AND2:%[0-9]+]]:_(s32) = G_AND [[TRUNC7]], [[C4]]
	; CHECK: $w0 = COPY [[AND2]](s32)			; CHECK-DAG: $w0 = COPY [[AND2]](s32)
	; CHECK: [[TRUNC8:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC8:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: $w0 = COPY [[TRUNC8]](s32)			; CHECK-DAG: $w0 = COPY [[TRUNC8]](s32)
	; CHECK: [[C5:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; CHECK-DAG: [[C5:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; CHECK: [[TRUNC9:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC9:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[TRUNC9]], [[C5]]			; CHECK-DAG: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[TRUNC9]], [[C5]]
	; CHECK: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL2]], [[C5]]			; CHECK-DAG: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL2]], [[C5]]
	; CHECK: $w0 = COPY [[ASHR2]](s32)			; CHECK-DAG: $w0 = COPY [[ASHR2]](s32)
	; CHECK: [[TRUNC10:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC10:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: [[AND3:%[0-9]+]]:_(s32) = G_AND [[TRUNC10]], [[C4]]			; CHECK-DAG: [[AND3:%[0-9]+]]:_(s32) = G_AND [[TRUNC10]], [[C4]]
	; CHECK: $w0 = COPY [[AND3]](s32)			; CHECK-DAG: $w0 = COPY [[AND3]](s32)
	; CHECK: [[TRUNC11:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC11:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: $w0 = COPY [[TRUNC11]](s32)			; CHECK-DAG: $w0 = COPY [[TRUNC11]](s32)
	; CHECK: [[TRUNC12:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC12:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: $w0 = COPY [[TRUNC12]](s32)			; CHECK-DAG: $w0 = COPY [[TRUNC12]](s32)
	; CHECK: [[FPEXT:%[0-9]+]]:_(s64) = G_FPEXT [[TRUNC12]](s32)			; CHECK-DAG: [[FPEXT:%[0-9]+]]:_(s64) = G_FPEXT [[TRUNC12]](s32)
	; CHECK: $x0 = COPY [[FPEXT]](s64)			; CHECK-DAG: $x0 = COPY [[FPEXT]](s64)
	; CHECK: [[C7:%[0-9]+]]:_(s32) = G_CONSTANT i32 0			; CHECK-DAG: [[C7:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
	; CHECK: $w0 = COPY [[C7]](s32)			; CHECK-DAG: $w0 = COPY [[C7]](s32)
	; CHECK: [[DEF:%[0-9]+]]:_(s32) = G_IMPLICIT_DEF			; CHECK-DAG: [[DEF:%[0-9]+]]:_(s32) = G_IMPLICIT_DEF
	; CHECK: $w0 = COPY [[DEF]](s32)			; CHECK-DAG: $w0 = COPY [[DEF]](s32)
				rovkaUnsubmitted Done Reply Inline Actions Is the order really irrelevant for all of these? If so, maybe commit just the change from CHECK to CHECK-DAG separately. Personally, I wouldn't mind keeping the CHECK lines so we can see what actually changed with this patch. Ditto for the other tests. rovka: Is the order really irrelevant for all of these? If so, maybe commit just the change from CHECK…
				dsandersAuthorUnsubmitted Done Reply Inline Actions The legalizer doesn't provide any guarantees on the order beyond that defs will precede uses. By changing to CHECK-DAG we make the test robust against future changes to the legalizer too. For this patch, the only thing that changed in many cases was the placement of the G_CONSTANT used in the sign-extending shifts dsanders: The legalizer doesn't provide any guarantees on the order beyond that defs will precede uses.
				rovkaUnsubmitted Done Reply Inline Actions I looked in more detail and I agree that the order isn't that important. I still think this is an independent change that you can commit before this patch. Keeping it here makes it a bit difficult to spot the tests that are actually relevant for G_SEXT_INREG. rovka: I looked in more detail and I agree that the order isn't that important. I still think this is…
				dsandersAuthorUnsubmitted Done Reply Inline Actions Sure. I can commit it separately. Keeping it here makes it a bit difficult to spot the tests that are actually relevant for G_SEXT_INREG. It's inclusion in this patch indicates that the test was affected by the addition of G_SEXT_INREG. It takes a different code path to the same end result and slightly peturbs the order in the process. FWIW, I think that makes it relevant to G_SEXT_INREG but I don't mind committing the status-quo tests separately. The two tests that test something other than the maintenance of the status quo are: llvm/unittests/CodeGen/GlobalISel/LegalizerHelperTest.cpp llvm/unittests/CodeGen/GlobalISel/PatternMatchTest.cpp D61290 is the patch that makes G_SEXT_INREG legal for a target and changes the code for that target. dsanders: Sure. I can commit it separately. > Keeping it here makes it a bit difficult to spot the tests…
				rovkaUnsubmitted Done Reply Inline Actions It's inclusion in this patch indicates that the test was affected by the addition of G_SEXT_INREG. It takes a different code path to the same end result and slightly peturbs the order in the process. FWIW, I think that makes it relevant to G_SEXT_INREG but I don't mind committing the status-quo tests separately. You're contradicting yourself a bit. If the order is relevant now, then it will be relevant for future changes as well, so you shouldn't switch to CHECK-DAG. rovka: > It's inclusion in this patch indicates that the test was affected by the addition of…
				dsandersAuthorUnsubmitted Done Reply Inline Actions It's inclusion in this patch indicates that the test was affected by the addition of G_SEXT_INREG. It takes a different code path to the same end result and slightly peturbs the order in the process. FWIW, I think that makes it relevant to G_SEXT_INREG but I don't mind committing the status-quo tests separately. You're contradicting yourself a bit. If the order is relevant now, then it will be relevant for future changes as well, so you shouldn't switch to CHECK-DAG. The test is relevant because it checks that the lowering path produces the same instructions as before (.lower() must cause the intermediate G_SEXT_INREG to lower to the same G_SHL/G_ASHR pair as before). The instruction order (specifically the placement of the G_CONSTANT argument to the G_SHL/G_ASHR pair) is not important because it has no bearing on the correctness of the output. Using CHECK-DAG lets us ignore the unimportant bit (instruction order) while still checking the important bit (opcode, arguments, etc.). dsanders: >> It's inclusion in this patch indicates that the test was affected by the addition of…
	%0:_(s64) = COPY $x0			%0:_(s64) = COPY $x0
	%1:_(s1) = G_TRUNC %0(s64)			%1:_(s1) = G_TRUNC %0(s64)
	%19:_(s32) = G_ANYEXT %1(s1)			%19:_(s32) = G_ANYEXT %1(s1)
	$w0 = COPY %19(s32)			$w0 = COPY %19(s32)
	%2:_(s8) = G_TRUNC %0(s64)			%2:_(s8) = G_TRUNC %0(s64)
	%20:_(s32) = G_ANYEXT %2(s8)			%20:_(s32) = G_ANYEXT %2(s8)
	$w0 = COPY %20(s32)			$w0 = COPY %20(s32)
	%3:_(s16) = G_TRUNC %0(s64)			%3:_(s16) = G_TRUNC %0(s64)
	▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines
	...			...
	---			---
	name: test_anyext_anyext			name: test_anyext_anyext
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $w0			liveins: $w0

	; CHECK-LABEL: name: test_anyext_anyext			; CHECK-LABEL: name: test_anyext_anyext
	; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $w0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $w0
	; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; CHECK: $w0 = COPY [[COPY1]](s32)			; CHECK-DAG: $w0 = COPY [[COPY1]](s32)
	%0:_(s32) = COPY $w0			%0:_(s32) = COPY $w0
	%1:_(s1) = G_TRUNC %0(s32)			%1:_(s1) = G_TRUNC %0(s32)
	%2:_(s8) = G_ANYEXT %1(s1)			%2:_(s8) = G_ANYEXT %1(s1)
	%3:_(s32) = G_ANYEXT %2(s8)			%3:_(s32) = G_ANYEXT %2(s8)
	$w0 = COPY %3(s32)			$w0 = COPY %3(s32)

	...			...
	---			---
	name: test_anyext_sext			name: test_anyext_sext
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $w0			liveins: $w0

	; CHECK-LABEL: name: test_anyext_sext			; CHECK-LABEL: name: test_anyext_sext
	; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $w0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $w0
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 31			; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 31
	; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]]			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]]
	; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]]			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]]
	; CHECK: $w0 = COPY [[ASHR]](s32)			; CHECK-DAG: $w0 = COPY [[ASHR]](s32)
	%0:_(s32) = COPY $w0			%0:_(s32) = COPY $w0
	%1:_(s1) = G_TRUNC %0(s32)			%1:_(s1) = G_TRUNC %0(s32)
	%2:_(s8) = G_SEXT %1(s1)			%2:_(s8) = G_SEXT %1(s1)
	%3:_(s32) = G_ANYEXT %2(s8)			%3:_(s32) = G_ANYEXT %2(s8)
	$w0 = COPY %3(s32)			$w0 = COPY %3(s32)

	...			...
	---			---
	name: test_anyext_zext			name: test_anyext_zext
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $w0			liveins: $w0

	; CHECK-LABEL: name: test_anyext_zext			; CHECK-LABEL: name: test_anyext_zext
	; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $w0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $w0
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 1			; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 1
	; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; CHECK: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY1]], [[C]]			; CHECK-DAG: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY1]], [[C]]
	; CHECK: $w0 = COPY [[AND]](s32)			; CHECK-DAG: $w0 = COPY [[AND]](s32)
	%0:_(s32) = COPY $w0			%0:_(s32) = COPY $w0
	%1:_(s1) = G_TRUNC %0(s32)			%1:_(s1) = G_TRUNC %0(s32)
	%2:_(s8) = G_ZEXT %1(s1)			%2:_(s8) = G_ZEXT %1(s1)
	%3:_(s32) = G_ANYEXT %2(s8)			%3:_(s32) = G_ANYEXT %2(s8)
	$w0 = COPY %3(s32)			$w0 = COPY %3(s32)

	...			...
	---			---
	▲ Show 20 Lines • Show All 194 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/GlobalISel/legalize-gep.mir

	# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py			# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
	# RUN: llc -march=aarch64 -run-pass=legalizer %s -o - \| FileCheck %s			# RUN: llc -march=aarch64 -run-pass=legalizer %s -o - \| FileCheck %s
	---			---
	name: test_gep_small			name: test_gep_small
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_gep_small			; CHECK-LABEL: name: test_gep_small
	; CHECK: [[COPY:%[0-9]+]]:_(p0) = COPY $x0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(p0) = COPY $x0
	; CHECK: [[COPY1:%[0-9]+]]:_(s64) = COPY $x1			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s64) = COPY $x1
	; CHECK: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 56			; CHECK-DAG: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 56
	; CHECK: [[COPY2:%[0-9]+]]:_(s64) = COPY [[COPY1]](s64)			; CHECK-DAG: [[COPY2:%[0-9]+]]:_(s64) = COPY [[COPY1]](s64)
	; CHECK: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[COPY2]], [[C]]			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[COPY2]], [[C]]
	; CHECK: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[C]]			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[C]]
	; CHECK: [[GEP:%[0-9]+]]:_(p0) = G_GEP [[COPY]], [[ASHR]](s64)			; CHECK-DAG: [[GEP:%[0-9]+]]:_(p0) = G_GEP [[COPY]], [[ASHR]](s64)
	; CHECK: $x0 = COPY [[GEP]](p0)			; CHECK-DAG: $x0 = COPY [[GEP]](p0)
	%0:_(p0) = COPY $x0			%0:_(p0) = COPY $x0
	%1:_(s64) = COPY $x1			%1:_(s64) = COPY $x1
	%2:_(s8) = G_TRUNC %1(s64)			%2:_(s8) = G_TRUNC %1(s64)
	%3:_(p0) = G_GEP %0, %2(s8)			%3:_(p0) = G_GEP %0, %2(s8)
	$x0 = COPY %3(p0)			$x0 = COPY %3(p0)

	...			...

llvm/test/CodeGen/AArch64/GlobalISel/legalize-itofp.mir

	Show First 20 Lines • Show All 141 Lines • ▼ Show 20 Lines


	---			---
	name: test_sitofp_s32_s1			name: test_sitofp_s32_s1
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $w0			liveins: $w0
	; CHECK-LABEL: name: test_sitofp_s32_s1			; CHECK-LABEL: name: test_sitofp_s32_s1
	; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $w0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $w0
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 31			; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 31
	; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)
	; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; CHECK: [[SITOFP:%[0-9]+]]:_(s32) = G_SITOFP [[ASHR]](s32)			; CHECK-DAG: [[SITOFP:%[0-9]+]]:_(s32) = G_SITOFP [[ASHR]](s32)
	; CHECK: $w0 = COPY [[SITOFP]](s32)			; CHECK-DAG: $w0 = COPY [[SITOFP]](s32)
	%0:_(s32) = COPY $w0			%0:_(s32) = COPY $w0
	%1:_(s1) = G_TRUNC %0			%1:_(s1) = G_TRUNC %0
	%2:_(s32) = G_SITOFP %1			%2:_(s32) = G_SITOFP %1
	$w0 = COPY %2			$w0 = COPY %2
	...			...

	---			---
	name: test_uitofp_s32_s1			name: test_uitofp_s32_s1
	Show All 14 Lines
	...			...

	---			---
	name: test_sitofp_s64_s8			name: test_sitofp_s64_s8
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $w0			liveins: $w0
	; CHECK-LABEL: name: test_sitofp_s64_s8			; CHECK-LABEL: name: test_sitofp_s64_s8
	; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $w0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $w0
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)
	; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; CHECK: [[SITOFP:%[0-9]+]]:_(s64) = G_SITOFP [[ASHR]](s32)			; CHECK-DAG: [[SITOFP:%[0-9]+]]:_(s64) = G_SITOFP [[ASHR]](s32)
	; CHECK: $x0 = COPY [[SITOFP]](s64)			; CHECK-DAG: $x0 = COPY [[SITOFP]](s64)
	%0:_(s32) = COPY $w0			%0:_(s32) = COPY $w0
	%1:_(s8) = G_TRUNC %0			%1:_(s8) = G_TRUNC %0
	%2:_(s64) = G_SITOFP %1			%2:_(s64) = G_SITOFP %1
	$x0 = COPY %2			$x0 = COPY %2
	...			...

	---			---
	name: test_uitofp_s64_s8			name: test_uitofp_s64_s8
	▲ Show 20 Lines • Show All 42 Lines • ▼ Show 20 Lines
	...			...

	---			---
	name: test_sitofp_s32_s16			name: test_sitofp_s32_s16
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $w0			liveins: $w0
	; CHECK-LABEL: name: test_sitofp_s32_s16			; CHECK-LABEL: name: test_sitofp_s32_s16
	; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $w0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $w0
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)
	; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; CHECK: [[SITOFP:%[0-9]+]]:_(s32) = G_SITOFP [[ASHR]](s32)			; CHECK-DAG: [[SITOFP:%[0-9]+]]:_(s32) = G_SITOFP [[ASHR]](s32)
	; CHECK: $w0 = COPY [[SITOFP]](s32)			; CHECK-DAG: $w0 = COPY [[SITOFP]](s32)
	%0:_(s32) = COPY $w0			%0:_(s32) = COPY $w0
	%1:_(s16) = G_TRUNC %0			%1:_(s16) = G_TRUNC %0
	%2:_(s32) = G_SITOFP %1			%2:_(s32) = G_SITOFP %1
	$w0 = COPY %2			$w0 = COPY %2
	...			...

	---			---
	name: test_uitofp_s32_s16			name: test_uitofp_s32_s16
	Show All 15 Lines

llvm/test/CodeGen/AArch64/GlobalISel/legalize-rem.mir

	# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py			# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
	# RUN: llc -O0 -march=aarch64 -run-pass=legalizer %s -o - \| FileCheck %s			# RUN: llc -O0 -march=aarch64 -run-pass=legalizer %s -o - \| FileCheck %s
	---			---
	name: test_urem_64			name: test_urem_64
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_urem_64			; CHECK-LABEL: name: test_urem_64
	; CHECK: [[COPY:%[0-9]+]]:_(s64) = COPY $x0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s64) = COPY $x0
	; CHECK: [[COPY1:%[0-9]+]]:_(s64) = COPY $x1			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s64) = COPY $x1
	; CHECK: [[UDIV:%[0-9]+]]:_(s64) = G_UDIV [[COPY]], [[COPY1]]			; CHECK-DAG: [[UDIV:%[0-9]+]]:_(s64) = G_UDIV [[COPY]], [[COPY1]]
	; CHECK: [[MUL:%[0-9]+]]:_(s64) = G_MUL [[UDIV]], [[COPY1]]			; CHECK-DAG: [[MUL:%[0-9]+]]:_(s64) = G_MUL [[UDIV]], [[COPY1]]
	; CHECK: [[SUB:%[0-9]+]]:_(s64) = G_SUB [[COPY]], [[MUL]]			; CHECK-DAG: [[SUB:%[0-9]+]]:_(s64) = G_SUB [[COPY]], [[MUL]]
	; CHECK: $x0 = COPY [[SUB]](s64)			; CHECK-DAG: $x0 = COPY [[SUB]](s64)
	%0:_(s64) = COPY $x0			%0:_(s64) = COPY $x0
	%1:_(s64) = COPY $x1			%1:_(s64) = COPY $x1
	%2:_(s64) = G_UREM %0, %1			%2:_(s64) = G_UREM %0, %1
	$x0 = COPY %2(s64)			$x0 = COPY %2(s64)

	...			...
	---			---
	name: test_srem_32			name: test_srem_32
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_srem_32			; CHECK-LABEL: name: test_srem_32
	; CHECK: [[COPY:%[0-9]+]]:_(s64) = COPY $x0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s64) = COPY $x0
	; CHECK: [[COPY1:%[0-9]+]]:_(s64) = COPY $x1			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s64) = COPY $x1
	; CHECK: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)			; CHECK-DAG: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)
	; CHECK: [[SDIV:%[0-9]+]]:_(s32) = G_SDIV [[TRUNC]], [[TRUNC1]]			; CHECK-DAG: [[SDIV:%[0-9]+]]:_(s32) = G_SDIV [[TRUNC]], [[TRUNC1]]
	; CHECK: [[MUL:%[0-9]+]]:_(s32) = G_MUL [[SDIV]], [[TRUNC1]]			; CHECK-DAG: [[MUL:%[0-9]+]]:_(s32) = G_MUL [[SDIV]], [[TRUNC1]]
	; CHECK: [[SUB:%[0-9]+]]:_(s32) = G_SUB [[TRUNC]], [[MUL]]			; CHECK-DAG: [[SUB:%[0-9]+]]:_(s32) = G_SUB [[TRUNC]], [[MUL]]
	; CHECK: $w0 = COPY [[SUB]](s32)			; CHECK-DAG: $w0 = COPY [[SUB]](s32)
	%0:_(s64) = COPY $x0			%0:_(s64) = COPY $x0
	%1:_(s64) = COPY $x1			%1:_(s64) = COPY $x1
	%2:_(s32) = G_TRUNC %0(s64)			%2:_(s32) = G_TRUNC %0(s64)
	%3:_(s32) = G_TRUNC %1(s64)			%3:_(s32) = G_TRUNC %1(s64)
	%4:_(s32) = G_SREM %2, %3			%4:_(s32) = G_SREM %2, %3
	$w0 = COPY %4(s32)			$w0 = COPY %4(s32)

	...			...
	---			---
	name: test_srem_8			name: test_srem_8
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_srem_8			; CHECK-LABEL: name: test_srem_8
	; CHECK: [[COPY:%[0-9]+]]:_(s64) = COPY $x0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s64) = COPY $x0
	; CHECK: [[COPY1:%[0-9]+]]:_(s64) = COPY $x1			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s64) = COPY $x1
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; CHECK: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[TRUNC]], [[C]]			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[TRUNC]], [[C]]
	; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]]			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]]
	; CHECK: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)			; CHECK-DAG: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)
	; CHECK: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[TRUNC1]], [[C]]			; CHECK-DAG: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[TRUNC1]], [[C]]
	; CHECK: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]]			; CHECK-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]]
	; CHECK: [[SDIV:%[0-9]+]]:_(s32) = G_SDIV [[ASHR]], [[ASHR1]]			; CHECK-DAG: [[SDIV:%[0-9]+]]:_(s32) = G_SDIV [[ASHR]], [[ASHR1]]
	; CHECK: [[COPY2:%[0-9]+]]:_(s32) = COPY [[SDIV]](s32)			; CHECK-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[SDIV]](s32)
	; CHECK: [[TRUNC2:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)			; CHECK-DAG: [[TRUNC2:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)
	; CHECK: [[MUL:%[0-9]+]]:_(s32) = G_MUL [[COPY2]], [[TRUNC2]]			; CHECK-DAG: [[MUL:%[0-9]+]]:_(s32) = G_MUL [[COPY2]], [[TRUNC2]]
	; CHECK: [[TRUNC3:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC3:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: [[COPY3:%[0-9]+]]:_(s32) = COPY [[MUL]](s32)			; CHECK-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[MUL]](s32)
	; CHECK: [[SUB:%[0-9]+]]:_(s32) = G_SUB [[TRUNC3]], [[COPY3]]			; CHECK-DAG: [[SUB:%[0-9]+]]:_(s32) = G_SUB [[TRUNC3]], [[COPY3]]
	; CHECK: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SUB]](s32)			; CHECK-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SUB]](s32)
	; CHECK: $w0 = COPY [[COPY4]](s32)			; CHECK-DAG: $w0 = COPY [[COPY4]](s32)
	%0:_(s64) = COPY $x0			%0:_(s64) = COPY $x0
	%1:_(s64) = COPY $x1			%1:_(s64) = COPY $x1
	%2:_(s8) = G_TRUNC %0(s64)			%2:_(s8) = G_TRUNC %0(s64)
	%3:_(s8) = G_TRUNC %1(s64)			%3:_(s8) = G_TRUNC %1(s64)
	%4:_(s8) = G_SREM %2, %3			%4:_(s8) = G_SREM %2, %3
	%5:_(s32) = G_ANYEXT %4(s8)			%5:_(s32) = G_ANYEXT %4(s8)
	$w0 = COPY %5(s32)			$w0 = COPY %5(s32)

	...			...
	---			---
	name: test_frem			name: test_frem
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_frem			; CHECK-LABEL: name: test_frem
	; CHECK: [[COPY:%[0-9]+]]:_(s64) = COPY $x0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s64) = COPY $x0
	; CHECK: [[COPY1:%[0-9]+]]:_(s64) = COPY $x1			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s64) = COPY $x1
	; CHECK: ADJCALLSTACKDOWN 0, 0, implicit-def $sp, implicit $sp			; CHECK-DAG: ADJCALLSTACKDOWN 0, 0, implicit-def $sp, implicit $sp
	; CHECK: $d0 = COPY [[COPY]](s64)			; CHECK-DAG: $d0 = COPY [[COPY]](s64)
	; CHECK: $d1 = COPY [[COPY1]](s64)			; CHECK-DAG: $d1 = COPY [[COPY1]](s64)
	; CHECK: BL &fmod, csr_aarch64_aapcs, implicit-def $lr, implicit $sp, implicit $d0, implicit $d1, implicit-def $d0			; CHECK-DAG: BL &fmod, csr_aarch64_aapcs, implicit-def $lr, implicit $sp, implicit $d0, implicit $d1, implicit-def $d0
	; CHECK: [[COPY2:%[0-9]+]]:_(s64) = COPY $d0			; CHECK-DAG: [[COPY2:%[0-9]+]]:_(s64) = COPY $d0
	; CHECK: ADJCALLSTACKUP 0, 0, implicit-def $sp, implicit $sp			; CHECK-DAG: ADJCALLSTACKUP 0, 0, implicit-def $sp, implicit $sp
	; CHECK: $x0 = COPY [[COPY2]](s64)			; CHECK-DAG: $x0 = COPY [[COPY2]](s64)
	; CHECK: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)			; CHECK-DAG: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)
	; CHECK: ADJCALLSTACKDOWN 0, 0, implicit-def $sp, implicit $sp			; CHECK-DAG: ADJCALLSTACKDOWN 0, 0, implicit-def $sp, implicit $sp
	; CHECK: $s0 = COPY [[TRUNC]](s32)			; CHECK-DAG: $s0 = COPY [[TRUNC]](s32)
	; CHECK: $s1 = COPY [[TRUNC1]](s32)			; CHECK-DAG: $s1 = COPY [[TRUNC1]](s32)
	; CHECK: BL &fmodf, csr_aarch64_aapcs, implicit-def $lr, implicit $sp, implicit $s0, implicit $s1, implicit-def $s0			; CHECK-DAG: BL &fmodf, csr_aarch64_aapcs, implicit-def $lr, implicit $sp, implicit $s0, implicit $s1, implicit-def $s0
	; CHECK: [[COPY3:%[0-9]+]]:_(s32) = COPY $s0			; CHECK-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY $s0
	; CHECK: ADJCALLSTACKUP 0, 0, implicit-def $sp, implicit $sp			; CHECK-DAG: ADJCALLSTACKUP 0, 0, implicit-def $sp, implicit $sp
	; CHECK: $w0 = COPY [[COPY3]](s32)			; CHECK-DAG: $w0 = COPY [[COPY3]](s32)
	%0:_(s64) = COPY $x0			%0:_(s64) = COPY $x0
	%1:_(s64) = COPY $x1			%1:_(s64) = COPY $x1
	%2:_(s64) = G_FREM %0, %1			%2:_(s64) = G_FREM %0, %1
	$x0 = COPY %2(s64)			$x0 = COPY %2(s64)
	%3:_(s32) = G_TRUNC %0(s64)			%3:_(s32) = G_TRUNC %0(s64)
	%4:_(s32) = G_TRUNC %1(s64)			%4:_(s32) = G_TRUNC %1(s64)
	%5:_(s32) = G_FREM %3, %4			%5:_(s32) = G_FREM %3, %4
	$w0 = COPY %5(s32)			$w0 = COPY %5(s32)

	...			...

llvm/test/CodeGen/AArch64/GlobalISel/legalize-sext.mir

This file was added.

				# RUN: llc -march=aarch64 -run-pass=legalizer %s -o - \| FileCheck %s
				---
				name: test_sext_inreg
				body: \|
				bb.0.entry:
				liveins: $w0, $w1
				; CHECK-LABEL: name: test_sext_inreg
				; CHECK-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $w1
				; CHECK-DAG: [[I25:%[0-9]+]]:_(s32) = G_CONSTANT i32 25
				; CHECK-DAG: [[SEXT1:%[0-9]+]]:_(s32) = G_SHL [[COPY]], [[I25]]
				; CHECK-DAG: [[SEXT2:%[0-9]+]]:_(s32) = G_ASHR [[SEXT1]], [[I25]]
				; CHECK-DAG: $w0 = COPY [[SEXT2]](s32)
				%0:_(s32) = COPY $w1
				dsandersAuthorUnsubmitted Done Reply Inline Actions @paquette @aemerson: I ought to draw attention to this. My code appears to be doing the right thing for lower() of G_SEXT_INREG but you appear to have a custom legalization that promotes one of the two constants to s64. Is that intentional? If so, is it also intentional that it only does it for G_ASHR and not G_SHL too? dsanders: @paquette @aemerson: I ought to draw attention to this. My code appears to be doing the right…
				aemersonUnsubmitted Done Reply Inline Actions Yes this is intentional. In order to re-use the existing imported patterns for ashr & lshr we promote the shift amount to i64. For G_SHL we have some custom selection code to deal with non-64b immediates so it's not necessary. aemerson: Yes this is intentional. In order to re-use the existing imported patterns for ashr & lshr we…
				dsandersAuthorUnsubmitted Done Reply Inline Actions That's ok then. Thanks dsanders: That's ok then. Thanks
				%2:_(s32) = G_SEXT_INREG %0(s32), 7
				$w0 = COPY %2(s32)
				...

llvm/test/CodeGen/AArch64/GlobalISel/legalize-shift.mir

	# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py			# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
	# RUN: llc -O0 -march=aarch64 -run-pass=legalizer %s -o - \| FileCheck %s			# RUN: llc -O0 -march=aarch64 -run-pass=legalizer %s -o - \| FileCheck %s
	---			---
	name: test_shift			name: test_shift
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	; CHECK-LABEL: name: test_shift			; CHECK-LABEL: name: test_shift
	; CHECK: [[COPY:%[0-9]+]]:_(s64) = COPY $x0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s64) = COPY $x0
	; CHECK: [[COPY1:%[0-9]+]]:_(s64) = COPY $x1			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s64) = COPY $x1
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255			; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
	; CHECK: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)			; CHECK-DAG: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)
	; CHECK: [[AND:%[0-9]+]]:_(s32) = G_AND [[TRUNC]], [[C]]			; CHECK-DAG: [[AND:%[0-9]+]]:_(s32) = G_AND [[TRUNC]], [[C]]
	; CHECK: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; CHECK-DAG: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; CHECK: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[TRUNC1]], [[C1]](s32)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[TRUNC1]], [[C1]](s32)
	; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)
	; CHECK: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[ASHR]], [[AND]](s32)			; CHECK-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[ASHR]], [[AND]](s32)
	; CHECK: [[COPY2:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)			; CHECK-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)
	; CHECK: $w0 = COPY [[COPY2]](s32)			; CHECK-DAG: $w0 = COPY [[COPY2]](s32)
	; CHECK: [[TRUNC2:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)			; CHECK-DAG: [[TRUNC2:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)
	; CHECK: [[AND1:%[0-9]+]]:_(s32) = G_AND [[TRUNC2]], [[C]]			; CHECK-DAG: [[AND1:%[0-9]+]]:_(s32) = G_AND [[TRUNC2]], [[C]]
	; CHECK: [[TRUNC3:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC3:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: [[AND2:%[0-9]+]]:_(s32) = G_AND [[TRUNC3]], [[C]]			; CHECK-DAG: [[AND2:%[0-9]+]]:_(s32) = G_AND [[TRUNC3]], [[C]]
	; CHECK: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[AND2]], [[AND1]](s32)			; CHECK-DAG: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[AND2]], [[AND1]](s32)
	; CHECK: [[COPY3:%[0-9]+]]:_(s32) = COPY [[LSHR]](s32)			; CHECK-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[LSHR]](s32)
	; CHECK: $w0 = COPY [[COPY3]](s32)			; CHECK-DAG: $w0 = COPY [[COPY3]](s32)
	; CHECK: [[TRUNC4:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)			; CHECK-DAG: [[TRUNC4:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)
	; CHECK: [[AND3:%[0-9]+]]:_(s32) = G_AND [[TRUNC4]], [[C]]			; CHECK-DAG: [[AND3:%[0-9]+]]:_(s32) = G_AND [[TRUNC4]], [[C]]
	; CHECK: [[TRUNC5:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK-DAG: [[TRUNC5:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[TRUNC5]], [[AND3]](s32)			; CHECK-DAG: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[TRUNC5]], [[AND3]](s32)
	; CHECK: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SHL1]](s32)			; CHECK-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SHL1]](s32)
	; CHECK: $w0 = COPY [[COPY4]](s32)			; CHECK-DAG: $w0 = COPY [[COPY4]](s32)
	%0:_(s64) = COPY $x0			%0:_(s64) = COPY $x0
	%1:_(s64) = COPY $x1			%1:_(s64) = COPY $x1
	%2:_(s8) = G_TRUNC %0(s64)			%2:_(s8) = G_TRUNC %0(s64)
	%3:_(s8) = G_TRUNC %1(s64)			%3:_(s8) = G_TRUNC %1(s64)
	%4:_(s8) = G_ASHR %2, %3			%4:_(s8) = G_ASHR %2, %3
	%7:_(s32) = G_ANYEXT %4(s8)			%7:_(s32) = G_ANYEXT %4(s8)
	$w0 = COPY %7(s32)			$w0 = COPY %7(s32)
	%5:_(s8) = G_LSHR %2, %3			%5:_(s8) = G_LSHR %2, %3
	▲ Show 20 Lines • Show All 190 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/GlobalISel/legalize-undef.mir

	Show All 18 Lines

	---			---
	name: test_implicit_def_s3			name: test_implicit_def_s3
	body: \|			body: \|
	bb.0:			bb.0:
	liveins:			liveins:

	; CHECK-LABEL: name: test_implicit_def_s3			; CHECK-LABEL: name: test_implicit_def_s3
	; CHECK: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 61			; CHECK-DAG: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 61
	; CHECK: [[DEF:%[0-9]+]]:_(s64) = G_IMPLICIT_DEF			; CHECK-DAG: [[DEF:%[0-9]+]]:_(s64) = G_IMPLICIT_DEF
	; CHECK: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[DEF]], [[C]](s64)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[DEF]], [[C]](s64)
				aemersonUnsubmitted Done Reply Inline Actions Did constant and impdef order change? If so we can just re-run the test update script. aemerson: Did constant and impdef order change? If so we can just re-run the test update script.
				dsandersAuthorUnsubmitted Done Reply Inline Actions Did constant and impdef order change? Yes. Should instruction order matter for these tests? Legalization doesn't give any guarantees on the output order and using CHECK-DAG makes the test robust against that dsanders: > Did constant and impdef order change? Yes. Should instruction order matter for these tests?
				arsenmUnsubmitted Done Reply Inline Actions It would, but as it's autogenerated It doesn't/shouldn't uses -DAG checks arsenm: It would, but as it's autogenerated It doesn't/shouldn't uses -DAG checks
	; CHECK: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[C]](s64)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[C]](s64)
	; CHECK: $x0 = COPY [[ASHR]](s64)			; CHECK-DAG: $x0 = COPY [[ASHR]](s64)
	%0:_(s3) = G_IMPLICIT_DEF			%0:_(s3) = G_IMPLICIT_DEF
	%1:_(s64) = G_SEXT %0			%1:_(s64) = G_SEXT %0
	$x0 = COPY %1(s64)			$x0 = COPY %1(s64)
	...			...

	# FIXME: s2 not correctly handled			# FIXME: s2 not correctly handled

	---			---
	Show All 30 Lines

llvm/test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir

	# RUN: llc -mtriple=aarch64-- -run-pass=legalizer %s \			# RUN: llc -mtriple=aarch64-- -run-pass=legalizer %s \
	# RUN: -mcpu=cortex-a75 -o - 2>&1 \| FileCheck %s --check-prefixes=CHECK			# RUN: -mcpu=cortex-a75 -o - 2>&1 \| FileCheck %s --check-prefixes=CHECK

	# RUN: llc -mtriple=aarch64-- -run-pass=legalizer %s -debug-only=legalizer-info \			# RUN: llc -mtriple=aarch64-- -run-pass=legalizer %s -debug-only=legalizer-info \
	# RUN: -mcpu=cortex-a75 -o - 2>&1 \| FileCheck %s --check-prefixes=CHECK,DEBUG			# RUN: -mcpu=cortex-a75 -o - 2>&1 \| FileCheck %s --check-prefixes=CHECK,DEBUG

	# REQUIRES: asserts			# REQUIRES: asserts

	# The main purpose of this test is to make sure we don't over-relax			# The main purpose of this test is to make sure we don't over-relax
	# LegalizerInfo validation and lose its ability to catch bugs.			# LegalizerInfo validation and lose its ability to catch bugs.
	#			#
	# Watch out for every "SKIPPED: user-defined predicate detected" in the			# Watch out for every "SKIPPED: user-defined predicate detected" in the
	# check-lines below and keep each and every one of them justified.			# check-lines below and keep each and every one of them justified.


	# DEBUG: G_ADD (opcode [[ADD_OPC:[0-9]+]]): 1 type index			# DEBUG: G_ADD (opcode [[ADD_OPC:[0-9]+]]): 1 type index, 0 imm indices
	# DEBUG-NEXT: .. the first uncovered type index: 1, OK			# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
	#			#
	# DEBUG-NEXT: G_SUB (opcode [[SUB_OPC:[0-9]+]]): 1 type index			# DEBUG-NEXT: G_SUB (opcode [[SUB_OPC:[0-9]+]]): 1 type index, 0 imm indices
	# DEBUG-NEXT: .. opcode [[SUB_OPC]] is aliased to [[ADD_OPC]]			# DEBUG-NEXT: .. opcode [[SUB_OPC]] is aliased to [[ADD_OPC]]
	# DEBUG-NEXT: .. opcode [[ADD_OPC]] is aliased to 0
	# DEBUG-NEXT: .. the first uncovered type index: 1, OK			# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
	#			#
	# DEBUG-NEXT: G_MUL (opcode {{[0-9]+}}): 1 type index			# DEBUG-NEXT: G_MUL (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG: .. the first uncovered type index: 1, OK			# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
	#			# DEBUG-NEXT: .. the first uncovered type index: 1, OK
	# DEBUG-NEXT: G_SDIV (opcode {{[0-9]+}}): 1 type index			# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_UDIV (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_SREM (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_UREM (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_AND (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_OR (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_XOR (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_IMPLICIT_DEF (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_PHI (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_FRAME_INDEX (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_GLOBAL_VALUE (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_EXTRACT (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_UNMERGE_VALUES (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_INSERT (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_MERGE_VALUES (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_BUILD_VECTOR (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_BUILD_VECTOR_TRUNC (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: no rules defined
	#
	# DEBUG-NEXT: G_CONCAT_VECTORS (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_PTRTOINT (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_INTTOPTR (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_BITCAST (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_INTRINSIC_TRUNC (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_INTRINSIC_ROUND (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_LOAD (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_SEXTLOAD (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ZEXTLOAD (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_STORE (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ATOMIC_CMPXCHG_WITH_SUCCESS (opcode {{[0-9]+}}): 3 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ATOMIC_CMPXCHG (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ATOMICRMW_XCHG (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ATOMICRMW_ADD (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ATOMICRMW_SUB (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ATOMICRMW_AND (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ATOMICRMW_NAND (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: no rules defined
	#
	# DEBUG-NEXT: G_ATOMICRMW_OR (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ATOMICRMW_XOR (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ATOMICRMW_MAX (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ATOMICRMW_MIN (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ATOMICRMW_UMAX (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ATOMICRMW_UMIN (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_BRCOND (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_BRINDIRECT (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_INTRINSIC (opcode {{[0-9]+}}): 0 type indices
	# DEBUG: .. type index coverage check SKIPPED: no rules defined
	#
	# DEBUG-NEXT: G_INTRINSIC_W_SIDE_EFFECTS (opcode {{[0-9]+}}): 0 type indices
	# DEBUG: .. type index coverage check SKIPPED: no rules defined
	#
	# DEBUG-NEXT: G_ANYEXT (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_TRUNC (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_CONSTANT (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_FCONSTANT (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_VASTART (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_VAARG (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_SEXT (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ZEXT (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_SHL (opcode {{[0-9]+}}): 2 type indices
	# DEBUG:.. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_LSHR (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ASHR (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_ICMP (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_FCMP (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_SELECT (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_UADDO (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_UADDE (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_USUBO (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: no rules defined
	#
	# DEBUG-NEXT: G_USUBE (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_SADDO (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_SADDE (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: no rules defined
	#
	# DEBUG-NEXT: G_SSUBO (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_SSUBE (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: no rules defined
	#
	# DEBUG-NEXT: G_UMULO (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_SMULO (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_UMULH (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_SMULH (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_FADD (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_FSUB (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_FMUL (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_FMA (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_FDIV (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_FREM (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_FPOW (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_FEXP (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_FEXP2 (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_FLOG (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_FLOG2 (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_FLOG10 (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_FNEG (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_FPEXT (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_FPTRUNC (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_FPTOSI (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_FPTOUI (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_SITOFP (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_UITOFP (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_FABS (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_FCANONICALIZE (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. type index coverage check SKIPPED: no rules defined
	#
	# DEBUG-NEXT: G_GEP (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK
	#
	# DEBUG-NEXT: G_PTR_MASK (opcode {{[0-9]+}}): 1 type index
	# DEBUG: .. the first uncovered type index: 1, OK
	#
	# DEBUG-NEXT: G_BR (opcode {{[0-9]+}}): 0 type indices
	# DEBUG: .. type index coverage check SKIPPED: no rules defined
	#
	# DEBUG-NEXT: G_INSERT_VECTOR_ELT (opcode {{[0-9]+}}): 3 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_EXTRACT_VECTOR_ELT (opcode {{[0-9]+}}): 3 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_SHUFFLE_VECTOR (opcode {{[0-9]+}}): 3 type indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#
	# DEBUG-NEXT: G_CTTZ (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. type index coverage check SKIPPED: no rules defined
	#			#
	# DEBUG-NEXT: G_CTTZ_ZERO_UNDEF (opcode {{[0-9]+}}): 2 type indices			# DEBUG-NEXT: G_SDIV (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG: .. type index coverage check SKIPPED: no rules defined			# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
	#			#
	# DEBUG-NEXT: G_CTLZ (opcode {{[0-9]+}}): 2 type indices			# DEBUG-NEXT: G_UDIV (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG: .. the first uncovered type index: 2, OK			# DEBUG-NEXT: .. opcode 39 is aliased to 38
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
	#			#
	# DEBUG-NEXT: G_CTLZ_ZERO_UNDEF (opcode {{[0-9]+}}): 2 type indices			# DEBUG-NEXT: G_SREM (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG: .. type index coverage check SKIPPED: no rules defined			# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
	#			#
	# DEBUG-NEXT: G_CTPOP (opcode {{[0-9]+}}): 2 type indices			# DEBUG-NEXT: G_UREM (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG: .. type index coverage check SKIPPED: no rules defined			# DEBUG-NEXT: .. opcode 41 is aliased to 40
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
	#			#
	# DEBUG-NEXT: G_BSWAP (opcode {{[0-9]+}}): 1 type index			# DEBUG-NEXT: G_AND (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG: .. the first uncovered type index: 1, OK			# DEBUG-NEXT: .. opcode 42 is aliased to 35
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
	#			#
	# DEBUG-NEXT: G_FCEIL (opcode {{[0-9]+}}): 1 type index			# DEBUG-NEXT: G_OR (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected			# DEBUG-NEXT: .. opcode 43 is aliased to 35
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
	#			#
	# DEBUG-NEXT: G_FCOS (opcode {{[0-9]+}}): 1 type index			# DEBUG-NEXT: G_XOR (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG: .. the first uncovered type index: 1, OK			# DEBUG-NEXT: .. opcode 44 is aliased to 35
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
	#			#
	# DEBUG-NEXT: G_FSIN (opcode {{[0-9]+}}): 1 type index			# DEBUG-NEXT: G_IMPLICIT_DEF (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG: .. the first uncovered type index: 1, OK			# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
	#			#
	# DEBUG-NEXT: G_FSQRT (opcode {{[0-9]+}}): 1 type index			# DEBUG-NEXT: G_PHI (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected			# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
	#			#
	# DEBUG-NEXT: G_FFLOOR (opcode {{[0-9]+}}): 1 type index			# DEBUG-NEXT: G_FRAME_INDEX (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected			# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
	#			#
	# DEBUG-NEXT: G_FRINT (opcode {{[0-9]+}}): 1 type index			# DEBUG-NEXT: G_GLOBAL_VALUE (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected			# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
	#			#
	# DEBUG-NEXT: G_FNEARBYINT (opcode {{[0-9]+}}): 1 type index			# DEBUG-NEXT: G_EXTRACT (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
	# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected			# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				#
				# DEBUG-NEXT: G_UNMERGE_VALUES (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				#
				# DEBUG-NEXT: G_INSERT (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				#
				# DEBUG-NEXT: G_MERGE_VALUES (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				#
				# DEBUG-NEXT: G_BUILD_VECTOR (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				#
				# DEBUG-NEXT: G_BUILD_VECTOR_TRUNC (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
				#
				# DEBUG-NEXT: G_CONCAT_VECTORS (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				#
				# DEBUG-NEXT: G_PTRTOINT (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				#
				# DEBUG-NEXT: G_INTTOPTR (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				#
				# DEBUG-NEXT: G_BITCAST (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				#
				# DEBUG-NEXT: G_INTRINSIC_TRUNC (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_INTRINSIC_ROUND (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_LOAD (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_SEXTLOAD (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ZEXTLOAD (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_STORE (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ATOMIC_CMPXCHG_WITH_SUCCESS (opcode {{[0-9]+}}): 3 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ATOMIC_CMPXCHG (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ATOMICRMW_XCHG (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ATOMICRMW_ADD (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ATOMICRMW_SUB (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ATOMICRMW_AND (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ATOMICRMW_NAND (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: G_ATOMICRMW_OR (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ATOMICRMW_XOR (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ATOMICRMW_MAX (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ATOMICRMW_MIN (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ATOMICRMW_UMAX (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ATOMICRMW_UMIN (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_BRCOND (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_BRINDIRECT (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_INTRINSIC (opcode {{[0-9]+}}): 0 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: G_INTRINSIC_W_SIDE_EFFECTS (opcode {{[0-9]+}}): 0 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: G_ANYEXT (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_TRUNC (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_CONSTANT (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FCONSTANT (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_VASTART (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_VAARG (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_SEXT (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_SEXT_INREG (opcode {{[0-9]+}}): 1 type index, 1 imm index
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ZEXT (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_SHL (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_LSHR (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ASHR (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_ICMP (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_FCMP (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_SELECT (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_UADDO (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_UADDE (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_USUBO (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: G_USUBE (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_SADDO (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_SADDE (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: G_SSUBO (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_SSUBE (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: G_UMULO (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_SMULO (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_UMULH (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_SMULH (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FADD (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FSUB (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FMUL (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FMA (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_FDIV (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FREM (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FPOW (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FEXP (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FEXP2 (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FLOG (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FLOG2 (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FLOG10 (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FNEG (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FPEXT (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FPTRUNC (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FPTOSI (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FPTOUI (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_SITOFP (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_UITOFP (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FABS (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_FCANONICALIZE (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: G_GEP (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_PTR_MASK (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_BR (opcode {{[0-9]+}}): 0 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: G_INSERT_VECTOR_ELT (opcode {{[0-9]+}}): 3 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_EXTRACT_VECTOR_ELT (opcode {{[0-9]+}}): 3 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_SHUFFLE_VECTOR (opcode {{[0-9]+}}): 3 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_CTTZ (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: G_CTTZ_ZERO_UNDEF (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: G_CTLZ (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 2, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_CTLZ_ZERO_UNDEF (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: G_CTPOP (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: no rules defined
				# DEBUG-NEXT: G_BSWAP (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FCEIL (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_FCOS (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FSIN (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. the first uncovered type index: 1, OK
				# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
				# DEBUG-NEXT: G_FSQRT (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_FFLOOR (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_FRINT (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: G_FNEARBYINT (opcode {{[0-9]+}}): 1 type index, 0 imm indices
				# DEBUG-NEXT: .. opcode {{[0-9]+}} is aliased to {{[0-9]+}}
				# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
				# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected

	# CHECK-NOT: ill-defined			# CHECK-NOT: ill-defined

	---			---
	name: dummy			name: dummy
	body: \|			body: \|
	bb.0:			bb.0:
	...			...

llvm/test/CodeGen/AMDGPU/GlobalISel/artifact-combiner-sext.mir

	# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py			# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
	# RUN: llc -O0 -mtriple=amdgcn-mesa-mesa3d -mcpu=tahiti -run-pass=legalizer -global-isel-abort=0 %s -o - \| FileCheck %s			# RUN: llc -O0 -mtriple=amdgcn-mesa-mesa3d -mcpu=tahiti -run-pass=legalizer -global-isel-abort=0 %s -o - \| FileCheck %s
	# FIXME: Remove -global-isel-abort=0 when G_TRUNC legality handled			# FIXME: Remove -global-isel-abort=0 when G_TRUNC legality handled

	---			---
	name: test_sext_trunc_v2s32_to_v2s16_to_v2s32			name: test_sext_trunc_v2s32_to_v2s16_to_v2s32
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0_vgpr1			liveins: $vgpr0_vgpr1

	; CHECK-LABEL: name: test_sext_trunc_v2s32_to_v2s16_to_v2s32			; CHECK-LABEL: name: test_sext_trunc_v2s32_to_v2s16_to_v2s32
	; CHECK: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr0_vgpr1			; CHECK-DAG: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr0_vgpr1
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; CHECK: [[COPY1:%[0-9]+]]:_(<2 x s32>) = COPY [[COPY]](<2 x s32>)			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(<2 x s32>) = COPY [[COPY]](<2 x s32>)
	; CHECK: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY1]](<2 x s32>)			; CHECK-DAG: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY1]](<2 x s32>)
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[C]](s32)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[C]](s32)
	; CHECK: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[C]](s32)			; CHECK-DAG: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[C]](s32)
	; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; CHECK: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]](s32)			; CHECK-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]](s32)
	; CHECK: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[ASHR]](s32), [[ASHR1]](s32)			; CHECK-DAG: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[ASHR]](s32), [[ASHR1]](s32)
	; CHECK: $vgpr0_vgpr1 = COPY [[BUILD_VECTOR]](<2 x s32>)			; CHECK-DAG: $vgpr0_vgpr1 = COPY [[BUILD_VECTOR]](<2 x s32>)
	%0:_(<2 x s32>) = COPY $vgpr0_vgpr1			%0:_(<2 x s32>) = COPY $vgpr0_vgpr1
	%1:_(<2 x s16>) = G_TRUNC %0			%1:_(<2 x s16>) = G_TRUNC %0
	%2:_(<2 x s32>) = G_SEXT %1			%2:_(<2 x s32>) = G_SEXT %1
	$vgpr0_vgpr1 = COPY %2			$vgpr0_vgpr1 = COPY %2
	...			...

	---			---
	name: test_sext_trunc_v2s32_to_v2s16_to_v2s64			name: test_sext_trunc_v2s32_to_v2s16_to_v2s64
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0_vgpr1			liveins: $vgpr0_vgpr1

	; CHECK-LABEL: name: test_sext_trunc_v2s32_to_v2s16_to_v2s64			; CHECK-LABEL: name: test_sext_trunc_v2s32_to_v2s16_to_v2s64
	; CHECK: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr0_vgpr1			; CHECK-DAG: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr0_vgpr1
	; CHECK: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 48			; CHECK-DAG: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 48
	; CHECK: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY]](<2 x s32>)			; CHECK-DAG: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY]](<2 x s32>)
	; CHECK: [[ANYEXT:%[0-9]+]]:_(s64) = G_ANYEXT [[UV]](s32)			; CHECK-DAG: [[ANYEXT:%[0-9]+]]:_(s64) = G_ANYEXT [[UV]](s32)
	; CHECK: [[ANYEXT1:%[0-9]+]]:_(s64) = G_ANYEXT [[UV1]](s32)			; CHECK-DAG: [[ANYEXT1:%[0-9]+]]:_(s64) = G_ANYEXT [[UV1]](s32)
	; CHECK: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)			; CHECK-DAG: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)
	; CHECK: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[ANYEXT]], [[TRUNC]](s32)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[ANYEXT]], [[TRUNC]](s32)
	; CHECK: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)			; CHECK-DAG: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)
	; CHECK: [[SHL1:%[0-9]+]]:_(s64) = G_SHL [[ANYEXT1]], [[TRUNC1]](s32)			; CHECK-DAG: [[SHL1:%[0-9]+]]:_(s64) = G_SHL [[ANYEXT1]], [[TRUNC1]](s32)
	; CHECK: [[TRUNC2:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)			; CHECK-DAG: [[TRUNC2:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)
	; CHECK: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[TRUNC2]](s32)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[TRUNC2]](s32)
	; CHECK: [[TRUNC3:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)			; CHECK-DAG: [[TRUNC3:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)
	; CHECK: [[ASHR1:%[0-9]+]]:_(s64) = G_ASHR [[SHL1]], [[TRUNC3]](s32)			; CHECK-DAG: [[ASHR1:%[0-9]+]]:_(s64) = G_ASHR [[SHL1]], [[TRUNC3]](s32)
	; CHECK: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s64>) = G_BUILD_VECTOR [[ASHR]](s64), [[ASHR1]](s64)			; CHECK-DAG: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s64>) = G_BUILD_VECTOR [[ASHR]](s64), [[ASHR1]](s64)
	; CHECK: $vgpr0_vgpr1_vgpr2_vgpr3 = COPY [[BUILD_VECTOR]](<2 x s64>)			; CHECK-DAG: $vgpr0_vgpr1_vgpr2_vgpr3 = COPY [[BUILD_VECTOR]](<2 x s64>)
	%0:_(<2 x s32>) = COPY $vgpr0_vgpr1			%0:_(<2 x s32>) = COPY $vgpr0_vgpr1
	%1:_(<2 x s16>) = G_TRUNC %0			%1:_(<2 x s16>) = G_TRUNC %0
	%2:_(<2 x s64>) = G_SEXT %1			%2:_(<2 x s64>) = G_SEXT %1
	$vgpr0_vgpr1_vgpr2_vgpr3 = COPY %2			$vgpr0_vgpr1_vgpr2_vgpr3 = COPY %2
	...			...

	---			---
	name: test_sext_trunc_v2s32_to_v2s8_to_v2s16			name: test_sext_trunc_v2s32_to_v2s8_to_v2s16
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0_vgpr1			liveins: $vgpr0_vgpr1

	; CHECK-LABEL: name: test_sext_trunc_v2s32_to_v2s8_to_v2s16			; CHECK-LABEL: name: test_sext_trunc_v2s32_to_v2s8_to_v2s16
	; CHECK: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr0_vgpr1			; CHECK-DAG: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY $vgpr0_vgpr1
	; CHECK: [[C:%[0-9]+]]:_(s16) = G_CONSTANT i16 8			; CHECK-DAG: [[TRUNC:%[0-9]+]]:_(<2 x s16>) = G_TRUNC [[COPY]](<2 x s32>)
	; CHECK: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s16>) = G_BUILD_VECTOR [[C]](s16), [[C]](s16)			; The G_SEXT_INREG doesn't lower here because G_TRUNC is both illegal and
	; CHECK: [[TRUNC:%[0-9]+]]:_(<2 x s16>) = G_TRUNC [[COPY]](<2 x s32>)			; unable to legalize. This prevents further legalization.
	; CHECK: [[SHL:%[0-9]+]]:_(<2 x s16>) = G_SHL [[TRUNC]], [[BUILD_VECTOR]](<2 x s16>)			; CHECK-DAG: [[SEXT_INREG:%[0-9]+]]:_(<2 x s16>) = G_SEXT_INREG [[TRUNC]], 8
	; CHECK: [[ASHR:%[0-9]+]]:_(<2 x s16>) = G_ASHR [[SHL]], [[BUILD_VECTOR]](<2 x s16>)			; CHECK-DAG: $vgpr0 = COPY [[SEXT_INREG]](<2 x s16>)
				dsandersAuthorUnsubmitted Done Reply Inline Actions @arsenm: This test currently doesn't complete legalization because the pre-existing G_TRUNC causes the legalizer to stop processing since it's illegal and it's unable to legalize it. This blocks the lowering of the G_SEXT_INREG which appears later in the work list. dsanders: @arsenm: This test currently doesn't complete legalization because the pre-existing G_TRUNC…
	; CHECK: $vgpr0 = COPY [[ASHR]](<2 x s16>)
	%0:_(<2 x s32>) = COPY $vgpr0_vgpr1			%0:_(<2 x s32>) = COPY $vgpr0_vgpr1
	%1:_(<2 x s8>) = G_TRUNC %0			%1:_(<2 x s8>) = G_TRUNC %0
	%2:_(<2 x s16>) = G_SEXT %1			%2:_(<2 x s16>) = G_SEXT %1
	$vgpr0 = COPY %2			$vgpr0 = COPY %2
	...			...

	---			---
	name: test_sext_trunc_v3s32_to_v3s16_to_v3s32			name: test_sext_trunc_v3s32_to_v3s16_to_v3s32
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0_vgpr1_vgpr2			liveins: $vgpr0_vgpr1_vgpr2

	; CHECK-LABEL: name: test_sext_trunc_v3s32_to_v3s16_to_v3s32			; CHECK-LABEL: name: test_sext_trunc_v3s32_to_v3s16_to_v3s32
	; CHECK: [[COPY:%[0-9]+]]:_(<3 x s32>) = COPY $vgpr0_vgpr1_vgpr2			; CHECK-DAG: [[COPY:%[0-9]+]]:_(<3 x s32>) = COPY $vgpr0_vgpr1_vgpr2
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; CHECK: [[COPY1:%[0-9]+]]:_(<3 x s32>) = COPY [[COPY]](<3 x s32>)			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(<3 x s32>) = COPY [[COPY]](<3 x s32>)
	; CHECK: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32), [[UV2:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY1]](<3 x s32>)			; CHECK-DAG: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32), [[UV2:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY1]](<3 x s32>)
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[C]](s32)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[C]](s32)
	; CHECK: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[C]](s32)			; CHECK-DAG: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[C]](s32)
	; CHECK: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[UV2]], [[C]](s32)			; CHECK-DAG: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[UV2]], [[C]](s32)
	; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; CHECK: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]](s32)			; CHECK-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]](s32)
	; CHECK: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL2]], [[C]](s32)			; CHECK-DAG: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL2]], [[C]](s32)
	; CHECK: [[BUILD_VECTOR:%[0-9]+]]:_(<3 x s32>) = G_BUILD_VECTOR [[ASHR]](s32), [[ASHR1]](s32), [[ASHR2]](s32)			; CHECK-DAG: [[BUILD_VECTOR:%[0-9]+]]:_(<3 x s32>) = G_BUILD_VECTOR [[ASHR]](s32), [[ASHR1]](s32), [[ASHR2]](s32)
	; CHECK: $vgpr0_vgpr1_vgpr2 = COPY [[BUILD_VECTOR]](<3 x s32>)			; CHECK-DAG: $vgpr0_vgpr1_vgpr2 = COPY [[BUILD_VECTOR]](<3 x s32>)
	%0:_(<3 x s32>) = COPY $vgpr0_vgpr1_vgpr2			%0:_(<3 x s32>) = COPY $vgpr0_vgpr1_vgpr2
	%1:_(<3 x s16>) = G_TRUNC %0			%1:_(<3 x s16>) = G_TRUNC %0
	%2:_(<3 x s32>) = G_SEXT %1			%2:_(<3 x s32>) = G_SEXT %1
	$vgpr0_vgpr1_vgpr2 = COPY %2			$vgpr0_vgpr1_vgpr2 = COPY %2
	...			...

llvm/test/CodeGen/AMDGPU/GlobalISel/combine-ext-legalizer.mir

	# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py			# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
	# RUN: llc -mtriple=amdgcn-mesa-mesa3d -mcpu=fiji -O0 -run-pass=legalizer %s -o - \| FileCheck %s			# RUN: llc -mtriple=amdgcn-mesa-mesa3d -mcpu=fiji -O0 -run-pass=legalizer %s -o - \| FileCheck %s

	---			---
	name: test_sext_trunc_i64_i32_i64			name: test_sext_trunc_i64_i32_i64
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0_vgpr1			liveins: $vgpr0_vgpr1

	; CHECK-LABEL: name: test_sext_trunc_i64_i32_i64			; CHECK-LABEL: name: test_sext_trunc_i64_i32_i64
	; CHECK: [[COPY:%[0-9]+]]:_(s64) = COPY $vgpr0_vgpr1			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s64) = COPY $vgpr0_vgpr1
	; CHECK: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 32			; CHECK-DAG: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 32
	; CHECK: [[COPY1:%[0-9]+]]:_(s64) = COPY [[COPY]](s64)			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s64) = COPY [[COPY]](s64)
	; CHECK: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)			; CHECK-DAG: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)
	; CHECK: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[COPY1]], [[TRUNC]](s32)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[COPY1]], [[TRUNC]](s32)
	; CHECK: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)			; CHECK-DAG: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)
	; CHECK: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[TRUNC1]](s32)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[TRUNC1]](s32)
	; CHECK: $vgpr0_vgpr1 = COPY [[ASHR]](s64)			; CHECK-DAG: $vgpr0_vgpr1 = COPY [[ASHR]](s64)
	%0:_(s64) = COPY $vgpr0_vgpr1			%0:_(s64) = COPY $vgpr0_vgpr1
	%1:_(s32) = G_TRUNC %0			%1:_(s32) = G_TRUNC %0
	%2:_(s64) = G_SEXT %1			%2:_(s64) = G_SEXT %1
	$vgpr0_vgpr1 = COPY %2			$vgpr0_vgpr1 = COPY %2
	...			...

	---			---
	name: test_zext_trunc_i64_i32_i64			name: test_zext_trunc_i64_i32_i64
	Show All 15 Lines

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-ashr.mir

	Show First 20 Lines • Show All 118 Lines • ▼ Show 20 Lines

	---			---
	name: test_ashr_s16_s32			name: test_ashr_s16_s32
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0, $vgpr1			liveins: $vgpr0, $vgpr1

	; SI-LABEL: name: test_ashr_s16_s32			; SI-LABEL: name: test_ashr_s16_s32
	; SI: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0			; SI-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
	; SI: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1			; SI-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
	; SI: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; SI-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; SI: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; SI-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; SI: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C]](s32)			; SI-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C]](s32)
	; SI: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; SI-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; SI: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[ASHR]], [[COPY1]](s32)			; SI-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[ASHR]], [[COPY1]](s32)
	; SI: [[COPY3:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)			; SI-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)
	; SI: $vgpr0 = COPY [[COPY3]](s32)			; SI-DAG: $vgpr0 = COPY [[COPY3]](s32)
	; VI-LABEL: name: test_ashr_s16_s32			; VI-LABEL: name: test_ashr_s16_s32
	; VI: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0			; VI: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
	; VI: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1			; VI: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
	; VI: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)			; VI: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)
	; VI: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[TRUNC]], [[COPY1]](s32)			; VI: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[TRUNC]], [[COPY1]](s32)
	; VI: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[ASHR]](s16)			; VI: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[ASHR]](s16)
	; VI: $vgpr0 = COPY [[ANYEXT]](s32)			; VI: $vgpr0 = COPY [[ANYEXT]](s32)
	; GFX9-LABEL: name: test_ashr_s16_s32			; GFX9-LABEL: name: test_ashr_s16_s32
	Show All 13 Lines

	---			---
	name: test_ashr_s16_s16			name: test_ashr_s16_s16
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0, $vgpr1			liveins: $vgpr0, $vgpr1

	; SI-LABEL: name: test_ashr_s16_s16			; SI-LABEL: name: test_ashr_s16_s16
	; SI: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0			; SI-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
	; SI: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1			; SI-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
	; SI: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 65535			; SI-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 65535
	; SI: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; SI-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; SI: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C]]			; SI-DAG: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C]]
	; SI: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; SI-DAG: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; SI: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; SI-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; SI: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C1]](s32)			; SI-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C1]](s32)
	; SI: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)			; SI-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)
	; SI: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[ASHR]], [[AND]](s32)			; SI-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[ASHR]], [[AND]](s32)
	; SI: [[COPY4:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)			; SI-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)
	; SI: $vgpr0 = COPY [[COPY4]](s32)			; SI-DAG: $vgpr0 = COPY [[COPY4]](s32)
	; VI-LABEL: name: test_ashr_s16_s16			; VI-LABEL: name: test_ashr_s16_s16
	; VI: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0			; VI: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
	; VI: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1			; VI: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
	; VI: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)			; VI: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)
	; VI: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[COPY1]](s32)			; VI: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[COPY1]](s32)
	; VI: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[TRUNC]], [[TRUNC1]](s16)			; VI: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[TRUNC]], [[TRUNC1]](s16)
	; VI: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[ASHR]](s16)			; VI: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[ASHR]](s16)
	; VI: $vgpr0 = COPY [[ANYEXT]](s32)			; VI: $vgpr0 = COPY [[ANYEXT]](s32)
	Show All 16 Lines

	---			---
	name: test_ashr_s16_i8			name: test_ashr_s16_i8
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0, $vgpr1			liveins: $vgpr0, $vgpr1

	; SI-LABEL: name: test_ashr_s16_i8			; SI-LABEL: name: test_ashr_s16_i8
	; SI: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0			; SI-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
	; SI: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1			; SI-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
	; SI: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255			; SI-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
	; SI: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; SI-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; SI: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C]]			; SI-DAG: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C]]
	; SI: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; SI-DAG: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; SI: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; SI-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; SI: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C1]](s32)			; SI-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C1]](s32)
	; SI: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)			; SI-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)
	; SI: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[ASHR]], [[AND]](s32)			; SI-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[ASHR]], [[AND]](s32)
	; SI: [[COPY4:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)			; SI-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)
	; SI: $vgpr0 = COPY [[COPY4]](s32)			; SI-DAG: $vgpr0 = COPY [[COPY4]](s32)
	; VI-LABEL: name: test_ashr_s16_i8			; VI-LABEL: name: test_ashr_s16_i8
	; VI: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0			; VI: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
	; VI: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1			; VI: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
	; VI: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)			; VI: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)
	; VI: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255			; VI: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
	; VI: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; VI: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; VI: [[COPY3:%[0-9]+]]:_(s32) = COPY [[C]](s32)			; VI: [[COPY3:%[0-9]+]]:_(s32) = COPY [[C]](s32)
	; VI: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[COPY3]]			; VI: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[COPY3]]
	Show All 24 Lines

	---			---
	name: test_ashr_i8_i8			name: test_ashr_i8_i8
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0, $vgpr1			liveins: $vgpr0, $vgpr1

	; SI-LABEL: name: test_ashr_i8_i8			; SI-LABEL: name: test_ashr_i8_i8
	; SI: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0			; SI-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
	; SI: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1			; SI-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
	; SI: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255			; SI-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
	; SI: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; SI-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; SI: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C]]			; SI-DAG: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C]]
	; SI: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; SI-DAG: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; SI: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; SI-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; SI: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C1]](s32)			; SI-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C1]](s32)
	; SI: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)			; SI-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)
	; SI: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[ASHR]], [[AND]](s32)			; SI-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[ASHR]], [[AND]](s32)
	; SI: [[COPY4:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)			; SI-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)
	; SI: $vgpr0 = COPY [[COPY4]](s32)			; SI-DAG: $vgpr0 = COPY [[COPY4]](s32)
	; VI-LABEL: name: test_ashr_i8_i8			; VI-LABEL: name: test_ashr_i8_i8
	; VI: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0			; VI-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
	; VI: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1			; VI-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
	; VI: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255			; VI-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
	; VI: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; VI-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; VI: [[COPY3:%[0-9]+]]:_(s32) = COPY [[C]](s32)			; VI-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[C]](s32)
	; VI: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[COPY3]]			; VI-DAG: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[COPY3]]
	; VI: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[AND]](s32)			; VI-DAG: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[AND]](s32)
	; VI: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 8			; VI-DAG: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 8
	; VI: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[C1]](s32)			; VI-DAG: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[C1]](s32)
	; VI: [[TRUNC2:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)			; VI-DAG: [[TRUNC2:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)
	; VI: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[TRUNC2]], [[TRUNC1]](s16)			; VI-DAG: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[TRUNC2]], [[TRUNC1]](s16)
	; VI: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL]], [[TRUNC1]](s16)			; VI-DAG: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL]], [[TRUNC1]](s16)
	; VI: [[ASHR1:%[0-9]+]]:_(s16) = G_ASHR [[ASHR]], [[TRUNC]](s16)			; VI-DAG: [[ASHR1:%[0-9]+]]:_(s16) = G_ASHR [[ASHR]], [[TRUNC]](s16)
	; VI: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[ASHR1]](s16)			; VI-DAG: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[ASHR1]](s16)
	; VI: $vgpr0 = COPY [[ANYEXT]](s32)			; VI-DAG: $vgpr0 = COPY [[ANYEXT]](s32)
	; GFX9-LABEL: name: test_ashr_i8_i8			; GFX9-LABEL: name: test_ashr_i8_i8
	; GFX9: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0			; GFX9-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
	; GFX9: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1			; GFX9-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $vgpr1
	; GFX9: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255			; GFX9-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
	; GFX9: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; GFX9-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; GFX9: [[COPY3:%[0-9]+]]:_(s32) = COPY [[C]](s32)			; GFX9-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[C]](s32)
	; GFX9: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[COPY3]]			; GFX9-DAG: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[COPY3]]
	; GFX9: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[AND]](s32)			; GFX9-DAG: [[TRUNC:%[0-9]+]]:_(s16) = G_TRUNC [[AND]](s32)
	; GFX9: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 8			; GFX9-DAG: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 8
	; GFX9: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[C1]](s32)			; GFX9-DAG: [[TRUNC1:%[0-9]+]]:_(s16) = G_TRUNC [[C1]](s32)
	; GFX9: [[TRUNC2:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)			; GFX9-DAG: [[TRUNC2:%[0-9]+]]:_(s16) = G_TRUNC [[COPY]](s32)
	; GFX9: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[TRUNC2]], [[TRUNC1]](s16)			; GFX9-DAG: [[SHL:%[0-9]+]]:_(s16) = G_SHL [[TRUNC2]], [[TRUNC1]](s16)
	; GFX9: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL]], [[TRUNC1]](s16)			; GFX9-DAG: [[ASHR:%[0-9]+]]:_(s16) = G_ASHR [[SHL]], [[TRUNC1]](s16)
	; GFX9: [[ASHR1:%[0-9]+]]:_(s16) = G_ASHR [[ASHR]], [[TRUNC]](s16)			; GFX9-DAG: [[ASHR1:%[0-9]+]]:_(s16) = G_ASHR [[ASHR]], [[TRUNC]](s16)
	; GFX9: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[ASHR1]](s16)			; GFX9-DAG: [[ANYEXT:%[0-9]+]]:_(s32) = G_ANYEXT [[ASHR1]](s16)
	; GFX9: $vgpr0 = COPY [[ANYEXT]](s32)			; GFX9-DAG: $vgpr0 = COPY [[ANYEXT]](s32)
	%0:_(s32) = COPY $vgpr0			%0:_(s32) = COPY $vgpr0
	%1:_(s32) = COPY $vgpr1			%1:_(s32) = COPY $vgpr1
	%2:_(s8) = G_TRUNC %0			%2:_(s8) = G_TRUNC %0
	%3:_(s8) = G_TRUNC %1			%3:_(s8) = G_TRUNC %1
	%4:_(s8) = G_ASHR %2, %3			%4:_(s8) = G_ASHR %2, %3
	%5:_(s32) = G_ANYEXT %4			%5:_(s32) = G_ANYEXT %4
	$vgpr0 = COPY %5			$vgpr0 = COPY %5
	...			...
	▲ Show 20 Lines • Show All 1,202 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-extract-vector-elt.mir

	Show First 20 Lines • Show All 181 Lines • ▼ Show 20 Lines

	---			---
	name: extract_vector_elt_0_v2i8_i32			name: extract_vector_elt_0_v2i8_i32

	body: \|			body: \|
	bb.0:			bb.0:

	; CHECK-LABEL: name: extract_vector_elt_0_v2i8_i32			; CHECK-LABEL: name: extract_vector_elt_0_v2i8_i32
	; CHECK: [[DEF:%[0-9]+]]:_(<2 x s32>) = G_IMPLICIT_DEF			; CHECK-DAG: [[DEF:%[0-9]+]]:_(<2 x s32>) = G_IMPLICIT_DEF
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 0			; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
	; CHECK: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; CHECK-DAG: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; CHECK: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY [[DEF]](<2 x s32>)			; CHECK-DAG: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY [[DEF]](<2 x s32>)
	; CHECK: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY]](<2 x s32>)			; CHECK-DAG: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY]](<2 x s32>)
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[C1]](s32)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[C1]](s32)
	; CHECK: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[C1]](s32)			; CHECK-DAG: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[C1]](s32)
	; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)
	; CHECK: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C1]](s32)			; CHECK-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C1]](s32)
	; CHECK: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[ASHR]](s32), [[ASHR1]](s32)			; CHECK-DAG: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[ASHR]](s32), [[ASHR1]](s32)
	; CHECK: [[EVEC:%[0-9]+]]:_(s32) = G_EXTRACT_VECTOR_ELT [[BUILD_VECTOR]](<2 x s32>), [[C]](s32)			; CHECK-DAG: [[EVEC:%[0-9]+]]:_(s32) = G_EXTRACT_VECTOR_ELT [[BUILD_VECTOR]](<2 x s32>), [[C]](s32)
	; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[EVEC]](s32)			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[EVEC]](s32)
	; CHECK: $vgpr0 = COPY [[COPY1]](s32)			; CHECK-DAG: $vgpr0 = COPY [[COPY1]](s32)
	%0:_(<2 x s8>) = G_IMPLICIT_DEF			%0:_(<2 x s8>) = G_IMPLICIT_DEF
	%1:_(s32) = G_CONSTANT i32 0			%1:_(s32) = G_CONSTANT i32 0
	%2:_(s8) = G_EXTRACT_VECTOR_ELT %0, %1			%2:_(s8) = G_EXTRACT_VECTOR_ELT %0, %1
	%3:_(s32) = G_ANYEXT %2			%3:_(s32) = G_ANYEXT %2
	$vgpr0 = COPY %3			$vgpr0 = COPY %3
	...			...

	---			---
	Show All 17 Lines

	---			---
	name: extract_vector_elt_0_v2i1_i32			name: extract_vector_elt_0_v2i1_i32

	body: \|			body: \|
	bb.0:			bb.0:

	; CHECK-LABEL: name: extract_vector_elt_0_v2i1_i32			; CHECK-LABEL: name: extract_vector_elt_0_v2i1_i32
	; CHECK: [[DEF:%[0-9]+]]:_(<2 x s32>) = G_IMPLICIT_DEF			; CHECK-DAG: [[DEF:%[0-9]+]]:_(<2 x s32>) = G_IMPLICIT_DEF
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 0			; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
	; CHECK: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 31			; CHECK-DAG: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 31
	; CHECK: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY [[DEF]](<2 x s32>)			; CHECK-DAG: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY [[DEF]](<2 x s32>)
	; CHECK: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY]](<2 x s32>)			; CHECK-DAG: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY]](<2 x s32>)
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[C1]](s32)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[C1]](s32)
	; CHECK: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[C1]](s32)			; CHECK-DAG: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[C1]](s32)
	; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)
	; CHECK: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C1]](s32)			; CHECK-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C1]](s32)
	; CHECK: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[ASHR]](s32), [[ASHR1]](s32)			; CHECK-DAG: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[ASHR]](s32), [[ASHR1]](s32)
	; CHECK: [[EVEC:%[0-9]+]]:_(s32) = G_EXTRACT_VECTOR_ELT [[BUILD_VECTOR]](<2 x s32>), [[C]](s32)			; CHECK-DAG: [[EVEC:%[0-9]+]]:_(s32) = G_EXTRACT_VECTOR_ELT [[BUILD_VECTOR]](<2 x s32>), [[C]](s32)
	; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[EVEC]](s32)			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[EVEC]](s32)
	; CHECK: $vgpr0 = COPY [[COPY1]](s32)			; CHECK-DAG: $vgpr0 = COPY [[COPY1]](s32)
	%0:_(<2 x s1>) = G_IMPLICIT_DEF			%0:_(<2 x s1>) = G_IMPLICIT_DEF
	%1:_(s32) = G_CONSTANT i32 0			%1:_(s32) = G_CONSTANT i32 0
	%2:_(s1) = G_EXTRACT_VECTOR_ELT %0, %1			%2:_(s1) = G_EXTRACT_VECTOR_ELT %0, %1
	%3:_(s32) = G_ANYEXT %2			%3:_(s32) = G_ANYEXT %2
	$vgpr0 = COPY %3			$vgpr0 = COPY %3
	...			...

	---			---
	name: extract_vector_elt_0_v2i1_i1			name: extract_vector_elt_0_v2i1_i1

	body: \|			body: \|
	bb.0:			bb.0:

	; CHECK-LABEL: name: extract_vector_elt_0_v2i1_i1			; CHECK-LABEL: name: extract_vector_elt_0_v2i1_i1
	; CHECK: [[DEF:%[0-9]+]]:_(<2 x s32>) = G_IMPLICIT_DEF			; CHECK-DAG: [[DEF:%[0-9]+]]:_(<2 x s32>) = G_IMPLICIT_DEF
	; CHECK: [[C:%[0-9]+]]:_(s1) = G_CONSTANT i1 false			; CHECK-DAG: [[C:%[0-9]+]]:_(s1) = G_CONSTANT i1 false
	; CHECK: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 31			; CHECK-DAG: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 31
	; CHECK: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY [[DEF]](<2 x s32>)			; CHECK-DAG: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY [[DEF]](<2 x s32>)
	; CHECK: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY]](<2 x s32>)			; CHECK-DAG: [[UV:%[0-9]+]]:_(s32), [[UV1:%[0-9]+]]:_(s32) = G_UNMERGE_VALUES [[COPY]](<2 x s32>)
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[C1]](s32)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[UV]], [[C1]](s32)
	; CHECK: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[C1]](s32)			; CHECK-DAG: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[UV1]], [[C1]](s32)
	; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)
	; CHECK: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C1]](s32)			; CHECK-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C1]](s32)
	; CHECK: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[ASHR]](s32), [[ASHR1]](s32)			; CHECK-DAG: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[ASHR]](s32), [[ASHR1]](s32)
	; CHECK: [[SEXT:%[0-9]+]]:_(s32) = G_SEXT [[C]](s1)			; CHECK-DAG: [[SEXT:%[0-9]+]]:_(s32) = G_SEXT [[C]](s1)
	; CHECK: [[EVEC:%[0-9]+]]:_(s32) = G_EXTRACT_VECTOR_ELT [[BUILD_VECTOR]](<2 x s32>), [[SEXT]](s32)			; CHECK-DAG: [[EVEC:%[0-9]+]]:_(s32) = G_EXTRACT_VECTOR_ELT [[BUILD_VECTOR]](<2 x s32>), [[SEXT]](s32)
	; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[EVEC]](s32)			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[EVEC]](s32)
	; CHECK: $vgpr0 = COPY [[COPY1]](s32)			; CHECK-DAG: $vgpr0 = COPY [[COPY1]](s32)
	%0:_(<2 x s1>) = G_IMPLICIT_DEF			%0:_(<2 x s1>) = G_IMPLICIT_DEF
	%1:_(s1) = G_CONSTANT i1 false			%1:_(s1) = G_CONSTANT i1 false
	%2:_(s1) = G_EXTRACT_VECTOR_ELT %0, %1			%2:_(s1) = G_EXTRACT_VECTOR_ELT %0, %1
	%3:_(s32) = G_ANYEXT %2			%3:_(s32) = G_ANYEXT %2
	$vgpr0 = COPY %3			$vgpr0 = COPY %3
	...			...

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-sext.mir

	Show All 17 Lines

	---			---
	name: test_sext_s16_to_s64			name: test_sext_s16_to_s64
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0			liveins: $vgpr0

	; CHECK-LABEL: name: test_sext_s16_to_s64			; CHECK-LABEL: name: test_sext_s16_to_s64
	; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
	; CHECK: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 48			; CHECK-DAG: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 48
	; CHECK: [[ANYEXT:%[0-9]+]]:_(s64) = G_ANYEXT [[COPY]](s32)			; CHECK-DAG: [[ANYEXT:%[0-9]+]]:_(s64) = G_ANYEXT [[COPY]](s32)
	; CHECK: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)			; CHECK-DAG: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)
	; CHECK: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[ANYEXT]], [[TRUNC]](s32)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[ANYEXT]], [[TRUNC]](s32)
	; CHECK: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)			; CHECK-DAG: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[C]](s64)
	; CHECK: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[TRUNC1]](s32)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[TRUNC1]](s32)
	; CHECK: $vgpr0_vgpr1 = COPY [[ASHR]](s64)			; CHECK-DAG: $vgpr0_vgpr1 = COPY [[ASHR]](s64)
	%0:_(s32) = COPY $vgpr0			%0:_(s32) = COPY $vgpr0
	%1:_(s16) = G_TRUNC %0			%1:_(s16) = G_TRUNC %0
	%2:_(s64) = G_SEXT %1			%2:_(s64) = G_SEXT %1
	$vgpr0_vgpr1 = COPY %2			$vgpr0_vgpr1 = COPY %2
	...			...

	---			---
	name: test_sext_s16_to_s32			name: test_sext_s16_to_s32
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0			liveins: $vgpr0

	; CHECK-LABEL: name: test_sext_s16_to_s32			; CHECK-LABEL: name: test_sext_s16_to_s32
	; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)
	; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; CHECK: $vgpr0 = COPY [[ASHR]](s32)			; CHECK-DAG: $vgpr0 = COPY [[ASHR]](s32)
	%0:_(s32) = COPY $vgpr0			%0:_(s32) = COPY $vgpr0
	%1:_(s16) = G_TRUNC %0			%1:_(s16) = G_TRUNC %0
	%2:_(s32) = G_SEXT %1			%2:_(s32) = G_SEXT %1
	$vgpr0 = COPY %2			$vgpr0 = COPY %2
	...			...

	---			---
	name: test_sext_i1_to_s32			name: test_sext_i1_to_s32
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0			liveins: $vgpr0

	; CHECK-LABEL: name: test_sext_i1_to_s32			; CHECK-LABEL: name: test_sext_i1_to_s32
	; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0			; CHECK-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $vgpr0
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 31			; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 31
	; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)			; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)
	; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; CHECK: $vgpr0 = COPY [[ASHR]](s32)			; CHECK-DAG: $vgpr0 = COPY [[ASHR]](s32)
	%0:_(s32) = COPY $vgpr0			%0:_(s32) = COPY $vgpr0
	%1:_(s1) = G_TRUNC %0			%1:_(s1) = G_TRUNC %0
	%2:_(s32) = G_SEXT %1			%2:_(s32) = G_SEXT %1
	$vgpr0 = COPY %2			$vgpr0 = COPY %2
	...			...

	---			---
	name: test_sext_v2s16_to_v2s32			name: test_sext_v2s16_to_v2s32
	▲ Show 20 Lines • Show All 114 Lines • Show Last 20 Lines

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-sextload-flat.mir

	# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py			# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
	# RUN: llc -mtriple=amdgcn-mesa-mesa3d -mcpu=fiji -run-pass=legalizer -o - %s \| FileCheck %s -check-prefixes=GCN,SI			# RUN: llc -mtriple=amdgcn-mesa-mesa3d -mcpu=fiji -run-pass=legalizer -o - %s \| FileCheck %s -check-prefixes=GCN,SI
	# RUN: llc -mtriple=amdgcn-mesa-mesa3d -mcpu=tahiti -run-pass=legalizer -o - %s \| FileCheck %s -check-prefixes=GCN,VI			# RUN: llc -mtriple=amdgcn-mesa-mesa3d -mcpu=tahiti -run-pass=legalizer -o - %s \| FileCheck %s -check-prefixes=GCN,VI
	---			---
	name: test_sextload_flat_i32_i8			name: test_sextload_flat_i32_i8
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0_vgpr1			liveins: $vgpr0_vgpr1

	; SI-LABEL: name: test_sextload_flat_i32_i8			; SI-LABEL: name: test_sextload_flat_i32_i8
	; SI: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1			; SI: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1
	; SI: [[SEXTLOAD:%[0-9]+]]:_(s32) = G_SEXTLOAD [[COPY]](p0) :: (load 1)			; SI: [[SEXTLOAD:%[0-9]+]]:_(s32) = G_SEXTLOAD [[COPY]](p0) :: (load 1)
	; SI: $vgpr0 = COPY [[SEXTLOAD]](s32)			; SI: $vgpr0 = COPY [[SEXTLOAD]](s32)
	; VI-LABEL: name: test_sextload_flat_i32_i8			; VI-LABEL: name: test_sextload_flat_i32_i8
	; VI: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1			; VI-DAG: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1
	; VI: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[COPY]](p0) :: (load 1)			; VI-DAG: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[COPY]](p0) :: (load 1)
	; VI: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; VI-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; VI: [[COPY1:%[0-9]+]]:_(s32) = COPY [[LOAD]](s32)			; VI-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[LOAD]](s32)
	; VI: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)			; VI-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)
	; VI: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; VI-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; VI: $vgpr0 = COPY [[ASHR]](s32)			; VI-DAG: $vgpr0 = COPY [[ASHR]](s32)
	%0:_(p0) = COPY $vgpr0_vgpr1			%0:_(p0) = COPY $vgpr0_vgpr1
	%1:_(s32) = G_SEXTLOAD %0 :: (load 1, addrspace 0)			%1:_(s32) = G_SEXTLOAD %0 :: (load 1, addrspace 0)
	$vgpr0 = COPY %1			$vgpr0 = COPY %1
	...			...
	---			---
	name: test_sextload_flat_i32_i16			name: test_sextload_flat_i32_i16
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0_vgpr1			liveins: $vgpr0_vgpr1

	; SI-LABEL: name: test_sextload_flat_i32_i16			; SI-LABEL: name: test_sextload_flat_i32_i16
	; SI: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1			; SI: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1
	; SI: [[SEXTLOAD:%[0-9]+]]:_(s32) = G_SEXTLOAD [[COPY]](p0) :: (load 2)			; SI: [[SEXTLOAD:%[0-9]+]]:_(s32) = G_SEXTLOAD [[COPY]](p0) :: (load 2)
	; SI: $vgpr0 = COPY [[SEXTLOAD]](s32)			; SI: $vgpr0 = COPY [[SEXTLOAD]](s32)
	; VI-LABEL: name: test_sextload_flat_i32_i16			; VI-LABEL: name: test_sextload_flat_i32_i16
	; VI: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1			; VI-DAG: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1
	; VI: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[COPY]](p0) :: (load 2)			; VI-DAG: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[COPY]](p0) :: (load 2)
	; VI: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; VI-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; VI: [[COPY1:%[0-9]+]]:_(s32) = COPY [[LOAD]](s32)			; VI-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[LOAD]](s32)
	; VI: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)			; VI-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)
	; VI: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; VI-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; VI: $vgpr0 = COPY [[ASHR]](s32)			; VI-DAG: $vgpr0 = COPY [[ASHR]](s32)
	%0:_(p0) = COPY $vgpr0_vgpr1			%0:_(p0) = COPY $vgpr0_vgpr1
	%1:_(s32) = G_SEXTLOAD %0 :: (load 2, addrspace 0)			%1:_(s32) = G_SEXTLOAD %0 :: (load 2, addrspace 0)
	$vgpr0 = COPY %1			$vgpr0 = COPY %1
	...			...
	---			---
	name: test_sextload_flat_i31_i8			name: test_sextload_flat_i31_i8
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0_vgpr1			liveins: $vgpr0_vgpr1

	; SI-LABEL: name: test_sextload_flat_i31_i8			; SI-LABEL: name: test_sextload_flat_i31_i8
	; SI: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1			; SI: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1
	; SI: [[SEXTLOAD:%[0-9]+]]:_(s32) = G_SEXTLOAD [[COPY]](p0) :: (load 1)			; SI: [[SEXTLOAD:%[0-9]+]]:_(s32) = G_SEXTLOAD [[COPY]](p0) :: (load 1)
	; SI: [[COPY1:%[0-9]+]]:_(s32) = COPY [[SEXTLOAD]](s32)			; SI: [[COPY1:%[0-9]+]]:_(s32) = COPY [[SEXTLOAD]](s32)
	; SI: $vgpr0 = COPY [[COPY1]](s32)			; SI: $vgpr0 = COPY [[COPY1]](s32)
	; VI-LABEL: name: test_sextload_flat_i31_i8			; VI-LABEL: name: test_sextload_flat_i31_i8
	; VI: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1			; VI-DAG: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1
	; VI: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[COPY]](p0) :: (load 1)			; VI-DAG: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[COPY]](p0) :: (load 1)
	; VI: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; VI-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; VI: [[COPY1:%[0-9]+]]:_(s32) = COPY [[LOAD]](s32)			; VI-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[LOAD]](s32)
	; VI: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)			; VI-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)
	; VI: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; VI-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; VI: [[COPY2:%[0-9]+]]:_(s32) = COPY [[ASHR]](s32)			; VI-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[ASHR]](s32)
	; VI: $vgpr0 = COPY [[COPY2]](s32)			; VI-DAG: $vgpr0 = COPY [[COPY2]](s32)
	%0:_(p0) = COPY $vgpr0_vgpr1			%0:_(p0) = COPY $vgpr0_vgpr1
	%1:_(s31) = G_SEXTLOAD %0 :: (load 1, addrspace 0)			%1:_(s31) = G_SEXTLOAD %0 :: (load 1, addrspace 0)
	%2:_(s32) = G_ANYEXT %1			%2:_(s32) = G_ANYEXT %1
	$vgpr0 = COPY %2			$vgpr0 = COPY %2
	...			...
	---			---
	name: test_sextload_flat_i64_i8			name: test_sextload_flat_i64_i8
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0_vgpr1			liveins: $vgpr0_vgpr1

	; SI-LABEL: name: test_sextload_flat_i64_i8			; SI-LABEL: name: test_sextload_flat_i64_i8
	; SI: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1			; SI: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1
	; SI: [[SEXTLOAD:%[0-9]+]]:_(s32) = G_SEXTLOAD [[COPY]](p0) :: (load 1)			; SI: [[SEXTLOAD:%[0-9]+]]:_(s32) = G_SEXTLOAD [[COPY]](p0) :: (load 1)
	; SI: [[SEXT:%[0-9]+]]:_(s64) = G_SEXT [[SEXTLOAD]](s32)			; SI: [[SEXT:%[0-9]+]]:_(s64) = G_SEXT [[SEXTLOAD]](s32)
	; SI: $vgpr0_vgpr1 = COPY [[SEXT]](s64)			; SI: $vgpr0_vgpr1 = COPY [[SEXT]](s64)
	; VI-LABEL: name: test_sextload_flat_i64_i8			; VI-LABEL: name: test_sextload_flat_i64_i8
	; VI: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1			; VI-DAG: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1
	; VI: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[COPY]](p0) :: (load 1)			; VI-DAG: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[COPY]](p0) :: (load 1)
	; VI: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; VI-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; VI: [[COPY1:%[0-9]+]]:_(s32) = COPY [[LOAD]](s32)			; VI-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[LOAD]](s32)
	; VI: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)			; VI-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)
	; VI: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; VI-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; VI: [[SEXT:%[0-9]+]]:_(s64) = G_SEXT [[ASHR]](s32)			; VI-DAG: [[SEXT:%[0-9]+]]:_(s64) = G_SEXT [[ASHR]](s32)
	; VI: $vgpr0_vgpr1 = COPY [[SEXT]](s64)			; VI-DAG: $vgpr0_vgpr1 = COPY [[SEXT]](s64)
	%0:_(p0) = COPY $vgpr0_vgpr1			%0:_(p0) = COPY $vgpr0_vgpr1
	%1:_(s64) = G_SEXTLOAD %0 :: (load 1, addrspace 0)			%1:_(s64) = G_SEXTLOAD %0 :: (load 1, addrspace 0)
	$vgpr0_vgpr1 = COPY %1			$vgpr0_vgpr1 = COPY %1
	...			...
	---			---
	name: test_sextload_flat_i64_i16			name: test_sextload_flat_i64_i16
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $vgpr0_vgpr1			liveins: $vgpr0_vgpr1

	; SI-LABEL: name: test_sextload_flat_i64_i16			; SI-LABEL: name: test_sextload_flat_i64_i16
	; SI: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1			; SI: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1
	; SI: [[SEXTLOAD:%[0-9]+]]:_(s32) = G_SEXTLOAD [[COPY]](p0) :: (load 2)			; SI: [[SEXTLOAD:%[0-9]+]]:_(s32) = G_SEXTLOAD [[COPY]](p0) :: (load 2)
	; SI: [[SEXT:%[0-9]+]]:_(s64) = G_SEXT [[SEXTLOAD]](s32)			; SI: [[SEXT:%[0-9]+]]:_(s64) = G_SEXT [[SEXTLOAD]](s32)
	; SI: $vgpr0_vgpr1 = COPY [[SEXT]](s64)			; SI: $vgpr0_vgpr1 = COPY [[SEXT]](s64)
	; VI-LABEL: name: test_sextload_flat_i64_i16			; VI-LABEL: name: test_sextload_flat_i64_i16
	; VI: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1			; VI-DAG: [[COPY:%[0-9]+]]:_(p0) = COPY $vgpr0_vgpr1
	; VI: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[COPY]](p0) :: (load 2)			; VI-DAG: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[COPY]](p0) :: (load 2)
	; VI: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; VI-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; VI: [[COPY1:%[0-9]+]]:_(s32) = COPY [[LOAD]](s32)			; VI-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[LOAD]](s32)
	; VI: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)			; VI-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[C]](s32)
	; VI: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; VI-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; VI: [[SEXT:%[0-9]+]]:_(s64) = G_SEXT [[ASHR]](s32)			; VI-DAG: [[SEXT:%[0-9]+]]:_(s64) = G_SEXT [[ASHR]](s32)
	; VI: $vgpr0_vgpr1 = COPY [[SEXT]](s64)			; VI-DAG: $vgpr0_vgpr1 = COPY [[SEXT]](s64)
	%0:_(p0) = COPY $vgpr0_vgpr1			%0:_(p0) = COPY $vgpr0_vgpr1
	%1:_(s64) = G_SEXTLOAD %0 :: (load 2, addrspace 0)			%1:_(s64) = G_SEXTLOAD %0 :: (load 2, addrspace 0)
	$vgpr0_vgpr1 = COPY %1			$vgpr0_vgpr1 = COPY %1
	...			...
	---			---
	name: test_sextload_flat_i64_i32			name: test_sextload_flat_i64_i32
	body: \|			body: \|
	bb.0:			bb.0:
	Show All 16 Lines

llvm/test/CodeGen/ARM/GlobalISel/arm-legalize-divmod.mir

	Show First 20 Lines • Show All 116 Lines • ▼ Show 20 Lines
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $r0, $r1			liveins: $r0, $r1

	; CHECK-DAG: [[R0:%[0-9]+]]:_(s32) = COPY $r0			; CHECK-DAG: [[R0:%[0-9]+]]:_(s32) = COPY $r0
	; CHECK-DAG: [[R1:%[0-9]+]]:_(s32) = COPY $r1			; CHECK-DAG: [[R1:%[0-9]+]]:_(s32) = COPY $r1
	; The G_TRUNC will combine with the extensions introduced by the legalizer,			; The G_TRUNC will combine with the extensions introduced by the legalizer,
	; leading to the following complicated sequences.			; leading to the following complicated sequences.
	; CHECK: [[BITS:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; CHECK-DAG: [[BITS:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; CHECK: [[X:%[0-9]+]]:_(s32) = COPY [[R0]]			; CHECK-DAG: [[X:%[0-9]+]]:_(s32) = COPY [[R0]]
	; CHECK: [[SHIFTEDX:%[0-9]+]]:_(s32) = G_SHL [[X]], [[BITS]]			; CHECK-DAG: [[SHIFTEDX:%[0-9]+]]:_(s32) = G_SHL [[X]], [[BITS]]
	; CHECK: [[X32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDX]], [[BITS]]			; CHECK-DAG: [[X32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDX]], [[BITS]]
	; CHECK: [[Y:%[0-9]+]]:_(s32) = COPY [[R1]]			; CHECK-DAG: [[Y:%[0-9]+]]:_(s32) = COPY [[R1]]
	; CHECK: [[SHIFTEDY:%[0-9]+]]:_(s32) = G_SHL [[Y]], [[BITS]]			; CHECK-DAG: [[SHIFTEDY:%[0-9]+]]:_(s32) = G_SHL [[Y]], [[BITS]]
	; CHECK: [[Y32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDY]], [[BITS]]			; CHECK-DAG: [[Y32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDY]], [[BITS]]
	%0(s32) = COPY $r0			%0(s32) = COPY $r0
	%1(s16) = G_TRUNC %0(s32)			%1(s16) = G_TRUNC %0(s32)
	%2(s32) = COPY $r1			%2(s32) = COPY $r1
	%3(s16) = G_TRUNC %2(s32)			%3(s16) = G_TRUNC %2(s32)
	; HWDIV: [[R32:%[0-9]+]]:_(s32) = G_SDIV [[X32]], [[Y32]]			; HWDIV: [[R32:%[0-9]+]]:_(s32) = G_SDIV [[X32]], [[Y32]]
	; SOFT-NOT: G_SDIV			; SOFT-NOT: G_SDIV
	; SOFT: ADJCALLSTACKDOWN			; SOFT: ADJCALLSTACKDOWN
	; SOFT-DAG: $r0 = COPY [[X32]]			; SOFT-DAG: $r0 = COPY [[X32]]
	▲ Show 20 Lines • Show All 83 Lines • ▼ Show 20 Lines
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $r0, $r1			liveins: $r0, $r1

	; CHECK-DAG: [[R0:%[0-9]+]]:_(s32) = COPY $r0			; CHECK-DAG: [[R0:%[0-9]+]]:_(s32) = COPY $r0
	; CHECK-DAG: [[R1:%[0-9]+]]:_(s32) = COPY $r1			; CHECK-DAG: [[R1:%[0-9]+]]:_(s32) = COPY $r1
	; The G_TRUNC will combine with the extensions introduced by the legalizer,			; The G_TRUNC will combine with the extensions introduced by the legalizer,
	; leading to the following complicated sequences.			; leading to the following complicated sequences.
	; CHECK: [[BITS:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; CHECK-DAG: [[BITS:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; CHECK: [[X:%[0-9]+]]:_(s32) = COPY [[R0]]			; CHECK-DAG: [[X:%[0-9]+]]:_(s32) = COPY [[R0]]
	; CHECK: [[SHIFTEDX:%[0-9]+]]:_(s32) = G_SHL [[X]], [[BITS]]			; CHECK-DAG: [[SHIFTEDX:%[0-9]+]]:_(s32) = G_SHL [[X]], [[BITS]]
	; CHECK: [[X32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDX]], [[BITS]]			; CHECK-DAG: [[X32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDX]], [[BITS]]
	; CHECK: [[Y:%[0-9]+]]:_(s32) = COPY [[R1]]			; CHECK-DAG: [[Y:%[0-9]+]]:_(s32) = COPY [[R1]]
	; CHECK: [[SHIFTEDY:%[0-9]+]]:_(s32) = G_SHL [[Y]], [[BITS]]			; CHECK-DAG: [[SHIFTEDY:%[0-9]+]]:_(s32) = G_SHL [[Y]], [[BITS]]
	; CHECK: [[Y32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDY]], [[BITS]]			; CHECK-DAG: [[Y32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDY]], [[BITS]]
	%0(s32) = COPY $r0			%0(s32) = COPY $r0
	%1(s8) = G_TRUNC %0(s32)			%1(s8) = G_TRUNC %0(s32)
	%2(s32) = COPY $r1			%2(s32) = COPY $r1
	%3(s8) = G_TRUNC %2(s32)			%3(s8) = G_TRUNC %2(s32)
	; HWDIV: [[R32:%[0-9]+]]:_(s32) = G_SDIV [[X32]], [[Y32]]			; HWDIV: [[R32:%[0-9]+]]:_(s32) = G_SDIV [[X32]], [[Y32]]
	; SOFT-NOT: G_SDIV			; SOFT-NOT: G_SDIV
	; SOFT: ADJCALLSTACKDOWN			; SOFT: ADJCALLSTACKDOWN
	; SOFT-DAG: $r0 = COPY [[X32]]			; SOFT-DAG: $r0 = COPY [[X32]]
	▲ Show 20 Lines • Show All 163 Lines • ▼ Show 20 Lines
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $r0, $r1			liveins: $r0, $r1

	; CHECK-DAG: [[R0:%[0-9]+]]:_(s32) = COPY $r0			; CHECK-DAG: [[R0:%[0-9]+]]:_(s32) = COPY $r0
	; CHECK-DAG: [[R1:%[0-9]+]]:_(s32) = COPY $r1			; CHECK-DAG: [[R1:%[0-9]+]]:_(s32) = COPY $r1
	; The G_TRUNC will combine with the extensions introduced by the legalizer,			; The G_TRUNC will combine with the extensions introduced by the legalizer,
	; leading to the following complicated sequences.			; leading to the following complicated sequences.
	; CHECK: [[BITS:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; CHECK-DAG: [[BITS:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; CHECK: [[X:%[0-9]+]]:_(s32) = COPY [[R0]]			; CHECK-DAG: [[X:%[0-9]+]]:_(s32) = COPY [[R0]]
	; CHECK: [[SHIFTEDX:%[0-9]+]]:_(s32) = G_SHL [[X]], [[BITS]]			; CHECK-DAG: [[SHIFTEDX:%[0-9]+]]:_(s32) = G_SHL [[X]], [[BITS]]
	; CHECK: [[X32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDX]], [[BITS]]			; CHECK-DAG: [[X32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDX]], [[BITS]]
	; CHECK: [[Y:%[0-9]+]]:_(s32) = COPY [[R1]]			; CHECK-DAG: [[Y:%[0-9]+]]:_(s32) = COPY [[R1]]
	; CHECK: [[SHIFTEDY:%[0-9]+]]:_(s32) = G_SHL [[Y]], [[BITS]]			; CHECK-DAG: [[SHIFTEDY:%[0-9]+]]:_(s32) = G_SHL [[Y]], [[BITS]]
	; CHECK: [[Y32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDY]], [[BITS]]			; CHECK-DAG: [[Y32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDY]], [[BITS]]
	%0(s32) = COPY $r0			%0(s32) = COPY $r0
	%1(s16) = G_TRUNC %0(s32)			%1(s16) = G_TRUNC %0(s32)
	%2(s32) = COPY $r1			%2(s32) = COPY $r1
	%3(s16) = G_TRUNC %2(s32)			%3(s16) = G_TRUNC %2(s32)
	; HWDIV: [[Q32:%[0-9]+]]:_(s32) = G_SDIV [[X32]], [[Y32]]			; HWDIV: [[Q32:%[0-9]+]]:_(s32) = G_SDIV [[X32]], [[Y32]]
	; HWDIV: [[P32:%[0-9]+]]:_(s32) = G_MUL [[Q32]], [[Y32]]			; HWDIV: [[P32:%[0-9]+]]:_(s32) = G_MUL [[Q32]], [[Y32]]
	; HWDIV: [[R32:%[0-9]+]]:_(s32) = G_SUB [[X32]], [[P32]]			; HWDIV: [[R32:%[0-9]+]]:_(s32) = G_SUB [[X32]], [[P32]]
	; SOFT-NOT: G_SREM			; SOFT-NOT: G_SREM
	▲ Show 20 Lines • Show All 89 Lines • ▼ Show 20 Lines
	body: \|			body: \|
	bb.0:			bb.0:
	liveins: $r0, $r1			liveins: $r0, $r1

	; CHECK-DAG: [[R0:%[0-9]+]]:_(s32) = COPY $r0			; CHECK-DAG: [[R0:%[0-9]+]]:_(s32) = COPY $r0
	; CHECK-DAG: [[R1:%[0-9]+]]:_(s32) = COPY $r1			; CHECK-DAG: [[R1:%[0-9]+]]:_(s32) = COPY $r1
	; The G_TRUNC will combine with the extensions introduced by the legalizer,			; The G_TRUNC will combine with the extensions introduced by the legalizer,
	; leading to the following complicated sequences.			; leading to the following complicated sequences.
	; CHECK: [[BITS:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; CHECK-DAG: [[BITS:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; CHECK: [[X:%[0-9]+]]:_(s32) = COPY [[R0]]			; CHECK-DAG: [[X:%[0-9]+]]:_(s32) = COPY [[R0]]
	; CHECK: [[SHIFTEDX:%[0-9]+]]:_(s32) = G_SHL [[X]], [[BITS]]			; CHECK-DAG: [[SHIFTEDX:%[0-9]+]]:_(s32) = G_SHL [[X]], [[BITS]]
	; CHECK: [[X32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDX]], [[BITS]]			; CHECK-DAG: [[X32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDX]], [[BITS]]
	; CHECK: [[Y:%[0-9]+]]:_(s32) = COPY [[R1]]			; CHECK-DAG: [[Y:%[0-9]+]]:_(s32) = COPY [[R1]]
	; CHECK: [[SHIFTEDY:%[0-9]+]]:_(s32) = G_SHL [[Y]], [[BITS]]			; CHECK-DAG: [[SHIFTEDY:%[0-9]+]]:_(s32) = G_SHL [[Y]], [[BITS]]
	; CHECK: [[Y32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDY]], [[BITS]]			; CHECK-DAG: [[Y32:%[0-9]+]]:_(s32) = G_ASHR [[SHIFTEDY]], [[BITS]]
	%0(s32) = COPY $r0			%0(s32) = COPY $r0
	%1(s8) = G_TRUNC %0(s32)			%1(s8) = G_TRUNC %0(s32)
	%2(s32) = COPY $r1			%2(s32) = COPY $r1
	%3(s8) = G_TRUNC %2(s32)			%3(s8) = G_TRUNC %2(s32)
	; HWDIV: [[Q32:%[0-9]+]]:_(s32) = G_SDIV [[X32]], [[Y32]]			; HWDIV: [[Q32:%[0-9]+]]:_(s32) = G_SDIV [[X32]], [[Y32]]
	; HWDIV: [[P32:%[0-9]+]]:_(s32) = G_MUL [[Q32]], [[Y32]]			; HWDIV: [[P32:%[0-9]+]]:_(s32) = G_MUL [[Q32]], [[Y32]]
	; HWDIV: [[R32:%[0-9]+]]:_(s32) = G_SUB [[X32]], [[P32]]			; HWDIV: [[R32:%[0-9]+]]:_(s32) = G_SUB [[X32]], [[P32]]
	; SOFT-NOT: G_SREM			; SOFT-NOT: G_SREM
	▲ Show 20 Lines • Show All 74 Lines • Show Last 20 Lines

llvm/test/CodeGen/ARM/GlobalISel/arm-legalize-exts.mir

# RUN: llc -mtriple arm-- -run-pass=legalizer %s -o - \| FileCheck %s		# RUN: llc -mtriple arm-- -run-pass=legalizer %s -o - \| FileCheck %s
# RUN: llc -mtriple thumb-- -mattr=+v6t2 -run-pass=legalizer %s -o - \| FileCheck %s		# RUN: llc -mtriple thumb-- -mattr=+v6t2 -run-pass=legalizer %s -o - \| FileCheck %s
--- \|		--- \|
define void @test_zext_s16() { ret void }		define void @test_zext_s16() { ret void }
define void @test_sext_s8() { ret void }		define void @test_sext_s8() { ret void }
		define void @test_sext_inreg_s8() { ret void }
define void @test_anyext_s1() { ret void }		define void @test_anyext_s1() { ret void }
...		...
---		---
name: test_zext_s16		name: test_zext_s16
# CHECK-LABEL: name: test_zext_s16		# CHECK-LABEL: name: test_zext_s16
legalized: false		legalized: false
# CHECK: legalized: true		# CHECK: legalized: true
regBankSelected: false		regBankSelected: false
Show All 35 Lines	bb.0:
%1(s8) = G_LOAD %0(p0) :: (load 1)		%1(s8) = G_LOAD %0(p0) :: (load 1)
%2(s32) = G_SEXT %1		%2(s32) = G_SEXT %1
; G_SEXT with s8 is legal, so we should find it unchanged in the output		; G_SEXT with s8 is legal, so we should find it unchanged in the output
; CHECK: {{%[0-9]+}}:_(s32) = G_SEXT {{%[0-9]+}}		; CHECK: {{%[0-9]+}}:_(s32) = G_SEXT {{%[0-9]+}}
$r0 = COPY %2(s32)		$r0 = COPY %2(s32)
BX_RET 14, $noreg, implicit $r0		BX_RET 14, $noreg, implicit $r0
...		...
---		---
		name: test_sext_inreg_s8
		# CHECK-LABEL: name: test_sext_inreg_s8
		legalized: false
		# CHECK: legalized: true
		regBankSelected: false
		selected: false
		tracksRegLiveness: true
		registers:
		- { id: 0, class: _ }
		- { id: 1, class: _ }
		- { id: 2, class: _ }
		body: \|
		bb.0:
		liveins: $r0

		%0(p0) = COPY $r0
		%1(s32) = G_LOAD %0(p0) :: (load 4)
		%2(s32) = G_SEXT_INREG %1, 8
		; G_SEXT_INREG should be lowered to a shift pair
		; CHECK: [[T1:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
		; CHECK: [[T2:%[0-9]+]]:_(s32) = G_SHL {{%[0-9]+}}, [[T1]]
		; CHECK: {{%[0-9]+}}:_(s32) = G_ASHR [[T2]], [[T1]]
		$r0 = COPY %2(s32)
		BX_RET 14, $noreg, implicit $r0
		...
		---
name: test_anyext_s1		name: test_anyext_s1
# CHECK-LABEL: name: test_anyext_s1		# CHECK-LABEL: name: test_anyext_s1
legalized: false		legalized: false
# CHECK: legalized: true		# CHECK: legalized: true
regBankSelected: false		regBankSelected: false
selected: false		selected: false
tracksRegLiveness: true		tracksRegLiveness: true
registers:		registers:
Show All 15 Lines

llvm/test/CodeGen/Mips/GlobalISel/legalizer/add.mir

	Show All 40 Lines
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	liveins: $a0, $a1			liveins: $a0, $a1

	; MIPS32-LABEL: name: add_i8_sext			; MIPS32-LABEL: name: add_i8_sext
	; MIPS32: liveins: $a0, $a1			; MIPS32: liveins: $a0, $a1
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
	; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1			; MIPS32-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
	; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; MIPS32-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; MIPS32-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; MIPS32: [[ADD:%[0-9]+]]:_(s32) = G_ADD [[COPY2]], [[COPY3]]			; MIPS32-DAG: [[ADD:%[0-9]+]]:_(s32) = G_ADD [[COPY2]], [[COPY3]]
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[ADD]](s32)			; MIPS32-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[ADD]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; MIPS32: $v0 = COPY [[ASHR]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%2:_(s32) = COPY $a0			%2:_(s32) = COPY $a0
	%0:_(s8) = G_TRUNC %2(s32)			%0:_(s8) = G_TRUNC %2(s32)
	%3:_(s32) = COPY $a1			%3:_(s32) = COPY $a1
	%1:_(s8) = G_TRUNC %3(s32)			%1:_(s8) = G_TRUNC %3(s32)
	%4:_(s8) = G_ADD %1, %0			%4:_(s8) = G_ADD %1, %0
	%5:_(s32) = G_SEXT %4(s8)			%5:_(s32) = G_SEXT %4(s8)
	$v0 = COPY %5(s32)			$v0 = COPY %5(s32)
	RetRA implicit $v0			RetRA implicit $v0
	▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	liveins: $a0, $a1			liveins: $a0, $a1

	; MIPS32-LABEL: name: add_i16_sext			; MIPS32-LABEL: name: add_i16_sext
	; MIPS32: liveins: $a0, $a1			; MIPS32: liveins: $a0, $a1
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
	; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1			; MIPS32-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
	; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; MIPS32-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; MIPS32-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; MIPS32: [[ADD:%[0-9]+]]:_(s32) = G_ADD [[COPY2]], [[COPY3]]			; MIPS32-DAG: [[ADD:%[0-9]+]]:_(s32) = G_ADD [[COPY2]], [[COPY3]]
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[ADD]](s32)			; MIPS32-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[ADD]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; MIPS32: $v0 = COPY [[ASHR]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%2:_(s32) = COPY $a0			%2:_(s32) = COPY $a0
	%0:_(s16) = G_TRUNC %2(s32)			%0:_(s16) = G_TRUNC %2(s32)
	%3:_(s32) = COPY $a1			%3:_(s32) = COPY $a1
	%1:_(s16) = G_TRUNC %3(s32)			%1:_(s16) = G_TRUNC %3(s32)
	%4:_(s16) = G_ADD %1, %0			%4:_(s16) = G_ADD %1, %0
	%5:_(s32) = G_SEXT %4(s16)			%5:_(s32) = G_SEXT %4(s16)
	$v0 = COPY %5(s32)			$v0 = COPY %5(s32)
	RetRA implicit $v0			RetRA implicit $v0
	▲ Show 20 Lines • Show All 209 Lines • Show Last 20 Lines

llvm/test/CodeGen/Mips/GlobalISel/legalizer/constants.mir

	Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
	...			...
	---			---
	name: signed_i16			name: signed_i16
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	; MIPS32-LABEL: name: signed_i16			; MIPS32-LABEL: name: signed_i16
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -32768			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -32768
	; MIPS32: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; MIPS32-DAG: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY [[C]](s32)			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY [[C]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY]], [[C1]]			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY]], [[C1]]
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]]			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]]
	; MIPS32: $v0 = COPY [[ASHR]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%0:_(s16) = G_CONSTANT i16 -32768			%0:_(s16) = G_CONSTANT i16 -32768
	%1:_(s32) = G_SEXT %0(s16)			%1:_(s32) = G_SEXT %0(s16)
	$v0 = COPY %1(s32)			$v0 = COPY %1(s32)
	RetRA implicit $v0			RetRA implicit $v0

	...			...
	---			---
	name: signed_i8			name: signed_i8
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	; MIPS32-LABEL: name: signed_i8			; MIPS32-LABEL: name: signed_i8
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -128			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -128
	; MIPS32: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; MIPS32-DAG: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY [[C]](s32)			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY [[C]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY]], [[C1]]			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY]], [[C1]]
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]]			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]]
	; MIPS32: $v0 = COPY [[ASHR]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%0:_(s8) = G_CONSTANT i8 -128			%0:_(s8) = G_CONSTANT i8 -128
	%1:_(s32) = G_SEXT %0(s8)			%1:_(s32) = G_SEXT %0(s8)
	$v0 = COPY %1(s32)			$v0 = COPY %1(s32)
	RetRA implicit $v0			RetRA implicit $v0

	...			...
	---			---
	name: unsigned_i16			name: unsigned_i16
	▲ Show 20 Lines • Show All 74 Lines • Show Last 20 Lines

llvm/test/CodeGen/Mips/GlobalISel/legalizer/mul.mir

	Show First 20 Lines • Show All 41 Lines • ▼ Show 20 Lines
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	liveins: $a0, $a1			liveins: $a0, $a1

	; MIPS32-LABEL: name: mul_i8_sext			; MIPS32-LABEL: name: mul_i8_sext
	; MIPS32: liveins: $a0, $a1			; MIPS32: liveins: $a0, $a1
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
	; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1			; MIPS32-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
	; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; MIPS32-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; MIPS32-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; MIPS32: [[MUL:%[0-9]+]]:_(s32) = G_MUL [[COPY2]], [[COPY3]]			; MIPS32-DAG: [[MUL:%[0-9]+]]:_(s32) = G_MUL [[COPY2]], [[COPY3]]
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[MUL]](s32)			; MIPS32-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[MUL]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; MIPS32: $v0 = COPY [[ASHR]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%2:_(s32) = COPY $a0			%2:_(s32) = COPY $a0
	%0:_(s8) = G_TRUNC %2(s32)			%0:_(s8) = G_TRUNC %2(s32)
	%3:_(s32) = COPY $a1			%3:_(s32) = COPY $a1
	%1:_(s8) = G_TRUNC %3(s32)			%1:_(s8) = G_TRUNC %3(s32)
	%4:_(s8) = G_MUL %1, %0			%4:_(s8) = G_MUL %1, %0
	%5:_(s32) = G_SEXT %4(s8)			%5:_(s32) = G_SEXT %4(s8)
	$v0 = COPY %5(s32)			$v0 = COPY %5(s32)
	RetRA implicit $v0			RetRA implicit $v0
	▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	liveins: $a0, $a1			liveins: $a0, $a1

	; MIPS32-LABEL: name: mul_i16_sext			; MIPS32-LABEL: name: mul_i16_sext
	; MIPS32: liveins: $a0, $a1			; MIPS32: liveins: $a0, $a1
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
	; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1			; MIPS32-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
	; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; MIPS32-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; MIPS32-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; MIPS32: [[MUL:%[0-9]+]]:_(s32) = G_MUL [[COPY2]], [[COPY3]]			; MIPS32-DAG: [[MUL:%[0-9]+]]:_(s32) = G_MUL [[COPY2]], [[COPY3]]
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[MUL]](s32)			; MIPS32-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[MUL]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; MIPS32: $v0 = COPY [[ASHR]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%2:_(s32) = COPY $a0			%2:_(s32) = COPY $a0
	%0:_(s16) = G_TRUNC %2(s32)			%0:_(s16) = G_TRUNC %2(s32)
	%3:_(s32) = COPY $a1			%3:_(s32) = COPY $a1
	%1:_(s16) = G_TRUNC %3(s32)			%1:_(s16) = G_TRUNC %3(s32)
	%4:_(s16) = G_MUL %1, %0			%4:_(s16) = G_MUL %1, %0
	%5:_(s32) = G_SEXT %4(s16)			%5:_(s32) = G_SEXT %4(s16)
	$v0 = COPY %5(s32)			$v0 = COPY %5(s32)
	RetRA implicit $v0			RetRA implicit $v0
	▲ Show 20 Lines • Show All 304 Lines • Show Last 20 Lines

llvm/test/CodeGen/Mips/GlobalISel/legalizer/rem_and_div.mir

	Show All 24 Lines
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	liveins: $a0, $a1			liveins: $a0, $a1

	; MIPS32-LABEL: name: sdiv_i8			; MIPS32-LABEL: name: sdiv_i8
	; MIPS32: liveins: $a0, $a1			; MIPS32: liveins: $a0, $a1
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
	; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1			; MIPS32-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; MIPS32-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C]](s32)			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C]](s32)
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; MIPS32-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; MIPS32: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C]](s32)			; MIPS32-DAG: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C]](s32)
	; MIPS32: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]](s32)			; MIPS32-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]](s32)
	; MIPS32: [[SDIV:%[0-9]+]]:_(s32) = G_SDIV [[ASHR]], [[ASHR1]]			; MIPS32-DAG: [[SDIV:%[0-9]+]]:_(s32) = G_SDIV [[ASHR]], [[ASHR1]]
	; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SDIV]](s32)			; MIPS32-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SDIV]](s32)
	; MIPS32: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)			; MIPS32-DAG: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)
	; MIPS32: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL2]], [[C]](s32)			; MIPS32-DAG: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL2]], [[C]](s32)
	; MIPS32: $v0 = COPY [[ASHR2]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR2]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%2:_(s32) = COPY $a0			%2:_(s32) = COPY $a0
	%0:_(s8) = G_TRUNC %2(s32)			%0:_(s8) = G_TRUNC %2(s32)
	%3:_(s32) = COPY $a1			%3:_(s32) = COPY $a1
	%1:_(s8) = G_TRUNC %3(s32)			%1:_(s8) = G_TRUNC %3(s32)
	%4:_(s8) = G_SDIV %1, %0			%4:_(s8) = G_SDIV %1, %0
	%5:_(s32) = G_SEXT %4(s8)			%5:_(s32) = G_SEXT %4(s8)
	$v0 = COPY %5(s32)			$v0 = COPY %5(s32)
	RetRA implicit $v0			RetRA implicit $v0

	...			...
	---			---
	name: sdiv_i16			name: sdiv_i16
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	liveins: $a0, $a1			liveins: $a0, $a1

	; MIPS32-LABEL: name: sdiv_i16			; MIPS32-LABEL: name: sdiv_i16
	; MIPS32: liveins: $a0, $a1			; MIPS32: liveins: $a0, $a1
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
	; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1			; MIPS32-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; MIPS32-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C]](s32)			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C]](s32)
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; MIPS32-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; MIPS32: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C]](s32)			; MIPS32-DAG: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C]](s32)
	; MIPS32: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]](s32)			; MIPS32-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]](s32)
	; MIPS32: [[SDIV:%[0-9]+]]:_(s32) = G_SDIV [[ASHR]], [[ASHR1]]			; MIPS32-DAG: [[SDIV:%[0-9]+]]:_(s32) = G_SDIV [[ASHR]], [[ASHR1]]
	; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SDIV]](s32)			; MIPS32-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SDIV]](s32)
	; MIPS32: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)			; MIPS32-DAG: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)
	; MIPS32: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL2]], [[C]](s32)			; MIPS32-DAG: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL2]], [[C]](s32)
	; MIPS32: $v0 = COPY [[ASHR2]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR2]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%2:_(s32) = COPY $a0			%2:_(s32) = COPY $a0
	%0:_(s16) = G_TRUNC %2(s32)			%0:_(s16) = G_TRUNC %2(s32)
	%3:_(s32) = COPY $a1			%3:_(s32) = COPY $a1
	%1:_(s16) = G_TRUNC %3(s32)			%1:_(s16) = G_TRUNC %3(s32)
	%4:_(s16) = G_SDIV %1, %0			%4:_(s16) = G_SDIV %1, %0
	%5:_(s32) = G_SEXT %4(s16)			%5:_(s32) = G_SEXT %4(s16)
	$v0 = COPY %5(s32)			$v0 = COPY %5(s32)
	RetRA implicit $v0			RetRA implicit $v0
	▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	liveins: $a0, $a1			liveins: $a0, $a1

	; MIPS32-LABEL: name: srem_i8			; MIPS32-LABEL: name: srem_i8
	; MIPS32: liveins: $a0, $a1			; MIPS32: liveins: $a0, $a1
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
	; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1			; MIPS32-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; MIPS32-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C]](s32)			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C]](s32)
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; MIPS32-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; MIPS32: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C]](s32)			; MIPS32-DAG: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C]](s32)
	; MIPS32: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]](s32)			; MIPS32-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]](s32)
	; MIPS32: [[SREM:%[0-9]+]]:_(s32) = G_SREM [[ASHR]], [[ASHR1]]			; MIPS32-DAG: [[SREM:%[0-9]+]]:_(s32) = G_SREM [[ASHR]], [[ASHR1]]
	; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SREM]](s32)			; MIPS32-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SREM]](s32)
	; MIPS32: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)			; MIPS32-DAG: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)
	; MIPS32: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL2]], [[C]](s32)			; MIPS32-DAG: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL2]], [[C]](s32)
	; MIPS32: $v0 = COPY [[ASHR2]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR2]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%2:_(s32) = COPY $a0			%2:_(s32) = COPY $a0
	%0:_(s8) = G_TRUNC %2(s32)			%0:_(s8) = G_TRUNC %2(s32)
	%3:_(s32) = COPY $a1			%3:_(s32) = COPY $a1
	%1:_(s8) = G_TRUNC %3(s32)			%1:_(s8) = G_TRUNC %3(s32)
	%4:_(s8) = G_SREM %1, %0			%4:_(s8) = G_SREM %1, %0
	%5:_(s32) = G_SEXT %4(s8)			%5:_(s32) = G_SEXT %4(s8)
	$v0 = COPY %5(s32)			$v0 = COPY %5(s32)
	RetRA implicit $v0			RetRA implicit $v0

	...			...
	---			---
	name: srem_i16			name: srem_i16
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	liveins: $a0, $a1			liveins: $a0, $a1

	; MIPS32-LABEL: name: srem_i16			; MIPS32-LABEL: name: srem_i16
	; MIPS32: liveins: $a0, $a1			; MIPS32: liveins: $a0, $a1
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
	; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1			; MIPS32-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; MIPS32-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C]](s32)			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY2]], [[C]](s32)
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; MIPS32-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; MIPS32: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C]](s32)			; MIPS32-DAG: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[COPY3]], [[C]](s32)
	; MIPS32: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]](s32)			; MIPS32-DAG: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[SHL1]], [[C]](s32)
	; MIPS32: [[SREM:%[0-9]+]]:_(s32) = G_SREM [[ASHR]], [[ASHR1]]			; MIPS32-DAG: [[SREM:%[0-9]+]]:_(s32) = G_SREM [[ASHR]], [[ASHR1]]
	; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SREM]](s32)			; MIPS32-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SREM]](s32)
	; MIPS32: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)			; MIPS32-DAG: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)
	; MIPS32: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL2]], [[C]](s32)			; MIPS32-DAG: [[ASHR2:%[0-9]+]]:_(s32) = G_ASHR [[SHL2]], [[C]](s32)
	; MIPS32: $v0 = COPY [[ASHR2]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR2]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%2:_(s32) = COPY $a0			%2:_(s32) = COPY $a0
	%0:_(s16) = G_TRUNC %2(s32)			%0:_(s16) = G_TRUNC %2(s32)
	%3:_(s32) = COPY $a1			%3:_(s32) = COPY $a1
	%1:_(s16) = G_TRUNC %3(s32)			%1:_(s16) = G_TRUNC %3(s32)
	%4:_(s16) = G_SREM %1, %0			%4:_(s16) = G_SREM %1, %0
	%5:_(s32) = G_SEXT %4(s16)			%5:_(s32) = G_SEXT %4(s16)
	$v0 = COPY %5(s32)			$v0 = COPY %5(s32)
	RetRA implicit $v0			RetRA implicit $v0
	▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	liveins: $a0, $a1			liveins: $a0, $a1

	; MIPS32-LABEL: name: udiv_i8			; MIPS32-LABEL: name: udiv_i8
	; MIPS32: liveins: $a0, $a1			; MIPS32: liveins: $a0, $a1
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
	; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1			; MIPS32-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
	; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; MIPS32-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; MIPS32: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C]]			; MIPS32-DAG: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C]]
	; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; MIPS32-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; MIPS32: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C]]			; MIPS32-DAG: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C]]
	; MIPS32: [[UDIV:%[0-9]+]]:_(s32) = G_UDIV [[AND]], [[AND1]]			; MIPS32-DAG: [[UDIV:%[0-9]+]]:_(s32) = G_UDIV [[AND]], [[AND1]]
	; MIPS32: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; MIPS32-DAG: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[UDIV]](s32)			; MIPS32-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[UDIV]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C1]](s32)			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C1]](s32)
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)
	; MIPS32: $v0 = COPY [[ASHR]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%2:_(s32) = COPY $a0			%2:_(s32) = COPY $a0
	%0:_(s8) = G_TRUNC %2(s32)			%0:_(s8) = G_TRUNC %2(s32)
	%3:_(s32) = COPY $a1			%3:_(s32) = COPY $a1
	%1:_(s8) = G_TRUNC %3(s32)			%1:_(s8) = G_TRUNC %3(s32)
	%4:_(s8) = G_UDIV %1, %0			%4:_(s8) = G_UDIV %1, %0
	%5:_(s32) = G_SEXT %4(s8)			%5:_(s32) = G_SEXT %4(s8)
	$v0 = COPY %5(s32)			$v0 = COPY %5(s32)
	RetRA implicit $v0			RetRA implicit $v0

	...			...
	---			---
	name: udiv_i16			name: udiv_i16
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	liveins: $a0, $a1			liveins: $a0, $a1

	; MIPS32-LABEL: name: udiv_i16			; MIPS32-LABEL: name: udiv_i16
	; MIPS32: liveins: $a0, $a1			; MIPS32: liveins: $a0, $a1
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
	; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1			; MIPS32-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 65535			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 65535
	; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; MIPS32-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; MIPS32: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C]]			; MIPS32-DAG: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C]]
	; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; MIPS32-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; MIPS32: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C]]			; MIPS32-DAG: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C]]
	; MIPS32: [[UDIV:%[0-9]+]]:_(s32) = G_UDIV [[AND]], [[AND1]]			; MIPS32-DAG: [[UDIV:%[0-9]+]]:_(s32) = G_UDIV [[AND]], [[AND1]]
	; MIPS32: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; MIPS32-DAG: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[UDIV]](s32)			; MIPS32-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[UDIV]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C1]](s32)			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C1]](s32)
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)
	; MIPS32: $v0 = COPY [[ASHR]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%2:_(s32) = COPY $a0			%2:_(s32) = COPY $a0
	%0:_(s16) = G_TRUNC %2(s32)			%0:_(s16) = G_TRUNC %2(s32)
	%3:_(s32) = COPY $a1			%3:_(s32) = COPY $a1
	%1:_(s16) = G_TRUNC %3(s32)			%1:_(s16) = G_TRUNC %3(s32)
	%4:_(s16) = G_UDIV %1, %0			%4:_(s16) = G_UDIV %1, %0
	%5:_(s32) = G_SEXT %4(s16)			%5:_(s32) = G_SEXT %4(s16)
	$v0 = COPY %5(s32)			$v0 = COPY %5(s32)
	RetRA implicit $v0			RetRA implicit $v0
	▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	liveins: $a0, $a1			liveins: $a0, $a1

	; MIPS32-LABEL: name: urem_i8			; MIPS32-LABEL: name: urem_i8
	; MIPS32: liveins: $a0, $a1			; MIPS32: liveins: $a0, $a1
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
	; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1			; MIPS32-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
	; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; MIPS32-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; MIPS32: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C]]			; MIPS32-DAG: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C]]
	; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; MIPS32-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; MIPS32: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C]]			; MIPS32-DAG: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C]]
	; MIPS32: [[UREM:%[0-9]+]]:_(s32) = G_UREM [[AND]], [[AND1]]			; MIPS32-DAG: [[UREM:%[0-9]+]]:_(s32) = G_UREM [[AND]], [[AND1]]
	; MIPS32: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; MIPS32-DAG: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[UREM]](s32)			; MIPS32-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[UREM]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C1]](s32)			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C1]](s32)
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)
	; MIPS32: $v0 = COPY [[ASHR]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%2:_(s32) = COPY $a0			%2:_(s32) = COPY $a0
	%0:_(s8) = G_TRUNC %2(s32)			%0:_(s8) = G_TRUNC %2(s32)
	%3:_(s32) = COPY $a1			%3:_(s32) = COPY $a1
	%1:_(s8) = G_TRUNC %3(s32)			%1:_(s8) = G_TRUNC %3(s32)
	%4:_(s8) = G_UREM %1, %0			%4:_(s8) = G_UREM %1, %0
	%5:_(s32) = G_SEXT %4(s8)			%5:_(s32) = G_SEXT %4(s8)
	$v0 = COPY %5(s32)			$v0 = COPY %5(s32)
	RetRA implicit $v0			RetRA implicit $v0

	...			...
	---			---
	name: urem_i16			name: urem_i16
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	liveins: $a0, $a1			liveins: $a0, $a1

	; MIPS32-LABEL: name: urem_i16			; MIPS32-LABEL: name: urem_i16
	; MIPS32: liveins: $a0, $a1			; MIPS32: liveins: $a0, $a1
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
	; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1			; MIPS32-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 65535			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 65535
	; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; MIPS32-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; MIPS32: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C]]			; MIPS32-DAG: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY2]], [[C]]
	; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; MIPS32-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; MIPS32: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C]]			; MIPS32-DAG: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C]]
	; MIPS32: [[UREM:%[0-9]+]]:_(s32) = G_UREM [[AND]], [[AND1]]			; MIPS32-DAG: [[UREM:%[0-9]+]]:_(s32) = G_UREM [[AND]], [[AND1]]
	; MIPS32: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; MIPS32-DAG: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[UREM]](s32)			; MIPS32-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[UREM]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C1]](s32)			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C1]](s32)
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C1]](s32)
	; MIPS32: $v0 = COPY [[ASHR]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%2:_(s32) = COPY $a0			%2:_(s32) = COPY $a0
	%0:_(s16) = G_TRUNC %2(s32)			%0:_(s16) = G_TRUNC %2(s32)
	%3:_(s32) = COPY $a1			%3:_(s32) = COPY $a1
	%1:_(s16) = G_TRUNC %3(s32)			%1:_(s16) = G_TRUNC %3(s32)
	%4:_(s16) = G_UREM %1, %0			%4:_(s16) = G_UREM %1, %0
	%5:_(s32) = G_SEXT %4(s16)			%5:_(s32) = G_SEXT %4(s16)
	$v0 = COPY %5(s32)			$v0 = COPY %5(s32)
	RetRA implicit $v0			RetRA implicit $v0
	▲ Show 20 Lines • Show All 63 Lines • Show Last 20 Lines

llvm/test/CodeGen/Mips/GlobalISel/legalizer/sub.mir

	Show All 39 Lines
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	liveins: $a0, $a1			liveins: $a0, $a1

	; MIPS32-LABEL: name: sub_i8_sext			; MIPS32-LABEL: name: sub_i8_sext
	; MIPS32: liveins: $a0, $a1			; MIPS32: liveins: $a0, $a1
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
	; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1			; MIPS32-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
	; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; MIPS32-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; MIPS32-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; MIPS32: [[SUB:%[0-9]+]]:_(s32) = G_SUB [[COPY2]], [[COPY3]]			; MIPS32-DAG: [[SUB:%[0-9]+]]:_(s32) = G_SUB [[COPY2]], [[COPY3]]
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
	; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SUB]](s32)			; MIPS32-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SUB]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; MIPS32: $v0 = COPY [[ASHR]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%2:_(s32) = COPY $a0			%2:_(s32) = COPY $a0
	%0:_(s8) = G_TRUNC %2(s32)			%0:_(s8) = G_TRUNC %2(s32)
	%3:_(s32) = COPY $a1			%3:_(s32) = COPY $a1
	%1:_(s8) = G_TRUNC %3(s32)			%1:_(s8) = G_TRUNC %3(s32)
	%4:_(s8) = G_SUB %1, %0			%4:_(s8) = G_SUB %1, %0
	%5:_(s32) = G_SEXT %4(s8)			%5:_(s32) = G_SEXT %4(s8)
	$v0 = COPY %5(s32)			$v0 = COPY %5(s32)
	RetRA implicit $v0			RetRA implicit $v0
	Show All 34 Lines
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	liveins: $a0, $a1			liveins: $a0, $a1

	; MIPS32-LABEL: name: sub_i8_aext			; MIPS32-LABEL: name: sub_i8_aext
	; MIPS32: liveins: $a0, $a1			; MIPS32: liveins: $a0, $a1
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
	; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1			; MIPS32-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
	; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; MIPS32-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; MIPS32-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; MIPS32: [[SUB:%[0-9]+]]:_(s32) = G_SUB [[COPY2]], [[COPY3]]			; MIPS32-DAG: [[SUB:%[0-9]+]]:_(s32) = G_SUB [[COPY2]], [[COPY3]]
	; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SUB]](s32)			; MIPS32-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SUB]](s32)
	; MIPS32: $v0 = COPY [[COPY4]](s32)			; MIPS32-DAG: $v0 = COPY [[COPY4]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%2:_(s32) = COPY $a0			%2:_(s32) = COPY $a0
	%0:_(s8) = G_TRUNC %2(s32)			%0:_(s8) = G_TRUNC %2(s32)
	%3:_(s32) = COPY $a1			%3:_(s32) = COPY $a1
	%1:_(s8) = G_TRUNC %3(s32)			%1:_(s8) = G_TRUNC %3(s32)
	%4:_(s8) = G_SUB %1, %0			%4:_(s8) = G_SUB %1, %0
	%5:_(s32) = G_ANYEXT %4(s8)			%5:_(s32) = G_ANYEXT %4(s8)
	$v0 = COPY %5(s32)			$v0 = COPY %5(s32)
	RetRA implicit $v0			RetRA implicit $v0

	...			...
	---			---
	name: sub_i16_sext			name: sub_i16_sext
	alignment: 2			alignment: 2
	tracksRegLiveness: true			tracksRegLiveness: true
	body: \|			body: \|
	bb.1.entry:			bb.1.entry:
	liveins: $a0, $a1			liveins: $a0, $a1

	; MIPS32-LABEL: name: sub_i16_sext			; MIPS32-LABEL: name: sub_i16_sext
	; MIPS32: liveins: $a0, $a1			; MIPS32: liveins: $a0, $a1
	; MIPS32: [[COPY:%[0-9]+]]:_(s32) = COPY $a0			; MIPS32-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $a0
	; MIPS32: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1			; MIPS32-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY $a1
	; MIPS32: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)			; MIPS32-DAG: [[COPY2:%[0-9]+]]:_(s32) = COPY [[COPY1]](s32)
	; MIPS32: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)			; MIPS32-DAG: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
	; MIPS32: [[SUB:%[0-9]+]]:_(s32) = G_SUB [[COPY2]], [[COPY3]]			; MIPS32-DAG: [[SUB:%[0-9]+]]:_(s32) = G_SUB [[COPY2]], [[COPY3]]
	; MIPS32: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; MIPS32-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; MIPS32: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SUB]](s32)			; MIPS32-DAG: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SUB]](s32)
	; MIPS32: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)			; MIPS32-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY4]], [[C]](s32)
	; MIPS32: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)			; MIPS32-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]](s32)
	; MIPS32: $v0 = COPY [[ASHR]](s32)			; MIPS32-DAG: $v0 = COPY [[ASHR]](s32)
	; MIPS32: RetRA implicit $v0			; MIPS32: RetRA implicit $v0
	%2:_(s32) = COPY $a0			%2:_(s32) = COPY $a0
	%0:_(s16) = G_TRUNC %2(s32)			%0:_(s16) = G_TRUNC %2(s32)
	%3:_(s32) = COPY $a1			%3:_(s32) = COPY $a1
	%1:_(s16) = G_TRUNC %3(s32)			%1:_(s16) = G_TRUNC %3(s32)
	%4:_(s16) = G_SUB %1, %0			%4:_(s16) = G_SUB %1, %0
	%5:_(s32) = G_SEXT %4(s16)			%5:_(s32) = G_SEXT %4(s16)
	$v0 = COPY %5(s32)			$v0 = COPY %5(s32)
	RetRA implicit $v0			RetRA implicit $v0
	▲ Show 20 Lines • Show All 181 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/GlobalISel/legalize-ext-x86-64.mir

Show First 20 Lines • Show All 70 Lines • ▼ Show 20 Lines	registers:
- { id: 0, class: _ }		- { id: 0, class: _ }
- { id: 1, class: _ }		- { id: 1, class: _ }
- { id: 2, class: _ }		- { id: 2, class: _ }
body: \|		body: \|
bb.1 (%ir-block.0):		bb.1 (%ir-block.0):
liveins: $edi		liveins: $edi

; CHECK-LABEL: name: test_sext_i1		; CHECK-LABEL: name: test_sext_i1
; CHECK: [[COPY:%[0-9]+]]:_(s8) = COPY $dil		; CHECK-DAG: [[COPY:%[0-9]+]]:_(s8) = COPY $dil
; CHECK: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 63		; CHECK-DAG: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 63
; CHECK: [[ANYEXT:%[0-9]+]]:_(s64) = G_ANYEXT [[COPY]](s8)		; CHECK-DAG: [[ANYEXT:%[0-9]+]]:_(s64) = G_ANYEXT [[COPY]](s8)
; CHECK: [[TRUNC:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s64)		; CHECK-DAG: [[TRUNC:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s64)
; CHECK: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[ANYEXT]], [[TRUNC]](s8)		; CHECK-DAG: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[ANYEXT]], [[TRUNC]](s8)
; CHECK: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s64)		; CHECK-DAG: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s64)
; CHECK: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[TRUNC1]](s8)		; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s64) = G_ASHR [[SHL]], [[TRUNC1]](s8)
; CHECK: $rax = COPY [[ASHR]](s64)		; CHECK-DAG: $rax = COPY [[ASHR]](s64)
; CHECK: RET 0, implicit $rax		; CHECK: RET 0, implicit $rax
%0(s8) = COPY $dil		%0(s8) = COPY $dil
%1(s1) = G_TRUNC %0(s8)		%1(s1) = G_TRUNC %0(s8)
%2(s64) = G_SEXT %1(s1)		%2(s64) = G_SEXT %1(s1)
$rax = COPY %2(s64)		$rax = COPY %2(s64)
RET 0, implicit $rax		RET 0, implicit $rax

...		...
---		---
▲ Show 20 Lines • Show All 259 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/GlobalISel/x86_64-legalize-sitofp.mir

Show First 20 Lines • Show All 81 Lines • ▼ Show 20 Lines	registers:
- { id: 2, class: _ }		- { id: 2, class: _ }
- { id: 3, class: _ }		- { id: 3, class: _ }
body: \|		body: \|
bb.1.entry:		bb.1.entry:
liveins: $edi		liveins: $edi

; CHECK-LABEL: name: int8_to_float		; CHECK-LABEL: name: int8_to_float
; CHECK: liveins: $edi		; CHECK: liveins: $edi
; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $edi		; CHECK-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $edi
; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24		; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)		; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
; CHECK: [[TRUNC:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)		; CHECK-DAG: [[TRUNC:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)
; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[TRUNC]](s8)		; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[TRUNC]](s8)
; CHECK: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)		; CHECK-DAG: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)
; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[TRUNC1]](s8)		; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[TRUNC1]](s8)
; CHECK: [[SITOFP:%[0-9]+]]:_(s32) = G_SITOFP [[ASHR]](s32)		; CHECK-DAG: [[SITOFP:%[0-9]+]]:_(s32) = G_SITOFP [[ASHR]](s32)
; CHECK: [[ANYEXT:%[0-9]+]]:_(s128) = G_ANYEXT [[SITOFP]](s32)		; CHECK-DAG: [[ANYEXT:%[0-9]+]]:_(s128) = G_ANYEXT [[SITOFP]](s32)
; CHECK: $xmm0 = COPY [[ANYEXT]](s128)		; CHECK-DAG: $xmm0 = COPY [[ANYEXT]](s128)
; CHECK: RET 0, implicit $xmm0		; CHECK: RET 0, implicit $xmm0
%1:_(s32) = COPY $edi		%1:_(s32) = COPY $edi
%0:_(s8) = G_TRUNC %1(s32)		%0:_(s8) = G_TRUNC %1(s32)
%2:_(s32) = G_SITOFP %0(s8)		%2:_(s32) = G_SITOFP %0(s8)
%3:_(s128) = G_ANYEXT %2(s32)		%3:_(s128) = G_ANYEXT %2(s32)
$xmm0 = COPY %3(s128)		$xmm0 = COPY %3(s128)
RET 0, implicit $xmm0		RET 0, implicit $xmm0

...		...
---		---
name: int16_to_float		name: int16_to_float
alignment: 4		alignment: 4
tracksRegLiveness: true		tracksRegLiveness: true
registers:		registers:
- { id: 0, class: _ }		- { id: 0, class: _ }
- { id: 1, class: _ }		- { id: 1, class: _ }
- { id: 2, class: _ }		- { id: 2, class: _ }
- { id: 3, class: _ }		- { id: 3, class: _ }
body: \|		body: \|
bb.1.entry:		bb.1.entry:
liveins: $edi		liveins: $edi

; CHECK-LABEL: name: int16_to_float		; CHECK-LABEL: name: int16_to_float
; CHECK: liveins: $edi		; CHECK: liveins: $edi
; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $edi		; CHECK-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $edi
; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16		; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)		; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
; CHECK: [[TRUNC:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)		; CHECK-DAG: [[TRUNC:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)
; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[TRUNC]](s8)		; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[TRUNC]](s8)
; CHECK: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)		; CHECK-DAG: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)
; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[TRUNC1]](s8)		; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[TRUNC1]](s8)
; CHECK: [[SITOFP:%[0-9]+]]:_(s32) = G_SITOFP [[ASHR]](s32)		; CHECK-DAG: [[SITOFP:%[0-9]+]]:_(s32) = G_SITOFP [[ASHR]](s32)
; CHECK: [[ANYEXT:%[0-9]+]]:_(s128) = G_ANYEXT [[SITOFP]](s32)		; CHECK-DAG: [[ANYEXT:%[0-9]+]]:_(s128) = G_ANYEXT [[SITOFP]](s32)
; CHECK: $xmm0 = COPY [[ANYEXT]](s128)		; CHECK-DAG: $xmm0 = COPY [[ANYEXT]](s128)
; CHECK: RET 0, implicit $xmm0		; CHECK: RET 0, implicit $xmm0
%1:_(s32) = COPY $edi		%1:_(s32) = COPY $edi
%0:_(s16) = G_TRUNC %1(s32)		%0:_(s16) = G_TRUNC %1(s32)
%2:_(s32) = G_SITOFP %0(s16)		%2:_(s32) = G_SITOFP %0(s16)
%3:_(s128) = G_ANYEXT %2(s32)		%3:_(s128) = G_ANYEXT %2(s32)
$xmm0 = COPY %3(s128)		$xmm0 = COPY %3(s128)
RET 0, implicit $xmm0		RET 0, implicit $xmm0

...		...
▲ Show 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	registers:
- { id: 2, class: _ }		- { id: 2, class: _ }
- { id: 3, class: _ }		- { id: 3, class: _ }
body: \|		body: \|
bb.1.entry:		bb.1.entry:
liveins: $edi		liveins: $edi

; CHECK-LABEL: name: int8_to_double		; CHECK-LABEL: name: int8_to_double
; CHECK: liveins: $edi		; CHECK: liveins: $edi
; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $edi		; CHECK-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $edi
; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24		; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)		; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
; CHECK: [[TRUNC:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)		; CHECK-DAG: [[TRUNC:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)
; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[TRUNC]](s8)		; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[TRUNC]](s8)
; CHECK: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)		; CHECK-DAG: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)
; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[TRUNC1]](s8)		; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[TRUNC1]](s8)
; CHECK: [[SITOFP:%[0-9]+]]:_(s64) = G_SITOFP [[ASHR]](s32)		; CHECK-DAG: [[SITOFP:%[0-9]+]]:_(s64) = G_SITOFP [[ASHR]](s32)
; CHECK: [[ANYEXT:%[0-9]+]]:_(s128) = G_ANYEXT [[SITOFP]](s64)		; CHECK-DAG: [[ANYEXT:%[0-9]+]]:_(s128) = G_ANYEXT [[SITOFP]](s64)
; CHECK: $xmm0 = COPY [[ANYEXT]](s128)		; CHECK-DAG: $xmm0 = COPY [[ANYEXT]](s128)
; CHECK: RET 0, implicit $xmm0		; CHECK: RET 0, implicit $xmm0
%1:_(s32) = COPY $edi		%1:_(s32) = COPY $edi
%0:_(s8) = G_TRUNC %1(s32)		%0:_(s8) = G_TRUNC %1(s32)
%2:_(s64) = G_SITOFP %0(s8)		%2:_(s64) = G_SITOFP %0(s8)
%3:_(s128) = G_ANYEXT %2(s64)		%3:_(s128) = G_ANYEXT %2(s64)
$xmm0 = COPY %3(s128)		$xmm0 = COPY %3(s128)
RET 0, implicit $xmm0		RET 0, implicit $xmm0

...		...
---		---
name: int16_to_double		name: int16_to_double
alignment: 4		alignment: 4
tracksRegLiveness: true		tracksRegLiveness: true
registers:		registers:
- { id: 0, class: _ }		- { id: 0, class: _ }
- { id: 1, class: _ }		- { id: 1, class: _ }
- { id: 2, class: _ }		- { id: 2, class: _ }
- { id: 3, class: _ }		- { id: 3, class: _ }
body: \|		body: \|
bb.1.entry:		bb.1.entry:
liveins: $edi		liveins: $edi

; CHECK-LABEL: name: int16_to_double		; CHECK-LABEL: name: int16_to_double
; CHECK: liveins: $edi		; CHECK: liveins: $edi
; CHECK: [[COPY:%[0-9]+]]:_(s32) = COPY $edi		; CHECK-DAG: [[COPY:%[0-9]+]]:_(s32) = COPY $edi
; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16		; CHECK-DAG: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)		; CHECK-DAG: [[COPY1:%[0-9]+]]:_(s32) = COPY [[COPY]](s32)
; CHECK: [[TRUNC:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)		; CHECK-DAG: [[TRUNC:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)
; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[TRUNC]](s8)		; CHECK-DAG: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[COPY1]], [[TRUNC]](s8)
; CHECK: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)		; CHECK-DAG: [[TRUNC1:%[0-9]+]]:_(s8) = G_TRUNC [[C]](s32)
; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[TRUNC1]](s8)		; CHECK-DAG: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[TRUNC1]](s8)
; CHECK: [[SITOFP:%[0-9]+]]:_(s64) = G_SITOFP [[ASHR]](s32)		; CHECK-DAG: [[SITOFP:%[0-9]+]]:_(s64) = G_SITOFP [[ASHR]](s32)
; CHECK: [[ANYEXT:%[0-9]+]]:_(s128) = G_ANYEXT [[SITOFP]](s64)		; CHECK-DAG: [[ANYEXT:%[0-9]+]]:_(s128) = G_ANYEXT [[SITOFP]](s64)
; CHECK: $xmm0 = COPY [[ANYEXT]](s128)		; CHECK-DAG: $xmm0 = COPY [[ANYEXT]](s128)
; CHECK: RET 0, implicit $xmm0		; CHECK: RET 0, implicit $xmm0
%1:_(s32) = COPY $edi		%1:_(s32) = COPY $edi
%0:_(s16) = G_TRUNC %1(s32)		%0:_(s16) = G_TRUNC %1(s32)
%2:_(s64) = G_SITOFP %0(s16)		%2:_(s64) = G_SITOFP %0(s16)
%3:_(s128) = G_ANYEXT %2(s64)		%3:_(s128) = G_ANYEXT %2(s64)
$xmm0 = COPY %3(s128)		$xmm0 = COPY %3(s128)
RET 0, implicit $xmm0		RET 0, implicit $xmm0

...		...
▲ Show 20 Lines • Show All 52 Lines • Show Last 20 Lines

llvm/test/MachineVerifier/test_g_sext_inreg.mir

This file was added.

				# RUN: not llc -verify-machineinstrs -run-pass none -o /dev/null %s 2>&1 \| FileCheck %s

				--- \|

				target datalayout = "e-m:o-i64:64-i128:128-n32:64-S128"
				target triple = "aarch64--"
				define void @test() { ret void }

				...

				---
				name: test
				registers:
				- { id: 0, class: gpr }
				- { id: 1, class: gpr }
				- { id: 2, class: gpr }
				- { id: 3, class: gpr }
				- { id: 4, class: gpr }
				- { id: 5, class: gpr }
				- { id: 6, class: gpr }
				- { id: 7, class: gpr }
				body: \|
				bb.0:
				liveins: $x0
				%0(s64) = COPY $x0
				%1(<4 x s16>) = COPY $x0

				; CHECK: * Bad machine code: G_SEXT_INREG expects an immediate operand #2 *
				; CHECK: instruction: %2:gpr(s64) = G_SEXT_INREG
				%2(s64) = G_SEXT_INREG %0, %0

				; CHECK: * Bad machine code: G_SEXT_INREG expects an immediate operand #2 *
				; CHECK: instruction: %3:gpr(s64) = G_SEXT_INREG
				%3(s64) = G_SEXT_INREG %0, i8 8

				; CHECK: * Bad machine code: Type mismatch in generic instruction *
				; CHECK: instruction: %4:gpr(<2 x s32>) = G_SEXT_INREG
				; CHECK: * Bad machine code: operand types must be all-vector or all-scalar *
				; CHECK: instruction: %4:gpr(<2 x s32>) = G_SEXT_INREG
				%4(<2 x s32>) = G_SEXT_INREG %0, 8

				; CHECK: * Bad machine code: operand types must preserve number of vector elements *
				; CHECK: instruction: %5:gpr(<2 x s32>) = G_SEXT_INREG
				%5(<2 x s32>) = G_SEXT_INREG %1, 8

				; CHECK: * Bad machine code: G_SEXT_INREG size must be >= 1 *
				; CHECK: instruction: %6:gpr(s64) = G_SEXT_INREG
				%6(s64) = G_SEXT_INREG %0, 0

				; CHECK: * Bad machine code: G_SEXT_INREG size must be less than source bit width *
				; CHECK: instruction: %7:gpr(s64) = G_SEXT_INREG
				%7(s64) = G_SEXT_INREG %0, 128
				...

llvm/unittests/CodeGen/GlobalISel/LegalizerHelperTest.cpp

Show First 20 Lines • Show All 723 Lines • ▼ Show 20 Lines	TEST_F(GISelMITest, FewerElementsPhi) {
CHECK: [[INSERT0:%[0-9]+]]:_(<5 x s32>) = G_INSERT [[REBUILD_VAL_IMPDEF]]:_, [[PHI0]]:_(<2 x s32>), 0		CHECK: [[INSERT0:%[0-9]+]]:_(<5 x s32>) = G_INSERT [[REBUILD_VAL_IMPDEF]]:_, [[PHI0]]:_(<2 x s32>), 0
CHECK: [[INSERT1:%[0-9]+]]:_(<5 x s32>) = G_INSERT [[INSERT0]]:_, [[PHI1]]:_(<2 x s32>), 64		CHECK: [[INSERT1:%[0-9]+]]:_(<5 x s32>) = G_INSERT [[INSERT0]]:_, [[PHI1]]:_(<2 x s32>), 64
CHECK: [[INSERT2:%[0-9]+]]:_(<5 x s32>) = G_INSERT [[INSERT1]]:_, [[PHI2]]:_(s32), 128		CHECK: [[INSERT2:%[0-9]+]]:_(<5 x s32>) = G_INSERT [[INSERT1]]:_, [[PHI2]]:_(s32), 128
CHECK: [[USE_OP:%[0-9]+]]:_(<5 x s32>) = G_AND [[INSERT2]]:_, [[INSERT2]]:_		CHECK: [[USE_OP:%[0-9]+]]:_(<5 x s32>) = G_AND [[INSERT2]]:_, [[INSERT2]]:_
)";		)";

EXPECT_TRUE(CheckMachineFunction(MF, CheckStr)) << MF;		EXPECT_TRUE(CheckMachineFunction(MF, CheckStr)) << MF;
}		}

		TEST_F(GISelMITest, WidenSEXTINREG) {
		if (!TM)
		return;

		// Declare your legalization info
		DefineLegalizerInfo(A, {
		getActionDefinitionsBuilder(G_SEXT_INREG).legalForTypeWithAnyImm({s64});
		});
		// Build Instr
		auto MIB = B.buildInstr(
		TargetOpcode::G_SEXT_INREG, {LLT::scalar(32)},
		{B.buildInstr(TargetOpcode::G_TRUNC, {LLT::scalar(32)}, {Copies[0]}),
		APInt(32, 8)});
		AInfo Info(MF->getSubtarget());
		DummyGISelObserver Observer;
		LegalizerHelper Helper(*MF, Info, Observer, B);
		// Perform Legalization
		ASSERT_TRUE(Helper.widenScalar(*MIB, 0, LLT::scalar(64)) ==
		LegalizerHelper::LegalizeResult::Legalized);

		auto CheckStr = R"(
		CHECK: [[T0:%[0-9]+]]:_(s32) = G_TRUNC
		CHECK: [[T1:%[0-9]+]]:_(s64) = G_ANYEXT [[T0]]:_(s32)
		CHECK: [[T2:%[0-9]+]]:_(s64) = G_SEXT_INREG [[T1]]:_, 8
		CHECK: [[T3:%[0-9]+]]:_(s32) = G_TRUNC [[T2]]:_(s64)
		)";

		// Check
		ASSERT_TRUE(CheckMachineFunction(*MF, CheckStr));
		}

		TEST_F(GISelMITest, NarrowSEXTINREG) {
		if (!TM)
		return;

		// Declare your legalization info, these aren't actually relevant to the test.
		DefineLegalizerInfo(A, {
		getActionDefinitionsBuilder(G_SEXT_INREG).legalForTypeWithAnyImm({s64});
		});
		// Build Instr
		auto MIB = B.buildInstr(
		TargetOpcode::G_SEXT_INREG, {LLT::scalar(16)},
		{B.buildInstr(TargetOpcode::G_TRUNC, {LLT::scalar(16)}, {Copies[0]}),
		APInt(16, 8)});
		MIB->getParent()->dump();
		AInfo Info(MF->getSubtarget());
		DummyGISelObserver Observer;
		LegalizerHelper Helper(*MF, Info, Observer, B);
		// Perform Legalization
		ASSERT_TRUE(Helper.narrowScalar(*MIB, 0, LLT::scalar(10)) ==
		LegalizerHelper::LegalizeResult::Legalized);
		MIB->getParent()->dump();

		auto CheckStr = R"(
		CHECK: [[T0:%[0-9]+]]:_(s16) = G_TRUNC
		CHECK: [[T1:%[0-9]+]]:_(s10) = G_TRUNC [[T0]]:_(s16)
		CHECK: [[T2:%[0-9]+]]:_(s10) = G_SEXT_INREG [[T1]]:_, 8
		CHECK: [[T3:%[0-9]+]]:_(s16) = G_SEXT [[T2]]:_(s10)
		)";

		// Check
		ASSERT_TRUE(CheckMachineFunction(*MF, CheckStr));
		}

		TEST_F(GISelMITest, NarrowSEXTINREG2) {
		if (!TM)
		return;

		// Declare your legalization info, these aren't actually relevant to the test.
		DefineLegalizerInfo(
		A, { getActionDefinitionsBuilder(G_SEXT_INREG).legalForTypeWithAnyImm({s64}); });
		// Build Instr
		auto MIB = B.buildInstr(
		TargetOpcode::G_SEXT_INREG, {LLT::scalar(32)},
		{B.buildInstr(TargetOpcode::G_TRUNC, {LLT::scalar(32)}, {Copies[0]}),
		APInt(32, 9)});
		AInfo Info(MF->getSubtarget());
		DummyGISelObserver Observer;
		LegalizerHelper Helper(*MF, Info, Observer, B);
		// Perform Legalization
		ASSERT_TRUE(Helper.narrowScalar(*MIB, 0, LLT::scalar(8)) ==
		LegalizerHelper::LegalizeResult::Legalized);
		MF->dump();

		auto CheckStr = R"(
		CHECK: [[T0:%[0-9]+]]:_(s32) = G_TRUNC
		CHECK: [[T1:%[0-9]+]]:_(s8), [[T2:%[0-9]+]]:_(s8), [[T3:%[0-9]+]]:_(s8), [[T4:%[0-9]+]]:_(s8) = G_UNMERGE_VALUES [[T0]]:_(s32)
		CHECK: [[CST2:%[0-9]+]]:_(s8) = G_CONSTANT i8 7
		CHECK: [[T5:%[0-9]+]]:_(s8) = G_SEXT_INREG [[T2]]:_, 1
		CHECK: [[T6:%[0-9]+]]:_(s8) = G_ASHR [[T5]]:_, [[CST2]]:_
		CHECK: [[T7:%[0-9]+]]:_(s32) = G_MERGE_VALUES [[T1]]:_(s8), [[T5]]:_(s8), [[T6]]:_(s8), [[T6]]:_(s8)
		)";

		// Check
		ASSERT_TRUE(CheckMachineFunction(*MF, CheckStr));
		}

		TEST_F(GISelMITest, LowerSEXTINREG) {
		if (!TM)
		return;

		// Declare your legalization info, these aren't actually relevant to the test.
		DefineLegalizerInfo(
		A, { getActionDefinitionsBuilder(G_SEXT_INREG).legalForTypeWithAnyImm({s64}); });
		// Build Instr
		auto MIB = B.buildInstr(
		TargetOpcode::G_SEXT_INREG, {LLT::scalar(32)},
		{B.buildInstr(TargetOpcode::G_TRUNC, {LLT::scalar(32)}, {Copies[0]}),
		APInt(32, 8)});
		AInfo Info(MF->getSubtarget());
		DummyGISelObserver Observer;
		LegalizerHelper Helper(*MF, Info, Observer, B);
		// Perform Legalization
		ASSERT_TRUE(Helper.lower(*MIB, 0, LLT()) ==
		LegalizerHelper::LegalizeResult::Legalized);
		MF->dump();

		auto CheckStr = R"(
		CHECK: [[T1:%[0-9]+]]:_(s32) = G_TRUNC
		CHECK: [[CST:%[0-9]+]]:_(s32) = G_CONSTANT i32 24
		CHECK: [[T2:%[0-9]+]]:_(s32) = G_SHL [[T1]]:_, [[CST]]:_
		CHECK: [[T3:%[0-9]+]]:_(s32) = G_ASHR [[T2]]:_, [[CST]]:_
		)";

		// Check
		ASSERT_TRUE(CheckMachineFunction(*MF, CheckStr));
		}
} // namespace		} // namespace

llvm/unittests/CodeGen/GlobalISel/PatternMatchTest.cpp

Show First 20 Lines • Show All 260 Lines • ▼ Show 20 Lines	TEST(PatternMatchInstr, MatchBinaryOp) {
CFB1.setInsertPt(*EntryMBB, EntryMBB->end());		CFB1.setInsertPt(*EntryMBB, EntryMBB->end());
auto MIBCSub =		auto MIBCSub =
CFB1.buildInstr(TargetOpcode::G_SUB, {s32},		CFB1.buildInstr(TargetOpcode::G_SUB, {s32},
{CFB1.buildConstant(s32, 1), CFB1.buildConstant(s32, 1)});		{CFB1.buildConstant(s32, 1), CFB1.buildConstant(s32, 1)});
// This should be a constant now.		// This should be a constant now.
match = mi_match(MIBCSub->getOperand(0).getReg(), MRI, m_ICst(Cst));		match = mi_match(MIBCSub->getOperand(0).getReg(), MRI, m_ICst(Cst));
EXPECT_TRUE(match);		EXPECT_TRUE(match);
EXPECT_EQ(Cst, 0);		EXPECT_EQ(Cst, 0);

		auto MIBCSext1 =
		CFB1.buildInstr(TargetOpcode::G_SEXT_INREG, {s32},
		{CFB1.buildConstant(s32, 0x01), APInt(32, 8)});
		// This should be a constant now.
		match = mi_match(MIBCSext1->getOperand(0).getReg(), MRI, m_ICst(Cst));
		EXPECT_TRUE(match);
		EXPECT_EQ(Cst, 1);
		arsenmUnsubmitted Done Reply Inline Actions It's a gtestism that these should be swapped to get the correct expected/actual error message arsenm: It's a gtestism that these should be swapped to get the correct expected/actual error message

		auto MIBCSext2 = CFB1.buildInstr(
		TargetOpcode::G_SEXT_INREG, {s32},
		{CFB1.buildConstant(s32, 0x80), APInt(32, 8)});
		// This should be a constant now.
		match = mi_match(MIBCSext2->getOperand(0).getReg(), MRI, m_ICst(Cst));
		EXPECT_TRUE(match);
		EXPECT_EQ(Cst, -0x80);
		arsenmUnsubmitted Done Reply Inline Actions Ditto arsenm: Ditto
}		}

TEST(PatternMatchInstr, MatchFPUnaryOp) {		TEST(PatternMatchInstr, MatchFPUnaryOp) {
LLVMContext Context;		LLVMContext Context;
std::unique_ptr<LLVMTargetMachine> TM = createTargetMachine();		std::unique_ptr<LLVMTargetMachine> TM = createTargetMachine();
if (!TM)		if (!TM)
return;		return;
auto ModuleMMIPair = createDummyModule(Context, *TM, "");		auto ModuleMMIPair = createDummyModule(Context, *TM, "");
▲ Show 20 Lines • Show All 217 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[globalisel] Add G_SEXT_INREGClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 207854

llvm/include/llvm/CodeGen/GlobalISel/ConstantFoldingMIRBuilder.h

llvm/include/llvm/CodeGen/GlobalISel/LegalizationArtifactCombiner.h

llvm/include/llvm/CodeGen/GlobalISel/LegalizerInfo.h

llvm/include/llvm/CodeGen/GlobalISel/Utils.h

llvm/include/llvm/MC/MCInstrDesc.h

llvm/include/llvm/Support/TargetOpcodes.def

llvm/include/llvm/Target/GenericOpcodes.td

llvm/include/llvm/Target/Target.td

llvm/lib/CodeGen/GlobalISel/CSEMIRBuilder.cpp

llvm/lib/CodeGen/GlobalISel/LegalizerHelper.cpp

llvm/lib/CodeGen/GlobalISel/LegalizerInfo.cpp

llvm/lib/CodeGen/GlobalISel/Utils.cpp

llvm/lib/CodeGen/MachineVerifier.cpp

llvm/lib/Target/AArch64/AArch64LegalizerInfo.cpp

llvm/lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp

llvm/lib/Target/ARM/ARMLegalizerInfo.cpp

llvm/lib/Target/Mips/MipsLegalizerInfo.cpp

llvm/lib/Target/X86/X86LegalizerInfo.cpp

llvm/test/CodeGen/AArch64/GlobalISel/irtranslator-extends.ll

llvm/test/CodeGen/AArch64/GlobalISel/legalize-div.mir

llvm/test/CodeGen/AArch64/GlobalISel/legalize-ext.mir

llvm/test/CodeGen/AArch64/GlobalISel/legalize-gep.mir

llvm/test/CodeGen/AArch64/GlobalISel/legalize-itofp.mir

llvm/test/CodeGen/AArch64/GlobalISel/legalize-rem.mir

llvm/test/CodeGen/AArch64/GlobalISel/legalize-sext.mir

llvm/test/CodeGen/AArch64/GlobalISel/legalize-shift.mir

llvm/test/CodeGen/AArch64/GlobalISel/legalize-undef.mir

llvm/test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir

llvm/test/CodeGen/AMDGPU/GlobalISel/artifact-combiner-sext.mir

llvm/test/CodeGen/AMDGPU/GlobalISel/combine-ext-legalizer.mir

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-ashr.mir

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-extract-vector-elt.mir

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-sext.mir

llvm/test/CodeGen/AMDGPU/GlobalISel/legalize-sextload-flat.mir

llvm/test/CodeGen/ARM/GlobalISel/arm-legalize-divmod.mir

llvm/test/CodeGen/ARM/GlobalISel/arm-legalize-exts.mir

llvm/test/CodeGen/Mips/GlobalISel/legalizer/add.mir

llvm/test/CodeGen/Mips/GlobalISel/legalizer/constants.mir

llvm/test/CodeGen/Mips/GlobalISel/legalizer/mul.mir

llvm/test/CodeGen/Mips/GlobalISel/legalizer/rem_and_div.mir

llvm/test/CodeGen/Mips/GlobalISel/legalizer/sub.mir

llvm/test/CodeGen/X86/GlobalISel/legalize-ext-x86-64.mir

llvm/test/CodeGen/X86/GlobalISel/x86_64-legalize-sitofp.mir

llvm/test/MachineVerifier/test_g_sext_inreg.mir

llvm/unittests/CodeGen/GlobalISel/LegalizerHelperTest.cpp

llvm/unittests/CodeGen/GlobalISel/PatternMatchTest.cpp

[globalisel] Add G_SEXT_INREG
ClosedPublic