This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
include/llvm/
-
llvm/
-
CodeGen/GlobalISel/
-
GlobalISel/
-
LegalizationArtifactCombiner.h
-
Target/
-
GenericOpcodes.td
-
lib/
-
CodeGen/GlobalISel/
-
GlobalISel/
-
LegalizerHelper.cpp
-
Target/
-
AArch64/
-
AArch64InstrFormats.td
-
AArch64InstrInfo.td
2
AArch64InstructionSelector.cpp
-
AArch64LegalizerInfo.cpp
-
AArch64RegisterBankInfo.cpp
-
AMDGPU/
-
AMDGPULegalizerInfo.cpp
-
ARM/
-
ARMLegalizerInfo.cpp
-
X86/
-
X86LegalizerInfo.cpp
-
test/CodeGen/AArch64/GlobalISel/
-
CodeGen/
-
AArch64/
-
GlobalISel/
-
legalize-shift.mir
-
legalizer-info-validation.mir
-
regbankselect-default.mir
-
regbankselect-shift.mir
-
select-shift.mir

Differential D44704

[GlobalISel][X86][ARM] Relaxing type constraints on G_SHL and friends
AcceptedPublic

Authored by rtereshin on Mar 20 2018, 2:40 PM.

Download Raw Diff

Details

Reviewers

qcolombet
dsanders
aditya_nandakumar
bogner
volkan
rovka
aemerson
ab
t.p.northover
javed.absar

Summary

This is WIP.

G_SHL, G_ASHR, and G_LSHR opcodes have a single type index at the moment, forcing all the operands to have the same LLT.
This is consistent with the corresponding LLVM IR opcodes, but not so much with the apparent target requirements and pre-existing *.td-defined selection rules / instruction patterns.

For instance, most (if not all) AArch64 patterns fall into one of the following categories:

scalar shifts, having the second source operand ("number of bits to shift") of type i64 (regardless of the type of the main operands, being i32 or i64 most of the time)
vector shifts, having the second source operand of type i32 (immediate or variable both), regardless of the type of the main operands (number of vector lanes and size of the vector elements both)

In other words, the type of the second source operand is not fixed (it could be i64 or i32), it could be larger or smaller than the type of the main operands, and it could be scalar even if the rest of the instruction operates on vectors.

The existing instruction selection artifacts include (but not limited to):
a) patterns, like the following:

multiclass SIMDVectorLShiftLongBySizeBHSPats<SDPatternOperator ext> {
  def : Pat<(AArch64vshl (v8i16 (ext (v8i8 V64:$Rn))), (i32 8)),
            (SHLLv8i8 V64:$Rn)>;

b) immediate predicates, like GIPFP_I64_Predicate_imm0_31

X86 patterns, on the other hand, fall into one of the following categories:

scalar shifts with the second source typed as i8 (as before, regardless of the types of the rest of the operands)
vector shifts following LLVM IR scheme with all the operands having the same type

A typical pattern from (1) looks like this:

// x << (32 - y) >> (32 - y)
def : Pat<(srl (shl GR32:$src, (i8 (trunc (sub 32, GR32:$lz)))),
               (i8 (trunc (sub 32, GR32:$lz)))),
          (BZHI32rr GR32:$src, GR32:$lz)>;
def : Pat<(srl (shl (loadi32 addr:$src), (i8 (trunc (sub 32, GR32:$lz)))),
               (i8 (trunc (sub 32, GR32:$lz)))),
          (BZHI32rm addr:$src, GR32:$lz)>;

Mips appears to have the second source operand typed as i32 or i64, independent of the rest of the types, as usual.

This discrepancy creates the following issues:

Non-optimized and mildly-optimized (pre- https://reviews.llvm.org/D44700 patch) Tablegen'erated InstructionSelect's MatchTable contains rules that could not possibly match, forcing targets to implement shifts' selection by hand in C++
Aggressively optimized MatchTable (post- https://reviews.llvm.org/D44700 patch) contains rules that can and will actually match, but then execute renderers that expect the values having different types (from original SelectionDAG ISel patterns), resulting in miscopmiles.
Testgen (https://reviews.llvm.org/D43962) generates test-cases that don't represent the actual contents of the MatchTable

(2) and (3) due to the fact that aggressive optimizations and test-generation exploits type constrains as defined by Tablegen'erated MCInstrDesc to reduce the number of type checks performed during selection and properly handle partially optimized match tables respectively.

Issue (1) for x86 is mentioned in the following commit message: https://github.com/llvm-mirror/llvm/commit/5b113a2c3b054e1d894ab9e44a6a08e1d0cd7ff3 (git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@327499 91177308-0d34-0410-b5e6-96231b3b80d8, https://reviews.llvm.org/D44395), adding as much hand-written C++ selecting shifts as could be seen here: https://github.com/llvm-mirror/llvm/blob/5b113a2c3b054e1d894ab9e44a6a08e1d0cd7ff3/lib/Target/X86/X86InstructionSelector.cpp#L1405-L1482

AArch64 ended up having manually written C++ for selecting all the shifts and G_GEP for GPR RegBank, and the majority of binary ops for FPR RegBank.

I see 2 ways of solving the problem:

Change GlobalISel Emitter the Tablegen backend so it would intelligently adapt the patterns being imported and re-write them so they would work with existing G_* shifts
Relax the type constraints for G_* shifts and allow the second source operand to have an independent type

Given the diversity between targets (1) will have to be target-specific, and in any way it will most likely end up being quite complicated and fragile. Also, vector shifts with a scalar shift amount (AArch64) won't be possible, therefore selecting an efficient opcode will only be possible for vector shifts if the vector operand for shift amounts have the same vreg for every vector element, which is again, fragile and will probably require an additional combine to happen more often. It makes more sense, IMO, to allow such mixed shifts on MIR level explicitly.

Also, shifts aren't regular arithmetic/logical binary ops anyway, they aren't commutative nor associative, their second source operand is always an unsigned integer type regardless of the rest of the operands being signed or unsigned, and the corresponding LLVM IR opcodes have special rules regarding poison values WRT that operand. Therefore they require special handling across the selector anyway.

So this patch is to track progress on implementing the solution (2) at the moment, and get it reviewed as soon as it's done.

Diff Detail

Repository: rL LLVM

Event Timeline

rtereshin created this revision.Mar 20 2018, 2:40 PM

Herald added subscribers: kristof.beyls, nhaehnle, arsenm. · View Herald TranscriptMar 20 2018, 2:40 PM

The GenericOpcodes.td change makes sense to me but I'm surprised that only AMGGPU's LegalizerInfo is changing and only for SHL. I'd have expected most (probably all) GlobalISel targets to need to change all three shifts.

Also, could you add a test case?

nhaehnle removed a subscriber: nhaehnle.Apr 2 2018, 12:18 AM

In D44704#1049848, @dsanders wrote:

The GenericOpcodes.td change makes sense to me but I'm surprised that only AMGGPU's LegalizerInfo is changing and only for SHL. I'd have expected most (probably all) GlobalISel targets to need to change all three shifts.

Also, could you add a test case?

AMDGPU only legalizes G_SHL, non of the others. And w/o the setAction({G_SHL, 1, S32}, Legal); already existing tests fail (CodeGen/AMDGPU/GlobalISel/legalize-shl.mir and CodeGen/AMDGPU/GlobalISel/regbankselect-shl.mir), so no need to add another one. As for the selector, this is entire AMDGPU's selector:

case TargetOpcode::G_ADD:
  return selectG_ADD(I);
case TargetOpcode::G_CONSTANT:
  return selectG_CONSTANT(I);
case TargetOpcode::G_GEP:
  return selectG_GEP(I);
case TargetOpcode::G_LOAD:
  return selectG_LOAD(I);
case TargetOpcode::G_STORE:
  return selectG_STORE(I);

Every other target legalizes shifts by using the new-style legalization action builders, and those have a couple of rather dangerous properties:

If actions are built for multiple opcodes simultaneously, they are actually directly defined for a representative only, making the rest aliases, which is fine, but nothing checks that all the aliases agree in the number of type indices.
while actions are built for the representative, nothing checks that they are properly defined for every type index, implicitly making any type legal for any type index not mentioned with no action required.

This is why none of the other already existing tests fail, though there are plenty that test legalization of shifts.

Herald added a subscriber: chrib. · View Herald TranscriptApr 27 2018, 11:32 AM

Here's a fine example of the issue mentioned earlier:

This is how we legalize G_INTTOPTR for AArch64 right now:

getActionDefinitionsBuilder(G_INTTOPTR)
    .unsupportedIf([&](const LegalityQuery &Query) {
      return Query.Types[0].getSizeInBits() != Query.Types[1].getSizeInBits();
    })
    .legalFor({s64, p0});

(https://github.com/llvm-mirror/llvm/blob/27c28519031032a11a917fa4dbf05417fbe78740/lib/Target/AArch64/AArch64LegalizerInfo.cpp#L215-L219)

Who can spot a problem?

In fact, 2 problems at once. I'm working on a simple validation for LegalizerInfo that would catch this.

SPOILERS below:

a tip, the following MIR:

--- |
  target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"
  target triple = "aarch64--"
  
  define i64 @broken(<4 x i16> %v) {
    %res = bitcast <4 x i16> %v to i64
    ret i64 %res
  }

...
---
name:            broken
alignment:       2
tracksRegLiveness: true
registers:       
  - { id: 0, class: _ }
  - { id: 1, class: _ }
body:             |
  bb.1 (%ir-block.0):
    liveins: $d0
  
    %0:_(<4 x s16>) = COPY $d0
    %1:_(s64) = G_INTTOPTR %0(<4 x s16>)
    $x0 = COPY %1(s64)
    RET_ReallyLR implicit $x0

...

will be successfully legalized by

./bin/llc -run-pass=legalizer -verify-machineinstrs -simplify-mir <INPUT>.mir -o -

with the following output:

--- |
  ; ModuleID = 'out.mir'
  source_filename = "out.mir"
  target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"
  target triple = "aarch64--"
  
  define i64 @broken(<4 x i16> %v) {
    %res = bitcast <4 x i16> %v to i64
    ret i64 %res
  }

...
---
name:            broken
alignment:       2
legalized:       true
tracksRegLiveness: true
registers:       
  - { id: 0, class: _ }
  - { id: 1, class: _ }
frameInfo:       
  maxCallFrameSize: 0
body:             |
  bb.0 (%ir-block.0):
    liveins: $d0
  
    %0:_(<4 x s16>) = COPY $d0
    %1:_(s64) = G_INTTOPTR %0(<4 x s16>)
    $x0 = COPY %1(s64)
    RET_ReallyLR implicit $x0

...

It's worth mentioning that:

%1:_(s64) = G_INTTOPTR %0(<4 x s16>)

isn't a valid G_INTTOPTR. The result should be a pointer type and the input should be a scalar. However, your point stands even with that fixed.

The first problem is that it should be using the pair version of legalFor() like so:

.legalFor({{s64, p0}});

I see your point that it's too easy to mix these two overloads up at the moment:

LegalizeRuleSet &legalFor(std::initializer_list<LLT> Types);
LegalizeRuleSet &legalFor(std::initializer_list<std::pair<LLT, LLT>> Types);

A partial fix would be to rename the latter legalForPair() or similar which would make it easier to spot but even then we'd still want something to make sure we don't mix up legalFor() and legalForPair()

The next problem is that the types are backwards. It should be:

.legalFor({{p0, s64}});

Nailed it.

isn't a valid G_INTTOPTR. The result should be a pointer type and the input should be a scalar.

Well, it should be because we like it to be, MachineVerifier doesn't check that.

This is what I eventually came up with to reduce the number of such mistakes in the future:
first: https://reviews.llvm.org/D46338
second: https://reviews.llvm.org/D46339

PTAL

rtereshin added a parent revision: D46413: [GlobalISel][Legalizer] Widening the second src op of shifts bug fix.May 4 2018, 2:55 PM

rtereshin added a child revision: D44700: [GlobalISel] Improving InstructionSelect's performance by reducing MatchTable.

rtereshin edited the summary of this revision. (Show Details)May 7 2018, 11:13 AM

rtereshin added reviewers: aemerson, ab, t.p.northover.

Herald added a reviewer: javed.absar. · View Herald TranscriptMay 7 2018, 11:13 AM

Herald added subscribers: arichardson, sdardis. · View Herald Transcript

rtereshin edited the summary of this revision. (Show Details)May 7 2018, 11:16 AM

This is pretty much finished, PTAL

rtereshin removed a child revision: D44700: [GlobalISel] Improving InstructionSelect's performance by reducing MatchTable.May 16 2018, 10:03 PM

rtereshin added a parent revision: D46339: [GlobalISel][Legalizer] LegalizerInfo verifier: Follow Up.May 16 2018, 10:06 PM

I'd like to get this in. LGTM but needs one issue addressed.

lib/Target/AArch64/AArch64InstructionSelector.cpp
1092	Even though this may be true, I'd rather we not use unreachable here. If for some reason Tablegen fails to select the user we want to have SDAG try it.

This revision is now accepted and ready to land.Aug 28 2018, 4:56 PM

Herald added subscribers: atanasyan, jvesely. · View Herald TranscriptAug 28 2018, 4:56 PM

aemerson added inline comments.Aug 28 2018, 4:58 PM

lib/Target/AArch64/AArch64InstructionSelector.cpp
1092	s/select the user/select, for the user

This is already done in another patch. Abandon?

Herald added a project: Restricted Project. · View Herald TranscriptJul 24 2019, 3:24 PM

Herald added a subscriber: Petar.Avramovic. · View Herald Transcript

Revision Contents

Path

Size

include/

llvm/

CodeGen/

GlobalISel/

LegalizationArtifactCombiner.h

4 lines

Target/

GenericOpcodes.td

6 lines

lib/

CodeGen/

GlobalISel/

LegalizerHelper.cpp

29 lines

Target/

AArch64/

AArch64InstrFormats.td

7 lines

AArch64InstrInfo.td

3 lines

AArch64InstructionSelector.cpp

10 lines

AArch64LegalizerInfo.cpp

24 lines

AArch64RegisterBankInfo.cpp

4 lines

AMDGPU/

AMDGPULegalizerInfo.cpp

1 line

ARM/

ARMLegalizerInfo.cpp

2 lines

X86/

X86LegalizerInfo.cpp

20 lines

test/

CodeGen/

AArch64/

GlobalISel/

legalize-shift.mir

137 lines

legalizer-info-validation.mir

12 lines

regbankselect-default.mir

2 lines

regbankselect-shift.mir

113 lines

select-shift.mir

118 lines

Diff 146275

include/llvm/CodeGen/GlobalISel/LegalizationArtifactCombiner.h

Show First 20 Lines • Show All 81 Lines • ▼ Show 20 Lines	bool tryCombineSExt(MachineInstr &MI,
SmallVectorImpl<MachineInstr *> &DeadInsts) {		SmallVectorImpl<MachineInstr *> &DeadInsts) {

if (MI.getOpcode() != TargetOpcode::G_SEXT)		if (MI.getOpcode() != TargetOpcode::G_SEXT)
return false;		return false;
if (MachineInstr *DefMI = getOpcodeDef(TargetOpcode::G_TRUNC,		if (MachineInstr *DefMI = getOpcodeDef(TargetOpcode::G_TRUNC,
MI.getOperand(1).getReg(), MRI)) {		MI.getOperand(1).getReg(), MRI)) {
unsigned DstReg = MI.getOperand(0).getReg();		unsigned DstReg = MI.getOperand(0).getReg();
LLT DstTy = MRI.getType(DstReg);		LLT DstTy = MRI.getType(DstReg);
if (isInstUnsupported({TargetOpcode::G_SHL, {DstTy}}) \|\|		if (isInstUnsupported({TargetOpcode::G_SHL, {DstTy, DstTy}}) \|\|
isInstUnsupported({TargetOpcode::G_ASHR, {DstTy}}) \|\|		isInstUnsupported({TargetOpcode::G_ASHR, {DstTy, DstTy}}) \|\|
isInstUnsupported({TargetOpcode::G_CONSTANT, {DstTy}}))		isInstUnsupported({TargetOpcode::G_CONSTANT, {DstTy}}))
return false;		return false;
DEBUG(dbgs() << ".. Combine MI: " << MI;);		DEBUG(dbgs() << ".. Combine MI: " << MI;);
Builder.setInstr(MI);		Builder.setInstr(MI);
unsigned SExtSrc = MI.getOperand(1).getReg();		unsigned SExtSrc = MI.getOperand(1).getReg();
LLT SExtSrcTy = MRI.getType(SExtSrc);		LLT SExtSrcTy = MRI.getType(SExtSrc);
unsigned SizeDiff = DstTy.getSizeInBits() - SExtSrcTy.getSizeInBits();		unsigned SizeDiff = DstTy.getSizeInBits() - SExtSrcTy.getSizeInBits();
auto SizeDiffMIB = Builder.buildConstant(DstTy, SizeDiff);		auto SizeDiffMIB = Builder.buildConstant(DstTy, SizeDiff);
▲ Show 20 Lines • Show All 183 Lines • Show Last 20 Lines

include/llvm/Target/GenericOpcodes.td

Show First 20 Lines • Show All 207 Lines • ▼ Show 20 Lines	def G_XOR : GenericInstruction {
let InOperandList = (ins type0:$src1, type0:$src2);		let InOperandList = (ins type0:$src1, type0:$src2);
let hasSideEffects = 0;		let hasSideEffects = 0;
let isCommutable = 1;		let isCommutable = 1;
}		}

// Generic left-shift.		// Generic left-shift.
def G_SHL : GenericInstruction {		def G_SHL : GenericInstruction {
let OutOperandList = (outs type0:$dst);		let OutOperandList = (outs type0:$dst);
let InOperandList = (ins type0:$src1, type0:$src2);		let InOperandList = (ins type0:$src1, type1:$src2);
let hasSideEffects = 0;		let hasSideEffects = 0;
}		}

// Generic logical right-shift.		// Generic logical right-shift.
def G_LSHR : GenericInstruction {		def G_LSHR : GenericInstruction {
let OutOperandList = (outs type0:$dst);		let OutOperandList = (outs type0:$dst);
let InOperandList = (ins type0:$src1, type0:$src2);		let InOperandList = (ins type0:$src1, type1:$src2);
let hasSideEffects = 0;		let hasSideEffects = 0;
}		}

// Generic arithmetic right-shift.		// Generic arithmetic right-shift.
def G_ASHR : GenericInstruction {		def G_ASHR : GenericInstruction {
let OutOperandList = (outs type0:$dst);		let OutOperandList = (outs type0:$dst);
let InOperandList = (ins type0:$src1, type0:$src2);		let InOperandList = (ins type0:$src1, type1:$src2);
let hasSideEffects = 0;		let hasSideEffects = 0;
}		}

// Generic integer comparison.		// Generic integer comparison.
def G_ICMP : GenericInstruction {		def G_ICMP : GenericInstruction {
let OutOperandList = (outs type0:$dst);		let OutOperandList = (outs type0:$dst);
let InOperandList = (ins unknown:$tst, type1:$src1, type1:$src2);		let InOperandList = (ins unknown:$tst, type1:$src1, type1:$src2);
let hasSideEffects = 0;		let hasSideEffects = 0;
▲ Show 20 Lines • Show All 422 Lines • Show Last 20 Lines

lib/CodeGen/GlobalISel/LegalizerHelper.cpp

Show First 20 Lines • Show All 625 Lines • ▼ Show 20 Lines	case TargetOpcode::G_SUB:
// original type.		// original type.
widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_ANYEXT);		widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_ANYEXT);
widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_ANYEXT);		widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_ANYEXT);
widenScalarDst(MI, WideTy);		widenScalarDst(MI, WideTy);
MIRBuilder.recordInsertion(&MI);		MIRBuilder.recordInsertion(&MI);
return Legalized;		return Legalized;

case TargetOpcode::G_SHL:		case TargetOpcode::G_SHL:
widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_ANYEXT);
// The "number of bits to shift" operand must preserve its value as an		// The "number of bits to shift" operand must preserve its value as an
// unsigned integer:		// unsigned integer:
		if (TypeIdx == 1)
widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_ZEXT);		widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_ZEXT);
		else {
		widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_ANYEXT);
widenScalarDst(MI, WideTy);		widenScalarDst(MI, WideTy);
		}
MIRBuilder.recordInsertion(&MI);		MIRBuilder.recordInsertion(&MI);
return Legalized;		return Legalized;

case TargetOpcode::G_SDIV:		case TargetOpcode::G_SDIV:
case TargetOpcode::G_SREM:		case TargetOpcode::G_SREM:
widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_SEXT);		widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_SEXT);
widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_SEXT);		widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_SEXT);
widenScalarDst(MI, WideTy);		widenScalarDst(MI, WideTy);
MIRBuilder.recordInsertion(&MI);		MIRBuilder.recordInsertion(&MI);
return Legalized;		return Legalized;

case TargetOpcode::G_ASHR:		case TargetOpcode::G_ASHR:
widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_SEXT);
// The "number of bits to shift" operand must preserve its value as an		// The "number of bits to shift" operand must preserve its value as an
// unsigned integer:		// unsigned integer:
		if (TypeIdx == 1)
widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_ZEXT);		widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_ZEXT);
		else {
		widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_SEXT);
widenScalarDst(MI, WideTy);		widenScalarDst(MI, WideTy);
		}
MIRBuilder.recordInsertion(&MI);		MIRBuilder.recordInsertion(&MI);
return Legalized;		return Legalized;

case TargetOpcode::G_UDIV:		case TargetOpcode::G_UDIV:
case TargetOpcode::G_UREM:		case TargetOpcode::G_UREM:
case TargetOpcode::G_LSHR:
widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_ZEXT);		widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_ZEXT);
widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_ZEXT);		widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_ZEXT);
widenScalarDst(MI, WideTy);		widenScalarDst(MI, WideTy);
MIRBuilder.recordInsertion(&MI);		MIRBuilder.recordInsertion(&MI);
return Legalized;		return Legalized;

		case TargetOpcode::G_LSHR:
		if (TypeIdx == 1)
		widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_ZEXT);
		else {
		widenScalarSrc(MI, WideTy, 1, TargetOpcode::G_ZEXT);
		widenScalarDst(MI, WideTy);
		}
		MIRBuilder.recordInsertion(&MI);
		return Legalized;

case TargetOpcode::G_SELECT:		case TargetOpcode::G_SELECT:
if (TypeIdx != 0)		if (TypeIdx != 0)
return UnableToLegalize;		return UnableToLegalize;
// Perform operation at larger width (any extension is fine here, high bits		// Perform operation at larger width (any extension is fine here, high bits
// don't affect the result) and then truncate the result back to the		// don't affect the result) and then truncate the result back to the
// original type.		// original type.
widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_ANYEXT);		widenScalarSrc(MI, WideTy, 2, TargetOpcode::G_ANYEXT);
widenScalarSrc(MI, WideTy, 3, TargetOpcode::G_ANYEXT);		widenScalarSrc(MI, WideTy, 3, TargetOpcode::G_ANYEXT);
▲ Show 20 Lines • Show All 361 Lines • Show Last 20 Lines

lib/Target/AArch64/AArch64InstrFormats.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,689 Lines • ▼ Show 20 Lines	multiclass Shift<bits<2> shift_type, string asm, SDNode OpNode> {
def Wr : BaseShift<shift_type, GPR32, asm> {		def Wr : BaseShift<shift_type, GPR32, asm> {
let Inst{31} = 0;		let Inst{31} = 0;
}		}

def Xr : BaseShift<shift_type, GPR64, asm, OpNode> {		def Xr : BaseShift<shift_type, GPR64, asm, OpNode> {
let Inst{31} = 1;		let Inst{31} = 1;
}		}

def : Pat<(i32 (OpNode GPR32:$Rn, i64:$Rm)),		def : Pat<(i32 (OpNode GPR32:$Rn, GPR64:$Rm)),
(!cast<Instruction>(NAME # "Wr") GPR32:$Rn,		(!cast<Instruction>(NAME # "Wr") GPR32:$Rn,
(EXTRACT_SUBREG i64:$Rm, sub_32))>;		(EXTRACT_SUBREG GPR64:$Rm, sub_32))>;

def : Pat<(i32 (OpNode GPR32:$Rn, (i64 (zext GPR32:$Rm)))),		def : Pat<(i32 (OpNode GPR32:$Rn, (i64 (zext GPR32:$Rm)))),
(!cast<Instruction>(NAME # "Wr") GPR32:$Rn, GPR32:$Rm)>;		(!cast<Instruction>(NAME # "Wr") GPR32:$Rn, GPR32:$Rm)>;

def : Pat<(i32 (OpNode GPR32:$Rn, (i64 (anyext GPR32:$Rm)))),		def : Pat<(i32 (OpNode GPR32:$Rn, (i64 (anyext GPR32:$Rm)))),
(!cast<Instruction>(NAME # "Wr") GPR32:$Rn, GPR32:$Rm)>;		(!cast<Instruction>(NAME # "Wr") GPR32:$Rn, GPR32:$Rm)>;

def : Pat<(i32 (OpNode GPR32:$Rn, (i64 (sext GPR32:$Rm)))),		def : Pat<(i32 (OpNode GPR32:$Rn, (i64 (sext GPR32:$Rm)))),
(!cast<Instruction>(NAME # "Wr") GPR32:$Rn, GPR32:$Rm)>;		(!cast<Instruction>(NAME # "Wr") GPR32:$Rn, GPR32:$Rm)>;

		def : Pat<(i32 (OpNode GPR32:$Rn, GPR32:$Rm)),
		(!cast<Instruction>(NAME # "Wr") GPR32:$Rn, GPR32:$Rm)>;
}		}

class ShiftAlias<string asm, Instruction inst, RegisterClass regtype>		class ShiftAlias<string asm, Instruction inst, RegisterClass regtype>
: InstAlias<asm#"\t$dst, $src1, $src2",		: InstAlias<asm#"\t$dst, $src1, $src2",
(inst regtype:$dst, regtype:$src1, regtype:$src2), 0>;		(inst regtype:$dst, regtype:$src1, regtype:$src2), 0>;

class BaseMulAccum<bit isSub, bits<3> opc, RegisterClass multype,		class BaseMulAccum<bit isSub, bits<3> opc, RegisterClass multype,
RegisterClass addtype, string asm,		RegisterClass addtype, string asm,
▲ Show 20 Lines • Show All 8,491 Lines • Show Last 20 Lines

lib/Target/AArch64/AArch64InstrInfo.td

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show First 20 Lines • Show All 313 Lines • ▼ Show 20 Lines

	def AArch64saddv : SDNode<"AArch64ISD::SADDV", SDT_AArch64UnaryVec>;			def AArch64saddv : SDNode<"AArch64ISD::SADDV", SDT_AArch64UnaryVec>;
	def AArch64uaddv : SDNode<"AArch64ISD::UADDV", SDT_AArch64UnaryVec>;			def AArch64uaddv : SDNode<"AArch64ISD::UADDV", SDT_AArch64UnaryVec>;
	def AArch64sminv : SDNode<"AArch64ISD::SMINV", SDT_AArch64UnaryVec>;			def AArch64sminv : SDNode<"AArch64ISD::SMINV", SDT_AArch64UnaryVec>;
	def AArch64uminv : SDNode<"AArch64ISD::UMINV", SDT_AArch64UnaryVec>;			def AArch64uminv : SDNode<"AArch64ISD::UMINV", SDT_AArch64UnaryVec>;
	def AArch64smaxv : SDNode<"AArch64ISD::SMAXV", SDT_AArch64UnaryVec>;			def AArch64smaxv : SDNode<"AArch64ISD::SMAXV", SDT_AArch64UnaryVec>;
	def AArch64umaxv : SDNode<"AArch64ISD::UMAXV", SDT_AArch64UnaryVec>;			def AArch64umaxv : SDNode<"AArch64ISD::UMAXV", SDT_AArch64UnaryVec>;


				def : GINodeEquiv<G_SHL, AArch64vshl>;

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	// AArch64 Instruction Predicate Definitions.			// AArch64 Instruction Predicate Definitions.
	// We could compute these on a per-module basis but doing so requires accessing			// We could compute these on a per-module basis but doing so requires accessing
	// the Function object through the <Target>Subtarget and objections were raised			// the Function object through the <Target>Subtarget and objections were raised
	// to that (see post-commit review comments for r301750).			// to that (see post-commit review comments for r301750).
	▲ Show 20 Lines • Show All 5,997 Lines • Show Last 20 Lines

lib/Target/AArch64/AArch64InstructionSelector.cpp

Show First 20 Lines • Show All 1,078 Lines • ▼ Show 20 Lines	case TargetOpcode::G_UMULH: {
unsigned NewOpc = I.getOpcode() == TargetOpcode::G_SMULH ? AArch64::SMULHrr		unsigned NewOpc = I.getOpcode() == TargetOpcode::G_SMULH ? AArch64::SMULHrr
: AArch64::UMULHrr;		: AArch64::UMULHrr;
I.setDesc(TII.get(NewOpc));		I.setDesc(TII.get(NewOpc));

// Now that we selected an opcode, we need to constrain the register		// Now that we selected an opcode, we need to constrain the register
// operands to use appropriate classes.		// operands to use appropriate classes.
return constrainSelectedInstRegOperands(I, TII, TRI, RBI);		return constrainSelectedInstRegOperands(I, TII, TRI, RBI);
}		}
		case TargetOpcode::G_SHL:
		llvm_unreachable("G_SHL is expected to be tablegen-selectable");
		case TargetOpcode::G_LSHR:
		llvm_unreachable("G_LSHR is expected to be tablegen-selectable");
		case TargetOpcode::G_ASHR:
		llvm_unreachable("G_ASHR is expected to be tablegen-selectable");
		aemersonUnsubmitted Not Done Reply Inline Actions Even though this may be true, I'd rather we not use unreachable here. If for some reason Tablegen fails to select the user we want to have SDAG try it. aemerson: Even though this may be true, I'd rather we not use unreachable here. If for some reason…
		aemersonUnsubmitted Not Done Reply Inline Actions s/select the user/select, for the user aemerson: s/select the user/select, for the user

case TargetOpcode::G_FADD:		case TargetOpcode::G_FADD:
case TargetOpcode::G_FSUB:		case TargetOpcode::G_FSUB:
case TargetOpcode::G_FMUL:		case TargetOpcode::G_FMUL:
case TargetOpcode::G_FDIV:		case TargetOpcode::G_FDIV:

case TargetOpcode::G_OR:		case TargetOpcode::G_OR:
case TargetOpcode::G_SHL:
case TargetOpcode::G_LSHR:
case TargetOpcode::G_ASHR:
case TargetOpcode::G_GEP: {		case TargetOpcode::G_GEP: {
// Reject the various things we don't support yet.		// Reject the various things we don't support yet.
if (unsupportedBinOp(I, RBI, MRI, TRI))		if (unsupportedBinOp(I, RBI, MRI, TRI))
return false;		return false;

const unsigned OpSize = Ty.getSizeInBits();		const unsigned OpSize = Ty.getSizeInBits();

const unsigned DefReg = I.getOperand(0).getReg();		const unsigned DefReg = I.getOperand(0).getReg();
▲ Show 20 Lines • Show All 556 Lines • Show Last 20 Lines

lib/Target/AArch64/AArch64LegalizerInfo.cpp

Show All 18 Lines
#include "llvm/CodeGen/MachineRegisterInfo.h"		#include "llvm/CodeGen/MachineRegisterInfo.h"
#include "llvm/CodeGen/TargetOpcodes.h"		#include "llvm/CodeGen/TargetOpcodes.h"
#include "llvm/CodeGen/ValueTypes.h"		#include "llvm/CodeGen/ValueTypes.h"
#include "llvm/IR/DerivedTypes.h"		#include "llvm/IR/DerivedTypes.h"
#include "llvm/IR/Type.h"		#include "llvm/IR/Type.h"

using namespace llvm;		using namespace llvm;
using namespace LegalizeActions;		using namespace LegalizeActions;
		using namespace LegalizeMutations;
using namespace LegalityPredicates;		using namespace LegalityPredicates;

AArch64LegalizerInfo::AArch64LegalizerInfo(const AArch64Subtarget &ST) {		AArch64LegalizerInfo::AArch64LegalizerInfo(const AArch64Subtarget &ST) {
using namespace TargetOpcode;		using namespace TargetOpcode;
const LLT p0 = LLT::pointer(0, 64);		const LLT p0 = LLT::pointer(0, 64);
const LLT s1 = LLT::scalar(1);		const LLT s1 = LLT::scalar(1);
const LLT s8 = LLT::scalar(8);		const LLT s8 = LLT::scalar(8);
const LLT s16 = LLT::scalar(16);		const LLT s16 = LLT::scalar(16);
Show All 22 Lines	getActionDefinitionsBuilder(G_PHI)
.clampScalar(0, s16, s64)		.clampScalar(0, s16, s64)
.widenScalarToNextPow2(0);		.widenScalarToNextPow2(0);

getActionDefinitionsBuilder(G_BSWAP)		getActionDefinitionsBuilder(G_BSWAP)
.legalFor({s32, s64})		.legalFor({s32, s64})
.clampScalar(0, s16, s64)		.clampScalar(0, s16, s64)
.widenScalarToNextPow2(0);		.widenScalarToNextPow2(0);

getActionDefinitionsBuilder({G_ADD, G_SUB, G_MUL, G_AND, G_OR, G_XOR, G_SHL})		getActionDefinitionsBuilder({G_ADD, G_SUB, G_MUL, G_AND, G_OR, G_XOR})
.legalFor({s32, s64, v2s32, v4s32, v2s64})		.legalFor({s32, s64, v2s32, v4s32, v2s64})
.clampScalar(0, s32, s64)		.clampScalar(0, s32, s64)
.widenScalarToNextPow2(0)		.widenScalarToNextPow2(0)
.clampNumElements(0, v2s32, v4s32)		.clampNumElements(0, v2s32, v4s32)
.clampNumElements(0, v2s64, v2s64)		.clampNumElements(0, v2s64, v2s64)
.moreElementsToNextPow2(0);		.moreElementsToNextPow2(0);

getActionDefinitionsBuilder(G_GEP)		getActionDefinitionsBuilder(G_GEP)
.legalFor({{p0, s64}})		.legalFor({{p0, s64}})
.clampScalar(1, s64, s64);		.clampScalar(1, s64, s64);

getActionDefinitionsBuilder(G_PTR_MASK).legalFor({p0});		getActionDefinitionsBuilder(G_PTR_MASK).legalFor({p0});

getActionDefinitionsBuilder({G_LSHR, G_ASHR, G_SDIV, G_UDIV})		getActionDefinitionsBuilder({G_SDIV, G_UDIV})
.legalFor({s32, s64})		.legalFor({s32, s64})
.clampScalar(0, s32, s64)		.clampScalar(0, s32, s64)
.widenScalarToNextPow2(0);		.widenScalarToNextPow2(0);

		getActionDefinitionsBuilder({G_SHL, G_LSHR, G_ASHR})
		.legalFor({{s32, s32}, {s32, s64}, {s64, s64}, {v4s32, s32}})
		.clampScalar(0, s32, s64)
		.widenScalarToNextPow2(0)
		.maxScalar(1, s64)
		.widenScalarIf(
		[s64](const LegalityQuery &Query) {
		return Query.Types[0] != Query.Types[1] && Query.Types[1] != s64;
		},
		[s64](const LegalityQuery &Query) {
		return ((Query.Types[0].getSizeInBits() <
		Query.Types[1].getSizeInBits())
		? changeTo(1, s64)
		: changeTo(1, 0))(Query);
		});

getActionDefinitionsBuilder({G_SREM, G_UREM})		getActionDefinitionsBuilder({G_SREM, G_UREM})
.lowerFor({s1, s8, s16, s32, s64});		.lowerFor({s1, s8, s16, s32, s64});

getActionDefinitionsBuilder({G_SMULO, G_UMULO})		getActionDefinitionsBuilder({G_SMULO, G_UMULO})
.lowerFor({{s64, s1}});		.lowerFor({{s64, s1}});

getActionDefinitionsBuilder({G_SMULH, G_UMULH}).legalFor({s32, s64});		getActionDefinitionsBuilder({G_SMULH, G_UMULH}).legalFor({s32, s64});

▲ Show 20 Lines • Show All 115 Lines • ▼ Show 20 Lines	AArch64LegalizerInfo::AArch64LegalizerInfo(const AArch64Subtarget &ST) {
getActionDefinitionsBuilder(G_FCMP)		getActionDefinitionsBuilder(G_FCMP)
.legalFor({{s32, s32}, {s32, s64}})		.legalFor({{s32, s32}, {s32, s64}})
.clampScalar(0, s32, s32)		.clampScalar(0, s32, s32)
.clampScalar(1, s32, s64)		.clampScalar(1, s32, s64)
.widenScalarToNextPow2(1);		.widenScalarToNextPow2(1);

// Extensions		// Extensions
getActionDefinitionsBuilder({G_ZEXT, G_SEXT, G_ANYEXT})		getActionDefinitionsBuilder({G_ZEXT, G_SEXT, G_ANYEXT})
.legalForCartesianProduct({s8, s16, s32, s64}, {s1, s8, s16, s32});		.legalForCartesianProduct({s8, s16, s32, s64}, {s1, s8, s16, s32})
		.legalFor({{v4s32, v4s16}});

// FP conversions		// FP conversions
getActionDefinitionsBuilder(G_FPTRUNC).legalFor(		getActionDefinitionsBuilder(G_FPTRUNC).legalFor(
{{s16, s32}, {s16, s64}, {s32, s64}});		{{s16, s32}, {s16, s64}, {s32, s64}});
getActionDefinitionsBuilder(G_FPEXT).legalFor(		getActionDefinitionsBuilder(G_FPEXT).legalFor(
{{s32, s16}, {s64, s16}, {s64, s32}});		{{s32, s16}, {s64, s16}, {s64, s32}});

// Conversions		// Conversions
▲ Show 20 Lines • Show All 228 Lines • Show Last 20 Lines

lib/Target/AArch64/AArch64RegisterBankInfo.cpp

Show First 20 Lines • Show All 463 Lines • ▼ Show 20 Lines	AArch64RegisterBankInfo::getInstrMapping(const MachineInstr &MI) const {
case TargetOpcode::G_GEP:		case TargetOpcode::G_GEP:
case TargetOpcode::G_MUL:		case TargetOpcode::G_MUL:
case TargetOpcode::G_SDIV:		case TargetOpcode::G_SDIV:
case TargetOpcode::G_UDIV:		case TargetOpcode::G_UDIV:
// Bitwise ops.		// Bitwise ops.
case TargetOpcode::G_AND:		case TargetOpcode::G_AND:
case TargetOpcode::G_OR:		case TargetOpcode::G_OR:
case TargetOpcode::G_XOR:		case TargetOpcode::G_XOR:
// Shifts.
case TargetOpcode::G_SHL:
case TargetOpcode::G_LSHR:
case TargetOpcode::G_ASHR:
// Floating point ops.		// Floating point ops.
case TargetOpcode::G_FADD:		case TargetOpcode::G_FADD:
case TargetOpcode::G_FSUB:		case TargetOpcode::G_FSUB:
case TargetOpcode::G_FMUL:		case TargetOpcode::G_FMUL:
case TargetOpcode::G_FDIV:		case TargetOpcode::G_FDIV:
return getSameKindOfOperandsMapping(MI);		return getSameKindOfOperandsMapping(MI);
case TargetOpcode::G_FPEXT: {		case TargetOpcode::G_FPEXT: {
LLT DstTy = MRI.getType(MI.getOperand(0).getReg());		LLT DstTy = MRI.getType(MI.getOperand(0).getReg());
▲ Show 20 Lines • Show All 168 Lines • Show Last 20 Lines

lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp

Show First 20 Lines • Show All 128 Lines • ▼ Show 20 Lines	.legalIf([=, &ST](const LegalityQuery &Query) {
});		});



setAction({G_SELECT, S32}, Legal);		setAction({G_SELECT, S32}, Legal);
setAction({G_SELECT, 1, S1}, Legal);		setAction({G_SELECT, 1, S1}, Legal);

setAction({G_SHL, S32}, Legal);		setAction({G_SHL, S32}, Legal);
		setAction({G_SHL, 1, S32}, Legal);


// FIXME: When RegBankSelect inserts copies, it will only create new		// FIXME: When RegBankSelect inserts copies, it will only create new
// registers with scalar types. This means we can end up with		// registers with scalar types. This means we can end up with
// G_LOAD/G_STORE/G_GEP instruction with scalar types for their pointer		// G_LOAD/G_STORE/G_GEP instruction with scalar types for their pointer
// operands. In assert builds, the instruction selector will assert		// operands. In assert builds, the instruction selector will assert
// if it sees a generic instruction which isn't legal, so we need to		// if it sees a generic instruction which isn't legal, so we need to
// tell it that scalar types are legal for pointer operands		// tell it that scalar types are legal for pointer operands
▲ Show 20 Lines • Show All 55 Lines • Show Last 20 Lines

lib/Target/ARM/ARMLegalizerInfo.cpp

Show First 20 Lines • Show All 101 Lines • ▼ Show 20 Lines	ARMLegalizerInfo::ARMLegalizerInfo(const ARMSubtarget &ST) {
}		}

getActionDefinitionsBuilder({G_SEXT, G_ZEXT, G_ANYEXT})		getActionDefinitionsBuilder({G_SEXT, G_ZEXT, G_ANYEXT})
.legalForCartesianProduct({s32}, {s1, s8, s16});		.legalForCartesianProduct({s32}, {s1, s8, s16});

getActionDefinitionsBuilder(G_INTTOPTR).legalFor({{p0, s32}});		getActionDefinitionsBuilder(G_INTTOPTR).legalFor({{p0, s32}});
getActionDefinitionsBuilder(G_PTRTOINT).legalFor({{s32, p0}});		getActionDefinitionsBuilder(G_PTRTOINT).legalFor({{s32, p0}});

getActionDefinitionsBuilder({G_ASHR, G_LSHR, G_SHL}).legalFor({s32});		getActionDefinitionsBuilder({G_ASHR, G_LSHR, G_SHL}).legalFor({{s32, s32}});

getActionDefinitionsBuilder(G_GEP).legalFor({{p0, s32}});		getActionDefinitionsBuilder(G_GEP).legalFor({{p0, s32}});

getActionDefinitionsBuilder(G_SELECT).legalForCartesianProduct({s32, p0},		getActionDefinitionsBuilder(G_SELECT).legalForCartesianProduct({s32, p0},
{s1});		{s1});

getActionDefinitionsBuilder(G_BRCOND).legalFor({s1});		getActionDefinitionsBuilder(G_BRCOND).legalFor({s1});

▲ Show 20 Lines • Show All 304 Lines • Show Last 20 Lines

lib/Target/X86/X86LegalizerInfo.cpp

Show First 20 Lines • Show All 126 Lines • ▼ Show 20 Lines	void X86LegalizerInfo::setLegalizerInfo32bit() {

if (!Subtarget.is64Bit()) {		if (!Subtarget.is64Bit()) {
getActionDefinitionsBuilder(G_PTRTOINT)		getActionDefinitionsBuilder(G_PTRTOINT)
.legalForCartesianProduct({s1, s8, s16, s32}, {p0})		.legalForCartesianProduct({s1, s8, s16, s32}, {p0})
.maxScalar(0, s32)		.maxScalar(0, s32)
.widenScalarToNextPow2(0, /Min/ 8);		.widenScalarToNextPow2(0, /Min/ 8);
getActionDefinitionsBuilder(G_INTTOPTR).legalFor({{p0, s32}});		getActionDefinitionsBuilder(G_INTTOPTR).legalFor({{p0, s32}});

// Shifts and SDIV		// Shifts
getActionDefinitionsBuilder({G_SHL, G_LSHR, G_ASHR, G_SDIV})		getActionDefinitionsBuilder({G_SHL, G_LSHR, G_ASHR})
		.legalFor({{s8, s8}, {s16, s16}, {s32, s32}})
		.clampScalar(0, s8, s32)
		.clampScalar(1, s8, s32);

		// SDIV
		getActionDefinitionsBuilder(G_SDIV)
.legalFor({s8, s16, s32})		.legalFor({s8, s16, s32})
.clampScalar(0, s8, s32);		.clampScalar(0, s8, s32);
}		}

// Control-flow		// Control-flow
setAction({G_BRCOND, s1}, Legal);		setAction({G_BRCOND, s1}, Legal);

// Constants		// Constants
▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	void X86LegalizerInfo::setLegalizerInfo64bit() {
// Extensions		// Extensions
for (unsigned extOp : {G_ZEXT, G_SEXT, G_ANYEXT}) {		for (unsigned extOp : {G_ZEXT, G_SEXT, G_ANYEXT}) {
setAction({extOp, s64}, Legal);		setAction({extOp, s64}, Legal);
}		}

// Comparison		// Comparison
setAction({G_ICMP, 1, s64}, Legal);		setAction({G_ICMP, 1, s64}, Legal);

// Shifts and SDIV		// Shifts
getActionDefinitionsBuilder({G_SHL, G_LSHR, G_ASHR, G_SDIV})		getActionDefinitionsBuilder({G_SHL, G_LSHR, G_ASHR})
		.legalFor({{s8, s8}, {s16, s16}, {s32, s32}, {s64, s64}})
		.clampScalar(0, s8, s64)
		.clampScalar(1, s8, s64);

		// SDIV
		getActionDefinitionsBuilder(G_SDIV)
.legalFor({s8, s16, s32, s64})		.legalFor({s8, s16, s32, s64})
.clampScalar(0, s8, s64);		.clampScalar(0, s8, s64);

// Merge/Unmerge		// Merge/Unmerge
setAction({G_MERGE_VALUES, s128}, Legal);		setAction({G_MERGE_VALUES, s128}, Legal);
setAction({G_UNMERGE_VALUES, 1, s128}, Legal);		setAction({G_UNMERGE_VALUES, 1, s128}, Legal);
setAction({G_MERGE_VALUES, 1, s128}, Legal);		setAction({G_MERGE_VALUES, 1, s128}, Legal);
setAction({G_UNMERGE_VALUES, s128}, Legal);		setAction({G_UNMERGE_VALUES, s128}, Legal);
▲ Show 20 Lines • Show All 247 Lines • Show Last 20 Lines

test/CodeGen/AArch64/GlobalISel/legalize-shift.mir

	# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
	# RUN: llc -O0 -run-pass=legalizer %s -o - \| FileCheck %s			# RUN: llc -O0 -run-pass=legalizer %s -o - \| FileCheck %s

	--- \|			--- \|
	target datalayout = "e-m:o-i64:64-i128:128-n32:64-S128"			target datalayout = "e-m:o-i64:64-i128:128-n32:64-S128"
	target triple = "aarch64--"			target triple = "aarch64--"
	define void @test_shift() {			define void @test_shift() {
	entry:			entry:
	ret void			ret void
	}			}
				define void @test_vector_shift() {
				entry:
				ret void
				}
	...			...

	---			---
	name: test_shift			name: test_shift
	registers:			registers:
	- { id: 0, class: _ }			- { id: 0, class: _ }
	- { id: 1, class: _ }			- { id: 1, class: _ }
	- { id: 2, class: _ }			- { id: 2, class: _ }
	- { id: 3, class: _ }			- { id: 3, class: _ }
	- { id: 4, class: _ }			- { id: 4, class: _ }
	- { id: 5, class: _ }			- { id: 5, class: _ }
	- { id: 6, class: _ }			- { id: 6, class: _ }
				- { id: 7, class: _ }
				- { id: 8, class: _ }
				- { id: 9, class: _ }
				- { id: 10, class: _ }
				- { id: 11, class: _ }
				tracksRegLiveness: true
	body: \|			body: \|
	bb.0.entry:			bb.0.entry:
	liveins: $x0, $x1, $x2, $x3			liveins: $x0, $w1, $w2

	; CHECK-LABEL: name: test_shift			; CHECK-LABEL: name: test_shift
				; CHECK-NOT: failedISel: true
				; CHECK: liveins: $x0, $w1, $w2
	; CHECK: [[COPY:%[0-9]+]]:_(s64) = COPY $x0			; CHECK: [[COPY:%[0-9]+]]:_(s64) = COPY $x0
	; CHECK: [[COPY1:%[0-9]+]]:_(s64) = COPY $x1			; CHECK: [[COPY1:%[0-9]+]]:_(s32) = COPY $w1
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 24			; CHECK: [[COPY2:%[0-9]+]]:_(s32) = COPY $w2
	; CHECK: [[TRUNC:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[TRUNC]], [[C]]			; CHECK: [[COPY3:%[0-9]+]]:_(s32) = COPY [[COPY2]](s32)
	; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[SHL]], [[C]]			; CHECK: [[AND:%[0-9]+]]:_(s32) = G_AND [[COPY3]], [[C]]
				; CHECK: [[ASHR:%[0-9]+]]:_(s32) = G_ASHR [[COPY1]], [[AND]](s32)
				; CHECK: $w1 = COPY [[ASHR]](s32)
	; CHECK: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 255			; CHECK: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
	; CHECK: [[TRUNC1:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)			; CHECK: [[COPY4:%[0-9]+]]:_(s32) = COPY [[COPY2]](s32)
	; CHECK: [[AND:%[0-9]+]]:_(s32) = G_AND [[TRUNC1]], [[C1]]			; CHECK: [[AND1:%[0-9]+]]:_(s32) = G_AND [[COPY4]], [[C1]]
	; CHECK: [[ASHR1:%[0-9]+]]:_(s32) = G_ASHR [[ASHR]], [[AND]]			; CHECK: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[AND1]], [[COPY]](s64)
	; CHECK: [[COPY2:%[0-9]+]]:_(s32) = COPY [[ASHR1]](s32)			; CHECK: [[COPY5:%[0-9]+]]:_(s32) = COPY [[LSHR]](s32)
	; CHECK: $w0 = COPY [[COPY2]](s32)			; CHECK: $w1 = COPY [[COPY5]](s32)
	; CHECK: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 255			; CHECK: [[C2:%[0-9]+]]:_(s64) = G_CONSTANT i64 255
	; CHECK: [[TRUNC2:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)			; CHECK: [[ANYEXT:%[0-9]+]]:_(s64) = G_ANYEXT [[COPY2]](s32)
	; CHECK: [[AND1:%[0-9]+]]:_(s32) = G_AND [[TRUNC2]], [[C2]]			; CHECK: [[AND2:%[0-9]+]]:_(s64) = G_AND [[ANYEXT]], [[C2]]
				; CHECK: [[SHL:%[0-9]+]]:_(s64) = G_SHL [[COPY]], [[AND2]](s64)
				; CHECK: $x0 = COPY [[SHL]](s64)
				; CHECK: [[COPY6:%[0-9]+]]:_(s32) = COPY [[COPY2]](s32)
				; CHECK: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[COPY6]], [[COPY1]](s32)
				; CHECK: [[COPY7:%[0-9]+]]:_(s32) = COPY [[SHL1]](s32)
				; CHECK: $w1 = COPY [[COPY7]](s32)
				; CHECK: [[COPY8:%[0-9]+]]:_(s32) = COPY [[COPY2]](s32)
	; CHECK: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 255			; CHECK: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
	; CHECK: [[TRUNC3:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)			; CHECK: [[COPY9:%[0-9]+]]:_(s32) = COPY [[COPY2]](s32)
	; CHECK: [[AND2:%[0-9]+]]:_(s32) = G_AND [[TRUNC3]], [[C3]]			; CHECK: [[AND3:%[0-9]+]]:_(s32) = G_AND [[COPY9]], [[C3]]
	; CHECK: [[LSHR:%[0-9]+]]:_(s32) = G_LSHR [[AND1]], [[AND2]]			; CHECK: [[SHL2:%[0-9]+]]:_(s32) = G_SHL [[COPY8]], [[AND3]](s32)
	; CHECK: [[COPY3:%[0-9]+]]:_(s32) = COPY [[LSHR]](s32)			; CHECK: [[COPY10:%[0-9]+]]:_(s32) = COPY [[SHL2]](s32)
	; CHECK: $w0 = COPY [[COPY3]](s32)			; CHECK: $w1 = COPY [[COPY10]](s32)
	; CHECK: [[TRUNC4:%[0-9]+]]:_(s32) = G_TRUNC [[COPY]](s64)
	; CHECK: [[C4:%[0-9]+]]:_(s32) = G_CONSTANT i32 255
	; CHECK: [[TRUNC5:%[0-9]+]]:_(s32) = G_TRUNC [[COPY1]](s64)
	; CHECK: [[AND3:%[0-9]+]]:_(s32) = G_AND [[TRUNC5]], [[C4]]
	; CHECK: [[SHL1:%[0-9]+]]:_(s32) = G_SHL [[TRUNC4]], [[AND3]]
	; CHECK: [[COPY4:%[0-9]+]]:_(s32) = COPY [[SHL1]](s32)
	; CHECK: $w0 = COPY [[COPY4]](s32)
	%0(s64) = COPY $x0			%0(s64) = COPY $x0
	%1(s64) = COPY $x1			%1(s32) = COPY $w1
	%2(s8) = G_TRUNC %0			%2(s32) = COPY $w2
	%3(s8) = G_TRUNC %1			%3(s8) = G_TRUNC %2

	%4(s8) = G_ASHR %2, %3			%4(s32) = G_ASHR %1, %3(s8)
	%7:_(s32) = G_ANYEXT %4			$w1 = COPY %4
	$w0 = COPY %7
				%5(s8) = G_LSHR %3, %0(s64)
				%6(s32) = G_ANYEXT %5
	%5(s8) = G_LSHR %2, %3			$w1 = COPY %6
	%8:_(s32) = G_ANYEXT %5
	$w0 = COPY %8			%7(s64) = G_SHL %0, %3(s8)
				$x0 = COPY %7
	%6(s8) = G_SHL %2, %3
	%9:_(s32) = G_ANYEXT %6			%8(s8) = G_SHL %3, %1(s32)
	$w0 = COPY %9			%9(s32) = G_ANYEXT %8
				$w1 = COPY %9

				%10(s8) = G_SHL %3, %3(s8)
				%11(s32) = G_ANYEXT %10
				$w1 = COPY %11
				...

				---
				name: test_vector_shift
				registers:
				- { id: 0, class: _ }
				- { id: 1, class: _ }
				- { id: 2, class: _ }
				- { id: 3, class: _ }
				- { id: 4, class: _ }
				- { id: 5, class: _ }
				- { id: 6, class: _ }
				tracksRegLiveness: true
				body: \|
				bb.0.entry:
				liveins: $q0, $d0

				; CHECK-LABEL: name: test_vector_shift
				; CHECK-NOT: failedISel: true
				; CHECK: liveins: $q0, $d0
				; CHECK: [[COPY:%[0-9]+]]:_(<4 x s32>) = COPY $q0
				; CHECK: [[COPY1:%[0-9]+]]:_(<4 x s16>) = COPY $d0
				; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 3
				; CHECK: [[SHL:%[0-9]+]]:_(<4 x s32>) = G_SHL [[COPY]], [[C]](s32)
				; CHECK: $q1 = COPY [[SHL]](<4 x s32>)
				; CHECK: [[SEXT:%[0-9]+]]:_(<4 x s32>) = G_SEXT [[COPY1]](<4 x s16>)
				; CHECK: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
				; CHECK: [[SHL1:%[0-9]+]]:_(<4 x s32>) = G_SHL [[SEXT]], [[C1]](s32)
				; CHECK: $q2 = COPY [[SHL1]](<4 x s32>)
				%0(<4 x s32>) = COPY $q0
				%5(<4 x s16>) = COPY $d0

				%1(s32) = G_CONSTANT i32 3
				%2(<4 x s32>) = G_SHL %0, %1(s32)
				$q1 = COPY %2

				%6(<4 x s32>) = G_SEXT %5(<4 x s16>)
				%3(s32) = G_CONSTANT i32 16
				%4(<4 x s32>) = G_SHL %6, %3(s32)
				$q2 = COPY %4
	...			...

test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir

	Show First 20 Lines • Show All 159 Lines • ▼ Show 20 Lines
	# DEBUG: .. the first uncovered type index: 2, OK			# DEBUG: .. the first uncovered type index: 2, OK
	#			#
	# DEBUG-NEXT: G_SEXT (opcode {{[0-9]+}}): 2 type indices			# DEBUG-NEXT: G_SEXT (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK			# DEBUG: .. the first uncovered type index: 2, OK
	#			#
	# DEBUG-NEXT: G_ZEXT (opcode {{[0-9]+}}): 2 type indices			# DEBUG-NEXT: G_ZEXT (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK			# DEBUG: .. the first uncovered type index: 2, OK
	#			#
	# DEBUG-NEXT: G_SHL (opcode {{[0-9]+}}): 1 type index			# DEBUG-NEXT: G_SHL (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 1, OK			# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#			#
	# DEBUG-NEXT: G_LSHR (opcode {{[0-9]+}}): 1 type index			# DEBUG-NEXT: G_LSHR (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 1, OK			# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#			#
	# DEBUG-NEXT: G_ASHR (opcode {{[0-9]+}}): 1 type index			# DEBUG-NEXT: G_ASHR (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 1, OK			# DEBUG: .. type index coverage check SKIPPED: user-defined predicate detected
	#			#
	# DEBUG-NEXT: G_ICMP (opcode {{[0-9]+}}): 2 type indices			# DEBUG-NEXT: G_ICMP (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK			# DEBUG: .. the first uncovered type index: 2, OK
	#			#
	# DEBUG-NEXT: G_FCMP (opcode {{[0-9]+}}): 2 type indices			# DEBUG-NEXT: G_FCMP (opcode {{[0-9]+}}): 2 type indices
	# DEBUG: .. the first uncovered type index: 2, OK			# DEBUG: .. the first uncovered type index: 2, OK
	#			#
	# DEBUG-NEXT: G_SELECT (opcode {{[0-9]+}}): 2 type indices			# DEBUG-NEXT: G_SELECT (opcode {{[0-9]+}}): 2 type indices
	▲ Show 20 Lines • Show All 111 Lines • Show Last 20 Lines

test/CodeGen/AArch64/GlobalISel/regbankselect-default.mir

	# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py			# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
	# RUN: llc -O0 -mtriple arm64-- -run-pass=regbankselect %s -o - \| FileCheck %s			# RUN: llc -O0 -mtriple arm64-- -run-pass=regbankselect -disable-gisel-legality-check %s -o - \| FileCheck %s

	# Check the default mappings for various instructions.			# Check the default mappings for various instructions.

	--- \|			--- \|
	target datalayout = "e-m:o-i64:64-i128:128-n32:64-S128"			target datalayout = "e-m:o-i64:64-i128:128-n32:64-S128"

	define void @test_add_s32() { ret void }			define void @test_add_s32() { ret void }
	define void @test_add_v4s32() { ret void }			define void @test_add_v4s32() { ret void }
	▲ Show 20 Lines • Show All 798 Lines • Show Last 20 Lines

test/CodeGen/AArch64/GlobalISel/regbankselect-shift.mir

This file was added.

				# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
				# RUN: llc -O0 -run-pass=regbankselect %s -o - \| FileCheck %s
				--- \|
				target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"
				target triple = "aarch64--"
				define void @test_shift() {
				entry:
				ret void
				}
				define void @test_vector_shift() {
				entry:
				ret void
				}
				...

				---
				name: test_shift
				legalized: true
				tracksRegLiveness: true
				body: \|
				bb.0.entry:
				liveins: $x0, $w1, $w2

				; CHECK-LABEL: name: test_shift
				; CHECK-NOT: failedISel: true
				; CHECK: liveins: $x0, $w1, $w2
				; CHECK: [[COPY:%[0-9]+]]:gpr(s64) = COPY $x0
				; CHECK: [[COPY1:%[0-9]+]]:gpr(s32) = COPY $w1
				; CHECK: [[COPY2:%[0-9]+]]:gpr(s32) = COPY $w2
				; CHECK: [[C:%[0-9]+]]:gpr(s32) = G_CONSTANT i32 255
				; CHECK: [[AND:%[0-9]+]]:gpr(s32) = G_AND [[COPY2]], [[C]]
				; CHECK: [[ASHR:%[0-9]+]]:gpr(s32) = G_ASHR [[COPY1]], [[AND]](s32)
				; CHECK: $w1 = COPY [[ASHR]](s32)
				; CHECK: [[C1:%[0-9]+]]:gpr(s32) = G_CONSTANT i32 255
				; CHECK: [[AND1:%[0-9]+]]:gpr(s32) = G_AND [[COPY2]], [[C1]]
				; CHECK: [[LSHR:%[0-9]+]]:gpr(s32) = G_LSHR [[AND1]], [[COPY]](s64)
				; CHECK: $w1 = COPY [[LSHR]](s32)
				; CHECK: [[C2:%[0-9]+]]:gpr(s64) = G_CONSTANT i64 255
				; CHECK: [[ANYEXT:%[0-9]+]]:gpr(s64) = G_ANYEXT [[COPY2]](s32)
				; CHECK: [[AND2:%[0-9]+]]:gpr(s64) = G_AND [[ANYEXT]], [[C2]]
				; CHECK: [[SHL:%[0-9]+]]:gpr(s64) = G_SHL [[COPY]], [[AND2]](s64)
				; CHECK: $x0 = COPY [[SHL]](s64)
				; CHECK: [[COPY3:%[0-9]+]]:gpr(s32) = COPY [[COPY2]](s32)
				; CHECK: [[SHL1:%[0-9]+]]:gpr(s32) = G_SHL [[COPY3]], [[COPY1]](s32)
				; CHECK: $w1 = COPY [[SHL1]](s32)
				; CHECK: [[C3:%[0-9]+]]:gpr(s32) = G_CONSTANT i32 255
				; CHECK: [[COPY4:%[0-9]+]]:gpr(s32) = COPY [[COPY2]](s32)
				; CHECK: [[AND3:%[0-9]+]]:gpr(s32) = G_AND [[COPY4]], [[C3]]
				; CHECK: [[SHL2:%[0-9]+]]:gpr(s32) = G_SHL [[COPY2]], [[AND3]](s32)
				; CHECK: $w1 = COPY [[SHL2]](s32)
				%0:_(s64) = COPY $x0
				%1:_(s32) = COPY $w1
				%2:_(s32) = COPY $w2

				%21:_(s32) = G_CONSTANT i32 255
				%20:_(s32) = G_AND %2, %21
				%4:_(s32) = G_ASHR %1, %20(s32)
				$w1 = COPY %4(s32)

				%23:_(s32) = G_CONSTANT i32 255
				%18:_(s32) = G_AND %2, %23
				%19:_(s32) = G_LSHR %18, %0(s64)
				$w1 = COPY %19(s32)

				%25:_(s64) = G_CONSTANT i64 255
				%24:_(s64) = G_ANYEXT %2(s32)
				%17:_(s64) = G_AND %24, %25
				%7:_(s64) = G_SHL %0, %17(s64)
				$x0 = COPY %7(s64)

				%15:_(s32) = COPY %2(s32)
				%16:_(s32) = G_SHL %15, %1(s32)
				$w1 = COPY %16(s32)

				%27:_(s32) = G_CONSTANT i32 255
				%28:_(s32) = COPY %2(s32)
				%14:_(s32) = G_AND %28, %27
				%13:_(s32) = G_SHL %2, %14(s32)
				$w1 = COPY %13(s32)
				...

				---
				name: test_vector_shift
				legalized: true
				tracksRegLiveness: true
				body: \|
				bb.0.entry:
				liveins: $q0, $d0

				; CHECK-LABEL: name: test_vector_shift
				; CHECK-NOT: failedISel: true
				; CHECK: liveins: $q0, $d0
				; CHECK: [[COPY:%[0-9]+]]:fpr(<4 x s32>) = COPY $q0
				; CHECK: [[COPY1:%[0-9]+]]:fpr(<4 x s16>) = COPY $d0
				; CHECK: [[C:%[0-9]+]]:gpr(s32) = G_CONSTANT i32 3
				; CHECK: [[SHL:%[0-9]+]]:fpr(<4 x s32>) = G_SHL [[COPY]], [[C]](s32)
				; CHECK: $q1 = COPY [[SHL]](<4 x s32>)
				; CHECK: [[SEXT:%[0-9]+]]:fpr(<4 x s32>) = G_SEXT [[COPY1]](<4 x s16>)
				; CHECK: [[C1:%[0-9]+]]:gpr(s32) = G_CONSTANT i32 16
				; CHECK: [[SHL1:%[0-9]+]]:fpr(<4 x s32>) = G_SHL [[SEXT]], [[C1]](s32)
				; CHECK: $q2 = COPY [[SHL1]](<4 x s32>)
				%0:_(<4 x s32>) = COPY $q0
				%5:_(<4 x s16>) = COPY $d0

				%1:_(s32) = G_CONSTANT i32 3
				%2:_(<4 x s32>) = G_SHL %0, %1(s32)
				$q1 = COPY %2(<4 x s32>)

				%6:_(<4 x s32>) = G_SEXT %5(<4 x s16>)
				%3:_(s32) = G_CONSTANT i32 16
				%4:_(<4 x s32>) = G_SHL %6, %3(s32)
				$q2 = COPY %4(<4 x s32>)
				...

test/CodeGen/AArch64/GlobalISel/select-shift.mir

This file was added.

				# RUN: llc -O0 -run-pass=instruction-select %s -o - \| FileCheck %s
				# RUN: llc -O0 -start-before=instruction-select %s -o - \| FileCheck %s --check-prefix=ASM

				--- \|
				target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"
				target triple = "aarch64--"
				define void @test_shift() {
				entry:
				ret void
				}
				define void @test_vector_shift() {
				entry:
				ret void
				}
				...

				---
				name: test_shift
				legalized: true
				regBankSelected: true
				tracksRegLiveness: true
				body: \|
				bb.0.entry:
				liveins: $x0, $w1, $w2

				; CHECK-LABEL: name: test_shift
				; CHECK-NOT: failedISel: true
				; CHECK: liveins: $x0, $w1, $w2
				; CHECK: [[COPY:%[0-9]+]]:gpr64 = COPY $x0
				; CHECK: [[COPY1:%[0-9]+]]:gpr32 = COPY $w1
				; CHECK: [[COPY2:%[0-9]+]]:gpr32 = COPY $w2
				; CHECK: [[MOVi32imm:%[0-9]+]]:gpr32 = MOVi32imm 255
				; CHECK: [[ANDWrr:%[0-9]+]]:gpr32 = ANDWrr [[COPY2]], [[MOVi32imm]]
				; CHECK: [[ASRVWr:%[0-9]+]]:gpr32 = ASRVWr [[COPY1]], [[ANDWrr]]
				; CHECK: $w1 = COPY [[ASRVWr]]
				; CHECK: [[MOVi32imm1:%[0-9]+]]:gpr32 = MOVi32imm 255
				; CHECK: [[ANDWrr1:%[0-9]+]]:gpr32 = ANDWrr [[COPY2]], [[MOVi32imm1]]
				; CHECK: [[COPY3:%[0-9]+]]:gpr32 = COPY [[COPY]].sub_32
				; CHECK: [[LSRVWr:%[0-9]+]]:gpr32 = LSRVWr [[ANDWrr1]], [[COPY3]]
				; CHECK: $w1 = COPY [[LSRVWr]]
				; CHECK: [[MOVi64imm:%[0-9]+]]:gpr64 = MOVi64imm 255
				; CHECK: [[SUBREG_TO_REG:%[0-9]+]]:gpr64all = SUBREG_TO_REG 0, [[COPY2]], %subreg.sub_32
				; CHECK: [[COPY4:%[0-9]+]]:gpr64 = COPY [[SUBREG_TO_REG]]
				; CHECK: [[ANDXrr:%[0-9]+]]:gpr64 = ANDXrr [[COPY4]], [[MOVi64imm]]
				; CHECK: [[LSLVXr:%[0-9]+]]:gpr64 = LSLVXr [[COPY]], [[ANDXrr]]
				; CHECK: $x0 = COPY [[LSLVXr]]
				; CHECK: [[LSLVWr:%[0-9]+]]:gpr32 = LSLVWr [[COPY2]], [[COPY1]]
				; CHECK: $w1 = COPY [[LSLVWr]]
				; CHECK: [[MOVi32imm2:%[0-9]+]]:gpr32 = MOVi32imm 255
				; CHECK: [[ANDWrr2:%[0-9]+]]:gpr32 = ANDWrr [[COPY2]], [[MOVi32imm2]]
				; CHECK: [[LSLVWr1:%[0-9]+]]:gpr32 = LSLVWr [[COPY2]], [[ANDWrr2]]
				; CHECK: $w1 = COPY [[LSLVWr1]]
				%0:gpr(s64) = COPY $x0
				%1:gpr(s32) = COPY $w1
				%2:gpr(s32) = COPY $w2

				%3:gpr(s32) = G_CONSTANT i32 255
				%4:gpr(s32) = G_AND %2, %3
				%5:gpr(s32) = G_ASHR %1, %4(s32)
				$w1 = COPY %5(s32)

				%6:gpr(s32) = G_CONSTANT i32 255
				%7:gpr(s32) = G_AND %2, %6
				%8:gpr(s32) = G_LSHR %7, %0(s64)
				$w1 = COPY %8(s32)

				%9:gpr(s64) = G_CONSTANT i64 255
				%10:gpr(s64) = G_ANYEXT %2(s32)
				%11:gpr(s64) = G_AND %10, %9
				%12:gpr(s64) = G_SHL %0, %11(s64)
				$x0 = COPY %12(s64)

				%13:gpr(s32) = COPY %2(s32)
				%14:gpr(s32) = G_SHL %13, %1(s32)
				$w1 = COPY %14(s32)

				%15:gpr(s32) = G_CONSTANT i32 255
				%16:gpr(s32) = COPY %2(s32)
				%17:gpr(s32) = G_AND %16, %15
				%18:gpr(s32) = G_SHL %2, %17(s32)
				$w1 = COPY %18(s32)
				...

				---
				name: test_vector_shift
				legalized: true
				regBankSelected: true
				tracksRegLiveness: true
				body: \|
				bb.0.entry:
				liveins: $q0, $d6

				; CHECK-LABEL: name: test_vector_shift
				; CHECK-NOT: failedISel: true
				; CHECK: liveins: $q0, $d6
				; CHECK: [[COPY:%[0-9]+]]:fpr128 = COPY $q0
				; CHECK: [[COPY1:%[0-9]+]]:fpr64 = COPY $d6
				; CHECK: [[SHLv4i32_shift:%[0-9]+]]:fpr128 = SHLv4i32_shift [[COPY]], 3
				; CHECK: $q1 = COPY [[SHLv4i32_shift]]
				; CHECK: [[SHLLv4i16_:%[0-9]+]]:fpr128 = SHLLv4i16 [[COPY1]]
				; CHECK: $q2 = COPY [[SHLLv4i16_]]

				; ASM-LABEL: test_vector_shift:
				; ASM: shl v1.4s, v0.4s, #3
				; ASM-NEXT: shll v2.4s, v6.4h, #16

				%0:fpr(<4 x s32>) = COPY $q0
				%1:fpr(<4 x s16>) = COPY $d6

				%2:gpr(s32) = G_CONSTANT i32 3
				%3:fpr(<4 x s32>) = G_SHL %0, %2(s32)
				$q1 = COPY %3(<4 x s32>)

				%4:fpr(<4 x s32>) = G_SEXT %1(<4 x s16>)
				%5:gpr(s32) = G_CONSTANT i32 16
				%6:fpr(<4 x s32>) = G_SHL %4, %5(s32)
				$q2 = COPY %6(<4 x s32>)
				...

This is an archive of the discontinued LLVM Phabricator instance.

[GlobalISel][X86][ARM] Relaxing type constraints on G_SHL and friendsAcceptedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 146275

include/llvm/CodeGen/GlobalISel/LegalizationArtifactCombiner.h

include/llvm/Target/GenericOpcodes.td

lib/CodeGen/GlobalISel/LegalizerHelper.cpp

lib/Target/AArch64/AArch64InstrFormats.td

lib/Target/AArch64/AArch64InstrInfo.td

lib/Target/AArch64/AArch64InstructionSelector.cpp

lib/Target/AArch64/AArch64LegalizerInfo.cpp

lib/Target/AArch64/AArch64RegisterBankInfo.cpp

lib/Target/AMDGPU/AMDGPULegalizerInfo.cpp

lib/Target/ARM/ARMLegalizerInfo.cpp

lib/Target/X86/X86LegalizerInfo.cpp

test/CodeGen/AArch64/GlobalISel/legalize-shift.mir

test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir

test/CodeGen/AArch64/GlobalISel/regbankselect-default.mir

test/CodeGen/AArch64/GlobalISel/regbankselect-shift.mir

test/CodeGen/AArch64/GlobalISel/select-shift.mir

[GlobalISel][X86][ARM] Relaxing type constraints on G_SHL and friends
AcceptedPublic