Download Raw Diff

Details

Reviewers

aemerson
paquette

Commits

rG5cd63e9ec2a3: [AArch64][GlobalISel] Legalize bswap <2 x i16>

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

jroelofs created this revision.Jul 13 2021, 1:10 PM

Herald added subscribers: danielkiss, hiraditya, kristof.beyls, rovka. · View Herald TranscriptJul 13 2021, 1:10 PM

jroelofs requested review of this revision.Jul 13 2021, 1:10 PM

Herald added a project: Restricted Project. · View Herald TranscriptJul 13 2021, 1:10 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

jroelofs added a parent revision: D105860: [AArch64] Implement floating point subreg copies.Jul 13 2021, 1:11 PM

marked as "WIP" because there are a few extraneous copies:

$ cat bswap.ll
; RUN: llc -mtriple=arm64-apple-ios < %s | FileCheck %s

define void @test1(<2 x i16>* %p) {
  %in = load <2 x i16>, <2 x i16>* %p
  %out = call <2 x i16> @llvm.bswap.v2i16(<2 x i16> %in)
  store <2 x i16> %out, <2 x i16>* %p
  ret void
}

declare <2 x i16> @llvm.bswap.v2i16(<2 x i16>) nounwind readnone

GISel:

$ ./bin/llc -global-isel=1 -global-isel-abort=1 bswap.ll -mtriple=arm64-apple-ios -o -
	.section	__TEXT,__text,regular,pure_instructions
	.globl	_test1                          ; -- Begin function test1
	.p2align	2
_test1:                                 ; @test1
	.cfi_startproc
; %bb.0:
	ldr	h0, [x0]
	ldr	h1, [x0, #2]
	mov.h	v0[1], v1[0]
	fmov	w8, s0
	mov.s	v0[0], w8
	rev32.8b	v0, v0
	ushr.2s	v0, v0, #16
	fmov	s0, s0
	mov	h1, v0[1]
	str	h0, [x0]
	str	h1, [x0, #2]
	ret
	.cfi_endproc
                                        ; -- End function
.subsections_via_symbols

SDAG:

$ ./bin/llc -global-isel=0 -global-isel-abort=1 bswap.ll -mtriple=arm64-apple-ios -o -
	.section	__TEXT,__text,regular,pure_instructions
	.globl	_test1                          ; -- Begin function test1
	.p2align	2
_test1:                                 ; @test1
	.cfi_startproc
; %bb.0:
	ld1.h	{ v0 }[0], [x0]
	add	x8, x0, #2                      ; =2
	ld1.h	{ v0 }[2], [x8]
	rev32.8b	v0, v0
	ushr.2s	v0, v0, #16
	mov.s	w8, v0[1]
	fmov	w9, s0
	strh	w9, [x0]
	strh	w8, [x0, #2]
	ret
	.cfi_endproc
                                        ; -- End function
.subsections_via_symbols

paquette added inline comments.Jul 13 2021, 1:23 PM

llvm/lib/Target/AArch64/GISel/AArch64LegalizerInfo.cpp
1024	Can you use G_REV32 here instead? (see: AArch64InstrGISel.td) I think it's preferable to keep the results of legalization as generic as possible.
1029	Is it possible to use a generic instruction here?

Harbormaster completed remote builds in B113835: Diff 358404.Jul 13 2021, 2:56 PM

Closer.

Use more generic opcodes during legalization.

fixup test checks

Harbormaster completed remote builds in B113897: Diff 358487.Jul 13 2021, 7:36 PM

paquette added inline comments.Jul 13 2021, 9:07 PM

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp
3999 ↗	(On Diff #358487)	This looks similar to `getSubRegForClass`? Maybe it's possible to share some code there?

jroelofs mentioned this in D105860: [AArch64] Implement floating point subreg copies.Jul 14 2021, 10:06 AM

paquette added inline comments.Jul 14 2021, 11:37 AM

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp
3999 ↗	(On Diff #358487)	Also, it'd be nice to have a specific test for G_UNMERGE_VALUES selection. I think that this part + that test could be moved into an independent patch.

jroelofs marked an inline comment as done.Jul 14 2021, 12:47 PM

jroelofs added inline comments.

llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp
3999 ↗	(On Diff #358487)	good idea
3999 ↗	(On Diff #358487)	https://reviews.llvm.org/D106007

Getting closer:

SDAG:

$ ./bin/llc -global-isel=0 -global-isel-abort=1 bswap.ll -mtriple=arm64-apple-ios -o -
	.section	__TEXT,__text,regular,pure_instructions
	.globl	_test1                          ; -- Begin function test1
	.p2align	2
_test1:                                 ; @test1
	.cfi_startproc
; %bb.0:
	ld1.h	{ v0 }[0], [x0]
	add	x8, x0, #2                      ; =2
	ld1.h	{ v0 }[2], [x8]
	rev32.8b	v0, v0
	ushr.2s	v0, v0, #16
	mov.s	w8, v0[1]
	fmov	w9, s0
	strh	w9, [x0]
	strh	w8, [x0, #2]
	ret
	.cfi_endproc
                                        ; -- End function
.subsections_via_symbols

GISel:

$ ./bin/llc -global-isel=1 -global-isel-abort=1 bswap.ll -mtriple=arm64-apple-ios -o -
	.section	__TEXT,__text,regular,pure_instructions
	.globl	_test1                          ; -- Begin function test1
	.p2align	2
_test1:                                 ; @test1
	.cfi_startproc
; %bb.0:
	ldr	h0, [x0]
	ldr	h1, [x0, #2]
	mov.h	v0[1], v1[0]
	mov.s	v0[0], v0[0]
	rev32.8b	v0, v0
	ushr.2s	v0, v0, #16
	mov	h1, v0[1]
	str	h0, [x0]
	str	h1, [x0, #2]
	ret
	.cfi_endproc
                                        ; -- End function
.subsections_via_symbols

jroelofs edited parent revisions, added: D106007: [AArch64] Fix selection of G_UNMERGE <2 x s16>; removed: D105860: [AArch64] Implement floating point subreg copies.Jul 14 2021, 1:11 PM

Harbormaster completed remote builds in B114067: Diff 358712.Jul 14 2021, 2:19 PM

SDAG:

$ ./bin/llc -global-isel=0 -global-isel-abort=1 bswap.ll -mtriple=arm64-apple-ios -o -
	.section	__TEXT,__text,regular,pure_instructions
	.globl	_test1                          ; -- Begin function test1
	.p2align	2
_test1:                                 ; @test1
	.cfi_startproc
; %bb.0:
	ld1.h	{ v0 }[0], [x0]
	add	x8, x0, #2                      ; =2
	ld1.h	{ v0 }[2], [x8]
	rev32.8b	v0, v0
	ushr.2s	v0, v0, #16
	mov.s	w8, v0[1]
	fmov	w9, s0
	strh	w9, [x0]
	strh	w8, [x0, #2]
	ret
	.cfi_endproc
                                        ; -- End function
.subsections_via_symbols

GISel:

$ ./bin/llc -global-isel=1 -global-isel-abort=1 bswap.ll -mtriple=arm64-apple-ios -o -
	.section	__TEXT,__text,regular,pure_instructions
	.globl	_test1                          ; -- Begin function test1
	.p2align	2
_test1:                                 ; @test1
	.cfi_startproc
; %bb.0:
	ldr	h0, [x0]
	ldr	h1, [x0, #2]
	mov.h	v0[1], v1[0]
	rev32.8b	v0, v0
	ushr.2s	v0, v0, #16
	mov	h1, v0[1]
	str	h0, [x0]
	str	h1, [x0, #2]
	ret
	.cfi_endproc
                                        ; -- End function
.subsections_via_symbols

Harbormaster completed remote builds in B114117: Diff 358783.Jul 14 2021, 4:58 PM

paquette added inline comments.Jul 15 2021, 10:02 AM

llvm/lib/Target/AArch64/GISel/AArch64LegalizerInfo.cpp
1016	Do you think you could add an explanation somewhere for why the v2s16 case is special?

Aside from the comment, I think this looks pretty good at this point? The codegen differences between SDAG and GISel seem unrelated to the bswap at this point.

This revision is now accepted and ready to land.Jul 15 2021, 1:58 PM

Explain why <2 x half> is weird, and why we're not directly selecting the instructions we want during legalization.

jroelofs marked an inline comment as done.Jul 16 2021, 5:34 PM

Harbormaster completed remote builds in B114641: Diff 359504.Jul 16 2021, 5:34 PM

Thanks!

This revision was landed with ongoing or failed builds.Jul 17 2021, 4:46 PM

Closed by commit rG5cd63e9ec2a3: [AArch64][GlobalISel] Legalize bswap <2 x i16> (authored by jroelofs). · Explain Why

This revision was automatically updated to reflect the committed changes.

jroelofs added a commit: rG5cd63e9ec2a3: [AArch64][GlobalISel] Legalize bswap <2 x i16>.

jroelofs added a reverting change: rG9237eda30407: Revert "[AArch64][GlobalISel] Legalize bswap <2 x i16>".Sep 1 2021, 4:51 PM

Diff 359588

llvm/lib/Target/AArch64/GISel/AArch64LegalizerInfo.h

Show All 29 Lines	public:
AArch64LegalizerInfo(const AArch64Subtarget &ST);		AArch64LegalizerInfo(const AArch64Subtarget &ST);

bool legalizeCustom(LegalizerHelper &Helper, MachineInstr &MI) const override;		bool legalizeCustom(LegalizerHelper &Helper, MachineInstr &MI) const override;

bool legalizeIntrinsic(LegalizerHelper &Helper,		bool legalizeIntrinsic(LegalizerHelper &Helper,
MachineInstr &MI) const override;		MachineInstr &MI) const override;

private:		private:
		bool legalizeBSwap(MachineInstr &MI, MachineRegisterInfo &MRI,
		MachineIRBuilder &MIRBuilder) const;
bool legalizeVaArg(MachineInstr &MI, MachineRegisterInfo &MRI,		bool legalizeVaArg(MachineInstr &MI, MachineRegisterInfo &MRI,
MachineIRBuilder &MIRBuilder) const;		MachineIRBuilder &MIRBuilder) const;
bool legalizeLoadStore(MachineInstr &MI, MachineRegisterInfo &MRI,		bool legalizeLoadStore(MachineInstr &MI, MachineRegisterInfo &MRI,
MachineIRBuilder &MIRBuilder,		MachineIRBuilder &MIRBuilder,
GISelChangeObserver &Observer) const;		GISelChangeObserver &Observer) const;
bool legalizeShlAshrLshr(MachineInstr &MI, MachineRegisterInfo &MRI,		bool legalizeShlAshrLshr(MachineInstr &MI, MachineRegisterInfo &MRI,
MachineIRBuilder &MIRBuilder,		MachineIRBuilder &MIRBuilder,
GISelChangeObserver &Observer) const;		GISelChangeObserver &Observer) const;
Show All 18 Lines

llvm/lib/Target/AArch64/GISel/AArch64LegalizerInfo.cpp

Show First 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	AArch64LegalizerInfo::AArch64LegalizerInfo(const AArch64Subtarget &ST)
getActionDefinitionsBuilder(G_PHI).legalFor({p0, s16, s32, s64})		getActionDefinitionsBuilder(G_PHI).legalFor({p0, s16, s32, s64})
.legalFor(PackedVectorAllTypeList)		.legalFor(PackedVectorAllTypeList)
.clampScalar(0, s16, s64)		.clampScalar(0, s16, s64)
.widenScalarToNextPow2(0);		.widenScalarToNextPow2(0);

getActionDefinitionsBuilder(G_BSWAP)		getActionDefinitionsBuilder(G_BSWAP)
.legalFor({s32, s64, v4s32, v2s32, v2s64})		.legalFor({s32, s64, v4s32, v2s32, v2s64})
.clampScalar(0, s32, s64)		.clampScalar(0, s32, s64)
.widenScalarToNextPow2(0);		.widenScalarToNextPow2(0)
		.customIf(typeIs(0, v2s16)); // custom lower as G_REV32 + G_LSHR

getActionDefinitionsBuilder({G_ADD, G_SUB, G_MUL, G_AND, G_OR, G_XOR})		getActionDefinitionsBuilder({G_ADD, G_SUB, G_MUL, G_AND, G_OR, G_XOR})
.legalFor({s32, s64, v2s32, v4s32, v4s16, v8s16, v16s8, v8s8})		.legalFor({s32, s64, v2s32, v4s32, v4s16, v8s16, v16s8, v8s8})
.scalarizeIf(		.scalarizeIf(
[=](const LegalityQuery &Query) {		[=](const LegalityQuery &Query) {
return Query.Opcode == G_MUL && Query.Types[0] == v2s64;		return Query.Opcode == G_MUL && Query.Types[0] == v2s64;
},		},
0)		0)
▲ Show 20 Lines • Show All 671 Lines • ▼ Show 20 Lines	bool AArch64LegalizerInfo::legalizeCustom(LegalizerHelper &Helper,
default:		default:
// No idea what to do.		// No idea what to do.
return false;		return false;
case TargetOpcode::G_VAARG:		case TargetOpcode::G_VAARG:
return legalizeVaArg(MI, MRI, MIRBuilder);		return legalizeVaArg(MI, MRI, MIRBuilder);
case TargetOpcode::G_LOAD:		case TargetOpcode::G_LOAD:
case TargetOpcode::G_STORE:		case TargetOpcode::G_STORE:
return legalizeLoadStore(MI, MRI, MIRBuilder, Observer);		return legalizeLoadStore(MI, MRI, MIRBuilder, Observer);
		case TargetOpcode::G_BSWAP:
		return legalizeBSwap(MI, MRI, MIRBuilder);
case TargetOpcode::G_SHL:		case TargetOpcode::G_SHL:
case TargetOpcode::G_ASHR:		case TargetOpcode::G_ASHR:
case TargetOpcode::G_LSHR:		case TargetOpcode::G_LSHR:
return legalizeShlAshrLshr(MI, MRI, MIRBuilder, Observer);		return legalizeShlAshrLshr(MI, MRI, MIRBuilder, Observer);
case TargetOpcode::G_GLOBAL_VALUE:		case TargetOpcode::G_GLOBAL_VALUE:
return legalizeSmallCMGlobalValue(MI, MRI, MIRBuilder, Observer);		return legalizeSmallCMGlobalValue(MI, MRI, MIRBuilder, Observer);
case TargetOpcode::G_TRUNC:		case TargetOpcode::G_TRUNC:
return legalizeVectorTrunc(MI, Helper);		return legalizeVectorTrunc(MI, Helper);
▲ Show 20 Lines • Show All 194 Lines • ▼ Show 20 Lines	bool AArch64LegalizerInfo::legalizeLoadStore(
} else {		} else {
auto NewLoad = MIRBuilder.buildLoad(NewTy, MI.getOperand(1), MMO);		auto NewLoad = MIRBuilder.buildLoad(NewTy, MI.getOperand(1), MMO);
MIRBuilder.buildBitcast(ValReg, NewLoad);		MIRBuilder.buildBitcast(ValReg, NewLoad);
}		}
MI.eraseFromParent();		MI.eraseFromParent();
return true;		return true;
}		}

		bool AArch64LegalizerInfo::legalizeBSwap(MachineInstr &MI,
		MachineRegisterInfo &MRI,
		MachineIRBuilder &MIRBuilder) const {
		assert(MI.getOpcode() == TargetOpcode::G_BSWAP);

		// The <2 x half> case needs special lowering because there isn't an
		// instruction that does that directly. Instead, we widen to <8 x i8>
		// and emit a G_REV32 followed by a G_LSHR knowing that instruction selection
		// will later match them as:
		//
		paquetteUnsubmitted Done Reply Inline Actions Do you think you could add an explanation somewhere for why the v2s16 case is special? paquette: Do you think you could add an explanation somewhere for why the v2s16 case is special?
		// rev32.8b v0, v0
		// ushr.2s v0, v0, #16
		//
		// We could emit those here directly, but it seems better to keep things as
		// generic as possible through legalization, and avoid committing layering
		// violations by legalizing & selecting here at the same time.

		Register ValReg = MI.getOperand(1).getReg();
		paquetteUnsubmitted Done Reply Inline Actions Can you use G_REV32 here instead? (see: AArch64InstrGISel.td) I think it's preferable to keep the results of legalization as generic as possible. paquette: Can you use G_REV32 here instead? (see: AArch64InstrGISel.td) I think it's preferable to keep…
		assert(LLT::fixed_vector(2, 16) == MRI.getType(ValReg));
		const LLT v2s32 = LLT::fixed_vector(2, 32);
		const LLT v8s8 = LLT::fixed_vector(8, 8);
		const LLT s32 = LLT::scalar(32);

		paquetteUnsubmitted Done Reply Inline Actions Is it possible to use a generic instruction here? paquette: Is it possible to use a generic instruction here?
		auto Undef = MIRBuilder.buildUndef(v8s8);
		auto Insert =
		MIRBuilder
		.buildInstr(TargetOpcode::INSERT_SUBREG, {v8s8}, {Undef, ValReg})
		.addImm(AArch64::ssub);
		auto Rev32 = MIRBuilder.buildInstr(AArch64::G_REV32, {v8s8}, {Insert});
		auto Bitcast = MIRBuilder.buildBitcast(v2s32, Rev32);
		auto Amt = MIRBuilder.buildConstant(v2s32, 16);
		auto UShr =
		MIRBuilder.buildInstr(TargetOpcode::G_LSHR, {v2s32}, {Bitcast, Amt});
		auto Zero = MIRBuilder.buildConstant(s32, 0);
		auto Extract = MIRBuilder.buildExtractVectorElement(s32, UShr, Zero);
		MIRBuilder.buildBitcast({MI.getOperand(0).getReg()}, Extract);
		MI.eraseFromParent();
		return true;
		}

bool AArch64LegalizerInfo::legalizeVaArg(MachineInstr &MI,		bool AArch64LegalizerInfo::legalizeVaArg(MachineInstr &MI,
MachineRegisterInfo &MRI,		MachineRegisterInfo &MRI,
MachineIRBuilder &MIRBuilder) const {		MachineIRBuilder &MIRBuilder) const {
MachineFunction &MF = MIRBuilder.getMF();		MachineFunction &MF = MIRBuilder.getMF();
Align Alignment(MI.getOperand(2).getImm());		Align Alignment(MI.getOperand(2).getImm());
Register Dst = MI.getOperand(0).getReg();		Register Dst = MI.getOperand(0).getReg();
Register ListPtr = MI.getOperand(1).getReg();		Register ListPtr = MI.getOperand(1).getReg();

▲ Show 20 Lines • Show All 181 Lines • Show Last 20 Lines

llvm/test/CodeGen/AArch64/GlobalISel/legalize-bswap.mir

# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py		# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
# RUN: llc -march=aarch64 -run-pass=legalizer %s -o - -verify-machineinstrs \| FileCheck %s		# RUN: llc -march=aarch64 -run-pass=legalizer %s -o - -verify-machineinstrs \| FileCheck %s
--- \|		--- \|
target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"		target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"
target triple = "aarch64"		target triple = "aarch64"

declare i16 @llvm.bswap.i16(i16) #0		declare i16 @llvm.bswap.i16(i16) #0

define i16 @bswap_s16(i16 %a) { ret i16 0 }		define i16 @bswap_s16(i16 %a) { ret i16 0 }

		define <2 x i16> @bswap_2xi16(<2 x i16> %a) { ret <2 x i16> <i16 0, i16 0> }

attributes #0 = { nounwind readnone speculatable willreturn }		attributes #0 = { nounwind readnone speculatable willreturn }

...		...
---		---
name: bswap_s16		name: bswap_s16
alignment: 4		alignment: 4
tracksRegLiveness: true		tracksRegLiveness: true
liveins:		liveins:
Show All 18 Lines	bb.1:
%1:_(s32) = COPY $w0		%1:_(s32) = COPY $w0
%0:_(s16) = G_TRUNC %1(s32)		%0:_(s16) = G_TRUNC %1(s32)
%2:_(s16) = G_BSWAP %0		%2:_(s16) = G_BSWAP %0
%3:_(s32) = G_ANYEXT %2(s16)		%3:_(s32) = G_ANYEXT %2(s16)
$w0 = COPY %3(s32)		$w0 = COPY %3(s32)
RET_ReallyLR implicit $w0		RET_ReallyLR implicit $w0

...		...
		---
		name: bswap_2xi16
		alignment: 4
		tracksRegLiveness: true
		registers:
		- { id: 0, class: _ }
		- { id: 1, class: _ }
		liveins:
		- { reg: '$s0' }
		frameInfo:
		maxAlignment: 1
		machineFunctionInfo: {}
		body: \|
		bb.1:
		liveins: $s0

		; CHECK-LABEL: name: bswap_2xi16
		; CHECK: liveins: $s0
		; CHECK: [[COPY:%[0-9]+]]:_(<2 x s16>) = COPY $s0
		; CHECK: [[DEF:%[0-9]+]]:_(<8 x s8>) = G_IMPLICIT_DEF
		; CHECK: [[INSERT_SUBREG:%[0-9]+]]:_(<8 x s8>) = INSERT_SUBREG [[DEF]](<8 x s8>), [[COPY]](<2 x s16>), %subreg.ssub
		; CHECK: [[REV32_:%[0-9]+]]:_(<8 x s8>) = G_REV32 [[INSERT_SUBREG]]
		; CHECK: [[BITCAST:%[0-9]+]]:_(<2 x s32>) = G_BITCAST [[REV32_]](<8 x s8>)
		; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
		; CHECK: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[C]](s32), [[C]](s32)
		; CHECK: [[LSHR:%[0-9]+]]:_(<2 x s32>) = G_LSHR [[BITCAST]], [[BUILD_VECTOR]](<2 x s32>)
		; CHECK: [[C1:%[0-9]+]]:_(s64) = G_CONSTANT i64 0
		; CHECK: [[EVEC:%[0-9]+]]:_(s32) = G_EXTRACT_VECTOR_ELT [[LSHR]](<2 x s32>), [[C1]](s64)
		; CHECK: [[BITCAST1:%[0-9]+]]:_(<2 x s16>) = G_BITCAST [[EVEC]](s32)
		; CHECK: $s0 = COPY [[BITCAST1]](<2 x s16>)
		; CHECK: RET_ReallyLR
		%0:_(<2 x s16>) = COPY $s0
		%1:_(<2 x s16>) = G_BSWAP %0
		$s0 = COPY %1(<2 x s16>)
		RET_ReallyLR

		...

llvm/test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir

	Show First 20 Lines • Show All 549 Lines • ▼ Show 20 Lines
	# DEBUG-NEXT: .. the first uncovered imm index: 0, OK			# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
	# DEBUG-NEXT: G_CTLZ_ZERO_UNDEF (opcode {{[0-9]+}}): 2 type indices, 0 imm indices			# DEBUG-NEXT: G_CTLZ_ZERO_UNDEF (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
	# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected			# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
	# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected			# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
	# DEBUG-NEXT: G_CTPOP (opcode {{[0-9]+}}): 2 type indices, 0 imm indices			# DEBUG-NEXT: G_CTPOP (opcode {{[0-9]+}}): 2 type indices, 0 imm indices
	# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected			# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
	# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected			# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
	# DEBUG-NEXT: G_BSWAP (opcode {{[0-9]+}}): 1 type index, 0 imm indices			# DEBUG-NEXT: G_BSWAP (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG-NEXT: .. the first uncovered type index: 1, OK			# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
	# DEBUG-NEXT: .. the first uncovered imm index: 0, OK			# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
	# DEBUG-NEXT: G_BITREVERSE (opcode {{[0-9]+}}): 1 type index, 0 imm indices			# DEBUG-NEXT: G_BITREVERSE (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG-NEXT: .. the first uncovered type index: 1, OK			# DEBUG-NEXT: .. the first uncovered type index: 1, OK
	# DEBUG-NEXT: .. the first uncovered imm index: 0, OK			# DEBUG-NEXT: .. the first uncovered imm index: 0, OK
	# DEBUG-NEXT: G_FCEIL (opcode {{[0-9]+}}): 1 type index, 0 imm indices			# DEBUG-NEXT: G_FCEIL (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected			# DEBUG-NEXT: .. type index coverage check SKIPPED: user-defined predicate detected
	# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected			# DEBUG-NEXT: .. imm index coverage check SKIPPED: user-defined predicate detected
	# DEBUG-NEXT: G_FCOS (opcode {{[0-9]+}}): 1 type index, 0 imm indices			# DEBUG-NEXT: G_FCOS (opcode {{[0-9]+}}): 1 type index, 0 imm indices
	# DEBUG-NEXT: .. the first uncovered type index: 1, OK			# DEBUG-NEXT: .. the first uncovered type index: 1, OK
	▲ Show 20 Lines • Show All 137 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64][GlobalISel] Legalize bswap <2 x i16>
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 359588

llvm/lib/Target/AArch64/GISel/AArch64LegalizerInfo.h

llvm/lib/Target/AArch64/GISel/AArch64LegalizerInfo.cpp

llvm/test/CodeGen/AArch64/GlobalISel/legalize-bswap.mir

llvm/test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir

This is an archive of the discontinued LLVM Phabricator instance.

[AArch64][GlobalISel] Legalize bswap <2 x i16>ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 359588

llvm/lib/Target/AArch64/GISel/AArch64LegalizerInfo.h

llvm/lib/Target/AArch64/GISel/AArch64LegalizerInfo.cpp

llvm/test/CodeGen/AArch64/GlobalISel/legalize-bswap.mir

llvm/test/CodeGen/AArch64/GlobalISel/legalizer-info-validation.mir

[AArch64][GlobalISel] Legalize bswap <2 x i16>
ClosedPublic