This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/
-
CodeGen/
-
GlobalISel/
-
IRTranslator.cpp
-
MachineVerifier.cpp
-
Target/AArch64/GISel/
-
AArch64/
-
GISel/
-
AArch64PostLegalizerCombiner.cpp
-
test/
-
CodeGen/AArch64/GlobalISel/
-
AArch64/
-
GlobalISel/
-
irtranslator-bitcast.ll
1/2
irtranslator-hoisted-constants.ll
-
localizer-arm64-tti.ll
-
MachineVerifier/
-
test_g_bitcast.mir

Differential D150179

[GlobalISel] Hide hoisted constants behind G_BITCAST to prevent folding.
AbandonedPublic

Authored by aemerson on May 9 2023, 12:32 AM.

Download Raw Diff

Details

Reviewers

paquette
arsenm
foad
tschuett
Pierre-vh

Summary

The constant hoisting pass tries to hoist large constants into predecessors and also
generates remat instructions in terms of the hoisted constants. These aim to prevent
codegen from rematerializing expensive constants multiple times. So we can re-use
this optimization, we can preserve the no-op bitcasts that are used to anchor
constants to the predecessor blocks.

SelectionDAG achieves this by having the OpaqueConstant node, which is just a
normal constant with an opaque flag set. I've opted to avoid introducing a new
constant generic instruction here.

However, once we've run all the combiners we can eliminate these G_BITCASTS
to allow selection to see them. For AArch64 it makes sense to do it at post-legalizer
combining, but *after* the actual combiner loop runs since we don't want it eliminated
before any constant folding runs.

This change by itself has very minor improvements in -Os CTMark overall. What this
does allow is better optimizations when future combines are added that rely on having
expensive constants remain unfolded.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

aemerson created this revision.May 9 2023, 12:32 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 9 2023, 12:32 AM

Herald added subscribers: StephenFan, hiraditya, kristof.beyls. · View Herald Transcript

aemerson requested review of this revision.May 9 2023, 12:32 AM

Herald added a subscriber: wdng. · View Herald TranscriptMay 9 2023, 12:32 AM

Seems like this is another one of those DAG workaround passes we ideally wouldn't need to keep running

llvm/test/CodeGen/AArch64/GlobalISel/irtranslator-hoisted-constants.ll
30	Also add some vector cases?

Harbormaster completed remote builds in B230807: Diff 520612.May 9 2023, 1:21 AM

In D150179#4328938, @arsenm wrote:

Seems like this is another one of those DAG workaround passes we ideally wouldn't need to keep running

I thought so too, but reading its documentation more I think the problem it's solving is actually agnostic to selectors. That is, the core issue isn't that DAG can only see basic blocks, but one of balancing CSE & register pressure. We could stop running this IR pass (which does exist as IR due to SDAG's nature), but we'd just need to implement something similar in the GISel pipeline anyway.

llvm/test/CodeGen/AArch64/GlobalISel/irtranslator-hoisted-constants.ll
30	Good point, constant hoisting doesn't touch vector constants so we can avoid this bitcast trickery for those.

Didn't notice the test failures before. One of them is the verifier complaining about the types being the same for G_BITCAST, so I've relaxed that restriction.

Harbormaster completed remote builds in B230908: Diff 520750.May 9 2023, 11:50 AM

Just wondering: if we have to add some "special" bitcast that needs to be removed before ISel anyway (+ relax G_BITCAST check rules on top of it), couldn't we just add a dedicated opcode instead?
I'm thinking of something like a G_OPAQUE opcode that serves as an optimization hint that an instruction should be considered opaque and no combiner should attempt to see through it. It could just be removed right before ISel.

I'm wondering if we'll see more use cases for this "hack" later and eventually end up with a use-case that cannot use G_BITCAST (e.g. pointer type?), so we end up doing a similar hack but on another opcode for a different type, etc

For instance, a G_OPAQUE instruction like that could be used to prevent infinite combine loop between the target-independent and target combine rules. If the target combine wants to create a pattern that isn't considered optimal by the generic combiner, it could use that opaque instruction to prevent further optimizations.

In D150179#4349172, @Pierre-vh wrote:

Just wondering: if we have to add some "special" bitcast that needs to be removed before ISel anyway (+ relax G_BITCAST check rules on top of it), couldn't we just add a dedicated opcode instead?
I'm thinking of something like a G_OPAQUE opcode that serves as an optimization hint that an instruction should be considered opaque and no combiner should attempt to see through it. It could just be removed right before ISel.

I'm wondering if we'll see more use cases for this "hack" later and eventually end up with a use-case that cannot use G_BITCAST (e.g. pointer type?), so we end up doing a similar hack but on another opcode for a different type, etc

For instance, a G_OPAQUE instruction like that could be used to prevent infinite combine loop between the target-independent and target combine rules. If the target combine wants to create a pattern that isn't considered optimal by the generic combiner, it could use that opaque instruction to prevent further optimizations.

Yeah, I think that's ok too.

In D150179#4357513, @aemerson wrote:

For instance, a G_OPAQUE instruction like that could be used to prevent infinite combine loop between the target-independent and target combine rules. If the target combine wants to create a pattern that isn't considered optimal by the generic combiner, it could use that opaque instruction to prevent further optimizations.

Yeah, I think that's ok too.

Isn't this what the DAG does? There are opaque constants

In D150179#4363870, @arsenm wrote:

In D150179#4357513, @aemerson wrote:

For instance, a G_OPAQUE instruction like that could be used to prevent infinite combine loop between the target-independent and target combine rules. If the target combine wants to create a pattern that isn't considered optimal by the generic combiner, it could use that opaque instruction to prevent further optimizations.

Yeah, I think that's ok too.

Isn't this what the DAG does? There are opaque constants

Pretty much, except Pierre’s suggestion is more of a dedicated barrier instruction (sort of like freeze), whereas OpaqueConstants are just normal constants with a flag bit set. I think a dedicated barrier is better since it’s messy to have two different constant representations in the MIR.

I've implemented the separate instruction approach in D151945

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

GlobalISel/

IRTranslator.cpp

7 lines

MachineVerifier.cpp

3 lines

Target/

AArch64/

GISel/

AArch64PostLegalizerCombiner.cpp

23 lines

test/

CodeGen/

AArch64/

GlobalISel/

irtranslator-bitcast.ll

2 lines

irtranslator-hoisted-constants.ll

65 lines

localizer-arm64-tti.ll

22 lines

MachineVerifier/

test_g_bitcast.mir

2 lines

Diff 520750

llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp

Show First 20 Lines • Show All 1,462 Lines • ▼ Show 20 Lines	bool IRTranslator::translateCopy(const User &U, const Value &V,
}		}
return true;		return true;
}		}

bool IRTranslator::translateBitCast(const User &U,		bool IRTranslator::translateBitCast(const User &U,
MachineIRBuilder &MIRBuilder) {		MachineIRBuilder &MIRBuilder) {
// If we're bitcasting to the source type, we can reuse the source vreg.		// If we're bitcasting to the source type, we can reuse the source vreg.
if (getLLTForType(U.getOperand(0)->getType(), DL) ==		if (getLLTForType(U.getOperand(0)->getType(), DL) ==
getLLTForType(U.getType(), DL))		getLLTForType(U.getType(), DL)) {
		// If the source is a ConstantInt then it was probably created by
		// ConstantHoisting and we should leave it alone.
		if (isa<ConstantInt>(U.getOperand(0)))
		return translateCast(TargetOpcode::G_BITCAST, U, MIRBuilder);
return translateCopy(U, *U.getOperand(0), MIRBuilder);		return translateCopy(U, *U.getOperand(0), MIRBuilder);
		}

return translateCast(TargetOpcode::G_BITCAST, U, MIRBuilder);		return translateCast(TargetOpcode::G_BITCAST, U, MIRBuilder);
}		}

bool IRTranslator::translateCast(unsigned Opcode, const User &U,		bool IRTranslator::translateCast(unsigned Opcode, const User &U,
MachineIRBuilder &MIRBuilder) {		MachineIRBuilder &MIRBuilder) {
Register Op = getOrCreateVReg(*U.getOperand(0));		Register Op = getOrCreateVReg(*U.getOperand(0));
Register Res = getOrCreateVReg(U);		Register Res = getOrCreateVReg(U);
▲ Show 20 Lines • Show All 2,137 Lines • Show Last 20 Lines

llvm/lib/CodeGen/MachineVerifier.cpp

Show First 20 Lines • Show All 1,168 Lines • ▼ Show 20 Lines	if (!DstTy.isValid() \|\| !SrcTy.isValid())
break;		break;

if (SrcTy.isPointer() != DstTy.isPointer())		if (SrcTy.isPointer() != DstTy.isPointer())
report("bitcast cannot convert between pointers and other types", MI);		report("bitcast cannot convert between pointers and other types", MI);

if (SrcTy.getSizeInBits() != DstTy.getSizeInBits())		if (SrcTy.getSizeInBits() != DstTy.getSizeInBits())
report("bitcast sizes must match", MI);		report("bitcast sizes must match", MI);

if (SrcTy == DstTy)
report("bitcast must change the type", MI);

break;		break;
}		}
case TargetOpcode::G_INTTOPTR:		case TargetOpcode::G_INTTOPTR:
case TargetOpcode::G_PTRTOINT:		case TargetOpcode::G_PTRTOINT:
case TargetOpcode::G_ADDRSPACE_CAST: {		case TargetOpcode::G_ADDRSPACE_CAST: {
LLT DstTy = MRI->getType(MI->getOperand(0).getReg());		LLT DstTy = MRI->getType(MI->getOperand(0).getReg());
LLT SrcTy = MRI->getType(MI->getOperand(1).getReg());		LLT SrcTy = MRI->getType(MI->getOperand(1).getReg());
if (!DstTy.isValid() \|\| !SrcTy.isValid())		if (!DstTy.isValid() \|\| !SrcTy.isValid())
▲ Show 20 Lines • Show All 2,276 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/GISel/AArch64PostLegalizerCombiner.cpp

Show All 14 Lines
/// AArch64PostLegalizerLowering.		/// AArch64PostLegalizerLowering.
///		///
/// Combines which don't rely on instruction legality should go in the		/// Combines which don't rely on instruction legality should go in the
/// AArch64PreLegalizerCombiner.		/// AArch64PreLegalizerCombiner.
///		///
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "AArch64TargetMachine.h"		#include "AArch64TargetMachine.h"
		#include "llvm/ADT/STLExtras.h"
#include "llvm/CodeGen/GlobalISel/CSEInfo.h"		#include "llvm/CodeGen/GlobalISel/CSEInfo.h"
#include "llvm/CodeGen/GlobalISel/Combiner.h"		#include "llvm/CodeGen/GlobalISel/Combiner.h"
#include "llvm/CodeGen/GlobalISel/CombinerHelper.h"		#include "llvm/CodeGen/GlobalISel/CombinerHelper.h"
#include "llvm/CodeGen/GlobalISel/CombinerInfo.h"		#include "llvm/CodeGen/GlobalISel/CombinerInfo.h"
#include "llvm/CodeGen/GlobalISel/GISelChangeObserver.h"		#include "llvm/CodeGen/GlobalISel/GISelChangeObserver.h"
#include "llvm/CodeGen/GlobalISel/GISelKnownBits.h"		#include "llvm/CodeGen/GlobalISel/GISelKnownBits.h"
#include "llvm/CodeGen/GlobalISel/GenericMachineInstrs.h"		#include "llvm/CodeGen/GlobalISel/GenericMachineInstrs.h"
#include "llvm/CodeGen/GlobalISel/MIPatternMatch.h"		#include "llvm/CodeGen/GlobalISel/MIPatternMatch.h"
▲ Show 20 Lines • Show All 395 Lines • ▼ Show 20 Lines	bool AArch64PostLegalizerCombiner::runOnMachineFunction(MachineFunction &MF) {
MachineDominatorTree *MDT =		MachineDominatorTree *MDT =
IsOptNone ? nullptr : &getAnalysis<MachineDominatorTree>();		IsOptNone ? nullptr : &getAnalysis<MachineDominatorTree>();
AArch64PostLegalizerCombinerInfo PCInfo(EnableOpt, F.hasOptSize(),		AArch64PostLegalizerCombinerInfo PCInfo(EnableOpt, F.hasOptSize(),
F.hasMinSize(), KB, MDT);		F.hasMinSize(), KB, MDT);
GISelCSEAnalysisWrapper &Wrapper =		GISelCSEAnalysisWrapper &Wrapper =
getAnalysis<GISelCSEAnalysisWrapperPass>().getCSEWrapper();		getAnalysis<GISelCSEAnalysisWrapperPass>().getCSEWrapper();
auto *CSEInfo = &Wrapper.get(TPC->getCSEConfig());		auto *CSEInfo = &Wrapper.get(TPC->getCSEConfig());
Combiner C(PCInfo, TPC);		Combiner C(PCInfo, TPC);
return C.combineMachineInstrs(MF, CSEInfo);		bool Changed = C.combineMachineInstrs(MF, CSEInfo);

		auto &MRI = MF.getRegInfo();
		// Eliminate any no-op bitcasts that we created to preserve hoisted constants.
		// We wait until all the combiners have run otherwise the constants may get
		// re-folded with others during the main combiner loop.
		for (auto &MBB : MF) {
		for (auto &MI : make_early_inc_range(MBB)) {
		if (MI.getOpcode() == TargetOpcode::G_BITCAST) {
		Register Dst = MI.getOperand(0).getReg();
		Register Src = MI.getOperand(1).getReg();
		if (MRI.getType(Dst) != MRI.getType(Src))
		continue;
		MRI.replaceRegWith(Dst, Src);
		MI.eraseFromParent();
		Changed = true;
		}
		}
		}

		return Changed;
}		}

char AArch64PostLegalizerCombiner::ID = 0;		char AArch64PostLegalizerCombiner::ID = 0;
INITIALIZE_PASS_BEGIN(AArch64PostLegalizerCombiner, DEBUG_TYPE,		INITIALIZE_PASS_BEGIN(AArch64PostLegalizerCombiner, DEBUG_TYPE,
"Combine AArch64 MachineInstrs after legalization", false,		"Combine AArch64 MachineInstrs after legalization", false,
false)		false)
INITIALIZE_PASS_DEPENDENCY(TargetPassConfig)		INITIALIZE_PASS_DEPENDENCY(TargetPassConfig)
INITIALIZE_PASS_DEPENDENCY(GISelKnownBitsAnalysis)		INITIALIZE_PASS_DEPENDENCY(GISelKnownBitsAnalysis)
Show All 9 Lines

llvm/test/CodeGen/AArch64/GlobalISel/irtranslator-bitcast.ll

Show All 18 Lines	define i32 @test_bitcast_invalid_vreg() {
%tmp11 = add i32 23, 24		%tmp11 = add i32 23, 24
%tmp12 = add i32 25, 26		%tmp12 = add i32 25, 26
%tmp13 = add i32 27, 28		%tmp13 = add i32 27, 28
%tmp14 = add i32 29, 30		%tmp14 = add i32 29, 30
%tmp15 = add i32 30, 30		%tmp15 = add i32 30, 30

; At this point we mapped 46 values. The 'i32 100' constant will grow the map.		; At this point we mapped 46 values. The 'i32 100' constant will grow the map.
; CHECK: %46:_(s32) = G_CONSTANT i32 100		; CHECK: %46:_(s32) = G_CONSTANT i32 100
; CHECK: $w0 = COPY %46(s32)		; CHECK: $w0 = COPY %47(s32)
%res = bitcast i32 100 to i32		%res = bitcast i32 100 to i32
ret i32 %res		ret i32 %res
}		}

llvm/test/CodeGen/AArch64/GlobalISel/irtranslator-hoisted-constants.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py UTC_ARGS: --version 2
				; RUN: llc -mtriple=aarch64-apple-ios -global-isel -stop-after=irtranslator %s -o - \| FileCheck %s --check-prefix=TRANSLATED
				; RUN: llc -mtriple=aarch64-apple-ios -global-isel -stop-after=aarch64-postlegalizer-combiner %s -o - \| FileCheck %s --check-prefix=PRESELECTION

				; Check we don't elide no-op bitcasts of constants since they're used by constant
				; hoisting to prevent constant folding/propagation.

				define i32 @test(i32 %a) {
				; TRANSLATED-LABEL: name: test
				; TRANSLATED: bb.1.entry:
				; TRANSLATED-NEXT: liveins: $w0
				; TRANSLATED-NEXT: {{ $}}
				; TRANSLATED-NEXT: [[COPY:%[0-9]+]]:_(s32) = COPY $w0
				; TRANSLATED-NEXT: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 100000
				; TRANSLATED-NEXT: [[BITCAST:%[0-9]+]]:_(s32) = G_BITCAST [[C]](s32)
				; TRANSLATED-NEXT: [[ADD:%[0-9]+]]:_(s32) = G_ADD [[COPY]], [[BITCAST]]
				; TRANSLATED-NEXT: $w0 = COPY [[ADD]](s32)
				; TRANSLATED-NEXT: RET_ReallyLR implicit $w0
				; PRESELECTION-LABEL: name: test
				; PRESELECTION: bb.1.entry:
				; PRESELECTION-NEXT: liveins: $w0
				; PRESELECTION-NEXT: {{ $}}
				; PRESELECTION-NEXT: [[COPY:%[0-9]+]]:_(s32) = COPY $w0
				; PRESELECTION-NEXT: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 100000
				; PRESELECTION-NEXT: [[ADD:%[0-9]+]]:_(s32) = G_ADD [[COPY]], [[C]]
				; PRESELECTION-NEXT: $w0 = COPY [[ADD]](s32)
				; PRESELECTION-NEXT: RET_ReallyLR implicit $w0
				entry:
				%hc = bitcast i32 100000 to i32
				br label %cont
				arsenmUnsubmitted Not Done Reply Inline Actions Also add some vector cases? arsenm: Also add some vector cases?
				aemersonAuthorUnsubmitted Done Reply Inline Actions Good point, constant hoisting doesn't touch vector constants so we can avoid this bitcast trickery for those. aemerson: Good point, constant hoisting doesn't touch vector constants so we can avoid this bitcast…
				cont:
				%add = add i32 %a, %hc
				ret i32 %add
				}


				define <2 x i32> @test_vector(<2 x i32> %a) {
				; TRANSLATED-LABEL: name: test_vector
				; TRANSLATED: bb.1.entry:
				; TRANSLATED-NEXT: liveins: $d0
				; TRANSLATED-NEXT: {{ $}}
				; TRANSLATED-NEXT: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY $d0
				; TRANSLATED-NEXT: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 100000
				; TRANSLATED-NEXT: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[C]](s32), [[C]](s32)
				; TRANSLATED-NEXT: [[ADD:%[0-9]+]]:_(<2 x s32>) = G_ADD [[COPY]], [[BUILD_VECTOR]]
				; TRANSLATED-NEXT: $d0 = COPY [[ADD]](<2 x s32>)
				; TRANSLATED-NEXT: RET_ReallyLR implicit $d0
				; PRESELECTION-LABEL: name: test_vector
				; PRESELECTION: bb.1.entry:
				; PRESELECTION-NEXT: liveins: $d0
				; PRESELECTION-NEXT: {{ $}}
				; PRESELECTION-NEXT: [[COPY:%[0-9]+]]:_(<2 x s32>) = COPY $d0
				; PRESELECTION-NEXT: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 100000
				; PRESELECTION-NEXT: [[BUILD_VECTOR:%[0-9]+]]:_(<2 x s32>) = G_BUILD_VECTOR [[C]](s32), [[C]](s32)
				; PRESELECTION-NEXT: [[ADD:%[0-9]+]]:_(<2 x s32>) = G_ADD [[COPY]], [[BUILD_VECTOR]]
				; PRESELECTION-NEXT: $d0 = COPY [[ADD]](<2 x s32>)
				; PRESELECTION-NEXT: RET_ReallyLR implicit $d0
				entry:
				%hc = bitcast <2 x i32> <i32 100000, i32 100000> to <2 x i32>
				br label %cont
				cont:
				%add = add <2 x i32> %a, %hc
				ret <2 x i32> %add
				}

llvm/test/CodeGen/AArch64/GlobalISel/localizer-arm64-tti.ll

Show First 20 Lines • Show All 112 Lines • ▼ Show 20 Lines	if.end:
ret i32 0		ret i32 0
}		}

define i32 @imm_cost_too_large_cost_of_2() {		define i32 @imm_cost_too_large_cost_of_2() {
; CHECK-LABEL: name: imm_cost_too_large_cost_of_2		; CHECK-LABEL: name: imm_cost_too_large_cost_of_2
; CHECK: bb.1.entry:		; CHECK: bb.1.entry:
; CHECK-NEXT: successors: %bb.2(0x40000000), %bb.4(0x40000000)		; CHECK-NEXT: successors: %bb.2(0x40000000), %bb.4(0x40000000)
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 -2228259
; CHECK-NEXT: [[GV:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var2		; CHECK-NEXT: [[GV:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var2
; CHECK-NEXT: [[GV1:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var3		; CHECK-NEXT: [[GV1:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var3
; CHECK-NEXT: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 0		; CHECK-NEXT: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
; CHECK-NEXT: [[GV2:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var1		; CHECK-NEXT: [[GV2:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var1
; CHECK-NEXT: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[GV2]](p0) :: (dereferenceable load (s32) from @var1)		; CHECK-NEXT: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[GV2]](p0) :: (dereferenceable load (s32) from @var1)
		; CHECK-NEXT: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 -2228259
		; CHECK-NEXT: [[BITCAST:%[0-9]+]]:_(s32) = G_BITCAST [[C1]](s32)
; CHECK-NEXT: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 1		; CHECK-NEXT: [[C2:%[0-9]+]]:_(s32) = G_CONSTANT i32 1
; CHECK-NEXT: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(ne), [[LOAD]](s32), [[C2]]		; CHECK-NEXT: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(ne), [[LOAD]](s32), [[C2]]
; CHECK-NEXT: G_BRCOND [[ICMP]](s1), %bb.4		; CHECK-NEXT: G_BRCOND [[ICMP]](s1), %bb.4
; CHECK-NEXT: G_BR %bb.2		; CHECK-NEXT: G_BR %bb.2
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.2.if.then:		; CHECK-NEXT: bb.2.if.then:
; CHECK-NEXT: successors: %bb.3(0x80000000)		; CHECK-NEXT: successors: %bb.3(0x80000000)
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: [[GV3:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var2		; CHECK-NEXT: [[GV3:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var2
; CHECK-NEXT: G_STORE [[C]](s32), [[GV3]](p0) :: (store (s32) into @var2)		; CHECK-NEXT: G_STORE [[BITCAST]](s32), [[GV3]](p0) :: (store (s32) into @var2)
; CHECK-NEXT: G_BR %bb.3		; CHECK-NEXT: G_BR %bb.3
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.3.if.then2:		; CHECK-NEXT: bb.3.if.then2:
; CHECK-NEXT: successors: %bb.4(0x80000000)		; CHECK-NEXT: successors: %bb.4(0x80000000)
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: [[GV4:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var1		; CHECK-NEXT: [[GV4:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var1
; CHECK-NEXT: G_STORE [[C]](s32), [[GV4]](p0) :: (store (s32) into @var1)		; CHECK-NEXT: G_STORE [[BITCAST]](s32), [[GV4]](p0) :: (store (s32) into @var1)
; CHECK-NEXT: G_BR %bb.4		; CHECK-NEXT: G_BR %bb.4
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.4.if.end:		; CHECK-NEXT: bb.4.if.end:
; CHECK-NEXT: [[GV5:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var3		; CHECK-NEXT: [[GV5:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var3
; CHECK-NEXT: G_STORE [[C]](s32), [[GV5]](p0) :: (store (s32) into @var3)		; CHECK-NEXT: G_STORE [[BITCAST]](s32), [[GV5]](p0) :: (store (s32) into @var3)
; CHECK-NEXT: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 0		; CHECK-NEXT: [[C3:%[0-9]+]]:_(s32) = G_CONSTANT i32 0
; CHECK-NEXT: $w0 = COPY [[C3]](s32)		; CHECK-NEXT: $w0 = COPY [[C3]](s32)
; CHECK-NEXT: RET_ReallyLR implicit $w0		; CHECK-NEXT: RET_ReallyLR implicit $w0
entry:		entry:
%0 = load i32, ptr @var1, align 4		%0 = load i32, ptr @var1, align 4
%cst1 = bitcast i32 -2228259 to i32		%cst1 = bitcast i32 -2228259 to i32
%cmp = icmp eq i32 %0, 1		%cmp = icmp eq i32 %0, 1
br i1 %cmp, label %if.then, label %if.end		br i1 %cmp, label %if.then, label %if.end
Show All 11 Lines	if.end:
ret i32 0		ret i32 0
}		}

define i64 @imm_cost_too_large_cost_of_4() {		define i64 @imm_cost_too_large_cost_of_4() {
; CHECK-LABEL: name: imm_cost_too_large_cost_of_4		; CHECK-LABEL: name: imm_cost_too_large_cost_of_4
; CHECK: bb.1.entry:		; CHECK: bb.1.entry:
; CHECK-NEXT: successors: %bb.2(0x40000000), %bb.4(0x40000000)		; CHECK-NEXT: successors: %bb.2(0x40000000), %bb.4(0x40000000)
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 -2228259
; CHECK-NEXT: [[GV:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var2_64		; CHECK-NEXT: [[GV:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var2_64
; CHECK-NEXT: [[GV1:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var3_64		; CHECK-NEXT: [[GV1:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var3_64
; CHECK-NEXT: [[C1:%[0-9]+]]:_(s64) = G_CONSTANT i64 0		; CHECK-NEXT: [[C:%[0-9]+]]:_(s64) = G_CONSTANT i64 0
; CHECK-NEXT: [[GV2:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var1_64		; CHECK-NEXT: [[GV2:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var1_64
; CHECK-NEXT: [[LOAD:%[0-9]+]]:_(s64) = G_LOAD [[GV2]](p0) :: (dereferenceable load (s64) from @var1_64, align 4)		; CHECK-NEXT: [[LOAD:%[0-9]+]]:_(s64) = G_LOAD [[GV2]](p0) :: (dereferenceable load (s64) from @var1_64, align 4)
		; CHECK-NEXT: [[C1:%[0-9]+]]:_(s64) = G_CONSTANT i64 -2228259
		; CHECK-NEXT: [[BITCAST:%[0-9]+]]:_(s64) = G_BITCAST [[C1]](s64)
; CHECK-NEXT: [[C2:%[0-9]+]]:_(s64) = G_CONSTANT i64 1		; CHECK-NEXT: [[C2:%[0-9]+]]:_(s64) = G_CONSTANT i64 1
; CHECK-NEXT: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(ne), [[LOAD]](s64), [[C2]]		; CHECK-NEXT: [[ICMP:%[0-9]+]]:_(s1) = G_ICMP intpred(ne), [[LOAD]](s64), [[C2]]
; CHECK-NEXT: G_BRCOND [[ICMP]](s1), %bb.4		; CHECK-NEXT: G_BRCOND [[ICMP]](s1), %bb.4
; CHECK-NEXT: G_BR %bb.2		; CHECK-NEXT: G_BR %bb.2
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.2.if.then:		; CHECK-NEXT: bb.2.if.then:
; CHECK-NEXT: successors: %bb.3(0x80000000)		; CHECK-NEXT: successors: %bb.3(0x80000000)
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: [[GV3:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var2_64		; CHECK-NEXT: [[GV3:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var2_64
; CHECK-NEXT: G_STORE [[C]](s64), [[GV3]](p0) :: (store (s64) into @var2_64)		; CHECK-NEXT: G_STORE [[BITCAST]](s64), [[GV3]](p0) :: (store (s64) into @var2_64)
; CHECK-NEXT: G_BR %bb.3		; CHECK-NEXT: G_BR %bb.3
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.3.if.then2:		; CHECK-NEXT: bb.3.if.then2:
; CHECK-NEXT: successors: %bb.4(0x80000000)		; CHECK-NEXT: successors: %bb.4(0x80000000)
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: [[GV4:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var1_64		; CHECK-NEXT: [[GV4:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var1_64
; CHECK-NEXT: G_STORE [[C]](s64), [[GV4]](p0) :: (store (s64) into @var1_64)		; CHECK-NEXT: G_STORE [[BITCAST]](s64), [[GV4]](p0) :: (store (s64) into @var1_64)
; CHECK-NEXT: G_BR %bb.4		; CHECK-NEXT: G_BR %bb.4
; CHECK-NEXT: {{ $}}		; CHECK-NEXT: {{ $}}
; CHECK-NEXT: bb.4.if.end:		; CHECK-NEXT: bb.4.if.end:
; CHECK-NEXT: [[GV5:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var3_64		; CHECK-NEXT: [[GV5:%[0-9]+]]:_(p0) = G_GLOBAL_VALUE @var3_64
; CHECK-NEXT: G_STORE [[C]](s64), [[GV5]](p0) :: (store (s64) into @var3_64)		; CHECK-NEXT: G_STORE [[BITCAST]](s64), [[GV5]](p0) :: (store (s64) into @var3_64)
; CHECK-NEXT: [[C3:%[0-9]+]]:_(s64) = G_CONSTANT i64 0		; CHECK-NEXT: [[C3:%[0-9]+]]:_(s64) = G_CONSTANT i64 0
; CHECK-NEXT: $x0 = COPY [[C3]](s64)		; CHECK-NEXT: $x0 = COPY [[C3]](s64)
; CHECK-NEXT: RET_ReallyLR implicit $x0		; CHECK-NEXT: RET_ReallyLR implicit $x0
entry:		entry:
%0 = load i64, ptr @var1_64, align 4		%0 = load i64, ptr @var1_64, align 4
%cst1 = bitcast i64 -2228259 to i64		%cst1 = bitcast i64 -2228259 to i64
%cmp = icmp eq i64 %0, 1		%cmp = icmp eq i64 %0, 1
br i1 %cmp, label %if.then, label %if.end		br i1 %cmp, label %if.then, label %if.end
Show All 17 Lines

llvm/test/MachineVerifier/test_g_bitcast.mir

Show All 28 Lines	; CHECK: Bad machine code: bitcast cannot convert between pointers and other types
; CHECK: Bad machine code: bitcast sizes must match		; CHECK: Bad machine code: bitcast sizes must match
%8:_(p1) = G_IMPLICIT_DEF		%8:_(p1) = G_IMPLICIT_DEF
%9:_(p3) = G_BITCAST %8		%9:_(p3) = G_BITCAST %8

; CHECK: Bad machine code: bitcast sizes must match		; CHECK: Bad machine code: bitcast sizes must match
%10:_(p1) = G_IMPLICIT_DEF		%10:_(p1) = G_IMPLICIT_DEF
%11:_(p3) = G_BITCAST %8		%11:_(p3) = G_BITCAST %8

; CHECK: Bad machine code: bitcast must change the type
%12:_(s64) = G_BITCAST %0
...		...