This is an archive of the discontinued LLVM Phabricator instance.

[x86] use the IR type of formal args to create assertzext/assertsext and scalar truncate nodes
AbandonedPublic

Authored by spatel on Aug 23 2017, 9:13 AM.

Download Raw Diff

Details

Reviewers

aivchenk
craig.topper
aaboud
delena
zvi
qcolombet
andreadb
efriedma

Summary

This is an implementation of the idea I suggested in D37017 as an alternate way to solve the multiple assertzext problem. I'm posting this just so it's clear what I had drafted. It could be completely wrong for reasons I don't understand yet. :)

I've avoided the vector problems/questions by only using the IR type in a truncate of a scalar. I don't know what mapping we use for illegal vector types or AVX512 masks to formal args, but there must be some cases where vectors are mapped to scalars because I hit asserts without that limitation.
I loosened an assert in SelectionDAGISel::LowerArguments() to account for this new/unexpected behavior.
The tests diffs are a superset of those in D37017 currently because we handle assertsext here.

Diff Detail

Event Timeline

spatel created this revision.Aug 23 2017, 9:13 AM

Herald added a subscriber: mcrosier. · View Herald TranscriptAug 23 2017, 9:13 AM

aaboud added inline comments.Aug 23 2017, 1:37 PM

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
8734	Do we still need this? It is the reason for the second AssertZext instruction.

spatel added a reviewer: efriedma.Aug 23 2017, 3:02 PM

spatel added inline comments.

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
8734	I don't think we'll get the 2nd assert for the affected tests with this patch because we trunc to i1, but let me know if you're seeing something else. But yes, we do need this code - or at least there's some other change needed to account for that part. If I just don't set AssertOp, we'll get these x86 test fails and from what I saw it's because we need extra masking ops without the assert: Failing Tests (5): LLVM :: CodeGen/X86/bool-zext.ll LLVM :: CodeGen/X86/illegal-bitfield-loadstore.ll LLVM :: CodeGen/X86/negate-i1.ll LLVM :: CodeGen/X86/sext-i1.ll LLVM :: CodeGen/X86/tail-call-casts.ll

aaboud added inline comments.Aug 24 2017, 5:08 AM

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

8734

I was using this test to debug the second AssertSext:

target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64"


define i32  @sbar(i1 signext %a) {
 %b = xor i1 %a, 1
 %sext = sext i1 %b to i32
 ret i32 %sext
}

command line:
llc -mtriple=x86_64-unknown-unknown -o -

DAG after this line:

SelectionDAG has 9 nodes:
            t0: ch = EntryToken
          t2: i32,ch = CopyFromReg t0, Register:i32 %vreg0
        t4: i32 = AssertSext t2, ValueType:ch:i8
      t5: i8 = truncate t4
    t7: i8 = AssertSext t5, ValueType:ch:i1
  t8: i1 = truncate t7
  t0: ch = EntryToken

spatel added inline comments.Aug 24 2017, 6:07 AM

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
8734	Yes - that's exactly the case that this patch should be hitting. I see: Creating new node: t4: i32 = AssertSext t2, ValueType:ch:i1 Creating new node: t5: i1 = truncate t4 ...so there's no extra trunc+assertzext with this patch. You're not seeing that?

aaboud added inline comments.Aug 24 2017, 8:18 AM

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp
8734	You are right, this patch prevent this code from generating another AssertSext. Now we return from getCopyFromParts at line 222, before adding the new AsserSext.
lib/Target/X86/X86ISelLowering.cpp
3009	by the way, do we need these parentheses here and below with the one line "else if"/"else" ?

spatel added inline comments.Aug 24 2017, 8:44 AM

lib/Target/X86/X86ISelLowering.cpp
3009	Technically, no - but since one clause in this chain had them (and I thought that has a good thing for readability), then all clauses in this chain should have them for symmetry. I think the LLVM coding guidelines are purposely ambiguous on this, but my preference is to brace more. It improves readability and helps avoid bugs if someone comes along and wants to add more code or comments - which I probably should do if this patch is functionally correct and going to proceed :)

Ping.

Any thoughts about the validity of this approach? If not, I can clean up D37017 as the alternate way to improve the x86 codegen.

Using more aggressive AssertZext/etc. when we can seems obviously good. I don't see what that has to do with modifying the type of ArgValue, though.

In D37069#858329, @efriedma wrote:

Using more aggressive AssertZext/etc. when we can seems obviously good. I don't see what that has to do with modifying the type of ArgValue, though.

IIUC, you're suggesting the solution that @aaboud was probably making earlier, but I didn't recognize it at the time:

Fix the x86 (and other targets*) assertzext generation to use the IR type because that has to be equal or smaller than the register type (that's the first part of the x86 diff in this rev of the patch near line 3000).
Don't alter the ArgValue types.
Fix the later chunk in SelectionDAGISel::LowerArguments() to not create a redundant assertzext if the target has already created one.

This produces identical x86 test diffs as we see here. We'll still have a trunc-of-trunc for i32 -> i8 -> i1 in the affected x86 cases, but those will get folded by existing logic.
*For in-tree targets, I would need to make a similar fix as the x86 change to PowerPC, Mips, and AMDGPU because I see regression test failures for all of those.

But...
As I was looking at what Mips was doing I discovered:

performAssertZextCombine()
// fold (AssertZext (trunc (AssertZext x))) -> (trunc (AssertZext x))
// if the type of the extension of the innermost AssertZext node is
// smaller from that of the outermost node, eg:
// (AssertZext:i32 (trunc:i32 (AssertZext:i64 X, i32)), i8)
//   -> (trunc:i32 (AssertZext X, i8))

This is similar to what I proposed as a general DAGCombine in D37017. Should we go ahead with that patch and avoid all of the target-specific changes? The upside is that out-of-tree targets likely have the same bug, so if we add a generic combine, everyone gets the improvement. The downside is slightly less efficiency from creating an unnecessary assert node that ends up being removed.

Abandoning - I've updated D37017; I think that makes more sense.

Revision Contents

Path

Size

lib/

CodeGen/

SelectionDAG/

SelectionDAGBuilder.cpp

3 lines

Target/

X86/

X86ISelLowering.cpp

13 lines

test/

CodeGen/

X86/

bool-zext.ll

3 lines

critical-edge-split-2.ll

2 lines

fp128-select.ll

4 lines

illegal-bitfield-loadstore.ll

53 lines

4 lines

14 lines

46 lines

9 lines

17 lines

Diff 112381

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 8,639 Lines • ▼ Show 20 Lines	void SelectionDAGISel::LowerArguments(const Function &F) {
assert(NewRoot.getNode() && NewRoot.getValueType() == MVT::Other &&		assert(NewRoot.getNode() && NewRoot.getValueType() == MVT::Other &&
"LowerFormalArguments didn't return a valid chain!");		"LowerFormalArguments didn't return a valid chain!");
assert(InVals.size() == Ins.size() &&		assert(InVals.size() == Ins.size() &&
"LowerFormalArguments didn't emit the correct number of values!");		"LowerFormalArguments didn't emit the correct number of values!");
DEBUG({		DEBUG({
for (unsigned i = 0, e = Ins.size(); i != e; ++i) {		for (unsigned i = 0, e = Ins.size(); i != e; ++i) {
assert(InVals[i].getNode() &&		assert(InVals[i].getNode() &&
"LowerFormalArguments emitted a null value!");		"LowerFormalArguments emitted a null value!");
assert(EVT(Ins[i].VT) == InVals[i].getValueType() &&		assert((EVT(Ins[i].VT) == InVals[i].getValueType() \|\|
		EVT(Ins[i].ArgVT) == InVals[i].getValueType()) &&
"LowerFormalArguments emitted a value with the wrong type!");		"LowerFormalArguments emitted a value with the wrong type!");
}		}
});		});

// Update the DAG with the new chain value resulting from argument lowering.		// Update the DAG with the new chain value resulting from argument lowering.
DAG.setRoot(NewRoot);		DAG.setRoot(NewRoot);

// Set up the argument values.		// Set up the argument values.
▲ Show 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	for (unsigned Val = 0; Val != NumValues; ++Val) {
// function.		// function.
if (ArgHasUses \|\| isSwiftErrorArg) {		if (ArgHasUses \|\| isSwiftErrorArg) {
Optional<ISD::NodeType> AssertOp;		Optional<ISD::NodeType> AssertOp;
if (Arg.hasAttribute(Attribute::SExt))		if (Arg.hasAttribute(Attribute::SExt))
AssertOp = ISD::AssertSext;		AssertOp = ISD::AssertSext;
else if (Arg.hasAttribute(Attribute::ZExt))		else if (Arg.hasAttribute(Attribute::ZExt))
AssertOp = ISD::AssertZext;		AssertOp = ISD::AssertZext;

ArgValues.push_back(getCopyFromParts(DAG, dl, &InVals[i], NumParts,		ArgValues.push_back(getCopyFromParts(DAG, dl, &InVals[i], NumParts,
		aaboudUnsubmitted Not Done Reply Inline Actions Do we still need this? It is the reason for the second AssertZext instruction. aaboud: Do we still need this? It is the reason for the second AssertZext instruction.
		spatelAuthorUnsubmitted Not Done Reply Inline Actions I don't think we'll get the 2nd assert for the affected tests with this patch because we trunc to i1, but let me know if you're seeing something else. But yes, we do need this code - or at least there's some other change needed to account for that part. If I just don't set AssertOp, we'll get these x86 test fails and from what I saw it's because we need extra masking ops without the assert: Failing Tests (5): LLVM :: CodeGen/X86/bool-zext.ll LLVM :: CodeGen/X86/illegal-bitfield-loadstore.ll LLVM :: CodeGen/X86/negate-i1.ll LLVM :: CodeGen/X86/sext-i1.ll LLVM :: CodeGen/X86/tail-call-casts.ll spatel: I don't think we'll get the 2nd assert for the affected tests with this patch because we trunc…
		aaboudUnsubmitted Not Done Reply Inline Actions I was using this test to debug the second AssertSext: target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64" define i32 @sbar(i1 signext %a) { %b = xor i1 %a, 1 %sext = sext i1 %b to i32 ret i32 %sext } command line: llc -mtriple=x86_64-unknown-unknown -o - DAG after this line: SelectionDAG has 9 nodes: t0: ch = EntryToken t2: i32,ch = CopyFromReg t0, Register:i32 %vreg0 t4: i32 = AssertSext t2, ValueType:ch:i8 t5: i8 = truncate t4 t7: i8 = AssertSext t5, ValueType:ch:i1 t8: i1 = truncate t7 t0: ch = EntryToken aaboud: I was using this test to debug the second AssertSext: ``` target datalayout = "e-p:64:64:64…
		spatelAuthorUnsubmitted Not Done Reply Inline Actions Yes - that's exactly the case that this patch should be hitting. I see: Creating new node: t4: i32 = AssertSext t2, ValueType:ch:i1 Creating new node: t5: i1 = truncate t4 ...so there's no extra trunc+assertzext with this patch. You're not seeing that? spatel: Yes - that's exactly the case that this patch should be hitting. I see: Creating new node: t4…
		aaboudUnsubmitted Not Done Reply Inline Actions You are right, this patch prevent this code from generating another AssertSext. Now we return from getCopyFromParts at line 222, before adding the new AsserSext. aaboud: You are right, this patch prevent this code from generating another AssertSext. Now we return…
PartVT, VT, nullptr, AssertOp,		PartVT, VT, nullptr, AssertOp,
true));		true));
}		}

i += NumParts;		i += NumParts;
}		}

// We don't need to do anything else for unused arguments.		// We don't need to do anything else for unused arguments.
▲ Show 20 Lines • Show All 1,099 Lines • Show Last 20 Lines

lib/Target/X86/X86ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 2,991 Lines • ▼ Show 20 Lines	if (VA.isRegLoc()) {
ArgValue = DAG.getCopyFromReg(Chain, dl, Reg, RegVT);		ArgValue = DAG.getCopyFromReg(Chain, dl, Reg, RegVT);
}		}

// If this is an 8 or 16-bit value, it is really passed promoted to 32		// If this is an 8 or 16-bit value, it is really passed promoted to 32
// bits. Insert an assert[sz]ext to capture this, then truncate to the		// bits. Insert an assert[sz]ext to capture this, then truncate to the
// right size.		// right size.
if (VA.getLocInfo() == CCValAssign::SExt)		if (VA.getLocInfo() == CCValAssign::SExt)
ArgValue = DAG.getNode(ISD::AssertSext, dl, RegVT, ArgValue,		ArgValue = DAG.getNode(ISD::AssertSext, dl, RegVT, ArgValue,
DAG.getValueType(VA.getValVT()));		DAG.getValueType(Ins[InsIndex].ArgVT));
else if (VA.getLocInfo() == CCValAssign::ZExt)		else if (VA.getLocInfo() == CCValAssign::ZExt)
ArgValue = DAG.getNode(ISD::AssertZext, dl, RegVT, ArgValue,		ArgValue = DAG.getNode(ISD::AssertZext, dl, RegVT, ArgValue,
DAG.getValueType(VA.getValVT()));		DAG.getValueType(Ins[InsIndex].ArgVT));
else if (VA.getLocInfo() == CCValAssign::BCvt)		else if (VA.getLocInfo() == CCValAssign::BCvt)
ArgValue = DAG.getBitcast(VA.getValVT(), ArgValue);		ArgValue = DAG.getBitcast(VA.getValVT(), ArgValue);

if (VA.isExtInLoc()) {		if (VA.isExtInLoc()) {
// Handle MMX values passed in XMM regs.		// Handle MMX values passed in XMM regs.
if (RegVT.isVector() && VA.getValVT().getScalarType() != MVT::i1)		if (RegVT.isVector() && VA.getValVT().getScalarType() != MVT::i1) {
		aaboudUnsubmitted Not Done Reply Inline Actions by the way, do we need these parentheses here and below with the one line "else if"/"else" ? aaboud: by the way, do we need these parentheses here and below with the one line "else if"/"else" ?
		spatelAuthorUnsubmitted Not Done Reply Inline Actions Technically, no - but since one clause in this chain had them (and I thought that has a good thing for readability), then all clauses in this chain should have them for symmetry. I think the LLVM coding guidelines are purposely ambiguous on this, but my preference is to brace more. It improves readability and helps avoid bugs if someone comes along and wants to add more code or comments - which I probably should do if this patch is functionally correct and going to proceed :) spatel: Technically, no - but since one clause in this chain had them (and I thought that has a good…
ArgValue = DAG.getNode(X86ISD::MOVDQ2Q, dl, VA.getValVT(), ArgValue);		ArgValue = DAG.getNode(X86ISD::MOVDQ2Q, dl, VA.getValVT(), ArgValue);
else if (VA.getValVT().isVector() &&		} else if (VA.getValVT().isVector() &&
VA.getValVT().getScalarType() == MVT::i1 &&		VA.getValVT().getScalarType() == MVT::i1 &&
((VA.getLocVT() == MVT::i64) \|\| (VA.getLocVT() == MVT::i32) \|\|		((VA.getLocVT() == MVT::i64) \|\| (VA.getLocVT() == MVT::i32) \|\|
(VA.getLocVT() == MVT::i16) \|\| (VA.getLocVT() == MVT::i8))) {		(VA.getLocVT() == MVT::i16) \|\| (VA.getLocVT() == MVT::i8))) {
// Promoting a mask type (v*i1) into a register of type i64/i32/i16/i8		// Promoting a mask type (v*i1) into a register of type i64/i32/i16/i8
ArgValue = lowerRegToMasks(ArgValue, VA.getValVT(), RegVT, dl, DAG);		ArgValue = lowerRegToMasks(ArgValue, VA.getValVT(), RegVT, dl, DAG);
} else		} else if (!Ins[InsIndex].ArgVT.isVector() && !VA.getValVT().isVector()) {
		ArgValue = DAG.getNode(ISD::TRUNCATE, dl, Ins[InsIndex].ArgVT, ArgValue);
		} else {
ArgValue = DAG.getNode(ISD::TRUNCATE, dl, VA.getValVT(), ArgValue);		ArgValue = DAG.getNode(ISD::TRUNCATE, dl, VA.getValVT(), ArgValue);
}		}
		}
} else {		} else {
assert(VA.isMemLoc());		assert(VA.isMemLoc());
ArgValue =		ArgValue =
LowerMemArgument(Chain, CallConv, Ins, dl, DAG, VA, MFI, InsIndex);		LowerMemArgument(Chain, CallConv, Ins, dl, DAG, VA, MFI, InsIndex);
}		}

// If value is passed via pointer - do a load.		// If value is passed via pointer - do a load.
if (VA.getLocInfo() == CCValAssign::Indirect)		if (VA.getLocInfo() == CCValAssign::Indirect)
▲ Show 20 Lines • Show All 33,780 Lines • Show Last 20 Lines

test/CodeGen/X86/bool-zext.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -mtriple=i686-unknown-unknown \| FileCheck %s -check-prefix=X32			; RUN: llc < %s -mtriple=i686-unknown-unknown \| FileCheck %s -check-prefix=X32
	; RUN: llc < %s -mtriple=x86_64-unknown-unknown \| FileCheck %s -check-prefix=X64			; RUN: llc < %s -mtriple=x86_64-unknown-unknown \| FileCheck %s -check-prefix=X64

	; Check that the argument gets zero-extended before calling.			; It's not necessary to zero-extend the arg because it is specified 'zeroext'.
	define void @bar1(i1 zeroext %v1) nounwind ssp {			define void @bar1(i1 zeroext %v1) nounwind ssp {
	; X32-LABEL: bar1:			; X32-LABEL: bar1:
	; X32: # BB#0:			; X32: # BB#0:
	; X32-NEXT: movzbl {{[0-9]+}}(%esp), %eax			; X32-NEXT: movzbl {{[0-9]+}}(%esp), %eax
	; X32-NEXT: pushl %eax			; X32-NEXT: pushl %eax
	; X32-NEXT: calll foo1			; X32-NEXT: calll foo1
	; X32-NEXT: addl $4, %esp			; X32-NEXT: addl $4, %esp
	; X32-NEXT: retl			; X32-NEXT: retl
	;			;
	; X64-LABEL: bar1:			; X64-LABEL: bar1:
	; X64: # BB#0:			; X64: # BB#0:
	; X64-NEXT: movzbl %dil, %edi
	; X64-NEXT: xorl %eax, %eax			; X64-NEXT: xorl %eax, %eax
	; X64-NEXT: jmp foo1 # TAILCALL			; X64-NEXT: jmp foo1 # TAILCALL
	%conv = zext i1 %v1 to i32			%conv = zext i1 %v1 to i32
	%call = tail call i32 (...) @foo1(i32 %conv) nounwind			%call = tail call i32 (...) @foo1(i32 %conv) nounwind
	ret void			ret void
	}			}

	; Check that on x86-64 the arguments are simply forwarded.			; Check that on x86-64 the arguments are simply forwarded.
	Show All 38 Lines

test/CodeGen/X86/critical-edge-split-2.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -mtriple=x86_64-unknown-unknown \| FileCheck %s			; RUN: llc < %s -mtriple=x86_64-unknown-unknown \| FileCheck %s

	%0 = type <{ %1, %1 }>			%0 = type <{ %1, %1 }>
	%1 = type { i8, i8, i8, i8 }			%1 = type { i8, i8, i8, i8 }

	@g_2 = global %0 zeroinitializer			@g_2 = global %0 zeroinitializer
	@g_4 = global %1 zeroinitializer, align 4			@g_4 = global %1 zeroinitializer, align 4

	; PR8642			; PR8642
	define i16 @test1(i1 zeroext %C, i8** nocapture %argv) nounwind ssp {			define i16 @test1(i1 zeroext %C, i8** nocapture %argv) nounwind ssp {
	; CHECK-LABEL: test1:			; CHECK-LABEL: test1:
	; CHECK: # BB#0: # %entry			; CHECK: # BB#0: # %entry
	; CHECK-NEXT: movw $1, %ax			; CHECK-NEXT: movw $1, %ax
	; CHECK-NEXT: testb %dil, %dil			; CHECK-NEXT: testl %edi, %edi
	; CHECK-NEXT: jne .LBB0_2			; CHECK-NEXT: jne .LBB0_2
	; CHECK-NEXT: # BB#1: # %cond.false.i			; CHECK-NEXT: # BB#1: # %cond.false.i
	; CHECK-NEXT: movl $g_4, %eax			; CHECK-NEXT: movl $g_4, %eax
	; CHECK-NEXT: movl $g_2+4, %ecx			; CHECK-NEXT: movl $g_2+4, %ecx
	; CHECK-NEXT: xorl %esi, %esi			; CHECK-NEXT: xorl %esi, %esi
	; CHECK-NEXT: cmpq %rax, %rcx			; CHECK-NEXT: cmpq %rax, %rcx
	; CHECK-NEXT: sete %sil			; CHECK-NEXT: sete %sil
	; CHECK-NEXT: movl $1, %eax			; CHECK-NEXT: movl $1, %eax
	Show All 17 Lines

test/CodeGen/X86/fp128-select.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -O2 -mtriple=x86_64-linux-android -mattr=+mmx \			; RUN: llc < %s -O2 -mtriple=x86_64-linux-android -mattr=+mmx \
	; RUN: -enable-legalize-types-checking \| FileCheck %s --check-prefix=MMX			; RUN: -enable-legalize-types-checking \| FileCheck %s --check-prefix=MMX
	; RUN: llc < %s -O2 -mtriple=x86_64-linux-gnu -mattr=+mmx \			; RUN: llc < %s -O2 -mtriple=x86_64-linux-gnu -mattr=+mmx \
	; RUN: -enable-legalize-types-checking \| FileCheck %s --check-prefix=MMX			; RUN: -enable-legalize-types-checking \| FileCheck %s --check-prefix=MMX
	; RUN: llc < %s -O2 -mtriple=x86_64-linux-android \			; RUN: llc < %s -O2 -mtriple=x86_64-linux-android \
	; RUN: -enable-legalize-types-checking \| FileCheck %s			; RUN: -enable-legalize-types-checking \| FileCheck %s
	; RUN: llc < %s -O2 -mtriple=x86_64-linux-gnu \			; RUN: llc < %s -O2 -mtriple=x86_64-linux-gnu \
	; RUN: -enable-legalize-types-checking \| FileCheck %s			; RUN: -enable-legalize-types-checking \| FileCheck %s

	define void @test_select(fp128* %p, fp128* %q, i1 zeroext %c) {			define void @test_select(fp128* %p, fp128* %q, i1 zeroext %c) {
	; MMX-LABEL: test_select:			; MMX-LABEL: test_select:
	; MMX: # BB#0:			; MMX: # BB#0:
	; MMX-NEXT: testb %dl, %dl			; MMX-NEXT: testl %edx, %edx
	; MMX-NEXT: jne .LBB0_1			; MMX-NEXT: jne .LBB0_1
	; MMX-NEXT: # BB#2:			; MMX-NEXT: # BB#2:
	; MMX-NEXT: movaps {{.*}}(%rip), %xmm0			; MMX-NEXT: movaps {{.*}}(%rip), %xmm0
	; MMX-NEXT: movaps %xmm0, (%rsi)			; MMX-NEXT: movaps %xmm0, (%rsi)
	; MMX-NEXT: retq			; MMX-NEXT: retq
	; MMX-NEXT: .LBB0_1:			; MMX-NEXT: .LBB0_1:
	; MMX-NEXT: movaps (%rdi), %xmm0			; MMX-NEXT: movaps (%rdi), %xmm0
	; MMX-NEXT: movaps %xmm0, (%rsi)			; MMX-NEXT: movaps %xmm0, (%rsi)
	; MMX-NEXT: retq			; MMX-NEXT: retq
	;			;
	; CHECK-LABEL: test_select:			; CHECK-LABEL: test_select:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: xorl %eax, %eax			; CHECK-NEXT: xorl %eax, %eax
	; CHECK-NEXT: testb %dl, %dl			; CHECK-NEXT: testl %edx, %edx
	; CHECK-NEXT: cmovneq (%rdi), %rax			; CHECK-NEXT: cmovneq (%rdi), %rax
	; CHECK-NEXT: movabsq $9223231299366420480, %rcx # imm = 0x7FFF800000000000			; CHECK-NEXT: movabsq $9223231299366420480, %rcx # imm = 0x7FFF800000000000
	; CHECK-NEXT: cmovneq 8(%rdi), %rcx			; CHECK-NEXT: cmovneq 8(%rdi), %rcx
	; CHECK-NEXT: movq %rcx, 8(%rsi)			; CHECK-NEXT: movq %rcx, 8(%rsi)
	; CHECK-NEXT: movq %rax, (%rsi)			; CHECK-NEXT: movq %rax, (%rsi)
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%a = load fp128, fp128* %p, align 2			%a = load fp128, fp128* %p, align 2
	%r = select i1 %c, fp128 %a, fp128 0xL00000000000000007FFF800000000000			%r = select i1 %c, fp128 %a, fp128 0xL00000000000000007FFF800000000000
	store fp128 %r, fp128* %q			store fp128 %r, fp128* %q
	ret void			ret void
	}			}

test/CodeGen/X86/illegal-bitfield-loadstore.ll

	Show First 20 Lines • Show All 81 Lines • ▼ Show 20 Lines
	; X86-NEXT: andl $16769023, %eax # imm = 0xFFDFFF			; X86-NEXT: andl $16769023, %eax # imm = 0xFFDFFF
	; X86-NEXT: orl %edx, %eax			; X86-NEXT: orl %edx, %eax
	; X86-NEXT: movw %ax, (%ecx)			; X86-NEXT: movw %ax, (%ecx)
	; X86-NEXT: popl %esi			; X86-NEXT: popl %esi
	; X86-NEXT: retl			; X86-NEXT: retl
	;			;
	; X64-LABEL: i24_insert_bit:			; X64-LABEL: i24_insert_bit:
	; X64: # BB#0:			; X64: # BB#0:
	; X64-NEXT: movzbl %sil, %eax			; X64-NEXT: movzwl (%rdi), %eax
	; X64-NEXT: movzwl (%rdi), %ecx			; X64-NEXT: movzbl 2(%rdi), %ecx
	; X64-NEXT: movzbl 2(%rdi), %edx			; X64-NEXT: movb %cl, 2(%rdi)
	; X64-NEXT: movb %dl, 2(%rdi)			; X64-NEXT: shll $16, %ecx
	; X64-NEXT: shll $16, %edx			; X64-NEXT: orl %eax, %ecx
	; X64-NEXT: orl %ecx, %edx			; X64-NEXT: shll $13, %esi
	; X64-NEXT: shll $13, %eax			; X64-NEXT: andl $16769023, %ecx # imm = 0xFFDFFF
	; X64-NEXT: andl $16769023, %edx # imm = 0xFFDFFF			; X64-NEXT: orl %esi, %ecx
	; X64-NEXT: orl %eax, %edx			; X64-NEXT: movw %cx, (%rdi)
	; X64-NEXT: movw %dx, (%rdi)
	; X64-NEXT: retq			; X64-NEXT: retq
	%extbit = zext i1 %bit to i24			%extbit = zext i1 %bit to i24
	%b = load i24, i24* %a, align 1			%b = load i24, i24* %a, align 1
	%extbit.shl = shl nuw nsw i24 %extbit, 13			%extbit.shl = shl nuw nsw i24 %extbit, 13
	%c = and i24 %b, -8193			%c = and i24 %b, -8193
	%d = or i24 %c, %extbit.shl			%d = or i24 %c, %extbit.shl
	store i24 %d, i24* %a, align 1			store i24 %d, i24* %a, align 1
	ret void			ret void
	▲ Show 20 Lines • Show All 72 Lines • ▼ Show 20 Lines
	; X86-NEXT: movl $-8193, %edx # imm = 0xDFFF			; X86-NEXT: movl $-8193, %edx # imm = 0xDFFF
	; X86-NEXT: andl (%eax), %edx			; X86-NEXT: andl (%eax), %edx
	; X86-NEXT: orl %ecx, %edx			; X86-NEXT: orl %ecx, %edx
	; X86-NEXT: movl %edx, (%eax)			; X86-NEXT: movl %edx, (%eax)
	; X86-NEXT: retl			; X86-NEXT: retl
	;			;
	; X64-LABEL: i56_insert_bit:			; X64-LABEL: i56_insert_bit:
	; X64: # BB#0:			; X64: # BB#0:
	; X64-NEXT: movzbl %sil, %eax			; X64-NEXT: # kill: %ESI<def> %ESI<kill> %RSI<def>
	; X64-NEXT: movzwl 4(%rdi), %ecx			; X64-NEXT: movzwl 4(%rdi), %eax
	; X64-NEXT: movzbl 6(%rdi), %edx			; X64-NEXT: movzbl 6(%rdi), %ecx
	; X64-NEXT: movl (%rdi), %esi			; X64-NEXT: movl (%rdi), %edx
	; X64-NEXT: movb %dl, 6(%rdi)			; X64-NEXT: movb %cl, 6(%rdi)
	; X64-NEXT: # kill: %EDX<def> %EDX<kill> %RDX<kill> %RDX<def>			; X64-NEXT: # kill: %ECX<def> %ECX<kill> %RCX<kill> %RCX<def>
	; X64-NEXT: shll $16, %edx			; X64-NEXT: shll $16, %ecx
	; X64-NEXT: orl %ecx, %edx			; X64-NEXT: orl %eax, %ecx
	; X64-NEXT: shlq $32, %rdx			; X64-NEXT: shlq $32, %rcx
	; X64-NEXT: orq %rdx, %rsi			; X64-NEXT: orq %rcx, %rdx
	; X64-NEXT: shlq $13, %rax			; X64-NEXT: shlq $13, %rsi
	; X64-NEXT: movabsq $72057594037919743, %rcx # imm = 0xFFFFFFFFFFDFFF			; X64-NEXT: movabsq $72057594037919743, %rax # imm = 0xFFFFFFFFFFDFFF
	; X64-NEXT: andq %rsi, %rcx			; X64-NEXT: andq %rdx, %rax
	; X64-NEXT: orq %rax, %rcx			; X64-NEXT: orq %rsi, %rax
	; X64-NEXT: movl %ecx, (%rdi)			; X64-NEXT: movl %eax, (%rdi)
	; X64-NEXT: shrq $32, %rcx			; X64-NEXT: shrq $32, %rax
	; X64-NEXT: movw %cx, 4(%rdi)			; X64-NEXT: movw %ax, 4(%rdi)
	; X64-NEXT: retq			; X64-NEXT: retq
	%extbit = zext i1 %bit to i56			%extbit = zext i1 %bit to i56
	%b = load i56, i56* %a, align 1			%b = load i56, i56* %a, align 1
	%extbit.shl = shl nuw nsw i56 %extbit, 13			%extbit.shl = shl nuw nsw i56 %extbit, 13
	%c = and i56 %b, -8193			%c = and i56 %b, -8193
	%d = or i56 %c, %extbit.shl			%d = or i56 %c, %extbit.shl
	store i56 %d, i56* %a, align 1			store i56 %d, i56* %a, align 1
	ret void			ret void
	}			}

test/CodeGen/X86/mask-negated-bool.ll

Show All 10 Lines	; CHECK-NEXT: retq
%neg = sub i32 0, %ext		%neg = sub i32 0, %ext
%and = and i32 %neg, 1		%and = and i32 %neg, 1
ret i32 %and		ret i32 %and
}		}

define i32 @mask_negated_zext_bool2(i1 zeroext %x) {		define i32 @mask_negated_zext_bool2(i1 zeroext %x) {
; CHECK-LABEL: mask_negated_zext_bool2:		; CHECK-LABEL: mask_negated_zext_bool2:
; CHECK: # BB#0:		; CHECK: # BB#0:
; CHECK-NEXT: movzbl %dil, %eax		; CHECK-NEXT: movl %edi, %eax
; CHECK-NEXT: retq		; CHECK-NEXT: retq
%ext = zext i1 %x to i32		%ext = zext i1 %x to i32
%neg = sub i32 0, %ext		%neg = sub i32 0, %ext
%and = and i32 %neg, 1		%and = and i32 %neg, 1
ret i32 %and		ret i32 %and
}		}

define <4 x i32> @mask_negated_zext_bool_vec(<4 x i1> %x) {		define <4 x i32> @mask_negated_zext_bool_vec(<4 x i1> %x) {
Show All 17 Lines	; CHECK-NEXT: retq
%neg = sub i32 0, %ext		%neg = sub i32 0, %ext
%and = and i32 %neg, 1		%and = and i32 %neg, 1
ret i32 %and		ret i32 %and
}		}

define i32 @mask_negated_sext_bool2(i1 zeroext %x) {		define i32 @mask_negated_sext_bool2(i1 zeroext %x) {
; CHECK-LABEL: mask_negated_sext_bool2:		; CHECK-LABEL: mask_negated_sext_bool2:
; CHECK: # BB#0:		; CHECK: # BB#0:
; CHECK-NEXT: movzbl %dil, %eax		; CHECK-NEXT: movl %edi, %eax
; CHECK-NEXT: retq		; CHECK-NEXT: retq
%ext = sext i1 %x to i32		%ext = sext i1 %x to i32
%neg = sub i32 0, %ext		%neg = sub i32 0, %ext
%and = and i32 %neg, 1		%and = and i32 %neg, 1
ret i32 %and		ret i32 %and
}		}

define <4 x i32> @mask_negated_sext_bool_vec(<4 x i1> %x) {		define <4 x i32> @mask_negated_sext_bool_vec(<4 x i1> %x) {
Show All 10 Lines

test/CodeGen/X86/negate-i1.ll

	Show First 20 Lines • Show All 52 Lines • ▼ Show 20 Lines
	; X32-NEXT: retl			; X32-NEXT: retl
	%b = sext i1 %a to i16			%b = sext i1 %a to i16
	ret i16 %b			ret i16 %b
	}			}

	define i16 @select_i16_neg1_or_0_zeroext(i1 zeroext %a) {			define i16 @select_i16_neg1_or_0_zeroext(i1 zeroext %a) {
	; X64-LABEL: select_i16_neg1_or_0_zeroext:			; X64-LABEL: select_i16_neg1_or_0_zeroext:
	; X64: # BB#0:			; X64: # BB#0:
	; X64-NEXT: movzbl %dil, %eax			; X64-NEXT: negl %edi
	; X64-NEXT: negl %eax			; X64-NEXT: movl %edi, %eax
	; X64-NEXT: # kill: %AX<def> %AX<kill> %EAX<kill>
	; X64-NEXT: retq			; X64-NEXT: retq
	;			;
	; X32-LABEL: select_i16_neg1_or_0_zeroext:			; X32-LABEL: select_i16_neg1_or_0_zeroext:
	; X32: # BB#0:			; X32: # BB#0:
	; X32-NEXT: movzbl {{[0-9]+}}(%esp), %eax			; X32-NEXT: movzbl {{[0-9]+}}(%esp), %eax
	; X32-NEXT: negl %eax			; X32-NEXT: negl %eax
	; X32-NEXT: # kill: %AX<def> %AX<kill> %EAX<kill>			; X32-NEXT: # kill: %AX<def> %AX<kill> %EAX<kill>
	; X32-NEXT: retl			; X32-NEXT: retl
	Show All 17 Lines
	; X32-NEXT: retl			; X32-NEXT: retl
	%b = sext i1 %a to i32			%b = sext i1 %a to i32
	ret i32 %b			ret i32 %b
	}			}

	define i32 @select_i32_neg1_or_0_zeroext(i1 zeroext %a) {			define i32 @select_i32_neg1_or_0_zeroext(i1 zeroext %a) {
	; X64-LABEL: select_i32_neg1_or_0_zeroext:			; X64-LABEL: select_i32_neg1_or_0_zeroext:
	; X64: # BB#0:			; X64: # BB#0:
	; X64-NEXT: movzbl %dil, %eax			; X64-NEXT: negl %edi
	; X64-NEXT: negl %eax			; X64-NEXT: movl %edi, %eax
	; X64-NEXT: retq			; X64-NEXT: retq
	;			;
	; X32-LABEL: select_i32_neg1_or_0_zeroext:			; X32-LABEL: select_i32_neg1_or_0_zeroext:
	; X32: # BB#0:			; X32: # BB#0:
	; X32-NEXT: movzbl {{[0-9]+}}(%esp), %eax			; X32-NEXT: movzbl {{[0-9]+}}(%esp), %eax
	; X32-NEXT: negl %eax			; X32-NEXT: negl %eax
	; X32-NEXT: retl			; X32-NEXT: retl
	%b = sext i1 %a to i32			%b = sext i1 %a to i32
	Show All 18 Lines
	; X32-NEXT: retl			; X32-NEXT: retl
	%b = sext i1 %a to i64			%b = sext i1 %a to i64
	ret i64 %b			ret i64 %b
	}			}

	define i64 @select_i64_neg1_or_0_zeroext(i1 zeroext %a) {			define i64 @select_i64_neg1_or_0_zeroext(i1 zeroext %a) {
	; X64-LABEL: select_i64_neg1_or_0_zeroext:			; X64-LABEL: select_i64_neg1_or_0_zeroext:
	; X64: # BB#0:			; X64: # BB#0:
	; X64-NEXT: movzbl %dil, %eax			; X64-NEXT: # kill: %EDI<def> %EDI<kill> %RDI<def>
	; X64-NEXT: negq %rax			; X64-NEXT: negq %rdi
				; X64-NEXT: movq %rdi, %rax
	; X64-NEXT: retq			; X64-NEXT: retq
	;			;
	; X32-LABEL: select_i64_neg1_or_0_zeroext:			; X32-LABEL: select_i64_neg1_or_0_zeroext:
	; X32: # BB#0:			; X32: # BB#0:
	; X32-NEXT: movzbl {{[0-9]+}}(%esp), %eax			; X32-NEXT: movzbl {{[0-9]+}}(%esp), %eax
	; X32-NEXT: negl %eax			; X32-NEXT: negl %eax
	; X32-NEXT: movl %eax, %edx			; X32-NEXT: movl %eax, %edx
	; X32-NEXT: retl			; X32-NEXT: retl
	%b = sext i1 %a to i64			%b = sext i1 %a to i64
	ret i64 %b			ret i64 %b
	}			}

test/CodeGen/X86/select_const.ll

	Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 1, i32 0			%sel = select i1 %cond, i32 1, i32 0
	ret i32 %sel			ret i32 %sel
	}			}

	define i32 @select_1_or_0_zeroext(i1 zeroext %cond) {			define i32 @select_1_or_0_zeroext(i1 zeroext %cond) {
	; CHECK-LABEL: select_1_or_0_zeroext:			; CHECK-LABEL: select_1_or_0_zeroext:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: movzbl %dil, %eax			; CHECK-NEXT: movl %edi, %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 1, i32 0			%sel = select i1 %cond, i32 1, i32 0
	ret i32 %sel			ret i32 %sel
	}			}

	define i32 @select_1_or_0_signext(i1 signext %cond) {			define i32 @select_1_or_0_signext(i1 signext %cond) {
	; CHECK-LABEL: select_1_or_0_signext:			; CHECK-LABEL: select_1_or_0_signext:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: andb $1, %dil			; CHECK-NEXT: andl $1, %edi
	; CHECK-NEXT: movzbl %dil, %eax			; CHECK-NEXT: movl %edi, %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 1, i32 0			%sel = select i1 %cond, i32 1, i32 0
	ret i32 %sel			ret i32 %sel
	}			}

	; select Cond, 0, -1 --> sext (!Cond)			; select Cond, 0, -1 --> sext (!Cond)

	define i32 @select_0_or_neg1(i1 %cond) {			define i32 @select_0_or_neg1(i1 %cond) {
	; CHECK-LABEL: select_0_or_neg1:			; CHECK-LABEL: select_0_or_neg1:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: # kill: %EDI<def> %EDI<kill> %RDI<def>			; CHECK-NEXT: # kill: %EDI<def> %EDI<kill> %RDI<def>
	; CHECK-NEXT: andl $1, %edi			; CHECK-NEXT: andl $1, %edi
	; CHECK-NEXT: leal -1(%rdi), %eax			; CHECK-NEXT: leal -1(%rdi), %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 0, i32 -1			%sel = select i1 %cond, i32 0, i32 -1
	ret i32 %sel			ret i32 %sel
	}			}

	define i32 @select_0_or_neg1_zeroext(i1 zeroext %cond) {			define i32 @select_0_or_neg1_zeroext(i1 zeroext %cond) {
	; CHECK-LABEL: select_0_or_neg1_zeroext:			; CHECK-LABEL: select_0_or_neg1_zeroext:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: movzbl %dil, %eax			; CHECK-NEXT: # kill: %EDI<def> %EDI<kill> %RDI<def>
	; CHECK-NEXT: decl %eax			; CHECK-NEXT: leal -1(%rdi), %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 0, i32 -1			%sel = select i1 %cond, i32 0, i32 -1
	ret i32 %sel			ret i32 %sel
	}			}

	define i32 @select_0_or_neg1_signext(i1 signext %cond) {			define i32 @select_0_or_neg1_signext(i1 signext %cond) {
	; CHECK-LABEL: select_0_or_neg1_signext:			; CHECK-LABEL: select_0_or_neg1_signext:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: andb $1, %dil			; CHECK-NEXT: notl %edi
	; CHECK-NEXT: movzbl %dil, %eax			; CHECK-NEXT: movl %edi, %eax
	; CHECK-NEXT: decl %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 0, i32 -1			%sel = select i1 %cond, i32 0, i32 -1
	ret i32 %sel			ret i32 %sel
	}			}

	; select Cond, -1, 0 --> sext (Cond)			; select Cond, -1, 0 --> sext (Cond)

	define i32 @select_neg1_or_0(i1 %cond) {			define i32 @select_neg1_or_0(i1 %cond) {
	; CHECK-LABEL: select_neg1_or_0:			; CHECK-LABEL: select_neg1_or_0:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: andl $1, %edi			; CHECK-NEXT: andl $1, %edi
	; CHECK-NEXT: negl %edi			; CHECK-NEXT: negl %edi
	; CHECK-NEXT: movl %edi, %eax			; CHECK-NEXT: movl %edi, %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 -1, i32 0			%sel = select i1 %cond, i32 -1, i32 0
	ret i32 %sel			ret i32 %sel
	}			}

	define i32 @select_neg1_or_0_zeroext(i1 zeroext %cond) {			define i32 @select_neg1_or_0_zeroext(i1 zeroext %cond) {
	; CHECK-LABEL: select_neg1_or_0_zeroext:			; CHECK-LABEL: select_neg1_or_0_zeroext:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: movzbl %dil, %eax			; CHECK-NEXT: negl %edi
	; CHECK-NEXT: negl %eax			; CHECK-NEXT: movl %edi, %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 -1, i32 0			%sel = select i1 %cond, i32 -1, i32 0
	ret i32 %sel			ret i32 %sel
	}			}

	define i32 @select_neg1_or_0_signext(i1 signext %cond) {			define i32 @select_neg1_or_0_signext(i1 signext %cond) {
	; CHECK-LABEL: select_neg1_or_0_signext:			; CHECK-LABEL: select_neg1_or_0_signext:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: movsbl %dil, %eax			; CHECK-NEXT: movl %edi, %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 -1, i32 0			%sel = select i1 %cond, i32 -1, i32 0
	ret i32 %sel			ret i32 %sel
	}			}

	; select Cond, C+1, C --> add (zext Cond), C			; select Cond, C+1, C --> add (zext Cond), C

	define i32 @select_Cplus1_C(i1 %cond) {			define i32 @select_Cplus1_C(i1 %cond) {
	; CHECK-LABEL: select_Cplus1_C:			; CHECK-LABEL: select_Cplus1_C:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: # kill: %EDI<def> %EDI<kill> %RDI<def>			; CHECK-NEXT: # kill: %EDI<def> %EDI<kill> %RDI<def>
	; CHECK-NEXT: andl $1, %edi			; CHECK-NEXT: andl $1, %edi
	; CHECK-NEXT: leal 41(%rdi), %eax			; CHECK-NEXT: leal 41(%rdi), %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 42, i32 41			%sel = select i1 %cond, i32 42, i32 41
	ret i32 %sel			ret i32 %sel
	}			}

	define i32 @select_Cplus1_C_zeroext(i1 zeroext %cond) {			define i32 @select_Cplus1_C_zeroext(i1 zeroext %cond) {
	; CHECK-LABEL: select_Cplus1_C_zeroext:			; CHECK-LABEL: select_Cplus1_C_zeroext:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: movzbl %dil, %eax			; CHECK-NEXT: # kill: %EDI<def> %EDI<kill> %RDI<def>
	; CHECK-NEXT: addl $41, %eax			; CHECK-NEXT: leal 41(%rdi), %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 42, i32 41			%sel = select i1 %cond, i32 42, i32 41
	ret i32 %sel			ret i32 %sel
	}			}

	define i32 @select_Cplus1_C_signext(i1 signext %cond) {			define i32 @select_Cplus1_C_signext(i1 signext %cond) {
	; CHECK-LABEL: select_Cplus1_C_signext:			; CHECK-LABEL: select_Cplus1_C_signext:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: andb $1, %dil			; CHECK-NEXT: movl $41, %eax
	; CHECK-NEXT: movzbl %dil, %eax			; CHECK-NEXT: subl %edi, %eax
	; CHECK-NEXT: addl $41, %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 42, i32 41			%sel = select i1 %cond, i32 42, i32 41
	ret i32 %sel			ret i32 %sel
	}			}

	; select Cond, C, C+1 --> add (sext Cond), C			; select Cond, C, C+1 --> add (sext Cond), C

	define i32 @select_C_Cplus1(i1 %cond) {			define i32 @select_C_Cplus1(i1 %cond) {
	; CHECK-LABEL: select_C_Cplus1:			; CHECK-LABEL: select_C_Cplus1:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: andl $1, %edi			; CHECK-NEXT: andl $1, %edi
	; CHECK-NEXT: movl $42, %eax			; CHECK-NEXT: movl $42, %eax
	; CHECK-NEXT: subl %edi, %eax			; CHECK-NEXT: subl %edi, %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 41, i32 42			%sel = select i1 %cond, i32 41, i32 42
	ret i32 %sel			ret i32 %sel
	}			}

	define i32 @select_C_Cplus1_zeroext(i1 zeroext %cond) {			define i32 @select_C_Cplus1_zeroext(i1 zeroext %cond) {
	; CHECK-LABEL: select_C_Cplus1_zeroext:			; CHECK-LABEL: select_C_Cplus1_zeroext:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: movzbl %dil, %ecx
	; CHECK-NEXT: movl $42, %eax			; CHECK-NEXT: movl $42, %eax
	; CHECK-NEXT: subl %ecx, %eax			; CHECK-NEXT: subl %edi, %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 41, i32 42			%sel = select i1 %cond, i32 41, i32 42
	ret i32 %sel			ret i32 %sel
	}			}

	define i32 @select_C_Cplus1_signext(i1 signext %cond) {			define i32 @select_C_Cplus1_signext(i1 signext %cond) {
	; CHECK-LABEL: select_C_Cplus1_signext:			; CHECK-LABEL: select_C_Cplus1_signext:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: andb $1, %dil			; CHECK-NEXT: andl $1, %edi
	; CHECK-NEXT: movzbl %dil, %ecx
	; CHECK-NEXT: movl $42, %eax			; CHECK-NEXT: movl $42, %eax
	; CHECK-NEXT: subl %ecx, %eax			; CHECK-NEXT: subl %edi, %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 41, i32 42			%sel = select i1 %cond, i32 41, i32 42
	ret i32 %sel			ret i32 %sel
	}			}

	; If the constants differ by a small multiplier, use LEA.			; If the constants differ by a small multiplier, use LEA.
	; select Cond, C1, C2 --> add (mul (zext Cond), C1-C2), C2 --> LEA C2(Cond * (C1-C2))			; select Cond, C1, C2 --> add (mul (zext Cond), C1-C2), C2 --> LEA C2(Cond * (C1-C2))

	▲ Show 20 Lines • Show All 141 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i16 7, i16 71			%sel = select i1 %cond, i16 7, i16 71
	ret i16 %sel			ret i16 %sel
	}			}

	define i32 @select_pow2_diff_neg(i1 zeroext %cond) {			define i32 @select_pow2_diff_neg(i1 zeroext %cond) {
	; CHECK-LABEL: select_pow2_diff_neg:			; CHECK-LABEL: select_pow2_diff_neg:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: shlb $4, %dil			; CHECK-NEXT: # kill: %EDI<def> %EDI<kill> %RDI<def>
	; CHECK-NEXT: movzbl %dil, %eax			; CHECK-NEXT: shll $4, %edi
	; CHECK-NEXT: orl $-25, %eax			; CHECK-NEXT: leal -25(%rdi), %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 -9, i32 -25			%sel = select i1 %cond, i32 -9, i32 -25
	ret i32 %sel			ret i32 %sel
	}			}

	define i64 @select_pow2_diff_neg_invert(i1 zeroext %cond) {			define i64 @select_pow2_diff_neg_invert(i1 zeroext %cond) {
	; CHECK-LABEL: select_pow2_diff_neg_invert:			; CHECK-LABEL: select_pow2_diff_neg_invert:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	Show All 37 Lines
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 421, i32 42			%sel = select i1 %cond, i32 421, i32 42
	ret i32 %sel			ret i32 %sel
	}			}

	define i32 @select_C1_C2_zeroext(i1 zeroext %cond) {			define i32 @select_C1_C2_zeroext(i1 zeroext %cond) {
	; CHECK-LABEL: select_C1_C2_zeroext:			; CHECK-LABEL: select_C1_C2_zeroext:
	; CHECK: # BB#0:			; CHECK: # BB#0:
	; CHECK-NEXT: testb %dil, %dil			; CHECK-NEXT: testl %edi, %edi
	; CHECK-NEXT: movl $421, %ecx # imm = 0x1A5			; CHECK-NEXT: movl $421, %ecx # imm = 0x1A5
	; CHECK-NEXT: movl $42, %eax			; CHECK-NEXT: movl $42, %eax
	; CHECK-NEXT: cmovnel %ecx, %eax			; CHECK-NEXT: cmovnel %ecx, %eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%sel = select i1 %cond, i32 421, i32 42			%sel = select i1 %cond, i32 421, i32 42
	ret i32 %sel			ret i32 %sel
	}			}

	▲ Show 20 Lines • Show All 84 Lines • Show Last 20 Lines

test/CodeGen/X86/sext-i1.ll

	Show First 20 Lines • Show All 138 Lines • ▼ Show 20 Lines
	; X32-LABEL: select_0_or_1s_zeroext:			; X32-LABEL: select_0_or_1s_zeroext:
	; X32: # BB#0:			; X32: # BB#0:
	; X32-NEXT: movzbl {{[0-9]+}}(%esp), %eax			; X32-NEXT: movzbl {{[0-9]+}}(%esp), %eax
	; X32-NEXT: decl %eax			; X32-NEXT: decl %eax
	; X32-NEXT: retl			; X32-NEXT: retl
	;			;
	; X64-LABEL: select_0_or_1s_zeroext:			; X64-LABEL: select_0_or_1s_zeroext:
	; X64: # BB#0:			; X64: # BB#0:
	; X64-NEXT: movzbl %dil, %eax			; X64-NEXT: # kill: %EDI<def> %EDI<kill> %RDI<def>
	; X64-NEXT: decl %eax			; X64-NEXT: leal -1(%rdi), %eax
	; X64-NEXT: retq			; X64-NEXT: retq
	%not = xor i1 %cond, 1			%not = xor i1 %cond, 1
	%sext = sext i1 %not to i32			%sext = sext i1 %not to i32
	ret i32 %sext			ret i32 %sext
	}			}

	; sext (xor Bool, -1) --> sub (zext Bool), 1			; sext (xor Bool, -1) --> sub (zext Bool), 1

	define i32 @select_0_or_1s_signext(i1 signext %cond) {			define i32 @select_0_or_1s_signext(i1 signext %cond) {
	; X32-LABEL: select_0_or_1s_signext:			; X32-LABEL: select_0_or_1s_signext:
	; X32: # BB#0:			; X32: # BB#0:
	; X32-NEXT: movb {{[0-9]+}}(%esp), %al			; X32-NEXT: movb {{[0-9]+}}(%esp), %al
	; X32-NEXT: andb $1, %al			; X32-NEXT: andb $1, %al
	; X32-NEXT: movzbl %al, %eax			; X32-NEXT: movzbl %al, %eax
	; X32-NEXT: decl %eax			; X32-NEXT: decl %eax
	; X32-NEXT: retl			; X32-NEXT: retl
	;			;
	; X64-LABEL: select_0_or_1s_signext:			; X64-LABEL: select_0_or_1s_signext:
	; X64: # BB#0:			; X64: # BB#0:
	; X64-NEXT: andb $1, %dil			; X64-NEXT: notl %edi
	; X64-NEXT: movzbl %dil, %eax			; X64-NEXT: movl %edi, %eax
	; X64-NEXT: decl %eax
	; X64-NEXT: retq			; X64-NEXT: retq
	%not = xor i1 %cond, 1			%not = xor i1 %cond, 1
	%sext = sext i1 %not to i32			%sext = sext i1 %not to i32
	ret i32 %sext			ret i32 %sext
	}			}

test/CodeGen/X86/split-store.ll

Show First 20 Lines • Show All 173 Lines • ▼ Show 20 Lines	entry:
%t2 = shl nuw i24 %t1, 12		%t2 = shl nuw i24 %t1, 12
%t3 = zext i12 %tmp1 to i24		%t3 = zext i12 %tmp1 to i24
%t4 = or i24 %t2, %t3		%t4 = or i24 %t2, %t3
store i24 %t4, i24* %ref.tmp, align 2		store i24 %t4, i24* %ref.tmp, align 2
ret void		ret void
}		}

; getTypeSizeInBits(i14) != getTypeStoreSizeInBits(i14), so store split doesn't kick in.		; getTypeSizeInBits(i14) != getTypeStoreSizeInBits(i14), so store split doesn't kick in.
; CHECK-LABEL: int7_int7_pair
; CHECK: movzbl %sil, %eax
; CHECK: shll $7, %eax
; CHECK: andb $127, %dil
; CHECK: movzbl %dil, %ecx
; CHECK: orl %eax, %ecx
; CHECK: andl $16383, %ecx
; CHECK: movw %cx, (%rdx)
define void @int7_int7_pair(i7 signext %tmp1, i7 signext %tmp2, i14* %ref.tmp) {		define void @int7_int7_pair(i7 signext %tmp1, i7 signext %tmp2, i14* %ref.tmp) {
entry:		; CHECK-LABEL: int7_int7_pair:
		; CHECK: # BB#0:
		; CHECK-NEXT: shll $7, %esi
		; CHECK-NEXT: andl $127, %edi
		; CHECK-NEXT: orl %esi, %edi
		; CHECK-NEXT: andl $16383, %edi # imm = 0x3FFF
		; CHECK-NEXT: movw %di, (%rdx)
		; CHECK-NEXT: retq
%t1 = zext i7 %tmp2 to i14		%t1 = zext i7 %tmp2 to i14
%t2 = shl nuw i14 %t1, 7		%t2 = shl nuw i14 %t1, 7
%t3 = zext i7 %tmp1 to i14		%t3 = zext i7 %tmp1 to i14
%t4 = or i14 %t2, %t3		%t4 = or i14 %t2, %t3
store i14 %t4, i14* %ref.tmp, align 2		store i14 %t4, i14* %ref.tmp, align 2
ret void		ret void
}		}

▲ Show 20 Lines • Show All 56 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[x86] use the IR type of formal args to create assertzext/assertsext and scalar truncate nodesAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 112381

lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp

lib/Target/X86/X86ISelLowering.cpp

test/CodeGen/X86/bool-zext.ll

test/CodeGen/X86/critical-edge-split-2.ll

test/CodeGen/X86/fp128-select.ll

test/CodeGen/X86/illegal-bitfield-loadstore.ll

test/CodeGen/X86/mask-negated-bool.ll

test/CodeGen/X86/negate-i1.ll

test/CodeGen/X86/select_const.ll

test/CodeGen/X86/sext-i1.ll

test/CodeGen/X86/split-store.ll

[x86] use the IR type of formal args to create assertzext/assertsext and scalar truncate nodes
AbandonedPublic