This is an archive of the discontinued LLVM Phabricator instance.

[X86] combineADC - fold ADC(C1,C2,Carry) -> ADC(0,C1+C2,Carry)
ClosedPublic

Authored by RKSimon on Mar 25 2022, 7:19 AM.

Download Raw Diff

Details

Reviewers

craig.topper
pengfei
spatel
lebedev.ri

Commits

rG6697e3354fbe: [X86] combineADC - fold ADC(C1,C2,Carry) -> ADC(0,C1+C2,Carry)

Summary

If we're not relying on the flag result, we can fold the constants together into the RHS immediate operand and set the LHS operand to zero, simplifying for further folds.

We could do something similar if the flag result is in use and the constant fold doesn't affect it, but I don't have any real test cases for this yet.

As suggested by @davezarzycki on Issue #35256

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

RKSimon created this revision.Mar 25 2022, 7:19 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 25 2022, 7:19 AM

Herald added subscribers: StephenFan, javed.absar, hiraditya. · View Herald Transcript

RKSimon requested review of this revision.Mar 25 2022, 7:19 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 25 2022, 7:19 AM

Harbormaster completed remote builds in B156282: Diff 418216.Mar 25 2022, 8:00 AM

This is a great improvement but a scenario is still missing. For example, from a build of LLVM with this change applied (i.e. self hosted):

000000000180e020 <_ZNK4llvm12X86InstrInfo30getFMA3OpcodeToCommuteOperandsERKNS_12MachineInstrEjjRKNS_17X86InstrFMA3GroupE>:
180e020: mov 0x10(%rsi),%rax
180e024: movzwl (%rax),%r9d
180e028: mov 0x10(%rax),%rsi
180e02c: cmp %ecx,%edx
180e02e: mov %ecx,%edi
180e030: cmova %edx,%ecx
180e033: cmovb %edx,%edi
180e036: xor %eax,%eax
180e038: bt $0x2a,%rsi
180e03d: mov $0x0,%edx
180e042: adc $0x2,%edx
180e045: cmp $0x1,%edi

Sorry, I can't see it - what were you hoping it would fold to?

I was just surprised to see that the last four lines didn't swap the BT and the zeroing of EDX. Maybe this is a separate problem. This seems like the natural output of the last four lines of the example above:

xor %edx,%edx
bt $0x2a,%rsi
adc $0x2,%edx
cmp $0x1,%edi

Yes, the schedulers don't seem to do a good job of working around eflags uses.

Okay. I'll try to remember to create a followup bug after this lands. I'm sure I'm naive about it, but it seems to me that after register allocation, MOV $0, <reg> instructions should move earlier (when possible) in the basic block to get out of the way of EFLAG dependencies and allow the MOV $0, <reg> to XOR <reg>, <reg> optimization, but that's just me hand waving.

craig.topper added inline comments.Mar 26 2022, 5:49 PM

llvm/test/CodeGen/X86/combine-adc.ll
88	This still seems more complicated that it needs to be.

RKSimon added inline comments.Mar 27 2022, 10:49 AM

llvm/test/CodeGen/X86/combine-adc.ll
88	Yes combineCarryThroughADD is missing a fold to X86ISD::BT, its on my todo list.......

Allen added a subscriber: Allen.Mar 27 2022, 1:38 PM

RKSimon mentioned this in D122572: [X86] combineCarryThroughADD - recognise X86ISD::ADD(AND(X,1),-1) pattern can be folded to X86ISD::BT.Mar 28 2022, 4:20 AM

any further comments?

llvm/test/CodeGen/X86/combine-adc.ll
88	This is addressed in D122572

LGTM

This revision is now accepted and ready to land.Mar 29 2022, 10:24 AM

This revision was landed with ongoing or failed builds.Mar 30 2022, 1:13 AM

Closed by commit rG6697e3354fbe: [X86] combineADC - fold ADC(C1,C2,Carry) -> ADC(0,C1+C2,Carry) (authored by RKSimon). · Explain Why

This revision was automatically updated to reflect the committed changes.

RKSimon added a commit: rG6697e3354fbe: [X86] combineADC - fold ADC(C1,C2,Carry) -> ADC(0,C1+C2,Carry).

RKSimon mentioned this in rG481b18562077: [X86] combineCarryThroughADD - recognise X86ISD::ADD(AND(X,1),-1) pattern can….Mar 31 2022, 1:54 AM

Revision Contents

Path

Size

llvm/

lib/

Target/

X86/

X86ISelLowering.cpp

11 lines

test/

CodeGen/

X86/

23 lines

17 lines

5 lines

19 lines

scheduler-backtracking.ll

72 lines

setcc.ll

8 lines

Diff 419064

llvm/lib/Target/X86/X86ISelLowering.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 52,299 Lines • ▼ Show 20 Lines	if (LHSC && RHSC && LHSC->isZero() && RHSC->isZero() &&
SDValue Res1 = DAG.getNode(		SDValue Res1 = DAG.getNode(
ISD::AND, DL, VT,		ISD::AND, DL, VT,
DAG.getNode(X86ISD::SETCC_CARRY, DL, VT,		DAG.getNode(X86ISD::SETCC_CARRY, DL, VT,
DAG.getTargetConstant(X86::COND_B, DL, MVT::i8), CarryIn),		DAG.getTargetConstant(X86::COND_B, DL, MVT::i8), CarryIn),
DAG.getConstant(1, DL, VT));		DAG.getConstant(1, DL, VT));
return DCI.CombineTo(N, Res1, CarryOut);		return DCI.CombineTo(N, Res1, CarryOut);
}		}

		// Fold ADC(C1,C2,Carry) -> ADC(0,C1+C2,Carry)
		// iff the flag result is dead.
		// TODO: Allow flag result if C1+C2 doesn't signed/unsigned overflow.
		if (LHSC && RHSC && !LHSC->isZero() && !N->hasAnyUseOfValue(1)) {
		SDLoc DL(N);
		APInt Sum = LHSC->getAPIntValue() + RHSC->getAPIntValue();
		return DAG.getNode(X86ISD::ADC, DL, N->getVTList(),
		DAG.getConstant(0, DL, LHS.getValueType()),
		DAG.getConstant(Sum, DL, LHS.getValueType()), CarryIn);
		}

if (SDValue Flags = combineCarryThroughADD(CarryIn, DAG)) {		if (SDValue Flags = combineCarryThroughADD(CarryIn, DAG)) {
MVT VT = N->getSimpleValueType(0);		MVT VT = N->getSimpleValueType(0);
SDVTList VTs = DAG.getVTList(VT, MVT::i32);		SDVTList VTs = DAG.getVTList(VT, MVT::i32);
return DAG.getNode(X86ISD::ADC, SDLoc(N), VTs, LHS, RHS, Flags);		return DAG.getNode(X86ISD::ADC, SDLoc(N), VTs, LHS, RHS, Flags);
}		}

// Fold ADC(ADD(X,Y),0,Carry) -> ADC(X,Y,Carry)		// Fold ADC(ADD(X,Y),0,Carry) -> ADC(X,Y,Carry)
// iff the flag result is dead.		// iff the flag result is dead.
▲ Show 20 Lines • Show All 3,385 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/call-rv-marker.ll

	Show First 20 Lines • Show All 49 Lines • ▼ Show 20 Lines
	;			;
	entry:			entry:
	%call = call i8* @foo1() [ "clang.arc.attachedcall"(i8* (i8) @objc_unsafeClaimAutoreleasedReturnValue) ]			%call = call i8* @foo1() [ "clang.arc.attachedcall"(i8* (i8) @objc_unsafeClaimAutoreleasedReturnValue) ]
	ret i8* %call			ret i8* %call
	}			}

	define void @rv_marker_2_select(i32 %c) {			define void @rv_marker_2_select(i32 %c) {
	; CHECK-LABEL: rv_marker_2_select:			; CHECK-LABEL: rv_marker_2_select:
	; CHECK: pushq %rax			; CHECK: pushq %rax
	; CHECK-NEXT: .cfi_def_cfa_offset 16			; CHECK-NEXT: .cfi_def_cfa_offset 16
				; CHECK-NEXT: xorl %eax, %eax
	; CHECK-NEXT: cmpl $1, %edi			; CHECK-NEXT: cmpl $1, %edi
	; CHECK-NEXT: movl $1, %edi			; CHECK-NEXT: adcl $1, %eax
	; CHECK-NEXT: adcl $0, %edi			; CHECK-NEXT: movl %eax, %edi
	; CHECK-NEXT: callq _foo0			; CHECK-NEXT: callq _foo0
	; CHECK-NEXT: movq %rax, %rdi			; CHECK-NEXT: movq %rax, %rdi
	; CHECK-NEXT: callq _objc_retainAutoreleasedReturnValue			; CHECK-NEXT: callq _objc_retainAutoreleasedReturnValue
	; CHECK-NEXT: movq %rax, %rdi			; CHECK-NEXT: movq %rax, %rdi
	; CHECK-NEXT: popq %rax			; CHECK-NEXT: popq %rax
	; CHECK-NEXT: jmp _foo2			; CHECK-NEXT: jmp _foo2
	;			;
	entry:			entry:
	%tobool.not = icmp eq i32 %c, 0			%tobool.not = icmp eq i32 %c, 0
	%.sink = select i1 %tobool.not, i32 2, i32 1			%.sink = select i1 %tobool.not, i32 2, i32 1
	%call1 = call i8* @foo0(i32 %.sink) [ "clang.arc.attachedcall"(i8* (i8) @objc_retainAutoreleasedReturnValue) ]			%call1 = call i8* @foo0(i32 %.sink) [ "clang.arc.attachedcall"(i8* (i8) @objc_retainAutoreleasedReturnValue) ]
	tail call void @foo2(i8* %call1)			tail call void @foo2(i8* %call1)
	ret void			ret void
	}			}
	▲ Show 20 Lines • Show All 186 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/combine-adc.ll

Show First 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	; X64-NEXT: retq
%6 = extractvalue { i8, i32 } %4, 0		%6 = extractvalue { i8, i32 } %4, 0
%7 = icmp eq i8 %6, 0		%7 = icmp eq i8 %6, 0
%8 = add i32 %3, %1		%8 = add i32 %3, %1
%9 = or i32 %5, %8		%9 = or i32 %5, %8
%10 = select i1 %7, i32 0, i32 %9		%10 = select i1 %7, i32 0, i32 %9
ret i32 %10		ret i32 %10
}		}

; FIXME: Fail to add (non-overflowing) constants together
; FIXME: Fail to convert add+lshr+and to BT		; FIXME: Fail to convert add+lshr+and to BT
define i32 @adc_merge_constants(i32 %a0) nounwind {		define i32 @adc_merge_constants(i32 %a0) nounwind {
; X86-LABEL: adc_merge_constants:		; X86-LABEL: adc_merge_constants:
; X86: # %bb.0:		; X86: # %bb.0:
; X86-NEXT: movl {{[0-9]+}}(%esp), %eax		; X86-NEXT: movl {{[0-9]+}}(%esp), %ecx
; X86-NEXT: shrl $11, %eax		; X86-NEXT: shrl $11, %ecx
; X86-NEXT: andb $1, %al		; X86-NEXT: andb $1, %cl
; X86-NEXT: addb $-1, %al		; X86-NEXT: xorl %eax, %eax
; X86-NEXT: movl $55, %eax		; X86-NEXT: addb $-1, %cl
; X86-NEXT: adcl $-1, %eax		; X86-NEXT: adcl $54, %eax
; X86-NEXT: retl		; X86-NEXT: retl
;		;
; X64-LABEL: adc_merge_constants:		; X64-LABEL: adc_merge_constants:
; X64: # %bb.0:		; X64: # %bb.0:
; X64-NEXT: shrl $11, %edi		; X64-NEXT: shrl $11, %edi
; X64-NEXT: andb $1, %dil		; X64-NEXT: andb $1, %dil
		; X64-NEXT: xorl %eax, %eax
; X64-NEXT: addb $-1, %dil		; X64-NEXT: addb $-1, %dil
; X64-NEXT: movl $55, %eax		; X64-NEXT: adcl $54, %eax
		craig.topperUnsubmitted Not Done Reply Inline Actions This still seems more complicated that it needs to be. craig.topper: This still seems more complicated that it needs to be.
		RKSimonAuthorUnsubmitted Not Done Reply Inline Actions Yes combineCarryThroughADD is missing a fold to X86ISD::BT, its on my todo list....... RKSimon: Yes combineCarryThroughADD is missing a fold to X86ISD::BT, its on my todo list.......
		RKSimonAuthorUnsubmitted Not Done Reply Inline Actions This is addressed in D122572 RKSimon: This is addressed in D122572
; X64-NEXT: adcl $-1, %eax
; X64-NEXT: retq		; X64-NEXT: retq
%bit = lshr i32 %a0, 11		%bit = lshr i32 %a0, 11
%mask = and i32 %bit, 1		%mask = and i32 %bit, 1
%isz = trunc i32 %mask to i8		%isz = trunc i32 %mask to i8
%adc = tail call { i8, i32 } @llvm.x86.addcarry.32(i8 %isz, i32 55, i32 -1)		%adc = tail call { i8, i32 } @llvm.x86.addcarry.32(i8 %isz, i32 55, i32 -1)
%sum = extractvalue { i8, i32 } %adc, 1		%sum = extractvalue { i8, i32 } %adc, 1
ret i32 %sum		ret i32 %sum
}		}

declare { i8, i32 } @llvm.x86.addcarry.32(i8, i32, i32)		declare { i8, i32 } @llvm.x86.addcarry.32(i8, i32, i32)

llvm/test/CodeGen/X86/combine-add.ll

	Show First 20 Lines • Show All 432 Lines • ▼ Show 20 Lines

	; This would crash because we tried to transform an add-with-overflow			; This would crash because we tried to transform an add-with-overflow
	; based on the wrong result value.			; based on the wrong result value.

	define i1 @PR51238(i1 %b, i8 %x, i8 %y, i8 %z) {			define i1 @PR51238(i1 %b, i8 %x, i8 %y, i8 %z) {
	; CHECK-LABEL: PR51238:			; CHECK-LABEL: PR51238:
	; CHECK: # %bb.0:			; CHECK: # %bb.0:
	; CHECK-NEXT: notb %cl			; CHECK-NEXT: notb %cl
				; CHECK-NEXT: xorl %eax, %eax
	; CHECK-NEXT: addb %dl, %cl			; CHECK-NEXT: addb %dl, %cl
	; CHECK-NEXT: movb $1, %al			; CHECK-NEXT: adcb $1, %al
	; CHECK-NEXT: adcb $0, %al			; CHECK-NEXT: # kill: def $al killed $al killed $eax
	; CHECK-NEXT: retq			; CHECK-NEXT: retq
	%ny = xor i8 %y, -1			%ny = xor i8 %y, -1
	%nz = xor i8 %z, -1			%nz = xor i8 %z, -1
	%minxz = select i1 %b, i8 %x, i8 %nz			%minxz = select i1 %b, i8 %x, i8 %nz
	%cmpyz = icmp ult i8 %ny, %nz			%cmpyz = icmp ult i8 %ny, %nz
	%r = add i1 %cmpyz, true			%r = add i1 %cmpyz, true
	ret i1 %r			ret i1 %r
	}			}

llvm/test/CodeGen/X86/pr16031.ll

	; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
	; RUN: llc < %s -mtriple=i386-unknown-linux-gnu -mcpu=corei7-avx -enable-misched=false \| FileCheck %s			; RUN: llc < %s -mtriple=i386-unknown-linux-gnu -mcpu=corei7-avx -enable-misched=false \| FileCheck %s

	define i64 @main(i1 %tobool1) nounwind {			define i64 @main(i1 %tobool1) nounwind {
	; CHECK-LABEL: main:			; CHECK-LABEL: main:
	; CHECK: # %bb.0: # %entry			; CHECK: # %bb.0: # %entry
	; CHECK-NEXT: pushl %esi
	; CHECK-NEXT: testb $1, {{[0-9]+}}(%esp)			; CHECK-NEXT: testb $1, {{[0-9]+}}(%esp)
	; CHECK-NEXT: movl $-12, %eax			; CHECK-NEXT: movl $-12, %ecx
	; CHECK-NEXT: movl $-1, %ecx			; CHECK-NEXT: movl $-1, %eax
	; CHECK-NEXT: cmovel %ecx, %eax			; CHECK-NEXT: cmovnel %ecx, %eax
				; CHECK-NEXT: xorl %ecx, %ecx
				; CHECK-NEXT: movl %eax, %edx
				; CHECK-NEXT: addl $-1, %edx
				; CHECK-NEXT: movl $0, %edx
				; CHECK-NEXT: adcl $-2, %edx
				; CHECK-NEXT: cmovsl %ecx, %eax
	; CHECK-NEXT: xorl %edx, %edx			; CHECK-NEXT: xorl %edx, %edx
	; CHECK-NEXT: movl %eax, %esi
	; CHECK-NEXT: addl $-1, %esi
	; CHECK-NEXT: adcl $-1, %ecx
	; CHECK-NEXT: cmovsl %edx, %eax
	; CHECK-NEXT: xorl %edx, %edx
	; CHECK-NEXT: popl %esi
	; CHECK-NEXT: retl			; CHECK-NEXT: retl
	entry:			entry:
	%0 = zext i1 %tobool1 to i32			%0 = zext i1 %tobool1 to i32
	%. = xor i32 %0, 1			%. = xor i32 %0, 1
	%.21 = select i1 %tobool1, i32 -12, i32 -1			%.21 = select i1 %tobool1, i32 -12, i32 -1
	%conv = sext i32 %.21 to i64			%conv = sext i32 %.21 to i64
	%1 = add i64 %conv, -1			%1 = add i64 %conv, -1
	%cmp10 = icmp slt i64 %1, 0			%cmp10 = icmp slt i64 %1, 0
	%sub17 = select i1 %cmp10, i64 0, i64 %conv			%sub17 = select i1 %cmp10, i64 0, i64 %conv
	ret i64 %sub17			ret i64 %sub17
	}			}

llvm/test/CodeGen/X86/scheduler-backtracking.ll

Show First 20 Lines • Show All 684 Lines • ▼ Show 20 Lines	; LIN-NEXT: retq
ret i256 %z		ret i256 %z
}		}

declare i256 @llvm.ctlz.i256(i256, i1) nounwind readnone		declare i256 @llvm.ctlz.i256(i256, i1) nounwind readnone

define i64 @test4(i64 %a, i64 %b) nounwind {		define i64 @test4(i64 %a, i64 %b) nounwind {
; ILP-LABEL: test4:		; ILP-LABEL: test4:
; ILP: # %bb.0:		; ILP: # %bb.0:
		; ILP-NEXT: xorl %eax, %eax
; ILP-NEXT: xorl %ecx, %ecx		; ILP-NEXT: xorl %ecx, %ecx
; ILP-NEXT: xorl %edx, %edx
; ILP-NEXT: incq %rsi		; ILP-NEXT: incq %rsi
; ILP-NEXT: sete %dl		; ILP-NEXT: sete %cl
; ILP-NEXT: movl $2, %eax
; ILP-NEXT: cmpq %rdi, %rsi		; ILP-NEXT: cmpq %rdi, %rsi
; ILP-NEXT: sbbq $0, %rdx		; ILP-NEXT: sbbq $0, %rcx
; ILP-NEXT: movl $0, %edx		; ILP-NEXT: movl $0, %ecx
; ILP-NEXT: sbbq %rdx, %rdx
; ILP-NEXT: sbbq %rcx, %rcx		; ILP-NEXT: sbbq %rcx, %rcx
; ILP-NEXT: adcq $-1, %rax		; ILP-NEXT: movl $0, %ecx
		; ILP-NEXT: sbbq %rcx, %rcx
		; ILP-NEXT: adcq $1, %rax
; ILP-NEXT: retq		; ILP-NEXT: retq
;		;
; HYBRID-LABEL: test4:		; HYBRID-LABEL: test4:
; HYBRID: # %bb.0:		; HYBRID: # %bb.0:
		; HYBRID-NEXT: xorl %eax, %eax
; HYBRID-NEXT: xorl %ecx, %ecx		; HYBRID-NEXT: xorl %ecx, %ecx
; HYBRID-NEXT: xorl %edx, %edx
; HYBRID-NEXT: incq %rsi		; HYBRID-NEXT: incq %rsi
; HYBRID-NEXT: sete %dl		; HYBRID-NEXT: sete %cl
; HYBRID-NEXT: movl $2, %eax
; HYBRID-NEXT: cmpq %rdi, %rsi		; HYBRID-NEXT: cmpq %rdi, %rsi
; HYBRID-NEXT: sbbq $0, %rdx		; HYBRID-NEXT: sbbq $0, %rcx
; HYBRID-NEXT: movl $0, %edx		; HYBRID-NEXT: movl $0, %ecx
; HYBRID-NEXT: sbbq %rdx, %rdx		; HYBRID-NEXT: sbbq %rcx, %rcx
		; HYBRID-NEXT: movl $0, %ecx
; HYBRID-NEXT: sbbq %rcx, %rcx		; HYBRID-NEXT: sbbq %rcx, %rcx
; HYBRID-NEXT: adcq $-1, %rax		; HYBRID-NEXT: adcq $1, %rax
; HYBRID-NEXT: retq		; HYBRID-NEXT: retq
;		;
; BURR-LABEL: test4:		; BURR-LABEL: test4:
; BURR: # %bb.0:		; BURR: # %bb.0:
		; BURR-NEXT: xorl %eax, %eax
; BURR-NEXT: xorl %ecx, %ecx		; BURR-NEXT: xorl %ecx, %ecx
; BURR-NEXT: xorl %edx, %edx
; BURR-NEXT: incq %rsi		; BURR-NEXT: incq %rsi
; BURR-NEXT: sete %dl		; BURR-NEXT: sete %cl
; BURR-NEXT: movl $2, %eax
; BURR-NEXT: cmpq %rdi, %rsi		; BURR-NEXT: cmpq %rdi, %rsi
; BURR-NEXT: sbbq $0, %rdx		; BURR-NEXT: sbbq $0, %rcx
; BURR-NEXT: movl $0, %edx		; BURR-NEXT: movl $0, %ecx
; BURR-NEXT: sbbq %rdx, %rdx		; BURR-NEXT: sbbq %rcx, %rcx
		; BURR-NEXT: movl $0, %ecx
; BURR-NEXT: sbbq %rcx, %rcx		; BURR-NEXT: sbbq %rcx, %rcx
; BURR-NEXT: adcq $-1, %rax		; BURR-NEXT: adcq $1, %rax
; BURR-NEXT: retq		; BURR-NEXT: retq
;		;
; SRC-LABEL: test4:		; SRC-LABEL: test4:
; SRC: # %bb.0:		; SRC: # %bb.0:
; SRC-NEXT: xorl %eax, %eax
; SRC-NEXT: incq %rsi
; SRC-NEXT: sete %al
; SRC-NEXT: xorl %ecx, %ecx		; SRC-NEXT: xorl %ecx, %ecx
		; SRC-NEXT: incq %rsi
		; SRC-NEXT: sete %cl
		; SRC-NEXT: xorl %eax, %eax
; SRC-NEXT: cmpq %rdi, %rsi		; SRC-NEXT: cmpq %rdi, %rsi
; SRC-NEXT: sbbq $0, %rax		; SRC-NEXT: sbbq $0, %rcx
; SRC-NEXT: movl $0, %eax		; SRC-NEXT: movl $0, %ecx
; SRC-NEXT: sbbq %rax, %rax		; SRC-NEXT: sbbq %rcx, %rcx
		; SRC-NEXT: movl $0, %ecx
; SRC-NEXT: sbbq %rcx, %rcx		; SRC-NEXT: sbbq %rcx, %rcx
; SRC-NEXT: movl $2, %eax		; SRC-NEXT: adcq $1, %rax
; SRC-NEXT: adcq $-1, %rax
; SRC-NEXT: retq		; SRC-NEXT: retq
;		;
; LIN-LABEL: test4:		; LIN-LABEL: test4:
; LIN: # %bb.0:		; LIN: # %bb.0:
; LIN-NEXT: movl $2, %eax		; LIN-NEXT: xorl %eax, %eax
; LIN-NEXT: xorl %ecx, %ecx		; LIN-NEXT: xorl %ecx, %ecx
; LIN-NEXT: xorl %edx, %edx
; LIN-NEXT: incq %rsi		; LIN-NEXT: incq %rsi
; LIN-NEXT: sete %dl		; LIN-NEXT: sete %cl
; LIN-NEXT: cmpq %rdi, %rsi		; LIN-NEXT: cmpq %rdi, %rsi
; LIN-NEXT: sbbq $0, %rdx		; LIN-NEXT: sbbq $0, %rcx
; LIN-NEXT: movl $0, %edx		; LIN-NEXT: movl $0, %ecx
; LIN-NEXT: sbbq %rdx, %rdx		; LIN-NEXT: sbbq %rcx, %rcx
		; LIN-NEXT: movl $0, %ecx
; LIN-NEXT: sbbq %rcx, %rcx		; LIN-NEXT: sbbq %rcx, %rcx
; LIN-NEXT: adcq $-1, %rax		; LIN-NEXT: adcq $1, %rax
; LIN-NEXT: retq		; LIN-NEXT: retq
%r = zext i64 %b to i256		%r = zext i64 %b to i256
%u = add i256 %r, 1		%u = add i256 %r, 1
%w = and i256 %u, 1461501637330902918203684832716283019655932542975		%w = and i256 %u, 1461501637330902918203684832716283019655932542975
%x = zext i64 %a to i256		%x = zext i64 %a to i256
%c = icmp uge i256 %w, %x		%c = icmp uge i256 %w, %x
%y = select i1 %c, i64 0, i64 1		%y = select i1 %c, i64 0, i64 1
%z = add i64 %y, 1		%z = add i64 %y, 1
▲ Show 20 Lines • Show All 251 Lines • Show Last 20 Lines

llvm/test/CodeGen/X86/setcc.ll

Show All 40 Lines	; CHECK-NEXT: retq
ret i64 %if		ret i64 %if
}		}

@v4 = common global i32 0, align 4		@v4 = common global i32 0, align 4

define i32 @t4(i32 %a) {		define i32 @t4(i32 %a) {
; CHECK-LABEL: t4:		; CHECK-LABEL: t4:
; CHECK: ## %bb.0:		; CHECK: ## %bb.0:
; CHECK-NEXT: movq _v4@GOTPCREL(%rip), %rax		; CHECK-NEXT: movq _v4@GOTPCREL(%rip), %rcx
; CHECK-NEXT: cmpl $1, (%rax)		; CHECK-NEXT: xorl %eax, %eax
; CHECK-NEXT: movw $1, %ax		; CHECK-NEXT: cmpl $1, (%rcx)
; CHECK-NEXT: adcw $0, %ax		; CHECK-NEXT: adcw $1, %ax
; CHECK-NEXT: shll $16, %eax		; CHECK-NEXT: shll $16, %eax
; CHECK-NEXT: retq		; CHECK-NEXT: retq
%t0 = load i32, i32* @v4, align 4		%t0 = load i32, i32* @v4, align 4
%not.tobool = icmp eq i32 %t0, 0		%not.tobool = icmp eq i32 %t0, 0
%conv.i = sext i1 %not.tobool to i16		%conv.i = sext i1 %not.tobool to i16
%call.lobit = lshr i16 %conv.i, 15		%call.lobit = lshr i16 %conv.i, 15
%add.i.1 = add nuw nsw i16 %call.lobit, 1		%add.i.1 = add nuw nsw i16 %call.lobit, 1
%conv4.2 = zext i16 %add.i.1 to i32		%conv4.2 = zext i16 %add.i.1 to i32
▲ Show 20 Lines • Show All 43 Lines • Show Last 20 Lines