This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
InstructionCombining.cpp
-
test/
-
Analysis/ValueTracking/
-
ValueTracking/
2/3
known-signbit-shift.ll
-
Transforms/InstCombine/
-
InstCombine/
2/3
assume.ll
1/1
out-of-bounds-indexes.ll
1/1
phi-shifts.ll

Differential D75801

[InstCombine] Remove known bits constant folding
ClosedPublic

Authored by nikic on Mar 7 2020, 2:09 AM.

Download Raw Diff

Details

Reviewers

spatel
lebedev.ri
xbolva00

Commits

rG2b52e4e629e6: [InstCombine] Remove known bits constant folding

Summary

If ExpensiveCombines is enabled (which is the case with -O3 on the legacy PM and always on the new PM), InstCombine tries to compute the known bits of all instructions in the hope that all bits end up being known. This is the most expensive individual part of InstCombine.

How effective is it? If we add some statistics on how often the constant folding succeeds and how many KnownBits calculations are performed and run test-suite we get:

"instcombine.NumConstPropKnownBits": 642,
"instcombine.NumConstPropKnownBitsComputed": 18744965,

In other words, we get one fold for every 30000 KnownBits calculations. However, the truth is actually much worse: Currently, known bits are computed before performing other folds, so there is a high chance that cases that get folded by known bits would also have been handled by other folds.

What happens if we compute known bits after all other folds (hacky implementation: https://gist.github.com/nikic/751f25b3b9d9e0860db5dde934f70f46)?

"instcombine.NumConstPropKnownBits": 0,
"instcombine.NumConstPropKnownBitsComputed": 18105547,

So it turns out despite doing 18 million known bits calculations, the known bits fold does not do anything useful on test-suite. I was originally planning to move this into AggressiveInstCombine so it only runs once in the pipeline, but seeing this, I think we're better off removing it entirely.

As this is the only use of the "expensive combines" mechanism, it may be removed afterwards, but I'll leave that to a separate patch.

Diff Detail

Event Timeline

nikic created this revision.Mar 7 2020, 2:09 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 7 2020, 2:09 AM

Herald added subscribers: llvm-commits, arphaman. · View Herald Transcript

nikic marked 5 inline comments as done.Mar 7 2020, 2:15 AM

nikic added inline comments.

test/Transforms/InstCombine/all-bits-shift.ll
36 ↗	(On Diff #248914)	I believe this test case was the original motivation for having this fold. However, I thinks this should be handled by InstCombineSimplifyDemanded, which we invoke in cases where we have a reasonable expectation of either demanded bits or known bits simplifications to occur (such as an "and" root, as is the case here). SimplifyDemanded currently doesn't handle this case due to what looks like an implementation bug to me: While normally SimplifyDemanded computes known bits for instructions it doesn't handle itself, it does not do so for some instructions it only partially handles (e.g. it handles a constant shift amount, but does not compute known bits if the shift amount is not constant).
test/Transforms/InstCombine/assume.ll
340	I'm not sure we really need to do anything about this, I think it's only important that we have an assume(false) here, and SimplifyCFG will deal with the rest. If we do want to improve on this, we could convert assume(false) into store undef (the InstCombine UB pattern) and then remove all instructions after store undef.
test/Transforms/InstCombine/expensive-combines.ll
14 ↗	(On Diff #248914)	We're missing a fold to replace call with `returned` attribute to the returned argument. Known bits calculation handles the particular case where the operand is constant, but non-constant operands are not handled at all right now.
test/Transforms/InstCombine/out-of-bounds-indexes.ll
10	This is an improvement.
test/Transforms/InstCombine/phi-shifts.ll
13	This is an improvement.

Harbormaster completed remote builds in B48447: Diff 248914.Mar 7 2020, 2:42 AM

nikic mentioned this in D75804: [InstCombine] Fix known bits handling in SimplifyDemandedUseBits.Mar 7 2020, 4:42 AM

nikic mentioned this in rG51a466a61f55: [InstCombine] Fix known bits handling in SimplifyDemandedUseBits.Mar 7 2020, 9:44 AM

nikic mentioned this in D75815: [InstCombine] Simplify calls with "returned" attribute.Mar 7 2020, 12:42 PM

jdoerfert added a subscriber: jdoerfert.Mar 7 2020, 7:08 PM

jdoerfert added inline comments.

test/Transforms/InstCombine/assume.ll
340	Can't we replace the UB instruction with unreachable?

nikic marked an inline comment as done.Mar 8 2020, 12:44 AM

nikic added inline comments.

test/Transforms/InstCombine/assume.ll
340	InstCombine preserves CFG, so it's not possible to use unreachable directly. Instead we need to use UB patterns understood by SimplifyCFG (assume false is one of them), which will convert them to proper unreachable.

Rebase this after fixes (if any) have landed?

This is the most expensive individual part of InstCombine.

That is an interesting statement.
It would be good to have some perf numbers here.

Do we do this fold somewhere else in the pipeline?

Rebase over D75804 and D75815.

nikic mentioned this in rG45555c381926: [InstSimplify] Simplify calls with "returned" attribute.Mar 9 2020, 11:20 AM

@lebedev.ri Finally got around to implementing some compile-time testing infrastructure today, here's some numbers (in terms of instructions retired) for this change:

Basically this is a 1-2% end-to-end improvement for -O3 compile-times.

In D75801#1922970, @nikic wrote:

@lebedev.ri Finally got around to implementing some compile-time testing infrastructure today, here's some numbers (in terms of instructions retired) for this change:
Basically this is a 1-2% end-to-end improvement for -O3 compile-times.

Thank you, so this does indeed have some measurable effect.
Sounds justified then.

In D75801#1912306, @lebedev.ri wrote:

Do we do this fold somewhere else in the pipeline?

?
Do we believe CVP/SCCP/??? will cover all the possible cases?

In D75801#1922970, @nikic wrote:

@lebedev.ri Finally got around to implementing some compile-time testing infrastructure today, here's some numbers (in terms of instructions retired) for this change:
Basically this is a 1-2% end-to-end improvement for -O3 compile-times.

http://llvm-compile-time-tracker.com/?config=O3&stat=wall-time
That looks great!
Is that end-to-end time for the entire compile or just 'opt -O3'?

In D75801#1923012, @lebedev.ri wrote:

In D75801#1912306, @lebedev.ri wrote:

Do we do this fold somewhere else in the pipeline?

?
Do we believe CVP/SCCP/??? will cover all the possible cases?

I believe this should be covered by two vectors: First, by known bits calculation as part of InstCombineSimplifyDemanded. We don't call this for literally everything, but it handles all the cases that are likely to be fully known via known bits (bitwise ops and shifts as root instructions). Second, we have the same known bits optimization as here also as part of SimplifyInstruction. That means we already perform this in all places that use that high-level interface (e.g. it happens during inlining). I do want to get rid of that known bits call as well, but I think it is the reason why the InstCombine fold triggers exactly zero times on test-suite.

In D75801#1926700, @spatel wrote:

http://llvm-compile-time-tracker.com/?config=O3&stat=wall-time
That looks great!
Is that end-to-end time for the entire compile or just 'opt -O3'?

Those are end-to-end times. Wall time is a bit too noisy to be really useful, so I tend to look at instructions, which is a reasonable proxy, and accurate to within 0.1% for most benchmarks.

test/Analysis/ValueTracking/known-signbit-shift.ll
46	I missed these Analysis test changes before. The shl here is poison because we know it wraps based on known bits. This optimization gets lost now. I could add it back explicitly (and better, by returning undef rather than zero) like this: https://gist.github.com/nikic/29135f304f7cf9de6d18dff7ca12659a I'm not sure whether that's worthwhile though, it seems that these tests are more about not crashing due to conflicting known bits than anything else.

SGTM, but i'll defer to @spatel.

test/Analysis/ValueTracking/known-signbit-shift.ll
46	This still seems like worthwhile fold.

nikic retitled this revision from [InstCombine] Remove known bits constant folding (WIP) to [InstCombine] Remove known bits constant folding.Mar 20 2020, 4:27 AM

nikic edited the summary of this revision. (Show Details)

nikic set the repository for this revision to rG LLVM Github Monorepo.

nikic marked an inline comment as done.Mar 20 2020, 5:05 AM

nikic added inline comments.

test/Analysis/ValueTracking/known-signbit-shift.ll
46	I've opened D76489 to address the reason why this difference exists in the first place (SimplifyDemanded produces a less good known bits result). We could still do a better overflow detection on top of that, but at least this removes the gap to what we currently do.

Please update after D76489.
If we're going to defer the removal of ExpensiveCombines to a follow-on NFC patch, add a FIXME note in this patch near its definition to indicate that it is deprecated/useless (just in case the follow-on doesn't happen immediately).

Rebase and add FIXME.

LGTM

This revision is now accepted and ready to land.Mar 20 2020, 10:51 AM

Closed by commit rG2b52e4e629e6: [InstCombine] Remove known bits constant folding (authored by nikic). · Explain WhyMar 20 2020, 1:01 PM

This revision was automatically updated to reflect the committed changes.

Herald added a subscriber: hiraditya. · View Herald TranscriptMar 20 2020, 1:01 PM

nikic mentioned this in D76540: [InstCombine] Remove ExpensiveCombines option.Mar 21 2020, 2:23 AM

nikic mentioned this in rGdc81923659c2: [InstCombine] Remove ExpensiveCombines option.Mar 22 2020, 9:04 AM

nikic mentioned this in D79294: [InstSimplify] Remove known bits constant folding.May 2 2020, 12:19 PM

nikic mentioned this in rG08556afc54e7: [InstSimplify] Remove known bits constant folding.May 3 2020, 11:41 AM

nikic mentioned this in rG5a2265647ed3: Reapply [InstSimplify] Remove known bits constant folding.May 8 2020, 1:34 AM

Revision Contents

Path

Size


	llvm/

lib/

Transforms/

InstCombine/

InstructionCombining.cpp

20 lines

test/

Analysis/

ValueTracking/

known-signbit-shift.ll

6 lines

Transforms/

InstCombine/

assume.ll

16 lines

out-of-bounds-indexes.ll

29 lines

phi-shifts.ll

22 lines

Diff 249166

lib/Transforms/InstCombine/InstructionCombining.cpp

Show First 20 Lines • Show All 3,446 Lines • ▼ Show 20 Lines	if (!I->use_empty() &&
++NumConstProp;		++NumConstProp;
if (isInstructionTriviallyDead(I, &TLI))		if (isInstructionTriviallyDead(I, &TLI))
eraseInstFromFunction(*I);		eraseInstFromFunction(*I);
MadeIRChange = true;		MadeIRChange = true;
continue;		continue;
}		}
}		}

// In general, it is possible for computeKnownBits to determine all bits in
// a value even when the operands are not all constants.
Type *Ty = I->getType();
if (ExpensiveCombines && !I->use_empty() && Ty->isIntOrIntVectorTy()) {
KnownBits Known = computeKnownBits(I, /Depth/0, I);
if (Known.isConstant()) {
Constant *C = ConstantInt::get(Ty, Known.getConstant());
LLVM_DEBUG(dbgs() << "IC: ConstFold (all bits known) to: " << *C
<< " from: " << *I << '\n');

// Add operands to the worklist.
replaceInstUsesWith(*I, C);
++NumConstProp;
if (isInstructionTriviallyDead(I, &TLI))
eraseInstFromFunction(*I);
MadeIRChange = true;
continue;
}
}

// See if we can trivially sink this instruction to a successor basic block.		// See if we can trivially sink this instruction to a successor basic block.
if (EnableCodeSinking && I->hasOneUse()) {		if (EnableCodeSinking && I->hasOneUse()) {
BasicBlock *BB = I->getParent();		BasicBlock *BB = I->getParent();
Instruction UserInst = cast<Instruction>(I->user_begin());		Instruction UserInst = cast<Instruction>(I->user_begin());
BasicBlock *UserParent;		BasicBlock *UserParent;

// Get the block the use occurs in.		// Get the block the use occurs in.
if (PHINode *PN = dyn_cast<PHINode>(UserInst))		if (PHINode *PN = dyn_cast<PHINode>(UserInst))
▲ Show 20 Lines • Show All 430 Lines • Show Last 20 Lines

test/Analysis/ValueTracking/known-signbit-shift.ll

Show All 24 Lines	;
%cmp = icmp slt i32 %shift, 0		%cmp = icmp slt i32 %shift, 0
ret i1 %cmp		ret i1 %cmp
}		}

; If sign bit is a known zero, it cannot be a known one.		; If sign bit is a known zero, it cannot be a known one.
; This test should not crash opt. The shift produces poison.		; This test should not crash opt. The shift produces poison.
define i32 @test_no_sign_bit_conflict1(i1 %b) {		define i32 @test_no_sign_bit_conflict1(i1 %b) {
; CHECK-LABEL: @test_no_sign_bit_conflict1(		; CHECK-LABEL: @test_no_sign_bit_conflict1(
; CHECK-NEXT: ret i32 0		; CHECK-NEXT: [[SEL:%.]] = select i1 [[B:%.]], i32 -2147221504, i32 -2147483648
		; CHECK-NEXT: ret i32 [[SEL]]
;		;
%sel = select i1 %b, i32 8193, i32 8192		%sel = select i1 %b, i32 8193, i32 8192
%mul = shl nsw i32 %sel, 18		%mul = shl nsw i32 %sel, 18
ret i32 %mul		ret i32 %mul
}		}

; If sign bit is a known one, it cannot be a known zero.		; If sign bit is a known one, it cannot be a known zero.
; This test should not crash opt. The shift produces poison.		; This test should not crash opt. The shift produces poison.
define i32 @test_no_sign_bit_conflict2(i1 %b) {		define i32 @test_no_sign_bit_conflict2(i1 %b) {
; CHECK-LABEL: @test_no_sign_bit_conflict2(		; CHECK-LABEL: @test_no_sign_bit_conflict2(
; CHECK-NEXT: ret i32 0		; CHECK-NEXT: [[SEL:%.]] = select i1 [[B:%.]], i32 2147221504, i32 2146959360
		; CHECK-NEXT: ret i32 [[SEL]]
		nikicAuthorUnsubmitted Done Reply Inline Actions I missed these Analysis test changes before. The shl here is poison because we know it wraps based on known bits. This optimization gets lost now. I could add it back explicitly (and better, by returning undef rather than zero) like this: https://gist.github.com/nikic/29135f304f7cf9de6d18dff7ca12659a I'm not sure whether that's worthwhile though, it seems that these tests are more about not crashing due to conflicting known bits than anything else. nikic: I missed these Analysis test changes before. The shl here is poison because we know it wraps…
		lebedev.riUnsubmitted Not Done Reply Inline Actions This still seems like worthwhile fold. lebedev.ri: This still seems like worthwhile fold.
		nikicAuthorUnsubmitted Done Reply Inline Actions I've opened D76489 to address the reason why this difference exists in the first place (SimplifyDemanded produces a less good known bits result). We could still do a better overflow detection on top of that, but at least this removes the gap to what we currently do. nikic: I've opened D76489 to address the reason why this difference exists in the first place…
;		;
%sel = select i1 %b, i32 -8193, i32 -8194		%sel = select i1 %b, i32 -8193, i32 -8194
%mul = shl nsw i32 %sel, 18		%mul = shl nsw i32 %sel, 18
ret i32 %mul		ret i32 %mul
}		}

test/Transforms/InstCombine/assume.ll

Show First 20 Lines • Show All 326 Lines • ▼ Show 20 Lines	;
tail call void @llvm.assume(i1 %cmp) ; %load has at least highest bit set		tail call void @llvm.assume(i1 %cmp) ; %load has at least highest bit set
%rval = icmp eq i32* %load, null		%rval = icmp eq i32* %load, null
ret i1 %rval		ret i1 %rval
}		}

; PR35846 - https://bugs.llvm.org/show_bug.cgi?id=35846		; PR35846 - https://bugs.llvm.org/show_bug.cgi?id=35846

define i32 @assumption_conflicts_with_known_bits(i32 %a, i32 %b) {		define i32 @assumption_conflicts_with_known_bits(i32 %a, i32 %b) {
; EXPENSIVE-ON-LABEL: @assumption_conflicts_with_known_bits(		; CHECK-LABEL: @assumption_conflicts_with_known_bits(
; EXPENSIVE-ON-NEXT: tail call void @llvm.assume(i1 false)		; CHECK-NEXT: [[AND1:%.]] = and i32 [[B:%.]], 3
; EXPENSIVE-ON-NEXT: ret i32 0		; CHECK-NEXT: tail call void @llvm.assume(i1 false)
;		; CHECK-NEXT: [[CMP2:%.*]] = icmp eq i32 [[AND1]], 0
; EXPENSIVE-OFF-LABEL: @assumption_conflicts_with_known_bits(		; CHECK-NEXT: tail call void @llvm.assume(i1 [[CMP2]])
; EXPENSIVE-OFF-NEXT: [[AND1:%.]] = and i32 [[B:%.]], 3		; CHECK-NEXT: ret i32 0
		nikicAuthorUnsubmitted Done Reply Inline Actions I'm not sure we really need to do anything about this, I think it's only important that we have an assume(false) here, and SimplifyCFG will deal with the rest. If we do want to improve on this, we could convert assume(false) into store undef (the InstCombine UB pattern) and then remove all instructions after store undef. nikic: I'm not sure we really need to do anything about this, I think it's only important that we have…
		jdoerfertUnsubmitted Not Done Reply Inline Actions Can't we replace the UB instruction with unreachable? jdoerfert: Can't we replace the UB instruction with unreachable?
		nikicAuthorUnsubmitted Done Reply Inline Actions InstCombine preserves CFG, so it's not possible to use unreachable directly. Instead we need to use UB patterns understood by SimplifyCFG (assume false is one of them), which will convert them to proper unreachable. nikic: InstCombine preserves CFG, so it's not possible to use unreachable directly. Instead we need to…
; EXPENSIVE-OFF-NEXT: tail call void @llvm.assume(i1 false)
; EXPENSIVE-OFF-NEXT: [[CMP2:%.*]] = icmp eq i32 [[AND1]], 0
; EXPENSIVE-OFF-NEXT: tail call void @llvm.assume(i1 [[CMP2]])
; EXPENSIVE-OFF-NEXT: ret i32 0
;		;
%and1 = and i32 %b, 3		%and1 = and i32 %b, 3
%B1 = lshr i32 %and1, %and1		%B1 = lshr i32 %and1, %and1
%B3 = shl nuw nsw i32 %and1, %B1		%B3 = shl nuw nsw i32 %and1, %B1
%cmp = icmp eq i32 %B3, 1		%cmp = icmp eq i32 %B3, 1
tail call void @llvm.assume(i1 %cmp)		tail call void @llvm.assume(i1 %cmp)
%cmp2 = icmp eq i32 %B1, %B3		%cmp2 = icmp eq i32 %B1, %B3
tail call void @llvm.assume(i1 %cmp2)		tail call void @llvm.assume(i1 %cmp2)
▲ Show 20 Lines • Show All 67 Lines • Show Last 20 Lines

test/Transforms/InstCombine/out-of-bounds-indexes.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -S -instcombine -expensive-combines=0 < %s \| FileCheck %s --check-prefixes=CHECK,EXPENSIVE-OFF			; RUN: opt -S -instcombine -expensive-combines=0 < %s \| FileCheck %s --check-prefixes=CHECK,EXPENSIVE-OFF
	; RUN: opt -S -instcombine -expensive-combines=1 < %s \| FileCheck %s --check-prefixes=CHECK,EXPENSIVE-ON			; RUN: opt -S -instcombine -expensive-combines=1 < %s \| FileCheck %s --check-prefixes=CHECK,EXPENSIVE-ON
	; Check that we don't crash on unreasonable constant indexes			; Check that we don't crash on unreasonable constant indexes

	define i32 @test_out_of_bounds(i32 %a, i1 %x, i1 %y) {			define i32 @test_out_of_bounds(i32 %a, i1 %x, i1 %y) {
	; EXPENSIVE-OFF-LABEL: @test_out_of_bounds(			; CHECK-LABEL: @test_out_of_bounds(
	; EXPENSIVE-OFF-NEXT: entry:			; CHECK-NEXT: entry:
	; EXPENSIVE-OFF-NEXT: [[AND1:%.]] = and i32 [[A:%.]], 3			; CHECK-NEXT: [[AND1:%.]] = and i32 [[A:%.]], 3
	; EXPENSIVE-OFF-NEXT: tail call void @llvm.assume(i1 undef)			; CHECK-NEXT: tail call void @llvm.assume(i1 undef)
				nikicAuthorUnsubmitted Done Reply Inline Actions This is an improvement. nikic: This is an improvement.
	; EXPENSIVE-OFF-NEXT: ret i32 [[AND1]]			; CHECK-NEXT: ret i32 [[AND1]]
	;
	; EXPENSIVE-ON-LABEL: @test_out_of_bounds(
	; EXPENSIVE-ON-NEXT: entry:
	; EXPENSIVE-ON-NEXT: [[AND1:%.]] = and i32 [[A:%.]], 3
	; EXPENSIVE-ON-NEXT: tail call void @llvm.assume(i1 false)
	; EXPENSIVE-ON-NEXT: ret i32 [[AND1]]
	;			;
	entry:			entry:
	%and1 = and i32 %a, 3			%and1 = and i32 %a, 3
	%B = lshr i32 %and1, -2147483648			%B = lshr i32 %and1, -2147483648
	%cmp = icmp eq i32 %B, 1			%cmp = icmp eq i32 %B, 1
	tail call void @llvm.assume(i1 %cmp)			tail call void @llvm.assume(i1 %cmp)
	ret i32 %and1			ret i32 %and1
	}			}

	define i128 @test_non64bit(i128 %a) {			define i128 @test_non64bit(i128 %a) {
	; EXPENSIVE-OFF-LABEL: @test_non64bit(			; CHECK-LABEL: @test_non64bit(
	; EXPENSIVE-OFF-NEXT: [[AND1:%.]] = and i128 [[A:%.]], 3			; CHECK-NEXT: [[AND1:%.]] = and i128 [[A:%.]], 3
	; EXPENSIVE-OFF-NEXT: tail call void @llvm.assume(i1 undef)			; CHECK-NEXT: tail call void @llvm.assume(i1 undef)
	; EXPENSIVE-OFF-NEXT: ret i128 [[AND1]]			; CHECK-NEXT: ret i128 [[AND1]]
	;
	; EXPENSIVE-ON-LABEL: @test_non64bit(
	; EXPENSIVE-ON-NEXT: [[AND1:%.]] = and i128 [[A:%.]], 3
	; EXPENSIVE-ON-NEXT: tail call void @llvm.assume(i1 false)
	; EXPENSIVE-ON-NEXT: ret i128 [[AND1]]
	;			;
	%and1 = and i128 %a, 3			%and1 = and i128 %a, 3
	%B = lshr i128 %and1, -1			%B = lshr i128 %and1, -1
	%cmp = icmp eq i128 %B, 1			%cmp = icmp eq i128 %B, 1
	tail call void @llvm.assume(i1 %cmp)			tail call void @llvm.assume(i1 %cmp)
	ret i128 %and1			ret i128 %and1
	}			}

	Show All 9 Lines

test/Transforms/InstCombine/phi-shifts.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt -S -instcombine -expensive-combines=0 < %s \| FileCheck %s --check-prefixes=CHECK,EXPENSIVE-OFF			; RUN: opt -S -instcombine -expensive-combines=0 < %s \| FileCheck %s --check-prefixes=CHECK,EXPENSIVE-OFF
	; RUN: opt -S -instcombine -expensive-combines=1 < %s \| FileCheck %s --check-prefixes=CHECK,EXPENSIVE-ON			; RUN: opt -S -instcombine -expensive-combines=1 < %s \| FileCheck %s --check-prefixes=CHECK,EXPENSIVE-ON

	; OSS Fuzz: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=15217			; OSS Fuzz: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=15217
	define i64 @fuzz15217(i1 %cond, i8* %Ptr, i64 %Val) {			define i64 @fuzz15217(i1 %cond, i8* %Ptr, i64 %Val) {
	; EXPENSIVE-OFF-LABEL: @fuzz15217(			; CHECK-LABEL: @fuzz15217(
	; EXPENSIVE-OFF-NEXT: entry:			; CHECK-NEXT: entry:
	; EXPENSIVE-OFF-NEXT: br i1 [[COND:%.]], label [[END:%.]], label [[TWO:%.*]]			; CHECK-NEXT: br i1 [[COND:%.]], label [[END:%.]], label [[TWO:%.*]]
	; EXPENSIVE-OFF: two:			; CHECK: two:
	; EXPENSIVE-OFF-NEXT: br label [[END]]			; CHECK-NEXT: br label [[END]]
	; EXPENSIVE-OFF: end:			; CHECK: end:
	; EXPENSIVE-OFF-NEXT: ret i64 undef			; CHECK-NEXT: ret i64 undef
				nikicAuthorUnsubmitted Done Reply Inline Actions This is an improvement. nikic: This is an improvement.
	;
	; EXPENSIVE-ON-LABEL: @fuzz15217(
	; EXPENSIVE-ON-NEXT: entry:
	; EXPENSIVE-ON-NEXT: br i1 [[COND:%.]], label [[END:%.]], label [[TWO:%.*]]
	; EXPENSIVE-ON: two:
	; EXPENSIVE-ON-NEXT: br label [[END]]
	; EXPENSIVE-ON: end:
	; EXPENSIVE-ON-NEXT: ret i64 0
	;			;
	entry:			entry:
	br i1 %cond, label %end, label %two			br i1 %cond, label %end, label %two

	two:			two:
	br label %end			br label %end

	end:			end:
	%tmp869.0 = phi i128 [ 0, %entry ], [ 18446744073709551616, %two ]			%tmp869.0 = phi i128 [ 0, %entry ], [ 18446744073709551616, %two ]
	%tmp29 = lshr i128 %tmp869.0, 64			%tmp29 = lshr i128 %tmp869.0, 64
	%B1 = lshr i128 %tmp29, 170141183460469231731687303715884105727			%B1 = lshr i128 %tmp29, 170141183460469231731687303715884105727
	%tmp30 = trunc i128 %B1 to i64			%tmp30 = trunc i128 %B1 to i64
	ret i64 %tmp30			ret i64 %tmp30
	}			}

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] Remove known bits constant foldingClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 249166

lib/Transforms/InstCombine/InstructionCombining.cpp

test/Analysis/ValueTracking/known-signbit-shift.ll

test/Transforms/InstCombine/assume.ll

test/Transforms/InstCombine/out-of-bounds-indexes.ll

test/Transforms/InstCombine/phi-shifts.ll

[InstCombine] Remove known bits constant folding
ClosedPublic