This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/InstCombine/
-
Transforms/
-
InstCombine/
-
InstCombineCompares.cpp
1/2
InstCombineInternal.h
-
InstCombineSelect.cpp
-
InstructionCombining.cpp
-
test/
-
ThinLTO/X86/
-
X86/
-
cfi-devirt.ll
-
Transforms/
-
InstCombine/
-
canonicalize-selects-icmp-condition-bittest.ll
-
icmp-mul-zext.ll
-
logical-select.ll
-
select-with-bitwise-ops.ll
-
shift.ll
-
LoopUnroll/
2
runtime-loop-multiple-exits.ll
-
PGOProfile/
-
chr.ll
-
SimplifyCFG/
-
merge-cond-stores.ll

Differential D83139

[InstCombine] Always try to invert non-canonical predicate of an icmp
ClosedPublic

Authored by lebedev.ri on Jul 3 2020, 10:31 AM.

Download Raw Diff

Details

Reviewers

spatel
jdoerfert
nikic

Commits

rGc3b8bd1eea5b: [InstCombine] Always try to invert non-canonical predicate of an icmp

Summary

The actual transform i was going after was:
https://rise4fun.com/Alive/Tp9H

Name: zz
Pre: isPowerOf2(C0) && isPowerOf2(C1) && C1 == C0 
%t0 = and i8 %x, C0
%r = icmp eq i8 %t0, C1
  =>
%t = icmp eq i8 %t0, 0
%r = xor i1 %t, -1

Name: zz
Pre: isPowerOf2(C0) 
%t0 = and i8 %x, C0
%r = icmp ne i8 %t0, 0
  =>
%t = icmp eq i8 %t0, 0
%r = xor i1 %t, -1

but as it can be seen from the current tests, we already canonicalize most of it,
and we are only missing handling multi-use non-canonical icmp predicates.

If we have both !=0 and ==0, even though we can CSE them,
we end up being stuck with them. We should canonicalize to the ==0.

I believe this is one of the cleanup steps i'll need after -scalarizer
if i end up proceeding with my WIP alloca promotion helper pass.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

lebedev.ri created this revision.Jul 3 2020, 10:31 AM

Herald added a subscriber: hiraditya. · View Herald TranscriptJul 3 2020, 10:32 AM

I'm wondering if it would make sense to treat this as a more general transform that starts from the icmp, rather than the select. I'm assuming the same issue principally also applied to, say, an icmp used in a br. Though probably the implementation would have to directly invert the users in that case, to avoid looping.

Harbormaster completed remote builds in B62868: Diff 275433.Jul 3 2020, 11:52 AM

Handle it at icmp level.
fcmp is a can of worms i'm not inclined to touch here..

Herald added a subscriber: zzheng. · View Herald TranscriptJul 3 2020, 2:05 PM

Harbormaster failed remote builds in B62883: Diff 275462!Jul 3 2020, 2:11 PM

LG from my side, but would be great if @spatel can chime in as well. We generally avoid folds that inspect users of an instruction, but I think there's reasonable motivation here. It would be great if you can provide your original test case that shows the missed CSE opportunity, as a sanity check that this can't be solved in some other way.

fcmp is a can of worms i'm not inclined to touch here..

Did you run into some particular issue on that front?

llvm/lib/Transforms/InstCombine/InstCombineInternal.h
220	Add a comment to keep this synced with canonicalizeICmpPredicate()?
231	Nice catch!
llvm/test/Transforms/LoopUnroll/runtime-loop-multiple-exits.ll
1	I don't see what has changed here. Can you either precommit, or only adjust the changed part? (Not sure if whoever wrote this appreciates the large generated output.)

This revision is now accepted and ready to land.Jul 4 2020, 1:31 AM

lebedev.ri marked an inline comment as done.Jul 4 2020, 2:52 AM

lebedev.ri added inline comments.

llvm/test/Transforms/LoopUnroll/runtime-loop-multiple-exits.ll
1	Yeah, i'm not sure what to do with this test. I'm not even sure anything actually changed here, it is possible it just broke because it uses value names. It's just a bad test.

For example, we could end up with the following after -alloca-promotion-coercion -mem2reg on https://godbolt.org/z/bwuEmJ

*** IR Dump After Promote Memory to Register *** (function: _Z4loopi)
; ModuleID = '/tmp/test.cpp'
source_filename = "/tmp/test.cpp"
target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"

; Function Attrs: uwtable
define dso_local void @_Z4loopi(i32 %width) local_unnamed_addr #0 {
entry:
  %0 = shufflevector <8 x i8> undef, <8 x i8> zeroinitializer, <8 x i32> <i32 8, i32 9, i32 10, i32 11, i32 12, i32 13, i32 14, i32 15>
  br label %for.cond

for.cond:                                         ; preds = %for.body, %entry
  %storage.apc.retyped.0 = phi <8 x i8> [ %0, %entry ], [ %10, %for.body ]
  %i.0 = phi i32 [ 0, %entry ], [ %inc, %for.body ]
  %1 = mul i32 %i.0, -1
  %2 = trunc i32 %1 to i1
  %3 = zext i1 %2 to i64
  %cmp = icmp ne i32 %i.0, %width
  br i1 %cmp, label %for.body, label %for.cond.cleanup

for.cond.cleanup:                                 ; preds = %for.cond
  ret void

for.body:                                         ; preds = %for.cond
  %4 = bitcast <8 x i8> %storage.apc.retyped.0 to <2 x i32>
  %5 = extractelement <2 x i32> %4, i64 %3
  %call = call i32 @_Z3adji(i32 %5)
  %6 = bitcast <8 x i8> %storage.apc.retyped.0 to <2 x i32>
  %7 = extractelement <2 x i32> %6, i64 %3
  %add = add nsw i32 %7, %call
  %8 = bitcast <8 x i8> %storage.apc.retyped.0 to <2 x i32>
  %9 = insertelement <2 x i32> %8, i32 %add, i64 %3
  %10 = bitcast <2 x i32> %9 to <8 x i8>
  %inc = add nsw i32 %i.0, 1
  br label %for.cond
}

; Function Attrs: argmemonly nounwind willreturn
declare void @llvm.lifetime.start.p0i8(i64 immarg, i8* nocapture) #1

; Function Attrs: argmemonly nounwind willreturn writeonly
declare void @llvm.memset.p0i8.i64(i8* nocapture writeonly, i8, i64, i1 immarg) #2

declare dso_local i32 @_Z3adji(i32) local_unnamed_addr #3

; Function Attrs: argmemonly nounwind willreturn
declare void @llvm.lifetime.end.p0i8(i64 immarg, i8* nocapture) #1

attributes #0 = { uwtable "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "frame-pointer"="none" "less-precise-fpmad"="false" "min-legal-vector-width"="0" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+cx8,+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
attributes #1 = { argmemonly nounwind willreturn }
attributes #2 = { argmemonly nounwind willreturn writeonly }
attributes #3 = { "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "frame-pointer"="none" "less-precise-fpmad"="false" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+cx8,+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }

!llvm.module.flags = !{!0}
!llvm.ident = !{!1}

!0 = !{i32 1, !"wchar_size", i32 4}
!1 = !{!"clang version 11.0.0 (git@github.com:LebedevRI/llvm-project.git ff1dbd7ce139b7769158065864ea2615b38f3e16)"}

Then -instcombine -scalarizer helps cleanup it:

$ ./bin/opt /tmp/test.ll -instcombine -scalarizer -o - -S
; ModuleID = '/tmp/test.ll'
source_filename = "/tmp/test.cpp"
target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"

; Function Attrs: uwtable
define dso_local void @_Z4loopi(i32 %width) local_unnamed_addr #0 {
entry:
  br label %for.cond

for.cond:                                         ; preds = %for.body, %entry
  %.i0 = phi i32 [ 0, %entry ], [ %.i01, %for.body ]
  %.i1 = phi i32 [ 0, %entry ], [ %.i12, %for.body ]
  %i.0 = phi i32 [ 0, %entry ], [ %inc, %for.body ]
  %0 = and i32 %i.0, 1
  %1 = zext i32 %0 to i64
  %cmp = icmp eq i32 %i.0, %width
  br i1 %cmp, label %for.cond.cleanup, label %for.body

for.cond.cleanup:                                 ; preds = %for.cond
  ret void

for.body:                                         ; preds = %for.cond
  %.is.0 = icmp eq i64 %1, 0
  %.upto0 = select i1 %.is.0, i32 %.i0, i32 undef
  %.is.1 = icmp eq i64 %1, 1
  %2 = select i1 %.is.1, i32 %.i1, i32 %.upto0
  %call = call i32 @_Z3adji(i32 %2)
  %.is.03 = icmp eq i64 %1, 0
  %.upto04 = select i1 %.is.03, i32 %.i0, i32 undef
  %.is.15 = icmp eq i64 %1, 1
  %3 = select i1 %.is.15, i32 %.i1, i32 %.upto04
  %add = add nsw i32 %3, %call
  %.is.07 = icmp eq i64 %1, 0
  %.i01 = select i1 %.is.07, i32 %add, i32 %.i0
  %.is.19 = icmp eq i64 %1, 1
  %.i12 = select i1 %.is.19, i32 %add, i32 %.i1
  %inc = add nuw nsw i32 %i.0, 1
  br label %for.cond
}

; Function Attrs: argmemonly nounwind willreturn
declare void @llvm.lifetime.start.p0i8(i64 immarg, i8* nocapture) #1

; Function Attrs: argmemonly nounwind willreturn writeonly
declare void @llvm.memset.p0i8.i64(i8* nocapture writeonly, i8, i64, i1 immarg) #2

declare dso_local i32 @_Z3adji(i32) local_unnamed_addr #3

; Function Attrs: argmemonly nounwind willreturn
declare void @llvm.lifetime.end.p0i8(i64 immarg, i8* nocapture) #1

attributes #0 = { uwtable "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "frame-pointer"="none" "less-precise-fpmad"="false" "min-legal-vector-width"="0" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+cx8,+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
attributes #1 = { argmemonly nounwind willreturn }
attributes #2 = { argmemonly nounwind willreturn writeonly }
attributes #3 = { "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "frame-pointer"="none" "less-precise-fpmad"="false" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+cx8,+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }

!llvm.module.flags = !{!0}
!llvm.ident = !{!1}

!0 = !{i32 1, !"wchar_size", i32 4}
!1 = !{!"clang version 11.0.0 (git@github.com:LebedevRI/llvm-project.git ff1dbd7ce139b7769158065864ea2615b38f3e16)"}

We have icmp eq i64 %1, 0 and icmp eq i64 %1, 1. The following will depend on the exact pass ordering.
If we happen to run EarlyCSE first, we'd still be stuck with them:

$ opt-11 /tmp/test.ll -early-cse -instcombine -o - -S
; ModuleID = '/tmp/test.ll'
source_filename = "/tmp/test.cpp"
target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"

; Function Attrs: uwtable
define dso_local void @_Z4loopi(i32 %width) local_unnamed_addr #0 {
entry:
  br label %for.cond

for.cond:                                         ; preds = %for.body, %entry
  %.i0 = phi i32 [ 0, %entry ], [ %.i01, %for.body ]
  %.i1 = phi i32 [ 0, %entry ], [ %.i12, %for.body ]
  %i.0 = phi i32 [ 0, %entry ], [ %inc, %for.body ]
  %0 = and i32 %i.0, 1
  %cmp = icmp eq i32 %i.0, %width
  br i1 %cmp, label %for.cond.cleanup, label %for.body

for.cond.cleanup:                                 ; preds = %for.cond
  ret void

for.body:                                         ; preds = %for.cond
  %.is.0 = icmp eq i32 %0, 0
  %.is.1 = icmp ne i32 %0, 0
  %1 = select i1 %.is.1, i32 %.i1, i32 %.i0
  %call = call i32 @_Z3adji(i32 %1)
  %add = add nsw i32 %1, %call
  %.i01 = select i1 %.is.0, i32 %add, i32 %.i0
  %.i12 = select i1 %.is.1, i32 %add, i32 %.i1
  %inc = add nuw nsw i32 %i.0, 1
  br label %for.cond
}

; Function Attrs: argmemonly nounwind willreturn
declare void @llvm.lifetime.start.p0i8(i64 immarg, i8* nocapture) #1

; Function Attrs: argmemonly nounwind willreturn writeonly
declare void @llvm.memset.p0i8.i64(i8* nocapture writeonly, i8, i64, i1 immarg) #2

declare dso_local i32 @_Z3adji(i32) local_unnamed_addr #3

; Function Attrs: argmemonly nounwind willreturn
declare void @llvm.lifetime.end.p0i8(i64 immarg, i8* nocapture) #1

attributes #0 = { uwtable "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "frame-pointer"="none" "less-precise-fpmad"="false" "min-legal-vector-width"="0" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+cx8,+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
attributes #1 = { argmemonly nounwind willreturn }
attributes #2 = { argmemonly nounwind willreturn writeonly }
attributes #3 = { "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "frame-pointer"="none" "less-precise-fpmad"="false" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+cx8,+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }

!llvm.module.flags = !{!0}
!llvm.ident = !{!1}

!0 = !{i32 1, !"wchar_size", i32 4}
!1 = !{!"clang version 11.0.0 (git@github.com:LebedevRI/llvm-project.git ff1dbd7ce139b7769158065864ea2615b38f3e16)"}

But it'd be okay if we would do:

$ opt-11 /tmp/test.ll -instcombine -early-cse -o - -S
; ModuleID = '/tmp/test.ll'
source_filename = "/tmp/test.cpp"
target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"

; Function Attrs: uwtable
define dso_local void @_Z4loopi(i32 %width) local_unnamed_addr #0 {
entry:
  br label %for.cond

for.cond:                                         ; preds = %for.body, %entry
  %.i0 = phi i32 [ 0, %entry ], [ %.i01, %for.body ]
  %.i1 = phi i32 [ 0, %entry ], [ %.i12, %for.body ]
  %i.0 = phi i32 [ 0, %entry ], [ %inc, %for.body ]
  %0 = and i32 %i.0, 1
  %cmp = icmp eq i32 %i.0, %width
  br i1 %cmp, label %for.cond.cleanup, label %for.body

for.cond.cleanup:                                 ; preds = %for.cond
  ret void

for.body:                                         ; preds = %for.cond
  %.is.1 = icmp eq i32 %0, 0
  %1 = select i1 %.is.1, i32 %.i0, i32 %.i1
  %call = call i32 @_Z3adji(i32 %1)
  %add = add nsw i32 %1, %call
  %.i01 = select i1 %.is.1, i32 %add, i32 %.i0
  %.i12 = select i1 %.is.1, i32 %.i1, i32 %add
  %inc = add nuw nsw i32 %i.0, 1
  br label %for.cond
}

; Function Attrs: argmemonly nounwind willreturn
declare void @llvm.lifetime.start.p0i8(i64 immarg, i8* nocapture) #1

; Function Attrs: argmemonly nounwind willreturn writeonly
declare void @llvm.memset.p0i8.i64(i8* nocapture writeonly, i8, i64, i1 immarg) #2

declare dso_local i32 @_Z3adji(i32) local_unnamed_addr #3

; Function Attrs: argmemonly nounwind willreturn
declare void @llvm.lifetime.end.p0i8(i64 immarg, i8* nocapture) #1

attributes #0 = { uwtable "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "frame-pointer"="none" "less-precise-fpmad"="false" "min-legal-vector-width"="0" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+cx8,+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
attributes #1 = { argmemonly nounwind willreturn }
attributes #2 = { argmemonly nounwind willreturn writeonly }
attributes #3 = { "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "frame-pointer"="none" "less-precise-fpmad"="false" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+cx8,+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }

!llvm.module.flags = !{!0}
!llvm.ident = !{!1}

!0 = !{i32 1, !"wchar_size", i32 4}
!1 = !{!"clang version 11.0.0 (git@github.com:LebedevRI/llvm-project.git ff1dbd7ce139b7769158065864ea2615b38f3e16)"}

And with this patch, we will get good result regardless:

$ ./bin/opt /tmp/test.ll -early-cse -instcombine -o - -S
; ModuleID = '/tmp/test.ll'
source_filename = "/tmp/test.cpp"
target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
target triple = "x86_64-unknown-linux-gnu"

; Function Attrs: uwtable
define dso_local void @_Z4loopi(i32 %width) local_unnamed_addr #0 {
entry:
  br label %for.cond

for.cond:                                         ; preds = %for.body, %entry
  %.i0 = phi i32 [ 0, %entry ], [ %.i01, %for.body ]
  %.i1 = phi i32 [ 0, %entry ], [ %.i12, %for.body ]
  %i.0 = phi i32 [ 0, %entry ], [ %inc, %for.body ]
  %0 = and i32 %i.0, 1
  %cmp = icmp eq i32 %i.0, %width
  br i1 %cmp, label %for.cond.cleanup, label %for.body

for.cond.cleanup:                                 ; preds = %for.cond
  ret void

for.body:                                         ; preds = %for.cond
  %.is.0 = icmp eq i32 %0, 0
  %.is.1.not = icmp eq i32 %0, 0
  %1 = select i1 %.is.1.not, i32 %.i0, i32 %.i1
  %call = call i32 @_Z3adji(i32 %1)
  %add = add nsw i32 %1, %call
  %.i01 = select i1 %.is.0, i32 %add, i32 %.i0
  %.i12 = select i1 %.is.1.not, i32 %.i1, i32 %add
  %inc = add nuw nsw i32 %i.0, 1
  br label %for.cond
}

; Function Attrs: argmemonly nounwind willreturn
declare void @llvm.lifetime.start.p0i8(i64 immarg, i8* nocapture) #1

; Function Attrs: argmemonly nounwind willreturn writeonly
declare void @llvm.memset.p0i8.i64(i8* nocapture writeonly, i8, i64, i1 immarg) #2

declare dso_local i32 @_Z3adji(i32) local_unnamed_addr #3

; Function Attrs: argmemonly nounwind willreturn
declare void @llvm.lifetime.end.p0i8(i64 immarg, i8* nocapture) #1

attributes #0 = { uwtable "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "frame-pointer"="none" "less-precise-fpmad"="false" "min-legal-vector-width"="0" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+cx8,+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
attributes #1 = { argmemonly nounwind willreturn }
attributes #2 = { argmemonly nounwind willreturn writeonly }
attributes #3 = { "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "frame-pointer"="none" "less-precise-fpmad"="false" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="true" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+cx8,+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }

!llvm.module.flags = !{!0}
!llvm.ident = !{!1}

!0 = !{i32 1, !"wchar_size", i32 4}
!1 = !{!"clang version 11.0.0 (git@github.com:LebedevRI/llvm-project.git ff1dbd7ce139b7769158065864ea2615b38f3e16)"}

which is good, and just needs one more -early-cse.

In D83139#2131285, @nikic wrote:

fcmp is a can of worms i'm not inclined to touch here..

Did you run into some particular issue on that front?

We already have conflicting transforms there.
We consider ordered fp predicates to be non-canonical, but at the same time we canonicalize to ordered fp predicate if it's a icmp driving select.
And if we consider ordered to be canonical, then minnum (i think?) intrinsic recognition fails.

In D83139#2131285, @nikic wrote:

LG from my side, but would be great if @spatel can chime in as well. We generally avoid folds that inspect users of an instruction, but I think there's reasonable motivation here. It would be great if you can provide your original test case that shows the missed CSE opportunity, as a sanity check that this can't be solved in some other way.

LGTM too. If you can pre/post-commit the value name change diffs in the tests, that would make it much easier to see the true diffs.
I looked at doing something like this somewhere in the past, but I was too scared of the infinite loop potential, especially for min/max patterns (intrinsics coming soon?). So watch out for fallout from fuzzers and bots.

@nikic @spatel thank you for the reviews!

Closed by commit rGc3b8bd1eea5b: [InstCombine] Always try to invert non-canonical predicate of an icmp (authored by lebedev.ri). · Explain WhyJul 4 2020, 8:35 AM

This revision was automatically updated to reflect the committed changes.

Herald added subscribers: dexonsmith, steven_wu. · View Herald TranscriptJul 4 2020, 8:35 AM

Revision Contents

Path

Size

llvm/

lib/

Transforms/

InstCombine/

InstCombineCompares.cpp

48 lines

InstCombineInternal.h

11 lines

InstCombineSelect.cpp

14 lines

InstructionCombining.cpp

4 lines

test/

ThinLTO/

X86/

cfi-devirt.ll

4 lines

Transforms/

InstCombine/

canonicalize-selects-icmp-condition-bittest.ll

24 lines

icmp-mul-zext.ll

10 lines

logical-select.ll

28 lines

select-with-bitwise-ops.ll

146 lines

shift.ll

4 lines

LoopUnroll/

runtime-loop-multiple-exits.ll

34 lines

PGOProfile/

chr.ll

238 lines

SimplifyCFG/

merge-cond-stores.ll

14 lines

Diff 275502

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp

Show First 20 Lines • Show All 5,277 Lines • ▼ Show 20 Lines	static ICmpInst *canonicalizeCmpWithConstant(ICmpInst &I) {

auto FlippedStrictness = getFlippedStrictnessPredicateAndConstant(Pred, Op1C);		auto FlippedStrictness = getFlippedStrictnessPredicateAndConstant(Pred, Op1C);
if (!FlippedStrictness)		if (!FlippedStrictness)
return nullptr;		return nullptr;

return new ICmpInst(FlippedStrictness->first, Op0, FlippedStrictness->second);		return new ICmpInst(FlippedStrictness->first, Op0, FlippedStrictness->second);
}		}

		/// If we have a comparison with a non-canonical predicate, if we can update
		/// all the users, invert the predicate and adjust all the users.
		static CmpInst *canonicalizeICmpPredicate(CmpInst &I) {
		// Is the predicate already canonical?
		CmpInst::Predicate Pred = I.getPredicate();
		if (isCanonicalPredicate(Pred))
		return nullptr;

		// Can all users be adjusted to predicate inversion?
		if (!canFreelyInvertAllUsersOf(&I, /IgnoredUser=/nullptr))
		return nullptr;

		// Ok, we can canonicalize comparison!
		// Let's first invert the comparison's predicate.
		I.setPredicate(CmpInst::getInversePredicate(Pred));
		I.setName(I.getName() + ".not");

		// And now let's adjust every user.
		for (User *U : I.users()) {
		switch (cast<Instruction>(U)->getOpcode()) {
		case Instruction::Select: {
		auto *SI = cast<SelectInst>(U);
		SI->swapValues();
		SI->swapProfMetadata();
		break;
		}
		case Instruction::Br:
		cast<BranchInst>(U)->swapSuccessors(); // swaps prof metadata too
		break;
		case Instruction::Xor:
		U->replaceAllUsesWith(&I);
		break;
		default:
		llvm_unreachable("Got unexpected user - out of sync with "
		"canFreelyInvertAllUsersOf() ?");
		}
		}

		return &I;
		}

/// Integer compare with boolean values can always be turned into bitwise ops.		/// Integer compare with boolean values can always be turned into bitwise ops.
static Instruction *canonicalizeICmpBool(ICmpInst &I,		static Instruction *canonicalizeICmpBool(ICmpInst &I,
InstCombiner::BuilderTy &Builder) {		InstCombiner::BuilderTy &Builder) {
Value A = I.getOperand(0), B = I.getOperand(1);		Value A = I.getOperand(0), B = I.getOperand(1);
assert(A->getType()->isIntOrIntVectorTy(1) && "Bools only");		assert(A->getType()->isIntOrIntVectorTy(1) && "Bools only");

// A boolean compared to true/false can be simplified to Op0/true/false in		// A boolean compared to true/false can be simplified to Op0/true/false in
// 14 out of the 20 (10 predicates * 2 constants) possible combinations.		// 14 out of the 20 (10 predicates * 2 constants) possible combinations.
▲ Show 20 Lines • Show All 222 Lines • ▼ Show 20 Lines	if (match(Op0, m_Select(m_Value(Cond), m_Value(SelectTrue),
}		}
}		}
}		}

if (Op0->getType()->isIntOrIntVectorTy(1))		if (Op0->getType()->isIntOrIntVectorTy(1))
if (Instruction *Res = canonicalizeICmpBool(I, Builder))		if (Instruction *Res = canonicalizeICmpBool(I, Builder))
return Res;		return Res;

if (ICmpInst *NewICmp = canonicalizeCmpWithConstant(I))		if (Instruction *Res = canonicalizeCmpWithConstant(I))
return NewICmp;		return Res;

		if (Instruction *Res = canonicalizeICmpPredicate(I))
		return Res;

if (Instruction *Res = foldICmpWithConstant(I))		if (Instruction *Res = foldICmpWithConstant(I))
return Res;		return Res;

if (Instruction *Res = foldICmpWithDominatingICmp(I))		if (Instruction *Res = foldICmpWithDominatingICmp(I))
return Res;		return Res;

if (Instruction *Res = foldICmpBinOp(I, Q))		if (Instruction *Res = foldICmpBinOp(I, Q))
▲ Show 20 Lines • Show All 687 Lines • Show Last 20 Lines

llvm/lib/Transforms/InstCombine/InstCombineInternal.h

Show First 20 Lines • Show All 209 Lines • ▼ Show 20 Lines	static inline bool isFreeToInvert(Value *V, bool WillInvertAllUses) {
// Selects with invertible operands are freely invertible		// Selects with invertible operands are freely invertible
if (match(V, m_Select(m_Value(), m_Not(m_Value()), m_Not(m_Value()))))		if (match(V, m_Select(m_Value(), m_Not(m_Value()), m_Not(m_Value()))))
return WillInvertAllUses;		return WillInvertAllUses;

return false;		return false;
}		}

/// Given i1 V, can every user of V be freely adapted if V is changed to !V ?		/// Given i1 V, can every user of V be freely adapted if V is changed to !V ?
		/// InstCombine's canonicalizeICmpPredicate() must be kept in sync with this fn.
///		///
/// See also: isFreeToInvert()		/// See also: isFreeToInvert()
		nikicUnsubmitted Not Done Reply Inline Actions Add a comment to keep this synced with canonicalizeICmpPredicate()? nikic: Add a comment to keep this synced with canonicalizeICmpPredicate()?
static inline bool canFreelyInvertAllUsersOf(Value V, Value IgnoredUser) {		static inline bool canFreelyInvertAllUsersOf(Value V, Value IgnoredUser) {
// Look at every user of V.		// Look at every user of V.
for (User *U : V->users()) {		for (Use &U : V->uses()) {
if (U == IgnoredUser)		if (U.getUser() == IgnoredUser)
continue; // Don't consider this user.		continue; // Don't consider this user.

auto *I = cast<Instruction>(U);		auto *I = cast<Instruction>(U.getUser());
switch (I->getOpcode()) {		switch (I->getOpcode()) {
case Instruction::Select:		case Instruction::Select:
		if (U.getOperandNo() != 0) // Only if the value is used as select cond.
		return false;
		nikicUnsubmitted Done Reply Inline Actions Nice catch! nikic: Nice catch!
		break;
case Instruction::Br:		case Instruction::Br:
		assert(U.getOperandNo() == 0 && "Must be branching on that value.");
break; // Free to invert by swapping true/false values/destinations.		break; // Free to invert by swapping true/false values/destinations.
case Instruction::Xor: // Can invert 'xor' if it's a 'not', by ignoring it.		case Instruction::Xor: // Can invert 'xor' if it's a 'not', by ignoring it.
if (!match(I, m_Not(m_Value())))		if (!match(I, m_Not(m_Value())))
return false; // Not a 'not'.		return false; // Not a 'not'.
break;		break;
default:		default:
return false; // Don't know, likely not freely invertible.		return false; // Don't know, likely not freely invertible.
}		}
▲ Show 20 Lines • Show All 845 Lines • Show Last 20 Lines

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp

Show First 20 Lines • Show All 2,526 Lines • ▼ Show 20 Lines	if (Value *V = SimplifySelectInst(CondVal, TrueVal, FalseVal,
return replaceInstUsesWith(SI, V);		return replaceInstUsesWith(SI, V);

if (Instruction *I = canonicalizeSelectToShuffle(SI))		if (Instruction *I = canonicalizeSelectToShuffle(SI))
return I;		return I;

if (Instruction I = canonicalizeScalarSelectOfVecs(SI, this))		if (Instruction I = canonicalizeScalarSelectOfVecs(SI, this))
return I;		return I;

// Canonicalize a one-use integer compare with a non-canonical predicate by
// inverting the predicate and swapping the select operands. This matches a
// compare canonicalization for conditional branches.
// TODO: Should we do the same for FP compares?
CmpInst::Predicate Pred;		CmpInst::Predicate Pred;
if (match(CondVal, m_OneUse(m_ICmp(Pred, m_Value(), m_Value()))) &&
!isCanonicalPredicate(Pred)) {
// Swap true/false values and condition.
CmpInst *Cond = cast<CmpInst>(CondVal);
Cond->setPredicate(CmpInst::getInversePredicate(Pred));
SI.swapValues();
SI.swapProfMetadata();
Worklist.push(Cond);
return &SI;
}

if (SelType->isIntOrIntVectorTy(1) &&		if (SelType->isIntOrIntVectorTy(1) &&
TrueVal->getType() == CondVal->getType()) {		TrueVal->getType() == CondVal->getType()) {
if (match(TrueVal, m_One())) {		if (match(TrueVal, m_One())) {
// Change: A = select B, true, C --> A = or B, C		// Change: A = select B, true, C --> A = or B, C
return BinaryOperator::CreateOr(CondVal, FalseVal);		return BinaryOperator::CreateOr(CondVal, FalseVal);
}		}
if (match(TrueVal, m_Zero())) {		if (match(TrueVal, m_Zero())) {
▲ Show 20 Lines • Show All 411 Lines • Show Last 20 Lines

llvm/lib/Transforms/InstCombine/InstructionCombining.cpp

Show First 20 Lines • Show All 2,749 Lines • ▼ Show 20 Lines	Instruction *InstCombiner::visitBranchInst(BranchInst &BI) {

// If the condition is irrelevant, remove the use so that other		// If the condition is irrelevant, remove the use so that other
// transforms on the condition become more effective.		// transforms on the condition become more effective.
if (!isa<ConstantInt>(BI.getCondition()) &&		if (!isa<ConstantInt>(BI.getCondition()) &&
BI.getSuccessor(0) == BI.getSuccessor(1))		BI.getSuccessor(0) == BI.getSuccessor(1))
return replaceOperand(		return replaceOperand(
BI, 0, ConstantInt::getFalse(BI.getCondition()->getType()));		BI, 0, ConstantInt::getFalse(BI.getCondition()->getType()));

// Canonicalize, for example, icmp_ne -> icmp_eq or fcmp_one -> fcmp_oeq.		// Canonicalize, for example, fcmp_one -> fcmp_oeq.
CmpInst::Predicate Pred;		CmpInst::Predicate Pred;
if (match(&BI, m_Br(m_OneUse(m_Cmp(Pred, m_Value(), m_Value())),		if (match(&BI, m_Br(m_OneUse(m_FCmp(Pred, m_Value(), m_Value())),
m_BasicBlock(), m_BasicBlock())) &&		m_BasicBlock(), m_BasicBlock())) &&
!isCanonicalPredicate(Pred)) {		!isCanonicalPredicate(Pred)) {
// Swap destinations and condition.		// Swap destinations and condition.
CmpInst *Cond = cast<CmpInst>(BI.getCondition());		CmpInst *Cond = cast<CmpInst>(BI.getCondition());
Cond->setPredicate(CmpInst::getInversePredicate(Pred));		Cond->setPredicate(CmpInst::getInversePredicate(Pred));
BI.swapSuccessors();		BI.swapSuccessors();
Worklist.push(Cond);		Worklist.push(Cond);
return &BI;		return &BI;
▲ Show 20 Lines • Show All 1,103 Lines • Show Last 20 Lines

llvm/test/ThinLTO/X86/cfi-devirt.ll

Show First 20 Lines • Show All 104 Lines • ▼ Show 20 Lines	cont:
br i1 %6, label %cont2, label %trap		br i1 %6, label %cont2, label %trap

cont2:		cont2:
%7 = extractvalue { i8*, i1 } %5, 0		%7 = extractvalue { i8*, i1 } %5, 0
%8 = bitcast i8* %7 to i32 (%struct.A, i32)		%8 = bitcast i8* %7 to i32 (%struct.A, i32)

; Check that traps are conditional. Invalid TYPE_ID can cause		; Check that traps are conditional. Invalid TYPE_ID can cause
; unconditional traps.		; unconditional traps.
; CHECK-IR: br i1 {{.*}}, label %trap		; CHECK-IR: br i1 {{.*}}, label %trap, label %cont2

; We still have to call it as virtual.		; We still have to call it as virtual.
; CHECK-IR: %call3 = tail call i32 %8		; CHECK-IR: %call3 = tail call i32 %7
%call3 = tail call i32 %8(%struct.A* nonnull %obj, i32 %call)		%call3 = tail call i32 %8(%struct.A* nonnull %obj, i32 %call)
ret i32 %call3		ret i32 %call3
}		}
; CHECK-IR-LABEL: ret i32		; CHECK-IR-LABEL: ret i32
; CHECK-IR-LABEL: }		; CHECK-IR-LABEL: }

declare { i8, i1 } @llvm.type.checked.load(i8, i32, metadata)		declare { i8, i1 } @llvm.type.checked.load(i8, i32, metadata)
declare void @llvm.trap()		declare void @llvm.trap()

declare i32 @_ZN1B1fEi(%struct.B* %this, i32 %a)		declare i32 @_ZN1B1fEi(%struct.B* %this, i32 %a)
declare i32 @_ZN1A1nEi(%struct.A* %this, i32 %a)		declare i32 @_ZN1A1nEi(%struct.A* %this, i32 %a)
declare i32 @_ZN1C1fEi(%struct.C* %this, i32 %a)		declare i32 @_ZN1C1fEi(%struct.C* %this, i32 %a)

!0 = !{i64 16, !"_ZTS1A"}		!0 = !{i64 16, !"_ZTS1A"}
!1 = !{i64 16, !"_ZTS1B"}		!1 = !{i64 16, !"_ZTS1B"}
!2 = !{i64 16, !"_ZTS1C"}		!2 = !{i64 16, !"_ZTS1C"}

llvm/test/Transforms/InstCombine/canonicalize-selects-icmp-condition-bittest.ll

; NOTE: Assertions have been autogenerated by utils/update_test_checks.py		; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
; RUN: opt < %s -instcombine -S \| FileCheck %s		; RUN: opt < %s -instcombine -S \| FileCheck %s

declare void @use8(i8)		declare void @use8(i8)
declare void @use1(i1)		declare void @use1(i1)

; Basic case - all good.		; Basic case - all good.
define i8 @p0(i8 %x, i8 %v0, i8 %v1) {		define i8 @p0(i8 %x, i8 %v0, i8 %v1) {
; CHECK-LABEL: @p0(		; CHECK-LABEL: @p0(
; CHECK-NEXT: [[T0:%.]] = and i8 [[X:%.]], 1		; CHECK-NEXT: [[T0:%.]] = and i8 [[X:%.]], 1
; CHECK-NEXT: [[T1:%.*]] = icmp eq i8 [[T0]], 0		; CHECK-NEXT: [[T1_NOT:%.*]] = icmp eq i8 [[T0]], 0
; CHECK-NEXT: [[R:%.]] = select i1 [[T1]], i8 [[V1:%.]], i8 [[V0:%.*]], !prof !0		; CHECK-NEXT: [[R:%.]] = select i1 [[T1_NOT]], i8 [[V1:%.]], i8 [[V0:%.*]], !prof !0
; CHECK-NEXT: ret i8 [[R]]		; CHECK-NEXT: ret i8 [[R]]
;		;
%t0 = and i8 %x, 1		%t0 = and i8 %x, 1
%t1 = icmp eq i8 %t0, 1		%t1 = icmp eq i8 %t0, 1
%r = select i1 %t1, i8 %v0, i8 %v1, !prof !0		%r = select i1 %t1, i8 %v0, i8 %v1, !prof !0
ret i8 %r		ret i8 %r
}		}
define i8 @p1(i8 %x, i8 %v0, i8 %v1) {		define i8 @p1(i8 %x, i8 %v0, i8 %v1) {
; CHECK-LABEL: @p1(		; CHECK-LABEL: @p1(
; CHECK-NEXT: [[T0:%.]] = and i8 [[X:%.]], 1		; CHECK-NEXT: [[T0:%.]] = and i8 [[X:%.]], 1
; CHECK-NEXT: [[T1:%.*]] = icmp eq i8 [[T0]], 0		; CHECK-NEXT: [[T1_NOT:%.*]] = icmp eq i8 [[T0]], 0
; CHECK-NEXT: [[R:%.]] = select i1 [[T1]], i8 [[V1:%.]], i8 [[V0:%.*]]		; CHECK-NEXT: [[R:%.]] = select i1 [[T1_NOT]], i8 [[V1:%.]], i8 [[V0:%.*]]
; CHECK-NEXT: ret i8 [[R]]		; CHECK-NEXT: ret i8 [[R]]
;		;
%t0 = and i8 %x, 1		%t0 = and i8 %x, 1
%t1 = icmp ne i8 %t0, 0		%t1 = icmp ne i8 %t0, 0
%r = select i1 %t1, i8 %v0, i8 %v1		%r = select i1 %t1, i8 %v0, i8 %v1
ret i8 %r		ret i8 %r
}		}

Show All 13 Lines	;
ret i8 %r		ret i8 %r
}		}

; Extra use can be adjusted. While there, test multi-bb case.		; Extra use can be adjusted. While there, test multi-bb case.
define i8 @t3(i8 %x, i8 %v0, i8 %v1, i8 %v2, i8 %v3, i8* %out, i1 %c) {		define i8 @t3(i8 %x, i8 %v0, i8 %v1, i8 %v2, i8 %v3, i8* %out, i1 %c) {
; CHECK-LABEL: @t3(		; CHECK-LABEL: @t3(
; CHECK-NEXT: bb0:		; CHECK-NEXT: bb0:
; CHECK-NEXT: [[T0:%.]] = and i8 [[X:%.]], 1		; CHECK-NEXT: [[T0:%.]] = and i8 [[X:%.]], 1
; CHECK-NEXT: [[T1:%.*]] = icmp ne i8 [[T0]], 0		; CHECK-NEXT: [[T1_NOT:%.*]] = icmp eq i8 [[T0]], 0
; CHECK-NEXT: br i1 [[C:%.]], label [[BB1:%.]], label [[BB2:%.*]]		; CHECK-NEXT: br i1 [[C:%.]], label [[BB1:%.]], label [[BB2:%.*]]
; CHECK: bb1:		; CHECK: bb1:
; CHECK-NEXT: [[R0:%.]] = select i1 [[T1]], i8 [[V0:%.]], i8 [[V1:%.*]]		; CHECK-NEXT: [[R0:%.]] = select i1 [[T1_NOT]], i8 [[V1:%.]], i8 [[V0:%.*]]
; CHECK-NEXT: store i8 [[R0]], i8* [[OUT:%.*]], align 1		; CHECK-NEXT: store i8 [[R0]], i8* [[OUT:%.*]], align 1
; CHECK-NEXT: br label [[BB2]]		; CHECK-NEXT: br label [[BB2]]
; CHECK: bb2:		; CHECK: bb2:
; CHECK-NEXT: [[R1:%.]] = select i1 [[T1]], i8 [[V2:%.]], i8 [[V3:%.*]]		; CHECK-NEXT: [[R1:%.]] = select i1 [[T1_NOT]], i8 [[V3:%.]], i8 [[V2:%.*]]
; CHECK-NEXT: ret i8 [[R1]]		; CHECK-NEXT: ret i8 [[R1]]
;		;
bb0:		bb0:
%t0 = and i8 %x, 1		%t0 = and i8 %x, 1
%t1 = icmp eq i8 %t0, 1		%t1 = icmp eq i8 %t0, 1
br i1 %c, label %bb1, label %bb2		br i1 %c, label %bb1, label %bb2
bb1:		bb1:
%r0 = select i1 %t1, i8 %v0, i8 %v1		%r0 = select i1 %t1, i8 %v0, i8 %v1
store i8 %r0, i8* %out		store i8 %r0, i8* %out
br label %bb2		br label %bb2
bb2:		bb2:
%r1 = select i1 %t1, i8 %v2, i8 %v3		%r1 = select i1 %t1, i8 %v2, i8 %v3
ret i8 %r1		ret i8 %r1
}		}
define i8 @t4(i8 %x, i8 %v0, i8 %v1, i8 %v2, i8 %v3, i8* %out) {		define i8 @t4(i8 %x, i8 %v0, i8 %v1, i8 %v2, i8 %v3, i8* %out) {
; CHECK-LABEL: @t4(		; CHECK-LABEL: @t4(
; CHECK-NEXT: [[T0:%.]] = and i8 [[X:%.]], 1		; CHECK-NEXT: [[T0:%.]] = and i8 [[X:%.]], 1
; CHECK-NEXT: [[T1:%.*]] = icmp ne i8 [[T0]], 0		; CHECK-NEXT: [[T1_NOT:%.*]] = icmp eq i8 [[T0]], 0
; CHECK-NEXT: [[R0:%.]] = select i1 [[T1]], i8 [[V0:%.]], i8 [[V1:%.*]]		; CHECK-NEXT: [[R0:%.]] = select i1 [[T1_NOT]], i8 [[V1:%.]], i8 [[V0:%.*]]
; CHECK-NEXT: store i8 [[R0]], i8* [[OUT:%.*]], align 1		; CHECK-NEXT: store i8 [[R0]], i8* [[OUT:%.*]], align 1
; CHECK-NEXT: [[R1:%.]] = select i1 [[T1]], i8 [[V2:%.]], i8 [[V3:%.*]]		; CHECK-NEXT: [[R1:%.]] = select i1 [[T1_NOT]], i8 [[V3:%.]], i8 [[V2:%.*]]
; CHECK-NEXT: ret i8 [[R1]]		; CHECK-NEXT: ret i8 [[R1]]
;		;
%t0 = and i8 %x, 1		%t0 = and i8 %x, 1
%t1 = icmp ne i8 %t0, 0		%t1 = icmp ne i8 %t0, 0
%r0 = select i1 %t1, i8 %v0, i8 %v1		%r0 = select i1 %t1, i8 %v0, i8 %v1
store i8 %r0, i8* %out		store i8 %r0, i8* %out
%r1 = select i1 %t1, i8 %v2, i8 %v3		%r1 = select i1 %t1, i8 %v2, i8 %v3
ret i8 %r1		ret i8 %r1
Show All 16 Lines	;
%t0 = and i8 %x, 1		%t0 = and i8 %x, 1
%t1 = icmp eq i8 %t0, 3 ; checking some other bit		%t1 = icmp eq i8 %t0, 3 ; checking some other bit
%r = select i1 %t1, i8 %v0, i8 %v1		%r = select i1 %t1, i8 %v0, i8 %v1
ret i8 %r		ret i8 %r
}		}
define i8 @n7(i8 %x, i8 %v0, i8 %v1) {		define i8 @n7(i8 %x, i8 %v0, i8 %v1) {
; CHECK-LABEL: @n7(		; CHECK-LABEL: @n7(
; CHECK-NEXT: [[T0:%.]] = and i8 [[X:%.]], 1		; CHECK-NEXT: [[T0:%.]] = and i8 [[X:%.]], 1
; CHECK-NEXT: [[T1:%.*]] = icmp eq i8 [[T0]], 0		; CHECK-NEXT: [[T1_NOT_NOT:%.*]] = icmp eq i8 [[T0]], 0
; CHECK-NEXT: [[R:%.]] = select i1 [[T1]], i8 [[V0:%.]], i8 [[V1:%.*]]		; CHECK-NEXT: [[R:%.]] = select i1 [[T1_NOT_NOT]], i8 [[V0:%.]], i8 [[V1:%.*]]
; CHECK-NEXT: ret i8 [[R]]		; CHECK-NEXT: ret i8 [[R]]
;		;
%t0 = and i8 %x, 1		%t0 = and i8 %x, 1
%t1 = icmp ne i8 %t0, 1 ; not checking that it's zero		%t1 = icmp ne i8 %t0, 1 ; not checking that it's zero
%r = select i1 %t1, i8 %v0, i8 %v1		%r = select i1 %t1, i8 %v0, i8 %v1
ret i8 %r		ret i8 %r
}		}

Show All 18 Lines

llvm/test/Transforms/InstCombine/icmp-mul-zext.ll

	; NOTE: Assertions have been autogenerated by utils/update_test_checks.py			; NOTE: Assertions have been autogenerated by utils/update_test_checks.py
	; RUN: opt < %s -instcombine -S \| FileCheck %s			; RUN: opt < %s -instcombine -S \| FileCheck %s

	target datalayout = "n32"			target datalayout = "n32"

	define i32 @sterix(i32, i8, i64) {			define i32 @sterix(i32, i8, i64) {
	; CHECK-LABEL: @sterix(			; CHECK-LABEL: @sterix(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[CONV:%.]] = zext i32 [[TMP0:%.]] to i64			; CHECK-NEXT: [[CONV:%.]] = zext i32 [[TMP0:%.]] to i64
	; CHECK-NEXT: [[CONV1:%.]] = sext i8 [[TMP1:%.]] to i32			; CHECK-NEXT: [[CONV1:%.]] = sext i8 [[TMP1:%.]] to i32
	; CHECK-NEXT: [[MUL:%.*]] = mul i32 [[CONV1]], 1945964878			; CHECK-NEXT: [[MUL:%.*]] = mul i32 [[CONV1]], 1945964878
	; CHECK-NEXT: [[SH_PROM:%.]] = trunc i64 [[TMP2:%.]] to i32			; CHECK-NEXT: [[SH_PROM:%.]] = trunc i64 [[TMP2:%.]] to i32
	; CHECK-NEXT: [[SHR:%.*]] = lshr i32 [[MUL]], [[SH_PROM]]			; CHECK-NEXT: [[SHR:%.*]] = lshr i32 [[MUL]], [[SH_PROM]]
	; CHECK-NEXT: [[CONV2:%.*]] = zext i32 [[SHR]] to i64			; CHECK-NEXT: [[CONV2:%.*]] = zext i32 [[SHR]] to i64
	; CHECK-NEXT: [[MUL3:%.*]] = mul nuw nsw i64 [[CONV]], [[CONV2]]			; CHECK-NEXT: [[MUL3:%.*]] = mul nuw nsw i64 [[CONV]], [[CONV2]]
	; CHECK-NEXT: [[TMP3:%.*]] = icmp ugt i64 [[MUL3]], 4294967295			; CHECK-NEXT: [[TMP3:%.*]] = icmp ult i64 [[MUL3]], 4294967296
	; CHECK-NEXT: br i1 [[TMP3]], label [[LOR_END:%.]], label [[LOR_RHS:%.]]			; CHECK-NEXT: br i1 [[TMP3]], label [[LOR_RHS:%.]], label [[LOR_END:%.]]
	; CHECK: lor.rhs:			; CHECK: lor.rhs:
	; CHECK-NEXT: [[AND:%.*]] = and i64 [[MUL3]], [[TMP2]]			; CHECK-NEXT: [[AND:%.*]] = and i64 [[MUL3]], [[TMP2]]
	; CHECK-NEXT: [[CONV4:%.*]] = trunc i64 [[AND]] to i32			; CHECK-NEXT: [[CONV4:%.*]] = trunc i64 [[AND]] to i32
	; CHECK-NEXT: [[TOBOOL7:%.*]] = icmp eq i32 [[CONV4]], 0			; CHECK-NEXT: [[TOBOOL7_NOT:%.*]] = icmp eq i32 [[CONV4]], 0
	; CHECK-NEXT: [[PHI_CAST:%.*]] = zext i1 [[TOBOOL7]] to i32			; CHECK-NEXT: [[PHITMP:%.*]] = zext i1 [[TOBOOL7_NOT]] to i32
	; CHECK-NEXT: br label [[LOR_END]]			; CHECK-NEXT: br label [[LOR_END]]
	; CHECK: lor.end:			; CHECK: lor.end:
	; CHECK-NEXT: [[TMP4:%.]] = phi i32 [ 1, [[ENTRY:%.]] ], [ [[PHI_CAST]], [[LOR_RHS]] ]			; CHECK-NEXT: [[TMP4:%.]] = phi i32 [ 1, [[ENTRY:%.]] ], [ [[PHITMP]], [[LOR_RHS]] ]
	; CHECK-NEXT: ret i32 [[TMP4]]			; CHECK-NEXT: ret i32 [[TMP4]]
	;			;
	entry:			entry:
	%conv = zext i32 %0 to i64			%conv = zext i32 %0 to i64
	%conv1 = sext i8 %1 to i32			%conv1 = sext i8 %1 to i32
	%mul = mul i32 %conv1, 1945964878			%mul = mul i32 %conv1, 1945964878
	%sh_prom = trunc i64 %2 to i32			%sh_prom = trunc i64 %2 to i32
	%shr = lshr i32 %mul, %sh_prom			%shr = lshr i32 %mul, %sh_prom
	▲ Show 20 Lines • Show All 119 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/logical-select.ll

Show All 13 Lines	;
%h = xor i32 %f, -1		%h = xor i32 %f, -1
%i = and i32 %d, %h		%i = and i32 %d, %h
%j = or i32 %g, %i		%j = or i32 %g, %i
ret i32 %j		ret i32 %j
}		}

define i32 @bar(i32 %a, i32 %b, i32 %c, i32 %d) {		define i32 @bar(i32 %a, i32 %b, i32 %c, i32 %d) {
; CHECK-LABEL: @bar(		; CHECK-LABEL: @bar(
; CHECK-NEXT: [[E:%.]] = icmp slt i32 [[A:%.]], [[B:%.*]]		; CHECK-NEXT: [[E_NOT:%.]] = icmp slt i32 [[A:%.]], [[B:%.*]]
; CHECK-NEXT: [[TMP1:%.]] = select i1 [[E]], i32 [[C:%.]], i32 [[D:%.*]]		; CHECK-NEXT: [[TMP1:%.]] = select i1 [[E_NOT]], i32 [[C:%.]], i32 [[D:%.*]]
; CHECK-NEXT: ret i32 [[TMP1]]		; CHECK-NEXT: ret i32 [[TMP1]]
;		;
%e = icmp slt i32 %a, %b		%e = icmp slt i32 %a, %b
%f = sext i1 %e to i32		%f = sext i1 %e to i32
%g = and i32 %c, %f		%g = and i32 %c, %f
%h = xor i32 %f, -1		%h = xor i32 %f, -1
%i = and i32 %d, %h		%i = and i32 %d, %h
%j = or i32 %i, %g		%j = or i32 %i, %g
Show All 32 Lines

; PR32791 - https://bugs.llvm.org//show_bug.cgi?id=32791		; PR32791 - https://bugs.llvm.org//show_bug.cgi?id=32791
; The 2nd compare/select are canonicalized, so CSE and another round of instcombine or some other pass will fold this.		; The 2nd compare/select are canonicalized, so CSE and another round of instcombine or some other pass will fold this.

define i32 @fold_inverted_icmp_preds(i32 %a, i32 %b, i32 %c, i32 %d) {		define i32 @fold_inverted_icmp_preds(i32 %a, i32 %b, i32 %c, i32 %d) {
; CHECK-LABEL: @fold_inverted_icmp_preds(		; CHECK-LABEL: @fold_inverted_icmp_preds(
; CHECK-NEXT: [[CMP1:%.]] = icmp slt i32 [[A:%.]], [[B:%.*]]		; CHECK-NEXT: [[CMP1:%.]] = icmp slt i32 [[A:%.]], [[B:%.*]]
; CHECK-NEXT: [[SEL1:%.]] = select i1 [[CMP1]], i32 [[C:%.]], i32 0		; CHECK-NEXT: [[SEL1:%.]] = select i1 [[CMP1]], i32 [[C:%.]], i32 0
; CHECK-NEXT: [[CMP2:%.*]] = icmp slt i32 [[A]], [[B]]		; CHECK-NEXT: [[CMP2_NOT:%.*]] = icmp slt i32 [[A]], [[B]]
; CHECK-NEXT: [[SEL2:%.]] = select i1 [[CMP2]], i32 0, i32 [[D:%.]]		; CHECK-NEXT: [[SEL2:%.]] = select i1 [[CMP2_NOT]], i32 0, i32 [[D:%.]]
; CHECK-NEXT: [[OR:%.*]] = or i32 [[SEL1]], [[SEL2]]		; CHECK-NEXT: [[OR:%.*]] = or i32 [[SEL1]], [[SEL2]]
; CHECK-NEXT: ret i32 [[OR]]		; CHECK-NEXT: ret i32 [[OR]]
;		;
%cmp1 = icmp slt i32 %a, %b		%cmp1 = icmp slt i32 %a, %b
%sel1 = select i1 %cmp1, i32 %c, i32 0		%sel1 = select i1 %cmp1, i32 %c, i32 0
%cmp2 = icmp sge i32 %a, %b		%cmp2 = icmp sge i32 %a, %b
%sel2 = select i1 %cmp2, i32 %d, i32 0		%sel2 = select i1 %cmp2, i32 %d, i32 0
%or = or i32 %sel1, %sel2		%or = or i32 %sel1, %sel2
ret i32 %or		ret i32 %or
}		}

; The 2nd compare/select are canonicalized, so CSE and another round of instcombine or some other pass will fold this.		; The 2nd compare/select are canonicalized, so CSE and another round of instcombine or some other pass will fold this.

define i32 @fold_inverted_icmp_preds_reverse(i32 %a, i32 %b, i32 %c, i32 %d) {		define i32 @fold_inverted_icmp_preds_reverse(i32 %a, i32 %b, i32 %c, i32 %d) {
; CHECK-LABEL: @fold_inverted_icmp_preds_reverse(		; CHECK-LABEL: @fold_inverted_icmp_preds_reverse(
; CHECK-NEXT: [[CMP1:%.]] = icmp slt i32 [[A:%.]], [[B:%.*]]		; CHECK-NEXT: [[CMP1:%.]] = icmp slt i32 [[A:%.]], [[B:%.*]]
; CHECK-NEXT: [[SEL1:%.]] = select i1 [[CMP1]], i32 0, i32 [[C:%.]]		; CHECK-NEXT: [[SEL1:%.]] = select i1 [[CMP1]], i32 0, i32 [[C:%.]]
; CHECK-NEXT: [[CMP2:%.*]] = icmp slt i32 [[A]], [[B]]		; CHECK-NEXT: [[CMP2_NOT:%.*]] = icmp slt i32 [[A]], [[B]]
; CHECK-NEXT: [[SEL2:%.]] = select i1 [[CMP2]], i32 [[D:%.]], i32 0		; CHECK-NEXT: [[SEL2:%.]] = select i1 [[CMP2_NOT]], i32 [[D:%.]], i32 0
; CHECK-NEXT: [[OR:%.*]] = or i32 [[SEL1]], [[SEL2]]		; CHECK-NEXT: [[OR:%.*]] = or i32 [[SEL1]], [[SEL2]]
; CHECK-NEXT: ret i32 [[OR]]		; CHECK-NEXT: ret i32 [[OR]]
;		;
%cmp1 = icmp slt i32 %a, %b		%cmp1 = icmp slt i32 %a, %b
%sel1 = select i1 %cmp1, i32 0, i32 %c		%sel1 = select i1 %cmp1, i32 0, i32 %c
%cmp2 = icmp sge i32 %a, %b		%cmp2 = icmp sge i32 %a, %b
%sel2 = select i1 %cmp2, i32 0, i32 %d		%sel2 = select i1 %cmp2, i32 0, i32 %d
%or = or i32 %sel1, %sel2		%or = or i32 %sel1, %sel2
Show All 18 Lines	;
%or = or i32 %sel1, %sel2		%or = or i32 %sel1, %sel2
ret i32 %or		ret i32 %or
}		}

; The 2nd compare/select are canonicalized, so CSE and another round of instcombine or some other pass will fold this.		; The 2nd compare/select are canonicalized, so CSE and another round of instcombine or some other pass will fold this.

define <2 x i32> @fold_inverted_icmp_vector_preds(<2 x i32> %a, <2 x i32> %b, <2 x i32> %c, <2 x i32> %d) {		define <2 x i32> @fold_inverted_icmp_vector_preds(<2 x i32> %a, <2 x i32> %b, <2 x i32> %c, <2 x i32> %d) {
; CHECK-LABEL: @fold_inverted_icmp_vector_preds(		; CHECK-LABEL: @fold_inverted_icmp_vector_preds(
; CHECK-NEXT: [[CMP1:%.]] = icmp eq <2 x i32> [[A:%.]], [[B:%.*]]		; CHECK-NEXT: [[CMP1_NOT:%.]] = icmp eq <2 x i32> [[A:%.]], [[B:%.*]]
; CHECK-NEXT: [[SEL1:%.]] = select <2 x i1> [[CMP1]], <2 x i32> zeroinitializer, <2 x i32> [[C:%.]]		; CHECK-NEXT: [[SEL1:%.]] = select <2 x i1> [[CMP1_NOT]], <2 x i32> zeroinitializer, <2 x i32> [[C:%.]]
; CHECK-NEXT: [[CMP2:%.*]] = icmp eq <2 x i32> [[A]], [[B]]		; CHECK-NEXT: [[CMP2:%.*]] = icmp eq <2 x i32> [[A]], [[B]]
; CHECK-NEXT: [[SEL2:%.]] = select <2 x i1> [[CMP2]], <2 x i32> [[D:%.]], <2 x i32> zeroinitializer		; CHECK-NEXT: [[SEL2:%.]] = select <2 x i1> [[CMP2]], <2 x i32> [[D:%.]], <2 x i32> zeroinitializer
; CHECK-NEXT: [[OR:%.*]] = or <2 x i32> [[SEL1]], [[SEL2]]		; CHECK-NEXT: [[OR:%.*]] = or <2 x i32> [[SEL1]], [[SEL2]]
; CHECK-NEXT: ret <2 x i32> [[OR]]		; CHECK-NEXT: ret <2 x i32> [[OR]]
;		;
%cmp1 = icmp ne <2 x i32> %a, %b		%cmp1 = icmp ne <2 x i32> %a, %b
%sel1 = select <2 x i1> %cmp1, <2 x i32> %c, <2 x i32> <i32 0, i32 0>		%sel1 = select <2 x i1> %cmp1, <2 x i32> %c, <2 x i32> <i32 0, i32 0>
%cmp2 = icmp eq <2 x i32> %a, %b		%cmp2 = icmp eq <2 x i32> %a, %b
▲ Show 20 Lines • Show All 393 Lines • ▼ Show 20 Lines	;
%add = add <4 x i32> %or, %mask_flip1		%add = add <4 x i32> %or, %mask_flip1
ret <4 x i32> %add		ret <4 x i32> %add
}		}

; The 'ashr' guarantees that we have a bitmask, so this is select with truncated condition.		; The 'ashr' guarantees that we have a bitmask, so this is select with truncated condition.

define i32 @allSignBits(i32 %cond, i32 %tval, i32 %fval) {		define i32 @allSignBits(i32 %cond, i32 %tval, i32 %fval) {
; CHECK-LABEL: @allSignBits(		; CHECK-LABEL: @allSignBits(
; CHECK-NEXT: [[TMP1:%.]] = icmp sgt i32 [[COND:%.]], -1		; CHECK-NEXT: [[DOTNOT:%.]] = icmp slt i32 [[COND:%.]], 0
; CHECK-NEXT: [[TMP2:%.]] = select i1 [[TMP1]], i32 [[FVAL:%.]], i32 [[TVAL:%.*]]		; CHECK-NEXT: [[TMP1:%.]] = select i1 [[DOTNOT]], i32 [[TVAL:%.]], i32 [[FVAL:%.*]]
; CHECK-NEXT: ret i32 [[TMP2]]		; CHECK-NEXT: ret i32 [[TMP1]]
;		;
%bitmask = ashr i32 %cond, 31		%bitmask = ashr i32 %cond, 31
%not_bitmask = xor i32 %bitmask, -1		%not_bitmask = xor i32 %bitmask, -1
%a1 = and i32 %tval, %bitmask		%a1 = and i32 %tval, %bitmask
%a2 = and i32 %not_bitmask, %fval		%a2 = and i32 %not_bitmask, %fval
%sel = or i32 %a1, %a2		%sel = or i32 %a1, %a2
ret i32 %sel		ret i32 %sel
}		}

define <4 x i8> @allSignBits_vec(<4 x i8> %cond, <4 x i8> %tval, <4 x i8> %fval) {		define <4 x i8> @allSignBits_vec(<4 x i8> %cond, <4 x i8> %tval, <4 x i8> %fval) {
; CHECK-LABEL: @allSignBits_vec(		; CHECK-LABEL: @allSignBits_vec(
; CHECK-NEXT: [[TMP1:%.]] = icmp sgt <4 x i8> [[COND:%.]], <i8 -1, i8 -1, i8 -1, i8 -1>		; CHECK-NEXT: [[DOTNOT:%.]] = icmp sgt <4 x i8> [[COND:%.]], <i8 -1, i8 -1, i8 -1, i8 -1>
; CHECK-NEXT: [[TMP2:%.]] = select <4 x i1> [[TMP1]], <4 x i8> [[FVAL:%.]], <4 x i8> [[TVAL:%.*]]		; CHECK-NEXT: [[TMP1:%.]] = select <4 x i1> [[DOTNOT]], <4 x i8> [[FVAL:%.]], <4 x i8> [[TVAL:%.*]]
; CHECK-NEXT: ret <4 x i8> [[TMP2]]		; CHECK-NEXT: ret <4 x i8> [[TMP1]]
;		;
%bitmask = ashr <4 x i8> %cond, <i8 7, i8 7, i8 7, i8 7>		%bitmask = ashr <4 x i8> %cond, <i8 7, i8 7, i8 7, i8 7>
%not_bitmask = xor <4 x i8> %bitmask, <i8 -1, i8 -1, i8 -1, i8 -1>		%not_bitmask = xor <4 x i8> %bitmask, <i8 -1, i8 -1, i8 -1, i8 -1>
%a1 = and <4 x i8> %tval, %bitmask		%a1 = and <4 x i8> %tval, %bitmask
%a2 = and <4 x i8> %fval, %not_bitmask		%a2 = and <4 x i8> %fval, %not_bitmask
%sel = or <4 x i8> %a2, %a1		%sel = or <4 x i8> %a2, %a1
ret <4 x i8> %sel		ret <4 x i8> %sel
}		}
▲ Show 20 Lines • Show All 75 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/select-with-bitwise-ops.ll

Show First 20 Lines • Show All 147 Lines • ▼ Show 20 Lines	;
%or = or <2 x i32> %y, <i32 4096, i32 4096>		%or = or <2 x i32> %y, <i32 4096, i32 4096>
%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or		%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or
ret <2 x i32> %select		ret <2 x i32> %select
}		}

define i32 @select_icmp_ne_0_and_4096_xor_4096(i32 %x, i32 %y) {		define i32 @select_icmp_ne_0_and_4096_xor_4096(i32 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_ne_0_and_4096_xor_4096(		; CHECK-LABEL: @select_icmp_ne_0_and_4096_xor_4096(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096		; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[XOR]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%xor = xor i32 %y, 4096		%xor = xor i32 %y, 4096
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
ret i32 %select		ret i32 %select
}		}

define i32 @select_icmp_ne_0_and_4096_and_not_4096(i32 %x, i32 %y) {		define i32 @select_icmp_ne_0_and_4096_and_not_4096(i32 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_ne_0_and_4096_and_not_4096(		; CHECK-LABEL: @select_icmp_ne_0_and_4096_and_not_4096(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -4097		; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -4097
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[AND2]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[AND2]], i32 [[Y]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%and2 = and i32 %y, -4097		%and2 = and i32 %y, -4097
%select = select i1 %cmp, i32 %y, i32 %and2		%select = select i1 %cmp, i32 %y, i32 %and2
ret i32 %select		ret i32 %select
}		}
▲ Show 20 Lines • Show All 124 Lines • ▼ Show 20 Lines	;
%or = or i32 %y, 32		%or = or i32 %y, 32
%select = select i1 %cmp, i32 %y, i32 %or		%select = select i1 %cmp, i32 %y, i32 %or
ret i32 %select		ret i32 %select
}		}

define i32 @select_icmp_ne_0_and_4096_xor_32(i32 %x, i32 %y) {		define i32 @select_icmp_ne_0_and_4096_xor_32(i32 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_ne_0_and_4096_xor_32(		; CHECK-LABEL: @select_icmp_ne_0_and_4096_xor_32(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 32		; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 32
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[XOR]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%xor = xor i32 %y, 32		%xor = xor i32 %y, 32
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
ret i32 %select		ret i32 %select
}		}

define i32 @select_icmp_ne_0_and_4096_and_not_32(i32 %x, i32 %y) {		define i32 @select_icmp_ne_0_and_4096_and_not_32(i32 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_ne_0_and_4096_and_not_32(		; CHECK-LABEL: @select_icmp_ne_0_and_4096_and_not_32(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -33		; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -33
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[AND2]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[AND2]], i32 [[Y]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%and2 = and i32 %y, -33		%and2 = and i32 %y, -33
%select = select i1 %cmp, i32 %y, i32 %and2		%select = select i1 %cmp, i32 %y, i32 %and2
ret i32 %select		ret i32 %select
}		}
Show All 26 Lines	;
%or = or <2 x i32> %y, <i32 4096, i32 4096>		%or = or <2 x i32> %y, <i32 4096, i32 4096>
%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or		%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or
ret <2 x i32> %select		ret <2 x i32> %select
}		}

define i32 @select_icmp_ne_0_and_32_xor_4096(i32 %x, i32 %y) {		define i32 @select_icmp_ne_0_and_32_xor_4096(i32 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_ne_0_and_32_xor_4096(		; CHECK-LABEL: @select_icmp_ne_0_and_32_xor_4096(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 32		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 32
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096		; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[XOR]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 32		%and = and i32 %x, 32
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%xor = xor i32 %y, 4096		%xor = xor i32 %y, 4096
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
ret i32 %select		ret i32 %select
}		}

define i32 @select_icmp_ne_0_and_32_and_not_4096(i32 %x, i32 %y) {		define i32 @select_icmp_ne_0_and_32_and_not_4096(i32 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_ne_0_and_32_and_not_4096(		; CHECK-LABEL: @select_icmp_ne_0_and_32_and_not_4096(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 32		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 32
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -4097		; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -4097
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[AND2]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[AND2]], i32 [[Y]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 32		%and = and i32 %x, 32
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%and2 = and i32 %y, -4097		%and2 = and i32 %y, -4097
%select = select i1 %cmp, i32 %y, i32 %and2		%select = select i1 %cmp, i32 %y, i32 %and2
ret i32 %select		ret i32 %select
}		}

define i8 @select_icmp_ne_0_and_1073741824_or_8(i32 %x, i8 %y) {		define i8 @select_icmp_ne_0_and_1073741824_or_8(i32 %x, i8 %y) {
; CHECK-LABEL: @select_icmp_ne_0_and_1073741824_or_8(		; CHECK-LABEL: @select_icmp_ne_0_and_1073741824_or_8(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 1073741824		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 1073741824
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[OR:%.]] = or i8 [[Y:%.]], 8		; CHECK-NEXT: [[OR:%.]] = or i8 [[Y:%.]], 8
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i8 [[OR]], i8 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i8 [[OR]], i8 [[Y]]
; CHECK-NEXT: ret i8 [[SELECT]]		; CHECK-NEXT: ret i8 [[SELECT]]
;		;
%and = and i32 %x, 1073741824		%and = and i32 %x, 1073741824
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%or = or i8 %y, 8		%or = or i8 %y, 8
%select = select i1 %cmp, i8 %y, i8 %or		%select = select i1 %cmp, i8 %y, i8 %or
ret i8 %select		ret i8 %select
}		}

define i8 @select_icmp_ne_0_and_1073741824_xor_8(i32 %x, i8 %y) {		define i8 @select_icmp_ne_0_and_1073741824_xor_8(i32 %x, i8 %y) {
; CHECK-LABEL: @select_icmp_ne_0_and_1073741824_xor_8(		; CHECK-LABEL: @select_icmp_ne_0_and_1073741824_xor_8(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 1073741824		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 1073741824
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i8 [[Y:%.]], 8		; CHECK-NEXT: [[XOR:%.]] = xor i8 [[Y:%.]], 8
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i8 [[XOR]], i8 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i8 [[XOR]], i8 [[Y]]
; CHECK-NEXT: ret i8 [[SELECT]]		; CHECK-NEXT: ret i8 [[SELECT]]
;		;
%and = and i32 %x, 1073741824		%and = and i32 %x, 1073741824
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%xor = xor i8 %y, 8		%xor = xor i8 %y, 8
%select = select i1 %cmp, i8 %y, i8 %xor		%select = select i1 %cmp, i8 %y, i8 %xor
ret i8 %select		ret i8 %select
}		}

define i8 @select_icmp_ne_0_and_1073741824_and_not_8(i32 %x, i8 %y) {		define i8 @select_icmp_ne_0_and_1073741824_and_not_8(i32 %x, i8 %y) {
; CHECK-LABEL: @select_icmp_ne_0_and_1073741824_and_not_8(		; CHECK-LABEL: @select_icmp_ne_0_and_1073741824_and_not_8(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 1073741824		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 1073741824
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[AND2:%.]] = and i8 [[Y:%.]], -9		; CHECK-NEXT: [[AND2:%.]] = and i8 [[Y:%.]], -9
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i8 [[AND2]], i8 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i8 [[AND2]], i8 [[Y]]
; CHECK-NEXT: ret i8 [[SELECT]]		; CHECK-NEXT: ret i8 [[SELECT]]
;		;
%and = and i32 %x, 1073741824		%and = and i32 %x, 1073741824
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%and2 = and i8 %y, -9		%and2 = and i8 %y, -9
%select = select i1 %cmp, i8 %y, i8 %and2		%select = select i1 %cmp, i8 %y, i8 %and2
ret i8 %select		ret i8 %select
}		}

define i32 @select_icmp_ne_0_and_8_or_1073741824(i8 %x, i32 %y) {		define i32 @select_icmp_ne_0_and_8_or_1073741824(i8 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_ne_0_and_8_or_1073741824(		; CHECK-LABEL: @select_icmp_ne_0_and_8_or_1073741824(
; CHECK-NEXT: [[AND:%.]] = and i8 [[X:%.]], 8		; CHECK-NEXT: [[AND:%.]] = and i8 [[X:%.]], 8
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i8 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i8 [[AND]], 0
; CHECK-NEXT: [[OR:%.]] = or i32 [[Y:%.]], 1073741824		; CHECK-NEXT: [[OR:%.]] = or i32 [[Y:%.]], 1073741824
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[OR]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[OR]], i32 [[Y]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i8 %x, 8		%and = and i8 %x, 8
%cmp = icmp ne i8 0, %and		%cmp = icmp ne i8 0, %and
%or = or i32 %y, 1073741824		%or = or i32 %y, 1073741824
%select = select i1 %cmp, i32 %y, i32 %or		%select = select i1 %cmp, i32 %y, i32 %or
ret i32 %select		ret i32 %select
}		}

define i32 @select_icmp_ne_0_and_8_xor_1073741824(i8 %x, i32 %y) {		define i32 @select_icmp_ne_0_and_8_xor_1073741824(i8 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_ne_0_and_8_xor_1073741824(		; CHECK-LABEL: @select_icmp_ne_0_and_8_xor_1073741824(
; CHECK-NEXT: [[AND:%.]] = and i8 [[X:%.]], 8		; CHECK-NEXT: [[AND:%.]] = and i8 [[X:%.]], 8
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i8 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i8 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 1073741824		; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 1073741824
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[XOR]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i8 %x, 8		%and = and i8 %x, 8
%cmp = icmp ne i8 0, %and		%cmp = icmp ne i8 0, %and
%xor = xor i32 %y, 1073741824		%xor = xor i32 %y, 1073741824
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
ret i32 %select		ret i32 %select
}		}

define i32 @select_icmp_ne_0_and_8_and_not_1073741824(i8 %x, i32 %y) {		define i32 @select_icmp_ne_0_and_8_and_not_1073741824(i8 %x, i32 %y) {
; CHECK-LABEL: @select_icmp_ne_0_and_8_and_not_1073741824(		; CHECK-LABEL: @select_icmp_ne_0_and_8_and_not_1073741824(
; CHECK-NEXT: [[AND:%.]] = and i8 [[X:%.]], 8		; CHECK-NEXT: [[AND:%.]] = and i8 [[X:%.]], 8
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i8 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i8 [[AND]], 0
; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -1073741825		; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -1073741825
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[AND2]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[AND2]], i32 [[Y]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i8 %x, 8		%and = and i8 %x, 8
%cmp = icmp ne i8 0, %and		%cmp = icmp ne i8 0, %and
%and2 = and i32 %y, -1073741825		%and2 = and i32 %y, -1073741825
%select = select i1 %cmp, i32 %y, i32 %and2		%select = select i1 %cmp, i32 %y, i32 %and2
ret i32 %select		ret i32 %select
}		}
▲ Show 20 Lines • Show All 236 Lines • ▼ Show 20 Lines	;
%or = or <2 x i32> %y, <i32 2, i32 2>		%or = or <2 x i32> %y, <i32 2, i32 2>
%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or		%select = select <2 x i1> %cmp, <2 x i32> %y, <2 x i32> %or
ret <2 x i32> %select		ret <2 x i32> %select
}		}

define i32 @test69_xor(i32 %x, i32 %y) {		define i32 @test69_xor(i32 %x, i32 %y) {
; CHECK-LABEL: @test69_xor(		; CHECK-LABEL: @test69_xor(
; CHECK-NEXT: [[TMP1:%.]] = trunc i32 [[X:%.]] to i8		; CHECK-NEXT: [[TMP1:%.]] = trunc i32 [[X:%.]] to i8
; CHECK-NEXT: [[CMP:%.*]] = icmp slt i8 [[TMP1]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp sgt i8 [[TMP1]], -1
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 2		; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 2
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[XOR]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 128		%and = and i32 %x, 128
%cmp = icmp ne i32 %and, 0		%cmp = icmp ne i32 %and, 0
%xor = xor i32 %y, 2		%xor = xor i32 %y, 2
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
ret i32 %select		ret i32 %select
}		}

define i32 @test69_and(i32 %x, i32 %y) {		define i32 @test69_and(i32 %x, i32 %y) {
; CHECK-LABEL: @test69_and(		; CHECK-LABEL: @test69_and(
; CHECK-NEXT: [[TMP1:%.]] = trunc i32 [[X:%.]] to i8		; CHECK-NEXT: [[TMP1:%.]] = trunc i32 [[X:%.]] to i8
; CHECK-NEXT: [[CMP:%.*]] = icmp slt i8 [[TMP1]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp sgt i8 [[TMP1]], -1
; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], 2		; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], 2
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[AND2]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[AND2]], i32 [[Y]]
; CHECK-NEXT: ret i32 [[SELECT]]		; CHECK-NEXT: ret i32 [[SELECT]]
;		;
%and = and i32 %x, 128		%and = and i32 %x, 128
%cmp = icmp ne i32 %and, 0		%cmp = icmp ne i32 %and, 0
%and2 = and i32 %y, 2		%and2 = and i32 %y, 2
%select = select i1 %cmp, i32 %y, i32 %and2		%select = select i1 %cmp, i32 %y, i32 %and2
ret i32 %select		ret i32 %select
}		}
▲ Show 20 Lines • Show All 127 Lines • ▼ Show 20 Lines	;
%select = select i1 %cmp, i32 %y, i32 %or		%select = select i1 %cmp, i32 %y, i32 %or
%res = mul i32 %select, %or ; to bump up use count of the Or		%res = mul i32 %select, %or ; to bump up use count of the Or
ret i32 %res		ret i32 %res
}		}

define i32 @no_shift_xor_multiuse_xor(i32 %x, i32 %y) {		define i32 @no_shift_xor_multiuse_xor(i32 %x, i32 %y) {
; CHECK-LABEL: @no_shift_xor_multiuse_xor(		; CHECK-LABEL: @no_shift_xor_multiuse_xor(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096		; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[XOR]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[XOR]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[XOR]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%xor = xor i32 %y, 4096		%xor = xor i32 %y, 4096
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
%res = mul i32 %select, %xor ; to bump up use count of the Xor		%res = mul i32 %select, %xor ; to bump up use count of the Xor
ret i32 %res		ret i32 %res
}		}

define i32 @no_shift_xor_multiuse_and(i32 %x, i32 %y) {		define i32 @no_shift_xor_multiuse_and(i32 %x, i32 %y) {
; CHECK-LABEL: @no_shift_xor_multiuse_and(		; CHECK-LABEL: @no_shift_xor_multiuse_and(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -4097		; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -4097
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[AND2]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[AND2]], i32 [[Y]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[AND2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[AND2]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%and2 = and i32 %y, -4097		%and2 = and i32 %y, -4097
%select = select i1 %cmp, i32 %y, i32 %and2		%select = select i1 %cmp, i32 %y, i32 %and2
%res = mul i32 %select, %and2 ; to bump up use count of the And		%res = mul i32 %select, %and2 ; to bump up use count of the And
ret i32 %res		ret i32 %res
}		}

define i32 @shift_xor_multiuse_or(i32 %x, i32 %y) {		define i32 @shift_xor_multiuse_or(i32 %x, i32 %y) {
; CHECK-LABEL: @shift_xor_multiuse_or(		; CHECK-LABEL: @shift_xor_multiuse_or(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[OR:%.]] = or i32 [[Y:%.]], 2048		; CHECK-NEXT: [[OR:%.]] = or i32 [[Y:%.]], 2048
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[OR]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[OR]], i32 [[Y]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[OR]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[OR]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%or = or i32 %y, 2048		%or = or i32 %y, 2048
%select = select i1 %cmp, i32 %y, i32 %or		%select = select i1 %cmp, i32 %y, i32 %or
%res = mul i32 %select, %or ; to bump up use count of the Or		%res = mul i32 %select, %or ; to bump up use count of the Or
ret i32 %res		ret i32 %res
}		}

define i32 @shift_xor_multiuse_xor(i32 %x, i32 %y) {		define i32 @shift_xor_multiuse_xor(i32 %x, i32 %y) {
; CHECK-LABEL: @shift_xor_multiuse_xor(		; CHECK-LABEL: @shift_xor_multiuse_xor(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 2048		; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 2048
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[XOR]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[XOR]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[XOR]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%xor = xor i32 %y, 2048		%xor = xor i32 %y, 2048
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
%res = mul i32 %select, %xor ; to bump up use count of the Xor		%res = mul i32 %select, %xor ; to bump up use count of the Xor
ret i32 %res		ret i32 %res
}		}

define i32 @shift_xor_multiuse_and(i32 %x, i32 %y) {		define i32 @shift_xor_multiuse_and(i32 %x, i32 %y) {
; CHECK-LABEL: @shift_xor_multiuse_and(		; CHECK-LABEL: @shift_xor_multiuse_and(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -2049		; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -2049
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[AND2]], i32 [[Y]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[AND2]], i32 [[Y]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[AND2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[AND2]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%and2 = and i32 %y, -2049		%and2 = and i32 %y, -2049
%select = select i1 %cmp, i32 %y, i32 %and2		%select = select i1 %cmp, i32 %y, i32 %and2
%res = mul i32 %select, %and2 ; to bump up use count of the and		%res = mul i32 %select, %and2 ; to bump up use count of the and
▲ Show 20 Lines • Show All 111 Lines • ▼ Show 20 Lines	;
%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp		%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp
%res = mul i32 %select, %select2		%res = mul i32 %select, %select2
ret i32 %res		ret i32 %res
}		}

define i32 @no_shift_xor_multiuse_cmp(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @no_shift_xor_multiuse_cmp(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @no_shift_xor_multiuse_cmp(		; CHECK-LABEL: @no_shift_xor_multiuse_cmp(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[TMP1:%.*]] = xor i32 [[AND]], 4096		; CHECK-NEXT: [[TMP1:%.*]] = xor i32 [[AND]], 4096
; CHECK-NEXT: [[TMP2:%.]] = or i32 [[TMP1]], [[Y:%.]]		; CHECK-NEXT: [[TMP2:%.]] = or i32 [[TMP1]], [[Y:%.]]
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[W:%.]], i32 [[Z:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP_NOT]], i32 [[W:%.]], i32 [[Z:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[TMP2]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[TMP2]], [[SELECT2]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%or = or i32 %y, 4096		%or = or i32 %y, 4096
%select = select i1 %cmp, i32 %y, i32 %or		%select = select i1 %cmp, i32 %y, i32 %or
%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp		%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp
%res = mul i32 %select, %select2		%res = mul i32 %select, %select2
ret i32 %res		ret i32 %res
}		}

define i32 @no_shift_xor_multiuse_cmp_with_xor(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @no_shift_xor_multiuse_cmp_with_xor(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @no_shift_xor_multiuse_cmp_with_xor(		; CHECK-LABEL: @no_shift_xor_multiuse_cmp_with_xor(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp ne i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096		; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[XOR]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP_NOT]], i32 [[W:%.]], i32 [[Z:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%xor = xor i32 %y, 4096		%xor = xor i32 %y, 4096
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp		%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp
%res = mul i32 %select, %select2		%res = mul i32 %select, %select2
ret i32 %res		ret i32 %res
}		}

define i32 @no_shift_xor_multiuse_cmp_with_and(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @no_shift_xor_multiuse_cmp_with_and(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @no_shift_xor_multiuse_cmp_with_and(		; CHECK-LABEL: @no_shift_xor_multiuse_cmp_with_and(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp ne i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -4097		; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -4097
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[AND2]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[AND2]], i32 [[Y]]
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP_NOT]], i32 [[W:%.]], i32 [[Z:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%and2 = and i32 %y, -4097		%and2 = and i32 %y, -4097
%select = select i1 %cmp, i32 %y, i32 %and2		%select = select i1 %cmp, i32 %y, i32 %and2
%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp		%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp
%res = mul i32 %select, %select2		%res = mul i32 %select, %select2
ret i32 %res		ret i32 %res
}		}

define i32 @shift_xor_multiuse_cmp(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @shift_xor_multiuse_cmp(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @shift_xor_multiuse_cmp(		; CHECK-LABEL: @shift_xor_multiuse_cmp(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp ne i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[OR:%.]] = or i32 [[Y:%.]], 2048		; CHECK-NEXT: [[OR:%.]] = or i32 [[Y:%.]], 2048
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[OR]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[OR]], i32 [[Y]]
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP_NOT]], i32 [[W:%.]], i32 [[Z:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%or = or i32 %y, 2048		%or = or i32 %y, 2048
%select = select i1 %cmp, i32 %y, i32 %or		%select = select i1 %cmp, i32 %y, i32 %or
%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp		%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp
%res = mul i32 %select, %select2		%res = mul i32 %select, %select2
ret i32 %res		ret i32 %res
}		}

define i32 @shift_xor_multiuse_cmp_with_xor(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @shift_xor_multiuse_cmp_with_xor(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @shift_xor_multiuse_cmp_with_xor(		; CHECK-LABEL: @shift_xor_multiuse_cmp_with_xor(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp ne i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 2048		; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 2048
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[XOR]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP_NOT]], i32 [[W:%.]], i32 [[Z:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%xor = xor i32 %y, 2048		%xor = xor i32 %y, 2048
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp		%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp
%res = mul i32 %select, %select2		%res = mul i32 %select, %select2
ret i32 %res		ret i32 %res
}		}

define i32 @shift_xor_multiuse_cmp_with_and(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @shift_xor_multiuse_cmp_with_and(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @shift_xor_multiuse_cmp_with_and(		; CHECK-LABEL: @shift_xor_multiuse_cmp_with_and(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp ne i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -2049		; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -2049
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[AND2]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[AND2]], i32 [[Y]]
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP_NOT]], i32 [[W:%.]], i32 [[Z:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]
; CHECK-NEXT: ret i32 [[RES]]		; CHECK-NEXT: ret i32 [[RES]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%and2 = and i32 %y, -2049		%and2 = and i32 %y, -2049
%select = select i1 %cmp, i32 %y, i32 %and2		%select = select i1 %cmp, i32 %y, i32 %and2
%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp		%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp
▲ Show 20 Lines • Show All 125 Lines • ▼ Show 20 Lines	;
%res = mul i32 %select, %select2		%res = mul i32 %select, %select2
%res2 = mul i32 %res, %and2 ; to bump up the use count of the and		%res2 = mul i32 %res, %and2 ; to bump up the use count of the and
ret i32 %res2		ret i32 %res2
}		}

define i32 @no_shift_xor_multiuse_cmp_or(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @no_shift_xor_multiuse_cmp_or(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @no_shift_xor_multiuse_cmp_or(		; CHECK-LABEL: @no_shift_xor_multiuse_cmp_or(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp ne i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[OR:%.]] = or i32 [[Y:%.]], 4096		; CHECK-NEXT: [[OR:%.]] = or i32 [[Y:%.]], 4096
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[OR]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[OR]], i32 [[Y]]
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP_NOT]], i32 [[W:%.]], i32 [[Z:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]
; CHECK-NEXT: [[RES2:%.*]] = mul i32 [[RES]], [[OR]]		; CHECK-NEXT: [[RES2:%.*]] = mul i32 [[RES]], [[OR]]
; CHECK-NEXT: ret i32 [[RES2]]		; CHECK-NEXT: ret i32 [[RES2]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%or = or i32 %y, 4096		%or = or i32 %y, 4096
%select = select i1 %cmp, i32 %y, i32 %or		%select = select i1 %cmp, i32 %y, i32 %or
%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp		%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp
%res = mul i32 %select, %select2		%res = mul i32 %select, %select2
%res2 = mul i32 %res, %or ; to bump up the use count of the or		%res2 = mul i32 %res, %or ; to bump up the use count of the or
ret i32 %res2		ret i32 %res2
}		}

define i32 @no_shift_xor_multiuse_cmp_xor(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @no_shift_xor_multiuse_cmp_xor(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @no_shift_xor_multiuse_cmp_xor(		; CHECK-LABEL: @no_shift_xor_multiuse_cmp_xor(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp ne i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096		; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 4096
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[XOR]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP_NOT]], i32 [[W:%.]], i32 [[Z:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]
; CHECK-NEXT: [[RES2:%.*]] = mul i32 [[RES]], [[XOR]]		; CHECK-NEXT: [[RES2:%.*]] = mul i32 [[RES]], [[XOR]]
; CHECK-NEXT: ret i32 [[RES2]]		; CHECK-NEXT: ret i32 [[RES2]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%xor = xor i32 %y, 4096		%xor = xor i32 %y, 4096
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp		%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp
%res = mul i32 %select, %select2		%res = mul i32 %select, %select2
%res2 = mul i32 %res, %xor ; to bump up the use count of the xor		%res2 = mul i32 %res, %xor ; to bump up the use count of the xor
ret i32 %res2		ret i32 %res2
}		}

define i32 @no_shift_xor_multiuse_cmp_and(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @no_shift_xor_multiuse_cmp_and(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @no_shift_xor_multiuse_cmp_and(		; CHECK-LABEL: @no_shift_xor_multiuse_cmp_and(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp ne i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -4097		; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], -4097
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[AND2]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[AND2]], i32 [[Y]]
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP_NOT]], i32 [[W:%.]], i32 [[Z:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]
; CHECK-NEXT: [[RES2:%.*]] = mul i32 [[RES]], [[AND2]]		; CHECK-NEXT: [[RES2:%.*]] = mul i32 [[RES]], [[AND2]]
; CHECK-NEXT: ret i32 [[RES2]]		; CHECK-NEXT: ret i32 [[RES2]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%and2 = and i32 %y, -4097		%and2 = and i32 %y, -4097
%select = select i1 %cmp, i32 %y, i32 %and2		%select = select i1 %cmp, i32 %y, i32 %and2
%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp		%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp
%res = mul i32 %select, %select2		%res = mul i32 %select, %select2
%res2 = mul i32 %res, %and2 ; to bump up the use count of the and		%res2 = mul i32 %res, %and2 ; to bump up the use count of the and
ret i32 %res2		ret i32 %res2
}		}

define i32 @shift_xor_multiuse_cmp_or(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @shift_xor_multiuse_cmp_or(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @shift_xor_multiuse_cmp_or(		; CHECK-LABEL: @shift_xor_multiuse_cmp_or(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp ne i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[OR:%.]] = or i32 [[Y:%.]], 2048		; CHECK-NEXT: [[OR:%.]] = or i32 [[Y:%.]], 2048
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[OR]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[OR]], i32 [[Y]]
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP_NOT]], i32 [[W:%.]], i32 [[Z:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]
; CHECK-NEXT: [[RES2:%.*]] = mul i32 [[RES]], [[OR]]		; CHECK-NEXT: [[RES2:%.*]] = mul i32 [[RES]], [[OR]]
; CHECK-NEXT: ret i32 [[RES2]]		; CHECK-NEXT: ret i32 [[RES2]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%or = or i32 %y, 2048		%or = or i32 %y, 2048
%select = select i1 %cmp, i32 %y, i32 %or		%select = select i1 %cmp, i32 %y, i32 %or
%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp		%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp
%res = mul i32 %select, %select2		%res = mul i32 %select, %select2
%res2 = mul i32 %res, %or ; to bump up the use count of the or		%res2 = mul i32 %res, %or ; to bump up the use count of the or
ret i32 %res2		ret i32 %res2
}		}

define i32 @shift_xor_multiuse_cmp_xor(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @shift_xor_multiuse_cmp_xor(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @shift_xor_multiuse_cmp_xor(		; CHECK-LABEL: @shift_xor_multiuse_cmp_xor(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp ne i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 2048		; CHECK-NEXT: [[XOR:%.]] = xor i32 [[Y:%.]], 2048
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[XOR]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[XOR]], i32 [[Y]]
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP_NOT]], i32 [[W:%.]], i32 [[Z:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]
; CHECK-NEXT: [[RES2:%.*]] = mul i32 [[RES]], [[XOR]]		; CHECK-NEXT: [[RES2:%.*]] = mul i32 [[RES]], [[XOR]]
; CHECK-NEXT: ret i32 [[RES2]]		; CHECK-NEXT: ret i32 [[RES2]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%xor = xor i32 %y, 2048		%xor = xor i32 %y, 2048
%select = select i1 %cmp, i32 %y, i32 %xor		%select = select i1 %cmp, i32 %y, i32 %xor
%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp		%select2 = select i1 %cmp, i32 %z, i32 %w ; to bump up use count of the cmp
%res = mul i32 %select, %select2		%res = mul i32 %select, %select2
%res2 = mul i32 %res, %xor ; to bump up the use count of the xor		%res2 = mul i32 %res, %xor ; to bump up the use count of the xor
ret i32 %res2		ret i32 %res2
}		}

define i32 @shift_xor_multiuse_cmp_and(i32 %x, i32 %y, i32 %z, i32 %w) {		define i32 @shift_xor_multiuse_cmp_and(i32 %x, i32 %y, i32 %z, i32 %w) {
; CHECK-LABEL: @shift_xor_multiuse_cmp_and(		; CHECK-LABEL: @shift_xor_multiuse_cmp_and(
; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096		; CHECK-NEXT: [[AND:%.]] = and i32 [[X:%.]], 4096
; CHECK-NEXT: [[CMP:%.*]] = icmp ne i32 [[AND]], 0		; CHECK-NEXT: [[CMP_NOT:%.*]] = icmp eq i32 [[AND]], 0
; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], 2048		; CHECK-NEXT: [[AND2:%.]] = and i32 [[Y:%.]], 2048
; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP]], i32 [[Y]], i32 [[AND2]]		; CHECK-NEXT: [[SELECT:%.*]] = select i1 [[CMP_NOT]], i32 [[AND2]], i32 [[Y]]
; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP]], i32 [[Z:%.]], i32 [[W:%.*]]		; CHECK-NEXT: [[SELECT2:%.]] = select i1 [[CMP_NOT]], i32 [[W:%.]], i32 [[Z:%.*]]
; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]		; CHECK-NEXT: [[RES:%.*]] = mul i32 [[SELECT]], [[SELECT2]]
; CHECK-NEXT: [[RES2:%.*]] = mul i32 [[RES]], [[AND2]]		; CHECK-NEXT: [[RES2:%.*]] = mul i32 [[RES]], [[AND2]]
; CHECK-NEXT: ret i32 [[RES2]]		; CHECK-NEXT: ret i32 [[RES2]]
;		;
%and = and i32 %x, 4096		%and = and i32 %x, 4096
%cmp = icmp ne i32 0, %and		%cmp = icmp ne i32 0, %and
%and2 = and i32 %y, 2048		%and2 = and i32 %y, 2048
%select = select i1 %cmp, i32 %y, i32 %and2		%select = select i1 %cmp, i32 %y, i32 %and2
▲ Show 20 Lines • Show All 125 Lines • Show Last 20 Lines

llvm/test/Transforms/InstCombine/shift.ll

Show First 20 Lines • Show All 422 Lines • ▼ Show 20 Lines	;
%cmp = icmp ne <2 x i8> %shr, zeroinitializer		%cmp = icmp ne <2 x i8> %shr, zeroinitializer
ret <2 x i1> %cmp		ret <2 x i1> %cmp
}		}

define i8 @test28a(i8 %x, i8 %y) {		define i8 @test28a(i8 %x, i8 %y) {
; CHECK-LABEL: @test28a(		; CHECK-LABEL: @test28a(
; CHECK-NEXT: entry:		; CHECK-NEXT: entry:
; CHECK-NEXT: [[I1:%.]] = lshr i8 [[X:%.]], 7		; CHECK-NEXT: [[I1:%.]] = lshr i8 [[X:%.]], 7
; CHECK-NEXT: [[COND1:%.*]] = icmp slt i8 [[X]], 0		; CHECK-NEXT: [[COND1_NOT:%.*]] = icmp sgt i8 [[X]], -1
; CHECK-NEXT: br i1 [[COND1]], label [[BB1:%.]], label [[BB2:%.]]		; CHECK-NEXT: br i1 [[COND1_NOT]], label [[BB2:%.]], label [[BB1:%.]]
; CHECK: bb1:		; CHECK: bb1:
; CHECK-NEXT: ret i8 [[I1]]		; CHECK-NEXT: ret i8 [[I1]]
; CHECK: bb2:		; CHECK: bb2:
; CHECK-NEXT: [[I2:%.]] = add i8 [[I1]], [[Y:%.]]		; CHECK-NEXT: [[I2:%.]] = add i8 [[I1]], [[Y:%.]]
; CHECK-NEXT: ret i8 [[I2]]		; CHECK-NEXT: ret i8 [[I2]]
;		;
entry:		entry:
; This shouldn't be transformed.		; This shouldn't be transformed.
▲ Show 20 Lines • Show All 1,246 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopUnroll/runtime-loop-multiple-exits.ll

	; RUN: opt < %s -loop-unroll -unroll-runtime=true -unroll-runtime-epilog=true -unroll-runtime-multi-exit=true -verify-loop-lcssa -verify-dom-info -verify-loop-info -S \| FileCheck %s -check-prefix=EPILOG-NO-IC			; RUN: opt < %s -loop-unroll -unroll-runtime=true -unroll-runtime-epilog=true -unroll-runtime-multi-exit=true -verify-loop-lcssa -verify-dom-info -verify-loop-info -S \| FileCheck %s -check-prefix=EPILOG-NO-IC
				nikicUnsubmitted Not Done Reply Inline Actions I don't see what has changed here. Can you either precommit, or only adjust the changed part? (Not sure if whoever wrote this appreciates the large generated output.) nikic: I don't see what has changed here. Can you either precommit, or only adjust the changed part?
				lebedev.riAuthorUnsubmitted Not Done Reply Inline Actions Yeah, i'm not sure what to do with this test. I'm not even sure anything actually changed here, it is possible it just broke because it uses value names. It's just a bad test. lebedev.ri: Yeah, i'm not sure what to do with this test. I'm not even sure anything actually changed here…
	; RUN: opt < %s -loop-unroll -unroll-runtime=true -unroll-runtime-epilog=true -unroll-runtime-multi-exit=true -verify-loop-lcssa -verify-dom-info -verify-loop-info -instcombine -S \| FileCheck %s -check-prefix=EPILOG			; RUN: opt < %s -loop-unroll -unroll-runtime=true -unroll-runtime-epilog=true -unroll-runtime-multi-exit=true -verify-loop-lcssa -verify-dom-info -verify-loop-info -instcombine -S \| FileCheck %s -check-prefix=EPILOG
	; RUN: opt < %s -loop-unroll -unroll-runtime -unroll-count=2 -unroll-runtime-epilog=true -unroll-runtime-multi-exit=true -verify-loop-lcssa -verify-dom-info -verify-loop-info -instcombine			; RUN: opt < %s -loop-unroll -unroll-runtime -unroll-count=2 -unroll-runtime-epilog=true -unroll-runtime-multi-exit=true -verify-loop-lcssa -verify-dom-info -verify-loop-info -instcombine -S
	; RUN: opt < %s -loop-unroll -unroll-runtime=true -unroll-runtime-epilog=false -unroll-runtime-multi-exit=true -verify-loop-lcssa -verify-dom-info -verify-loop-info -instcombine -S \| FileCheck %s -check-prefix=PROLOG			; RUN: opt < %s -loop-unroll -unroll-runtime=true -unroll-runtime-epilog=false -unroll-runtime-multi-exit=true -verify-loop-lcssa -verify-dom-info -verify-loop-info -instcombine -S \| FileCheck %s -check-prefix=PROLOG
	; RUN: opt < %s -loop-unroll -unroll-runtime -unroll-runtime-epilog=false -unroll-count=2 -unroll-runtime-multi-exit=true -verify-loop-lcssa -verify-dom-info -verify-loop-info -instcombine			; RUN: opt < %s -loop-unroll -unroll-runtime -unroll-runtime-epilog=false -unroll-count=2 -unroll-runtime-multi-exit=true -verify-loop-lcssa -verify-dom-info -verify-loop-info -instcombine -S

	; REQUIRES: asserts			; REQUIRES: asserts

	; the third and fifth RUNs generate an epilog/prolog remainder block for all the test			; the third and fifth RUNs generate an epilog/prolog remainder block for all the test
	; cases below (it does not generate a loop).			; cases below (it does not generate a loop).

	; test with three exiting and three exit blocks.			; test with three exiting and three exit blocks.
	; none of the exit blocks have successors			; none of the exit blocks have successors
	define void @test1(i64 %trip, i1 %cond) {			define void @test1(i64 %trip, i1 %cond) {
	; EPILOG: test1(			; EPILOG: test1(
	; EPILOG-NEXT: entry:			; EPILOG-NEXT: entry:
	; EPILOG-NEXT: [[TMP0:%.]] = add i64 [[TRIP:%.]], -1			; EPILOG-NEXT: [[TMP0:%.]] = add i64 [[TRIP:%.]], -1
	; EPILOG-NEXT: [[XTRAITER:%.*]] = and i64 [[TRIP]], 7			; EPILOG-NEXT: [[XTRAITER:%.*]] = and i64 [[TRIP]], 7
	; EPILOG-NEXT: [[TMP1:%.*]] = icmp ult i64 [[TMP0]], 7			; EPILOG-NEXT: [[TMP1:%.*]] = icmp ult i64 [[TMP0]], 7
	; EPILOG-NEXT: br i1 [[TMP1]], label %exit2.loopexit.unr-lcssa, label [[ENTRY_NEW:%.*]]			; EPILOG-NEXT: br i1 [[TMP1]], label %exit2.loopexit.unr-lcssa, label [[ENTRY_NEW:%.*]]
	; EPILOG: entry.new:			; EPILOG: entry.new:
	; EPILOG-NEXT: [[UNROLL_ITER:%.*]] = and i64 [[TRIP]], -8			; EPILOG-NEXT: [[UNROLL_ITER:%.*]] = and i64 [[TRIP]], -8
	; EPILOG-NEXT: br label [[LOOP_HEADER:%.*]]			; EPILOG-NEXT: br label [[LOOP_HEADER:%.*]]
	; EPILOG: loop_latch.epil:			; EPILOG: loop_latch.epil:
	; EPILOG-NEXT: %epil.iter.sub = add i64 %epil.iter, -1			; EPILOG-NEXT: %epil.iter.sub = add i64 %epil.iter, -1
	; EPILOG-NEXT: %epil.iter.cmp = icmp eq i64 %epil.iter.sub, 0			; EPILOG-NEXT: %epil.iter.cmp.not = icmp eq i64 %epil.iter.sub, 0
	; EPILOG-NEXT: br i1 %epil.iter.cmp, label %exit2.loopexit.epilog-lcssa, label %loop_header.epil			; EPILOG-NEXT: br i1 %epil.iter.cmp.not, label %exit2.loopexit.epilog-lcssa, label %loop_header.epil
	; EPILOG: loop_latch.7:			; EPILOG: loop_latch.7:
	; EPILOG-NEXT: %niter.nsub.7 = add i64 %niter, -8			; EPILOG-NEXT: %niter.nsub.7 = add i64 %niter, -8
	; EPILOG-NEXT: %niter.ncmp.7 = icmp eq i64 %niter.nsub.7, 0			; EPILOG-NEXT: %niter.ncmp.7.not = icmp eq i64 %niter.nsub.7, 0
	; EPILOG-NEXT: br i1 %niter.ncmp.7, label %exit2.loopexit.unr-lcssa.loopexit, label %loop_header			; EPILOG-NEXT: br i1 %niter.ncmp.7.not, label %exit2.loopexit.unr-lcssa.loopexit, label %loop_header

	; PROLOG: test1(			; PROLOG: test1(
	; PROLOG-NEXT: entry:			; PROLOG-NEXT: entry:
	; PROLOG-NEXT: [[TMP0:%.]] = add i64 [[TRIP:%.]], -1			; PROLOG-NEXT: [[TMP0:%.]] = add i64 [[TRIP:%.]], -1
	; PROLOG-NEXT: [[XTRAITER:%.*]] = and i64 [[TRIP]], 7			; PROLOG-NEXT: [[XTRAITER:%.*]] = and i64 [[TRIP]], 7
	; PROLOG-NEXT: [[TMP1:%.*]] = icmp eq i64 [[XTRAITER]], 0			; PROLOG-NEXT: [[TMP1:%.*]] = icmp eq i64 [[XTRAITER]], 0
	; PROLOG-NEXT: br i1 [[TMP1]], label %loop_header.prol.loopexit, label %loop_header.prol.preheader			; PROLOG-NEXT: br i1 [[TMP1]], label %loop_header.prol.loopexit, label %loop_header.prol.preheader
	; PROLOG: loop_header.prol:			; PROLOG: loop_header.prol:
	; PROLOG-NEXT: %iv.prol = phi i64 [ 0, %loop_header.prol.preheader ], [ %iv_next.prol, %loop_latch.prol ]			; PROLOG-NEXT: %iv.prol = phi i64 [ 0, %loop_header.prol.preheader ], [ %iv_next.prol, %loop_latch.prol ]
	; PROLOG-NEXT: %prol.iter = phi i64 [ [[XTRAITER]], %loop_header.prol.preheader ], [ %prol.iter.sub, %loop_latch.prol ]			; PROLOG-NEXT: %prol.iter = phi i64 [ [[XTRAITER]], %loop_header.prol.preheader ], [ %prol.iter.sub, %loop_latch.prol ]
	; PROLOG-NEXT: br i1 %cond, label %loop_latch.prol, label %loop_exiting_bb1.prol			; PROLOG-NEXT: br i1 %cond, label %loop_latch.prol, label %loop_exiting_bb1.prol
	; PROLOG: loop_latch.prol:			; PROLOG: loop_latch.prol:
	; PROLOG-NEXT: %iv_next.prol = add i64 %iv.prol, 1			; PROLOG-NEXT: %iv_next.prol = add i64 %iv.prol, 1
	; PROLOG-NEXT: %prol.iter.sub = add i64 %prol.iter, -1			; PROLOG-NEXT: %prol.iter.sub = add i64 %prol.iter, -1
	; PROLOG-NEXT: %prol.iter.cmp = icmp eq i64 %prol.iter.sub, 0			; PROLOG-NEXT: %prol.iter.cmp.not = icmp eq i64 %prol.iter.sub, 0
	; PROLOG-NEXT: br i1 %prol.iter.cmp, label %loop_header.prol.loopexit.unr-lcssa, label %loop_header.prol			; PROLOG-NEXT: br i1 %prol.iter.cmp.not, label %loop_header.prol.loopexit.unr-lcssa, label %loop_header.prol
	; PROLOG: loop_latch.7:			; PROLOG: loop_latch.7:
	; PROLOG-NEXT: %iv_next.7 = add i64 %iv, 8			; PROLOG-NEXT: %iv_next.7 = add i64 %iv, 8
	; PROLOG-NEXT: %cmp.7 = icmp eq i64 %iv_next.7, %trip			; PROLOG-NEXT: %cmp.7.not = icmp eq i64 %iv_next.7, %trip
	; PROLOG-NEXT: br i1 %cmp.7, label %exit2.loopexit.unr-lcssa, label %loop_header			; PROLOG-NEXT: br i1 %cmp.7.not, label %exit2.loopexit.unr-lcssa, label %loop_header
	entry:			entry:
	br label %loop_header			br label %loop_header

	loop_header:			loop_header:
	%iv = phi i64 [ 0, %entry ], [ %iv_next, %loop_latch ]			%iv = phi i64 [ 0, %entry ], [ %iv_next, %loop_latch ]
	br i1 %cond, label %loop_latch, label %loop_exiting_bb1			br i1 %cond, label %loop_latch, label %loop_exiting_bb1

	loop_exiting_bb1:			loop_exiting_bb1:
	▲ Show 20 Lines • Show All 92 Lines • ▼ Show 20 Lines
	; EPILOG: loop_header:			; EPILOG: loop_header:
	; EPILOG-NEXT: %sum = phi i64 [ 0, %entry.new ], [ %sum.next.7, %loop_latch.7 ]			; EPILOG-NEXT: %sum = phi i64 [ 0, %entry.new ], [ %sum.next.7, %loop_latch.7 ]
	; EPILOG-NEXT: %niter = phi i64 [ %unroll_iter, %entry.new ], [ %niter.nsub.7, %loop_latch.7 ]			; EPILOG-NEXT: %niter = phi i64 [ %unroll_iter, %entry.new ], [ %niter.nsub.7, %loop_latch.7 ]
	; EPILOG: loop_exiting_bb1.7:			; EPILOG: loop_exiting_bb1.7:
	; EPILOG-NEXT: switch i64 %sum.next.6, label %loop_latch.7			; EPILOG-NEXT: switch i64 %sum.next.6, label %loop_latch.7
	; EPILOG: loop_latch.7:			; EPILOG: loop_latch.7:
	; EPILOG-NEXT: %sum.next.7 = add i64 %sum.next.6, %add			; EPILOG-NEXT: %sum.next.7 = add i64 %sum.next.6, %add
	; EPILOG-NEXT: %niter.nsub.7 = add i64 %niter, -8			; EPILOG-NEXT: %niter.nsub.7 = add i64 %niter, -8
	; EPILOG-NEXT: %niter.ncmp.7 = icmp eq i64 %niter.nsub.7, 0			; EPILOG-NEXT: %niter.ncmp.7.not = icmp eq i64 %niter.nsub.7, 0
	; EPILOG-NEXT: br i1 %niter.ncmp.7, label %exit2.loopexit.unr-lcssa.loopexit, label %loop_header			; EPILOG-NEXT: br i1 %niter.ncmp.7.not, label %exit2.loopexit.unr-lcssa.loopexit, label %loop_header

	; PROLOG: test3(			; PROLOG: test3(
	; PROLOG-NEXT: entry:			; PROLOG-NEXT: entry:
	; PROLOG-NEXT: [[TMP0:%.]] = add i64 [[TRIP:%.]], -1			; PROLOG-NEXT: [[TMP0:%.]] = add i64 [[TRIP:%.]], -1
	; PROLOG-NEXT: [[XTRAITER:%.*]] = and i64 [[TRIP]], 7			; PROLOG-NEXT: [[XTRAITER:%.*]] = and i64 [[TRIP]], 7
	; PROLOG-NEXT: [[TMP1:%.*]] = icmp eq i64 [[XTRAITER]], 0			; PROLOG-NEXT: [[TMP1:%.*]] = icmp eq i64 [[XTRAITER]], 0
	; PROLOG-NEXT: br i1 [[TMP1]], label %loop_header.prol.loopexit, label %loop_header.prol.preheader			; PROLOG-NEXT: br i1 [[TMP1]], label %loop_header.prol.loopexit, label %loop_header.prol.preheader
	; PROLOG: loop_header:			; PROLOG: loop_header:
	; PROLOG-NEXT: %iv = phi i64 [ %iv.unr, %entry.new ], [ %iv_next.7, %loop_latch.7 ]			; PROLOG-NEXT: %iv = phi i64 [ %iv.unr, %entry.new ], [ %iv_next.7, %loop_latch.7 ]
	; PROLOG-NEXT: %sum = phi i64 [ %sum.unr, %entry.new ], [ %sum.next.7, %loop_latch.7 ]			; PROLOG-NEXT: %sum = phi i64 [ %sum.unr, %entry.new ], [ %sum.next.7, %loop_latch.7 ]
	; PROLOG: loop_exiting_bb1.7:			; PROLOG: loop_exiting_bb1.7:
	; PROLOG-NEXT: switch i64 %sum.next.6, label %loop_latch.7			; PROLOG-NEXT: switch i64 %sum.next.6, label %loop_latch.7
	; PROLOG: loop_latch.7:			; PROLOG: loop_latch.7:
	; PROLOG-NEXT: %iv_next.7 = add nuw nsw i64 %iv, 8			; PROLOG-NEXT: %iv_next.7 = add nuw nsw i64 %iv, 8
	; PROLOG-NEXT: %sum.next.7 = add i64 %sum.next.6, %add			; PROLOG-NEXT: %sum.next.7 = add i64 %sum.next.6, %add
	; PROLOG-NEXT: %cmp.7 = icmp eq i64 %iv_next.7, %trip			; PROLOG-NEXT: %cmp.7.not = icmp eq i64 %iv_next.7, %trip
	; PROLOG-NEXT: br i1 %cmp.7, label %exit2.loopexit.unr-lcssa, label %loop_header			; PROLOG-NEXT: br i1 %cmp.7.not, label %exit2.loopexit.unr-lcssa, label %loop_header
	entry:			entry:
	br label %loop_header			br label %loop_header

	loop_header:			loop_header:
	%iv = phi i64 [ 0, %entry ], [ %iv_next, %loop_latch ]			%iv = phi i64 [ 0, %entry ], [ %iv_next, %loop_latch ]
	%sum = phi i64 [ 0, %entry ], [ %sum.next, %loop_latch ]			%sum = phi i64 [ 0, %entry ], [ %sum.next, %loop_latch ]
	br i1 undef, label %loop_latch, label %loop_exiting_bb1			br i1 undef, label %loop_latch, label %loop_exiting_bb1

	▲ Show 20 Lines • Show All 383 Lines • ▼ Show 20 Lines
	; branches back to the outer loop.			; branches back to the outer loop.
	define void @test8() {			define void @test8() {
	; EPILOG: test8(			; EPILOG: test8(
	; EPILOG-NOT: niter			; EPILOG-NOT: niter

	; PROLOG: test8(			; PROLOG: test8(
	; PROLOG: outerloop:			; PROLOG: outerloop:
	; PROLOG-NEXT: phi i64 [ 3, %bb ], [ 0, %outerloop.loopexit ]			; PROLOG-NEXT: phi i64 [ 3, %bb ], [ 0, %outerloop.loopexit ]
	; PROLOG: %lcmp.mod = icmp eq i64			; PROLOG: %lcmp.mod.not = icmp eq i64
	; PROLOG-NEXT: br i1 %lcmp.mod, label %innerH.prol.loopexit, label %innerH.prol.preheader			; PROLOG-NEXT: br i1 %lcmp.mod.not, label %innerH.prol.loopexit, label %innerH.prol.preheader
	; PROLOG: latch.6:			; PROLOG: latch.6:
	; PROLOG-NEXT: br i1 false, label %outerloop.loopexit.loopexit, label %latch.7			; PROLOG-NEXT: br i1 false, label %outerloop.loopexit.loopexit, label %latch.7
	; PROLOG: latch.7:			; PROLOG: latch.7:
	; PROLOG-NEXT: %i4.7 = add nuw nsw i64 %i3, 8			; PROLOG-NEXT: %i4.7 = add nuw nsw i64 %i3, 8
	; PROLOG-NEXT: %i6.7 = icmp ult i64 %i3, 92			; PROLOG-NEXT: %i6.7 = icmp ult i64 %i3, 92
	; PROLOG-NEXT: br i1 %i6.7, label %innerH, label %exit.unr-lcssa			; PROLOG-NEXT: br i1 %i6.7, label %innerH, label %exit.unr-lcssa
	bb:			bb:
	br label %outerloop			br label %outerloop
	Show All 20 Lines
	; a value from outer loop is used in exit block of inner loop.			; a value from outer loop is used in exit block of inner loop.
	; Don't create VMap entries for such values (%trip).			; Don't create VMap entries for such values (%trip).
	define i8 addrspace(1)* @test9(i8* nocapture readonly %arg, i32 %n) {			define i8 addrspace(1)* @test9(i8* nocapture readonly %arg, i32 %n) {
	; PROLOG: test9(			; PROLOG: test9(
	; PROLOG: header.prol:			; PROLOG: header.prol:
	; PROLOG-NEXT: %phi.prol = phi i64 [ 0, %header.prol.preheader ], [ %iv.next.prol, %latch.prol ]			; PROLOG-NEXT: %phi.prol = phi i64 [ 0, %header.prol.preheader ], [ %iv.next.prol, %latch.prol ]
	; PROLOG: latch.prol:			; PROLOG: latch.prol:
	; PROLOG-NOT: trip			; PROLOG-NOT: trip
	; PROLOG: br i1 %prol.iter.cmp, label %header.prol.loopexit.unr-lcssa, label %header.prol			; PROLOG: br i1 %prol.iter.cmp.not, label %header.prol.loopexit.unr-lcssa, label %header.prol
	bb:			bb:
	br label %outerloopHdr			br label %outerloopHdr

	outerloopHdr: ; preds = %outerLatch, %bb			outerloopHdr: ; preds = %outerLatch, %bb
	%trip = add i32 %n, -1			%trip = add i32 %n, -1
	%outercnd = icmp slt i32 0, %trip			%outercnd = icmp slt i32 0, %trip
	br i1 %outercnd, label %preheader, label %outerLatch			br i1 %outercnd, label %preheader, label %outerLatch

	Show All 22 Lines

llvm/test/Transforms/PGOProfile/chr.ll

	Show All 30 Lines
	; CHECK-NEXT: [[TMP2:%.*]] = icmp eq i32 [[TMP1]], 3			; CHECK-NEXT: [[TMP2:%.*]] = icmp eq i32 [[TMP1]], 3
	; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15			; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15
	; CHECK: bb0:			; CHECK: bb0:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB3:%.*]]			; CHECK-NEXT: br label [[BB3:%.*]]
	; CHECK: entry.split.nonchr:			; CHECK: entry.split.nonchr:
	; CHECK-NEXT: [[TMP3:%.*]] = and i32 [[TMP0]], 1			; CHECK-NEXT: [[TMP3:%.*]] = and i32 [[TMP0]], 1
	; CHECK-NEXT: [[TMP4:%.*]] = icmp eq i32 [[TMP3]], 0			; CHECK-NEXT: [[DOTNOT:%.*]] = icmp eq i32 [[TMP3]], 0
	; CHECK-NEXT: br i1 [[TMP4]], label [[BB1_NONCHR:%.]], label [[BB0_NONCHR:%.]], !prof !16			; CHECK-NEXT: br i1 [[DOTNOT]], label [[BB1_NONCHR:%.]], label [[BB0_NONCHR:%.]], !prof !16
	; CHECK: bb0.nonchr:			; CHECK: bb0.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB1_NONCHR]]			; CHECK-NEXT: br label [[BB1_NONCHR]]
	; CHECK: bb1.nonchr:			; CHECK: bb1.nonchr:
	; CHECK-NEXT: [[TMP5:%.*]] = and i32 [[TMP0]], 2			; CHECK-NEXT: [[TMP4:%.*]] = and i32 [[TMP0]], 2
	; CHECK-NEXT: [[TMP6:%.*]] = icmp eq i32 [[TMP5]], 0			; CHECK-NEXT: [[TMP5:%.*]] = icmp eq i32 [[TMP4]], 0
	; CHECK-NEXT: br i1 [[TMP6]], label [[BB3]], label [[BB2_NONCHR:%.*]], !prof !16			; CHECK-NEXT: br i1 [[TMP5]], label [[BB3]], label [[BB2_NONCHR:%.*]], !prof !16
	; CHECK: bb2.nonchr:			; CHECK: bb2.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB3]]			; CHECK-NEXT: br label [[BB3]]
	; CHECK: bb3:			; CHECK: bb3:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%0 = load i32, i32* %i			%0 = load i32, i32* %i
	▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[TMP2:%.*]] = icmp eq i32 [[TMP1]], 7			; CHECK-NEXT: [[TMP2:%.*]] = icmp eq i32 [[TMP1]], 7
	; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15			; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15
	; CHECK: bb0:			; CHECK: bb0:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB5:%.*]]			; CHECK-NEXT: br label [[BB5:%.*]]
	; CHECK: entry.split.nonchr:			; CHECK: entry.split.nonchr:
	; CHECK-NEXT: [[TMP3:%.*]] = and i32 [[TMP0]], 1			; CHECK-NEXT: [[TMP3:%.*]] = and i32 [[TMP0]], 1
	; CHECK-NEXT: [[TMP4:%.*]] = icmp eq i32 [[TMP3]], 0			; CHECK-NEXT: [[DOTNOT:%.*]] = icmp eq i32 [[TMP3]], 0
	; CHECK-NEXT: br i1 [[TMP4]], label [[BB1_NONCHR:%.]], label [[BB0_NONCHR:%.]], !prof !16			; CHECK-NEXT: br i1 [[DOTNOT]], label [[BB1_NONCHR:%.]], label [[BB0_NONCHR:%.]], !prof !16
	; CHECK: bb0.nonchr:			; CHECK: bb0.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB1_NONCHR]]			; CHECK-NEXT: br label [[BB1_NONCHR]]
	; CHECK: bb1.nonchr:			; CHECK: bb1.nonchr:
	; CHECK-NEXT: [[TMP5:%.*]] = and i32 [[TMP0]], 2			; CHECK-NEXT: [[TMP4:%.*]] = and i32 [[TMP0]], 2
	; CHECK-NEXT: [[TMP6:%.*]] = icmp eq i32 [[TMP5]], 0			; CHECK-NEXT: [[TMP5:%.*]] = icmp eq i32 [[TMP4]], 0
	; CHECK-NEXT: br i1 [[TMP6]], label [[BB2_NONCHR:%.]], label [[BB3_NONCHR:%.]], !prof !16			; CHECK-NEXT: br i1 [[TMP5]], label [[BB2_NONCHR:%.]], label [[BB3_NONCHR:%.]], !prof !16
	; CHECK: bb2.nonchr:			; CHECK: bb2.nonchr:
	; CHECK-NEXT: call void @bar()			; CHECK-NEXT: call void @bar()
	; CHECK-NEXT: br label [[BB3_NONCHR]]			; CHECK-NEXT: br label [[BB3_NONCHR]]
	; CHECK: bb3.nonchr:			; CHECK: bb3.nonchr:
	; CHECK-NEXT: [[TMP7:%.*]] = and i32 [[TMP0]], 4			; CHECK-NEXT: [[TMP6:%.*]] = and i32 [[TMP0]], 4
	; CHECK-NEXT: [[TMP8:%.*]] = icmp eq i32 [[TMP7]], 0			; CHECK-NEXT: [[TMP7:%.*]] = icmp eq i32 [[TMP6]], 0
	; CHECK-NEXT: br i1 [[TMP8]], label [[BB5]], label [[BB4_NONCHR:%.*]], !prof !16			; CHECK-NEXT: br i1 [[TMP7]], label [[BB5]], label [[BB4_NONCHR:%.*]], !prof !16
	; CHECK: bb4.nonchr:			; CHECK: bb4.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB5]]			; CHECK-NEXT: br label [[BB5]]
	; CHECK: bb5:			; CHECK: bb5:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%0 = load i32, i32* %i			%0 = load i32, i32* %i
	▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[TMP2:%.*]] = icmp eq i32 [[TMP1]], 3			; CHECK-NEXT: [[TMP2:%.*]] = icmp eq i32 [[TMP1]], 3
	; CHECK-NEXT: br i1 [[TMP2]], label [[BB1:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15			; CHECK-NEXT: br i1 [[TMP2]], label [[BB1:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15
	; CHECK: bb1:			; CHECK: bb1:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB4:%.*]]			; CHECK-NEXT: br label [[BB4:%.*]]
	; CHECK: entry.split.nonchr:			; CHECK: entry.split.nonchr:
	; CHECK-NEXT: [[TMP3:%.*]] = and i32 [[TMP0]], 255			; CHECK-NEXT: [[TMP3:%.*]] = and i32 [[TMP0]], 255
	; CHECK-NEXT: [[TMP4:%.*]] = icmp eq i32 [[TMP3]], 0			; CHECK-NEXT: [[DOTNOT:%.*]] = icmp eq i32 [[TMP3]], 0
	; CHECK-NEXT: br i1 [[TMP4]], label [[BB4]], label [[BB0_NONCHR:%.*]], !prof !16			; CHECK-NEXT: br i1 [[DOTNOT]], label [[BB4]], label [[BB0_NONCHR:%.*]], !prof !16
	; CHECK: bb0.nonchr:			; CHECK: bb0.nonchr:
	; CHECK-NEXT: [[TMP5:%.*]] = and i32 [[TMP0]], 1			; CHECK-NEXT: [[TMP4:%.*]] = and i32 [[TMP0]], 1
	; CHECK-NEXT: [[TMP6:%.*]] = icmp eq i32 [[TMP5]], 0			; CHECK-NEXT: [[TMP5:%.*]] = icmp eq i32 [[TMP4]], 0
	; CHECK-NEXT: br i1 [[TMP6]], label [[BB2_NONCHR:%.]], label [[BB1_NONCHR:%.]], !prof !16			; CHECK-NEXT: br i1 [[TMP5]], label [[BB2_NONCHR:%.]], label [[BB1_NONCHR:%.]], !prof !16
	; CHECK: bb2.nonchr:			; CHECK: bb2.nonchr:
	; CHECK-NEXT: [[TMP7:%.*]] = and i32 [[TMP0]], 2			; CHECK-NEXT: [[TMP6:%.*]] = and i32 [[TMP0]], 2
	; CHECK-NEXT: [[TMP8:%.*]] = icmp eq i32 [[TMP7]], 0			; CHECK-NEXT: [[TMP7:%.*]] = icmp eq i32 [[TMP6]], 0
	; CHECK-NEXT: br i1 [[TMP8]], label [[BB4]], label [[BB3_NONCHR:%.*]], !prof !16			; CHECK-NEXT: br i1 [[TMP7]], label [[BB4]], label [[BB3_NONCHR:%.*]], !prof !16
	; CHECK: bb3.nonchr:			; CHECK: bb3.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB4]]			; CHECK-NEXT: br label [[BB4]]
	; CHECK: bb1.nonchr:			; CHECK: bb1.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB2_NONCHR]]			; CHECK-NEXT: br label [[BB2_NONCHR]]
	; CHECK: bb4:			; CHECK: bb4:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	▲ Show 20 Lines • Show All 67 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[TMP2:%.*]] = icmp eq i32 [[TMP1]], 3			; CHECK-NEXT: [[TMP2:%.*]] = icmp eq i32 [[TMP1]], 3
	; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15			; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15
	; CHECK: bb0:			; CHECK: bb0:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB3:%.*]]			; CHECK-NEXT: br label [[BB3:%.*]]
	; CHECK: entry.split.nonchr:			; CHECK: entry.split.nonchr:
	; CHECK-NEXT: [[TMP3:%.*]] = and i32 [[TMP0]], 1			; CHECK-NEXT: [[TMP3:%.*]] = and i32 [[TMP0]], 1
	; CHECK-NEXT: [[TMP4:%.*]] = icmp eq i32 [[TMP3]], 0			; CHECK-NEXT: [[DOTNOT:%.*]] = icmp eq i32 [[TMP3]], 0
	; CHECK-NEXT: br i1 [[TMP4]], label [[BB1_NONCHR:%.]], label [[BB0_NONCHR:%.]], !prof !16			; CHECK-NEXT: br i1 [[DOTNOT]], label [[BB1_NONCHR:%.]], label [[BB0_NONCHR:%.]], !prof !16
	; CHECK: bb0.nonchr:			; CHECK: bb0.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB1_NONCHR]]			; CHECK-NEXT: br label [[BB1_NONCHR]]
	; CHECK: bb1.nonchr:			; CHECK: bb1.nonchr:
	; CHECK-NEXT: [[TMP5:%.*]] = and i32 [[TMP0]], 2			; CHECK-NEXT: [[TMP4:%.*]] = and i32 [[TMP0]], 2
	; CHECK-NEXT: [[TMP6:%.*]] = icmp eq i32 [[TMP5]], 0			; CHECK-NEXT: [[TMP5:%.*]] = icmp eq i32 [[TMP4]], 0
	; CHECK-NEXT: br i1 [[TMP6]], label [[BB3]], label [[BB2_NONCHR:%.*]], !prof !16			; CHECK-NEXT: br i1 [[TMP5]], label [[BB3]], label [[BB2_NONCHR:%.*]], !prof !16
	; CHECK: bb2.nonchr:			; CHECK: bb2.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB3]]			; CHECK-NEXT: br label [[BB3]]
	; CHECK: bb3:			; CHECK: bb3:
	; CHECK-NEXT: [[TMP7:%.]] = load i32, i32 [[I]], align 4			; CHECK-NEXT: [[TMP6:%.]] = load i32, i32 [[I]], align 4
	; CHECK-NEXT: [[TMP8:%.*]] = and i32 [[TMP7]], 12			; CHECK-NEXT: [[TMP7:%.*]] = and i32 [[TMP6]], 12
	; CHECK-NEXT: [[TMP9:%.*]] = icmp eq i32 [[TMP8]], 12			; CHECK-NEXT: [[TMP8:%.*]] = icmp eq i32 [[TMP7]], 12
	; CHECK-NEXT: br i1 [[TMP9]], label [[BB4:%.]], label [[BB3_SPLIT_NONCHR:%.]], !prof !15			; CHECK-NEXT: br i1 [[TMP8]], label [[BB4:%.]], label [[BB3_SPLIT_NONCHR:%.]], !prof !15
	; CHECK: bb4:			; CHECK: bb4:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB7:%.*]]			; CHECK-NEXT: br label [[BB7:%.*]]
	; CHECK: bb3.split.nonchr:			; CHECK: bb3.split.nonchr:
	; CHECK-NEXT: [[TMP10:%.*]] = and i32 [[TMP7]], 4			; CHECK-NEXT: [[TMP9:%.*]] = and i32 [[TMP6]], 4
	; CHECK-NEXT: [[TMP11:%.*]] = icmp eq i32 [[TMP10]], 0			; CHECK-NEXT: [[DOTNOT1:%.*]] = icmp eq i32 [[TMP9]], 0
	; CHECK-NEXT: br i1 [[TMP11]], label [[BB5_NONCHR:%.]], label [[BB4_NONCHR:%.]], !prof !16			; CHECK-NEXT: br i1 [[DOTNOT1]], label [[BB5_NONCHR:%.]], label [[BB4_NONCHR:%.]], !prof !16
	; CHECK: bb4.nonchr:			; CHECK: bb4.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB5_NONCHR]]			; CHECK-NEXT: br label [[BB5_NONCHR]]
	; CHECK: bb5.nonchr:			; CHECK: bb5.nonchr:
	; CHECK-NEXT: [[TMP12:%.*]] = and i32 [[TMP7]], 8			; CHECK-NEXT: [[TMP10:%.*]] = and i32 [[TMP6]], 8
	; CHECK-NEXT: [[TMP13:%.*]] = icmp eq i32 [[TMP12]], 0			; CHECK-NEXT: [[TMP11:%.*]] = icmp eq i32 [[TMP10]], 0
	; CHECK-NEXT: br i1 [[TMP13]], label [[BB7]], label [[BB6_NONCHR:%.*]], !prof !16			; CHECK-NEXT: br i1 [[TMP11]], label [[BB7]], label [[BB6_NONCHR:%.*]], !prof !16
	; CHECK: bb6.nonchr:			; CHECK: bb6.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB7]]			; CHECK-NEXT: br label [[BB7]]
	; CHECK: bb7:			; CHECK: bb7:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	entry:			entry:
	%0 = load i32, i32* %i			%0 = load i32, i32* %i
	▲ Show 20 Lines • Show All 60 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[TMP2:%.*]] = icmp eq i32 [[TMP1]], 3			; CHECK-NEXT: [[TMP2:%.*]] = icmp eq i32 [[TMP1]], 3
	; CHECK-NEXT: br i1 [[TMP2]], label [[ENTRY_SPLIT:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15			; CHECK-NEXT: br i1 [[TMP2]], label [[ENTRY_SPLIT:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15
	; CHECK: entry.split:			; CHECK: entry.split:
	; CHECK-NEXT: [[TMP3:%.]] = add i32 [[SUM0:%.]], 85			; CHECK-NEXT: [[TMP3:%.]] = add i32 [[SUM0:%.]], 85
	; CHECK-NEXT: ret i32 [[TMP3]]			; CHECK-NEXT: ret i32 [[TMP3]]
	; CHECK: entry.split.nonchr:			; CHECK: entry.split.nonchr:
	; CHECK-NEXT: [[TMP4:%.*]] = add i32 [[SUM0]], 42			; CHECK-NEXT: [[TMP4:%.*]] = add i32 [[SUM0]], 42
	; CHECK-NEXT: [[TMP5:%.*]] = and i32 [[TMP0]], 1			; CHECK-NEXT: [[TMP5:%.*]] = and i32 [[TMP0]], 1
	; CHECK-NEXT: [[TMP6:%.*]] = icmp eq i32 [[TMP5]], 0			; CHECK-NEXT: [[DOTNOT:%.*]] = icmp eq i32 [[TMP5]], 0
	; CHECK-NEXT: [[SUM1_NONCHR:%.*]] = select i1 [[TMP6]], i32 [[SUM0]], i32 [[TMP4]], !prof !16			; CHECK-NEXT: [[SUM1_NONCHR:%.*]] = select i1 [[DOTNOT]], i32 [[SUM0]], i32 [[TMP4]], !prof !16
	; CHECK-NEXT: [[TMP7:%.*]] = and i32 [[TMP0]], 2			; CHECK-NEXT: [[TMP6:%.*]] = and i32 [[TMP0]], 2
	; CHECK-NEXT: [[TMP8:%.*]] = icmp eq i32 [[TMP7]], 0			; CHECK-NEXT: [[TMP7:%.*]] = icmp eq i32 [[TMP6]], 0
	; CHECK-NEXT: [[TMP9:%.*]] = add i32 [[SUM1_NONCHR]], 43			; CHECK-NEXT: [[TMP8:%.*]] = add i32 [[SUM1_NONCHR]], 43
	; CHECK-NEXT: [[SUM2_NONCHR:%.*]] = select i1 [[TMP8]], i32 [[SUM1_NONCHR]], i32 [[TMP9]], !prof !16			; CHECK-NEXT: [[SUM2_NONCHR:%.*]] = select i1 [[TMP7]], i32 [[SUM1_NONCHR]], i32 [[TMP8]], !prof !16
	; CHECK-NEXT: ret i32 [[SUM2_NONCHR]]			; CHECK-NEXT: ret i32 [[SUM2_NONCHR]]
	;			;
	entry:			entry:
	%0 = load i32, i32* %i			%0 = load i32, i32* %i
	%1 = and i32 %0, 1			%1 = and i32 %0, 1
	%2 = icmp eq i32 %1, 0			%2 = icmp eq i32 %1, 0
	%3 = add i32 %sum0, 42			%3 = add i32 %sum0, 42
	%sum1 = select i1 %2, i32 %sum0, i32 %3, !prof !15			%sum1 = select i1 %2, i32 %sum0, i32 %3, !prof !15
	Show All 37 Lines
	; CHECK-NEXT: [[TMP2:%.*]] = icmp eq i32 [[TMP1]], 15			; CHECK-NEXT: [[TMP2:%.*]] = icmp eq i32 [[TMP1]], 15
	; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15			; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15
	; CHECK: bb0:			; CHECK: bb0:
	; CHECK-NEXT: [[TMP3:%.]] = add i32 [[SUM0:%.]], 85			; CHECK-NEXT: [[TMP3:%.]] = add i32 [[SUM0:%.]], 85
	; CHECK-NEXT: [[TMP4:%.*]] = add i32 [[SUM0]], 173			; CHECK-NEXT: [[TMP4:%.*]] = add i32 [[SUM0]], 173
	; CHECK-NEXT: br label [[BB3:%.*]]			; CHECK-NEXT: br label [[BB3:%.*]]
	; CHECK: entry.split.nonchr:			; CHECK: entry.split.nonchr:
	; CHECK-NEXT: [[TMP5:%.*]] = and i32 [[TMP0]], 255			; CHECK-NEXT: [[TMP5:%.*]] = and i32 [[TMP0]], 255
	; CHECK-NEXT: [[TMP6:%.*]] = icmp eq i32 [[TMP5]], 0			; CHECK-NEXT: [[DOTNOT:%.*]] = icmp eq i32 [[TMP5]], 0
	; CHECK-NEXT: br i1 [[TMP6]], label [[BB3]], label [[BB0_NONCHR:%.*]], !prof !16			; CHECK-NEXT: br i1 [[DOTNOT]], label [[BB3]], label [[BB0_NONCHR:%.*]], !prof !16
	; CHECK: bb0.nonchr:			; CHECK: bb0.nonchr:
	; CHECK-NEXT: [[TMP7:%.*]] = and i32 [[TMP0]], 1			; CHECK-NEXT: [[TMP6:%.*]] = and i32 [[TMP0]], 1
	; CHECK-NEXT: [[TMP8:%.*]] = icmp eq i32 [[TMP7]], 0			; CHECK-NEXT: [[TMP7:%.*]] = icmp eq i32 [[TMP6]], 0
	; CHECK-NEXT: [[TMP9:%.*]] = add i32 [[SUM0]], 42			; CHECK-NEXT: [[TMP8:%.*]] = add i32 [[SUM0]], 42
	; CHECK-NEXT: [[SUM1_NONCHR:%.*]] = select i1 [[TMP8]], i32 [[SUM0]], i32 [[TMP9]], !prof !16			; CHECK-NEXT: [[SUM1_NONCHR:%.*]] = select i1 [[TMP7]], i32 [[SUM0]], i32 [[TMP8]], !prof !16
	; CHECK-NEXT: [[TMP10:%.*]] = and i32 [[TMP0]], 2			; CHECK-NEXT: [[TMP9:%.*]] = and i32 [[TMP0]], 2
	; CHECK-NEXT: [[TMP11:%.*]] = icmp eq i32 [[TMP10]], 0			; CHECK-NEXT: [[TMP10:%.*]] = icmp eq i32 [[TMP9]], 0
	; CHECK-NEXT: [[TMP12:%.*]] = add i32 [[SUM1_NONCHR]], 43			; CHECK-NEXT: [[TMP11:%.*]] = add i32 [[SUM1_NONCHR]], 43
	; CHECK-NEXT: [[SUM2_NONCHR:%.*]] = select i1 [[TMP11]], i32 [[SUM1_NONCHR]], i32 [[TMP12]], !prof !16			; CHECK-NEXT: [[SUM2_NONCHR:%.*]] = select i1 [[TMP10]], i32 [[SUM1_NONCHR]], i32 [[TMP11]], !prof !16
	; CHECK-NEXT: [[TMP13:%.*]] = and i32 [[TMP0]], 4			; CHECK-NEXT: [[TMP12:%.*]] = and i32 [[TMP0]], 4
	; CHECK-NEXT: [[TMP14:%.*]] = icmp eq i32 [[TMP13]], 0			; CHECK-NEXT: [[TMP13:%.*]] = icmp eq i32 [[TMP12]], 0
	; CHECK-NEXT: [[TMP15:%.*]] = and i32 [[TMP0]], 8			; CHECK-NEXT: [[TMP14:%.*]] = and i32 [[TMP0]], 8
	; CHECK-NEXT: [[TMP16:%.*]] = icmp eq i32 [[TMP15]], 0			; CHECK-NEXT: [[TMP15:%.*]] = icmp eq i32 [[TMP14]], 0
	; CHECK-NEXT: [[SUM4_NONCHR_V:%.*]] = select i1 [[TMP16]], i32 44, i32 88			; CHECK-NEXT: [[SUM4_NONCHR_V:%.*]] = select i1 [[TMP15]], i32 44, i32 88
	; CHECK-NEXT: [[SUM4_NONCHR:%.*]] = add i32 [[SUM2_NONCHR]], [[SUM4_NONCHR_V]]			; CHECK-NEXT: [[SUM4_NONCHR:%.*]] = add i32 [[SUM2_NONCHR]], [[SUM4_NONCHR_V]]
	; CHECK-NEXT: [[SUM5_NONCHR:%.*]] = select i1 [[TMP14]], i32 [[SUM2_NONCHR]], i32 [[SUM4_NONCHR]], !prof !16			; CHECK-NEXT: [[SUM5_NONCHR:%.*]] = select i1 [[TMP13]], i32 [[SUM2_NONCHR]], i32 [[SUM4_NONCHR]], !prof !16
	; CHECK-NEXT: br label [[BB3]]			; CHECK-NEXT: br label [[BB3]]
	; CHECK: bb3:			; CHECK: bb3:
	; CHECK-NEXT: [[SUM6:%.*]] = phi i32 [ [[TMP4]], [[BB0]] ], [ [[SUM0]], [[ENTRY_SPLIT_NONCHR]] ], [ [[SUM5_NONCHR]], [[BB0_NONCHR]] ]			; CHECK-NEXT: [[SUM6:%.*]] = phi i32 [ [[TMP4]], [[BB0]] ], [ [[SUM0]], [[ENTRY_SPLIT_NONCHR]] ], [ [[SUM5_NONCHR]], [[BB0_NONCHR]] ]
	; CHECK-NEXT: ret i32 [[SUM6]]			; CHECK-NEXT: ret i32 [[SUM6]]
	;			;
	entry:			entry:
	%0 = load i32, i32* %i			%0 = load i32, i32* %i
	%1 = and i32 %0, 255			%1 = and i32 %0, 255
	▲ Show 20 Lines • Show All 66 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[TMP5:%.*]] = and i1 [[TMP4]], [[TMP2]]			; CHECK-NEXT: [[TMP5:%.*]] = and i1 [[TMP4]], [[TMP2]]
	; CHECK-NEXT: br i1 [[TMP5]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15			; CHECK-NEXT: br i1 [[TMP5]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15
	; CHECK: bb0:			; CHECK: bb0:
	; CHECK-NEXT: [[TMP6:%.*]] = add i32 [[SUM0]], 85			; CHECK-NEXT: [[TMP6:%.*]] = add i32 [[SUM0]], 85
	; CHECK-NEXT: [[TMP7:%.*]] = add i32 [[SUM0]], 173			; CHECK-NEXT: [[TMP7:%.*]] = add i32 [[SUM0]], 173
	; CHECK-NEXT: br label [[BB3:%.*]]			; CHECK-NEXT: br label [[BB3:%.*]]
	; CHECK: entry.split.nonchr:			; CHECK: entry.split.nonchr:
	; CHECK-NEXT: [[TMP8:%.*]] = and i32 [[TMP0]], 255			; CHECK-NEXT: [[TMP8:%.*]] = and i32 [[TMP0]], 255
	; CHECK-NEXT: [[TMP9:%.*]] = icmp eq i32 [[TMP8]], 0			; CHECK-NEXT: [[DOTNOT:%.*]] = icmp eq i32 [[TMP8]], 0
	; CHECK-NEXT: br i1 [[TMP9]], label [[BB3]], label [[BB0_NONCHR:%.*]], !prof !16			; CHECK-NEXT: br i1 [[DOTNOT]], label [[BB3]], label [[BB0_NONCHR:%.*]], !prof !16
	; CHECK: bb0.nonchr:			; CHECK: bb0.nonchr:
	; CHECK-NEXT: [[TMP10:%.*]] = and i32 [[TMP0]], 1			; CHECK-NEXT: [[TMP9:%.*]] = and i32 [[TMP0]], 1
	; CHECK-NEXT: [[TMP11:%.*]] = icmp eq i32 [[TMP10]], 0			; CHECK-NEXT: [[TMP10:%.*]] = icmp eq i32 [[TMP9]], 0
	; CHECK-NEXT: [[TMP12:%.*]] = add i32 [[SUM0]], 42			; CHECK-NEXT: [[TMP11:%.*]] = add i32 [[SUM0]], 42
	; CHECK-NEXT: [[SUM1_NONCHR:%.*]] = select i1 [[TMP11]], i32 [[SUM0]], i32 [[TMP12]], !prof !16			; CHECK-NEXT: [[SUM1_NONCHR:%.*]] = select i1 [[TMP10]], i32 [[SUM0]], i32 [[TMP11]], !prof !16
	; CHECK-NEXT: [[TMP13:%.*]] = and i32 [[TMP0]], 2			; CHECK-NEXT: [[TMP12:%.*]] = and i32 [[TMP0]], 2
	; CHECK-NEXT: [[TMP14:%.*]] = icmp eq i32 [[TMP13]], 0			; CHECK-NEXT: [[TMP13:%.*]] = icmp eq i32 [[TMP12]], 0
	; CHECK-NEXT: [[TMP15:%.*]] = add i32 [[SUM1_NONCHR]], 43			; CHECK-NEXT: [[TMP14:%.*]] = add i32 [[SUM1_NONCHR]], 43
	; CHECK-NEXT: [[SUM2_NONCHR:%.*]] = select i1 [[TMP14]], i32 [[SUM1_NONCHR]], i32 [[TMP15]], !prof !16			; CHECK-NEXT: [[SUM2_NONCHR:%.*]] = select i1 [[TMP13]], i32 [[SUM1_NONCHR]], i32 [[TMP14]], !prof !16
	; CHECK-NEXT: [[TMP16:%.*]] = and i32 [[SUM0]], 4			; CHECK-NEXT: [[TMP15:%.*]] = and i32 [[SUM0]], 4
	; CHECK-NEXT: [[TMP17:%.*]] = icmp eq i32 [[TMP16]], 0			; CHECK-NEXT: [[TMP16:%.*]] = icmp eq i32 [[TMP15]], 0
	; CHECK-NEXT: [[TMP18:%.*]] = and i32 [[TMP0]], 8			; CHECK-NEXT: [[TMP17:%.*]] = and i32 [[TMP0]], 8
	; CHECK-NEXT: [[TMP19:%.*]] = icmp eq i32 [[TMP18]], 0			; CHECK-NEXT: [[TMP18:%.*]] = icmp eq i32 [[TMP17]], 0
	; CHECK-NEXT: [[SUM4_NONCHR_V:%.*]] = select i1 [[TMP19]], i32 44, i32 88			; CHECK-NEXT: [[SUM4_NONCHR_V:%.*]] = select i1 [[TMP18]], i32 44, i32 88
	; CHECK-NEXT: [[SUM4_NONCHR:%.*]] = add i32 [[SUM2_NONCHR]], [[SUM4_NONCHR_V]]			; CHECK-NEXT: [[SUM4_NONCHR:%.*]] = add i32 [[SUM2_NONCHR]], [[SUM4_NONCHR_V]]
	; CHECK-NEXT: [[SUM5_NONCHR:%.*]] = select i1 [[TMP17]], i32 [[SUM2_NONCHR]], i32 [[SUM4_NONCHR]], !prof !16			; CHECK-NEXT: [[SUM5_NONCHR:%.*]] = select i1 [[TMP16]], i32 [[SUM2_NONCHR]], i32 [[SUM4_NONCHR]], !prof !16
	; CHECK-NEXT: br label [[BB3]]			; CHECK-NEXT: br label [[BB3]]
	; CHECK: bb3:			; CHECK: bb3:
	; CHECK-NEXT: [[SUM6:%.*]] = phi i32 [ [[TMP7]], [[BB0]] ], [ [[SUM0]], [[ENTRY_SPLIT_NONCHR]] ], [ [[SUM5_NONCHR]], [[BB0_NONCHR]] ]			; CHECK-NEXT: [[SUM6:%.*]] = phi i32 [ [[TMP7]], [[BB0]] ], [ [[SUM0]], [[ENTRY_SPLIT_NONCHR]] ], [ [[SUM5_NONCHR]], [[BB0_NONCHR]] ]
	; CHECK-NEXT: ret i32 [[SUM6]]			; CHECK-NEXT: ret i32 [[SUM6]]
	;			;
	entry:			entry:
	%0 = load i32, i32* %i			%0 = load i32, i32* %i
	%1 = and i32 %0, 255			%1 = and i32 %0, 255
	▲ Show 20 Lines • Show All 67 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[TMP2:%.*]] = and i1 [[TMP1]], [[V10]]			; CHECK-NEXT: [[TMP2:%.*]] = and i1 [[TMP1]], [[V10]]
	; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15			; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15
	; CHECK: bb0:			; CHECK: bb0:
	; CHECK-NEXT: [[V8:%.]] = add i32 [[SUM0:%.]], 43			; CHECK-NEXT: [[V8:%.]] = add i32 [[SUM0:%.]], 43
	; CHECK-NEXT: [[V13:%.*]] = add i32 [[SUM0]], 131			; CHECK-NEXT: [[V13:%.*]] = add i32 [[SUM0]], 131
	; CHECK-NEXT: br label [[BB3:%.*]]			; CHECK-NEXT: br label [[BB3:%.*]]
	; CHECK: entry.split.nonchr:			; CHECK: entry.split.nonchr:
	; CHECK-NEXT: [[V1:%.*]] = and i32 [[I0]], 255			; CHECK-NEXT: [[V1:%.*]] = and i32 [[I0]], 255
	; CHECK-NEXT: [[V2:%.*]] = icmp eq i32 [[V1]], 0			; CHECK-NEXT: [[V2_NOT:%.*]] = icmp eq i32 [[V1]], 0
	; CHECK-NEXT: br i1 [[V2]], label [[BB3]], label [[BB0_NONCHR:%.*]], !prof !16			; CHECK-NEXT: br i1 [[V2_NOT]], label [[BB3]], label [[BB0_NONCHR:%.*]], !prof !16
	; CHECK: bb0.nonchr:			; CHECK: bb0.nonchr:
	; CHECK-NEXT: [[V3_NONCHR:%.*]] = and i32 [[I0]], 2			; CHECK-NEXT: [[V3_NONCHR:%.*]] = and i32 [[I0]], 2
	; CHECK-NEXT: [[V4_NONCHR:%.*]] = icmp eq i32 [[V3_NONCHR]], 0			; CHECK-NEXT: [[V4_NONCHR:%.*]] = icmp eq i32 [[V3_NONCHR]], 0
	; CHECK-NEXT: [[V8_NONCHR:%.*]] = add i32 [[SUM0]], 43			; CHECK-NEXT: [[V8_NONCHR:%.*]] = add i32 [[SUM0]], 43
	; CHECK-NEXT: [[SUM2_NONCHR:%.*]] = select i1 [[V4_NONCHR]], i32 [[SUM0]], i32 [[V8_NONCHR]], !prof !16			; CHECK-NEXT: [[SUM2_NONCHR:%.*]] = select i1 [[V4_NONCHR]], i32 [[SUM0]], i32 [[V8_NONCHR]], !prof !16
	; CHECK-NEXT: [[V9_NONCHR:%.*]] = and i32 [[J0]], 4			; CHECK-NEXT: [[V9_NONCHR:%.*]] = and i32 [[J0]], 4
	; CHECK-NEXT: [[V10_NONCHR:%.*]] = icmp eq i32 [[V9_NONCHR]], 0			; CHECK-NEXT: [[V10_NONCHR:%.*]] = icmp eq i32 [[V9_NONCHR]], 0
	; CHECK-NEXT: [[V11_NONCHR:%.*]] = and i32 [[I0]], 8			; CHECK-NEXT: [[V11_NONCHR:%.*]] = and i32 [[I0]], 8
	▲ Show 20 Lines • Show All 133 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[TMP1:%.*]] = icmp eq i32 [[TMP0]], 12			; CHECK-NEXT: [[TMP1:%.*]] = icmp eq i32 [[TMP0]], 12
	; CHECK-NEXT: br i1 [[TMP1]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15			; CHECK-NEXT: br i1 [[TMP1]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15
	; CHECK: bb0:			; CHECK: bb0:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB3:%.*]]			; CHECK-NEXT: br label [[BB3:%.*]]
	; CHECK: entry.split.nonchr:			; CHECK: entry.split.nonchr:
	; CHECK-NEXT: [[V9:%.*]] = and i32 [[J0]], 4			; CHECK-NEXT: [[V9:%.*]] = and i32 [[J0]], 4
	; CHECK-NEXT: [[V10:%.*]] = icmp eq i32 [[V9]], 0			; CHECK-NEXT: [[V10_NOT:%.*]] = icmp eq i32 [[V9]], 0
	; CHECK-NEXT: br i1 [[V10]], label [[BB1_NONCHR:%.]], label [[BB0_NONCHR:%.]], !prof !16			; CHECK-NEXT: br i1 [[V10_NOT]], label [[BB1_NONCHR:%.]], label [[BB0_NONCHR:%.]], !prof !16
	; CHECK: bb0.nonchr:			; CHECK: bb0.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB1_NONCHR]]			; CHECK-NEXT: br label [[BB1_NONCHR]]
	; CHECK: bb1.nonchr:			; CHECK: bb1.nonchr:
	; CHECK-NEXT: [[V11_NONCHR:%.*]] = and i32 [[J0]], 8			; CHECK-NEXT: [[V11_NONCHR:%.*]] = and i32 [[J0]], 8
	; CHECK-NEXT: [[V12_NONCHR:%.*]] = icmp eq i32 [[V11_NONCHR]], 0			; CHECK-NEXT: [[V12_NONCHR:%.*]] = icmp eq i32 [[V11_NONCHR]], 0
	; CHECK-NEXT: br i1 [[V12_NONCHR]], label [[BB3]], label [[BB2_NONCHR:%.*]], !prof !16			; CHECK-NEXT: br i1 [[V12_NONCHR]], label [[BB3]], label [[BB2_NONCHR:%.*]], !prof !16
	; CHECK: bb2.nonchr:			; CHECK: bb2.nonchr:
	▲ Show 20 Lines • Show All 123 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15			; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15
	; CHECK: bb0:			; CHECK: bb0:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: [[TMP3:%.]] = load i32, i32 [[J:%.*]], align 4			; CHECK-NEXT: [[TMP3:%.]] = load i32, i32 [[J:%.*]], align 4
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB3:%.*]]			; CHECK-NEXT: br label [[BB3:%.*]]
	; CHECK: entry.split.nonchr:			; CHECK: entry.split.nonchr:
	; CHECK-NEXT: [[TMP4:%.*]] = and i32 [[TMP0]], 1			; CHECK-NEXT: [[TMP4:%.*]] = and i32 [[TMP0]], 1
	; CHECK-NEXT: [[TMP5:%.*]] = icmp eq i32 [[TMP4]], 0			; CHECK-NEXT: [[DOTNOT:%.*]] = icmp eq i32 [[TMP4]], 0
	; CHECK-NEXT: br i1 [[TMP5]], label [[BB1_NONCHR:%.]], label [[BB0_NONCHR:%.]], !prof !16			; CHECK-NEXT: br i1 [[DOTNOT]], label [[BB1_NONCHR:%.]], label [[BB0_NONCHR:%.]], !prof !16
	; CHECK: bb0.nonchr:			; CHECK: bb0.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB1_NONCHR]]			; CHECK-NEXT: br label [[BB1_NONCHR]]
	; CHECK: bb1.nonchr:			; CHECK: bb1.nonchr:
	; CHECK-NEXT: [[TMP6:%.*]] = and i32 [[TMP0]], 2			; CHECK-NEXT: [[TMP5:%.*]] = and i32 [[TMP0]], 2
	; CHECK-NEXT: [[TMP7:%.*]] = icmp eq i32 [[TMP6]], 0			; CHECK-NEXT: [[TMP6:%.*]] = icmp eq i32 [[TMP5]], 0
	; CHECK-NEXT: br i1 [[TMP7]], label [[BB3]], label [[BB2_NONCHR:%.*]], !prof !16			; CHECK-NEXT: br i1 [[TMP6]], label [[BB3]], label [[BB2_NONCHR:%.*]], !prof !16
	; CHECK: bb2.nonchr:			; CHECK: bb2.nonchr:
	; CHECK-NEXT: [[TMP8:%.]] = load i32, i32 [[J]], align 4			; CHECK-NEXT: [[TMP7:%.]] = load i32, i32 [[J]], align 4
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB3]]			; CHECK-NEXT: br label [[BB3]]
	; CHECK: bb3:			; CHECK: bb3:
	; CHECK-NEXT: [[TMP9:%.*]] = phi i32 [ [[TMP3]], [[BB0]] ], [ [[TMP0]], [[BB1_NONCHR]] ], [ [[TMP8]], [[BB2_NONCHR]] ]			; CHECK-NEXT: [[TMP8:%.*]] = phi i32 [ [[TMP3]], [[BB0]] ], [ [[TMP0]], [[BB1_NONCHR]] ], [ [[TMP7]], [[BB2_NONCHR]] ]
	; CHECK-NEXT: ret i32 [[TMP9]]			; CHECK-NEXT: ret i32 [[TMP8]]
	;			;
	entry:			entry:
	%0 = load i32, i32* %i			%0 = load i32, i32* %i
	%1 = and i32 %0, 1			%1 = and i32 %0, 1
	%2 = icmp eq i32 %1, 0			%2 = icmp eq i32 %1, 0
	br i1 %2, label %bb1, label %bb0, !prof !15			br i1 %2, label %bb1, label %bb0, !prof !15

	bb0:			bb0:
	▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15			; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15
	; CHECK: bb0:			; CHECK: bb0:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: [[TMP3:%.]] = load i32, i32 [[J:%.*]], align 4			; CHECK-NEXT: [[TMP3:%.]] = load i32, i32 [[J:%.*]], align 4
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB3:%.*]]			; CHECK-NEXT: br label [[BB3:%.*]]
	; CHECK: entry.split.nonchr:			; CHECK: entry.split.nonchr:
	; CHECK-NEXT: [[TMP4:%.*]] = and i32 [[TMP0]], 1			; CHECK-NEXT: [[TMP4:%.*]] = and i32 [[TMP0]], 1
	; CHECK-NEXT: [[TMP5:%.*]] = icmp eq i32 [[TMP4]], 0			; CHECK-NEXT: [[DOTNOT:%.*]] = icmp eq i32 [[TMP4]], 0
	; CHECK-NEXT: br i1 [[TMP5]], label [[BB1_NONCHR:%.]], label [[BB0_NONCHR:%.]], !prof !16			; CHECK-NEXT: br i1 [[DOTNOT]], label [[BB1_NONCHR:%.]], label [[BB0_NONCHR:%.]], !prof !16
	; CHECK: bb0.nonchr:			; CHECK: bb0.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB1_NONCHR]]			; CHECK-NEXT: br label [[BB1_NONCHR]]
	; CHECK: bb1.nonchr:			; CHECK: bb1.nonchr:
	; CHECK-NEXT: [[TMP6:%.]] = load i32, i32 [[J]], align 4			; CHECK-NEXT: [[TMP5:%.]] = load i32, i32 [[J]], align 4
	; CHECK-NEXT: [[TMP7:%.*]] = and i32 [[TMP0]], 2			; CHECK-NEXT: [[TMP6:%.*]] = and i32 [[TMP0]], 2
	; CHECK-NEXT: [[TMP8:%.*]] = icmp eq i32 [[TMP7]], 0			; CHECK-NEXT: [[TMP7:%.*]] = icmp eq i32 [[TMP6]], 0
	; CHECK-NEXT: br i1 [[TMP8]], label [[BB3]], label [[BB2_NONCHR:%.*]], !prof !16			; CHECK-NEXT: br i1 [[TMP7]], label [[BB3]], label [[BB2_NONCHR:%.*]], !prof !16
	; CHECK: bb2.nonchr:			; CHECK: bb2.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB3]]			; CHECK-NEXT: br label [[BB3]]
	; CHECK: bb3:			; CHECK: bb3:
	; CHECK-NEXT: [[TMP9:%.*]] = phi i32 [ [[TMP3]], [[BB0]] ], [ [[TMP6]], [[BB2_NONCHR]] ], [ [[TMP6]], [[BB1_NONCHR]] ]			; CHECK-NEXT: [[TMP8:%.*]] = phi i32 [ [[TMP3]], [[BB0]] ], [ [[TMP5]], [[BB2_NONCHR]] ], [ [[TMP5]], [[BB1_NONCHR]] ]
	; CHECK-NEXT: [[TMP10:%.*]] = mul i32 [[TMP9]], 42			; CHECK-NEXT: [[TMP9:%.*]] = mul i32 [[TMP8]], 42
	; CHECK-NEXT: [[TMP11:%.*]] = add i32 [[TMP9]], -99			; CHECK-NEXT: [[TMP10:%.*]] = add i32 [[TMP8]], -99
	; CHECK-NEXT: [[TMP12:%.*]] = add i32 [[TMP10]], [[TMP11]]			; CHECK-NEXT: [[TMP11:%.*]] = add i32 [[TMP9]], [[TMP10]]
	; CHECK-NEXT: ret i32 [[TMP12]]			; CHECK-NEXT: ret i32 [[TMP11]]
	;			;
	entry:			entry:
	%0 = load i32, i32* %i			%0 = load i32, i32* %i
	%1 = and i32 %0, 1			%1 = and i32 %0, 1
	%2 = icmp eq i32 %1, 0			%2 = icmp eq i32 %1, 0
	br i1 %2, label %bb1, label %bb0, !prof !15			br i1 %2, label %bb1, label %bb0, !prof !15

	bb0:			bb0:
	▲ Show 20 Lines • Show All 412 Lines • ▼ Show 20 Lines
	; CHECK: bb0:			; CHECK: bb0:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: [[V40:%.*]] = add i32 [[TMP0]], 44			; CHECK-NEXT: [[V40:%.*]] = add i32 [[TMP0]], 44
	; CHECK-NEXT: [[V41:%.*]] = add i32 [[TMP0]], 99			; CHECK-NEXT: [[V41:%.*]] = add i32 [[TMP0]], 99
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB3:%.*]]			; CHECK-NEXT: br label [[BB3:%.*]]
	; CHECK: entry.split.nonchr:			; CHECK: entry.split.nonchr:
	; CHECK-NEXT: [[TMP3:%.*]] = and i32 [[TMP0]], 1			; CHECK-NEXT: [[TMP3:%.*]] = and i32 [[TMP0]], 1
	; CHECK-NEXT: [[TMP4:%.*]] = icmp eq i32 [[TMP3]], 0			; CHECK-NEXT: [[DOTNOT:%.*]] = icmp eq i32 [[TMP3]], 0
	; CHECK-NEXT: br i1 [[TMP4]], label [[BB1_NONCHR:%.]], label [[BB0_NONCHR:%.]], !prof !16			; CHECK-NEXT: br i1 [[DOTNOT]], label [[BB1_NONCHR:%.]], label [[BB0_NONCHR:%.]], !prof !16
	; CHECK: bb0.nonchr:			; CHECK: bb0.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB1_NONCHR]]			; CHECK-NEXT: br label [[BB1_NONCHR]]
	; CHECK: bb1.nonchr:			; CHECK: bb1.nonchr:
	; CHECK-NEXT: [[V40_NONCHR:%.*]] = add i32 [[TMP0]], 44			; CHECK-NEXT: [[V40_NONCHR:%.*]] = add i32 [[TMP0]], 44
	; CHECK-NEXT: [[TMP5:%.*]] = and i32 [[TMP0]], 2			; CHECK-NEXT: [[TMP4:%.*]] = and i32 [[TMP0]], 2
	; CHECK-NEXT: [[TMP6:%.*]] = icmp eq i32 [[TMP5]], 0			; CHECK-NEXT: [[TMP5:%.*]] = icmp eq i32 [[TMP4]], 0
	; CHECK-NEXT: br i1 [[TMP6]], label [[BB3]], label [[BB2_NONCHR:%.*]], !prof !16			; CHECK-NEXT: br i1 [[TMP5]], label [[BB3]], label [[BB2_NONCHR:%.*]], !prof !16
	; CHECK: bb2.nonchr:			; CHECK: bb2.nonchr:
	; CHECK-NEXT: [[V41_NONCHR:%.*]] = add i32 [[TMP0]], 99			; CHECK-NEXT: [[V41_NONCHR:%.*]] = add i32 [[TMP0]], 99
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB3]]			; CHECK-NEXT: br label [[BB3]]
	; CHECK: bb3:			; CHECK: bb3:
	; CHECK-NEXT: [[TMP7:%.*]] = phi i32 [ [[V40]], [[BB0]] ], [ [[V40_NONCHR]], [[BB2_NONCHR]] ], [ [[V40_NONCHR]], [[BB1_NONCHR]] ]			; CHECK-NEXT: [[TMP6:%.*]] = phi i32 [ [[V40]], [[BB0]] ], [ [[V40_NONCHR]], [[BB2_NONCHR]] ], [ [[V40_NONCHR]], [[BB1_NONCHR]] ]
	; CHECK-NEXT: [[V42:%.*]] = phi i32 [ [[V41]], [[BB0]] ], [ [[V41_NONCHR]], [[BB2_NONCHR]] ], [ [[V40_NONCHR]], [[BB1_NONCHR]] ]			; CHECK-NEXT: [[V42:%.*]] = phi i32 [ [[V41]], [[BB0]] ], [ [[V41_NONCHR]], [[BB2_NONCHR]] ], [ [[V40_NONCHR]], [[BB1_NONCHR]] ]
	; CHECK-NEXT: [[V43:%.*]] = add i32 [[V42]], [[TMP7]]			; CHECK-NEXT: [[V43:%.*]] = add i32 [[V42]], [[TMP6]]
	; CHECK-NEXT: ret i32 [[V43]]			; CHECK-NEXT: ret i32 [[V43]]
	;			;
	entry:			entry:
	%0 = load i32, i32* %i			%0 = load i32, i32* %i
	%1 = and i32 %0, 1			%1 = and i32 %0, 1
	%2 = icmp eq i32 %1, 0			%2 = icmp eq i32 %1, 0
	br i1 %2, label %bb1, label %bb0, !prof !15			br i1 %2, label %bb1, label %bb0, !prof !15

	▲ Show 20 Lines • Show All 151 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: br label [[BB2]]			; CHECK-NEXT: br label [[BB2]]
	; CHECK: bb0.split.nonchr:			; CHECK: bb0.split.nonchr:
	; CHECK-NEXT: [[A4_NONCHR:%.*]] = and i32 [[LI]], 4			; CHECK-NEXT: [[A4_NONCHR:%.*]] = and i32 [[LI]], 4
	; CHECK-NEXT: [[CMP4_NONCHR:%.*]] = icmp eq i32 [[A4_NONCHR]], 0			; CHECK-NEXT: [[CMP4_NONCHR:%.*]] = icmp eq i32 [[A4_NONCHR]], 0
	; CHECK-NEXT: [[INC2_NONCHR:%.*]] = add i32 [[INC1]], 1			; CHECK-NEXT: [[INC2_NONCHR:%.*]] = add i32 [[INC1]], 1
	; CHECK-NEXT: br i1 [[CMP4_NONCHR]], label [[BB2]], label [[BB1_NONCHR:%.*]], !prof !16			; CHECK-NEXT: br i1 [[CMP4_NONCHR]], label [[BB2]], label [[BB1_NONCHR:%.*]], !prof !16
	; CHECK: bb1.nonchr:			; CHECK: bb1.nonchr:
	; CHECK-NEXT: [[A1:%.*]] = and i32 [[LI]], 1			; CHECK-NEXT: [[A1:%.*]] = and i32 [[LI]], 1
	; CHECK-NEXT: [[CMP1:%.*]] = icmp eq i32 [[A1]], 0			; CHECK-NEXT: [[CMP1_NOT:%.*]] = icmp eq i32 [[A1]], 0
	; CHECK-NEXT: [[SUM2_NONCHR:%.*]] = select i1 [[CMP1]], i32 [[SUM0]], i32 [[SUM1]], !prof !16			; CHECK-NEXT: [[SUM2_NONCHR:%.*]] = select i1 [[CMP1_NOT]], i32 [[SUM0]], i32 [[SUM1]], !prof !16
	; CHECK-NEXT: [[SUM3_NONCHR:%.*]] = add i32 [[SUM2_NONCHR]], 44			; CHECK-NEXT: [[SUM3_NONCHR:%.*]] = add i32 [[SUM2_NONCHR]], 44
	; CHECK-NEXT: br label [[BB2]]			; CHECK-NEXT: br label [[BB2]]
	; CHECK: bb2:			; CHECK: bb2:
	; CHECK-NEXT: [[TMP2]] = phi i32 [ [[INC2]], [[BB0_SPLIT]] ], [ [[INC2_NONCHR]], [[BB1_NONCHR]] ], [ [[INC2_NONCHR]], [[BB0_SPLIT_NONCHR]] ]			; CHECK-NEXT: [[TMP2]] = phi i32 [ [[INC2]], [[BB0_SPLIT]] ], [ [[INC2_NONCHR]], [[BB1_NONCHR]] ], [ [[INC2_NONCHR]], [[BB0_SPLIT_NONCHR]] ]
	; CHECK-NEXT: [[SUM4:%.*]] = phi i32 [ [[SUM3]], [[BB0_SPLIT]] ], [ [[SUM3_NONCHR]], [[BB1_NONCHR]] ], [ [[SUM1]], [[BB0_SPLIT_NONCHR]] ]			; CHECK-NEXT: [[SUM4:%.*]] = phi i32 [ [[SUM3]], [[BB0_SPLIT]] ], [ [[SUM3_NONCHR]], [[BB1_NONCHR]] ], [ [[SUM1]], [[BB0_SPLIT_NONCHR]] ]
	; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[TMP2]], 100			; CHECK-NEXT: [[CMP:%.*]] = icmp eq i32 [[TMP2]], 100
	; CHECK-NEXT: br i1 [[CMP]], label [[BB3:%.*]], label [[BB0]], !prof !16			; CHECK-NEXT: br i1 [[CMP]], label [[BB3:%.*]], label [[BB0]], !prof !16
	; CHECK: bb3:			; CHECK: bb3:
	▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[TMP2:%.*]] = icmp eq i32 [[TMP1]], 9			; CHECK-NEXT: [[TMP2:%.*]] = icmp eq i32 [[TMP1]], 9
	; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15			; CHECK-NEXT: br i1 [[TMP2]], label [[BB0:%.]], label [[ENTRY_SPLIT_NONCHR:%.]], !prof !15
	; CHECK: bb0:			; CHECK: bb0:
	; CHECK-NEXT: [[TMP3:%.]] = add i32 [[SUM0:%.]], 85			; CHECK-NEXT: [[TMP3:%.]] = add i32 [[SUM0:%.]], 85
	; CHECK-NEXT: [[TMP4:%.*]] = add i32 [[SUM0]], 173			; CHECK-NEXT: [[TMP4:%.*]] = add i32 [[SUM0]], 173
	; CHECK-NEXT: br label [[BB3:%.*]]			; CHECK-NEXT: br label [[BB3:%.*]]
	; CHECK: entry.split.nonchr:			; CHECK: entry.split.nonchr:
	; CHECK-NEXT: [[TMP5:%.*]] = and i32 [[TMP0]], 255			; CHECK-NEXT: [[TMP5:%.*]] = and i32 [[TMP0]], 255
	; CHECK-NEXT: [[TMP6:%.*]] = icmp eq i32 [[TMP5]], 0			; CHECK-NEXT: [[DOTNOT:%.*]] = icmp eq i32 [[TMP5]], 0
	; CHECK-NEXT: br i1 [[TMP6]], label [[BB3]], label [[BB0_NONCHR:%.*]], !prof !16			; CHECK-NEXT: br i1 [[DOTNOT]], label [[BB3]], label [[BB0_NONCHR:%.*]], !prof !16
	; CHECK: bb0.nonchr:			; CHECK: bb0.nonchr:
	; CHECK-NEXT: [[TMP7:%.*]] = and i32 [[TMP0]], 1			; CHECK-NEXT: [[TMP6:%.*]] = and i32 [[TMP0]], 1
	; CHECK-NEXT: [[TMP8:%.*]] = icmp eq i32 [[TMP7]], 0			; CHECK-NEXT: [[TMP7:%.*]] = icmp eq i32 [[TMP6]], 0
	; CHECK-NEXT: [[TMP9:%.*]] = add i32 [[SUM0]], 85			; CHECK-NEXT: [[TMP8:%.*]] = add i32 [[SUM0]], 85
	; CHECK-NEXT: [[SUM2_NONCHR:%.*]] = select i1 [[TMP8]], i32 [[SUM0]], i32 [[TMP9]], !prof !16			; CHECK-NEXT: [[SUM2_NONCHR:%.*]] = select i1 [[TMP7]], i32 [[SUM0]], i32 [[TMP8]], !prof !16
	; CHECK-NEXT: [[TMP10:%.*]] = and i32 [[TMP0]], 8			; CHECK-NEXT: [[TMP9:%.*]] = and i32 [[TMP0]], 8
	; CHECK-NEXT: [[TMP11:%.*]] = icmp eq i32 [[TMP10]], 0			; CHECK-NEXT: [[TMP10:%.*]] = icmp eq i32 [[TMP9]], 0
	; CHECK-NEXT: [[SUM4_NONCHR_V:%.*]] = select i1 [[TMP11]], i32 44, i32 88			; CHECK-NEXT: [[SUM4_NONCHR_V:%.*]] = select i1 [[TMP10]], i32 44, i32 88
	; CHECK-NEXT: [[SUM4_NONCHR:%.*]] = add i32 [[SUM2_NONCHR]], [[SUM4_NONCHR_V]]			; CHECK-NEXT: [[SUM4_NONCHR:%.*]] = add i32 [[SUM2_NONCHR]], [[SUM4_NONCHR_V]]
	; CHECK-NEXT: [[SUM5_NONCHR:%.*]] = select i1 [[TMP8]], i32 [[SUM2_NONCHR]], i32 [[SUM4_NONCHR]], !prof !16			; CHECK-NEXT: [[SUM5_NONCHR:%.*]] = select i1 [[TMP7]], i32 [[SUM2_NONCHR]], i32 [[SUM4_NONCHR]], !prof !16
	; CHECK-NEXT: br label [[BB3]]			; CHECK-NEXT: br label [[BB3]]
	; CHECK: bb3:			; CHECK: bb3:
	; CHECK-NEXT: [[SUM6:%.*]] = phi i32 [ [[TMP4]], [[BB0]] ], [ [[SUM0]], [[ENTRY_SPLIT_NONCHR]] ], [ [[SUM5_NONCHR]], [[BB0_NONCHR]] ]			; CHECK-NEXT: [[SUM6:%.*]] = phi i32 [ [[TMP4]], [[BB0]] ], [ [[SUM0]], [[ENTRY_SPLIT_NONCHR]] ], [ [[SUM5_NONCHR]], [[BB0_NONCHR]] ]
	; CHECK-NEXT: ret i32 [[SUM6]]			; CHECK-NEXT: ret i32 [[SUM6]]
	;			;
	entry:			entry:
	%0 = load i32, i32* %i			%0 = load i32, i32* %i
	%1 = and i32 %0, 255			%1 = and i32 %0, 255
	▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: [[V9:%.]] = add i32 [[SUM0:%.]], 87			; CHECK-NEXT: [[V9:%.]] = add i32 [[SUM0:%.]], 87
	; CHECK-NEXT: br i1 [[J:%.]], label [[BB1:%.]], label [[BB4:%.*]]			; CHECK-NEXT: br i1 [[J:%.]], label [[BB1:%.]], label [[BB4:%.*]]
	; CHECK: bb1:			; CHECK: bb1:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB4]]			; CHECK-NEXT: br label [[BB4]]
	; CHECK: entry.split.nonchr:			; CHECK: entry.split.nonchr:
	; CHECK-NEXT: [[V8:%.*]] = add i32 [[SUM0]], 43			; CHECK-NEXT: [[V8:%.*]] = add i32 [[SUM0]], 43
	; CHECK-NEXT: [[V3:%.*]] = and i32 [[I0]], 2			; CHECK-NEXT: [[V3:%.*]] = and i32 [[I0]], 2
	; CHECK-NEXT: [[V4:%.*]] = icmp eq i32 [[V3]], 0			; CHECK-NEXT: [[V4_NOT:%.*]] = icmp eq i32 [[V3]], 0
	; CHECK-NEXT: [[SUM2_NONCHR:%.*]] = select i1 [[V4]], i32 [[SUM0]], i32 [[V8]], !prof !16			; CHECK-NEXT: [[SUM2_NONCHR:%.*]] = select i1 [[V4_NOT]], i32 [[SUM0]], i32 [[V8]], !prof !16
	; CHECK-NEXT: [[V6_NONCHR:%.*]] = and i32 [[I0]], 4			; CHECK-NEXT: [[V6_NONCHR:%.*]] = and i32 [[I0]], 4
	; CHECK-NEXT: [[V5_NONCHR:%.*]] = icmp eq i32 [[V6_NONCHR]], 0			; CHECK-NEXT: [[V5_NONCHR:%.*]] = icmp eq i32 [[V6_NONCHR]], 0
	; CHECK-NEXT: [[V9_NONCHR:%.*]] = add i32 [[SUM2_NONCHR]], 44			; CHECK-NEXT: [[V9_NONCHR:%.*]] = add i32 [[SUM2_NONCHR]], 44
	; CHECK-NEXT: [[SUM3_NONCHR:%.*]] = select i1 [[V5_NONCHR]], i32 [[SUM2_NONCHR]], i32 [[V9_NONCHR]], !prof !16			; CHECK-NEXT: [[SUM3_NONCHR:%.*]] = select i1 [[V5_NONCHR]], i32 [[SUM2_NONCHR]], i32 [[V9_NONCHR]], !prof !16
	; CHECK-NEXT: br i1 [[J]], label [[BB1_NONCHR:%.*]], label [[BB4]]			; CHECK-NEXT: br i1 [[J]], label [[BB1_NONCHR:%.*]], label [[BB4]]
	; CHECK: bb1.nonchr:			; CHECK: bb1.nonchr:
	; CHECK-NEXT: call void @foo()			; CHECK-NEXT: call void @foo()
	; CHECK-NEXT: br label [[BB4]]			; CHECK-NEXT: br label [[BB4]]
	▲ Show 20 Lines • Show All 468 Lines • ▼ Show 20 Lines
	; Test a case with a really long use-def chains. This test checks that it's not			; Test a case with a really long use-def chains. This test checks that it's not
	; really slow and doesn't appear to be hanging. This is different from			; really slow and doesn't appear to be hanging. This is different from
	; test_chr_22 in that it has nested control structures (multiple scopes) and			; test_chr_22 in that it has nested control structures (multiple scopes) and
	; covers additional code.			; covers additional code.
	define i64 @test_chr_23(i64 %v0) !prof !14 {			define i64 @test_chr_23(i64 %v0) !prof !14 {
	; CHECK-LABEL: @test_chr_23(			; CHECK-LABEL: @test_chr_23(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[TMP0:%.]] = mul i64 [[V0:%.]], 50			; CHECK-NEXT: [[TMP0:%.]] = mul i64 [[V0:%.]], 50
	; CHECK-NEXT: [[V10:%.*]] = icmp ne i64 [[TMP0]], -50			; CHECK-NEXT: [[V10_NOT:%.*]] = icmp eq i64 [[TMP0]], -50
	; CHECK-NEXT: ret i64 99			; CHECK-NEXT: ret i64 99
	;			;
	entry:			entry:
	%v1 = add i64 %v0, 3			%v1 = add i64 %v0, 3
	%v2 = add i64 %v1, %v1			%v2 = add i64 %v1, %v1
	%v3 = add i64 %v2, %v1			%v3 = add i64 %v2, %v1
	%v4 = add i64 %v2, %v3			%v4 = add i64 %v2, %v3
	%v5 = add i64 %v4, %v2			%v5 = add i64 %v4, %v2
	▲ Show 20 Lines • Show All 209 Lines • Show Last 20 Lines

llvm/test/Transforms/SimplifyCFG/merge-cond-stores.ll

	Show All 36 Lines
	end:			end:
	ret void			ret void
	}			}

	; This is the same as test_simple, but the branch target order has been swapped			; This is the same as test_simple, but the branch target order has been swapped
	define void @test_simple_commuted(i32* %p, i32 %a, i32 %b) {			define void @test_simple_commuted(i32* %p, i32 %a, i32 %b) {
	; CHECK-LABEL: @test_simple_commuted(			; CHECK-LABEL: @test_simple_commuted(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[X1:%.]] = icmp eq i32 [[A:%.]], 0			; CHECK-NEXT: [[X1_NOT:%.]] = icmp eq i32 [[A:%.]], 0
	; CHECK-NEXT: [[X2:%.]] = icmp eq i32 [[B:%.]], 0			; CHECK-NEXT: [[X2:%.]] = icmp eq i32 [[B:%.]], 0
	; CHECK-NEXT: [[TMP0:%.*]] = or i1 [[X1]], [[X2]]			; CHECK-NEXT: [[TMP0:%.*]] = or i1 [[X1_NOT]], [[X2]]
	; CHECK-NEXT: br i1 [[TMP0]], label [[TMP1:%.]], label [[TMP2:%.]]			; CHECK-NEXT: br i1 [[TMP0]], label [[TMP1:%.]], label [[TMP2:%.]]
	; CHECK: 1:			; CHECK: 1:
	; CHECK-NEXT: [[SPEC_SELECT:%.*]] = zext i1 [[X2]] to i32			; CHECK-NEXT: [[SPEC_SELECT:%.*]] = zext i1 [[X2]] to i32
	; CHECK-NEXT: store i32 [[SPEC_SELECT]], i32* [[P:%.*]], align 4			; CHECK-NEXT: store i32 [[SPEC_SELECT]], i32* [[P:%.*]], align 4
	; CHECK-NEXT: br label [[TMP2]]			; CHECK-NEXT: br label [[TMP2]]
	; CHECK: 2:			; CHECK: 2:
	; CHECK-NEXT: ret void			; CHECK-NEXT: ret void
	;			;
	▲ Show 20 Lines • Show All 213 Lines • ▼ Show 20 Lines
	; This should get if-converted.			; This should get if-converted.
	define i32 @test_diamond_simple(i32* %p, i32* %q, i32 %a, i32 %b) {			define i32 @test_diamond_simple(i32* %p, i32* %q, i32 %a, i32 %b) {
	; CHECK-LABEL: @test_diamond_simple(			; CHECK-LABEL: @test_diamond_simple(
	; CHECK-NEXT: entry:			; CHECK-NEXT: entry:
	; CHECK-NEXT: [[X1:%.]] = icmp eq i32 [[A:%.]], 0			; CHECK-NEXT: [[X1:%.]] = icmp eq i32 [[A:%.]], 0
	; CHECK-NEXT: [[Z2:%.]] = select i1 [[X1]], i32 [[B:%.]], i32 0			; CHECK-NEXT: [[Z2:%.]] = select i1 [[X1]], i32 [[B:%.]], i32 0
	; CHECK-NEXT: [[X2:%.*]] = icmp eq i32 [[B]], 0			; CHECK-NEXT: [[X2:%.*]] = icmp eq i32 [[B]], 0
	; CHECK-NEXT: [[TMP0:%.*]] = or i32 [[A]], [[B]]			; CHECK-NEXT: [[TMP0:%.*]] = or i32 [[A]], [[B]]
	; CHECK-NEXT: [[TMP1:%.*]] = icmp eq i32 [[TMP0]], 0			; CHECK-NEXT: [[DOTNOT:%.*]] = icmp eq i32 [[TMP0]], 0
	; CHECK-NEXT: br i1 [[TMP1]], label [[TMP3:%.]], label [[TMP2:%.]]			; CHECK-NEXT: br i1 [[DOTNOT]], label [[TMP2:%.]], label [[TMP1:%.]]
	; CHECK: 2:			; CHECK: 1:
	; CHECK-NEXT: [[SIMPLIFYCFG_MERGE:%.*]] = select i1 [[X2]], i32 [[Z2]], i32 1			; CHECK-NEXT: [[SIMPLIFYCFG_MERGE:%.*]] = select i1 [[X2]], i32 [[Z2]], i32 1
	; CHECK-NEXT: store i32 [[SIMPLIFYCFG_MERGE]], i32* [[P:%.*]], align 4			; CHECK-NEXT: store i32 [[SIMPLIFYCFG_MERGE]], i32* [[P:%.*]], align 4
	; CHECK-NEXT: br label [[TMP3]]			; CHECK-NEXT: br label [[TMP2]]
	; CHECK: 3:			; CHECK: 2:
	; CHECK-NEXT: [[Z4:%.*]] = select i1 [[X2]], i32 [[Z2]], i32 3			; CHECK-NEXT: [[Z4:%.*]] = select i1 [[X2]], i32 [[Z2]], i32 3
	; CHECK-NEXT: ret i32 [[Z4]]			; CHECK-NEXT: ret i32 [[Z4]]
	;			;
	entry:			entry:
	%x1 = icmp eq i32 %a, 0			%x1 = icmp eq i32 %a, 0
	br i1 %x1, label %no1, label %yes1			br i1 %x1, label %no1, label %yes1

	yes1:			yes1:
	▲ Show 20 Lines • Show All 125 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[InstCombine] Always try to invert non-canonical predicate of an icmpClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 275502

llvm/lib/Transforms/InstCombine/InstCombineCompares.cpp

llvm/lib/Transforms/InstCombine/InstCombineInternal.h

llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp

llvm/lib/Transforms/InstCombine/InstructionCombining.cpp

llvm/test/ThinLTO/X86/cfi-devirt.ll

llvm/test/Transforms/InstCombine/canonicalize-selects-icmp-condition-bittest.ll

llvm/test/Transforms/InstCombine/icmp-mul-zext.ll

llvm/test/Transforms/InstCombine/logical-select.ll

llvm/test/Transforms/InstCombine/select-with-bitwise-ops.ll

llvm/test/Transforms/InstCombine/shift.ll

llvm/test/Transforms/LoopUnroll/runtime-loop-multiple-exits.ll

llvm/test/Transforms/PGOProfile/chr.ll

llvm/test/Transforms/SimplifyCFG/merge-cond-stores.ll

[InstCombine] Always try to invert non-canonical predicate of an icmp
ClosedPublic