This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/test/CodeGen/AMDGPU/
-
test/
-
CodeGen/
-
AMDGPU/
18/20
combine_andor_with_cmps.ll

Differential D153479

[NFC] Tests for future commit in DAGCombiner
ClosedPublic

Authored by kmitropoulou on Jun 21 2023, 3:55 PM.

Download Raw Diff

Details

Reviewers

arsenm
foad

Commits

rG11cd92a70f26: [NFC] Tests for future commit in DAGCombiner

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

kmitropoulou created this revision.Jun 21 2023, 3:55 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 21 2023, 3:55 PM

Herald added subscribers: kerbowa, jvesely. · View Herald Transcript

kmitropoulou requested review of this revision.Jun 21 2023, 3:55 PM

Herald added a project: Restricted Project. · View Herald TranscriptJun 21 2023, 3:55 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

arsenm added a subscriber: arsenm.Jun 21 2023, 4:07 PM

arsenm added inline comments.

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll
3	Why use mir for this?
24	Should also do some SGPR tests

Harbormaster completed remote builds in B240368: Diff 533426.Jun 21 2023, 4:46 PM

Updating D153479: [NFC] Add tests for future commit in DAGCombiner

kmitropoulou marked 2 inline comments as done.Jun 21 2023, 6:18 PM

kmitropoulou added inline comments.

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll

CSE changes my optimization. Therefore, I need to do the checking earlier.

For example, the following test:

define i1 @test1(i32 %arg1, i32 %arg2) #0 {

%cmp1 = icmp slt i32 %arg1, 1000
%cmp2 = icmp slt i32 %arg2, 1000
%or  = or i1 %cmp1, %cmp2
ret i1 %or

}

will be optimized as follows with my optimization:

bb.0 (%ir-block.0):
  liveins: $vgpr0, $vgpr1
  %1:vgpr_32 = COPY $vgpr1
  %0:vgpr_32 = COPY $vgpr0
  %2:vgpr_32 = V_MIN_I32_e64 %0, %1, implicit $exec
  %3:sreg_32 = S_MOV_B32 1000
  %4:sreg_32_xm0_xexec = V_CMP_LT_I32_e64 killed %2, killed %3, implicit $exec
  %5:vgpr_32 = V_CNDMASK_B32_e64 0, 0, 0, 1, killed %4, implicit $exec
  $vgpr0 = COPY %5
  SI_RETURN implicit $vgpr0

This is the output after the instruction selection. After CSE, the predicate of the compare instruction changes:

; %bb.0:

s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
s_waitcnt_vscnt null, 0x0
v_min_i32_e32 v0, v0, v1
s_delay_alu instid0(VALU_DEP_1)
v_cmp_gt_i32_e32 vcc_lo, 0x3e8, v0
v_cndmask_b32_e64 v0, 0, 1, vcc_lo
s_setpc_b64 s[30:31]

kmitropoulou added reviewers: arsenm, foad.Jun 21 2023, 6:19 PM

Herald added subscribers: StephenFan, wdng. · View Herald TranscriptJun 21 2023, 6:19 PM

kmitropoulou retitled this revision from [NFC] Add tests for future commit in DAGCombiner to [NFC] Tests for future commit in DAGCombiner.Jun 21 2023, 6:21 PM

kmitropoulou marked an inline comment as done.

kmitropoulou added a subscriber: Restricted Project.

arsenm added inline comments.Jun 21 2023, 6:45 PM

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll
3	I don't understand. I assume you mean MachineCSE? Is your patch not actually a DAG combine as the description states? Can you stop somewhere after SIFixSGPRCopies instead?

Harbormaster completed remote builds in B240387: Diff 533448.Jun 21 2023, 6:48 PM

Updating D153479: [NFC] Tests for future commit in DAGCombiner

kmitropoulou added inline comments.Jun 21 2023, 7:48 PM

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll
3	I am sorry I meant MachineCSE. The patch will upload implements is in DAGCombiner. The new checks are generated after amdgpu-isel .

kmitropoulou added inline comments.Jun 21 2023, 7:50 PM

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll
3	*The patch that I will upload implements the optimization in DAGCombiner.

Harbormaster completed remote builds in B240396: Diff 533458.Jun 21 2023, 8:35 PM

Updating D153479: [NFC] Tests for future commit in DAGCombiner

kmitropoulou added inline comments.Jun 21 2023, 9:43 PM

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll
3	I am sorry I did not understand your comment earlier :) I update the test.

Harbormaster completed remote builds in B240403: Diff 533466.Jun 21 2023, 10:15 PM

Updating D153479: [NFC] Tests for future commit in DAGCombiner

Harbormaster completed remote builds in B240416: Diff 533483.Jun 22 2023, 12:46 AM

arsenm added inline comments.Jun 22 2023, 6:41 AM

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll
3	My original point still stands. Why can't you test the end ISA? In general optimization patches are better of testing end to end unless you specifically need to check some intermediate state

Updating D153479: [NFC] Tests for future commit in DAGCombiner

kmitropoulou added inline comments.Jun 22 2023, 10:01 PM

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll

Lets' say we have the following test:

define i1 @test1(i32 %arg1, i32 %arg2) #0 {

%cmp1 = icmp slt i32 %arg1, 1000
%cmp2 = icmp slt i32 %arg2, 1000
%or  = or i1 %cmp1, %cmp2
ret i1 %or

}

The dump after SI Fix SGPR copies is:

bb.0 (%ir-block.0):
  liveins: $vgpr0, $vgpr1
  %1:vgpr_32 = COPY $vgpr1
  %0:vgpr_32 = COPY $vgpr0
  %2:vgpr_32 = V_MIN_I32_e64 %0, %1, implicit $exec
  %3:sreg_32 = S_MOV_B32 1000
  %4:sreg_32_xm0_xexec = V_CMP_LT_I32_e64 killed %2, killed %3, implicit $exec
  %5:vgpr_32 = V_CNDMASK_B32_e64 0, 0, 0, 1, killed %4, implicit $exec
  $vgpr0 = COPY %5
  SI_RETURN implicit $vgpr0

The final output is:

%bb.0:

s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
s_waitcnt_vscnt null, 0x0
v_min_i32_e32 v0, v0, v1
s_delay_alu instid0(VALU_DEP_1)
v_cmp_gt_i32_e32 vcc_lo, 0x3e8, v0
v_cndmask_b32_e64 v0, 0, 1, vcc_lo
s_setpc_b64 s[30:31]

So, it is easier to check the correctness of the optimization after SI Fix SGPR copies.

If you do not like it, then I can change it.

Harbormaster completed remote builds in B240689: Diff 533855.Jun 22 2023, 10:26 PM

Updating D153479: [NFC] Tests for future commit in DAGCombiner

Harbormaster completed remote builds in B240696: Diff 533862.Jun 22 2023, 11:48 PM

Updating D153479: [NFC] Tests for future commit in DAGCombiner

Harbormaster completed remote builds in B241194: Diff 534578.Jun 26 2023, 9:30 AM

arsenm added inline comments.Jun 26 2023, 9:47 AM

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll
3	You don't actually care about the MIR here, so just go to ISA. You can disable the s_delay_salu insertion for gfx11 with the flag
950	amdgpu_gfx with inreg is less noisy

kmitropoulou added inline comments.Jun 26 2023, 10:28 AM

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll
3	Which flag?

foad added inline comments.Jun 26 2023, 10:47 AM

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll
3	`-amdgpu-enable-delay-alu=0`

Updating D153479: [NFC] Tests for future commit in DAGCombiner

Thank you all :)

Harbormaster completed remote builds in B241304: Diff 534739.Jun 26 2023, 3:15 PM

Updating D153479: [NFC] Tests for future commit in DAGCombiner

kmitropoulou added a child revision: D153502: [DAGCombiner] Change foldAndOrOfSETCC() to optimize and/or patterns.Jun 26 2023, 5:12 PM

Harbormaster completed remote builds in B241331: Diff 534783.Jun 26 2023, 5:50 PM

arsenm accepted this revision.Jun 27 2023, 11:27 AM

This revision is now accepted and ready to land.Jun 27 2023, 11:27 AM

Updating D153479: [NFC] Tests for future commit in DAGCombiner

arsenm added inline comments.Jul 6 2023, 2:57 PM

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll
18–19	Do you need tests where these use different constants?
274–275	Some tests where the compare types don't match?
1100	Don't need these attributes

Updating D153479: [NFC] Tests for future commit in DAGCombiner

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll
274–275	In test62, the predicates are different and in test63, the compare types are different.

kmitropoulou added inline comments.Jul 6 2023, 3:32 PM

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll
18–19	I added test65.

Harbormaster completed remote builds in B243591: Diff 537899.Jul 6 2023, 5:55 PM

Updating D153479: [NFC] Tests for future commit in DAGCombiner

Harbormaster completed remote builds in B245278: Diff 540263.Jul 13 2023, 9:57 PM

kmitropoulou added a child revision: D155267: [DAGCombiner] Change foldAndOrOfSETCC() to optimize and/or patterns with floating points..Jul 14 2023, 1:10 AM

arsenm added inline comments.Jul 14 2023, 5:15 AM

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll
1875	Also test f16 and v2f16

Updating D153479: [NFC] Tests for future commit in DAGCombiner

kmitropoulou marked an inline comment as done.Jul 14 2023, 4:08 PM

kmitropoulou added inline comments.Jul 14 2023, 4:11 PM

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll
1875	I added tests: test84, test85, test86, test87, test88, test89, tes90 and test91.

Harbormaster completed remote builds in B245513: Diff 540590.Jul 14 2023, 4:55 PM

Updating D153479: [NFC] Tests for future commit in DAGCombiner

Harbormaster completed remote builds in B245930: Diff 541135.Jul 17 2023, 4:46 PM

This revision was landed with ongoing or failed builds.Jul 17 2023, 5:09 PM

Closed by commit rG11cd92a70f26: [NFC] Tests for future commit in DAGCombiner (authored by kmitropoulou). · Explain Why

This revision was automatically updated to reflect the committed changes.

kmitropoulou added a commit: rG11cd92a70f26: [NFC] Tests for future commit in DAGCombiner.

kmitropoulou removed a child revision: D155267: [DAGCombiner] Change foldAndOrOfSETCC() to optimize and/or patterns with floating points..Aug 2 2023, 1:26 PM

Revision Contents

Path

Size

llvm/

test/

CodeGen/

AMDGPU/

combine_andor_with_cmps.ll

1100 lines

Diff 537891

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll

This file was added.

				; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py UTC_ARGS: --version 2
				; RUN: llc -march=amdgcn -mcpu=gfx1100 -verify-machineinstrs -amdgpu-enable-delay-alu=0 < %s \| FileCheck %s

				arsenmUnsubmitted Done Reply Inline Actions Why use mir for this? arsenm: Why use mir for this?
				kmitropoulouAuthorUnsubmitted Done Reply Inline Actions CSE changes my optimization. Therefore, I need to do the checking earlier. For example, the following test: define i1 @test1(i32 %arg1, i32 %arg2) #0 { %cmp1 = icmp slt i32 %arg1, 1000 %cmp2 = icmp slt i32 %arg2, 1000 %or = or i1 %cmp1, %cmp2 ret i1 %or } will be optimized as follows with my optimization: bb.0 (%ir-block.0): liveins: $vgpr0, $vgpr1 %1:vgpr_32 = COPY $vgpr1 %0:vgpr_32 = COPY $vgpr0 %2:vgpr_32 = V_MIN_I32_e64 %0, %1, implicit $exec %3:sreg_32 = S_MOV_B32 1000 %4:sreg_32_xm0_xexec = V_CMP_LT_I32_e64 killed %2, killed %3, implicit $exec %5:vgpr_32 = V_CNDMASK_B32_e64 0, 0, 0, 1, killed %4, implicit $exec $vgpr0 = COPY %5 SI_RETURN implicit $vgpr0 This is the output after the instruction selection. After CSE, the predicate of the compare instruction changes: ; %bb.0: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0) s_waitcnt_vscnt null, 0x0 v_min_i32_e32 v0, v0, v1 s_delay_alu instid0(VALU_DEP_1) v_cmp_gt_i32_e32 vcc_lo, 0x3e8, v0 v_cndmask_b32_e64 v0, 0, 1, vcc_lo s_setpc_b64 s[30:31] kmitropoulou: CSE changes my optimization. Therefore, I need to do the checking earlier. For example, the…
				arsenmUnsubmitted Not Done Reply Inline Actions I don't understand. I assume you mean MachineCSE? Is your patch not actually a DAG combine as the description states? Can you stop somewhere after SIFixSGPRCopies instead? arsenm: I don't understand. I assume you mean MachineCSE? Is your patch not actually a DAG combine as…
				kmitropoulouAuthorUnsubmitted Done Reply Inline Actions I am sorry I meant MachineCSE. The patch will upload implements is in DAGCombiner. The new checks are generated after amdgpu-isel . kmitropoulou: I am sorry I meant MachineCSE. The patch will upload implements is in DAGCombiner. The new…
				kmitropoulouAuthorUnsubmitted Done Reply Inline Actions The patch that I will upload implements the optimization in DAGCombiner. kmitropoulou:* *The patch that I will upload implements the optimization in DAGCombiner.
				kmitropoulouAuthorUnsubmitted Done Reply Inline Actions I am sorry I did not understand your comment earlier :) I update the test. kmitropoulou: I am sorry I did not understand your comment earlier :) I update the test.
				arsenmUnsubmitted Not Done Reply Inline Actions My original point still stands. Why can't you test the end ISA? In general optimization patches are better of testing end to end unless you specifically need to check some intermediate state arsenm: My original point still stands. Why can't you test the end ISA? In general optimization patches…
				kmitropoulouAuthorUnsubmitted Done Reply Inline Actions Lets' say we have the following test: define i1 @test1(i32 %arg1, i32 %arg2) #0 { %cmp1 = icmp slt i32 %arg1, 1000 %cmp2 = icmp slt i32 %arg2, 1000 %or = or i1 %cmp1, %cmp2 ret i1 %or } The dump after SI Fix SGPR copies is: bb.0 (%ir-block.0): liveins: $vgpr0, $vgpr1 %1:vgpr_32 = COPY $vgpr1 %0:vgpr_32 = COPY $vgpr0 %2:vgpr_32 = V_MIN_I32_e64 %0, %1, implicit $exec %3:sreg_32 = S_MOV_B32 1000 %4:sreg_32_xm0_xexec = V_CMP_LT_I32_e64 killed %2, killed %3, implicit $exec %5:vgpr_32 = V_CNDMASK_B32_e64 0, 0, 0, 1, killed %4, implicit $exec $vgpr0 = COPY %5 SI_RETURN implicit $vgpr0 The final output is: %bb.0: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0) s_waitcnt_vscnt null, 0x0 v_min_i32_e32 v0, v0, v1 s_delay_alu instid0(VALU_DEP_1) v_cmp_gt_i32_e32 vcc_lo, 0x3e8, v0 v_cndmask_b32_e64 v0, 0, 1, vcc_lo s_setpc_b64 s[30:31] So, it is easier to check the correctness of the optimization after SI Fix SGPR copies. If you do not like it, then I can change it. kmitropoulou: Lets' say we have the following test: define i1 @test1(i32 %arg1, i32 %arg2) #0 { %cmp1 =…
				arsenmUnsubmitted Done Reply Inline Actions You don't actually care about the MIR here, so just go to ISA. You can disable the s_delay_salu insertion for gfx11 with the flag arsenm: You don't actually care about the MIR here, so just go to ISA. You can disable the s_delay_salu…
				kmitropoulouAuthorUnsubmitted Done Reply Inline Actions Which flag? kmitropoulou: Which flag?
				foadUnsubmitted Done Reply Inline Actions `-amdgpu-enable-delay-alu=0` foad: `-amdgpu-enable-delay-alu=0`
				; The tests check the following optimization of DAGCombiner:
				; CMP(A,C)\|\|CMP(B,C) => CMP(MIN/MAX(A,B), C)
				; CMP(A,C)&&CMP(B,C) => CMP(MIN/MAX(A,B), C)

				define i1 @test1(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test1:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_i32_e32 vcc_lo, 0x3e8, v0
				; CHECK-NEXT: v_cmp_gt_i32_e64 s0, 0x3e8, v1
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp slt i32 %arg1, 1000
				%cmp2 = icmp slt i32 %arg2, 1000
				arsenmUnsubmitted Done Reply Inline Actions Do you need tests where these use different constants? arsenm: Do you need tests where these use different constants?
				kmitropoulouAuthorUnsubmitted Done Reply Inline Actions I added test65. kmitropoulou: I added test65.
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test2(i32 %arg1, i32 %arg2) #0 {
				arsenmUnsubmitted Done Reply Inline Actions Should also do some SGPR tests arsenm: Should also do some SGPR tests
				; CHECK-LABEL: test2:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_u32_e32 vcc_lo, 0x3e8, v0
				; CHECK-NEXT: v_cmp_gt_u32_e64 s0, 0x3e8, v1
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ult i32 %arg1, 1000
				%cmp2 = icmp ult i32 %arg2, 1000
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test3(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test3:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_i32_e32 vcc_lo, 0x3e9, v0
				; CHECK-NEXT: v_cmp_gt_i32_e64 s0, 0x3e9, v1
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sle i32 %arg1, 1000
				%cmp2 = icmp sle i32 %arg2, 1000
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test4(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test4:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_u32_e32 vcc_lo, 0x3e9, v0
				; CHECK-NEXT: v_cmp_gt_u32_e64 s0, 0x3e9, v1
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ule i32 %arg1, 1000
				%cmp2 = icmp ule i32 %arg2, 1000
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test5(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test5:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_i32_e32 vcc_lo, 0x3e8, v0
				; CHECK-NEXT: v_cmp_lt_i32_e64 s0, 0x3e8, v1
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sgt i32 %arg1, 1000
				%cmp2 = icmp sgt i32 %arg2, 1000
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test6(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test6:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_u32_e32 vcc_lo, 0x3e8, v0
				; CHECK-NEXT: v_cmp_lt_u32_e64 s0, 0x3e8, v1
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ugt i32 %arg1, 1000
				%cmp2 = icmp ugt i32 %arg2, 1000
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test7(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test7:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_i32_e32 vcc_lo, 0x3e7, v0
				; CHECK-NEXT: v_cmp_lt_i32_e64 s0, 0x3e7, v1
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sge i32 %arg1, 1000
				%cmp2 = icmp sge i32 %arg2, 1000
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test8(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test8:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_u32_e32 vcc_lo, 0x3e7, v0
				; CHECK-NEXT: v_cmp_lt_u32_e64 s0, 0x3e7, v1
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp uge i32 %arg1, 1000
				%cmp2 = icmp uge i32 %arg2, 1000
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test9(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test9:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_i32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_lt_i32_e64 s0, v1, v2
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp slt i32 %arg1, %arg3
				%cmp2 = icmp slt i32 %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test10(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test10:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_u32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_lt_u32_e64 s0, v1, v2
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ult i32 %arg1, %arg3
				%cmp2 = icmp ult i32 %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test11(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test11:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_le_i32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_le_i32_e64 s0, v1, v2
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sle i32 %arg1, %arg3
				%cmp2 = icmp sle i32 %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test12(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test12:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_le_u32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_le_u32_e64 s0, v1, v2
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ule i32 %arg1, %arg3
				%cmp2 = icmp ule i32 %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test13(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test13:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_i32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_gt_i32_e64 s0, v1, v2
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sgt i32 %arg1, %arg3
				%cmp2 = icmp sgt i32 %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test14(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test14:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_u32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_gt_u32_e64 s0, v1, v2
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ugt i32 %arg1, %arg3
				%cmp2 = icmp ugt i32 %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test15(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test15:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_ge_i32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_ge_i32_e64 s0, v1, v2
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sge i32 %arg1, %arg3
				%cmp2 = icmp sge i32 %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test16(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test16:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_ge_u32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_ge_u32_e64 s0, v1, v2
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp uge i32 %arg1, %arg3
				%cmp2 = icmp uge i32 %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test17(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test17:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_i32_e32 vcc_lo, 0x3e8, v0
				; CHECK-NEXT: v_cmp_gt_i32_e64 s0, 0x3e8, v1
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp slt i32 %arg1, 1000
				%cmp2 = icmp slt i32 %arg2, 1000
				arsenmUnsubmitted Done Reply Inline Actions Some tests where the compare types don't match? arsenm: Some tests where the compare types don't match?
				kmitropoulouAuthorUnsubmitted Done Reply Inline Actions In test62, the predicates are different and in test63, the compare types are different. kmitropoulou: In test62, the predicates are different and in test63, the compare types are different.
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test18(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test18:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_u32_e32 vcc_lo, 0x3e8, v0
				; CHECK-NEXT: v_cmp_gt_u32_e64 s0, 0x3e8, v1
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ult i32 %arg1, 1000
				%cmp2 = icmp ult i32 %arg2, 1000
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test19(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test19:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_i32_e32 vcc_lo, 0x3e9, v0
				; CHECK-NEXT: v_cmp_gt_i32_e64 s0, 0x3e9, v1
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sle i32 %arg1, 1000
				%cmp2 = icmp sle i32 %arg2, 1000
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test20(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test20:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_u32_e32 vcc_lo, 0x3e9, v0
				; CHECK-NEXT: v_cmp_gt_u32_e64 s0, 0x3e9, v1
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ule i32 %arg1, 1000
				%cmp2 = icmp ule i32 %arg2, 1000
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test21(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test21:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_i32_e32 vcc_lo, 0x3e8, v0
				; CHECK-NEXT: v_cmp_lt_i32_e64 s0, 0x3e8, v1
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sgt i32 %arg1, 1000
				%cmp2 = icmp sgt i32 %arg2, 1000
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test22(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test22:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_u32_e32 vcc_lo, 0x3e8, v0
				; CHECK-NEXT: v_cmp_lt_u32_e64 s0, 0x3e8, v1
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ugt i32 %arg1, 1000
				%cmp2 = icmp ugt i32 %arg2, 1000
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test23(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test23:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_i32_e32 vcc_lo, 0x3e7, v0
				; CHECK-NEXT: v_cmp_lt_i32_e64 s0, 0x3e7, v1
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sge i32 %arg1, 1000
				%cmp2 = icmp sge i32 %arg2, 1000
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test24(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test24:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_u32_e32 vcc_lo, 0x3e7, v0
				; CHECK-NEXT: v_cmp_lt_u32_e64 s0, 0x3e7, v1
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp uge i32 %arg1, 1000
				%cmp2 = icmp uge i32 %arg2, 1000
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test25(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test25:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_i32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_lt_i32_e64 s0, v1, v2
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp slt i32 %arg1, %arg3
				%cmp2 = icmp slt i32 %arg2, %arg3
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test26(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test26:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_u32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_lt_u32_e64 s0, v1, v2
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ult i32 %arg1, %arg3
				%cmp2 = icmp ult i32 %arg2, %arg3
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test27(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test27:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_le_i32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_le_i32_e64 s0, v1, v2
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sle i32 %arg1, %arg3
				%cmp2 = icmp sle i32 %arg2, %arg3
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test28(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test28:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_le_u32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_le_u32_e64 s0, v1, v2
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ule i32 %arg1, %arg3
				%cmp2 = icmp ule i32 %arg2, %arg3
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test29(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test29:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_i32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_gt_i32_e64 s0, v1, v2
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sgt i32 %arg1, %arg3
				%cmp2 = icmp sgt i32 %arg2, %arg3
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test30(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test30:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_u32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_gt_u32_e64 s0, v1, v2
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ugt i32 %arg1, %arg3
				%cmp2 = icmp ugt i32 %arg2, %arg3
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test31(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test31:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_ge_i32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_ge_i32_e64 s0, v1, v2
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sge i32 %arg1, %arg3
				%cmp2 = icmp sge i32 %arg2, %arg3
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test32(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test32:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_ge_u32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_ge_u32_e64 s0, v1, v2
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp uge i32 %arg1, %arg3
				%cmp2 = icmp uge i32 %arg2, %arg3
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test33(i32 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test33:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_i32_e32 vcc_lo, v0, v1
				; CHECK-NEXT: v_cmp_gt_i32_e64 s0, 0x3e8, v0
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp slt i32 %arg1, %arg2
				%cmp2 = icmp slt i32 %arg1, 1000
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test34(i32 %arg1, i64 %arg2) #0 {
				; CHECK-LABEL: test34:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_mov_b64 s[0:1], 0x3e8
				; CHECK-NEXT: v_cmp_gt_i64_e32 vcc_lo, s[0:1], v[1:2]
				; CHECK-NEXT: v_cmp_gt_i32_e64 s0, 0x3e8, v0
				; CHECK-NEXT: s_or_b32 s0, s0, vcc_lo
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp slt i32 %arg1, 1000
				%cmp2 = icmp slt i64 %arg2, 1000
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test35(i32 %arg1, i64 %arg2) #0 {
				; CHECK-LABEL: test35:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_mov_b64 s[0:1], 0x3e8
				; CHECK-NEXT: v_cmp_eq_u64_e32 vcc_lo, s[0:1], v[1:2]
				; CHECK-NEXT: v_cmp_eq_u32_e64 s0, 0x3e8, v0
				; CHECK-NEXT: s_or_b32 s0, s0, vcc_lo
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp eq i32 %arg1, 1000
				%cmp2 = icmp eq i64 %arg2, 1000
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test36(i32 %arg1, i64 %arg2) #0 {
				; CHECK-LABEL: test36:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_mov_b64 s[0:1], 0x3e8
				; CHECK-NEXT: v_cmp_ne_u64_e32 vcc_lo, s[0:1], v[1:2]
				; CHECK-NEXT: v_cmp_ne_u32_e64 s0, 0x3e8, v0
				; CHECK-NEXT: s_or_b32 s0, s0, vcc_lo
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ne i32 %arg1, 1000
				%cmp2 = icmp ne i64 %arg2, 1000
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define amdgpu_gfx void @test37(i32 inreg %arg1, i32 inreg %arg2) #0 {
				; CHECK-LABEL: test37:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_cmpk_lt_i32 s4, 0x3e9
				; CHECK-NEXT: v_mov_b32_e32 v0, 0
				; CHECK-NEXT: s_cselect_b32 s0, -1, 0
				; CHECK-NEXT: s_cmpk_lt_i32 s5, 0x3e9
				; CHECK-NEXT: v_mov_b32_e32 v1, 0
				; CHECK-NEXT: s_cselect_b32 s1, -1, 0
				; CHECK-NEXT: s_or_b32 s0, s0, s1
				; CHECK-NEXT: v_cndmask_b32_e64 v2, 0, 1, s0
				; CHECK-NEXT: global_store_b8 v[0:1], v2, off dlc
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sle i32 %arg1, 1000
				%cmp2 = icmp sle i32 %arg2, 1000
				%or = or i1 %cmp1, %cmp2
				store volatile i1 %or, ptr addrspace(1) null
				ret void
				}

				define amdgpu_gfx void @test38(i32 inreg %arg1, i32 inreg %arg2) #0 {
				; CHECK-LABEL: test38:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_cmpk_gt_i32 s4, 0x3e8
				; CHECK-NEXT: v_mov_b32_e32 v0, 0
				; CHECK-NEXT: s_cselect_b32 s0, -1, 0
				; CHECK-NEXT: s_cmpk_gt_i32 s5, 0x3e8
				; CHECK-NEXT: v_mov_b32_e32 v1, 0
				; CHECK-NEXT: s_cselect_b32 s1, -1, 0
				; CHECK-NEXT: s_or_b32 s0, s0, s1
				; CHECK-NEXT: v_cndmask_b32_e64 v2, 0, 1, s0
				; CHECK-NEXT: global_store_b8 v[0:1], v2, off dlc
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sgt i32 %arg1, 1000
				%cmp2 = icmp sgt i32 %arg2, 1000
				%or = or i1 %cmp1, %cmp2
				store volatile i1 %or, ptr addrspace(1) null
				ret void
				}

				define amdgpu_gfx void @test39(i32 inreg %arg1, i32 inreg %arg2, i32 inreg %arg3) #0 {
				; CHECK-LABEL: test39:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_cmp_lt_u32 s4, s6
				; CHECK-NEXT: v_mov_b32_e32 v0, 0
				; CHECK-NEXT: s_cselect_b32 s0, -1, 0
				; CHECK-NEXT: s_cmp_lt_u32 s5, s6
				; CHECK-NEXT: v_mov_b32_e32 v1, 0
				; CHECK-NEXT: s_cselect_b32 s1, -1, 0
				; CHECK-NEXT: s_or_b32 s0, s0, s1
				; CHECK-NEXT: v_cndmask_b32_e64 v2, 0, 1, s0
				; CHECK-NEXT: global_store_b8 v[0:1], v2, off dlc
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ult i32 %arg1, %arg3
				%cmp2 = icmp ult i32 %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				store volatile i1 %or, ptr addrspace(1) null
				ret void
				}

				define amdgpu_gfx void @test40(i32 inreg %arg1, i32 inreg %arg2, i32 inreg %arg3) #0 {
				; CHECK-LABEL: test40:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_cmp_ge_i32 s4, s6
				; CHECK-NEXT: v_mov_b32_e32 v0, 0
				; CHECK-NEXT: s_cselect_b32 s0, -1, 0
				; CHECK-NEXT: s_cmp_ge_i32 s5, s6
				; CHECK-NEXT: v_mov_b32_e32 v1, 0
				; CHECK-NEXT: s_cselect_b32 s1, -1, 0
				; CHECK-NEXT: s_or_b32 s0, s0, s1
				; CHECK-NEXT: v_cndmask_b32_e64 v2, 0, 1, s0
				; CHECK-NEXT: global_store_b8 v[0:1], v2, off dlc
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sge i32 %arg1, %arg3
				%cmp2 = icmp sge i32 %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				store volatile i1 %or, ptr addrspace(1) null
				ret void
				}

				define amdgpu_gfx void @test41(i32 inreg %arg1, i32 inreg %arg2) #0 {
				; CHECK-LABEL: test41:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_cmpk_lt_u32 s4, 0x3e9
				; CHECK-NEXT: v_mov_b32_e32 v0, 0
				; CHECK-NEXT: s_cselect_b32 s0, -1, 0
				; CHECK-NEXT: s_cmpk_lt_u32 s5, 0x3e9
				; CHECK-NEXT: v_mov_b32_e32 v1, 0
				; CHECK-NEXT: s_cselect_b32 s1, -1, 0
				; CHECK-NEXT: s_and_b32 s0, s0, s1
				; CHECK-NEXT: v_cndmask_b32_e64 v2, 0, 1, s0
				; CHECK-NEXT: global_store_b8 v[0:1], v2, off dlc
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ule i32 %arg1, 1000
				%cmp2 = icmp ule i32 %arg2, 1000
				%and = and i1 %cmp1, %cmp2
				store volatile i1 %and, ptr addrspace(1) null
				ret void
				}

				define amdgpu_gfx void @test42(i32 inreg %arg1, i32 inreg %arg2) #0 {
				; CHECK-LABEL: test42:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_cmpk_gt_i32 s4, 0x3e7
				; CHECK-NEXT: v_mov_b32_e32 v0, 0
				; CHECK-NEXT: s_cselect_b32 s0, -1, 0
				; CHECK-NEXT: s_cmpk_gt_i32 s5, 0x3e7
				; CHECK-NEXT: v_mov_b32_e32 v1, 0
				; CHECK-NEXT: s_cselect_b32 s1, -1, 0
				; CHECK-NEXT: s_and_b32 s0, s0, s1
				; CHECK-NEXT: v_cndmask_b32_e64 v2, 0, 1, s0
				; CHECK-NEXT: global_store_b8 v[0:1], v2, off dlc
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sge i32 %arg1, 1000
				%cmp2 = icmp sge i32 %arg2, 1000
				%and = and i1 %cmp1, %cmp2
				store volatile i1 %and, ptr addrspace(1) null
				ret void
				}

				define amdgpu_gfx void @test43(i32 inreg %arg1, i32 inreg %arg2, i32 inreg %arg3) #0 {
				; CHECK-LABEL: test43:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_cmp_le_i32 s4, s6
				; CHECK-NEXT: v_mov_b32_e32 v0, 0
				; CHECK-NEXT: s_cselect_b32 s0, -1, 0
				; CHECK-NEXT: s_cmp_le_i32 s5, s6
				; CHECK-NEXT: v_mov_b32_e32 v1, 0
				; CHECK-NEXT: s_cselect_b32 s1, -1, 0
				; CHECK-NEXT: s_and_b32 s0, s0, s1
				; CHECK-NEXT: v_cndmask_b32_e64 v2, 0, 1, s0
				; CHECK-NEXT: global_store_b8 v[0:1], v2, off dlc
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sle i32 %arg1, %arg3
				%cmp2 = icmp sle i32 %arg2, %arg3
				%and = and i1 %cmp1, %cmp2
				store volatile i1 %and, ptr addrspace(1) null
				ret void
				}

				define amdgpu_gfx void @test44(i32 inreg %arg1, i32 inreg %arg2, i32 inreg %arg3) #0 {
				; CHECK-LABEL: test44:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_cmp_ge_u32 s4, s6
				; CHECK-NEXT: v_mov_b32_e32 v0, 0
				; CHECK-NEXT: s_cselect_b32 s0, -1, 0
				; CHECK-NEXT: s_cmp_ge_u32 s5, s6
				; CHECK-NEXT: v_mov_b32_e32 v1, 0
				; CHECK-NEXT: s_cselect_b32 s1, -1, 0
				; CHECK-NEXT: s_and_b32 s0, s0, s1
				; CHECK-NEXT: v_cndmask_b32_e64 v2, 0, 1, s0
				; CHECK-NEXT: global_store_b8 v[0:1], v2, off dlc
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp uge i32 %arg1, %arg3
				%cmp2 = icmp uge i32 %arg2, %arg3
				%and = and i1 %cmp1, %cmp2
				store volatile i1 %and, ptr addrspace(1) null
				ret void
				}

				define i1 @test45(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test45:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_u32_e32 vcc_lo, v2, v0
				; CHECK-NEXT: v_cmp_lt_u32_e64 s0, v2, v1
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ult i32 %arg3, %arg1
				%cmp2 = icmp ult i32 %arg3, %arg2
				%or = and i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test46(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test46:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_u32_e32 vcc_lo, v2, v0
				; CHECK-NEXT: v_cmp_lt_u32_e64 s0, v2, v1
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ult i32 %arg3, %arg1
				%cmp2 = icmp ult i32 %arg3, %arg2
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}


				define i1 @test47(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test47:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_u32_e32 vcc_lo, v2, v0
				; CHECK-NEXT: v_cmp_gt_u32_e64 s0, v2, v1
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ugt i32 %arg3, %arg1
				%cmp2 = icmp ugt i32 %arg3, %arg2
				%or = and i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test48(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test48:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_u32_e32 vcc_lo, v2, v0
				; CHECK-NEXT: v_cmp_gt_u32_e64 s0, v2, v1
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ugt i32 %arg3, %arg1
				%cmp2 = icmp ugt i32 %arg3, %arg2
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test49(i64 %arg1, i64 %arg2, i64 %arg3) #0 {
				; CHECK-LABEL: test49:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_u64_e32 vcc_lo, v[0:1], v[4:5]
				; CHECK-NEXT: v_cmp_lt_u64_e64 s0, v[2:3], v[4:5]
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ult i64 %arg1, %arg3
				%cmp2 = icmp ult i64 %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test50(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test50:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_i32_e32 vcc_lo, v2, v0
				; CHECK-NEXT: v_cmp_gt_i32_e64 s0, v1, v2
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp slt i32 %arg3, %arg1
				%cmp2 = icmp sgt i32 %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test51(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test51:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_i32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_lt_i32_e64 s0, v2, v1
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sgt i32 %arg1, %arg3
				%cmp2 = icmp slt i32 %arg3, %arg2
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test52(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test52:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_i32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_gt_i32_e64 s0, v2, v1
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp slt i32 %arg1, %arg3
				%cmp2 = icmp sgt i32 %arg3, %arg2
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test53(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test53:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_i32_e32 vcc_lo, v2, v0
				; CHECK-NEXT: v_cmp_lt_i32_e64 s0, v1, v2
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sgt i32 %arg3, %arg1
				%cmp2 = icmp slt i32 %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test54(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test54:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_i32_e32 vcc_lo, v2, v0
				; CHECK-NEXT: v_cmp_gt_i32_e64 s0, v1, v2
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp slt i32 %arg3, %arg1
				%cmp2 = icmp sgt i32 %arg2, %arg3
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test55(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test55:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_i32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_lt_i32_e64 s0, v2, v1
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sgt i32 %arg1, %arg3
				%cmp2 = icmp slt i32 %arg3, %arg2
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test56(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test56:
				; CHECK: ; %bb.0:
				arsenmUnsubmitted Done Reply Inline Actions amdgpu_gfx with inreg is less noisy arsenm: amdgpu_gfx with inreg is less noisy
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_i32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_gt_i32_e64 s0, v2, v1
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp slt i32 %arg1, %arg3
				%cmp2 = icmp sgt i32 %arg3, %arg2
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				define i1 @test57(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test57:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_i32_e32 vcc_lo, v2, v0
				; CHECK-NEXT: v_cmp_lt_i32_e64 s0, v1, v2
				; CHECK-NEXT: s_and_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp sgt i32 %arg3, %arg1
				%cmp2 = icmp slt i32 %arg2, %arg3
				%and = and i1 %cmp1, %cmp2
				ret i1 %and
				}

				; The optimization does not apply to the following tests.

				define i1 @test58(float %arg1, float %arg2, float %arg3) #0 {
				; CHECK-LABEL: test58:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_nge_f32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_nge_f32_e64 s0, v1, v2
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = fcmp ult float %arg1, %arg3
				%cmp2 = fcmp ult float %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test59(double %arg1, double %arg2, double %arg3) #0 {
				; CHECK-LABEL: test59:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_nge_f64_e32 vcc_lo, v[0:1], v[4:5]
				; CHECK-NEXT: v_cmp_nge_f64_e64 s0, v[2:3], v[4:5]
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = fcmp ult double %arg1, %arg3
				%cmp2 = fcmp ult double %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test60(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test60:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_u32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_lt_u32_e64 s0, v2, v1
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ult i32 %arg1, %arg3
				%cmp2 = icmp ult i32 %arg3, %arg2
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test61(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test61:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_u32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_le_u32_e64 s0, v1, v2
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ult i32 %arg1, %arg3
				%cmp2 = icmp ule i32 %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test62(i32 %arg1, i32 %arg2, i32 %arg3) #0 {
				; CHECK-LABEL: test62:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_le_u32_e32 vcc_lo, v2, v0
				; CHECK-NEXT: v_cmp_gt_u32_e64 s0, v1, v2
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ule i32 %arg3, %arg1
				%cmp2 = icmp ugt i32 %arg2, %arg3
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test63(i16 %arg1, i32 %arg2) #0 {
				; CHECK-LABEL: test63:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_gt_u16_e32 vcc_lo, 10, v0
				; CHECK-NEXT: v_cmp_gt_u32_e64 s0, 10, v1
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ult i16 %arg1, 10
				%cmp2 = icmp ult i32 %arg2, 10
				%or = or i1 %cmp1, %cmp2
				ret i1 %or
				}

				define i1 @test64(i32 %arg1, i32 %arg2, i32 %arg3, i32 %arg4) #0 {
				; CHECK-LABEL: test64:
				; CHECK: ; %bb.0:
				; CHECK-NEXT: s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0)
				; CHECK-NEXT: s_waitcnt_vscnt null, 0x0
				; CHECK-NEXT: v_cmp_lt_u32_e32 vcc_lo, v0, v2
				; CHECK-NEXT: v_cmp_lt_u32_e64 s0, v1, v2
				; CHECK-NEXT: v_cmp_lt_u32_e64 s1, v0, v3
				; CHECK-NEXT: s_or_b32 s0, vcc_lo, s0
				; CHECK-NEXT: s_or_b32 s1, s1, vcc_lo
				; CHECK-NEXT: s_or_b32 s0, s0, s1
				; CHECK-NEXT: v_cndmask_b32_e64 v0, 0, 1, s0
				; CHECK-NEXT: s_setpc_b64 s[30:31]
				%cmp1 = icmp ult i32 %arg1, %arg3
				%cmp2 = icmp ult i32 %arg2, %arg3
				%or1 = or i1 %cmp1, %cmp2
				%cmp3 = icmp ult i32 %arg1, %arg4
				%or2 = or i1 %cmp3, %cmp1
				%or3 = or i1 %or1, %or2
				ret i1 %or3
				}

				attributes #0 = { nounwind readnone }
				arsenmUnsubmitted Done Reply Inline Actions Don't need these attributes arsenm: Don't need these attributes
				arsenmUnsubmitted Done Reply Inline Actions Also test f16 and v2f16 arsenm: Also test f16 and v2f16
				kmitropoulouAuthorUnsubmitted Done Reply Inline Actions I added tests: test84, test85, test86, test87, test88, test89, tes90 and test91. kmitropoulou: I added tests: test84, test85, test86, test87, test88, test89, tes90 and test91.

This is an archive of the discontinued LLVM Phabricator instance.

[NFC] Tests for future commit in DAGCombinerClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 537891

llvm/test/CodeGen/AMDGPU/combine_andor_with_cmps.ll

[NFC] Tests for future commit in DAGCombiner
ClosedPublic