This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Target/WebAssembly/
-
Target/
-
WebAssembly/
1/6
WebAssemblyInstrControl.td
-
WebAssemblyInstrInfo.td
-
test/CodeGen/WebAssembly/
-
CodeGen/
-
WebAssembly/
1/2
comparisons-f32.ll
-
comparisons-f64.ll

Differential D99171

[WebAssembly] Fold xor by inverting branch target
ClosedPublic

Authored by samparker on Mar 23 2021, 4:09 AM.

Download Raw Diff

Details

Reviewers

tlively
kripken
sunfish

Commits

rG92e777148359: [WebAssembly] Invert branch condition on xor input

Summary

I noticed this pattern appearing when running the bullet physics engine on node. Folding away the xor looks beneficial for different architectures and runtimes, speedups:

Benchmark	Macbook m1 (node)	Macbook m1 (wasmtime)	Ryzen 3 (node)
Bullet	1.4%	0.5%	1%
Adobe	2.4%	0.8%	2.3%

I have performed this transformation directly in v8 too and the numbers correlate.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

samparker created this revision.Mar 23 2021, 4:09 AM

Herald added subscribers: ecnelises, hiraditya, jgravelle-google and 2 others. · View Herald TranscriptMar 23 2021, 4:09 AM

samparker requested review of this revision.Mar 23 2021, 4:09 AM

Herald added a project: Restricted Project. · View Herald TranscriptMar 23 2021, 4:09 AM

Herald added a subscriber: aheejin. · View Herald Transcript

samparker edited the summary of this revision. (Show Details)Mar 23 2021, 4:11 AM

samparker edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B95227: Diff 332602.Mar 23 2021, 7:18 AM

samparker added inline comments.Mar 23 2021, 7:19 AM

llvm/lib/Target/WebAssembly/WebAssemblyInstrControl.td
33	Ah, I've remembered that I thought I should check that the incoming I32 is indeed a boolean value.

kripken added inline comments.Mar 23 2021, 7:57 PM

llvm/lib/Target/WebAssembly/WebAssemblyInstrControl.td
33	Yes, this seems like a valid pattern for any boolean value, not just one flowing into a br? That is, bool(x) ^ 1 => !bool(x) or in wasm (i32.xor X (i32.const 1)) => (i32.eqz X)

tlively added inline comments.Mar 23 2021, 8:36 PM

llvm/lib/Target/WebAssembly/WebAssemblyInstrControl.td
33	Is there a good way to check for boolean values from tablegen patterns?

craig.topper added a subscriber: craig.topper.Mar 23 2021, 8:40 PM

craig.topper added inline comments.

llvm/lib/Target/WebAssembly/WebAssemblyInstrControl.td
33	You'd need to use a PatFrag to call computeKnownBits I think.

Thanks all. I've added a PatLeaf to detect a boolean, but I wasn't sure how to write a negative test using LLVM IR considering that the branch always takes an i1. Any ideas?

Harbormaster completed remote builds in B95421: Diff 332883.Mar 24 2021, 4:23 AM

In D99171#2646847, @samparker wrote:

Thanks all. I've added a PatLeaf to detect a boolean, but I wasn't sure how to write a negative test using LLVM IR considering that the branch always takes an i1. Any ideas?

If you're starting from brcond, I don't think you need the check. I think the input to brcond should agree with the what the target has defined for getBooleanContents. I assume that's ZeroOrOne for WebAssembly. If you were to generalize this for xors not being used by a brcond you would need a check.

@samparker Can you add tests that do explicit xors in the IR to show that it is not folded out where it shouldn't be?

llvm/lib/Target/WebAssembly/WebAssemblyInstrControl.td
29–31	This looks generally useful; Can you move it to WebAssemblyInstrInfo.td and add a TODO about using it in more places?
33	@craig.topper, by If you're starting from brcond, I don't think you need the check. Do you mean that we shouldn't need to use `bool_node` here?

Can you add tests that do explicit xors in the IR to show that it is not folded out where it shouldn't be?

Well this is what I mean, that I don't think this is possible. Writing a test in IR, the br has to take an i1, though this is expanded to i32 during codegen. I could write some IR tests with xors, but they should all be valid. AFAIK, I'd need a way of writing a test input in selection dag form to write a negative test. I have missed some FP conditions in the existing tests, so I'm going to add those.

Moved PatLeaf to top-level file.
Added tests for missing FP conditions.
Added switch tests.

Harbormaster completed remote builds in B96483: Diff 334373.Mar 31 2021, 2:04 AM

Looks good, thanks! Do you need me to land this for you?

This revision is now accepted and ready to land.Mar 31 2021, 12:21 PM

Cheers! And no thanks - I've had commit access for a few years.

This revision was landed with ongoing or failed builds.Apr 1 2021, 1:24 AM

Closed by commit rG92e777148359: [WebAssembly] Invert branch condition on xor input (authored by samparker). · Explain Why

This revision was automatically updated to reflect the committed changes.

samparker added a commit: rG92e777148359: [WebAssembly] Invert branch condition on xor input.

sbc100 added inline comments.Apr 4 2021, 6:23 PM

llvm/test/CodeGen/WebAssembly/comparisons-f32.ll
304	Probably cleaner not to include C++ name mangling here. Maybe just `call1`?

samparker added inline comments.Apr 6 2021, 1:01 AM

llvm/test/CodeGen/WebAssembly/comparisons-f32.ll
304	Done: f1313b3b249a

Revision Contents

Path

Size

llvm/

lib/

Target/

WebAssembly/

WebAssemblyInstrControl.td

2 lines

WebAssemblyInstrInfo.td

5 lines

test/

CodeGen/

WebAssembly/

comparisons-f32.ll

195 lines

comparisons-f64.ll

195 lines

Diff 334616

llvm/lib/Target/WebAssembly/WebAssemblyInstrControl.td

	Show All 20 Lines
	defm BR_UNLESS : I<(outs), (ins bb_op:$dst, I32:$cond),			defm BR_UNLESS : I<(outs), (ins bb_op:$dst, I32:$cond),
	(outs), (ins bb_op:$dst), []>;			(outs), (ins bb_op:$dst), []>;
	let isBarrier = 1 in			let isBarrier = 1 in
	defm BR : NRI<(outs), (ins bb_op:$dst),			defm BR : NRI<(outs), (ins bb_op:$dst),
	[(br bb:$dst)],			[(br bb:$dst)],
	"br \t$dst", 0x0c>;			"br \t$dst", 0x0c>;
	} // isBranch = 1, isTerminator = 1, hasCtrlDep = 1			} // isBranch = 1, isTerminator = 1, hasCtrlDep = 1

	def : Pat<(brcond (i32 (setne I32:$cond, 0)), bb:$dst),			def : Pat<(brcond (i32 (setne I32:$cond, 0)), bb:$dst),
	(BR_IF bb_op:$dst, I32:$cond)>;			(BR_IF bb_op:$dst, I32:$cond)>;
	def : Pat<(brcond (i32 (seteq I32:$cond, 0)), bb:$dst),			def : Pat<(brcond (i32 (seteq I32:$cond, 0)), bb:$dst),
				tlivelyUnsubmitted Not Done Reply Inline Actions This looks generally useful; Can you move it to WebAssemblyInstrInfo.td and add a TODO about using it in more places? tlively: This looks generally useful; Can you move it to WebAssemblyInstrInfo.td and add a TODO about…
	(BR_UNLESS bb_op:$dst, I32:$cond)>;			(BR_UNLESS bb_op:$dst, I32:$cond)>;
				def : Pat<(brcond (i32 (xor bool_node:$cond, (i32 1))), bb:$dst),
				samparkerAuthorUnsubmitted Done Reply Inline Actions Ah, I've remembered that I thought I should check that the incoming I32 is indeed a boolean value. samparker: Ah, I've remembered that I thought I should check that the incoming I32 is indeed a boolean…
				kripkenUnsubmitted Not Done Reply Inline Actions Yes, this seems like a valid pattern for any boolean value, not just one flowing into a br? That is, bool(x) ^ 1 => !bool(x) or in wasm (i32.xor X (i32.const 1)) => (i32.eqz X) kripken: Yes, this seems like a valid pattern for any boolean value, not just one flowing into a br?
				tlivelyUnsubmitted Not Done Reply Inline Actions Is there a good way to check for boolean values from tablegen patterns? tlively: Is there a good way to check for boolean values from tablegen patterns?
				craig.topperUnsubmitted Not Done Reply Inline Actions You'd need to use a PatFrag to call computeKnownBits I think. craig.topper: You'd need to use a PatFrag to call computeKnownBits I think.
				tlivelyUnsubmitted Not Done Reply Inline Actions @craig.topper, by If you're starting from brcond, I don't think you need the check. Do you mean that we shouldn't need to use `bool_node` here? tlively: @craig.topper, by > If you're starting from brcond, I don't think you need the check. Do you…
				(BR_UNLESS bb_op:$dst, I32:$cond)>;

	// A list of branch targets enclosed in {} and separated by comma.			// A list of branch targets enclosed in {} and separated by comma.
	// Used by br_table only.			// Used by br_table only.
	def BrListAsmOperand : AsmOperandClass { let Name = "BrList"; }			def BrListAsmOperand : AsmOperandClass { let Name = "BrList"; }
	let OperandNamespace = "WebAssembly", OperandType = "OPERAND_BRLIST" in			let OperandNamespace = "WebAssembly", OperandType = "OPERAND_BRLIST" in
	def brlist : Operand<i32> {			def brlist : Operand<i32> {
	let ParserMatchClass = BrListAsmOperand;			let ParserMatchClass = BrListAsmOperand;
	let PrintMethod = "printBrList";			let PrintMethod = "printBrList";
	▲ Show 20 Lines • Show All 131 Lines • Show Last 20 Lines

llvm/lib/Target/WebAssembly/WebAssemblyInstrInfo.td

Show First 20 Lines • Show All 188 Lines • ▼ Show 20 Lines	def HeapType : Operand<i32> {
let PrintMethod = "printWebAssemblyHeapTypeOperand";		let PrintMethod = "printWebAssemblyHeapTypeOperand";
}		}

let OperandType = "OPERAND_TYPEINDEX" in		let OperandType = "OPERAND_TYPEINDEX" in
def TypeIndex : Operand<i32>;		def TypeIndex : Operand<i32>;

} // OperandNamespace = "WebAssembly"		} // OperandNamespace = "WebAssembly"

		// TODO: Find more places to use this.
		def bool_node : PatLeaf<(i32 I32:$cond), [{
		return CurDAG->computeKnownBits(SDValue(N, 0)).countMinLeadingZeros() == 31;
		}]>;

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// WebAssembly Register to Stack instruction mapping		// WebAssembly Register to Stack instruction mapping
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

class StackRel;		class StackRel;
def getStackOpcode : InstrMapping {		def getStackOpcode : InstrMapping {
let FilterClass = "StackRel";		let FilterClass = "StackRel";
let RowFields = ["BaseName"];		let RowFields = ["BaseName"];
▲ Show 20 Lines • Show All 176 Lines • Show Last 20 Lines

llvm/test/CodeGen/WebAssembly/comparisons-f32.ll

	Show First 20 Lines • Show All 181 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: i32.const $push[[C0:[0-9]+]]=, 1			; CHECK-NEXT: i32.const $push[[C0:[0-9]+]]=, 1
	; CHECK-NEXT: i32.xor $push[[NUM2:[0-9]+]]=, $pop[[NUM0]], $pop[[C0]]{{$}}			; CHECK-NEXT: i32.xor $push[[NUM2:[0-9]+]]=, $pop[[NUM0]], $pop[[C0]]{{$}}
	; CHECK-NEXT: return $pop[[NUM2]]{{$}}			; CHECK-NEXT: return $pop[[NUM2]]{{$}}
	define i32 @uge_f32(float %x, float %y) {			define i32 @uge_f32(float %x, float %y) {
	%a = fcmp uge float %x, %y			%a = fcmp uge float %x, %y
	%b = zext i1 %a to i32			%b = zext i1 %a to i32
	ret i32 %b			ret i32 %b
	}			}

				; CHECK-LABEL: olt_f32_branch
				; CHECK: local.get $push[[L4:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L3:[0-9]+]]=, 1
				; CHECK-NEXT: f32.lt $push[[NUM0:[0-9]+]]=, $pop[[L4]], $pop[[L3]]
				; CHECK-NEXT: i32.eqz $push[[NUM3:[0-9]+]]=, $pop[[NUM0]]
				; CHECK-NEXT: br_if 0, $pop[[NUM3]]
				; CHECK-NEXT: call _Z5call1v
				define void @olt_f32_branch(float %a, float %b) {
				entry:
				%cmp = fcmp olt float %a, %b
				br i1 %cmp, label %if.then, label %if.end

				if.then:
				tail call void @_Z5call1v()
				br label %if.end

				if.end:
				ret void
				}

				; CHECK-LABEL: ole_f32_branch
				; CHECK: local.get $push[[L4:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L3:[0-9]+]]=, 1
				; CHECK-NEXT: f32.le $push[[NUM0:[0-9]+]]=, $pop[[L4]], $pop[[L3]]
				; CHECK-NEXT: i32.eqz $push[[NUM3:[0-9]+]]=, $pop[[NUM0]]
				; CHECK-NEXT: br_if 0, $pop[[NUM3]]
				; CHECK-NEXT: call _Z5call1v
				define void @ole_f32_branch(float %a, float %b) {
				entry:
				%cmp = fcmp ole float %a, %b
				br i1 %cmp, label %if.then, label %if.end

				if.then:
				tail call void @_Z5call1v()
				br label %if.end

				if.end:
				ret void
				}

				; CHECK-LABEL: ugt_f32_branch
				; CHECK: local.get $push[[L4:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L3:[0-9]+]]=, 1
				; CHECK-NEXT: f32.le $push[[NUM0:[0-9]+]]=, $pop[[L4]], $pop[[L3]]
				; CHECK-NEXT: i32.eqz $push[[NUM3:[0-9]+]]=, $pop[[NUM0]]
				; CHECK-NEXT: br_if 0, $pop[[NUM3]]
				; CHECK-NEXT: call _Z5call1v
				define void @ugt_f32_branch(float %a, float %b) {
				entry:
				%cmp = fcmp ugt float %a, %b
				br i1 %cmp, label %if.end, label %if.then

				if.then:
				tail call void @_Z5call1v()
				br label %if.end

				if.end:
				ret void
				}

				; CHECK-LABEL: ogt_f32_branch
				; CHECK: local.get $push[[L4:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L3:[0-9]+]]=, 1
				; CHECK-NEXT: f32.gt $push[[NUM0:[0-9]+]]=, $pop[[L4]], $pop[[L3]]
				; CHECK-NEXT: i32.eqz $push[[NUM3:[0-9]+]]=, $pop[[NUM0]]
				; CHECK-NEXT: br_if 0, $pop[[NUM3]]
				; CHECK-NEXT: call _Z5call1v
				define void @ogt_f32_branch(float %a, float %b) {
				entry:
				%cmp = fcmp ogt float %a, %b
				br i1 %cmp, label %if.then, label %if.end

				if.then:
				tail call void @_Z5call1v()
				br label %if.end

				if.end:
				ret void
				}

				; CHECK-LABEL: ult_f32_branch
				; CHECK: local.get $push[[L4:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L3:[0-9]+]]=, 1
				; CHECK-NEXT: f32.ge $push[[NUM0:[0-9]+]]=, $pop[[L4]], $pop[[L3]]
				; CHECK-NEXT: i32.eqz $push[[NUM3:[0-9]+]]=, $pop[[NUM0]]
				; CHECK-NEXT: br_if 0, $pop[[NUM3]]
				; CHECK-NEXT: call _Z5call1v
				define void @ult_f32_branch(float %a, float %b) {
				entry:
				%cmp = fcmp ult float %a, %b
				br i1 %cmp, label %if.end, label %if.then

				if.then:
				tail call void @_Z5call1v()
				br label %if.end

				if.end:
				ret void
				}

				; CHECK-LABEL: ule_f32_branch
				; CHECK: local.get $push[[L4:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L3:[0-9]+]]=, 1
				; CHECK-NEXT: f32.ge $push[[NUM0:[0-9]+]]=, $pop[[L4]], $pop[[L3]]
				; CHECK-NEXT: i32.eqz $push[[NUM3:[0-9]+]]=, $pop[[NUM0]]
				; CHECK-NEXT: br_if 0, $pop[[NUM3]]
				; CHECK-NEXT: call _Z5call1v
				define void @ule_f32_branch(float %a, float %b) {
				entry:
				%cmp = fcmp ult float %a, %b
				br i1 %cmp, label %if.end, label %if.then

				if.then:
				tail call void @_Z5call1v()
				sbc100Unsubmitted Not Done Reply Inline Actions Probably cleaner not to include C++ name mangling here. Maybe just `call1`? sbc100: Probably cleaner not to include C++ name mangling here. Maybe just `call1`?
				samparkerAuthorUnsubmitted Done Reply Inline Actions Done: f1313b3b249a samparker: Done: f1313b3b249a
				br label %if.end

				if.end:
				ret void
				}

				; CHECK-LABEL: xor_zext_switch
				; CHECK: i32.const $push[[L1:[0-9]+]]=, 0
				; CHECK-NEXT: br_if 0, $pop[[L1]]
				; CHECK-NEXT: block
				; CHECK-NEXT: block
				; CHECK-NEXT: local.get $push[[L3:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L2:[0-9]+]]=, 1
				; CHECK-NEXT: f32.ge $push[[L0:[0-9]+]]=, $pop[[L3]], $pop[[L2]]
				; CHECK-NEXT: br_table $pop[[L0]], 0, 1, 0
				define void @xor_zext_switch(float %a, float %b) {
				entry:
				%cmp = fcmp ult float %a, %b
				%zext = zext i1 %cmp to i32
				%xor = xor i32 %zext, 1
				switch i32 %xor, label %exit [
				i32 0, label %sw.bb.1
				i32 1, label %sw.bb.2
				]

				sw.bb.1:
				tail call void @foo1()
				br label %exit

				sw.bb.2:
				tail call void @foo2()
				br label %exit

				exit:
				ret void
				}

				; CHECK-LABEL: xor_add_switch
				; CHECK: local.get $push[[L8:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L7:[0-9]+]]=, 1
				; CHECK-NEXT: f32.ge $push[[L1:[0-9]+]]=, $pop[[L8]], $pop[[L7]]
				; CHECK-NEXT: i32.const $push[[L2:[0-9]+]]=, 1
				; CHECK-NEXT: i32.xor $push[[L3:[0-9]+]]=, $pop[[L1]], $pop[[L2]]
				; CHECK-NEXT: i32.const $push[[L6:[0-9]+]]=, 1
				; CHECK-NEXT: i32.add $push[[L4:[0-9]+]]=, $pop[[L3]], $pop[[L6]]
				; CHECK-NEXT: i32.const $push[[L5:[0-9]+]]=, 1
				; CHECK-NEXT: i32.xor $push[[L0:[0-9]+]]=, $pop[[L4]], $pop[[L5]]
				; CHECK-NEXT: br_table $pop[[L0]], 0, 1, 2, 3
				define void @xor_add_switch(float %a, float %b) {
				entry:
				%cmp = fcmp ult float %a, %b
				%zext = zext i1 %cmp to i32
				%add = add nsw nuw i32 %zext, 1
				%xor = xor i32 %add, 1
				switch i32 %xor, label %exit [
				i32 0, label %sw.bb.1
				i32 1, label %sw.bb.2
				i32 2, label %sw.bb.3
				]

				sw.bb.1:
				tail call void @foo1()
				br label %exit

				sw.bb.2:
				tail call void @foo2()
				br label %exit

				sw.bb.3:
				tail call void @foo3()
				br label %exit

				exit:
				ret void
				}

				declare void @foo1()
				declare void @foo2()
				declare void @foo3()
				declare void @_Z5call1v()

llvm/test/CodeGen/WebAssembly/comparisons-f64.ll

	Show First 20 Lines • Show All 180 Lines • ▼ Show 20 Lines
	; CHECK-NEXT: i32.const $push[[C0:[0-9]+]]=, 1			; CHECK-NEXT: i32.const $push[[C0:[0-9]+]]=, 1
	; CHECK-NEXT: i32.xor $push[[NUM2:[0-9]+]]=, $pop[[NUM0]], $pop[[C0]]{{$}}			; CHECK-NEXT: i32.xor $push[[NUM2:[0-9]+]]=, $pop[[NUM0]], $pop[[C0]]{{$}}
	; CHECK-NEXT: return $pop[[NUM2]]{{$}}			; CHECK-NEXT: return $pop[[NUM2]]{{$}}
	define i32 @uge_f64(double %x, double %y) {			define i32 @uge_f64(double %x, double %y) {
	%a = fcmp uge double %x, %y			%a = fcmp uge double %x, %y
	%b = zext i1 %a to i32			%b = zext i1 %a to i32
	ret i32 %b			ret i32 %b
	}			}

				; CHECK-LABEL: olt_f64_branch:
				; CHECK: local.get $push[[L0:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L1:[0-9]+]]=, 1
				; CHECK-NEXT: f64.lt $push[[NUM0:[0-9]+]]=, $pop[[L0]], $pop[[L1]]
				; CHECK-NEXT: i32.eqz $push[[NUM3:[0-9]+]]=, $pop[[NUM0]]
				; CHECK-NEXT: br_if 0, $pop[[NUM3]]
				; CHECK-NEXT: call _Z5call1v
				define void @olt_f64_branch(double %a, double %b) {
				entry:
				%cmp = fcmp olt double %a, %b
				br i1 %cmp, label %if.then, label %if.end

				if.then:
				tail call void @_Z5call1v()
				br label %if.end

				if.end:
				ret void
				}

				; CHECK-LABEL: ole_f64_branch:
				; CHECK: local.get $push[[L0:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L1:[0-9]+]]=, 1
				; CHECK-NEXT: f64.le $push[[NUM0:[0-9]+]]=, $pop[[L0]], $pop[[L1]]
				; CHECK-NEXT: i32.eqz $push[[NUM3:[0-9]+]]=, $pop[[NUM0]]
				; CHECK-NEXT: br_if 0, $pop[[NUM3]]
				; CHECK-NEXT: call _Z5call1v
				define void @ole_f64_branch(double %a, double %b) {
				entry:
				%cmp = fcmp ole double %a, %b
				br i1 %cmp, label %if.then, label %if.end

				if.then:
				tail call void @_Z5call1v()
				br label %if.end

				if.end:
				ret void
				}

				; CHECK-LABEL: ugt_f64_branch:
				; CHECK: local.get $push[[L0:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L1:[0-9]+]]=, 1
				; CHECK-NEXT: f64.le $push[[NUM0:[0-9]+]]=, $pop[[L0]], $pop[[L1]]
				; CHECK-NEXT: i32.eqz $push[[NUM3:[0-9]+]]=, $pop[[NUM0]]
				; CHECK-NEXT: br_if 0, $pop[[NUM3]]
				; CHECK-NEXT: call _Z5call1v
				define void @ugt_f64_branch(double %a, double %b) {
				entry:
				%cmp = fcmp ugt double %a, %b
				br i1 %cmp, label %if.end, label %if.then

				if.then:
				tail call void @_Z5call1v()
				br label %if.end

				if.end:
				ret void
				}

				; CHECK-LABEL: ogt_f64_branch:
				; CHECK: local.get $push[[L0:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L1:[0-9]+]]=, 1
				; CHECK-NEXT: f64.gt $push[[NUM0:[0-9]+]]=, $pop[[L0]], $pop[[L1]]
				; CHECK-NEXT: i32.eqz $push[[NUM3:[0-9]+]]=, $pop[[NUM0]]
				; CHECK-NEXT: br_if 0, $pop[[NUM3]]
				; CHECK-NEXT: call _Z5call1v
				define void @ogt_f64_branch(double %a, double %b) {
				entry:
				%cmp = fcmp ogt double %a, %b
				br i1 %cmp, label %if.then, label %if.end

				if.then:
				tail call void @_Z5call1v()
				br label %if.end

				if.end:
				ret void
				}

				; CHECK-LABEL: ult_f64_branch:
				; CHECK: local.get $push[[L0:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L1:[0-9]+]]=, 1
				; CHECK-NEXT: f64.ge $push[[NUM0:[0-9]+]]=, $pop[[L0]], $pop[[L1]]
				; CHECK-NEXT: i32.eqz $push[[NUM3:[0-9]+]]=, $pop[[NUM0]]
				; CHECK-NEXT: br_if 0, $pop[[NUM3]]
				; CHECK-NEXT: call _Z5call1v
				define void @ult_f64_branch(double %a, double %b) {
				entry:
				%cmp = fcmp ult double %a, %b
				br i1 %cmp, label %if.end, label %if.then

				if.then:
				tail call void @_Z5call1v()
				br label %if.end

				if.end:
				ret void
				}

				; CHECK-LABEL: ule_f64_branch:
				; CHECK: local.get $push[[L0:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L1:[0-9]+]]=, 1
				; CHECK-NEXT: f64.gt $push[[NUM0:[0-9]+]]=, $pop[[L0]], $pop[[L1]]
				; CHECK-NEXT: i32.eqz $push[[NUM3:[0-9]+]]=, $pop[[NUM0]]
				; CHECK-NEXT: br_if 0, $pop[[NUM3]]
				; CHECK-NEXT: call _Z5call1v
				define void @ule_f64_branch(double %a, double %b) {
				entry:
				%cmp = fcmp ule double %a, %b
				br i1 %cmp, label %if.end, label %if.then

				if.then:
				tail call void @_Z5call1v()
				br label %if.end

				if.end:
				ret void
				}

				; CHECK-LABEL: xor_zext_switch
				; CHECK: i32.const $push[[L1:[0-9]+]]=, 0
				; CHECK-NEXT: br_if 0, $pop[[L1]]
				; CHECK-NEXT: block
				; CHECK-NEXT: block
				; CHECK-NEXT: local.get $push[[L3:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L2:[0-9]+]]=, 1
				; CHECK-NEXT: f64.ge $push[[L0:[0-9]+]]=, $pop[[L3]], $pop[[L2]]
				; CHECK-NEXT: br_table $pop[[L0]], 0, 1, 0
				define void @xor_zext_switch(double %a, double %b) {
				entry:
				%cmp = fcmp ult double %a, %b
				%zext = zext i1 %cmp to i32
				%xor = xor i32 %zext, 1
				switch i32 %xor, label %exit [
				i32 0, label %sw.bb.1
				i32 1, label %sw.bb.2
				]

				sw.bb.1:
				tail call void @foo1()
				br label %exit

				sw.bb.2:
				tail call void @foo2()
				br label %exit

				exit:
				ret void
				}

				; CHECK-LABEL: xor_add_switch
				; CHECK: local.get $push[[L8:[0-9]+]]=, 0
				; CHECK-NEXT: local.get $push[[L7:[0-9]+]]=, 1
				; CHECK-NEXT: f64.ge $push[[L1:[0-9]+]]=, $pop[[L8]], $pop[[L7]]
				; CHECK-NEXT: i32.const $push[[L2:[0-9]+]]=, 1
				; CHECK-NEXT: i32.xor $push[[L3:[0-9]+]]=, $pop[[L1]], $pop[[L2]]
				; CHECK-NEXT: i32.const $push[[L6:[0-9]+]]=, 1
				; CHECK-NEXT: i32.add $push[[L4:[0-9]+]]=, $pop[[L3]], $pop[[L6]]
				; CHECK-NEXT: i32.const $push[[L5:[0-9]+]]=, 1
				; CHECK-NEXT: i32.xor $push[[L0:[0-9]+]]=, $pop[[L4]], $pop[[L5]]
				; CHECK-NEXT: br_table $pop[[L0]], 0, 1, 2, 3
				define void @xor_add_switch(double %a, double %b) {
				entry:
				%cmp = fcmp ult double %a, %b
				%zext = zext i1 %cmp to i32
				%add = add nsw nuw i32 %zext, 1
				%xor = xor i32 %add, 1
				switch i32 %xor, label %exit [
				i32 0, label %sw.bb.1
				i32 1, label %sw.bb.2
				i32 2, label %sw.bb.3
				]

				sw.bb.1:
				tail call void @foo1()
				br label %exit

				sw.bb.2:
				tail call void @foo2()
				br label %exit

				sw.bb.3:
				tail call void @foo3()
				br label %exit

				exit:
				ret void
				}

				declare void @foo1()
				declare void @foo2()
				declare void @foo3()
				declare void @_Z5call1v()