Download Raw Diff

Details

Reviewers

aditya_nandakumar
dsanders
arsenm
volkan
paquette

Summary

Patch adds five new GICombinerRules, one for each of the following unary FP instrs: G_FNEG, G_FABS, G_FPTRUNC, G_FSQRT, and G_FLOG2. The combine rules perform the FP operation on the constant operand and replace the original instr with the result. Patch additionally adds new combiner tests for the AArch64 target to test these new combiner rules.

Diff Detail

Event Timeline

mkitzan created this revision.Aug 21 2020, 11:27 PM

Herald added a reviewer: paquette. · View Herald TranscriptAug 21 2020, 11:27 PM

Herald added subscribers: llvm-commits, hiraditya, kristof.beyls. · View Herald Transcript

mkitzan requested review of this revision.Aug 21 2020, 11:27 PM

Herald added a subscriber: wdng. · View Herald TranscriptAug 21 2020, 11:27 PM

Harbormaster completed remote builds in B69210: Diff 287163.Aug 22 2020, 12:17 AM

Fixed clang-tidy feedback
(not enough)

Harbormaster completed remote builds in B69218: Diff 287195.Aug 22 2020, 10:15 AM

arsenm added inline comments.Aug 22 2020, 10:36 AM

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
1481	Why go through double instead of preserving the APFloat?

arsenm added inline comments.Aug 22 2020, 10:44 AM

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
1462	Wrong rounding mode

mkitzan added inline comments.Aug 22 2020, 10:46 AM

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp

1481

Because GIDefMatchData wants to have the variable uninitialized, which would call the default ctor of APFloat which is private.

See following pseudo code:

APFLoat MatchDataN; // calls APFloat()
if (matchCombineConstantFoldFpUnary(MI, MatchDataN))
  replaceInstWithAPFloat(MI, MatchDataN); // dummy function for example

The error looks like:

llvm-project/build/lib/Target/AArch64/AArch64GenPreLegalizeGICombiner.inc:343:11: error: calling a private constructor of class 'llvm::APFloat'
  APFloat MatchData23;
          ^
llvm-project/llvm/include/llvm/ADT/APFloat.h:842:3: note: implicitly declared private here
  APFloat() : U(IEEEdouble()) {
  ^

arsenm added inline comments.Aug 22 2020, 10:57 AM

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
1481	I guess you could work around this by keeping it wrapped in Optional<APFloat>

mkitzan added inline comments.Aug 22 2020, 11:27 AM

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
1462	Should the correct rounding mode be `rmNearestTiesToEven`? Is that only for `G_FPTRUNC` to `LLT::scalar(16)` and not `LLT::scalar(32)`?
1481	That could work. I ended up liking the current solution with `replaceInstWithFConstant` over my initial prototype where I tried passing around the `APFloat&`, because `buildFConstant(DstOp, double)` will convert the `double` to the appropriate `APFloat` depending on the `LLT` of the `DstOp`. That way we can just take advantage of the existing `replaceInstWithFConstant` function.

mkitzan removed a reviewer: • jpaquette.Aug 22 2020, 11:32 AM

Rebased and resolved conflicts with new changes to Combine.td.

arsenm added inline comments.Aug 24 2020, 10:26 AM

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
1462	All of the non-constrained FP instructions assume rmNearestTiesToEven independent of the type
1481	This adds limitations on supporting other FP types, like fp128. It's best to keep everything in APFloat

match / apply functions use Optional<APFloat> to pass info about the constant FP.
Updated rounding modes to be rmNearestTiesToEven
Refactored constantFoldFpUnary

arsenm added inline comments.Aug 26 2020, 4:51 PM

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
1563–1566	We should probably have a getFltSemanticForLLT utility somewhere for this
1583	setInstrAndDebugLoc?

mkitzan added inline comments.Aug 27 2020, 7:02 PM

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
1563–1566	Doesn't appear to be one for `LLT`. I suspect because unlike `Type`, with its discrete number of `TypeID`s, `LLT` is very open ended.
1583	Right. Will pick this up in the next fixup.

Rebased and fixed merge conflicts
Changed setInstr to setInstrAndDebugLoc

arsenm added inline comments.Aug 28 2020, 11:16 AM

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
1563–1566	Yes, that's why there should be one. An FP LLT isn't any arbitrary number of bits, there's still only the handful of valid FP type combinations. We're probably going to have to add something to track f16 vs. bf16, but for now it's still a switch over a handful of valid FP sizes

Added LLT helper function to get float semantic from scalar LLT (IEEE fp semantics only at the moment)
Updated constantFoldFpUnary to use the new helper function
Rebased / fixed merge conflicts

paquette added inline comments.Sep 4 2020, 2:08 PM

llvm/include/llvm/Target/GlobalISel/Combine.td
294	Is there any reason these are all separate combines, when they're all using the same function? Have you found it useful to be able to turn these on/off per-opcode? Most other combines look like this: def fconstant_matchinfo: GIDefMatchData<"Optional<APFloat>">; def constant_fold_unary: GICombineRule < (defs root:$root, fconstant_matchinfo:$info), (match (wip_match_opcode G_FNEG, G_FABS, G_FPTRUNC, G_FSQRT, G_FLOG2):$root, [{ return Helper.matchCombineConstantFoldFpUnary(${root}, ${info}); }]), (apply [{ return Helper.applyCombineConstantFoldFpUnary(${root}, ${info}); }]) >;
295	Do you need separate `matchinfo` definitions? I've noticed all the combines do this, but I think it would be better to just say def fconstant_matchinfo: GIDefMatchData<"Optional<APFloat>">; and then reuse it in every combine that uses it versus redefining matchinfo for every combine. (The rest of the combines could probably be cleaned up similarly in a later commit if this works)
llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
1449	This should probably assert or be a `llvm_unreachable`, since we never expect to run this with any other opcode.

mkitzan added inline comments.Sep 4 2020, 2:43 PM

llvm/include/llvm/Target/GlobalISel/Combine.td
294	No reason, except I didn't see that we could have a list of opcodes. Will fix that.
295	When they are all refactored into a single combine rule, we'll end up with one `matchinfo` for free.
llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
1449	Makes sense to have an `assert`. If somehow control flow ended up there, then the developer would likely want the compiler to `assert` rather than `return None`.

Collapsed the many combine rules into a single combine rule with a list of matchable opcodes
Made the default switch case in constantFoldFpUnary be llvm_unreachable

arsenm added inline comments.Sep 9 2020, 8:04 AM

llvm/include/llvm/Target/GlobalISel/Combine.td
295	Tablegen isn't smart enough to reuse identical matchinfos (although this should really be fixed)

arsenm added inline comments.Sep 14 2020, 4:06 PM

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
1443	Don't need to bother with the type check?
1459	Why special case S64 and not use getFltSemanticForLLT?
llvm/lib/CodeGen/LowLevelType.cpp
64 ↗	(On Diff #290052)	I would expect this to just do .getScalarSizeInBits

mkitzan added inline comments.Sep 14 2020, 6:37 PM

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp
1443	Probably not anymore
1459	You're right, no longer necessary
llvm/lib/CodeGen/LowLevelType.cpp
64 ↗	(On Diff #290052)	I figured calling it on the vector element type would likely be the typical intended use, and calling on the aggregate vector type would likely be bug (and wouldn't really make much sense).

Rebased and fixed merge conflicts
Removed unnecessary type checks in constantFoldFpUnary
Removed G_FPTRUNC special case for S64 in constantFoldFpUnary

arsenm accepted this revision.Sep 16 2020, 7:53 AM

This revision is now accepted and ready to land.Sep 16 2020, 7:53 AM

mkitzan mentioned this in rGc4e589b7954c: [GISel] Add new combines for unary FP instrs with constant operand.Sep 16 2020, 10:34 AM

Committed in c4e589b7954

Diff 288505

llvm/include/llvm/CodeGen/GlobalISel/CombinerHelper.h

Context not available.
	#ifndef LLVM_CODEGEN_GLOBALISEL_COMBINER_HELPER_H	#ifndef LLVM_CODEGEN_GLOBALISEL_COMBINER_HELPER_H
	#define LLVM_CODEGEN_GLOBALISEL_COMBINER_HELPER_H	#define LLVM_CODEGEN_GLOBALISEL_COMBINER_HELPER_H

		#include "llvm/ADT/APFloat.h"
	#include "llvm/CodeGen/LowLevelType.h"	#include "llvm/CodeGen/LowLevelType.h"
	#include "llvm/CodeGen/Register.h"	#include "llvm/CodeGen/Register.h"
	#include "llvm/Support/Alignment.h"	#include "llvm/Support/Alignment.h"
Context not available.
	bool applyCombineShiftToUnmerge(MachineInstr &MI, const unsigned &ShiftVal);	bool applyCombineShiftToUnmerge(MachineInstr &MI, const unsigned &ShiftVal);
	bool tryCombineShiftToUnmerge(MachineInstr &MI, unsigned TargetShiftAmount);	bool tryCombineShiftToUnmerge(MachineInstr &MI, unsigned TargetShiftAmount);

		/// Transform fp_instr(cst) to constant result of the fp operation.
		bool matchCombineConstantFoldFpUnary(MachineInstr &MI,
		Optional<APFloat> &Cst);
		bool applyCombineConstantFoldFpUnary(MachineInstr &MI,
		Optional<APFloat> &Cst);

	/// Transform IntToPtr(PtrToInt(x)) to x if cast is in the same address space.	/// Transform IntToPtr(PtrToInt(x)) to x if cast is in the same address space.
	bool matchCombineI2PToP2I(MachineInstr &MI, Register &Reg);	bool matchCombineI2PToP2I(MachineInstr &MI, Register &Reg);
	bool applyCombineI2PToP2I(MachineInstr &MI, Register &Reg);	bool applyCombineI2PToP2I(MachineInstr &MI, Register &Reg);
Context not available.

llvm/include/llvm/Target/GlobalISel/Combine.td

Context not available.
	(apply [{ return Helper.applySimplifyAddToSub(*${root}, ${info});}])	(apply [{ return Helper.applySimplifyAddToSub(*${root}, ${info});}])
	>;	>;

		// Fold fneg(cst) to cst result of fneg operation
		paquetteUnsubmitted Not Done Reply Inline Actions Is there any reason these are all separate combines, when they're all using the same function? Have you found it useful to be able to turn these on/off per-opcode? Most other combines look like this: def fconstant_matchinfo: GIDefMatchData<"Optional<APFloat>">; def constant_fold_unary: GICombineRule < (defs root:$root, fconstant_matchinfo:$info), (match (wip_match_opcode G_FNEG, G_FABS, G_FPTRUNC, G_FSQRT, G_FLOG2):$root, [{ return Helper.matchCombineConstantFoldFpUnary(${root}, ${info}); }]), (apply [{ return Helper.applyCombineConstantFoldFpUnary(${root}, ${info}); }]) >; paquette: Is there any reason these are all separate combines, when they're all using the same function?
		mkitzanAuthorUnsubmitted Done Reply Inline Actions No reason, except I didn't see that we could have a list of opcodes. Will fix that. mkitzan: No reason, except I didn't see that we could have a list of opcodes. Will fix that.
		def constant_fneg_matchinfo: GIDefMatchData<"Optional<APFloat>">;
		paquetteUnsubmitted Not Done Reply Inline Actions Do you need separate `matchinfo` definitions? I've noticed all the combines do this, but I think it would be better to just say def fconstant_matchinfo: GIDefMatchData<"Optional<APFloat>">; and then reuse it in every combine that uses it versus redefining matchinfo for every combine. (The rest of the combines could probably be cleaned up similarly in a later commit if this works) paquette: Do you need separate `matchinfo` definitions? I've noticed all the combines do this, but I…
		mkitzanAuthorUnsubmitted Done Reply Inline Actions When they are all refactored into a single combine rule, we'll end up with one `matchinfo` for free. mkitzan: When they are all refactored into a single combine rule, we'll end up with one `matchinfo` for…
		arsenmUnsubmitted Not Done Reply Inline Actions Tablegen isn't smart enough to reuse identical matchinfos (although this should really be fixed) arsenm: Tablegen isn't smart enough to reuse identical matchinfos (although this should really be fixed)
		def constant_fneg: GICombineRule <
		(defs root:$root, constant_fneg_matchinfo:$info),
		(match (wip_match_opcode G_FNEG):$root,
		[{ return Helper.matchCombineConstantFoldFpUnary(*${root}, ${info}); }]),
		(apply [{ return Helper.applyCombineConstantFoldFpUnary(*${root}, ${info}); }])
		>;

		// Fold fabs(cst) to cst result of fabs operation
		def constant_fabs_matchinfo: GIDefMatchData<"Optional<APFloat>">;
		def constant_fabs: GICombineRule <
		(defs root:$root, constant_fabs_matchinfo:$info),
		(match (wip_match_opcode G_FABS):$root,
		[{ return Helper.matchCombineConstantFoldFpUnary(*${root}, ${info}); }]),
		(apply [{ return Helper.applyCombineConstantFoldFpUnary(*${root}, ${info}); }])
		>;

		// Fold fptrunc(cst) to cst result of fptrunc operation
		def constant_fptrunc_matchinfo: GIDefMatchData<"Optional<APFloat>">;
		def constant_fptrunc: GICombineRule <
		(defs root:$root, constant_fptrunc_matchinfo:$info),
		(match (wip_match_opcode G_FPTRUNC):$root,
		[{ return Helper.matchCombineConstantFoldFpUnary(*${root}, ${info}); }]),
		(apply [{ return Helper.applyCombineConstantFoldFpUnary(*${root}, ${info}); }])
		>;

		// Fold fsqrt(cst) to cst result of fsqrt operation
		def constant_fsqrt_matchinfo: GIDefMatchData<"Optional<APFloat>">;
		def constant_fsqrt: GICombineRule <
		(defs root:$root, constant_fsqrt_matchinfo:$info),
		(match (wip_match_opcode G_FSQRT):$root,
		[{ return Helper.matchCombineConstantFoldFpUnary(*${root}, ${info}); }]),
		(apply [{ return Helper.applyCombineConstantFoldFpUnary(*${root}, ${info}); }])
		>;

		// Fold flog2(cst) to cst result of flog2 operation
		def constant_flog2_matchinfo: GIDefMatchData<"Optional<APFloat>">;
		def constant_flog2: GICombineRule <
		(defs root:$root, constant_flog2_matchinfo:$info),
		(match (wip_match_opcode G_FLOG2):$root,
		[{ return Helper.matchCombineConstantFoldFpUnary(*${root}, ${info}); }]),
		(apply [{ return Helper.applyCombineConstantFoldFpUnary(*${root}, ${info}); }])
		>;

	// Fold int2ptr(ptr2int(x)) -> x	// Fold int2ptr(ptr2int(x)) -> x
	def p2i_to_i2p_matchinfo: GIDefMatchData<"Register">;	def p2i_to_i2p_matchinfo: GIDefMatchData<"Register">;
	def p2i_to_i2p: GICombineRule<	def p2i_to_i2p: GICombineRule<
Context not available.

	def width_reduction_combines : GICombineGroup<[reduce_shl_of_extend]>;	def width_reduction_combines : GICombineGroup<[reduce_shl_of_extend]>;

		def constant_fp_combines : GICombineGroup<[constant_fneg, constant_fabs,
		constant_fptrunc, constant_fsqrt,
		constant_flog2]>;

	def select_combines : GICombineGroup<[select_undef_cmp, select_constant_cmp]>;	def select_combines : GICombineGroup<[select_undef_cmp, select_constant_cmp]>;

	def trivial_combines : GICombineGroup<[copy_prop, mul_to_shl, add_p2i_to_ptradd]>;	def trivial_combines : GICombineGroup<[copy_prop, mul_to_shl, add_p2i_to_ptradd]>;
Context not available.
	hoist_logic_op_with_same_opcode_hands,	hoist_logic_op_with_same_opcode_hands,
	shl_ashr_to_sext_inreg, sext_inreg_of_load,	shl_ashr_to_sext_inreg, sext_inreg_of_load,
	width_reduction_combines, select_combines,	width_reduction_combines, select_combines,
	known_bits_simplifications]>;	known_bits_simplifications, constant_fp_combines]>;
Context not available.

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp

Context not available.
	return false;	return false;
	}	}

		static Optional<APFloat> constantFoldFpUnary(unsigned Opcode, LLT DstTy,
		const Register Op,
		const MachineRegisterInfo &MRI) {
		const auto S16 = LLT::scalar(16);
		const auto S32 = LLT::scalar(32);
		const auto S64 = LLT::scalar(64);
		const ConstantFP *MaybeCst = getConstantFPVRegVal(Op, MRI);

		arsenmUnsubmitted Not Done Reply Inline Actions Don't need to bother with the type check? arsenm: Don't need to bother with the type check?
		mkitzanAuthorUnsubmitted Done Reply Inline Actions Probably not anymore mkitzan: Probably not anymore
		if (!MaybeCst \|\| !(DstTy == S16 \|\| DstTy == S32 \|\| DstTy == S64))
		return None;
		APFloat V = MaybeCst->getValueAPF();
		switch (Opcode) {
		default:
		return None;
		paquetteUnsubmitted Not Done Reply Inline Actions This should probably assert or be a `llvm_unreachable`, since we never expect to run this with any other opcode. paquette: This should probably assert or be a `llvm_unreachable`, since we never expect to run this with…
		mkitzanAuthorUnsubmitted Done Reply Inline Actions Makes sense to have an `assert`. If somehow control flow ended up there, then the developer would likely want the compiler to `assert` rather than `return None`. mkitzan: Makes sense to have an `assert`. If somehow control flow ended up there, then the developer…
		case TargetOpcode::G_FNEG: {
		V.changeSign();
		return V;
		}
		case TargetOpcode::G_FABS: {
		V.clearSign();
		return V;
		}
		case TargetOpcode::G_FPTRUNC: {
		if (DstTy == S64) {
		arsenmUnsubmitted Not Done Reply Inline Actions Why special case S64 and not use getFltSemanticForLLT? arsenm: Why special case S64 and not use getFltSemanticForLLT?
		mkitzanAuthorUnsubmitted Done Reply Inline Actions You're right, no longer necessary mkitzan: You're right, no longer necessary
		bool Unused;
		V.convert(APFloat::IEEEdouble(), APFloat::rmNearestTiesToEven, &Unused);
		return V;
		arsenmUnsubmitted Not Done Reply Inline Actions Wrong rounding mode arsenm: Wrong rounding mode
		mkitzanAuthorUnsubmitted Done Reply Inline Actions Should the correct rounding mode be `rmNearestTiesToEven`? Is that only for `G_FPTRUNC` to `LLT::scalar(16)` and not `LLT::scalar(32)`? mkitzan: Should the correct rounding mode be `rmNearestTiesToEven`? Is that only for `G_FPTRUNC` to `LLT…
		arsenmUnsubmitted Not Done Reply Inline Actions All of the non-constrained FP instructions assume rmNearestTiesToEven independent of the type arsenm: All of the non-constrained FP instructions assume rmNearestTiesToEven independent of the type
		}
		break;
		}
		case TargetOpcode::G_FSQRT: {
		bool Unused;
		V.convert(APFloat::IEEEdouble(), APFloat::rmNearestTiesToEven, &Unused);
		V = APFloat(sqrt(V.convertToDouble()));
		break;
		}
		case TargetOpcode::G_FLOG2: {
		bool Unused;
		V.convert(APFloat::IEEEdouble(), APFloat::rmNearestTiesToEven, &Unused);
		V = APFloat(log2(V.convertToDouble()));
		break;
		}
		}

		// Convert `APFloat` to appropriate IEEE type depending on `DstTy`. Otherwise,
		// `buildFConstant` will assert on size mismatch. Only `G_FPTRUNC`, `G_FSQRT`,
		arsenmUnsubmitted Not Done Reply Inline Actions Why go through double instead of preserving the APFloat? arsenm: Why go through double instead of preserving the APFloat?
		mkitzanAuthorUnsubmitted Done Reply Inline Actions Because `GIDefMatchData` wants to have the variable uninitialized, which would call the default ctor of `APFloat` which is private. See following pseudo code: APFLoat MatchDataN; // calls APFloat() if (matchCombineConstantFoldFpUnary(MI, MatchDataN)) replaceInstWithAPFloat(MI, MatchDataN); // dummy function for example The error looks like: llvm-project/build/lib/Target/AArch64/AArch64GenPreLegalizeGICombiner.inc:343:11: error: calling a private constructor of class 'llvm::APFloat' APFloat MatchData23; ^ llvm-project/llvm/include/llvm/ADT/APFloat.h:842:3: note: implicitly declared private here APFloat() : U(IEEEdouble()) { ^ mkitzan: Because `GIDefMatchData` wants to have the variable uninitialized, which would call the default…
		arsenmUnsubmitted Not Done Reply Inline Actions I guess you could work around this by keeping it wrapped in Optional<APFloat> arsenm: I guess you could work around this by keeping it wrapped in Optional<APFloat>
		mkitzanAuthorUnsubmitted Done Reply Inline Actions That could work. I ended up liking the current solution with `replaceInstWithFConstant` over my initial prototype where I tried passing around the `APFloat&`, because `buildFConstant(DstOp, double)` will convert the `double` to the appropriate `APFloat` depending on the `LLT` of the `DstOp`. That way we can just take advantage of the existing `replaceInstWithFConstant` function. mkitzan: That could work. I ended up liking the current solution with `replaceInstWithFConstant` over my…
		arsenmUnsubmitted Not Done Reply Inline Actions This adds limitations on supporting other FP types, like fp128. It's best to keep everything in APFloat arsenm: This adds limitations on supporting other FP types, like fp128. It's best to keep everything in…
		// and `G_FLOG2` reach here.
		bool Unused;
		if (DstTy == S16)
		V.convert(APFloat::IEEEhalf(), APFloat::rmNearestTiesToEven, &Unused);
		else if (DstTy == S32)
		V.convert(APFloat::IEEEsingle(), APFloat::rmNearestTiesToEven, &Unused);

		return V;
		}

		bool CombinerHelper::matchCombineConstantFoldFpUnary(MachineInstr &MI,
		Optional<APFloat> &Cst) {
		Register DstReg = MI.getOperand(0).getReg();
		Register SrcReg = MI.getOperand(1).getReg();
		LLT DstTy = MRI.getType(DstReg);
		Cst = constantFoldFpUnary(MI.getOpcode(), DstTy, SrcReg, MRI);
		return Cst.hasValue();
		}

		bool CombinerHelper::applyCombineConstantFoldFpUnary(MachineInstr &MI,
		Optional<APFloat> &Cst) {
		assert(Cst.hasValue() && "Optional is unexpectedly empty!");
		Builder.setInstrAndDebugLoc(MI);
		MachineFunction &MF = Builder.getMF();
		auto FPVal = ConstantFP::get(MF.getFunction().getContext(), Cst);
		Register DstReg = MI.getOperand(0).getReg();
		Builder.buildFConstant(DstReg, *FPVal);
		MI.eraseFromParent();
		return true;
		}

	bool CombinerHelper::matchPtrAddImmedChain(MachineInstr &MI,	bool CombinerHelper::matchPtrAddImmedChain(MachineInstr &MI,
	PtrAddChain &MatchInfo) {	PtrAddChain &MatchInfo) {
	// We're trying to match the following pattern:	// We're trying to match the following pattern:
Context not available.
		arsenmUnsubmitted Not Done Reply Inline Actions setInstrAndDebugLoc? arsenm: setInstrAndDebugLoc?
		mkitzanAuthorUnsubmitted Done Reply Inline Actions Right. Will pick this up in the next fixup. mkitzan: Right. Will pick this up in the next fixup.
		arsenmUnsubmitted Not Done Reply Inline Actions We should probably have a getFltSemanticForLLT utility somewhere for this arsenm: We should probably have a getFltSemanticForLLT utility somewhere for this
		mkitzanAuthorUnsubmitted Done Reply Inline Actions Doesn't appear to be one for `LLT`. I suspect because unlike `Type`, with its discrete number of `TypeID`s, `LLT` is very open ended. mkitzan: Doesn't appear to be one for `LLT`. I suspect because unlike `Type`, with its discrete number…
		arsenmUnsubmitted Not Done Reply Inline Actions Yes, that's why there should be one. An FP LLT isn't any arbitrary number of bits, there's still only the handful of valid FP type combinations. We're probably going to have to add something to track f16 vs. bf16, but for now it's still a switch over a handful of valid FP sizes arsenm: Yes, that's why there should be one. An FP LLT isn't any arbitrary number of bits, there's…

llvm/test/CodeGen/AArch64/GlobalISel/combine-fabs.mir

This file was added.

				# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
				# RUN: llc -run-pass=aarch64-prelegalizer-combiner -verify-machineinstrs -mtriple aarch64-unknown-unknown %s -o - \| FileCheck %s
				# RUN: llc -debugify-and-strip-all-safe -run-pass=aarch64-prelegalizer-combiner -verify-machineinstrs -mtriple aarch64-unknown-unknown %s -o - \| FileCheck %s

				---
				name: test_combine_half_fabs_neg_constant
				body: \|
				bb.1:
				; CHECK-LABEL: name: test_combine_half_fabs_neg_constant
				; CHECK: [[C:%[0-9]+]]:_(s16) = G_FCONSTANT half 0xH4580
				; CHECK: $h0 = COPY [[C]](s16)
				%0:_(s16) = G_FCONSTANT half 0xHC580
				%1:_(s16) = G_FABS %0
				$h0 = COPY %1(s16)
				...
				---
				name: test_combine_half_fabs_pos_constant
				body: \|
				bb.1:
				; CHECK-LABEL: name: test_combine_half_fabs_pos_constant
				; CHECK: [[C:%[0-9]+]]:_(s16) = G_FCONSTANT half 0xH4580
				; CHECK: $h0 = COPY [[C]](s16)
				%0:_(s16) = G_FCONSTANT half 0xH4580
				%1:_(s16) = G_FABS %0
				$h0 = COPY %1(s16)
				...
				---
				name: test_combine_float_fabs_neg_constant
				body: \|
				bb.1:
				liveins: $w0
				; CHECK-LABEL: name: test_combine_float_fabs_neg_constant
				; CHECK: [[C:%[0-9]+]]:_(s32) = G_FCONSTANT float 5.500000e+00
				; CHECK: $w0 = COPY [[C]](s32)
				%0:_(s32) = G_FCONSTANT float -5.500000e+00
				%1:_(s32) = G_FABS %0
				$w0 = COPY %1(s32)
				...
				---
				name: test_combine_float_fabs_pos_constant
				body: \|
				bb.1:
				liveins: $w0
				; CHECK-LABEL: name: test_combine_float_fabs_pos_constant
				; CHECK: [[C:%[0-9]+]]:_(s32) = G_FCONSTANT float 5.500000e+00
				; CHECK: $w0 = COPY [[C]](s32)
				%0:_(s32) = G_FCONSTANT float -5.500000e+00
				%1:_(s32) = G_FABS %0
				$w0 = COPY %1(s32)
				...
				---
				name: test_combine_double_fabs_neg_constant
				body: \|
				bb.1:
				liveins: $x0
				; CHECK-LABEL: name: test_combine_double_fabs_neg_constant
				; CHECK: [[C:%[0-9]+]]:_(s64) = G_FCONSTANT double 4.200000e+00
				; CHECK: $x0 = COPY [[C]](s64)
				%0:_(s64) = G_FCONSTANT double -4.200000e+00
				%1:_(s64) = G_FABS %0
				$x0 = COPY %1(s64)
				...
				---
				name: test_combine_double_fabs_pos_constant
				body: \|
				bb.1:
				liveins: $x0
				; CHECK-LABEL: name: test_combine_double_fabs_pos_constant
				; CHECK: [[C:%[0-9]+]]:_(s64) = G_FCONSTANT double 4.200000e+00
				; CHECK: $x0 = COPY [[C]](s64)
				%0:_(s64) = G_FCONSTANT double 4.200000e+00
				%1:_(s64) = G_FABS %0
				$x0 = COPY %0(s64)
				...

llvm/test/CodeGen/AArch64/GlobalISel/combine-flog2.mir

This file was added.

				# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
				# RUN: llc -run-pass=aarch64-prelegalizer-combiner -verify-machineinstrs -mtriple aarch64-unknown-unknown %s -o - \| FileCheck %s
				# RUN: llc -debugify-and-strip-all-safe -run-pass=aarch64-prelegalizer-combiner -verify-machineinstrs -mtriple aarch64-unknown-unknown %s -o - \| FileCheck %s

				---
				name: test_combine_half_flog2_constant
				body: \|
				bb.1:
				; CHECK-LABEL: name: test_combine_half_flog2_constant
				; CHECK: [[C:%[0-9]+]]:_(s16) = G_FCONSTANT half 0xH4000
				; CHECK: $h0 = COPY [[C]](s16)
				%0:_(s16) = G_FCONSTANT half 4.000000e+00
				%1:_(s16) = G_FLOG2 %0
				$h0 = COPY %1(s16)
				...
				---
				name: test_combine_float_flog2_constant
				body: \|
				bb.1:
				; CHECK-LABEL: name: test_combine_float_flog2_constant
				; CHECK: [[C:%[0-9]+]]:_(s32) = G_FCONSTANT float 2.000000e+00
				; CHECK: $w0 = COPY [[C]](s32)
				%0:_(s32) = G_FCONSTANT float 4.000000e+00
				%1:_(s32) = G_FLOG2 %0
				$w0 = COPY %1(s32)
				...
				---
				name: test_combine_double_flog2_constant
				body: \|
				bb.1:
				; CHECK-LABEL: name: test_combine_double_flog2_constant
				; CHECK: [[C:%[0-9]+]]:_(s64) = G_FCONSTANT double 2.000000e+00
				; CHECK: $x0 = COPY [[C]](s64)
				%0:_(s64) = G_FCONSTANT double 4.000000e+00
				%1:_(s64) = G_FLOG2 %0
				$x0 = COPY %1(s64)
				...

llvm/test/CodeGen/AArch64/GlobalISel/combine-fneg.mir

This file was added.

				# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
				# RUN: llc -run-pass=aarch64-prelegalizer-combiner -verify-machineinstrs -mtriple aarch64-unknown-unknown %s -o - \| FileCheck %s
				# RUN: llc -debugify-and-strip-all-safe -run-pass=aarch64-prelegalizer-combiner -verify-machineinstrs -mtriple aarch64-unknown-unknown %s -o - \| FileCheck %s

				---
				name: test_combine_half_fneg_neg_constant
				body: \|
				bb.1:
				; CHECK-LABEL: name: test_combine_half_fneg_neg_constant
				; CHECK: [[C:%[0-9]+]]:_(s16) = G_FCONSTANT half 0xH4580
				; CHECK: $h0 = COPY [[C]](s16)
				%0:_(s16) = G_FCONSTANT half 0xHC580
				%1:_(s16) = G_FNEG %0
				$h0 = COPY %1(s16)
				...
				---
				name: test_combine_half_fneg_pos_constant
				body: \|
				bb.1:
				; CHECK-LABEL: name: test_combine_half_fneg_pos_constant
				; CHECK: [[C:%[0-9]+]]:_(s16) = G_FCONSTANT half 0xHC580
				; CHECK: $h0 = COPY [[C]](s16)
				%0:_(s16) = G_FCONSTANT half 0xH4580
				%1:_(s16) = G_FNEG %0
				$h0 = COPY %1(s16)
				...
				---
				name: test_combine_float_fneg_neg_constant
				body: \|
				bb.1:
				; CHECK-LABEL: name: test_combine_float_fneg_neg_constant
				; CHECK: [[C:%[0-9]+]]:_(s32) = G_FCONSTANT float 5.500000e+00
				; CHECK: $w0 = COPY [[C]](s32)
				%0:_(s32) = G_FCONSTANT float -5.500000e+00
				%1:_(s32) = G_FNEG %0
				$w0 = COPY %1(s32)
				...
				---
				name: test_combine_float_fneg_pos_constant
				body: \|
				bb.1:
				; CHECK-LABEL: name: test_combine_float_fneg_pos_constant
				; CHECK: [[C:%[0-9]+]]:_(s32) = G_FCONSTANT float -5.500000e+00
				; CHECK: $w0 = COPY [[C]](s32)
				%0:_(s32) = G_FCONSTANT float 5.500000e+00
				%1:_(s32) = G_FNEG %0
				$w0 = COPY %1(s32)
				...
				---
				name: test_combine_double_fneg_neg_constant
				body: \|
				bb.1:
				; CHECK-LABEL: name: test_combine_double_fneg_neg_constant
				; CHECK: [[C:%[0-9]+]]:_(s64) = G_FCONSTANT double 4.200000e+00
				; CHECK: $x0 = COPY [[C]](s64)
				%0:_(s64) = G_FCONSTANT double -4.200000e+00
				%1:_(s64) = G_FNEG %0
				$x0 = COPY %1(s64)
				...
				---
				name: test_combine_double_fneg_pos_constant
				body: \|
				bb.1:
				; CHECK-LABEL: name: test_combine_double_fneg_pos_constant
				; CHECK: [[C:%[0-9]+]]:_(s64) = G_FCONSTANT double -4.200000e+00
				; CHECK: $x0 = COPY [[C]](s64)
				%0:_(s64) = G_FCONSTANT double 4.200000e+00
				%1:_(s64) = G_FNEG %0
				$x0 = COPY %1(s64)
				...

llvm/test/CodeGen/AArch64/GlobalISel/combine-fptrunc.mir

This file was added.

				# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
				# RUN: llc -run-pass=aarch64-prelegalizer-combiner -verify-machineinstrs -mtriple aarch64-unknown-unknown %s -o - \| FileCheck %s
				# RUN: llc -debugify-and-strip-all-safe -run-pass=aarch64-prelegalizer-combiner -verify-machineinstrs -mtriple aarch64-unknown-unknown %s -o - \| FileCheck %s

				---
				name: test_combine_float_to_half_fptrunc_constant
				body: \|
				bb.1:
				; CHECK-LABEL: name: test_combine_float_to_half_fptrunc_constant
				; CHECK: [[C:%[0-9]+]]:_(s16) = G_FCONSTANT half 0xH4580
				; CHECK: $h0 = COPY [[C]](s16)
				%0:_(s32) = G_FCONSTANT float 5.500000e+00
				%1:_(s16) = G_FPTRUNC %0(s32)
				$h0 = COPY %1(s16)
				...
				---
				name: test_combine_double_to_half_fptrunc_constant
				body: \|
				bb.1:
				; CHECK-LABEL: name: test_combine_double_to_half_fptrunc_constant
				; CHECK: [[C:%[0-9]+]]:_(s16) = G_FCONSTANT half 0xH4433
				; CHECK: $h0 = COPY [[C]](s16)
				%0:_(s64) = G_FCONSTANT double 4.200000e+00
				%1:_(s16) = G_FPTRUNC %0(s64)
				$h0 = COPY %1(s16)
				...
				---
				name: test_combine_double_to_foat_fptrunc_constant
				body: \|
				bb.1:
				; CHECK-LABEL: name: test_combine_double_to_foat_fptrunc_constant
				; CHECK: [[C:%[0-9]+]]:_(s32) = G_FCONSTANT float 0x4010CCCCC0000000
				; CHECK: $w0 = COPY [[C]](s32)
				%0:_(s64) = G_FCONSTANT double 4.200000e+00
				%1:_(s32) = G_FPTRUNC %0(s64)
				$w0 = COPY %1(s32)
				...

llvm/test/CodeGen/AArch64/GlobalISel/combine-fsqrt.mir

This file was added.

				# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
				# RUN: llc -run-pass=aarch64-prelegalizer-combiner -verify-machineinstrs -mtriple aarch64-unknown-unknown %s -o - \| FileCheck %s
				# RUN: llc -debugify-and-strip-all-safe -run-pass=aarch64-prelegalizer-combiner -verify-machineinstrs -mtriple aarch64-unknown-unknown %s -o - \| FileCheck %s

				---
				name: test_combine_half_fsqrt_constant
				body: \|
				bb.1:
				liveins:
				; CHECK-LABEL: name: test_combine_half_fsqrt_constant
				; CHECK: [[C:%[0-9]+]]:_(s16) = G_FCONSTANT half 0xH4000
				; CHECK: $h0 = COPY [[C]](s16)
				%0:_(s16) = G_FCONSTANT half 4.000000e+00
				%1:_(s16) = G_FSQRT %0
				$h0 = COPY %1
				...
				---
				name: test_combine_float_fsqrt_constant
				body: \|
				bb.1:
				liveins:
				; CHECK-LABEL: name: test_combine_float_fsqrt_constant
				; CHECK: [[C:%[0-9]+]]:_(s32) = G_FCONSTANT float 2.000000e+00
				; CHECK: $w0 = COPY [[C]](s32)
				%0:_(s32) = G_FCONSTANT float 4.000000e+00
				%1:_(s32) = G_FSQRT %0
				$w0 = COPY %1
				...
				---
				name: test_combine_double_fsqrt_constant
				body: \|
				bb.1:
				liveins:
				; CHECK-LABEL: name: test_combine_double_fsqrt_constant
				; CHECK: [[C:%[0-9]+]]:_(s64) = G_FCONSTANT double 2.000000e+00
				; CHECK: $x0 = COPY [[C]](s64)
				%0:_(s64) = G_FCONSTANT double 4.000000e+00
				%1:_(s64) = G_FSQRT %0
				$x0 = COPY %1
				...

This is an archive of the discontinued LLVM Phabricator instance.

[GISel] Add combines for unary FP instrs with constant operand
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 288505

llvm/include/llvm/CodeGen/GlobalISel/CombinerHelper.h

llvm/include/llvm/Target/GlobalISel/Combine.td

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp

llvm/test/CodeGen/AArch64/GlobalISel/combine-fabs.mir

llvm/test/CodeGen/AArch64/GlobalISel/combine-flog2.mir

llvm/test/CodeGen/AArch64/GlobalISel/combine-fneg.mir

llvm/test/CodeGen/AArch64/GlobalISel/combine-fptrunc.mir

llvm/test/CodeGen/AArch64/GlobalISel/combine-fsqrt.mir

This is an archive of the discontinued LLVM Phabricator instance.

[GISel] Add combines for unary FP instrs with constant operandClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 288505

llvm/include/llvm/CodeGen/GlobalISel/CombinerHelper.h

llvm/include/llvm/Target/GlobalISel/Combine.td

llvm/lib/CodeGen/GlobalISel/CombinerHelper.cpp

llvm/test/CodeGen/AArch64/GlobalISel/combine-fabs.mir

llvm/test/CodeGen/AArch64/GlobalISel/combine-flog2.mir

llvm/test/CodeGen/AArch64/GlobalISel/combine-fneg.mir

llvm/test/CodeGen/AArch64/GlobalISel/combine-fptrunc.mir

llvm/test/CodeGen/AArch64/GlobalISel/combine-fsqrt.mir

[GISel] Add combines for unary FP instrs with constant operand
ClosedPublic