This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
flang/
-
include/flang/Optimizer/Builder/
-
flang/
-
Optimizer/
-
Builder/
-
IntrinsicCall.h
-
lib/
-
Lower/
-
ConvertExpr.cpp
-
ConvertExprToHLFIR.cpp
-
Optimizer/Builder/
-
Builder/
-
IntrinsicCall.cpp
-
test/Lower/
-
Lower/
-
HLFIR/
-
binary-ops.f90
-
assignment.f90
-
complex-operations.f90

Differential D149546

[Flang] Change complex divide lowering
ClosedPublic

Authored by kiranchandramohan on Apr 30 2023, 5:56 AM.

Download Raw Diff

Details

Reviewers

vzakhari
PeteSteinfeld
DavidTruby
SBallantyne
jeanPerier
Renaud-K

Commits

rGc3a0df1903bb: [Flang] Change complex divide lowering
rG1b9d0deb6d53: [Flang] Change complex divide lowering

Summary

Currently complex division is lowered to a fir.divc operation and the
fir.divc is later converted to a sequence of llvm operations to perform
complex division, however this causes issues for extreme values when
the calculations overflow.

This patch changes the lowering of complex division to use the Intrinsic
Call functionality to lower into library calls (for single, double,
extended and quad precisions) or an MLIR complex dialect division operation
(for half and bfloat precisions).

A new wrapper function genLibSplitComplexArgsCall is written to handle
the case of the arguments of the Complex Library calls being split to
its real and imaginary real components.

Note 1: If the Complex To Standard conversion of division operation
matures then we can use it for all precisions. Currently it has the
same issues as the conversion of fir.divc.
Note 2: A previous patch (D145808) did the same but during conversion of
the fir.divc operation. But using function calls at that stage leads to
ABI issues since the conversion to LLVM is not aware of the complex target
rewrite.
Note 3: If the patch is accepted, fir.divc can be removed from FIR. We
can use the complex.div operation where any transformation is required.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

kiranchandramohan created this revision.Apr 30 2023, 5:56 AM

Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptApr 30 2023, 5:56 AM

Herald added subscribers: sunshaoce, bzcheeseman, mehdi_amini, rriddle. · View Herald Transcript

kiranchandramohan requested review of this revision.Apr 30 2023, 5:56 AM

Herald added subscribers: stephenneuendorffer, jdoerfert. · View Herald TranscriptApr 30 2023, 5:56 AM

Harbormaster completed remote builds in B229122: Diff 518299.Apr 30 2023, 6:12 AM

kiranchandramohan added reviewers: vzakhari, PeteSteinfeld, DavidTruby, SBallantyne.May 2 2023, 2:07 AM

Thanks for fixing this up, Kiran.

All builds, tests, and looks good. All tests that failed with the previous effort now pass.

This revision is now accepted and ready to land.May 3 2023, 10:03 AM

Would it be possible to continue to use MLIR ComplexToStandard when building with fast math? When I did some tests before, using ComplexToStandard is significantly faster than doing the library calls.

In D149546#4318688, @DavidTruby wrote:

Would it be possible to continue to use MLIR ComplexToStandard when building with fast math? When I did some tests before, using ComplexToStandard is significantly faster than doing the library calls.

With the present settings, the MLIR operations are used by default and would cause correctness issues for Complex Divide when not using fast math. If we can change the setting to only use MLIR Operations with fast-math or approx-func then we can switch to MLIR Complex operations for divide at all precisions and take advantage of ComplexToStandard.

Sorry not sure I understand; from what I can see this patch changes it to always use a function call rather than any MLIR operation. I was wondering if we can keep the MLIR operation when fastmath is turned on

In D149546#4319063, @DavidTruby wrote:

Sorry not sure I understand; from what I can see this patch changes it to always use a function call rather than any MLIR operation. I was wondering if we can keep the MLIR operation when fastmath is turned on

Currently, the default setting of mathRuntimeVersion is fastVersion. So if I add MLIR complex.div generation then it will always be generated and that will cause precision issues.

I think we should explore setting the mathRuntimeVersion to preciseVersion or fastVersion based on user flags in a separate patch. Once that is done, we can enable MLIR complex.div generation with fast-math or approx-func or something similar.

Thank you for fixing this, Kiran!

Ok I understand, this patch looks good to fix the issue in the mean time, thanks!

Closed by commit rG1b9d0deb6d53: [Flang] Change complex divide lowering (authored by kiranchandramohan). · Explain WhyMay 5 2023, 6:08 AM

This revision was automatically updated to reflect the committed changes.

kiranchandramohan added a commit: rG1b9d0deb6d53: [Flang] Change complex divide lowering.

Hello,

The signature for __divsc3 is

complex float __divsc3 (float a, float b, float c, float d)

not

complex float __divsc3 (complex float a, complex float b)

Can you please fix it?

This revision is now accepted and ready to land.May 5 2023, 10:35 AM

kiranchandramohan added a reverting change: rG28e99cccf101: Revert "[Flang] Change complex divide lowering".May 5 2023, 10:44 AM

In D149546#4322664, @Renaud-K wrote:
Hello,

The signature for __divsc3 is
complex float __divsc3 (float a, float b, float c, float d)
not
complex float __divsc3 (complex float a, complex float b)
Can you please fix it?

Thanks @Renaud-K. Will revert for now.

Note 3: If the patch is accepted, fir.divc can be removed from FIR.

Are you sure you want to get rid of exposing the operation to the compiler's optimizer? There may be special-case transformations such a cplx1 * cplx2 / cplx2 -> cplx1 (see Fortran rules about arithmetic simplification), {0,0} / cplx -> {0,0}, or even floating point projection {re1, 0} / {re2, 0} -> re1 / re2 that could be performed for better performance.

In D149546#4323204, @schweitz wrote:

Note 3: If the patch is accepted, fir.divc can be removed from FIR.

Are you sure you want to get rid of exposing the operation to the compiler's optimizer? There may be special-case transformations such a cplx1 * cplx2 / cplx2 -> cplx1 (see Fortran rules about arithmetic simplification), {0,0} / cplx -> {0,0}, or even floating point projection {re1, 0} / {re2, 0} -> re1 / re2 that could be performed for better performance.

Hi Eric,
I was hoping can use the MLIR complex dialect complex.div for these transformations just like we use the math dialect operations.

Update to fix issue pointed by @Renaud-K. This is accomplished by
a wrapper function that correctly calls the library functions with
the complex arguments split into the real and imaginary parts, the
result value is still a complex value.

Also updated HLFIR to use the new lowering.

kiranchandramohan edited the summary of this revision. (Show Details)May 9 2023, 7:21 AM

kiranchandramohan added a reviewer: jeanPerier.

kiranchandramohan added a reviewer: Renaud-K.May 9 2023, 7:26 AM

Harbormaster completed remote builds in B230867: Diff 520695.May 9 2023, 7:35 AM

Thanks for updating the HLFIR path too!

Closed by commit rGc3a0df1903bb: [Flang] Change complex divide lowering (authored by kiranchandramohan). · Explain WhyMay 11 2023, 5:04 AM

This revision was automatically updated to reflect the committed changes.

kiranchandramohan added a commit: rGc3a0df1903bb: [Flang] Change complex divide lowering.

Revision Contents

Path

Size

flang/

include/

flang/

Optimizer/

Builder/

IntrinsicCall.h

5 lines

lib/

Lower/

ConvertExpr.cpp

27 lines

ConvertExprToHLFIR.cpp

17 lines

Optimizer/

Builder/

IntrinsicCall.cpp

69 lines

test/

Lower/

HLFIR/

binary-ops.f90

7 lines

assignment.f90

6 lines

complex-operations.f90

93 lines

Diff 521262

flang/include/flang/Optimizer/Builder/IntrinsicCall.h

	Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines
	/// must have the same type.			/// must have the same type.
	mlir::Value genMax(fir::FirOpBuilder &, mlir::Location,			mlir::Value genMax(fir::FirOpBuilder &, mlir::Location,
	llvm::ArrayRef<mlir::Value> args);			llvm::ArrayRef<mlir::Value> args);

	/// Generate minimum. Same constraints as genMax.			/// Generate minimum. Same constraints as genMax.
	mlir::Value genMin(fir::FirOpBuilder &, mlir::Location,			mlir::Value genMin(fir::FirOpBuilder &, mlir::Location,
	llvm::ArrayRef<mlir::Value> args);			llvm::ArrayRef<mlir::Value> args);

				/// Generate Complex divide with the given expected
				/// result type.
				mlir::Value genDivC(fir::FirOpBuilder &builder, mlir::Location loc,
				mlir::Type resultType, mlir::Value x, mlir::Value y);

	/// Generate power function x**y with the given expected			/// Generate power function x**y with the given expected
	/// result type.			/// result type.
	mlir::Value genPow(fir::FirOpBuilder &, mlir::Location, mlir::Type resultType,			mlir::Value genPow(fir::FirOpBuilder &, mlir::Location, mlir::Type resultType,
	mlir::Value x, mlir::Value y);			mlir::Value x, mlir::Value y);

	} // namespace fir			} // namespace fir

	#endif // FORTRAN_LOWER_INTRINSICCALL_H			#endif // FORTRAN_LOWER_INTRINSICCALL_H

flang/lib/Lower/ConvertExpr.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,074 Lines • ▼ Show 20 Lines	#define GENBIN(GenBinEvOp, GenBinTyCat, GenBinFirOp) \
GENBIN(Subtract, Integer, mlir::arith::SubIOp)		GENBIN(Subtract, Integer, mlir::arith::SubIOp)
GENBIN(Subtract, Real, mlir::arith::SubFOp)		GENBIN(Subtract, Real, mlir::arith::SubFOp)
GENBIN(Subtract, Complex, fir::SubcOp)		GENBIN(Subtract, Complex, fir::SubcOp)
GENBIN(Multiply, Integer, mlir::arith::MulIOp)		GENBIN(Multiply, Integer, mlir::arith::MulIOp)
GENBIN(Multiply, Real, mlir::arith::MulFOp)		GENBIN(Multiply, Real, mlir::arith::MulFOp)
GENBIN(Multiply, Complex, fir::MulcOp)		GENBIN(Multiply, Complex, fir::MulcOp)
GENBIN(Divide, Integer, mlir::arith::DivSIOp)		GENBIN(Divide, Integer, mlir::arith::DivSIOp)
GENBIN(Divide, Real, mlir::arith::DivFOp)		GENBIN(Divide, Real, mlir::arith::DivFOp)
GENBIN(Divide, Complex, fir::DivcOp)
		template <int KIND>
		ExtValue genval(const Fortran::evaluate::Divide<Fortran::evaluate::Type<
		Fortran::common::TypeCategory::Complex, KIND>> &op) {
		mlir::Type ty =
		converter.genType(Fortran::common::TypeCategory::Complex, KIND);
		mlir::Value lhs = genunbox(op.left());
		mlir::Value rhs = genunbox(op.right());
		return fir::genDivC(builder, getLoc(), ty, lhs, rhs);
		}

template <Fortran::common::TypeCategory TC, int KIND>		template <Fortran::common::TypeCategory TC, int KIND>
ExtValue genval(		ExtValue genval(
const Fortran::evaluate::Power<Fortran::evaluate::Type<TC, KIND>> &op) {		const Fortran::evaluate::Power<Fortran::evaluate::Type<TC, KIND>> &op) {
mlir::Type ty = converter.genType(TC, KIND);		mlir::Type ty = converter.genType(TC, KIND);
mlir::Value lhs = genunbox(op.left());		mlir::Value lhs = genunbox(op.left());
mlir::Value rhs = genunbox(op.right());		mlir::Value rhs = genunbox(op.right());
return fir::genPow(builder, getLoc(), ty, lhs, rhs);		return fir::genPow(builder, getLoc(), ty, lhs, rhs);
▲ Show 20 Lines • Show All 3,985 Lines • ▼ Show 20 Lines	#define GENBIN(GenBinEvOp, GenBinTyCat, GenBinFirOp) \
GENBIN(Subtract, Integer, mlir::arith::SubIOp)		GENBIN(Subtract, Integer, mlir::arith::SubIOp)
GENBIN(Subtract, Real, mlir::arith::SubFOp)		GENBIN(Subtract, Real, mlir::arith::SubFOp)
GENBIN(Subtract, Complex, fir::SubcOp)		GENBIN(Subtract, Complex, fir::SubcOp)
GENBIN(Multiply, Integer, mlir::arith::MulIOp)		GENBIN(Multiply, Integer, mlir::arith::MulIOp)
GENBIN(Multiply, Real, mlir::arith::MulFOp)		GENBIN(Multiply, Real, mlir::arith::MulFOp)
GENBIN(Multiply, Complex, fir::MulcOp)		GENBIN(Multiply, Complex, fir::MulcOp)
GENBIN(Divide, Integer, mlir::arith::DivSIOp)		GENBIN(Divide, Integer, mlir::arith::DivSIOp)
GENBIN(Divide, Real, mlir::arith::DivFOp)		GENBIN(Divide, Real, mlir::arith::DivFOp)
GENBIN(Divide, Complex, fir::DivcOp)
		template <int KIND>
		CC genarr(const Fortran::evaluate::Divide<Fortran::evaluate::Type<
		Fortran::common::TypeCategory::Complex, KIND>> &x) {
		mlir::Location loc = getLoc();
		mlir::Type ty =
		converter.genType(Fortran::common::TypeCategory::Complex, KIND);
		auto lf = genarr(x.left());
		auto rf = genarr(x.right());
		return [=](IterSpace iters) -> ExtValue {
		mlir::Value lhs = fir::getBase(lf(iters));
		mlir::Value rhs = fir::getBase(rf(iters));
		return fir::genDivC(builder, loc, ty, lhs, rhs);
		};
		}

template <Fortran::common::TypeCategory TC, int KIND>		template <Fortran::common::TypeCategory TC, int KIND>
CC genarr(		CC genarr(
const Fortran::evaluate::Power<Fortran::evaluate::Type<TC, KIND>> &x) {		const Fortran::evaluate::Power<Fortran::evaluate::Type<TC, KIND>> &x) {
mlir::Location loc = getLoc();		mlir::Location loc = getLoc();
mlir::Type ty = converter.genType(TC, KIND);		mlir::Type ty = converter.genType(TC, KIND);
auto lf = genarr(x.left());		auto lf = genarr(x.left());
auto rf = genarr(x.right());		auto rf = genarr(x.right());
▲ Show 20 Lines • Show All 2,386 Lines • Show Last 20 Lines

flang/lib/Lower/ConvertExprToHLFIR.cpp

	Show First 20 Lines • Show All 942 Lines • ▼ Show 20 Lines
	GENBIN(Subtract, Integer, mlir::arith::SubIOp)			GENBIN(Subtract, Integer, mlir::arith::SubIOp)
	GENBIN(Subtract, Real, mlir::arith::SubFOp)			GENBIN(Subtract, Real, mlir::arith::SubFOp)
	GENBIN(Subtract, Complex, fir::SubcOp)			GENBIN(Subtract, Complex, fir::SubcOp)
	GENBIN(Multiply, Integer, mlir::arith::MulIOp)			GENBIN(Multiply, Integer, mlir::arith::MulIOp)
	GENBIN(Multiply, Real, mlir::arith::MulFOp)			GENBIN(Multiply, Real, mlir::arith::MulFOp)
	GENBIN(Multiply, Complex, fir::MulcOp)			GENBIN(Multiply, Complex, fir::MulcOp)
	GENBIN(Divide, Integer, mlir::arith::DivSIOp)			GENBIN(Divide, Integer, mlir::arith::DivSIOp)
	GENBIN(Divide, Real, mlir::arith::DivFOp)			GENBIN(Divide, Real, mlir::arith::DivFOp)
	GENBIN(Divide, Complex, fir::DivcOp)
				template <int KIND>
				struct BinaryOp<Fortran::evaluate::Divide<
				Fortran::evaluate::Type<Fortran::common::TypeCategory::Complex, KIND>>> {
				using Op = Fortran::evaluate::Divide<
				Fortran::evaluate::Type<Fortran::common::TypeCategory::Complex, KIND>>;
				static hlfir::EntityWithAttributes gen(mlir::Location loc,
				fir::FirOpBuilder &builder, const Op &,
				hlfir::Entity lhs, hlfir::Entity rhs) {
				mlir::Type ty = Fortran::lower::getFIRType(
				builder.getContext(), Fortran::common::TypeCategory::Complex, KIND,
				/params=/std::nullopt);
				return hlfir::EntityWithAttributes{
				fir::genDivC(builder, loc, ty, lhs, rhs)};
				}
				};

	template <Fortran::common::TypeCategory TC, int KIND>			template <Fortran::common::TypeCategory TC, int KIND>
	struct BinaryOp<Fortran::evaluate::Power<Fortran::evaluate::Type<TC, KIND>>> {			struct BinaryOp<Fortran::evaluate::Power<Fortran::evaluate::Type<TC, KIND>>> {
	using Op = Fortran::evaluate::Power<Fortran::evaluate::Type<TC, KIND>>;			using Op = Fortran::evaluate::Power<Fortran::evaluate::Type<TC, KIND>>;
	static hlfir::EntityWithAttributes gen(mlir::Location loc,			static hlfir::EntityWithAttributes gen(mlir::Location loc,
	fir::FirOpBuilder &builder, const Op &,			fir::FirOpBuilder &builder, const Op &,
	hlfir::Entity lhs, hlfir::Entity rhs) {			hlfir::Entity lhs, hlfir::Entity rhs) {
	mlir::Type ty = Fortran::lower::getFIRType(builder.getContext(), TC, KIND,			mlir::Type ty = Fortran::lower::getFIRType(builder.getContext(), TC, KIND,
	▲ Show 20 Lines • Show All 741 Lines • Show Last 20 Lines

flang/lib/Optimizer/Builder/IntrinsicCall.cpp

Show First 20 Lines • Show All 1,178 Lines • ▼ Show 20 Lines	if (soughtFuncType == libFuncType) {
libCall = builder.create<fir::CallOp>(loc, libFuncType.getResults(),		libCall = builder.create<fir::CallOp>(loc, libFuncType.getResults(),
nullptr, operands);		nullptr, operands);
}		}

LLVM_DEBUG(libCall.dump(); llvm::dbgs() << "\n");		LLVM_DEBUG(libCall.dump(); llvm::dbgs() << "\n");
return libCall.getResult(0);		return libCall.getResult(0);
}		}

		static mlir::Value genLibSplitComplexArgsCall(
		fir::FirOpBuilder &builder, mlir::Location loc, llvm::StringRef libFuncName,
		mlir::FunctionType libFuncType, llvm::ArrayRef<mlir::Value> args) {
		assert(args.size() == 2 && "Incorrect #args to genLibSplitComplexArgsCall");

		auto getSplitComplexArgsType = [&builder, &args]() -> mlir::FunctionType {
		mlir::Type ctype = args[0].getType();
		auto fKind = ctype.cast<fir::ComplexType>().getFKind();
		mlir::Type ftype;

		if (fKind == 2)
		ftype = builder.getF16Type();
		else if (fKind == 3)
		ftype = builder.getBF16Type();
		else if (fKind == 4)
		ftype = builder.getF32Type();
		else if (fKind == 8)
		ftype = builder.getF64Type();
		else if (fKind == 10)
		ftype = builder.getF80Type();
		else if (fKind == 16)
		ftype = builder.getF128Type();
		else
		assert(0 && "Unsupported Complex Type");

		return builder.getFunctionType({ftype, ftype, ftype, ftype}, {ctype});
		};

		llvm::SmallVector<mlir::Value, 4> splitArgs;
		mlir::Value cplx1 = args[0];
		auto real1 = fir::factory::Complex{builder, loc}.extractComplexPart(
		cplx1, /isImagPart=/false);
		splitArgs.push_back(real1);
		auto imag1 = fir::factory::Complex{builder, loc}.extractComplexPart(
		cplx1, /isImagPart=/true);
		splitArgs.push_back(imag1);
		mlir::Value cplx2 = args[1];
		auto real2 = fir::factory::Complex{builder, loc}.extractComplexPart(
		cplx2, /isImagPart=/false);
		splitArgs.push_back(real2);
		auto imag2 = fir::factory::Complex{builder, loc}.extractComplexPart(
		cplx2, /isImagPart=/true);
		splitArgs.push_back(imag2);

		return genLibCall(builder, loc, libFuncName, getSplitComplexArgsType(),
		splitArgs);
		}

template <typename T>		template <typename T>
static mlir::Value genMathOp(fir::FirOpBuilder &builder, mlir::Location loc,		static mlir::Value genMathOp(fir::FirOpBuilder &builder, mlir::Location loc,
llvm::StringRef mathLibFuncName,		llvm::StringRef mathLibFuncName,
mlir::FunctionType mathLibFuncType,		mlir::FunctionType mathLibFuncType,
llvm::ArrayRef<mlir::Value> args) {		llvm::ArrayRef<mlir::Value> args) {
// TODO: we have to annotate the math operations with flags		// TODO: we have to annotate the math operations with flags
// that will allow to define FP accuracy/exception		// that will allow to define FP accuracy/exception
// behavior per operation, so that after early multi-module		// behavior per operation, so that after early multi-module
▲ Show 20 Lines • Show All 145 Lines • ▼ Show 20 Lines	static constexpr MathOperation mathOperations[] = {
{"cos", "ccosf", genComplexComplexFuncType<4>,		{"cos", "ccosf", genComplexComplexFuncType<4>,
genComplexMathOp<mlir::complex::CosOp>},		genComplexMathOp<mlir::complex::CosOp>},
{"cos", "ccos", genComplexComplexFuncType<8>,		{"cos", "ccos", genComplexComplexFuncType<8>,
genComplexMathOp<mlir::complex::CosOp>},		genComplexMathOp<mlir::complex::CosOp>},
{"cosh", "coshf", genF32F32FuncType, genLibCall},		{"cosh", "coshf", genF32F32FuncType, genLibCall},
{"cosh", "cosh", genF64F64FuncType, genLibCall},		{"cosh", "cosh", genF64F64FuncType, genLibCall},
{"cosh", "ccoshf", genComplexComplexFuncType<4>, genLibCall},		{"cosh", "ccoshf", genComplexComplexFuncType<4>, genLibCall},
{"cosh", "ccosh", genComplexComplexFuncType<8>, genLibCall},		{"cosh", "ccosh", genComplexComplexFuncType<8>, genLibCall},
		{"divc",
		{},
		genComplexComplexComplexFuncType<2>,
		genComplexMathOp<mlir::complex::DivOp>},
		{"divc",
		{},
		genComplexComplexComplexFuncType<3>,
		genComplexMathOp<mlir::complex::DivOp>},
		{"divc", "__divsc3", genComplexComplexComplexFuncType<4>,
		genLibSplitComplexArgsCall},
		{"divc", "__divdc3", genComplexComplexComplexFuncType<8>,
		genLibSplitComplexArgsCall},
		{"divc", "__divxc3", genComplexComplexComplexFuncType<10>,
		genLibSplitComplexArgsCall},
		{"divc", "__divtc3", genComplexComplexComplexFuncType<16>,
		genLibSplitComplexArgsCall},
{"erf", "erff", genF32F32FuncType, genMathOp<mlir::math::ErfOp>},		{"erf", "erff", genF32F32FuncType, genMathOp<mlir::math::ErfOp>},
{"erf", "erf", genF64F64FuncType, genMathOp<mlir::math::ErfOp>},		{"erf", "erf", genF64F64FuncType, genMathOp<mlir::math::ErfOp>},
{"erfc", "erfcf", genF32F32FuncType, genLibCall},		{"erfc", "erfcf", genF32F32FuncType, genLibCall},
{"erfc", "erfc", genF64F64FuncType, genLibCall},		{"erfc", "erfc", genF64F64FuncType, genLibCall},
{"exp", "expf", genF32F32FuncType, genMathOp<mlir::math::ExpOp>},		{"exp", "expf", genF32F32FuncType, genMathOp<mlir::math::ExpOp>},
{"exp", "exp", genF64F64FuncType, genMathOp<mlir::math::ExpOp>},		{"exp", "exp", genF64F64FuncType, genMathOp<mlir::math::ExpOp>},
{"exp", "cexpf", genComplexComplexFuncType<4>,		{"exp", "cexpf", genComplexComplexFuncType<4>,
genComplexMathOp<mlir::complex::ExpOp>},		genComplexMathOp<mlir::complex::ExpOp>},
▲ Show 20 Lines • Show All 4,300 Lines • ▼ Show 20 Lines
mlir::Value fir::genMin(fir::FirOpBuilder &builder, mlir::Location loc,		mlir::Value fir::genMin(fir::FirOpBuilder &builder, mlir::Location loc,
llvm::ArrayRef<mlir::Value> args) {		llvm::ArrayRef<mlir::Value> args) {
assert(args.size() > 0 && "min requires at least one argument");		assert(args.size() > 0 && "min requires at least one argument");
return IntrinsicLibrary{builder, loc}		return IntrinsicLibrary{builder, loc}
.genExtremum<Extremum::Min, ExtremumBehavior::MinMaxss>(args[0].getType(),		.genExtremum<Extremum::Min, ExtremumBehavior::MinMaxss>(args[0].getType(),
args);		args);
}		}

		mlir::Value fir::genDivC(fir::FirOpBuilder &builder, mlir::Location loc,
		mlir::Type type, mlir::Value x, mlir::Value y) {
		return IntrinsicLibrary{builder, loc}.genRuntimeCall("divc", type, {x, y});
		}

mlir::Value fir::genPow(fir::FirOpBuilder &builder, mlir::Location loc,		mlir::Value fir::genPow(fir::FirOpBuilder &builder, mlir::Location loc,
mlir::Type type, mlir::Value x, mlir::Value y) {		mlir::Type type, mlir::Value x, mlir::Value y) {
// TODO: since there is no libm version of pow with integer exponent,		// TODO: since there is no libm version of pow with integer exponent,
// we have to provide an alternative implementation for		// we have to provide an alternative implementation for
// "precise/strict" FP mode.		// "precise/strict" FP mode.
// One option is to generate internal function with inlined		// One option is to generate internal function with inlined
// implementation and mark it 'strictfp'.		// implementation and mark it 'strictfp'.
// Another option is to implement it in Fortran runtime library		// Another option is to implement it in Fortran runtime library
Show All 10 Lines

flang/test/Lower/HLFIR/binary-ops.f90

	Show First 20 Lines • Show All 125 Lines • ▼ Show 20 Lines
	complex :: x, y, z			complex :: x, y, z
	x = y / z			x = y / z
	end subroutine			end subroutine
	! CHECK-LABEL: func.func @_QPcomplex_div(			! CHECK-LABEL: func.func @_QPcomplex_div(
	! CHECK: %[[VAL_4:.]]:2 = hlfir.declare %{{.}}y"} : (!fir.ref<!fir.complex<4>>) -> (!fir.ref<!fir.complex<4>>, !fir.ref<!fir.complex<4>>)			! CHECK: %[[VAL_4:.]]:2 = hlfir.declare %{{.}}y"} : (!fir.ref<!fir.complex<4>>) -> (!fir.ref<!fir.complex<4>>, !fir.ref<!fir.complex<4>>)
	! CHECK: %[[VAL_5:.]]:2 = hlfir.declare %{{.}}z"} : (!fir.ref<!fir.complex<4>>) -> (!fir.ref<!fir.complex<4>>, !fir.ref<!fir.complex<4>>)			! CHECK: %[[VAL_5:.]]:2 = hlfir.declare %{{.}}z"} : (!fir.ref<!fir.complex<4>>) -> (!fir.ref<!fir.complex<4>>, !fir.ref<!fir.complex<4>>)
	! CHECK: %[[VAL_6:.*]] = fir.load %[[VAL_4]]#0 : !fir.ref<!fir.complex<4>>			! CHECK: %[[VAL_6:.*]] = fir.load %[[VAL_4]]#0 : !fir.ref<!fir.complex<4>>
	! CHECK: %[[VAL_7:.*]] = fir.load %[[VAL_5]]#0 : !fir.ref<!fir.complex<4>>			! CHECK: %[[VAL_7:.*]] = fir.load %[[VAL_5]]#0 : !fir.ref<!fir.complex<4>>
	! CHECK: %[[VAL_8:.*]] = fir.divc %[[VAL_6]], %[[VAL_7]] : !fir.complex<4>			! CHECK: %[[VAL_8:.*]] = fir.extract_value %[[VAL_6]], [0 : index] : (!fir.complex<4>) -> f32
				! CHECK: %[[VAL_9:.*]] = fir.extract_value %[[VAL_6]], [1 : index] : (!fir.complex<4>) -> f32
				! CHECK: %[[VAL_10:.*]] = fir.extract_value %[[VAL_7]], [0 : index] : (!fir.complex<4>) -> f32
				! CHECK: %[[VAL_11:.*]] = fir.extract_value %[[VAL_7]], [1 : index] : (!fir.complex<4>) -> f32
				! CHECK: %[[VAL_12:.*]] = fir.call @__divsc3(%[[VAL_8]], %[[VAL_9]], %[[VAL_10]], %[[VAL_11]]) fastmath<contract> : (f32, f32, f32, f32) -> !fir.complex<4>

	subroutine int_power(x, y, z)			subroutine int_power(x, y, z)
	integer :: x, y, z			integer :: x, y, z
	x = y**z			x = y**z
	end subroutine			end subroutine
	! CHECK-LABEL: func.func @_QPint_power(			! CHECK-LABEL: func.func @_QPint_power(
	! CHECK: %[[VAL_4:.]]:2 = hlfir.declare %{{.}}y"} : (!fir.ref<i32>) -> (!fir.ref<i32>, !fir.ref<i32>)			! CHECK: %[[VAL_4:.]]:2 = hlfir.declare %{{.}}y"} : (!fir.ref<i32>) -> (!fir.ref<i32>, !fir.ref<i32>)
	! CHECK: %[[VAL_5:.]]:2 = hlfir.declare %{{.}}z"} : (!fir.ref<i32>) -> (!fir.ref<i32>, !fir.ref<i32>)			! CHECK: %[[VAL_5:.]]:2 = hlfir.declare %{{.}}z"} : (!fir.ref<i32>) -> (!fir.ref<i32>, !fir.ref<i32>)
	▲ Show 20 Lines • Show All 209 Lines • Show Last 20 Lines

flang/test/Lower/assignment.f90

	Show First 20 Lines • Show All 245 Lines • ▼ Show 20 Lines
	end			end

	! CHECK-LABEL: func @_QPdivc(			! CHECK-LABEL: func @_QPdivc(
	! CHECK-SAME: %[[A:.*]]: !fir.ref<!fir.complex<4>> {fir.bindc_name = "a"},			! CHECK-SAME: %[[A:.*]]: !fir.ref<!fir.complex<4>> {fir.bindc_name = "a"},
	! CHECK-SAME: %[[B:.*]]: !fir.ref<!fir.complex<4>> {fir.bindc_name = "b"}			! CHECK-SAME: %[[B:.*]]: !fir.ref<!fir.complex<4>> {fir.bindc_name = "b"}
	! CHECK: %[[FCTRES:.*]] = fir.alloca !fir.complex<4>			! CHECK: %[[FCTRES:.*]] = fir.alloca !fir.complex<4>
	! CHECK: %[[A_VAL:.*]] = fir.load %[[A]] : !fir.ref<!fir.complex<4>>			! CHECK: %[[A_VAL:.*]] = fir.load %[[A]] : !fir.ref<!fir.complex<4>>
	! CHECK: %[[B_VAL:.*]] = fir.load %[[B]] : !fir.ref<!fir.complex<4>>			! CHECK: %[[B_VAL:.*]] = fir.load %[[B]] : !fir.ref<!fir.complex<4>>
	! CHECK: %[[DIV:.*]] = fir.divc %[[A_VAL]], %[[B_VAL]] : !fir.complex<4>			! CHECK: %[[A_REAL:.*]] = fir.extract_value %[[A_VAL]], [0 : index] : (!fir.complex<4>) -> f32
				! CHECK: %[[A_IMAG:.*]] = fir.extract_value %[[A_VAL]], [1 : index] : (!fir.complex<4>) -> f32
				! CHECK: %[[B_REAL:.*]] = fir.extract_value %[[B_VAL]], [0 : index] : (!fir.complex<4>) -> f32
				! CHECK: %[[B_IMAG:.*]] = fir.extract_value %[[B_VAL]], [1 : index] : (!fir.complex<4>) -> f32
				! CHECK: %[[DIV:.*]] = fir.call @__divsc3(%[[A_REAL]], %[[A_IMAG]], %[[B_REAL]], %[[B_IMAG]]) fastmath<contract> : (f32, f32, f32, f32) -> !fir.complex<4>
	! CHECK: fir.store %[[DIV]] to %[[FCTRES]] : !fir.ref<!fir.complex<4>>			! CHECK: fir.store %[[DIV]] to %[[FCTRES]] : !fir.ref<!fir.complex<4>>
	! CHECK: %[[RET:.*]] = fir.load %[[FCTRES]] : !fir.ref<!fir.complex<4>>			! CHECK: %[[RET:.*]] = fir.load %[[FCTRES]] : !fir.ref<!fir.complex<4>>
	! CHECK: return %[[RET]] : !fir.complex<4>			! CHECK: return %[[RET]] : !fir.complex<4>

	subroutine real_constant()			subroutine real_constant()
	real(2) :: a			real(2) :: a
	real(4) :: b			real(4) :: b
	real(8) :: c			real(8) :: c
	▲ Show 20 Lines • Show All 74 Lines • Show Last 20 Lines

flang/test/Lower/complex-operations.f90

	Show All 21 Lines
	subroutine mul_test(a,b,c)			subroutine mul_test(a,b,c)
	complex :: a, b, c			complex :: a, b, c
	! CHECK-NOT: fir.extract_value			! CHECK-NOT: fir.extract_value
	! CHECK-NOT: fir.insert_value			! CHECK-NOT: fir.insert_value
	! CHECK: fir.mulc {{.*}}: !fir.complex			! CHECK: fir.mulc {{.*}}: !fir.complex
	a = b * c			a = b * c
	end subroutine mul_test			end subroutine mul_test

	! CHECK-LABEL: @_QPdiv_test			! CHECK-LABEL: @_QPdiv_test_half
	subroutine div_test(a,b,c)			! CHECK-SAME: %[[AREF:.]]: !fir.ref<!fir.complex<2>> {{.}}, %[[BREF:.]]: !fir.ref<!fir.complex<2>> {{.}}, %[[CREF:.]]: !fir.ref<!fir.complex<2>> {{.}})
	complex :: a, b, c			! CHECK: %[[BVAL:.*]] = fir.load %[[BREF]] : !fir.ref<!fir.complex<2>>
	! CHECK-NOT: fir.extract_value			! CHECK: %[[CVAL:.*]] = fir.load %[[CREF]] : !fir.ref<!fir.complex<2>>
	! CHECK-NOT: fir.insert_value			! CHECK: %[[BVAL_CVT:.*]] = fir.convert %[[BVAL]] : (!fir.complex<2>) -> complex<f16>
	! CHECK: fir.divc {{.*}}: !fir.complex			! CHECK: %[[CVAL_CVT:.*]] = fir.convert %[[CVAL]] : (!fir.complex<2>) -> complex<f16>
				! CHECK: %[[AVAL_CVT:.*]] = complex.div %[[BVAL_CVT]], %[[CVAL_CVT]] : complex<f16>
				! CHECK: %[[AVAL:.*]] = fir.convert %[[AVAL_CVT]] : (complex<f16>) -> !fir.complex<2>
				! CHECK: fir.store %[[AVAL]] to %[[AREF]] : !fir.ref<!fir.complex<2>>
				subroutine div_test_half(a,b,c)
				complex(kind=2) :: a, b, c
				a = b / c
				end subroutine div_test_half

				! CHECK-LABEL: @_QPdiv_test_bfloat
				! CHECK-SAME: %[[AREF:.]]: !fir.ref<!fir.complex<3>> {{.}}, %[[BREF:.]]: !fir.ref<!fir.complex<3>> {{.}}, %[[CREF:.]]: !fir.ref<!fir.complex<3>> {{.}})
				! CHECK: %[[BVAL:.*]] = fir.load %[[BREF]] : !fir.ref<!fir.complex<3>>
				! CHECK: %[[CVAL:.*]] = fir.load %[[CREF]] : !fir.ref<!fir.complex<3>>
				! CHECK: %[[BVAL_CVT:.*]] = fir.convert %[[BVAL]] : (!fir.complex<3>) -> complex<bf16>
				! CHECK: %[[CVAL_CVT:.*]] = fir.convert %[[CVAL]] : (!fir.complex<3>) -> complex<bf16>
				! CHECK: %[[AVAL_CVT:.*]] = complex.div %[[BVAL_CVT]], %[[CVAL_CVT]] : complex<bf16>
				! CHECK: %[[AVAL:.*]] = fir.convert %[[AVAL_CVT]] : (complex<bf16>) -> !fir.complex<3>
				! CHECK: fir.store %[[AVAL]] to %[[AREF]] : !fir.ref<!fir.complex<3>>
				subroutine div_test_bfloat(a,b,c)
				complex(kind=3) :: a, b, c
				a = b / c
				end subroutine div_test_bfloat

				! CHECK-LABEL: @_QPdiv_test_single
				! CHECK-SAME: %[[AREF:.]]: !fir.ref<!fir.complex<4>> {{.}}, %[[BREF:.]]: !fir.ref<!fir.complex<4>> {{.}}, %[[CREF:.]]: !fir.ref<!fir.complex<4>> {{.}})
				! CHECK: %[[BVAL:.*]] = fir.load %[[BREF]] : !fir.ref<!fir.complex<4>>
				! CHECK: %[[CVAL:.*]] = fir.load %[[CREF]] : !fir.ref<!fir.complex<4>>
				! CHECK: %[[BREAL:.*]] = fir.extract_value %[[BVAL]], [0 : index] : (!fir.complex<4>) -> f32
				! CHECK: %[[BIMAG:.*]] = fir.extract_value %[[BVAL]], [1 : index] : (!fir.complex<4>) -> f32
				! CHECK: %[[CREAL:.*]] = fir.extract_value %[[CVAL]], [0 : index] : (!fir.complex<4>) -> f32
				! CHECK: %[[CIMAG:.*]] = fir.extract_value %[[CVAL]], [1 : index] : (!fir.complex<4>) -> f32
				! CHECK: %[[AVAL:.*]] = fir.call @__divsc3(%[[BREAL]], %[[BIMAG]], %[[CREAL]], %[[CIMAG]]) fastmath<contract> : (f32, f32, f32, f32) -> !fir.complex<4>
				! CHECK: fir.store %[[AVAL]] to %[[AREF]] : !fir.ref<!fir.complex<4>>
				subroutine div_test_single(a,b,c)
				complex(kind=4) :: a, b, c
				a = b / c
				end subroutine div_test_single

				! CHECK-LABEL: @_QPdiv_test_double
				! CHECK-SAME: %[[AREF:.]]: !fir.ref<!fir.complex<8>> {{.}}, %[[BREF:.]]: !fir.ref<!fir.complex<8>> {{.}}, %[[CREF:.]]: !fir.ref<!fir.complex<8>> {{.}})
				! CHECK: %[[BVAL:.*]] = fir.load %[[BREF]] : !fir.ref<!fir.complex<8>>
				! CHECK: %[[CVAL:.*]] = fir.load %[[CREF]] : !fir.ref<!fir.complex<8>>
				! CHECK: %[[BREAL:.*]] = fir.extract_value %[[BVAL]], [0 : index] : (!fir.complex<8>) -> f64
				! CHECK: %[[BIMAG:.*]] = fir.extract_value %[[BVAL]], [1 : index] : (!fir.complex<8>) -> f64
				! CHECK: %[[CREAL:.*]] = fir.extract_value %[[CVAL]], [0 : index] : (!fir.complex<8>) -> f64
				! CHECK: %[[CIMAG:.*]] = fir.extract_value %[[CVAL]], [1 : index] : (!fir.complex<8>) -> f64
				! CHECK: %[[AVAL:.*]] = fir.call @__divdc3(%[[BREAL]], %[[BIMAG]], %[[CREAL]], %[[CIMAG]]) fastmath<contract> : (f64, f64, f64, f64) -> !fir.complex<8>
				! CHECK: fir.store %[[AVAL]] to %[[AREF]] : !fir.ref<!fir.complex<8>>
				subroutine div_test_double(a,b,c)
				complex(kind=8) :: a, b, c
				a = b / c
				end subroutine div_test_double

				! CHECK-LABEL: @_QPdiv_test_extended
				! CHECK-SAME: %[[AREF:.]]: !fir.ref<!fir.complex<10>> {{.}}, %[[BREF:.]]: !fir.ref<!fir.complex<10>> {{.}}, %[[CREF:.]]: !fir.ref<!fir.complex<10>> {{.}})
				! CHECK: %[[BVAL:.*]] = fir.load %[[BREF]] : !fir.ref<!fir.complex<10>>
				! CHECK: %[[CVAL:.*]] = fir.load %[[CREF]] : !fir.ref<!fir.complex<10>>
				! CHECK: %[[BREAL:.*]] = fir.extract_value %[[BVAL]], [0 : index] : (!fir.complex<10>) -> f80
				! CHECK: %[[BIMAG:.*]] = fir.extract_value %[[BVAL]], [1 : index] : (!fir.complex<10>) -> f80
				! CHECK: %[[CREAL:.*]] = fir.extract_value %[[CVAL]], [0 : index] : (!fir.complex<10>) -> f80
				! CHECK: %[[CIMAG:.*]] = fir.extract_value %[[CVAL]], [1 : index] : (!fir.complex<10>) -> f80
				! CHECK: %[[AVAL:.*]] = fir.call @__divxc3(%[[BREAL]], %[[BIMAG]], %[[CREAL]], %[[CIMAG]]) fastmath<contract> : (f80, f80, f80, f80) -> !fir.complex<10>
				! CHECK: fir.store %[[AVAL]] to %[[AREF]] : !fir.ref<!fir.complex<10>>
				subroutine div_test_extended(a,b,c)
				complex(kind=10) :: a, b, c
				a = b / c
				end subroutine div_test_extended

				! CHECK-LABEL: @_QPdiv_test_quad
				! CHECK-SAME: %[[AREF:.]]: !fir.ref<!fir.complex<16>> {{.}}, %[[BREF:.]]: !fir.ref<!fir.complex<16>> {{.}}, %[[CREF:.]]: !fir.ref<!fir.complex<16>> {{.}})
				! CHECK: %[[BVAL:.*]] = fir.load %[[BREF]] : !fir.ref<!fir.complex<16>>
				! CHECK: %[[CVAL:.*]] = fir.load %[[CREF]] : !fir.ref<!fir.complex<16>>
				! CHECK: %[[BREAL:.*]] = fir.extract_value %[[BVAL]], [0 : index] : (!fir.complex<16>) -> f128
				! CHECK: %[[BIMAG:.*]] = fir.extract_value %[[BVAL]], [1 : index] : (!fir.complex<16>) -> f128
				! CHECK: %[[CREAL:.*]] = fir.extract_value %[[CVAL]], [0 : index] : (!fir.complex<16>) -> f128
				! CHECK: %[[CIMAG:.*]] = fir.extract_value %[[CVAL]], [1 : index] : (!fir.complex<16>) -> f128
				! CHECK: %[[AVAL:.*]] = fir.call @__divtc3(%[[BREAL]], %[[BIMAG]], %[[CREAL]], %[[CIMAG]]) fastmath<contract> : (f128, f128, f128, f128) -> !fir.complex<16>
				! CHECK: fir.store %[[AVAL]] to %[[AREF]] : !fir.ref<!fir.complex<16>>
				subroutine div_test_quad(a,b,c)
				complex(kind=16) :: a, b, c
	a = b / c			a = b / c
	end subroutine div_test			end subroutine div_test_quad