This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
flang/
-
lib/Optimizer/Transforms/
-
Optimizer/
-
Transforms/
4/8
SimplifyIntrinsics.cpp
-
test/Transforms/
-
Transforms/
-
simplifyintrinsics.fir

Differential D133820

[flang] Support multidimensional reductions in SimplifyIntrinsicsPass.
ClosedPublic

Authored by vzakhari on Sep 13 2022, 5:26 PM.

Download Raw Diff

Details

Reviewers

Leporacanthicus
clementval

Commits

rG8bd76ac15153: [flang] Support multidimensional reductions in SimplifyIntrinsicsPass.

Summary

Create simplified functions for each rank with "x<rank>" suffix
that implement multidimensional reductions. To enable this I had to fix
an issue with taking incorrect box shape in cases of sliced embox/rebox.

Depends on D133818

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

vzakhari created this revision.Sep 13 2022, 5:26 PM

Herald added a project: Restricted Project. · View Herald TranscriptSep 13 2022, 5:26 PM

Herald added subscribers: mehdi_amini, jdoerfert. · View Herald Transcript

vzakhari requested review of this revision.Sep 13 2022, 5:26 PM

Harbormaster completed remote builds in B186494: Diff 459928.Sep 13 2022, 5:50 PM

Again, thanks for the work here. It looks roughly like I had imagined [although I was kind of wishfully thinking that maybe we could just treat ND as 1D, just larger - at least for things where "ordering isn't important", such as sum or maxval]

flang/lib/Optimizer/Transforms/SimplifyIntrinsics.cpp
136	Not that it matters a huge amount, but isn't Fortran limiting number of dimensions to 6? And thus asking for 16 is a bit excessive?
138	Nit: the "rank must be positive" is a bit misleading for an unsigned value. :)
177	Would it make more sense to reverse this before the first for-loop (line 161), and then reverse it again before the second loop - makes it easier to read the for-loop, if nothing else.
194	Do we not need to do some "combine" here? I will give it a try in a bit, to see if I can prove this hypothesis.

vzakhari added inline comments.Sep 14 2022, 8:33 AM

flang/lib/Optimizer/Transforms/SimplifyIntrinsics.cpp
136	Flang's limit is actually 15, so I will change it to 15.
138	"Positive" does not include zero, so the message should be correct :) I will change it to "rank cannot be zero".
177	Sorry, I am not getting it. It is empty before the first for-loop (line 161), so there is nothing to reverse.
194	The "combine" happens via the loop block arguments. Basically, we may think of an inner loop as a function that takes the reduction value that is an input for the outer loop, and the result of the "function" is passed back to the outer loop.

vzakhari updated this revision to Diff 460113.Sep 14 2022, 8:39 AM

In D133820#3789150, @Leporacanthicus wrote:

Again, thanks for the work here. It looks roughly like I had imagined [although I was kind of wishfully thinking that maybe we could just treat ND as 1D, just larger - at least for things where "ordering isn't important", such as sum or maxval]

If you mean that we could have used a single loop vs the loop nest, then I do not think it is easily possible. fir.coordinate_of provided with the multiple indices accounts for the dimension strides (which can be anything in case of slicing) - that we would have to do manually if using a single loop. Using the loop nest looks much cleaner to me and consistent with the rest of FIR generation.

In D133820#3789710, @vzakhari wrote:

In D133820#3789150, @Leporacanthicus wrote:

Again, thanks for the work here. It looks roughly like I had imagined [although I was kind of wishfully thinking that maybe we could just treat ND as 1D, just larger - at least for things where "ordering isn't important", such as sum or maxval]

If you mean that we could have used a single loop vs the loop nest, then I do not think it is easily possible. fir.coordinate_of provided with the multiple indices accounts for the dimension strides (which can be anything in case of slicing) - that we would have to do manually if using a single loop. Using the loop nest looks much cleaner to me and consistent with the rest of FIR generation.

Also importantly, multidimensional iteration spaces and access coordinates are consistent with other aspects of MLIR, so more generally amenable to reusing the standard MLIR optimization passes rather than reinventing them.

Fixed minor typo in a comment.

Harbormaster completed remote builds in B186653: Diff 460131.Sep 14 2022, 9:38 AM

LGTM

This revision is now accepted and ready to land.Sep 19 2022, 11:17 AM

Closed by commit rG8bd76ac15153: [flang] Support multidimensional reductions in SimplifyIntrinsicsPass. (authored by vzakhari). · Explain WhySep 19 2022, 12:17 PM

This revision was automatically updated to reflect the committed changes.

vzakhari added a commit: rG8bd76ac15153: [flang] Support multidimensional reductions in SimplifyIntrinsicsPass..

Revision Contents

Path

Size

flang/

lib/

Optimizer/

Transforms/

SimplifyIntrinsics.cpp

142 lines

test/

Transforms/

simplifyintrinsics.fir

195 lines

Diff 461301

flang/lib/Optimizer/Transforms/SimplifyIntrinsics.cpp

Show First 20 Lines • Show All 55 Lines • ▼ Show 20 Lines

class SimplifyIntrinsicsPass		class SimplifyIntrinsicsPass
: public fir::impl::SimplifyIntrinsicsBase<SimplifyIntrinsicsPass> {		: public fir::impl::SimplifyIntrinsicsBase<SimplifyIntrinsicsPass> {
using FunctionTypeGeneratorTy =		using FunctionTypeGeneratorTy =
llvm::function_ref<mlir::FunctionType(fir::FirOpBuilder &)>;		llvm::function_ref<mlir::FunctionType(fir::FirOpBuilder &)>;
using FunctionBodyGeneratorTy =		using FunctionBodyGeneratorTy =
llvm::function_ref<void(fir::FirOpBuilder &, mlir::func::FuncOp &)>;		llvm::function_ref<void(fir::FirOpBuilder &, mlir::func::FuncOp &)>;
using GenReductionBodyTy = llvm::function_ref<void(		using GenReductionBodyTy = llvm::function_ref<void(
fir::FirOpBuilder &builder, mlir::func::FuncOp &funcOp)>;		fir::FirOpBuilder &builder, mlir::func::FuncOp &funcOp, unsigned rank)>;

public:		public:
/// Generate a new function implementing a simplified version		/// Generate a new function implementing a simplified version
/// of a Fortran runtime function defined by \p basename name.		/// of a Fortran runtime function defined by \p basename name.
/// \p typeGenerator is a callback that generates the new function's type.		/// \p typeGenerator is a callback that generates the new function's type.
/// \p bodyGenerator is a callback that generates the new function's body.		/// \p bodyGenerator is a callback that generates the new function's body.
/// The new function is created in the \p builder's Module.		/// The new function is created in the \p builder's Module.
mlir::func::FuncOp getOrCreateFunction(fir::FirOpBuilder &builder,		mlir::func::FuncOp getOrCreateFunction(fir::FirOpBuilder &builder,
Show All 32 Lines	using InitValGeneratorTy = llvm::function_ref<mlir::Value(
fir::FirOpBuilder &, mlir::Location, const mlir::Type &)>;		fir::FirOpBuilder &, mlir::Location, const mlir::Type &)>;

/// Generate the reduction loop into \p funcOp.		/// Generate the reduction loop into \p funcOp.
///		///
/// \p initVal is a function, called to get the initial value for		/// \p initVal is a function, called to get the initial value for
/// the reduction value		/// the reduction value
/// \p genBody is called to fill in the actual reduciton operation		/// \p genBody is called to fill in the actual reduciton operation
/// for example add for SUM, MAX for MAXVAL, etc.		/// for example add for SUM, MAX for MAXVAL, etc.
		/// \p rank is the rank of the input argument.
static void genReductionLoop(fir::FirOpBuilder &builder,		static void genReductionLoop(fir::FirOpBuilder &builder,
mlir::func::FuncOp &funcOp,		mlir::func::FuncOp &funcOp,
InitValGeneratorTy initVal,		InitValGeneratorTy initVal,
BodyOpGeneratorTy genBody) {		BodyOpGeneratorTy genBody, unsigned rank) {
auto loc = mlir::UnknownLoc::get(builder.getContext());		auto loc = mlir::UnknownLoc::get(builder.getContext());
mlir::Type elementType = funcOp.getResultTypes()[0];		mlir::Type elementType = funcOp.getResultTypes()[0];
builder.setInsertionPointToEnd(funcOp.addEntryBlock());		builder.setInsertionPointToEnd(funcOp.addEntryBlock());

mlir::IndexType idxTy = builder.getIndexType();		mlir::IndexType idxTy = builder.getIndexType();

mlir::Block::BlockArgListType args = funcOp.front().getArguments();		mlir::Block::BlockArgListType args = funcOp.front().getArguments();
mlir::Value arg = args[0];		mlir::Value arg = args[0];

mlir::Value zeroIdx = builder.createIntegerConstant(loc, idxTy, 0);		mlir::Value zeroIdx = builder.createIntegerConstant(loc, idxTy, 0);

fir::SequenceType::Shape flatShape = {fir::SequenceType::getUnknownExtent()};		fir::SequenceType::Shape flatShape(rank,
		fir::SequenceType::getUnknownExtent());
mlir::Type arrTy = fir::SequenceType::get(flatShape, elementType);		mlir::Type arrTy = fir::SequenceType::get(flatShape, elementType);
mlir::Type boxArrTy = fir::BoxType::get(arrTy);		mlir::Type boxArrTy = fir::BoxType::get(arrTy);
mlir::Value array = builder.create<fir::ConvertOp>(loc, boxArrTy, arg);		mlir::Value array = builder.create<fir::ConvertOp>(loc, boxArrTy, arg);
auto dims =		mlir::Value init = initVal(builder, loc, elementType);
builder.create<fir::BoxDimsOp>(loc, idxTy, idxTy, idxTy, array, zeroIdx);
mlir::Value len = dims.getResult(1);		llvm::SmallVector<mlir::Value, 15> bounds;
		LeporacanthicusUnsubmitted Not Done Reply Inline Actions Not that it matters a huge amount, but isn't Fortran limiting number of dimensions to 6? And thus asking for 16 is a bit excessive? Leporacanthicus: Not that it matters a huge amount, but isn't Fortran limiting number of dimensions to 6? And…
		vzakhariAuthorUnsubmitted Done Reply Inline Actions Flang's limit is actually 15, so I will change it to 15. vzakhari: Flang's limit is actually 15, so I will change it to 15.

		assert(rank > 0 && "rank cannot be zero");
		LeporacanthicusUnsubmitted Not Done Reply Inline Actions Nit: the "rank must be positive" is a bit misleading for an unsigned value. :) Leporacanthicus: Nit: the "rank must be positive" is a bit misleading for an unsigned value. :)
		vzakhariAuthorUnsubmitted Done Reply Inline Actions "Positive" does not include zero, so the message should be correct :) I will change it to "rank cannot be zero". vzakhari: "Positive" does not include zero, so the message should be correct :) I will change it to…
mlir::Value one = builder.createIntegerConstant(loc, idxTy, 1);		mlir::Value one = builder.createIntegerConstant(loc, idxTy, 1);
mlir::Value step = one;

		// Compute all the upper bounds before the loop nest.
		// It is not strictly necessary for performance, since the loop nest
		// does not have any store operations and any LICM optimization
		// should be able to optimize the redundancy.
		for (unsigned i = 0; i < rank; ++i) {
		mlir::Value dimIdx = builder.createIntegerConstant(loc, idxTy, i);
		auto dims =
		builder.create<fir::BoxDimsOp>(loc, idxTy, idxTy, idxTy, array, dimIdx);
		mlir::Value len = dims.getResult(1);
// We use C indexing here, so len-1 as loopcount		// We use C indexing here, so len-1 as loopcount
mlir::Value loopCount = builder.create<mlir::arith::SubIOp>(loc, len, one);		mlir::Value loopCount = builder.create<mlir::arith::SubIOp>(loc, len, one);
mlir::Value init = initVal(builder, loc, elementType);		bounds.push_back(loopCount);
		}

		// Create a loop nest consisting of DoLoopOp operations.
		// Collect the loops' induction variables into indices array,
		// which will be used in the innermost loop to load the input
		// array's element.
		// The loops are generated such that the innermost loop processes
		// the 0 dimension.
		llvm::SmallVector<mlir::Value, 15> indices;
		for (unsigned i = rank; 0 < i; --i) {
		mlir::Value step = one;
		mlir::Value loopCount = bounds[i - 1];
auto loop = builder.create<fir::DoLoopOp>(loc, zeroIdx, loopCount, step,		auto loop = builder.create<fir::DoLoopOp>(loc, zeroIdx, loopCount, step,
/unordered=/false,		/unordered=/false,
/finalCountValue=/false, init);		/finalCountValue=/false, init);
mlir::Value reductionVal = loop.getRegionIterArgs()[0];		init = loop.getRegionIterArgs()[0];
		indices.push_back(loop.getInductionVar());
// Begin loop code		// Set insertion point to the loop body so that the next loop
mlir::OpBuilder::InsertPoint loopEndPt = builder.saveInsertionPoint();		// is inserted inside the current one.
builder.setInsertionPointToStart(loop.getBody());		builder.setInsertionPointToStart(loop.getBody());
		}

		// Reverse the indices such that they are ordered as:
		// <dim-0-idx, dim-1-idx, ...>
		std::reverse(indices.begin(), indices.end());
		LeporacanthicusUnsubmitted Not Done Reply Inline Actions Would it make more sense to reverse this before the first for-loop (line 161), and then reverse it again before the second loop - makes it easier to read the for-loop, if nothing else. Leporacanthicus: Would it make more sense to reverse this before the first for-loop (line 161), and then reverse…
		vzakhariAuthorUnsubmitted Done Reply Inline Actions Sorry, I am not getting it. It is empty before the first for-loop (line 161), so there is nothing to reverse. vzakhari: Sorry, I am not getting it. It is empty before the first for-loop (line 161), so there is…

		// We are in the innermost loop: generate the reduction body.
mlir::Type eleRefTy = builder.getRefType(elementType);		mlir::Type eleRefTy = builder.getRefType(elementType);
mlir::Value index = loop.getInductionVar();
mlir::Value addr =		mlir::Value addr =
builder.create<fir::CoordinateOp>(loc, eleRefTy, array, index);		builder.create<fir::CoordinateOp>(loc, eleRefTy, array, indices);
mlir::Value elem = builder.create<fir::LoadOp>(loc, addr);		mlir::Value elem = builder.create<fir::LoadOp>(loc, addr);

reductionVal = genBody(builder, loc, elementType, elem, reductionVal);		mlir::Value reductionVal = genBody(builder, loc, elementType, elem, init);

builder.create<fir::ResultOp>(loc, reductionVal);
// End of loop.
builder.restoreInsertionPoint(loopEndPt);

mlir::Value resultVal = loop.getResult(0);		// Unwind the loop nest and insert ResultOp on each level
builder.create<mlir::func::ReturnOp>(loc, resultVal);		// to return the updated value of the reduction to the enclosing
		// loops.
		for (unsigned i = 0; i < rank; ++i) {
		auto result = builder.create<fir::ResultOp>(loc, reductionVal);
		// Proceed to the outer loop.
		auto loop = mlir::cast<fir::DoLoopOp>(result->getParentOp());
		reductionVal = loop.getResult(0);
		LeporacanthicusUnsubmitted Not Done Reply Inline Actions Do we not need to do some "combine" here? I will give it a try in a bit, to see if I can prove this hypothesis. Leporacanthicus: Do we not need to do some "combine" here? I will give it a try in a bit, to see if I can prove…
		vzakhariAuthorUnsubmitted Done Reply Inline Actions The "combine" happens via the loop block arguments. Basically, we may think of an inner loop as a function that takes the reduction value that is an input for the outer loop, and the result of the "function" is passed back to the outer loop. vzakhari: The "combine" happens via the loop block arguments. Basically, we may think of an inner loop…
		// Set insertion point after the loop operation that we have
		// just processed.
		builder.setInsertionPointAfter(loop.getOperation());
		}

		// End of loop nest. The insertion point is after the outermost loop.
		// Return the reduction value from the function.
		builder.create<mlir::func::ReturnOp>(loc, reductionVal);
}		}

/// Generate function body of the simplified version of RTNAME(Sum)		/// Generate function body of the simplified version of RTNAME(Sum)
/// with signature provided by \p funcOp. The caller is responsible		/// with signature provided by \p funcOp. The caller is responsible
/// for saving/restoring the original insertion point of \p builder.		/// for saving/restoring the original insertion point of \p builder.
/// \p funcOp is expected to be empty on entry to this function.		/// \p funcOp is expected to be empty on entry to this function.
		/// \p rank specifies the rank of the input argument.
static void genRuntimeSumBody(fir::FirOpBuilder &builder,		static void genRuntimeSumBody(fir::FirOpBuilder &builder,
mlir::func::FuncOp &funcOp) {		mlir::func::FuncOp &funcOp, unsigned rank) {
// function RTNAME(Sum)<T>_simplified(arr)		// function RTNAME(Sum)<T>x<rank>_simplified(arr)
// T, dimension(:) :: arr		// T, dimension(:) :: arr
// T sum = 0		// T sum = 0
// integer iter		// integer iter
// do iter = 0, extent(arr)		// do iter = 0, extent(arr)
// sum = sum + arr[iter]		// sum = sum + arr[iter]
// end do		// end do
// RTNAME(Sum)<T>_simplified = sum		// RTNAME(Sum)<T>x<rank>_simplified = sum
// end function RTNAME(Sum)<T>_simplified		// end function RTNAME(Sum)<T>x<rank>_simplified
auto zero = [](fir::FirOpBuilder builder, mlir::Location loc,		auto zero = [](fir::FirOpBuilder builder, mlir::Location loc,
mlir::Type elementType) {		mlir::Type elementType) {
if (auto ty = elementType.dyn_cast<mlir::FloatType>()) {		if (auto ty = elementType.dyn_cast<mlir::FloatType>()) {
const llvm::fltSemantics &sem = ty.getFloatSemantics();		const llvm::fltSemantics &sem = ty.getFloatSemantics();
return builder.createRealConstant(loc, elementType,		return builder.createRealConstant(loc, elementType,
llvm::APFloat::getZero(sem));		llvm::APFloat::getZero(sem));
}		}
return builder.createIntegerConstant(loc, elementType, 0);		return builder.createIntegerConstant(loc, elementType, 0);
};		};

auto genBodyOp = [](fir::FirOpBuilder builder, mlir::Location loc,		auto genBodyOp = [](fir::FirOpBuilder builder, mlir::Location loc,
mlir::Type elementType, mlir::Value elem1,		mlir::Type elementType, mlir::Value elem1,
mlir::Value elem2) -> mlir::Value {		mlir::Value elem2) -> mlir::Value {
if (elementType.isa<mlir::FloatType>())		if (elementType.isa<mlir::FloatType>())
return builder.create<mlir::arith::AddFOp>(loc, elem1, elem2);		return builder.create<mlir::arith::AddFOp>(loc, elem1, elem2);
if (elementType.isa<mlir::IntegerType>())		if (elementType.isa<mlir::IntegerType>())
return builder.create<mlir::arith::AddIOp>(loc, elem1, elem2);		return builder.create<mlir::arith::AddIOp>(loc, elem1, elem2);

llvm_unreachable("unsupported type");		llvm_unreachable("unsupported type");
return {};		return {};
};		};

genReductionLoop(builder, funcOp, zero, genBodyOp);		genReductionLoop(builder, funcOp, zero, genBodyOp, rank);
}		}

static void genRuntimeMaxvalBody(fir::FirOpBuilder &builder,		static void genRuntimeMaxvalBody(fir::FirOpBuilder &builder,
mlir::func::FuncOp &funcOp) {		mlir::func::FuncOp &funcOp, unsigned rank) {
auto init = [](fir::FirOpBuilder builder, mlir::Location loc,		auto init = [](fir::FirOpBuilder builder, mlir::Location loc,
mlir::Type elementType) {		mlir::Type elementType) {
if (auto ty = elementType.dyn_cast<mlir::FloatType>()) {		if (auto ty = elementType.dyn_cast<mlir::FloatType>()) {
const llvm::fltSemantics &sem = ty.getFloatSemantics();		const llvm::fltSemantics &sem = ty.getFloatSemantics();
return builder.createRealConstant(		return builder.createRealConstant(
loc, elementType, llvm::APFloat::getLargest(sem, /Negative=/true));		loc, elementType, llvm::APFloat::getLargest(sem, /Negative=/true));
}		}
unsigned bits = elementType.getIntOrFloatBitWidth();		unsigned bits = elementType.getIntOrFloatBitWidth();
int64_t minInt = llvm::APInt::getSignedMinValue(bits).getSExtValue();		int64_t minInt = llvm::APInt::getSignedMinValue(bits).getSExtValue();
return builder.createIntegerConstant(loc, elementType, minInt);		return builder.createIntegerConstant(loc, elementType, minInt);
};		};

auto genBodyOp = [](fir::FirOpBuilder builder, mlir::Location loc,		auto genBodyOp = [](fir::FirOpBuilder builder, mlir::Location loc,
mlir::Type elementType, mlir::Value elem1,		mlir::Type elementType, mlir::Value elem1,
mlir::Value elem2) -> mlir::Value {		mlir::Value elem2) -> mlir::Value {
if (elementType.isa<mlir::FloatType>())		if (elementType.isa<mlir::FloatType>())
return builder.create<mlir::arith::MaxFOp>(loc, elem1, elem2);		return builder.create<mlir::arith::MaxFOp>(loc, elem1, elem2);
if (elementType.isa<mlir::IntegerType>())		if (elementType.isa<mlir::IntegerType>())
return builder.create<mlir::arith::MaxSIOp>(loc, elem1, elem2);		return builder.create<mlir::arith::MaxSIOp>(loc, elem1, elem2);

llvm_unreachable("unsupported type");		llvm_unreachable("unsupported type");
return {};		return {};
};		};
genReductionLoop(builder, funcOp, init, genBodyOp);		genReductionLoop(builder, funcOp, init, genBodyOp, rank);
}		}

/// Generate function type for the simplified version of RTNAME(DotProduct)		/// Generate function type for the simplified version of RTNAME(DotProduct)
/// operating on the given \p elementType.		/// operating on the given \p elementType.
static mlir::FunctionType genRuntimeDotType(fir::FirOpBuilder &builder,		static mlir::FunctionType genRuntimeDotType(fir::FirOpBuilder &builder,
const mlir::Type &elementType) {		const mlir::Type &elementType) {
mlir::Type boxType = fir::BoxType::get(builder.getNoneType());		mlir::Type boxType = fir::BoxType::get(builder.getNoneType());
return mlir::FunctionType::get(builder.getContext(), {boxType, boxType},		return mlir::FunctionType::get(builder.getContext(), {boxType, boxType},
▲ Show 20 Lines • Show All 165 Lines • ▼ Show 20 Lines	static bool isZero(mlir::Value val) {
if (auto op = expectConvertOp(val)) {		if (auto op = expectConvertOp(val)) {
assert(op->getOperands().size() != 0);		assert(op->getOperands().size() != 0);
if (mlir::Operation *defOp = op->getOperand(0).getDefiningOp())		if (mlir::Operation *defOp = op->getOperand(0).getDefiningOp())
return mlir::matchPattern(defOp, mlir::m_Zero());		return mlir::matchPattern(defOp, mlir::m_Zero());
}		}
return false;		return false;
}		}

static mlir::Value findShape(mlir::Value val) {		static mlir::Value findBoxDef(mlir::Value val) {
if (auto op = expectConvertOp(val)) {		if (auto op = expectConvertOp(val)) {
assert(op->getOperands().size() != 0);		assert(op->getOperands().size() != 0);
if (auto box = mlir::dyn_cast_or_null<fir::EmboxOp>(		if (auto box = mlir::dyn_cast_or_null<fir::EmboxOp>(
op->getOperand(0).getDefiningOp()))		op->getOperand(0).getDefiningOp()))
return box.getShape();		return box.getResult();
		if (auto box = mlir::dyn_cast_or_null<fir::ReboxOp>(
		op->getOperand(0).getDefiningOp()))
		return box.getResult();
}		}
return {};		return {};
}		}

static unsigned getDimCount(mlir::Value val) {		static unsigned getDimCount(mlir::Value val) {
if (mlir::Value shapeVal = findShape(val)) {		// In order to find the dimensions count, we look for EmboxOp/ReboxOp
mlir::Type resType = shapeVal.getDefiningOp()->getResultTypes()[0];		// and take the count from its result type. Note that in case
return fir::getRankOfShapeType(resType);		// of sliced emboxing the operand and the result of EmboxOp/ReboxOp
}		// have different types.
		// Actually, we can take the box type from the operand of
		// the first ConvertOp that has non-opaque box type that we meet
		// going through the ConvertOp chain.
		if (mlir::Value emboxVal = findBoxDef(val))
		if (auto boxTy = emboxVal.getType().dyn_cast<fir::BoxType>())
		if (auto seqTy = boxTy.getEleTy().dyn_cast<fir::SequenceType>())
		return seqTy.getDimension();
return 0;		return 0;
}		}

/// Given the call operation's box argument \p val, discover		/// Given the call operation's box argument \p val, discover
/// the element type of the underlying array object.		/// the element type of the underlying array object.
/// \returns the element type or llvm::None if the type cannot		/// \returns the element type or llvm::None if the type cannot
/// be reliably found.		/// be reliably found.
/// We expect that the argument is a result of fir.convert		/// We expect that the argument is a result of fir.convert
Show All 14 Lines	if (!elementType.isa<mlir::NoneType>())
return elementType;		return elementType;
} while (true);		} while (true);
}		}

void SimplifyIntrinsicsPass::simplifyReduction(fir::CallOp call,		void SimplifyIntrinsicsPass::simplifyReduction(fir::CallOp call,
const fir::KindMapping &kindMap,		const fir::KindMapping &kindMap,
GenReductionBodyTy genBodyFunc) {		GenReductionBodyTy genBodyFunc) {
mlir::SymbolRefAttr callee = call.getCalleeAttr();		mlir::SymbolRefAttr callee = call.getCalleeAttr();
mlir::StringRef funcName = callee.getLeafReference().getValue();
mlir::Operation::operand_range args = call.getArgs();		mlir::Operation::operand_range args = call.getArgs();
// args[1] and args[2] are source filename and line number, ignored.		// args[1] and args[2] are source filename and line number, ignored.
const mlir::Value &dim = args[3];		const mlir::Value &dim = args[3];
const mlir::Value &mask = args[4];		const mlir::Value &mask = args[4];
// dim is zero when it is absent, which is an implementation		// dim is zero when it is absent, which is an implementation
// detail in the runtime library.		// detail in the runtime library.
bool dimAndMaskAbsent = isZero(dim) && isOperandAbsent(mask);		bool dimAndMaskAbsent = isZero(dim) && isOperandAbsent(mask);
unsigned rank = getDimCount(args[0]);		unsigned rank = getDimCount(args[0]);
if (dimAndMaskAbsent && rank == 1) {		if (dimAndMaskAbsent && rank > 0) {
mlir::Location loc = call.getLoc();		mlir::Location loc = call.getLoc();
fir::FirOpBuilder builder(call, kindMap);		fir::FirOpBuilder builder(call, kindMap);

// Support only floating point and integer results now.		// Support only floating point and integer results now.
mlir::Type resultType = call.getResult(0).getType();		mlir::Type resultType = call.getResult(0).getType();
if (!resultType.isa<mlir::FloatType>() &&		if (!resultType.isa<mlir::FloatType>() &&
!resultType.isa<mlir::IntegerType>())		!resultType.isa<mlir::IntegerType>())
return;		return;

auto argType = getArgElementType(args[0]);		auto argType = getArgElementType(args[0]);
if (!argType)		if (!argType)
return;		return;
assert(*argType == resultType &&		assert(*argType == resultType &&
"Argument/result types mismatch in reduction");		"Argument/result types mismatch in reduction");

auto typeGenerator = [&resultType](fir::FirOpBuilder &builder) {		auto typeGenerator = [&resultType](fir::FirOpBuilder &builder) {
return genNoneBoxType(builder, resultType);		return genNoneBoxType(builder, resultType);
};		};
		auto bodyGenerator = [&rank, &genBodyFunc](fir::FirOpBuilder &builder,
		mlir::func::FuncOp &funcOp) {
		genBodyFunc(builder, funcOp, rank);
		};
		// Mangle the function name with the rank value as "x<rank>".
		std::string funcName =
		(mlir::Twine{callee.getLeafReference().getValue(), "x"} +
		mlir::Twine{rank})
		.str();
mlir::func::FuncOp newFunc =		mlir::func::FuncOp newFunc =
getOrCreateFunction(builder, funcName, typeGenerator, genBodyFunc);		getOrCreateFunction(builder, funcName, typeGenerator, bodyGenerator);
auto newCall =		auto newCall =
builder.create<fir::CallOp>(loc, newFunc, mlir::ValueRange{args[0]});		builder.create<fir::CallOp>(loc, newFunc, mlir::ValueRange{args[0]});
call->replaceAllUsesWith(newCall.getResults());		call->replaceAllUsesWith(newCall.getResults());
call->dropAllReferences();		call->dropAllReferences();
call->erase();		call->erase();
}		}
}		}

▲ Show 20 Lines • Show All 99 Lines • Show Last 20 Lines

flang/test/Transforms/simplifyintrinsics.fir

Show All 28 Lines


// CHECK-LABEL: func.func @sum_1d_array_int(		// CHECK-LABEL: func.func @sum_1d_array_int(
// CHECK-SAME: %[[A:.*]]: !fir.ref<!fir.array<10xi32>> {fir.bindc_name = "a"}) -> i32 {		// CHECK-SAME: %[[A:.*]]: !fir.ref<!fir.array<10xi32>> {fir.bindc_name = "a"}) -> i32 {
// CHECK: %[[SHAPE:.]] = fir.shape %{{.}} : (index) -> !fir.shape<1>		// CHECK: %[[SHAPE:.]] = fir.shape %{{.}} : (index) -> !fir.shape<1>
// CHECK: %[[A_BOX_I32:.*]] = fir.embox %[[A]](%[[SHAPE]]) : (!fir.ref<!fir.array<10xi32>>, !fir.shape<1>) -> !fir.box<!fir.array<10xi32>>		// CHECK: %[[A_BOX_I32:.*]] = fir.embox %[[A]](%[[SHAPE]]) : (!fir.ref<!fir.array<10xi32>>, !fir.shape<1>) -> !fir.box<!fir.array<10xi32>>
// CHECK: %[[A_BOX_NONE:.*]] = fir.convert %[[A_BOX_I32]] : (!fir.box<!fir.array<10xi32>>) -> !fir.box<none>		// CHECK: %[[A_BOX_NONE:.*]] = fir.convert %[[A_BOX_I32]] : (!fir.box<!fir.array<10xi32>>) -> !fir.box<none>
// CHECK-NOT: fir.call @_FortranASumInteger4({{.*}})		// CHECK-NOT: fir.call @_FortranASumInteger4({{.*}})
// CHECK: %[[RES:.*]] = fir.call @_FortranASumInteger4_simplified(%[[A_BOX_NONE]]) : (!fir.box<none>) -> i32		// CHECK: %[[RES:.*]] = fir.call @_FortranASumInteger4x1_simplified(%[[A_BOX_NONE]]) : (!fir.box<none>) -> i32
// CHECK-NOT: fir.call @_FortranASumInteger4({{.*}})		// CHECK-NOT: fir.call @_FortranASumInteger4({{.*}})
// CHECK: return %{{.*}} : i32		// CHECK: return %{{.*}} : i32
// CHECK: }		// CHECK: }
// CHECK: func.func private @_FortranASumInteger4(!fir.box<none>, !fir.ref<i8>, i32, i32, !fir.box<none>) -> i32 attributes {fir.runtime}		// CHECK: func.func private @_FortranASumInteger4(!fir.box<none>, !fir.ref<i8>, i32, i32, !fir.box<none>) -> i32 attributes {fir.runtime}

// CHECK-LABEL: func.func private @_FortranASumInteger4_simplified(		// CHECK-LABEL: func.func private @_FortranASumInteger4x1_simplified(
// CHECK-SAME: %[[ARR:.*]]: !fir.box<none>) -> i32 attributes {llvm.linkage = #llvm.linkage<linkonce_odr>} {		// CHECK-SAME: %[[ARR:.*]]: !fir.box<none>) -> i32 attributes {llvm.linkage = #llvm.linkage<linkonce_odr>} {
// CHECK: %[[CINDEX_0:.*]] = arith.constant 0 : index		// CHECK: %[[CINDEX_0:.*]] = arith.constant 0 : index
// CHECK: %[[ARR_BOX_I32:.*]] = fir.convert %[[ARR]] : (!fir.box<none>) -> !fir.box<!fir.array<?xi32>>		// CHECK: %[[ARR_BOX_I32:.*]] = fir.convert %[[ARR]] : (!fir.box<none>) -> !fir.box<!fir.array<?xi32>>
// CHECK: %[[DIMS:.*]]:3 = fir.box_dims %[[ARR_BOX_I32]], %[[CINDEX_0]] : (!fir.box<!fir.array<?xi32>>, index) -> (index, index, index)		// CHECK: %[[CI32_0:.*]] = arith.constant 0 : i32
// CHECK: %[[CINDEX_1:.*]] = arith.constant 1 : index		// CHECK: %[[CINDEX_1:.*]] = arith.constant 1 : index
		// CHECK: %[[DIMIDX_0:.*]] = arith.constant 0 : index
		// CHECK: %[[DIMS:.*]]:3 = fir.box_dims %[[ARR_BOX_I32]], %[[DIMIDX_0]] : (!fir.box<!fir.array<?xi32>>, index) -> (index, index, index)
// CHECK: %[[EXTENT:.*]] = arith.subi %[[DIMS]]#1, %[[CINDEX_1]] : index		// CHECK: %[[EXTENT:.*]] = arith.subi %[[DIMS]]#1, %[[CINDEX_1]] : index
// CHECK: %[[CI32_0:.*]] = arith.constant 0 : i32
// CHECK: %[[RES:.]] = fir.do_loop %[[ITER:.]] = %[[CINDEX_0]] to %[[EXTENT]] step %[[CINDEX_1]] iter_args(%[[SUM:.*]] = %[[CI32_0]]) -> (i32) {		// CHECK: %[[RES:.]] = fir.do_loop %[[ITER:.]] = %[[CINDEX_0]] to %[[EXTENT]] step %[[CINDEX_1]] iter_args(%[[SUM:.*]] = %[[CI32_0]]) -> (i32) {
// CHECK: %[[ITEM:.*]] = fir.coordinate_of %[[ARR_BOX_I32]], %[[ITER]] : (!fir.box<!fir.array<?xi32>>, index) -> !fir.ref<i32>		// CHECK: %[[ITEM:.*]] = fir.coordinate_of %[[ARR_BOX_I32]], %[[ITER]] : (!fir.box<!fir.array<?xi32>>, index) -> !fir.ref<i32>
// CHECK: %[[ITEM_VAL:.*]] = fir.load %[[ITEM]] : !fir.ref<i32>		// CHECK: %[[ITEM_VAL:.*]] = fir.load %[[ITEM]] : !fir.ref<i32>
// CHECK: %[[NEW_SUM:.*]] = arith.addi %[[ITEM_VAL]], %[[SUM]] : i32		// CHECK: %[[NEW_SUM:.*]] = arith.addi %[[ITEM_VAL]], %[[SUM]] : i32
// CHECK: fir.result %[[NEW_SUM]] : i32		// CHECK: fir.result %[[NEW_SUM]] : i32
// CHECK: }		// CHECK: }
// CHECK: return %[[RES]] : i32		// CHECK: return %[[RES]] : i32
// CHECK: }		// CHECK: }

// -----		// -----

// Call to SUM with 2D I32 arrays is not replaced.		// Call to SUM with 2D I32 arrays is replaced.
module attributes {fir.defaultkind = "a1c4d8i4l4r4", fir.kindmap = "", llvm.target_triple = "native"} {		module attributes {fir.defaultkind = "a1c4d8i4l4r4", fir.kindmap = "", llvm.target_triple = "native"} {
func.func @sum_2d_array_int(%arg0: !fir.ref<!fir.array<10x10xi32>> {fir.bindc_name = "a"}) -> i32 {		func.func @sum_2d_array_int(%arg0: !fir.ref<!fir.array<10x10xi32>> {fir.bindc_name = "a"}) -> i32 {
%c10 = arith.constant 10 : index		%c10 = arith.constant 10 : index
%c10_0 = arith.constant 10 : index		%c10_0 = arith.constant 10 : index
%0 = fir.alloca i32 {bindc_name = "test_sum_3", uniq_name = "_QFtest_sum_3Etest_sum_3"}		%0 = fir.alloca i32 {bindc_name = "test_sum_3", uniq_name = "_QFtest_sum_3Etest_sum_3"}
%1 = fir.shape %c10, %c10_0 : (index, index) -> !fir.shape<2>		%1 = fir.shape %c10, %c10_0 : (index, index) -> !fir.shape<2>
%2 = fir.embox %arg0(%1) : (!fir.ref<!fir.array<10x10xi32>>, !fir.shape<2>) -> !fir.box<!fir.array<10x10xi32>>		%2 = fir.embox %arg0(%1) : (!fir.ref<!fir.array<10x10xi32>>, !fir.shape<2>) -> !fir.box<!fir.array<10x10xi32>>
%3 = fir.absent !fir.box<i1>		%3 = fir.absent !fir.box<i1>
Show All 12 Lines	module attributes {fir.defaultkind = "a1c4d8i4l4r4", fir.kindmap = "", llvm.target_triple = "native"} {
func.func private @_FortranASumInteger4(!fir.box<none>, !fir.ref<i8>, i32, i32, !fir.box<none>) -> i32 attributes {fir.runtime}		func.func private @_FortranASumInteger4(!fir.box<none>, !fir.ref<i8>, i32, i32, !fir.box<none>) -> i32 attributes {fir.runtime}
fir.global linkonce @_QQcl.2E2F6973756D5F332E66393000 constant : !fir.char<1,13> {		fir.global linkonce @_QQcl.2E2F6973756D5F332E66393000 constant : !fir.char<1,13> {
%0 = fir.string_lit "./isum_3.f90\00"(13) : !fir.char<1,13>		%0 = fir.string_lit "./isum_3.f90\00"(13) : !fir.char<1,13>
fir.has_value %0 : !fir.char<1,13>		fir.has_value %0 : !fir.char<1,13>
}		}
}		}

// CHECK-LABEL: func.func @sum_2d_array_int({{.*}} !fir.ref<!fir.array<10x10xi32>> {fir.bindc_name = "a"}) -> i32 {		// CHECK-LABEL: func.func @sum_2d_array_int({{.*}} !fir.ref<!fir.array<10x10xi32>> {fir.bindc_name = "a"}) -> i32 {
// CHECK-NOT: fir.call @_FortranASumInteger4_simplified({{.*}})		// CHECK: %[[SHAPE:.]] = fir.shape %{{.}} : (index, index) -> !fir.shape<2>
// CHECK: fir.call @_FortranASumInteger4({{.*}}) : (!fir.box<none>, !fir.ref<i8>, i32, i32, !fir.box<none>) -> i32		// CHECK: %[[A_BOX_I32:.*]] = fir.embox %[[A]](%[[SHAPE]]) : (!fir.ref<!fir.array<10x10xi32>>, !fir.shape<2>) -> !fir.box<!fir.array<10x10xi32>>
// CHECK-NOT: fir.call @_FortranASumInteger4_simplified({{.*}})		// CHECK: %[[A_BOX_NONE:.*]] = fir.convert %[[A_BOX_I32]] : (!fir.box<!fir.array<10x10xi32>>) -> !fir.box<none>
		// CHECK-NOT: fir.call @_FortranASumInteger4({{.*}})
		// CHECK: %[[RES:.*]] = fir.call @_FortranASumInteger4x2_simplified(%[[A_BOX_NONE]]) : (!fir.box<none>) -> i32
		// CHECK-NOT: fir.call @_FortranASumInteger4({{.*}})
		// CHECK: return %{{.*}} : i32
		// CHECK: }
		// CHECK: func.func private @_FortranASumInteger4(!fir.box<none>, !fir.ref<i8>, i32, i32, !fir.box<none>) -> i32 attributes {fir.runtime}

		// CHECK-LABEL: func.func private @_FortranASumInteger4x2_simplified(
		// CHECK-SAME: %[[ARR:.*]]: !fir.box<none>) -> i32 attributes {llvm.linkage = #llvm.linkage<linkonce_odr>} {
		// CHECK: %[[CINDEX_0:.*]] = arith.constant 0 : index
		// CHECK: %[[ARR_BOX_I32:.*]] = fir.convert %[[ARR]] : (!fir.box<none>) -> !fir.box<!fir.array<?x?xi32>>
		// CHECK: %[[CI32_0:.*]] = arith.constant 0 : i32
		// CHECK: %[[CINDEX_1:.*]] = arith.constant 1 : index
		// CHECK: %[[DIMIDX_0:.*]] = arith.constant 0 : index
		// CHECK: %[[DIMS_0:.*]]:3 = fir.box_dims %[[ARR_BOX_I32]], %[[DIMIDX_0]] : (!fir.box<!fir.array<?x?xi32>>, index) -> (index, index, index)
		// CHECK: %[[EXTENT_0:.*]] = arith.subi %[[DIMS_0]]#1, %[[CINDEX_1]] : index
		// CHECK: %[[DIMIDX_1:.*]] = arith.constant 1 : index
		// CHECK: %[[DIMS_1:.*]]:3 = fir.box_dims %[[ARR_BOX_I32]], %[[DIMIDX_1]] : (!fir.box<!fir.array<?x?xi32>>, index) -> (index, index, index)
		// CHECK: %[[EXTENT_1:.*]] = arith.subi %[[DIMS_1]]#1, %[[CINDEX_1]] : index
		// CHECK: %[[RES_1:.]] = fir.do_loop %[[ITER_1:.]] = %[[CINDEX_0]] to %[[EXTENT_1]] step %[[CINDEX_1]] iter_args(%[[SUM_1:.*]] = %[[CI32_0]]) -> (i32) {
		// CHECK: %[[RES_0:.]] = fir.do_loop %[[ITER_0:.]] = %[[CINDEX_0]] to %[[EXTENT_0]] step %[[CINDEX_1]] iter_args(%[[SUM_0:.*]] = %[[SUM_1]]) -> (i32) {
		// CHECK: %[[ITEM:.*]] = fir.coordinate_of %[[ARR_BOX_I32]], %[[ITER_0]], %[[ITER_1]] : (!fir.box<!fir.array<?x?xi32>>, index, index) -> !fir.ref<i32>
		// CHECK: %[[ITEM_VAL:.*]] = fir.load %[[ITEM]] : !fir.ref<i32>
		// CHECK: %[[NEW_SUM:.*]] = arith.addi %[[ITEM_VAL]], %[[SUM_0]] : i32
		// CHECK: fir.result %[[NEW_SUM]] : i32
		// CHECK: }
		// CHECK: fir.result %[[RES_0]]
		// CHECK: }
		// CHECK: return %[[RES_1]] : i32
		// CHECK: }

// -----		// -----

// Call to SUM with 1D F64 is replaced.		// Call to SUM with 1D F64 is replaced.
module attributes {fir.defaultkind = "a1c4d8i4l4r4", fir.kindmap = "", llvm.target_triple = "native"} {		module attributes {fir.defaultkind = "a1c4d8i4l4r4", fir.kindmap = "", llvm.target_triple = "native"} {
func.func @sum_1d_real(%arg0: !fir.ref<!fir.array<10xf64>> {fir.bindc_name = "a"}) -> f64 {		func.func @sum_1d_real(%arg0: !fir.ref<!fir.array<10xf64>> {fir.bindc_name = "a"}) -> f64 {
%c10 = arith.constant 10 : index		%c10 = arith.constant 10 : index
%0 = fir.alloca f64 {bindc_name = "sum_1d_real", uniq_name = "_QFsum_1d_realEsum_1d_real"}		%0 = fir.alloca f64 {bindc_name = "sum_1d_real", uniq_name = "_QFsum_1d_realEsum_1d_real"}
Show All 22 Lines

// CHECK-LABEL: func.func @sum_1d_real(		// CHECK-LABEL: func.func @sum_1d_real(
// CHECK-SAME: %[[A:.*]]: !fir.ref<!fir.array<10xf64>> {fir.bindc_name = "a"}) -> f64 {		// CHECK-SAME: %[[A:.*]]: !fir.ref<!fir.array<10xf64>> {fir.bindc_name = "a"}) -> f64 {
// CHECK: %[[CINDEX_10:.*]] = arith.constant 10 : index		// CHECK: %[[CINDEX_10:.*]] = arith.constant 10 : index
// CHECK: %[[SHAPE:.*]] = fir.shape %[[CINDEX_10]] : (index) -> !fir.shape<1>		// CHECK: %[[SHAPE:.*]] = fir.shape %[[CINDEX_10]] : (index) -> !fir.shape<1>
// CHECK: %[[A_BOX_F64:.*]] = fir.embox %[[A]](%[[SHAPE]]) : (!fir.ref<!fir.array<10xf64>>, !fir.shape<1>) -> !fir.box<!fir.array<10xf64>>		// CHECK: %[[A_BOX_F64:.*]] = fir.embox %[[A]](%[[SHAPE]]) : (!fir.ref<!fir.array<10xf64>>, !fir.shape<1>) -> !fir.box<!fir.array<10xf64>>
// CHECK: %[[A_BOX_NONE:.*]] = fir.convert %[[A_BOX_F64]] : (!fir.box<!fir.array<10xf64>>) -> !fir.box<none>		// CHECK: %[[A_BOX_NONE:.*]] = fir.convert %[[A_BOX_F64]] : (!fir.box<!fir.array<10xf64>>) -> !fir.box<none>
// CHECK-NOT: fir.call @_FortranASumReal8({{.*}})		// CHECK-NOT: fir.call @_FortranASumReal8({{.*}})
// CHECK: %[[RES:.*]] = fir.call @_FortranASumReal8_simplified(%[[A_BOX_NONE]]) : (!fir.box<none>) -> f64		// CHECK: %[[RES:.*]] = fir.call @_FortranASumReal8x1_simplified(%[[A_BOX_NONE]]) : (!fir.box<none>) -> f64
// CHECK-NOT: fir.call @_FortranASumReal8({{.*}})		// CHECK-NOT: fir.call @_FortranASumReal8({{.*}})
// CHECK: return %{{.*}} : f64		// CHECK: return %{{.*}} : f64
// CHECK: }		// CHECK: }

// CHECK-LABEL: func.func private @_FortranASumReal8_simplified(		// CHECK-LABEL: func.func private @_FortranASumReal8x1_simplified(
// CHECK-SAME: %[[ARR:.*]]: !fir.box<none>) -> f64 attributes {llvm.linkage = #llvm.linkage<linkonce_odr>} {		// CHECK-SAME: %[[ARR:.*]]: !fir.box<none>) -> f64 attributes {llvm.linkage = #llvm.linkage<linkonce_odr>} {
// CHECK: %[[CINDEX_0:.*]] = arith.constant 0 : index		// CHECK: %[[CINDEX_0:.*]] = arith.constant 0 : index
// CHECK: %[[ARR_BOX_F64:.*]] = fir.convert %[[ARR]] : (!fir.box<none>) -> !fir.box<!fir.array<?xf64>>		// CHECK: %[[ARR_BOX_F64:.*]] = fir.convert %[[ARR]] : (!fir.box<none>) -> !fir.box<!fir.array<?xf64>>
// CHECK: %[[DIMS:.*]]:3 = fir.box_dims %[[ARR_BOX_F64]], %[[CINDEX_0]] : (!fir.box<!fir.array<?xf64>>, index) -> (index, index, index)		// CHECK: %[[ZERO:.*]] = arith.constant 0.000000e+00 : f64
// CHECK: %[[CINDEX_1:.*]] = arith.constant 1 : index		// CHECK: %[[CINDEX_1:.*]] = arith.constant 1 : index
		// CHECK: %[[DIMIDX_0:.*]] = arith.constant 0 : index
		// CHECK: %[[DIMS:.*]]:3 = fir.box_dims %[[ARR_BOX_F64]], %[[DIMIDX_0]] : (!fir.box<!fir.array<?xf64>>, index) -> (index, index, index)
// CHECK: %[[EXTENT:.*]] = arith.subi %[[DIMS]]#1, %[[CINDEX_1]] : index		// CHECK: %[[EXTENT:.*]] = arith.subi %[[DIMS]]#1, %[[CINDEX_1]] : index
// CHECK: %[[ZERO:.*]] = arith.constant 0.000000e+00 : f64
// CHECK: %[[RES:.]] = fir.do_loop %[[ITER:.]] = %[[CINDEX_0]] to %[[EXTENT]] step %[[CINDEX_1]] iter_args(%[[SUM]] = %[[ZERO]]) -> (f64) {		// CHECK: %[[RES:.]] = fir.do_loop %[[ITER:.]] = %[[CINDEX_0]] to %[[EXTENT]] step %[[CINDEX_1]] iter_args(%[[SUM]] = %[[ZERO]]) -> (f64) {
// CHECK: %[[ITEM:.*]] = fir.coordinate_of %[[ARR_BOX_F64]], %[[ITER]] : (!fir.box<!fir.array<?xf64>>, index) -> !fir.ref<f64>		// CHECK: %[[ITEM:.*]] = fir.coordinate_of %[[ARR_BOX_F64]], %[[ITER]] : (!fir.box<!fir.array<?xf64>>, index) -> !fir.ref<f64>
// CHECK: %[[ITEM_VAL:.*]] = fir.load %[[ITEM]] : !fir.ref<f64>		// CHECK: %[[ITEM_VAL:.*]] = fir.load %[[ITEM]] : !fir.ref<f64>
// CHECK: %[[NEW_SUM:.*]] = arith.addf %[[ITEM_VAL]], %[[SUM]] : f64		// CHECK: %[[NEW_SUM:.*]] = arith.addf %[[ITEM_VAL]], %[[SUM]] : f64
// CHECK: fir.result %[[NEW_SUM]] : f64		// CHECK: fir.result %[[NEW_SUM]] : f64
// CHECK: }		// CHECK: }
// CHECK: return %[[RES]] : f64		// CHECK: return %[[RES]] : f64
// CHECK: }		// CHECK: }
Show All 30 Lines

// CHECK-LABEL: func.func @sum_1d_real(		// CHECK-LABEL: func.func @sum_1d_real(
// CHECK-SAME: %[[A:.*]]: !fir.ref<!fir.array<10xf32>> {fir.bindc_name = "a"}) -> f32 {		// CHECK-SAME: %[[A:.*]]: !fir.ref<!fir.array<10xf32>> {fir.bindc_name = "a"}) -> f32 {
// CHECK: %[[CINDEX_10:.*]] = arith.constant 10 : index		// CHECK: %[[CINDEX_10:.*]] = arith.constant 10 : index
// CHECK: %[[SHAPE:.*]] = fir.shape %[[CINDEX_10]] : (index) -> !fir.shape<1>		// CHECK: %[[SHAPE:.*]] = fir.shape %[[CINDEX_10]] : (index) -> !fir.shape<1>
// CHECK: %[[A_BOX_F32:.*]] = fir.embox %[[A]](%[[SHAPE]]) : (!fir.ref<!fir.array<10xf32>>, !fir.shape<1>) -> !fir.box<!fir.array<10xf32>>		// CHECK: %[[A_BOX_F32:.*]] = fir.embox %[[A]](%[[SHAPE]]) : (!fir.ref<!fir.array<10xf32>>, !fir.shape<1>) -> !fir.box<!fir.array<10xf32>>
// CHECK: %[[A_BOX_NONE:.*]] = fir.convert %[[A_BOX_F32]] : (!fir.box<!fir.array<10xf32>>) -> !fir.box<none>		// CHECK: %[[A_BOX_NONE:.*]] = fir.convert %[[A_BOX_F32]] : (!fir.box<!fir.array<10xf32>>) -> !fir.box<none>
// CHECK-NOT: fir.call @_FortranASumReal4({{.*}})		// CHECK-NOT: fir.call @_FortranASumReal4({{.*}})
// CHECK: %[[RES:.*]] = fir.call @_FortranASumReal4_simplified(%[[A_BOX_NONE]]) : (!fir.box<none>) -> f32		// CHECK: %[[RES:.*]] = fir.call @_FortranASumReal4x1_simplified(%[[A_BOX_NONE]]) : (!fir.box<none>) -> f32
// CHECK-NOT: fir.call @_FortranASumReal4({{.*}})		// CHECK-NOT: fir.call @_FortranASumReal4({{.*}})
// CHECK: return %{{.*}} : f32		// CHECK: return %{{.*}} : f32
// CHECK: }		// CHECK: }

// CHECK-LABEL: func.func private @_FortranASumReal4_simplified(		// CHECK-LABEL: func.func private @_FortranASumReal4x1_simplified(
// CHECK-SAME: %[[ARR:.*]]: !fir.box<none>) -> f32 attributes {llvm.linkage = #llvm.linkage<linkonce_odr>} {		// CHECK-SAME: %[[ARR:.*]]: !fir.box<none>) -> f32 attributes {llvm.linkage = #llvm.linkage<linkonce_odr>} {
// CHECK: %[[CINDEX_0:.*]] = arith.constant 0 : index		// CHECK: %[[CINDEX_0:.*]] = arith.constant 0 : index
// CHECK: %[[ARR_BOX_F32:.*]] = fir.convert %[[ARR]] : (!fir.box<none>) -> !fir.box<!fir.array<?xf32>>		// CHECK: %[[ARR_BOX_F32:.*]] = fir.convert %[[ARR]] : (!fir.box<none>) -> !fir.box<!fir.array<?xf32>>
// CHECK: %[[DIMS:.*]]:3 = fir.box_dims %[[ARR_BOX_F32]], %[[CINDEX_0]] : (!fir.box<!fir.array<?xf32>>, index) -> (index, index, index)		// CHECK: %[[ZERO:.*]] = arith.constant 0.000000e+00 : f32
// CHECK: %[[CINDEX_1:.*]] = arith.constant 1 : index		// CHECK: %[[CINDEX_1:.*]] = arith.constant 1 : index
		// CHECK: %[[DIMIDX_0:.*]] = arith.constant 0 : index
		// CHECK: %[[DIMS:.*]]:3 = fir.box_dims %[[ARR_BOX_F32]], %[[DIMIDX_0]] : (!fir.box<!fir.array<?xf32>>, index) -> (index, index, index)
// CHECK: %[[EXTENT:.*]] = arith.subi %[[DIMS]]#1, %[[CINDEX_1]] : index		// CHECK: %[[EXTENT:.*]] = arith.subi %[[DIMS]]#1, %[[CINDEX_1]] : index
// CHECK: %[[ZERO:.*]] = arith.constant 0.000000e+00 : f32
// CHECK: %[[RES:.]] = fir.do_loop %[[ITER:.]] = %[[CINDEX_0]] to %[[EXTENT]] step %[[CINDEX_1]] iter_args(%[[SUM]] = %[[ZERO]]) -> (f32) {		// CHECK: %[[RES:.]] = fir.do_loop %[[ITER:.]] = %[[CINDEX_0]] to %[[EXTENT]] step %[[CINDEX_1]] iter_args(%[[SUM]] = %[[ZERO]]) -> (f32) {
// CHECK: %[[ITEM:.*]] = fir.coordinate_of %[[ARR_BOX_F32]], %[[ITER]] : (!fir.box<!fir.array<?xf32>>, index) -> !fir.ref<f32>		// CHECK: %[[ITEM:.*]] = fir.coordinate_of %[[ARR_BOX_F32]], %[[ITER]] : (!fir.box<!fir.array<?xf32>>, index) -> !fir.ref<f32>
// CHECK: %[[ITEM_VAL:.*]] = fir.load %[[ITEM]] : !fir.ref<f32>		// CHECK: %[[ITEM_VAL:.*]] = fir.load %[[ITEM]] : !fir.ref<f32>
// CHECK: %[[NEW_SUM:.*]] = arith.addf %[[ITEM_VAL]], %[[SUM]] : f32		// CHECK: %[[NEW_SUM:.*]] = arith.addf %[[ITEM_VAL]], %[[SUM]] : f32
// CHECK: fir.result %[[NEW_SUM]] : f32		// CHECK: fir.result %[[NEW_SUM]] : f32
// CHECK: }		// CHECK: }
// CHECK: return %[[RES]] : f32		// CHECK: return %[[RES]] : f32
// CHECK: }		// CHECK: }
Show All 26 Lines	module attributes {fir.defaultkind = "a1c4d8i4l4r4", fir.kindmap = "", llvm.target_triple = "native"} {
func.func private @_FortranACppSumComplex4(!fir.ref<complex<f32>>, !fir.box<none>, !fir.ref<i8>, i32, i32, !fir.box<none>) -> none attributes {fir.runtime}		func.func private @_FortranACppSumComplex4(!fir.ref<complex<f32>>, !fir.box<none>, !fir.ref<i8>, i32, i32, !fir.box<none>) -> none attributes {fir.runtime}
fir.global linkonce @_QQcl.2E2F6973756D5F362E66393000 constant : !fir.char<1,13> {		fir.global linkonce @_QQcl.2E2F6973756D5F362E66393000 constant : !fir.char<1,13> {
%0 = fir.string_lit "./isum_6.f90\00"(13) : !fir.char<1,13>		%0 = fir.string_lit "./isum_6.f90\00"(13) : !fir.char<1,13>
fir.has_value %0 : !fir.char<1,13>		fir.has_value %0 : !fir.char<1,13>
}		}
}		}

// CHECK-LABEL: func.func @sum_1d_complex(%{{.*}}: !fir.ref<!fir.array<10x!fir.complex<4>>> {fir.bindc_name = "a"}) -> !fir.complex<4> {		// CHECK-LABEL: func.func @sum_1d_complex(%{{.*}}: !fir.ref<!fir.array<10x!fir.complex<4>>> {fir.bindc_name = "a"}) -> !fir.complex<4> {
// CHECK-NOT: fir.call @_FortranACppSumComplex4_simplified({{.*}})		// CHECK-NOT: fir.call @_FortranACppSumComplex4x1_simplified({{.*}})
// CHECK: fir.call @_FortranACppSumComplex4({{.*}}) : (!fir.ref<complex<f32>>, !fir.box<none>, !fir.ref<i8>, i32, i32, !fir.box<none>) -> none		// CHECK: fir.call @_FortranACppSumComplex4({{.*}}) : (!fir.ref<complex<f32>>, !fir.box<none>, !fir.ref<i8>, i32, i32, !fir.box<none>) -> none
// CHECK-NOT: fir.call @_FortranACppSumComplex4_simplified({{.*}})		// CHECK-NOT: fir.call @_FortranACppSumComplex4x1_simplified({{.*}})

// -----		// -----

// Test that two functions calling the same SUM function		// Test that two functions calling the same SUM function
// generates only ONE function declaration (and that both		// generates only ONE function declaration (and that both
// calls are converted)		// calls are converted)
module attributes {fir.defaultkind = "a1c4d8i4l4r4", fir.kindmap = "", llvm.target_triple = "native"} {		module attributes {fir.defaultkind = "a1c4d8i4l4r4", fir.kindmap = "", llvm.target_triple = "native"} {
func.func @sum_1d_calla(%arg0: !fir.ref<!fir.array<10xi32>> {fir.bindc_name = "a"}) -> i32 {		func.func @sum_1d_calla(%arg0: !fir.ref<!fir.array<10xi32>> {fir.bindc_name = "a"}) -> i32 {
Show All 36 Lines	module attributes {fir.defaultkind = "a1c4d8i4l4r4", fir.kindmap = "", llvm.target_triple = "native"} {
fir.global linkonce @_QQcl.2E2F6973756D5F372E66393000 constant : !fir.char<1,13> {		fir.global linkonce @_QQcl.2E2F6973756D5F372E66393000 constant : !fir.char<1,13> {
%0 = fir.string_lit "./isum_7.f90\00"(13) : !fir.char<1,13>		%0 = fir.string_lit "./isum_7.f90\00"(13) : !fir.char<1,13>
fir.has_value %0 : !fir.char<1,13>		fir.has_value %0 : !fir.char<1,13>
}		}
}		}

// CHECK-LABEL: func.func @sum_1d_calla(%{{.*}}) -> i32 {		// CHECK-LABEL: func.func @sum_1d_calla(%{{.*}}) -> i32 {
// CHECK-NOT: fir.call @_FortranASumInteger4({{.*}})		// CHECK-NOT: fir.call @_FortranASumInteger4({{.*}})
// CHECK: fir.call @_FortranASumInteger4_simplified(%{{.*}})		// CHECK: fir.call @_FortranASumInteger4x1_simplified(%{{.*}})
// CHECK-NOT: fir.call @_FortranASumInteger4({{.*}})		// CHECK-NOT: fir.call @_FortranASumInteger4({{.*}})
// CHECK: }		// CHECK: }

// CHECK-LABEL: func.func @sum_1d_callb(%{{.*}}) -> i32 {		// CHECK-LABEL: func.func @sum_1d_callb(%{{.*}}) -> i32 {
// CHECK-NOT: fir.call @_FortranASumInteger4({{.*}})		// CHECK-NOT: fir.call @_FortranASumInteger4({{.*}})
// CHECK: fir.call @_FortranASumInteger4_simplified(%{{.*}})		// CHECK: fir.call @_FortranASumInteger4x1_simplified(%{{.*}})
// CHECK-NOT: fir.call @_FortranASumInteger4({{.*}})		// CHECK-NOT: fir.call @_FortranASumInteger4({{.*}})
// CHECK: }		// CHECK: }

// CHECK-LABEL: func.func private @_FortranASumInteger4_simplified({{.}}) -> i32 {{.}} {		// CHECK-LABEL: func.func private @_FortranASumInteger4x1_simplified({{.}}) -> i32 {{.}} {
// CHECK: return %{{.*}} : i32		// CHECK: return %{{.*}} : i32
// CHECK: }		// CHECK: }
// CHECK-NOT: func.func private @_FortranASumInteger4_simplified({{.*}})		// CHECK-NOT: func.func private @_FortranASumInteger4x1_simplified({{.*}})

// -----		// -----

module attributes {fir.defaultkind = "a1c4d8i4l4r4", fir.kindmap = "", llvm.target_triple = "native"} {		module attributes {fir.defaultkind = "a1c4d8i4l4r4", fir.kindmap = "", llvm.target_triple = "native"} {
func.func @sum_1d_stride(%arg0: !fir.ref<!fir.array<20xi32>> {fir.bindc_name = "a"}) -> i32 {		func.func @sum_1d_stride(%arg0: !fir.ref<!fir.array<20xi32>> {fir.bindc_name = "a"}) -> i32 {
%c20 = arith.constant 20 : index		%c20 = arith.constant 20 : index
%0 = fir.alloca i32 {bindc_name = "sum_1d_stride", uniq_name = "_QFsum_1d_strideEsum_1d_stride"}		%0 = fir.alloca i32 {bindc_name = "sum_1d_stride", uniq_name = "_QFsum_1d_strideEsum_1d_stride"}
%c1 = arith.constant 1 : index		%c1 = arith.constant 1 : index
Show All 26 Lines

// CHECK-LABEL: func.func @sum_1d_stride(%{{.*}} -> i32 {		// CHECK-LABEL: func.func @sum_1d_stride(%{{.*}} -> i32 {
// CHECK: %[[CI64_2:.*]] = arith.constant 2 : i64		// CHECK: %[[CI64_2:.*]] = arith.constant 2 : i64
// CHECK: %[[CINDEX_2:.*]] = fir.convert %[[CI64_2]] : (i64) -> index		// CHECK: %[[CINDEX_2:.*]] = fir.convert %[[CI64_2]] : (i64) -> index
// CHECK: %[[SHAPE:.]] = fir.shape %{{.}}		// CHECK: %[[SHAPE:.]] = fir.shape %{{.}}
// CHECK: %[[SLICE:.]] = fir.slice %{{.}}, %{{.*}}, %[[CINDEX_2]] : (index, index, index) -> !fir.slice<1>		// CHECK: %[[SLICE:.]] = fir.slice %{{.}}, %{{.*}}, %[[CINDEX_2]] : (index, index, index) -> !fir.slice<1>
// CHECK: %[[A_BOX_I32:.]] = fir.embox %{{.}}(%[[SHAPE]]) {{\[}}%[[SLICE]]] : (!fir.ref<!fir.array<20xi32>>, !fir.shape<1>, !fir.slice<1>) -> !fir.box<!fir.array<?xi32>>		// CHECK: %[[A_BOX_I32:.]] = fir.embox %{{.}}(%[[SHAPE]]) {{\[}}%[[SLICE]]] : (!fir.ref<!fir.array<20xi32>>, !fir.shape<1>, !fir.slice<1>) -> !fir.box<!fir.array<?xi32>>
// CHECK: %[[A_BOX_NONE:.*]] = fir.convert %[[A_BOX_I32]] : (!fir.box<!fir.array<?xi32>>) -> !fir.box<none>		// CHECK: %[[A_BOX_NONE:.*]] = fir.convert %[[A_BOX_I32]] : (!fir.box<!fir.array<?xi32>>) -> !fir.box<none>
// CHECK: %{{.*}} = fir.call @_FortranASumInteger4_simplified(%[[A_BOX_NONE]]) : (!fir.box<none>) -> i32		// CHECK: %{{.*}} = fir.call @_FortranASumInteger4x1_simplified(%[[A_BOX_NONE]]) : (!fir.box<none>) -> i32
// CHECK: return %{{.*}} : i32		// CHECK: return %{{.*}} : i32
// CHECK: }		// CHECK: }

// CHECK-LABEL: func.func private @_FortranASumInteger4_simplified(%{{.*}}) -> i32 attributes {llvm.linkage = #llvm.linkage<linkonce_odr>} {		// CHECK-LABEL: func.func private @_FortranASumInteger4x1_simplified(%{{.*}}) -> i32 attributes {llvm.linkage = #llvm.linkage<linkonce_odr>} {
// CHECK: %[[ARR_BOX_I32:.]] = fir.convert %{{.}} : (!fir.box<none>) -> !fir.box<!fir.array<?xi32>>		// CHECK: %[[ARR_BOX_I32:.]] = fir.convert %{{.}} : (!fir.box<none>) -> !fir.box<!fir.array<?xi32>>
// CHECK: %[[DIMS:.]]:3 = fir.box_dims %[[ARR_BOX_I32]], %{{.}} : (!fir.box<!fir.array<?xi32>>, index) -> (index, index, index)
// CHECK: %[[CINDEX_1:.*]] = arith.constant 1 : index		// CHECK: %[[CINDEX_1:.*]] = arith.constant 1 : index
		// CHECK: %[[DIMS:.]]:3 = fir.box_dims %[[ARR_BOX_I32]], %{{.}} : (!fir.box<!fir.array<?xi32>>, index) -> (index, index, index)
// CHECK: %[[EXTENT:.*]] = arith.subi %[[DIMS]]#1, %[[CINDEX_1]] : index		// CHECK: %[[EXTENT:.*]] = arith.subi %[[DIMS]]#1, %[[CINDEX_1]] : index
// CHECK: %[[RES:.]] = fir.do_loop %[[ITER:.]] = %{{.}} to %[[EXTENT]] step %[[CINDEX_1]] iter_args({{.}}) -> (i32) {		// CHECK: %[[RES:.]] = fir.do_loop %[[ITER:.]] = %{{.}} to %[[EXTENT]] step %[[CINDEX_1]] iter_args({{.}}) -> (i32) {
// CHECK: %{{.*}} = fir.coordinate_of %[[ARR_BOX_I32]], %[[ITER]] : (!fir.box<!fir.array<?xi32>>, index) -> !fir.ref<i32>		// CHECK: %{{.*}} = fir.coordinate_of %[[ARR_BOX_I32]], %[[ITER]] : (!fir.box<!fir.array<?xi32>>, index) -> !fir.ref<i32>
// CHECK: }		// CHECK: }
// CHECK: return %[[RES]] : i32		// CHECK: return %[[RES]] : i32
// CHECK: }		// CHECK: }

// -----		// -----
▲ Show 20 Lines • Show All 414 Lines • ▼ Show 20 Lines	module attributes {fir.defaultkind = "a1c4d8i4l4r4", fir.kindmap = "", llvm.target_triple = "native"} {
}		}
}		}

// CHECK-LABEL: func.func @maxval_1d_array_int(		// CHECK-LABEL: func.func @maxval_1d_array_int(
// CHECK-SAME: %[[A:.*]]: !fir.ref<!fir.array<10xi32>> {fir.bindc_name = "a"}) -> i32 {		// CHECK-SAME: %[[A:.*]]: !fir.ref<!fir.array<10xi32>> {fir.bindc_name = "a"}) -> i32 {
// CHECK: %[[SHAPE:.]] = fir.shape %{{.}} : (index) -> !fir.shape<1>		// CHECK: %[[SHAPE:.]] = fir.shape %{{.}} : (index) -> !fir.shape<1>
// CHECK: %[[A_BOX_I32:.*]] = fir.embox %[[A]](%[[SHAPE]]) : (!fir.ref<!fir.array<10xi32>>, !fir.shape<1>) -> !fir.box<!fir.array<10xi32>>		// CHECK: %[[A_BOX_I32:.*]] = fir.embox %[[A]](%[[SHAPE]]) : (!fir.ref<!fir.array<10xi32>>, !fir.shape<1>) -> !fir.box<!fir.array<10xi32>>
// CHECK: %[[A_BOX_NONE:.*]] = fir.convert %[[A_BOX_I32]] : (!fir.box<!fir.array<10xi32>>) -> !fir.box<none>		// CHECK: %[[A_BOX_NONE:.*]] = fir.convert %[[A_BOX_I32]] : (!fir.box<!fir.array<10xi32>>) -> !fir.box<none>
// CHECK: %[[RES:.*]] = fir.call @_FortranAMaxvalInteger4_simplified(%[[A_BOX_NONE]]) : (!fir.box<none>) -> i32		// CHECK: %[[RES:.*]] = fir.call @_FortranAMaxvalInteger4x1_simplified(%[[A_BOX_NONE]]) : (!fir.box<none>) -> i32
// CHECK: return %{{.*}} : i32		// CHECK: return %{{.*}} : i32
// CHECK: }		// CHECK: }

// CHECK-LABEL: func.func private @_FortranAMaxvalInteger4_simplified(		// CHECK-LABEL: func.func private @_FortranAMaxvalInteger4x1_simplified(
// CHECK-SAME: %[[ARR:.*]]: !fir.box<none>) -> i32 attributes {llvm.linkage = #llvm.linkage<linkonce_odr>} {		// CHECK-SAME: %[[ARR:.*]]: !fir.box<none>) -> i32 attributes {llvm.linkage = #llvm.linkage<linkonce_odr>} {
// CHECK: %[[CINDEX_0:.*]] = arith.constant 0 : index		// CHECK: %[[CINDEX_0:.*]] = arith.constant 0 : index
// CHECK: %[[ARR_BOX_I32:.*]] = fir.convert %[[ARR]] : (!fir.box<none>) -> !fir.box<!fir.array<?xi32>>		// CHECK: %[[ARR_BOX_I32:.*]] = fir.convert %[[ARR]] : (!fir.box<none>) -> !fir.box<!fir.array<?xi32>>
// CHECK: %[[DIMS:.*]]:3 = fir.box_dims %[[ARR_BOX_I32]], %[[CINDEX_0]] : (!fir.box<!fir.array<?xi32>>, index) -> (index, index, index)		// CHECK: %[[CI32_MININT:.*]] = arith.constant -2147483648 : i32
// CHECK: %[[CINDEX_1:.*]] = arith.constant 1 : index		// CHECK: %[[CINDEX_1:.*]] = arith.constant 1 : index
		// CHECK: %[[DIMIDX_0:.*]] = arith.constant 0 : index
		// CHECK: %[[DIMS:.*]]:3 = fir.box_dims %[[ARR_BOX_I32]], %[[DIMIDX_0]] : (!fir.box<!fir.array<?xi32>>, index) -> (index, index, index)
// CHECK: %[[EXTENT:.*]] = arith.subi %[[DIMS]]#1, %[[CINDEX_1]] : index		// CHECK: %[[EXTENT:.*]] = arith.subi %[[DIMS]]#1, %[[CINDEX_1]] : index
// CHECK: %[[CI32_MININT:.*]] = arith.constant -2147483648 : i32
// CHECK: %[[RES:.]] = fir.do_loop %[[ITER:.]] = %[[CINDEX_0]] to %[[EXTENT]] step %[[CINDEX_1]] iter_args(%[[MAX:.*]] = %[[CI32_MININT]]) -> (i32) {		// CHECK: %[[RES:.]] = fir.do_loop %[[ITER:.]] = %[[CINDEX_0]] to %[[EXTENT]] step %[[CINDEX_1]] iter_args(%[[MAX:.*]] = %[[CI32_MININT]]) -> (i32) {
// CHECK: %[[ITEM:.*]] = fir.coordinate_of %[[ARR_BOX_I32]], %[[ITER]] : (!fir.box<!fir.array<?xi32>>, index) -> !fir.ref<i32>		// CHECK: %[[ITEM:.*]] = fir.coordinate_of %[[ARR_BOX_I32]], %[[ITER]] : (!fir.box<!fir.array<?xi32>>, index) -> !fir.ref<i32>
// CHECK: %[[ITEM_VAL:.*]] = fir.load %[[ITEM]] : !fir.ref<i32>		// CHECK: %[[ITEM_VAL:.*]] = fir.load %[[ITEM]] : !fir.ref<i32>
// CHECK: %[[NEW_MAX:.*]] = arith.maxsi %[[ITEM_VAL]], %[[MAX]] : i32		// CHECK: %[[NEW_MAX:.*]] = arith.maxsi %[[ITEM_VAL]], %[[MAX]] : i32
// CHECK: fir.result %[[NEW_MAX]] : i32		// CHECK: fir.result %[[NEW_MAX]] : i32
// CHECK: }		// CHECK: }
// CHECK: return %[[RES]] : i32		// CHECK: return %[[RES]] : i32
// CHECK: }		// CHECK: }
Show All 29 Lines


// CHECK-LABEL: func.func @maxval_1d_real(		// CHECK-LABEL: func.func @maxval_1d_real(
// CHECK-SAME: %[[A:.*]]: !fir.ref<!fir.array<10xf64>> {fir.bindc_name = "a"}) -> f64 {		// CHECK-SAME: %[[A:.*]]: !fir.ref<!fir.array<10xf64>> {fir.bindc_name = "a"}) -> f64 {
// CHECK: %[[CINDEX_10:.*]] = arith.constant 10 : index		// CHECK: %[[CINDEX_10:.*]] = arith.constant 10 : index
// CHECK: %[[SHAPE:.*]] = fir.shape %[[CINDEX_10]] : (index) -> !fir.shape<1>		// CHECK: %[[SHAPE:.*]] = fir.shape %[[CINDEX_10]] : (index) -> !fir.shape<1>
// CHECK: %[[A_BOX_F64:.*]] = fir.embox %[[A]](%[[SHAPE]]) : (!fir.ref<!fir.array<10xf64>>, !fir.shape<1>) -> !fir.box<!fir.array<10xf64>>		// CHECK: %[[A_BOX_F64:.*]] = fir.embox %[[A]](%[[SHAPE]]) : (!fir.ref<!fir.array<10xf64>>, !fir.shape<1>) -> !fir.box<!fir.array<10xf64>>
// CHECK: %[[A_BOX_NONE:.*]] = fir.convert %[[A_BOX_F64]] : (!fir.box<!fir.array<10xf64>>) -> !fir.box<none>		// CHECK: %[[A_BOX_NONE:.*]] = fir.convert %[[A_BOX_F64]] : (!fir.box<!fir.array<10xf64>>) -> !fir.box<none>
// CHECK: %[[RES:.*]] = fir.call @_FortranAMaxvalReal8_simplified(%[[A_BOX_NONE]]) : (!fir.box<none>) -> f64		// CHECK: %[[RES:.*]] = fir.call @_FortranAMaxvalReal8x1_simplified(%[[A_BOX_NONE]]) : (!fir.box<none>) -> f64
// CHECK: return %{{.*}} : f64		// CHECK: return %{{.*}} : f64
// CHECK: }		// CHECK: }

// CHECK-LABEL: func.func private @_FortranAMaxvalReal8_simplified(		// CHECK-LABEL: func.func private @_FortranAMaxvalReal8x1_simplified(
// CHECK-SAME: %[[ARR:.*]]: !fir.box<none>) -> f64 attributes {llvm.linkage = #llvm.linkage<linkonce_odr>} {		// CHECK-SAME: %[[ARR:.*]]: !fir.box<none>) -> f64 attributes {llvm.linkage = #llvm.linkage<linkonce_odr>} {
// CHECK: %[[CINDEX_0:.*]] = arith.constant 0 : index		// CHECK: %[[CINDEX_0:.*]] = arith.constant 0 : index
// CHECK: %[[ARR_BOX_F64:.*]] = fir.convert %[[ARR]] : (!fir.box<none>) -> !fir.box<!fir.array<?xf64>>		// CHECK: %[[ARR_BOX_F64:.*]] = fir.convert %[[ARR]] : (!fir.box<none>) -> !fir.box<!fir.array<?xf64>>
// CHECK: %[[DIMS:.*]]:3 = fir.box_dims %[[ARR_BOX_F64]], %[[CINDEX_0]] : (!fir.box<!fir.array<?xf64>>, index) -> (index, index, index)		// CHECK: %[[NEG_DBL_MAX:.*]] = arith.constant -1.7976931348623157E+308 : f64
// CHECK: %[[CINDEX_1:.*]] = arith.constant 1 : index		// CHECK: %[[CINDEX_1:.*]] = arith.constant 1 : index
		// CHECK: %[[DIMIDX_0:.*]] = arith.constant 0 : index
		// CHECK: %[[DIMS:.*]]:3 = fir.box_dims %[[ARR_BOX_F64]], %[[DIMIDX_0]] : (!fir.box<!fir.array<?xf64>>, index) -> (index, index, index)
// CHECK: %[[EXTENT:.*]] = arith.subi %[[DIMS]]#1, %[[CINDEX_1]] : index		// CHECK: %[[EXTENT:.*]] = arith.subi %[[DIMS]]#1, %[[CINDEX_1]] : index
// CHECK: %[[NEG_DBL_MAX:.*]] = arith.constant -1.7976931348623157E+308 : f64
// CHECK: %[[RES:.]] = fir.do_loop %[[ITER:.]] = %[[CINDEX_0]] to %[[EXTENT]] step %[[CINDEX_1]] iter_args(%[[MAX]] = %[[NEG_DBL_MAX]]) -> (f64) {		// CHECK: %[[RES:.]] = fir.do_loop %[[ITER:.]] = %[[CINDEX_0]] to %[[EXTENT]] step %[[CINDEX_1]] iter_args(%[[MAX]] = %[[NEG_DBL_MAX]]) -> (f64) {
// CHECK: %[[ITEM:.*]] = fir.coordinate_of %[[ARR_BOX_F64]], %[[ITER]] : (!fir.box<!fir.array<?xf64>>, index) -> !fir.ref<f64>		// CHECK: %[[ITEM:.*]] = fir.coordinate_of %[[ARR_BOX_F64]], %[[ITER]] : (!fir.box<!fir.array<?xf64>>, index) -> !fir.ref<f64>
// CHECK: %[[ITEM_VAL:.*]] = fir.load %[[ITEM]] : !fir.ref<f64>		// CHECK: %[[ITEM_VAL:.*]] = fir.load %[[ITEM]] : !fir.ref<f64>
// CHECK: %[[NEW_MAX:.*]] = arith.maxf %[[ITEM_VAL]], %[[MAX]] : f64		// CHECK: %[[NEW_MAX:.*]] = arith.maxf %[[ITEM_VAL]], %[[MAX]] : f64
// CHECK: fir.result %[[NEW_MAX]] : f64		// CHECK: fir.result %[[NEW_MAX]] : f64
// CHECK: }		// CHECK: }
// CHECK: return %[[RES]] : f64		// CHECK: return %[[RES]] : f64
// CHECK: }		// CHECK: }

		// -----

		// SUM reduction of sliced explicit-shape array is replaced with
		// 2D simplified implementation.
		func.func @sum_sliced_embox_i64(%arg0: !fir.ref<!fir.array<10x10x10xi64>> {fir.bindc_name = "a"}) -> f32 {
		%c10 = arith.constant 10 : index
		%c10_0 = arith.constant 10 : index
		%c10_1 = arith.constant 10 : index
		%0 = fir.alloca f32 {bindc_name = "sum_sliced_embox_i64", uniq_name = "_QFsum_sliced_embox_i64Esum_sliced_embox_i64"}
		%1 = fir.alloca i64 {bindc_name = "sum_sliced_i64", uniq_name = "_QFsum_sliced_embox_i64Esum_sliced_i64"}
		%c1 = arith.constant 1 : index
		%c1_i64 = arith.constant 1 : i64
		%2 = fir.convert %c1_i64 : (i64) -> index
		%3 = arith.addi %c1, %c10 : index
		%4 = arith.subi %3, %c1 : index
		%c1_i64_2 = arith.constant 1 : i64
		%5 = fir.convert %c1_i64_2 : (i64) -> index
		%6 = arith.addi %c1, %c10_0 : index
		%7 = arith.subi %6, %c1 : index
		%c1_i64_3 = arith.constant 1 : i64
		%8 = fir.undefined index
		%9 = fir.shape %c10, %c10_0, %c10_1 : (index, index, index) -> !fir.shape<3>
		%10 = fir.slice %c1, %4, %2, %c1, %7, %5, %c1_i64_3, %8, %8 : (index, index, index, index, index, index, i64, index, index) -> !fir.slice<3>
		%11 = fir.embox %arg0(%9) [%10] : (!fir.ref<!fir.array<10x10x10xi64>>, !fir.shape<3>, !fir.slice<3>) -> !fir.box<!fir.array<?x?xi64>>
		%12 = fir.absent !fir.box<i1>
		%c0 = arith.constant 0 : index
		%13 = fir.address_of(@_QQcl.2E2F746573742E66393000) : !fir.ref<!fir.char<1,11>>
		%c3_i32 = arith.constant 3 : i32
		%14 = fir.convert %11 : (!fir.box<!fir.array<?x?xi64>>) -> !fir.box<none>
		%15 = fir.convert %13 : (!fir.ref<!fir.char<1,11>>) -> !fir.ref<i8>
		%16 = fir.convert %c0 : (index) -> i32
		%17 = fir.convert %12 : (!fir.box<i1>) -> !fir.box<none>
		%18 = fir.call @_FortranASumInteger8(%14, %15, %c3_i32, %16, %17) : (!fir.box<none>, !fir.ref<i8>, i32, i32, !fir.box<none>) -> i64
		fir.store %18 to %1 : !fir.ref<i64>
		%19 = fir.load %0 : !fir.ref<f32>
		return %19 : f32
		}
		func.func private @_FortranASumInteger8(!fir.box<none>, !fir.ref<i8>, i32, i32, !fir.box<none>) -> i64 attributes {fir.runtime}
		fir.global linkonce @_QQcl.2E2F746573742E66393000 constant : !fir.char<1,11> {
		%0 = fir.string_lit "./test.f90\00"(11) : !fir.char<1,11>
		fir.has_value %0 : !fir.char<1,11>
		}

		// CHECK-NOT: call{{.*}}_FortranASumInteger8(
		// CHECK: call @_FortranASumInteger8x2_simplified(
		// CHECK-NOT: call{{.*}}_FortranASumInteger8(

		// -----

		// SUM reduction of sliced assumed-shape array is replaced with
		// 2D simplified implementation.
		func.func @_QPsum_sliced_rebox_i64(%arg0: !fir.box<!fir.array<?x?x?xi64>> {fir.bindc_name = "a"}) -> f32 {
		%0 = fir.alloca i64 {bindc_name = "sum_sliced_i64", uniq_name = "_QFsum_sliced_rebox_i64Esum_sliced_i64"}
		%1 = fir.alloca f32 {bindc_name = "sum_sliced_rebox_i64", uniq_name = "_QFsum_sliced_rebox_i64Esum_sliced_rebox_i64"}
		%c1 = arith.constant 1 : index
		%c1_i64 = arith.constant 1 : i64
		%2 = fir.convert %c1_i64 : (i64) -> index
		%c0 = arith.constant 0 : index
		%3:3 = fir.box_dims %arg0, %c0 : (!fir.box<!fir.array<?x?x?xi64>>, index) -> (index, index, index)
		%4 = arith.addi %c1, %3#1 : index
		%5 = arith.subi %4, %c1 : index
		%c1_i64_0 = arith.constant 1 : i64
		%6 = fir.convert %c1_i64_0 : (i64) -> index
		%c1_1 = arith.constant 1 : index
		%7:3 = fir.box_dims %arg0, %c1_1 : (!fir.box<!fir.array<?x?x?xi64>>, index) -> (index, index, index)
		%8 = arith.addi %c1, %7#1 : index
		%9 = arith.subi %8, %c1 : index
		%c1_i64_2 = arith.constant 1 : i64
		%10 = fir.undefined index
		%11 = fir.slice %c1, %5, %2, %c1, %9, %6, %c1_i64_2, %10, %10 : (index, index, index, index, index, index, i64, index, index) -> !fir.slice<3>
		%12 = fir.rebox %arg0 [%11] : (!fir.box<!fir.array<?x?x?xi64>>, !fir.slice<3>) -> !fir.box<!fir.array<?x?xi64>>
		%13 = fir.absent !fir.box<i1>
		%c0_3 = arith.constant 0 : index
		%14 = fir.address_of(@_QQcl.2E2F746573742E66393000) : !fir.ref<!fir.char<1,11>>
		%c8_i32 = arith.constant 8 : i32
		%15 = fir.convert %12 : (!fir.box<!fir.array<?x?xi64>>) -> !fir.box<none>
		%16 = fir.convert %14 : (!fir.ref<!fir.char<1,11>>) -> !fir.ref<i8>
		%17 = fir.convert %c0_3 : (index) -> i32
		%18 = fir.convert %13 : (!fir.box<i1>) -> !fir.box<none>
		%19 = fir.call @_FortranASumInteger8(%15, %16, %c8_i32, %17, %18) : (!fir.box<none>, !fir.ref<i8>, i32, i32, !fir.box<none>) -> i64
		fir.store %19 to %0 : !fir.ref<i64>
		%20 = fir.load %1 : !fir.ref<f32>
		return %20 : f32
		}
		func.func private @_FortranASumInteger8(!fir.box<none>, !fir.ref<i8>, i32, i32, !fir.box<none>) -> i64 attributes {fir.runtime}
		fir.global linkonce @_QQcl.2E2F746573742E66393000 constant : !fir.char<1,11> {
		%0 = fir.string_lit "./test.f90\00"(11) : !fir.char<1,11>
		fir.has_value %0 : !fir.char<1,11>
		}

		// CHECK-NOT: call{{.*}}_FortranASumInteger8(
		// CHECK: call @_FortranASumInteger8x2_simplified(
		// CHECK-NOT: call{{.*}}_FortranASumInteger8(