This is an archive of the discontinued LLVM Phabricator instance.

Interesting, I think the issue is actually that the hlfir.transpose is being rewritten to an hlfir.elemental with a different !hlfir.expr type (some constant extent information is lost):

Here is a repro with a single use of the result:

func.func @transpose4(%arg0: !fir.ref<!fir.array<2x2xf32>>, %n : index) {
  %x_shape = fir.shape %n, %n : (index, index) -> !fir.shape<2>
  %x:2 = hlfir.declare %arg0(%x_shape) {uniq_name = "a"} : (!fir.ref<!fir.array<2x2xf32>>, !fir.shape<2>) -> (!fir.ref<!fir.array<2x2xf32>>, !fir.ref<!fir.array<2x2xf32>>)
  %transpose = hlfir.transpose %x#0 : (!fir.ref<!fir.array<2x2xf32>>) -> !hlfir.expr<2x2xf32>
  hlfir.destroy %transpose : !hlfir.expr<2x2xf32>
  return
}

The hlfir.transpose -> !hlfir.expr<2x2xf32> is actually being rewritten to an hlfir.elemental -> !hlfir.expr<?x?xf32> because the fir.shape of the transpose argument does not have constant arguments. When using replaceOp, the mlir::ConversionPatternRewriter (see [1]) does a check, is unhappy with this type mismatch, and refuse to do the replacement (see [2]). These checks are not done when manually using replaceAllUsesWith and earseOp, and the operand types just changes to !hlfir.expr<?x?xf32>.

While I think it may just be OK to change the hlfir.expr operand in such way on operations, if hlfir.expr<> makes it to block argument, it will be invalid to silently pass an hlfir.expr<?> to an hlfir.expr<2> block argument.

Lowering could be improved in to build a better shape for this case (first example of the bug), but the second test mentioned in this bug would be a bit harder to fix via lowering. And the reverse could be true after mlir folding: using the shape could allow building an hlfir.elemental with an !hlfir.expr<cst> while the transpose had !hlfir.expr<T>.

So I am thinking we may want to "force" the replacement hlfir.elemental result type to match the hlfir.transpose here for robustness (if I was sure we could always replace the uses with a "better typed" hlfir.expr during this pass, I would go for it, but the block argument case would be problematic (I think), and although this is not used now, I do not want to rule it out). Introducing an hlfir.expr_cast would require some thinking about the implications (but why not).

What do you think?

[1]: although the pass patterns takes am mlir::PatternRewriter, it is actually an mlir::ConversionPatternRewriter instance because that is what mlir::applyFullConversion uses.
[2] https://github.com/llvm/llvm-project/blob/ffe0495105fb67da4e07d1a22d684239ea46a57f/mlir/lib/Transforms/Utils/DialectConversion.cpp#L2486

Ensure that the type of the hlfir.expr is not changed

In D153333#4435398, @jeanPerier wrote:
Interesting, I think the issue is actually that the hlfir.transpose is being rewritten to an hlfir.elemental with a different !hlfir.expr type (some constant extent information is lost):

Here is a repro with a single use of the result:
func.func @transpose4(%arg0: !fir.ref<!fir.array<2x2xf32>>, %n : index) {
  %x_shape = fir.shape %n, %n : (index, index) -> !fir.shape<2>
  %x:2 = hlfir.declare %arg0(%x_shape) {uniq_name = "a"} : (!fir.ref<!fir.array<2x2xf32>>, !fir.shape<2>) -> (!fir.ref<!fir.array<2x2xf32>>, !fir.ref<!fir.array<2x2xf32>>)
  %transpose = hlfir.transpose %x#0 : (!fir.ref<!fir.array<2x2xf32>>) -> !hlfir.expr<2x2xf32>
  hlfir.destroy %transpose : !hlfir.expr<2x2xf32>
  return
}
The hlfir.transpose -> !hlfir.expr<2x2xf32> is actually being rewritten to an hlfir.elemental -> !hlfir.expr<?x?xf32> because the fir.shape of the transpose argument does not have constant arguments. When using replaceOp, the mlir::ConversionPatternRewriter (see [1]) does a check, is unhappy with this type mismatch, and refuse to do the replacement (see [2]). These checks are not done when manually using replaceAllUsesWith and earseOp, and the operand types just changes to !hlfir.expr<?x?xf32>.

While I think it may just be OK to change the hlfir.expr operand in such way on operations, if hlfir.expr<> makes it to block argument, it will be invalid to silently pass an hlfir.expr<?> to an hlfir.expr<2> block argument.

Lowering could be improved in to build a better shape for this case (first example of the bug), but the second test mentioned in this bug would be a bit harder to fix via lowering. And the reverse could be true after mlir folding: using the shape could allow building an hlfir.elemental with an !hlfir.expr<cst> while the transpose had !hlfir.expr<T>.

So I am thinking we may want to "force" the replacement hlfir.elemental result type to match the hlfir.transpose here for robustness (if I was sure we could always replace the uses with a "better typed" hlfir.expr during this pass, I would go for it, but the block argument case would be problematic (I think), and although this is not used now, I do not want to rule it out). Introducing an hlfir.expr_cast would require some thinking about the implications (but why not).

What do you think?

[1]: although the pass patterns takes am mlir::PatternRewriter, it is actually an mlir::ConversionPatternRewriter instance because that is what mlir::applyFullConversion uses.
[2] https://github.com/llvm/llvm-project/blob/ffe0495105fb67da4e07d1a22d684239ea46a57f/mlir/lib/Transforms/Utils/DialectConversion.cpp#L2486

Thanks for taking a look. It is a shame that it isn't safe to improve the shape information of hlfir expressions. The problems with block arguments aren't clear to me - could you expand on that?

I've updated the patch to make sure the type stays the same. I propose we merge this as it is, and come back to this later if we want to intentionally tweak shapes.

Harbormaster completed remote builds in B240206: Diff 533207.Jun 21 2023, 4:12 AM

Thanks for the update, this looks good to me!

It is a shame that it isn't safe to improve the shape information of hlfir expressions. The problems with block arguments aren't clear to me - could you expand on that?

Sure, here is an illustration with an hlfir.expr block argument, this is a weird implementation of CALL FOO(MERGE( SUM(TRANPOSE(X)) , SUM(MATMUL(Y,Y))) , CONDITION)).

func.func @block_arg_test(%argx: !fir.ref<!fir.array<?x?xf32>>, %argy: !fir.ref<!fir.array<?x?xf32>>, %n : index, %condition : i1) {
  %c2 = arith.constant 2 : index
  %x_shape = fir.shape %c2, %c2 : (index, index) -> !fir.shape<2>
  %x:2 = hlfir.declare %argx(%x_shape) {uniq_name = "x"} : (!fir.ref<!fir.array<?x?xf32>>, !fir.shape<2>) -> (!fir.box<!fir.array<?x?xf32>>, !fir.ref<!fir.array<?x?xf32>>)
  %y_shape = fir.shape %n, %n : (index, index) -> !fir.shape<2>
  %y:2 = hlfir.declare %argy(%y_shape) {uniq_name = "y"} : (!fir.ref<!fir.array<?x?xf32>>, !fir.shape<2>) -> (!fir.box<!fir.array<?x?xf32>>, !fir.ref<!fir.array<?x?xf32>>)
  cf.cond_br %condition, ^bb1, ^bb2

  // EXPR <- TRANSPOSE(X)
  ^bb1:
  %transpose = hlfir.transpose %x#0 : (!fir.box<!fir.array<?x?xf32>>) -> !hlfir.expr<?x?xf32>
  cf.br ^bb3(%transpose : !hlfir.expr<?x?xf32>)

  // EXPR <- MATMUL(Y, Y)
  ^bb2:
  %matmul = hlfir.matmul %y#0 %y#0 : (!fir.box<!fir.array<?x?xf32>>, !fir.box<!fir.array<?x?xf32>>) -> !hlfir.expr<?x?xf32>
  cf.br ^bb3(%matmul : !hlfir.expr<?x?xf32>)

  // FOO(SUM(EXPR))
  ^bb3(%block_arg: !hlfir.expr<?x?xf32>): 
  %sum = hlfir.sum %block_arg : (!hlfir.expr<?x?xf32>) -> f32
  fir.call @foo(%sum) : (f32) -> ()
  hlfir.destroy %block_arg : !hlfir.expr<?x?xf32>
  return
}
func.func private @foo()

With your previous patch (manually using replaceAllUsesWith without preserving the type), there is still an MLIR verifier error here because the cf.br ^bb3(%tranpose) operand type is updated to !hlfir.expr<2x2xf32> but now mismatches the block argument type of ^bb3 definition. ^bb3 definition cannot be "force" updated here because it has an other usage, cf.br ^bb3(%matmul), that still passing an !hlfir.expr<?x?xf32>.

fir-opt --simplify-hlfir-intrinsics

repro.mlir:12:3: error: type mismatch for bb argument #0 of successor #0
  cf.br ^bb3(%transpose : !hlfir.expr<?x?xf32>)
  ^
repro.mlir:12:3: note: see current operation: "cf.br"(%6)[^bb3] : (!hlfir.expr<2x2xf32>) -> ()

The fact the the fir.shape operands became constant is possible (and desired!) after mlir folding/inlining/load-to-store....
If we wanted this to work, while still being able to improve hlfir.expr type when possible, we would need to add something like hlfir.expr_convert and register it in the MLIR pass as a way to legalize hlfir.expr<> mismatch caused by dynamic vs constant type mismatches (it would be added before doing the ^bb3(%transpose)).

Now, all this is a bit theoretic since hlfir bufferization currently does not allow hlfir.expr<> block arguments, since lowering does not generate such code and mlir block merging is disabled. Before enabling it, I would rather have strong use case and a clear idea of what it implies regarding hlfir.expr lifetime and destruction.

This revision is now accepted and ready to land.Jun 21 2023, 6:39 AM

jeanPerier added inline comments.Jun 21 2023, 6:42 AM

flang/lib/Optimizer/HLFIR/Transforms/SimplifyHLFIRIntrinsics.cpp
68–72	Note that you should be able to go back to replaceOp (that does the assert as a verification) if you want to now that the type is preserved.

Move back to using replaceOp()

Harbormaster completed remote builds in B240253: Diff 533273.Jun 21 2023, 8:46 AM

Closed by commit rG74adc3e0ebfb: [flang][hlfir] fix missing conversion in transpose simplification (authored by tblah). · Explain WhyJun 21 2023, 9:59 AM

This revision was automatically updated to reflect the committed changes.

tblah added a commit: rG74adc3e0ebfb: [flang][hlfir] fix missing conversion in transpose simplification.

Revision Contents

Path

Size

flang/

include/

flang/

Optimizer/

Builder/

HLFIRTools.h

4 lines

lib/

Optimizer/

Builder/

HLFIRTools.cpp

12 lines

HLFIR/

Transforms/

SimplifyHLFIRIntrinsics.cpp

11 lines

test/

HLFIR/

simplify-hlfir-intrinsics.fir

91 lines

Diff 533313

flang/include/flang/Optimizer/Builder/HLFIRTools.h

	Show First 20 Lines • Show All 357 Lines • ▼ Show 20 Lines
	/// input entity type if it is scalar. Will crash if the entity is not a			/// input entity type if it is scalar. Will crash if the entity is not a
	/// variable.			/// variable.
	mlir::Type getVariableElementType(hlfir::Entity variable);			mlir::Type getVariableElementType(hlfir::Entity variable);

	using ElementalKernelGenerator = std::function<hlfir::Entity(			using ElementalKernelGenerator = std::function<hlfir::Entity(
	mlir::Location, fir::FirOpBuilder &, mlir::ValueRange)>;			mlir::Location, fir::FirOpBuilder &, mlir::ValueRange)>;
	/// Generate an hlfir.elementalOp given call back to generate the element			/// Generate an hlfir.elementalOp given call back to generate the element
	/// value at for each iteration.			/// value at for each iteration.
				/// If exprType is specified, this will be the return type of the elemental op
	hlfir::ElementalOp genElementalOp(mlir::Location loc,			hlfir::ElementalOp genElementalOp(mlir::Location loc,
	fir::FirOpBuilder &builder,			fir::FirOpBuilder &builder,
	mlir::Type elementType, mlir::Value shape,			mlir::Type elementType, mlir::Value shape,
	mlir::ValueRange typeParams,			mlir::ValueRange typeParams,
	const ElementalKernelGenerator &genKernel);			const ElementalKernelGenerator &genKernel,
				mlir::Type exprType = mlir::Type{});

	/// Structure to describe a loop nest.			/// Structure to describe a loop nest.
	struct LoopNest {			struct LoopNest {
	fir::DoLoopOp outerLoop;			fir::DoLoopOp outerLoop;
	fir::DoLoopOp innerLoop;			fir::DoLoopOp innerLoop;
	llvm::SmallVector<mlir::Value> oneBasedIndices;			llvm::SmallVector<mlir::Value> oneBasedIndices;
	};			};

	▲ Show 20 Lines • Show All 52 Lines • Show Last 20 Lines

flang/lib/Optimizer/Builder/HLFIRTools.cpp

Show First 20 Lines • Show All 716 Lines • ▼ Show 20 Lines	static hlfir::ExprType getArrayExprType(mlir::Type elementType,
if (auto shapeOp = shape.getDefiningOp<fir::ShapeOp>())		if (auto shapeOp = shape.getDefiningOp<fir::ShapeOp>())
for (auto extent : llvm::enumerate(shapeOp.getExtents()))		for (auto extent : llvm::enumerate(shapeOp.getExtents()))
if (auto cstExtent = fir::getIntIfConstant(extent.value()))		if (auto cstExtent = fir::getIntIfConstant(extent.value()))
typeShape[extent.index()] = *cstExtent;		typeShape[extent.index()] = *cstExtent;
return hlfir::ExprType::get(elementType.getContext(), typeShape, elementType,		return hlfir::ExprType::get(elementType.getContext(), typeShape, elementType,
isPolymorphic);		isPolymorphic);
}		}

hlfir::ElementalOp		hlfir::ElementalOp hlfir::genElementalOp(
hlfir::genElementalOp(mlir::Location loc, fir::FirOpBuilder &builder,		mlir::Location loc, fir::FirOpBuilder &builder, mlir::Type elementType,
mlir::Type elementType, mlir::Value shape,		mlir::Value shape, mlir::ValueRange typeParams,
mlir::ValueRange typeParams,		const ElementalKernelGenerator &genKernel, mlir::Type exprType) {
const ElementalKernelGenerator &genKernel) {		if (!exprType)
mlir::Type exprType = getArrayExprType(elementType, shape, false);		exprType = getArrayExprType(elementType, shape, false);
auto elementalOp =		auto elementalOp =
builder.create<hlfir::ElementalOp>(loc, exprType, shape, typeParams);		builder.create<hlfir::ElementalOp>(loc, exprType, shape, typeParams);
auto insertPt = builder.saveInsertionPoint();		auto insertPt = builder.saveInsertionPoint();
builder.setInsertionPointToStart(elementalOp.getBody());		builder.setInsertionPointToStart(elementalOp.getBody());
mlir::Value elementResult = genKernel(loc, builder, elementalOp.getIndices());		mlir::Value elementResult = genKernel(loc, builder, elementalOp.getIndices());
// Numerical and logical scalars may be lowered to another type than the		// Numerical and logical scalars may be lowered to another type than the
// Fortran expression type (e.g i1 instead of fir.logical). Array expression		// Fortran expression type (e.g i1 instead of fir.logical). Array expression
// values are typed according to their Fortran type. Insert a cast if needed		// values are typed according to their Fortran type. Insert a cast if needed
▲ Show 20 Lines • Show All 312 Lines • Show Last 20 Lines

flang/lib/Optimizer/HLFIR/Transforms/SimplifyHLFIRIntrinsics.cpp

Show First 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	auto genKernel = [&array](mlir::Location loc, fir::FirOpBuilder &builder,
assert(inputIndices.size() == 2 && "checked in TransposeOp::validate");		assert(inputIndices.size() == 2 && "checked in TransposeOp::validate");
mlir::ValueRange transposedIndices{{inputIndices[1], inputIndices[0]}};		mlir::ValueRange transposedIndices{{inputIndices[1], inputIndices[0]}};
hlfir::Entity element =		hlfir::Entity element =
hlfir::getElementAt(loc, builder, array, transposedIndices);		hlfir::getElementAt(loc, builder, array, transposedIndices);
hlfir::Entity val = hlfir::loadTrivialScalar(loc, builder, element);		hlfir::Entity val = hlfir::loadTrivialScalar(loc, builder, element);
return val;		return val;
};		};
hlfir::ElementalOp elementalOp = hlfir::genElementalOp(		hlfir::ElementalOp elementalOp = hlfir::genElementalOp(
loc, builder, elementType, resultShape, typeParams, genKernel);		loc, builder, elementType, resultShape, typeParams, genKernel,
		transpose.getResult().getType());

rewriter.replaceOp(transpose, elementalOp.getResult());		// it wouldn't be safe to replace block arguments with a different
		// hlfir.expr type. Types can differ due to differing amounts of shape
		// information
		assert(elementalOp.getResult().getType() ==
		transpose.getResult().getType());

		rewriter.replaceOp(transpose, elementalOp);
return mlir::success();		return mlir::success();
		jeanPerierUnsubmitted Not Done Reply Inline Actions Note that you should be able to go back to replaceOp (that does the assert as a verification) if you want to now that the type is preserved. jeanPerier: Note that you should be able to go back to replaceOp (that does the assert as a verification)…
}		}

private:		private:
static mlir::Value genResultShape(mlir::Location loc,		static mlir::Value genResultShape(mlir::Location loc,
fir::FirOpBuilder &builder,		fir::FirOpBuilder &builder,
hlfir::Entity array) {		hlfir::Entity array) {
mlir::Value inShape = hlfir::genShape(loc, builder, array);		mlir::Value inShape = hlfir::genShape(loc, builder, array);
llvm::SmallVector<mlir::Value> inExtents =		llvm::SmallVector<mlir::Value> inExtents =
▲ Show 20 Lines • Show All 41 Lines • Show Last 20 Lines

flang/test/HLFIR/simplify-hlfir-intrinsics.fir

	Show First 20 Lines • Show All 87 Lines • ▼ Show 20 Lines
	// CHECK: %[[OUT_SHAPE:.*]] = fir.shape %[[C2]], %[[EXTENT0]] : (index, index) -> !fir.shape<2>			// CHECK: %[[OUT_SHAPE:.*]] = fir.shape %[[C2]], %[[EXTENT0]] : (index, index) -> !fir.shape<2>
	// CHECK: %[[EXPR:.*]] = hlfir.elemental %[[OUT_SHAPE]] : (!fir.shape<2>) -> !hlfir.expr<2x?xi32> {			// CHECK: %[[EXPR:.*]] = hlfir.elemental %[[OUT_SHAPE]] : (!fir.shape<2>) -> !hlfir.expr<2x?xi32> {
	// CHECK: ^bb0(%[[I:.]]: index, %[[J:.]]: index):			// CHECK: ^bb0(%[[I:.]]: index, %[[J:.]]: index):
	// CHECK: %[[ELEMENT:.*]] = hlfir.apply %[[ARG0]], %[[J]], %[[I]] : (!hlfir.expr<?x2xi32>, index, index) -> i32			// CHECK: %[[ELEMENT:.*]] = hlfir.apply %[[ARG0]], %[[J]], %[[I]] : (!hlfir.expr<?x2xi32>, index, index) -> i32
	// CHECK: hlfir.yield_element %[[ELEMENT]] : i32			// CHECK: hlfir.yield_element %[[ELEMENT]] : i32
	// CHECK: }			// CHECK: }
	// CHECK: return			// CHECK: return
	// CHECK: }			// CHECK: }

				// expr with multiple uses
				func.func @transpose4(%arg0: !hlfir.expr<2x2xf32>, %arg1: !fir.ref<!fir.box<!fir.heap<!fir.array<?x?xf32>>>>) {
				%0 = hlfir.transpose %arg0 : (!hlfir.expr<2x2xf32>) -> !hlfir.expr<2x2xf32>
				%1 = hlfir.shape_of %0 : (!hlfir.expr<2x2xf32>) -> !fir.shape<2>
				%2 = hlfir.elemental %1 : (!fir.shape<2>) -> !hlfir.expr<2x2xf32> {
				^bb0(%arg2: index, %arg3: index):
				%3 = hlfir.apply %0, %arg2, %arg3 : (!hlfir.expr<2x2xf32>, index, index) -> f32
				%4 = math.cos %3 fastmath<contract> : f32
				hlfir.yield_element %4 : f32
				}
				hlfir.assign %2 to %arg1 realloc : !hlfir.expr<2x2xf32>, !fir.ref<!fir.box<!fir.heap<!fir.array<?x?xf32>>>>
				hlfir.destroy %2 : !hlfir.expr<2x2xf32>
				hlfir.destroy %0 : !hlfir.expr<2x2xf32>
				return
				}
				// CHECK-LABEL: func.func @transpose4(
				// CHECK-SAME: %[[ARG0:.*]]: !hlfir.expr<2x2xf32>
				// CHECK-SAME: %[[ARG1:.*]]:
				// CHECK: %[[SHAPE0:.*]] = fir.shape
				// CHECK: %[[TRANSPOSE:.*]] = hlfir.elemental %[[SHAPE0]] : (!fir.shape<2>) -> !hlfir.expr<2x2xf32> {
				// CHECK: ^bb0(%[[I:.]]: index, %[[J:.]]: index):
				// CHECK: %[[ELE:.*]] = hlfir.apply %[[ARG0]], %[[J]], %[[I]] : (!hlfir.expr<2x2xf32>, index, index) -> f32
				// CHECK: hlfir.yield_element %[[ELE]] : f32
				// CHECK: }
				// CHECK: %[[SHAPE1:.*]] = hlfir.shape_of %[[TRANSPOSE]] : (!hlfir.expr<2x2xf32>) -> !fir.shape<2>
				// CHECK: %[[COS:.*]] = hlfir.elemental %[[SHAPE1]] : (!fir.shape<2>) -> !hlfir.expr<2x2xf32> {
				// CHECK: ^bb0(%[[I:.]]: index, %[[J:.]]: index):
				// CHECK: %[[ELE:.*]] = hlfir.apply %[[TRANSPOSE]], %[[I]], %[[J]] : (!hlfir.expr<2x2xf32>, index, index) -> f32
				// CHECK: %[[COS_ELE:.*]] = math.cos %[[ELE]] fastmath<contract> : f32
				// CHECK: hlfir.yield_element %[[COS_ELE]] : f32
				// CHECK: }
				// CHECK: hlfir.assign %[[COS]] to %[[ARG1]] realloc
				// CHECK: hlfir.destroy %[[COS]] : !hlfir.expr<2x2xf32>
				// CHECK: hlfir.destroy %[[TRANSPOSE]] : !hlfir.expr<2x2xf32>
				// CHECK: return
				// CHECK: }

				// regression test
				func.func @transpose5(%arg0: !fir.ref<tuple<!fir.box<!fir.array<2x2xf64>>, !fir.box<!fir.array<2x2xf64>>>> {fir.host_assoc}) attributes {fir.internal_proc} {
				%0 = fir.address_of(@_QFEb) : !fir.ref<!fir.box<!fir.heap<!fir.array<?x?xf64>>>>
				%1:2 = hlfir.declare %0 {fortran_attrs = #fir.var_attrs<allocatable>, uniq_name = "_QFEb"} : (!fir.ref<!fir.box<!fir.heap<!fir.array<?x?xf64>>>>) -> (!fir.ref<!fir.box<!fir.heap<!fir.array<?x?xf64>>>>, !fir.ref<!fir.box<!fir.heap<!fir.array<?x?xf64>>>>)
				%c0_i32 = arith.constant 0 : i32
				%2 = fir.coordinate_of %arg0, %c0_i32 : (!fir.ref<tuple<!fir.box<!fir.array<2x2xf64>>, !fir.box<!fir.array<2x2xf64>>>>, i32) -> !fir.ref<!fir.box<!fir.array<2x2xf64>>>
				%3 = fir.load %2 : !fir.ref<!fir.box<!fir.array<2x2xf64>>>
				%4 = fir.box_addr %3 : (!fir.box<!fir.array<2x2xf64>>) -> !fir.ref<!fir.array<2x2xf64>>
				%c0 = arith.constant 0 : index
				%5:3 = fir.box_dims %3, %c0 : (!fir.box<!fir.array<2x2xf64>>, index) -> (index, index, index)
				%c1 = arith.constant 1 : index
				%6:3 = fir.box_dims %3, %c1 : (!fir.box<!fir.array<2x2xf64>>, index) -> (index, index, index)
				%7 = fir.shape %5#1, %6#1 : (index, index) -> !fir.shape<2>
				%8:2 = hlfir.declare %4(%7) {uniq_name = "_QFEa"} : (!fir.ref<!fir.array<2x2xf64>>, !fir.shape<2>) -> (!fir.ref<!fir.array<2x2xf64>>, !fir.ref<!fir.array<2x2xf64>>)
				%c1_i32 = arith.constant 1 : i32
				%9 = fir.coordinate_of %arg0, %c1_i32 : (!fir.ref<tuple<!fir.box<!fir.array<2x2xf64>>, !fir.box<!fir.array<2x2xf64>>>>, i32) -> !fir.ref<!fir.box<!fir.array<2x2xf64>>>
				%10 = fir.load %9 : !fir.ref<!fir.box<!fir.array<2x2xf64>>>
				%11 = fir.box_addr %10 : (!fir.box<!fir.array<2x2xf64>>) -> !fir.ref<!fir.array<2x2xf64>>
				%c0_0 = arith.constant 0 : index
				%12:3 = fir.box_dims %10, %c0_0 : (!fir.box<!fir.array<2x2xf64>>, index) -> (index, index, index)
				%c1_1 = arith.constant 1 : index
				%13:3 = fir.box_dims %10, %c1_1 : (!fir.box<!fir.array<2x2xf64>>, index) -> (index, index, index)
				%14 = fir.shape %12#1, %13#1 : (index, index) -> !fir.shape<2>
				%15:2 = hlfir.declare %11(%14) {uniq_name = "_QFEc"} : (!fir.ref<!fir.array<2x2xf64>>, !fir.shape<2>) -> (!fir.ref<!fir.array<2x2xf64>>, !fir.ref<!fir.array<2x2xf64>>)
				%16 = hlfir.transpose %8#0 : (!fir.ref<!fir.array<2x2xf64>>) -> !hlfir.expr<2x2xf64>
				%17 = hlfir.shape_of %16 : (!hlfir.expr<2x2xf64>) -> !fir.shape<2>
				%18 = hlfir.elemental %17 : (!fir.shape<2>) -> !hlfir.expr<?x?xf64> {
				^bb0(%arg1: index, %arg2: index):
				%19 = hlfir.apply %16, %arg1, %arg2 : (!hlfir.expr<2x2xf64>, index, index) -> f64
				%20 = math.cos %19 fastmath<contract> : f64
				hlfir.yield_element %20 : f64
				}
				hlfir.assign %18 to %1#0 realloc : !hlfir.expr<?x?xf64>, !fir.ref<!fir.box<!fir.heap<!fir.array<?x?xf64>>>>
				hlfir.destroy %18 : !hlfir.expr<?x?xf64>
				hlfir.destroy %16 : !hlfir.expr<2x2xf64>
				return
				}
				// CHECK-LABEL: func.func @transpose5(
				// ...
				// CHECK: %[[TRANSPOSE:.]] = hlfir.elemental %[[SHAPE0:.]]
				// CHECK: ^bb0(%[[I:.]]: index, %[[J:.]]: index):
				// CHECK: %[[ELE:.]] = hlfir.designate %[[ARRAY:.]] (%[[J]], %[[I]])
				// CHECK: %[[LOAD:.*]] = fir.load %[[ELE]]
				// CHECK: hlfir.yield_element %[[LOAD]]
				// CHECK: }
				// CHECK: %[[SHAPE1:.*]] = hlfir.shape_of %[[TRANSPOSE]]
				// CHECK: %[[COS:.*]] = hlfir.elemental %[[SHAPE1]]
				// ...
				// CHECK: hlfir.assign %[[COS]] to %{{.*}} realloc
				// CHECK: hlfir.destroy %[[COS]]
				// CHECK: hlfir.destroy %[[TRANSPOSE]]
				// CHECK: return
				// CHECK: }

This is an archive of the discontinued LLVM Phabricator instance.

[flang][hlfir] fix missing conversion in transpose simplificationClosedPublic

Details

Diff Detail