This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
flang/
-
include/flang/Optimizer/
-
flang/
-
Optimizer/
-
CodeGen/
-
CGOps.td
4
CGPasses.td
-
CMakeLists.txt
-
Dialect/
-
FIRDialect.h
-
Support/
-
InitFIR.h
-
lib/Optimizer/
-
Optimizer/
-
CMakeLists.txt
-
CodeGen/
-
CGOps.h
-
CGOps.cpp
-
PassDetail.h
10/26
PreCGRewrite.cpp
-
test/Fir/
-
Fir/
1/1
cg-ops.fir
-
tools/
-
fir-opt/
4/8
fir-opt.cpp
-
tco/
-
tco.cpp

Differential D98063

[flang][fir] Add the pre-code gen rewrite pass and codegen ops.
ClosedPublic

Authored by schweitz on Mar 5 2021, 11:21 AM.

Download Raw Diff

Details

Reviewers

clementval
kiranchandramohan
jeanPerier
svedanayagam
sscalpone
awarzynski
jdoerfert
mehdi_amini

Commits

rG97d8972c9cd1: [flang][fir] Add the pre-code gen rewrite pass and codegen ops.

Summary

Before the conversion to LLVM-IR dialect and ultimately LLVM IR, FIR is
partially rewritten into a codegen form. This patch adds that pass, the
fircg dialect, and the small set of Ops in the fircg (sub) dialect.
Fircg is not part of the FIR dialect and should never be used outside of
the (closed) conversion to LLVM IR.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

schweitz created this revision.Mar 5 2021, 11:21 AM

Herald added a reviewer: awarzynski. · View Herald TranscriptMar 5 2021, 11:21 AM

Herald added subscribers: mehdi_amini, jdoerfert, mgorny. · View Herald Transcript

schweitz requested review of this revision.Mar 5 2021, 11:21 AM

Herald added a reviewer: jdoerfert. · View Herald TranscriptMar 5 2021, 11:21 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: llvm-commits, sstefan1. · View Herald Transcript

mehdi_amini added inline comments.Mar 5 2021, 12:08 PM

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp
55	Is the notion of "extended form" for embox documented anywhere? If not can you expand the doc here to describe what it is? (`rewriteStaticShape` and `rewriteDynamicShape` aren't documented, but that may not be necessary with a longer description for the pattern here, ideally with snippets example) (same for all patterns)
133	Nit: you can omit the `, 8` everywhere, `SmallVector` now computes a default. (that is unless you know that 8 is better than the default in this particular case of course)
200	Is this comment up-to-date?
235	By making this a module pass, we're losing on parallelism. Can you make is an operation pass and filter here instead? void runOnOperation() { Operation *op = getOperation(); if (auto func = op->dyn_cast<mlir::FuncOp>()) runOn(func, func.getBody()); if (auto global : op->dyn_cast<fir::GlobalOp>()) runOn(global, global.getRegion()); }
241	This test looks spurious? The loop inside the body would not execute if there are no region right?
249	I don't quite get the delayed erasing, this whole simplification could be done without keeping state in a vector: region.walk([] (Operation op) { if (auto embox = dyn_cast<EmboxOp>(op)) { if (embox.getShape()) { op->erase(); return; } } } (note that `op->erase()` should already assert for `op->use_empty()`, no need to duplicate here) But also, could you do it more simply: region.walk([] (Operation op) { if (isOpTriviallyDead(op)) op->erase(); } ?
flang/tools/fir-opt/fir-opt.cpp
21	What is the intended difference between `registerFIRPasses` and `registerOptPasses` ?
22	Why this change?

Harbormaster completed remote builds in B92366: Diff 328597.Mar 6 2021, 5:12 AM

SouraVX added a subscriber: SouraVX.Mar 8 2021, 10:02 AM

SouraVX added inline comments.

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp
300	NIT: These comments doesn't make sense here? Must be gone in while preparing the patch ? Could you please update them accordingly.

SouraVX added inline comments.Mar 8 2021, 10:08 AM

flang/include/flang/Optimizer/CodeGen/CGPasses.td
27	I'm not sure whether we need `OpenACC` Dialect too ? (Since OpenACC doesn't lower as `OpenMP`) @clementval do you have any comments/thought on this ?

clementval added inline comments.Mar 8 2021, 11:25 AM

flang/include/flang/Optimizer/CodeGen/CGPasses.td
27	We will probably need it later but since it is not done yet it can be added in a later patch.

mehdi_amini added inline comments.Mar 8 2021, 11:46 AM

flang/include/flang/Optimizer/CodeGen/CGPasses.td
27	You only need to list here the dialects that this pass introduces that aren't in the input. So basically if you take a `fir` op (or other dialect) and you turn it into an `OpenACC` op then you need to add the dependency (same for the other dialects listed here).

clementval added inline comments.Mar 8 2021, 11:52 AM

flang/include/flang/Optimizer/CodeGen/CGPasses.td
27	That's what I thought. Since there is no `fir` op to `openacc` op conversion it is not needed. At least for now.

schweitz added inline comments.Mar 8 2021, 12:52 PM

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp
133	Thanks.
200	Removed.
235	It might be possible, but a quick experiment showed a bunch of tests regressing.
241	Removed.
249	This should be a very general DCE like in mlir Transforms/.CSE.cpp. That seems to have been mangled but can be brought back.
flang/tools/fir-opt/fir-opt.cpp
21	Fixed.
22	A synch problem with the source.

schweitz updated this revision to Diff 329114.Mar 8 2021, 12:54 PM

schweitz added inline comments.Mar 8 2021, 1:25 PM

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp
300	Removed. The pass is now defined in a tablegen file.

(seems like you need to run git clang-format)

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp
235	What kind of regressions?
249	Can you just call this instead? https://github.com/llvm/llvm-project/blob/main/mlir/include/mlir/Transforms/RegionUtils.h#L57 In general generic algorithm in passes like this ends up being replicated everywhere when they can be exposed a general utilities, if the `simplifyRegions` does not fit the bill here can we introduce another utility there?
flang/tools/fir-opt/fir-opt.cpp
21	It is still unclear to me what is the intent here: `registerOptimizerPasses` is not documented, and `registerMLIRPassesForFortranTools` says "Register the standard passes we use" which seems like it should have all the required passes. Why do you introduce `registerOptimizerPasses` at all?
23	This does not seem needed

mehdi_amini added inline comments.Mar 8 2021, 6:06 PM

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp
235	Actually in case I wasn't clear, the snippet I wrote above requires to schedule the pass twice in a pipeline, the test would look like this: `fir-opt --pass-pipeline="func(cg-rewrite),fir.global(cg-rewrite)"` If you can provide an .mlir test where this regresses, I'd be happy to take a look.
flang/test/Fir/cg-ops.fir
18	Your test does not have any global op?

Harbormaster completed remote builds in B92733: Diff 329114.Mar 8 2021, 8:14 PM

schweitz updated this revision to Diff 329360.Mar 9 2021, 9:01 AM

Harbormaster completed remote builds in B92896: Diff 329360.Mar 9 2021, 6:12 PM

schweitz updated this revision to Diff 331348.Mar 17 2021, 12:42 PM

schweitz marked 2 inline comments as done.Mar 17 2021, 12:57 PM

schweitz added inline comments.

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp
235	I spent some time working to make this pass multithreaded, including converting data structures to be thread local, etc. We have approximately 800 basic tests and many thousands of Fortran tests. The attempted changes caused many of these tests to regress and start failing. The existing pass accomplishes its task as is. There is no data-based argument that the pass is a bottleneck. The code is being upstreamed in a series of many patches, and there will be ample opportunity to write and debug a parallel algorithm in subsequent patches. Finally, anyone that wants to can contribute improvements on fir-dev.
249	Possibly. For the time being, we prefer to keep the simple dead-code cleanup.
flang/tools/fir-opt/fir-opt.cpp
21	Upstreaming the code is being done in a series of many patches. I have chosen to group code in ways that make the upstreaming process more manageable. For example, the MLIR registration interfaces have been changing, and we want those calls isolated until they can be upstreamed. Furthermore, link times are already expensive, so we have made efforts for reduce that impact by selecting specific required libraries from MLIR. For these reasons there is no reason to change this now. There will be opportunities to regroup registration calls in subsequent patches

Harbormaster completed remote builds in B94302: Diff 331348.Mar 17 2021, 1:18 PM

mehdi_amini added inline comments.Mar 17 2021, 1:39 PM

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp
235	he code is being upstreamed in a series of many patches, and there will be ample opportunity to write and debug a parallel algorithm in subsequent patches The problem is that I could your pass right after you land it, but that will create issues for you downstream, and right now it seems you can't show upstream why it is an issue with a test for this pass. That seems problematic to me for upstream development right now. The "FuncOp Pass" vs "Module Pass" is more than just "waiting for a bottleneck: it is about system design as whole. The fact that you bring thread_local here seems fishy and makes me worried about what kind of skeletons we'll find down the road. It is also in our experience almost impossible to come back and fix this later after it creeps everywhere in the compiler. We have a good example of this: LLVM itself. I tried for a while various approaches to revisit some of the LLVM internals but this is engrained too far by now. MLIR was designed with more care to avoid any global state and produce "crash reproducers" that can be as hermetic as possible for example. It'll be quite sad to me to give on all this so early and without very strong justifications.
249	I'm fine if you don't want to call `simplifyRegions`, but then can you refactor this as a pre-patch into a utility in a common place to avoid code duplication then? I'm not enthusiastic to see each pass reimplementing generic utility / algorithm, and I don't see really a reason why we should at all. This has community/project wise implications as well, the CIRCT folks were reporting performance issues with this kind of code path just a few days ago: https://llvm.discourse.group/t/speeding-up-canonicalize/3015 ; we were able to work together to improve the general tooling and infra around this.
flang/tools/fir-opt/fir-opt.cpp
21	Can you clarify how this is impacting link time here? Can you introduce the APIs when they make sense in-tree please! I have no way to figure if there is a good reason or not for what you send for review otherwise. I don't know in which state is your downstream project, but I'm concerned about the issue it is causing right now on the restriction it puts on the code structure and organization. For example the regressions you mentioned that you spot downstream but that can't be reproduced upstream with a lit tests.

Re the comment on losing on parallelism, the uploaded code is functionally correct and complete. We (or the community at large) can certainly make this execute in parallel once the up-streaming is done. Fast execution and maintainability are in everyone’s interests, including ours, and we will be motivated to update the code after we've up streamed code gen and its tests.

We would like to upstream this part of the patch with an understanding that it will be fixed at a later date.

In D98063#2638971, @svedanayagam wrote:

Re the comment on losing on parallelism, the uploaded code is functionally correct and complete.

This is true, there are still a bunch of comment I have that are still undressed even the parallelism aside: i.e. refactoring DCE and not duplicating the registration function when it isn't needed or not justified.

We (or the community at large) can certainly make this execute in parallel once the up-streaming is done. Fast execution and maintainability are in everyone’s interests, including ours, and we will be motivated to update the code after we've up streamed code gen and its tests.

We would like to upstream this part of the patch with an understanding that it will be fixed at a later date.

This does not answer my previous question: are you OK with me sending a patch to fix the parallelism there immediately after this patch lands?
I am under the impression right now that this is not possible because it would break some downstream tests in some ways that are unclear to me at the moment.

The OpenMP for Flang team has consistently been asking for upstreaming the fir-dev branch. I made an initial attempt to upstream a portion in May last year (https://reviews.llvm.org/D79731) which was discarded since it
did not have any community support. The current situation presents a few difficulties for the OpenMP team.

The OpenMP dialect is in the llvm-project repo, while FIR codegen is developed in the fir-dev repo. So we have to first make a patch to llvm-project/mlir and then wait for it to be merged into the fir-dev repo (which can take from one to two weeks) and then make the relevant changes in the fir-dev repo. If FIR codegen was also upstream then this delay and committing to multiple repositories can be avoided.
Since the bridge code (parse-tree to FIR) and codegen is not available in llvm-project/flang, any commits that we make to fir-dev cannot be upstreamed. So all our changes are also increasing the diff between fir-dev and upstream llvm-project/flang. Left uncontrolled this might become an untameable monster and we might never be able to fully upstream fir-dev.
Since the OpenMP code which works with FIR is in fir-dev we cannot often show the context to MLIR core team. On at least one occasion this has become an issue while seeking help. If the code is upstream this will facilitate better discussions with the MLIR core team.

Given the issues mentioned above, we favour a faster upstreaming process so that the entire community can work on a single code base.

The MLIR core team has been very helpful with the OpenMP dialect work and we have benefitted from their advise and review comments. I also fondly recall that Mehdi has stepped in to support Flang/F18 (https://lists.llvm.org/pipermail/llvm-dev/2020-January/138219.html) when there was an opinion to consider other candidates. Mehdi has been very gracious with his time and has provided several reviews for FIR upstreaming. It will be unwise to not consider his review comments. We should try to address the DCE and registration comments. If addressing a comment requires substantial work then may be we can considered for later with a suitable mechanism to track this publicly.

I hope we can soon find a way out of this impasse and make progress.

kiranchandramohan added inline comments.Mar 22 2021, 12:24 PM

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp

249

Is the suggestion here to move the following DCE code to a file inside the mlir directory tree?

// Clean up the region.
  void simplifyRegion(mlir::Region &region) {
    for (auto &block : region.getBlocks())
      for (auto &op : block.getOperations()) {
        for (auto &reg : op.getRegions())
          simplifyRegion(reg);
        maybeEraseOp(&op);
      }
    doDCE();
  }

  /// Run a simple DCE cleanup to remove any dead code after the rewrites.
  void doDCE() {
    std::vector<mlir::Operation *> workList;
    workList.swap(opsToErase);
    while (!workList.empty()) {
      for (auto *op : workList) {
        std::vector<mlir::Value> opOperands(op->operand_begin(),
                                            op->operand_end());
        LLVM_DEBUG(llvm::dbgs() << "DCE on " << *op << '\n');
        ++numDCE;
        op->erase();
        for (auto opnd : opOperands)
          maybeEraseOp(opnd.getDefiningOp());
      }
      workList.clear();
      workList.swap(opsToErase);
    }
  }

  void maybeEraseOp(mlir::Operation *op) {
    if (!op)
      return;
    if (op->hasTrait<mlir::OpTrait::IsTerminator>())
      return;
    if (mlir::isOpTriviallyDead(op))
      opsToErase.push_back(op);
  }

mehdi_amini added inline comments.Mar 22 2021, 1:20 PM

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp
249	Yes this isn't more involved that what you're mentioning. I'll be happy to help with this kind of things, I just don't know how to proceed at the moment. The DCE refactoring is trivial, I'm more concerned about the testing problem with the pass and the inability to have IR tests upstream when there are issues.

schweitz added inline comments.Mar 22 2021, 3:48 PM

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp
249	The original suggestion was to make this pass not work on ModuleOp so that it would allow multithreading. That suggestion did not contain enough information. The code snippet does not compile. Fixing the compilation led to the test included in this patch failing. Subsequently an incorrect guess was made that appeared to fix the test in this patch. However, the guessed at solution was wrong as other tests not merged regressed. This was all done in an effort to follow your original suggestion. Clearly it is just not possible to merge tests that require code that has not been merged.

mehdi_amini added inline comments.Mar 22 2021, 6:23 PM

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp
249	I am not sure what you're trying to say here, would you like me to send a patch that can be applied on top of this one and change this pass in a way that works with the test in-tree?

mehdi_amini added inline comments.Mar 22 2021, 6:46 PM

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp
249	Here is the patch: https://reviews.llvm.org/D99132 for this part of the discussion.

Respond to review comments.

LGTM

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp
249	Seems like you updated to use this version of the patch. I'm sorry it took so long to get there for this revision: if something isn't clear with any of my comment feel free to ask me to clarify and I'm always happy to send a patch when it helps!

This revision is now accepted and ready to land.Mar 24 2021, 12:59 PM

Harbormaster completed remote builds in B95486: Diff 332984.Mar 24 2021, 1:21 PM

This revision was landed with ongoing or failed builds.Mar 24 2021, 7:27 PM

Closed by commit rG97d8972c9cd1: [flang][fir] Add the pre-code gen rewrite pass and codegen ops. (authored by schweitz). · Explain Why

This revision was automatically updated to reflect the committed changes.

schweitz added a commit: rG97d8972c9cd1: [flang][fir] Add the pre-code gen rewrite pass and codegen ops..

I made a trivial fix to get the builds to pass.
https://reviews.llvm.org/rG502f27e66fd9fe44cd45ec5acae3e18f15f2d8c6

In D98063#2650019, @kiranchandramohan wrote:

I made a trivial fix to get the builds to pass.
https://reviews.llvm.org/rG502f27e66fd9fe44cd45ec5acae3e18f15f2d8c6

I've just checked, all Flang public workers are back to :green:. Thank you for your prompt fix @kiranchandramohan ! @schweitz , thanks for working on this!

Revision Contents

Path

Size

flang/

include/

flang/

Optimizer/

CodeGen/

CGOps.td

177 lines

CGPasses.td

16 lines

CMakeLists.txt

4 lines

Dialect/

FIRDialect.h

10 lines

Support/

InitFIR.h

14 lines

lib/

Optimizer/

CMakeLists.txt

5 lines

CodeGen/

24 lines

64 lines

26 lines

263 lines

test/

Fir/

cg-ops.fir

30 lines

tools/

fir-opt/

fir-opt.cpp

4 lines

tco/

tco.cpp

2 lines

Diff 333191

flang/include/flang/Optimizer/CodeGen/CGOps.td

This file was added.

				//===-- CGOps.td - FIR operation definitions ---------------- tablegen --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				///
				/// \file
				/// Definition of the FIRCG dialect operations
				///
				//===----------------------------------------------------------------------===//

				#ifndef FORTRAN_DIALECT_FIRCG_OPS
				#define FORTRAN_DIALECT_FIRCG_OPS

				include "mlir/IR/SymbolInterfaces.td"
				include "flang/Optimizer/Dialect/FIRTypes.td"

				def fircg_Dialect : Dialect {
				let name = "fircg";
				let cppNamespace = "::fir::cg";
				}

				// Base class for FIR CG operations.
				// All operations automatically get a prefix of "fircg.".
				class fircg_Op<string mnemonic, list<OpTrait> traits>
				: Op<fircg_Dialect, mnemonic, traits>;

				// Extended embox operation.
				def fircg_XEmboxOp : fircg_Op<"ext_embox", [AttrSizedOperandSegments]> {
				let summary = "for internal conversion only";

				let description = [{
				Prior to lowering to LLVM IR dialect, a non-scalar non-trivial embox op will
				be converted to an extended embox. This op will have the following sets of
				arguments.

				- memref: The memory reference being emboxed.
				- shape: A vector that is the runtime shape of the underlying array.
				- shift: A vector that is the runtime origin of the first element.
				The default is a vector of the value 1.
				- slice: A vector of triples that describe an array slice.
				- subcomponent: A vector of indices for subobject slicing.
				- LEN type parameters: A vector of runtime LEN type parameters that
				describe an correspond to the elemental derived type.

				The memref and shape arguments are mandatory. The rest are optional.
				}];

				let arguments = (ins
				AnyReferenceLike:$memref,
				Variadic<AnyIntegerType>:$shape,
				Variadic<AnyIntegerType>:$shift,
				Variadic<AnyIntegerType>:$slice,
				Variadic<AnyCoordinateType>:$subcomponent,
				Variadic<AnyIntegerType>:$lenParams
				);
				let results = (outs fir_BoxType);

				let assemblyFormat = [{
				$memref (`(`$shape^`)`)? (`origin` $shift^)? (`[`$slice^`]`)?
				(`path` $subcomponent^)? (`typeparams` $lenParams^)? attr-dict
				`:` functional-type(operands, results)
				}];

				let extraClassDeclaration = [{
				// The rank of the entity being emboxed
				unsigned getRank() { return shape().size(); }

				// The rank of the result. A slice op can reduce the rank.
				unsigned getOutRank();

				// The shape operands are mandatory and always start at 1.
				unsigned shapeOffset() { return 1; }
				unsigned shiftOffset() { return shapeOffset() + shape().size(); }
				unsigned sliceOffset() { return shiftOffset() + shift().size(); }
				unsigned subcomponentOffset() { return sliceOffset() + slice().size(); }
				unsigned lenParamOffset() {
				return subcomponentOffset() + subcomponent().size();
				}
				}];
				}

				// Extended rebox operation.
				def fircg_XReboxOp : fircg_Op<"ext_rebox", [AttrSizedOperandSegments]> {
				let summary = "for internal conversion only";

				let description = [{
				Prior to lowering to LLVM IR dialect, a non-scalar non-trivial rebox op will
				be converted to an extended rebox. This op will have the following sets of
				arguments.

				- box: The box being reboxed.
				- shape: A vector that is the new runtime shape for the array
				- shift: A vector that is the new runtime origin of the first element.
				The default is a vector of the value 1.
				- slice: A vector of triples that describe an array slice.
				- subcomponent: A vector of indices for subobject slicing.

				The box argument is mandatory, the other arguments are optional.
				There must not both be a shape and slice/subcomponent arguments
				}];

				let arguments = (ins
				fir_BoxType:$box,
				Variadic<AnyIntegerType>:$shape,
				Variadic<AnyIntegerType>:$shift,
				Variadic<AnyIntegerType>:$slice,
				Variadic<AnyCoordinateType>:$subcomponent
				);
				let results = (outs fir_BoxType);

				let assemblyFormat = [{
				$box (`(`$shape^`)`)? (`origin` $shift^)? (`[`$slice^`]`)?
				(`path` $subcomponent^) ? attr-dict
				`:` functional-type(operands, results)
				}];

				let extraClassDeclaration = [{
				// The rank of the entity being reboxed
				unsigned getRank();
				// The rank of the result box
				unsigned getOutRank();
				}];
				}


				// Extended array coordinate operation.
				def fircg_XArrayCoorOp : fircg_Op<"ext_array_coor", [AttrSizedOperandSegments]> {
				let summary = "for internal conversion only";

				let description = [{
				Prior to lowering to LLVM IR dialect, a non-scalar non-trivial embox op will
				be converted to an extended embox. This op will have the following sets of
				arguments.

				- memref: The memory reference of the array's data. It can be a fir.box if
				the underlying data is not contiguous.
				- shape: A vector that is the runtime shape of the underlying array.
				- shift: A vector that is the runtime origin of the first element.
				The default is a vector of the value 1.
				- slice: A vector of triples that describe an array slice.
				- subcomponent: A vector of indices that describe subobject slicing.
				- indices: A vector of runtime values that describe the coordinate of
				the element of the array to be computed.
				- LEN type parameters: A vector of runtime LEN type parameters that
				describe an correspond to the elemental derived type.

				The memref and indices arguments are mandatory.
				The shape argument is mandatory if the memref is not a box, and should be
				omitted otherwise. The rest of the arguments are optional.
				}];

				let arguments = (ins
				AnyRefOrBox:$memref,
				Variadic<AnyIntegerType>:$shape,
				Variadic<AnyIntegerType>:$shift,
				Variadic<AnyIntegerType>:$slice,
				Variadic<AnyCoordinateType>:$subcomponent,
				Variadic<AnyCoordinateType>:$indices,
				Variadic<AnyIntegerType>:$lenParams
				);
				let results = (outs fir_ReferenceType);

				let assemblyFormat = [{
				$memref (`(`$shape^`)`)? (`origin` $shift^)? (`[`$slice^`]`)?
				(`path` $subcomponent^)? `<`$indices`>` (`typeparams` $lenParams^)?
				attr-dict `:` functional-type(operands, results)
				}];

				let extraClassDeclaration = [{
				unsigned getRank();
				}];
				}

				#endif

flang/include/flang/Optimizer/CodeGen/CGPasses.td

	//===-- CGPasses.td - code gen pass definition file --------- tablegen --===//			//===-- CGPasses.td - code gen pass definition file --------- tablegen --===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//
	// This file contains definitions for passes within the Optimizer/CodeGen/			// This file contains definitions for passes within the Optimizer/CodeGen/
	// directory.			// directory.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef FLANG_OPTIMIZER_CODEGEN_PASSES			#ifndef FORTRAN_OPTIMIZER_CODEGEN_FIR_PASSES
	#define FLANG_OPTIMIZER_CODEGEN_PASSES			#define FORTRAN_OPTIMIZER_CODEGEN_FIR_PASSES

	include "mlir/Pass/PassBase.td"			include "mlir/Pass/PassBase.td"

	def CodeGenRewrite : Pass<"cg-rewrite", "mlir::ModuleOp"> {			def CodeGenRewrite : Pass<"cg-rewrite"> {
	let summary = "Rewrite some FIR ops into their code-gen forms.";			let summary = "Rewrite some FIR ops into their code-gen forms.";
	let description = [{			let description = [{
	Fuse specific subgraphs into single Ops for code generation.			Fuse specific subgraphs into single Ops for code generation.
	}];			}];
	let constructor = "fir::createFirCodeGenRewritePass()";			let constructor = "fir::createFirCodeGenRewritePass()";
	let dependentDialects = ["fir::FIROpsDialect"];			let dependentDialects = [
				"fir::FIROpsDialect", "fir::FIRCodeGenDialect", "mlir::BuiltinDialect",
				"mlir::LLVM::LLVMDialect", "mlir::omp::OpenMPDialect"
				SouraVXUnsubmitted Not Done Reply Inline Actions I'm not sure whether we need `OpenACC` Dialect too ? (Since OpenACC doesn't lower as `OpenMP`) @clementval do you have any comments/thought on this ? SouraVX: I'm not sure whether we need `OpenACC` Dialect too ? (Since OpenACC doesn't lower as `OpenMP`)…
				clementvalUnsubmitted Not Done Reply Inline Actions We will probably need it later but since it is not done yet it can be added in a later patch. clementval: We will probably need it later but since it is not done yet it can be added in a later patch.
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions You only need to list here the dialects that this pass introduces that aren't in the input. So basically if you take a `fir` op (or other dialect) and you turn it into an `OpenACC` op then you need to add the dependency (same for the other dialects listed here). mehdi_amini: You only need to list here the dialects that this pass introduces that aren't in the input. So…
				clementvalUnsubmitted Not Done Reply Inline Actions That's what I thought. Since there is no `fir` op to `openacc` op conversion it is not needed. At least for now. clementval: That's what I thought. Since there is no `fir` op to `openacc` op conversion it is not needed.
				];
				let statistics = [
				Statistic<"numDCE", "num-dce'd", "Number of operations eliminated">
				];
	}			}

	#endif // FLANG_OPTIMIZER_CODEGEN_PASSES			#endif // FORTRAN_OPTIMIZER_CODEGEN_FIR_PASSES

flang/include/flang/Optimizer/CodeGen/CMakeLists.txt

				set(LLVM_TARGET_DEFINITIONS CGOps.td)
				mlir_tablegen(CGOps.h.inc -gen-op-decls)
				mlir_tablegen(CGOps.cpp.inc -gen-op-defs)
				add_public_tablegen_target(CGOpsIncGen)

	set(LLVM_TARGET_DEFINITIONS CGPasses.td)			set(LLVM_TARGET_DEFINITIONS CGPasses.td)
	mlir_tablegen(CGPasses.h.inc -gen-pass-decls -name OptCodeGen)			mlir_tablegen(CGPasses.h.inc -gen-pass-decls -name OptCodeGen)
	add_public_tablegen_target(FIROptCodeGenPassIncGen)			add_public_tablegen_target(FIROptCodeGenPassIncGen)

flang/include/flang/Optimizer/Dialect/FIRDialect.h

	Show All 34 Lines

	private:			private:
	// Register the Attributes of this dialect.			// Register the Attributes of this dialect.
	void registerAttributes();			void registerAttributes();
	// Register the Types of this dialect.			// Register the Types of this dialect.
	void registerTypes();			void registerTypes();
	};			};

				/// The FIR codegen dialect is a dialect containing a small set of transient
				/// operations used exclusively during code generation.
				class FIRCodeGenDialect final : public mlir::Dialect {
				public:
				explicit FIRCodeGenDialect(mlir::MLIRContext *ctx);
				virtual ~FIRCodeGenDialect();

				static llvm::StringRef getDialectNamespace() { return "fircg"; }
				};

	} // namespace fir			} // namespace fir

	#endif // FORTRAN_OPTIMIZER_DIALECT_FIRDIALECT_H			#endif // FORTRAN_OPTIMIZER_DIALECT_FIRDIALECT_H

flang/include/flang/Optimizer/Support/InitFIR.h

	Show All 15 Lines
	#include "flang/Optimizer/Dialect/FIRDialect.h"			#include "flang/Optimizer/Dialect/FIRDialect.h"
	#include "mlir/Conversion/Passes.h"			#include "mlir/Conversion/Passes.h"
	#include "mlir/Dialect/Affine/Passes.h"			#include "mlir/Dialect/Affine/Passes.h"
	#include "mlir/InitAllDialects.h"			#include "mlir/InitAllDialects.h"
	#include "mlir/Pass/Pass.h"			#include "mlir/Pass/Pass.h"
	#include "mlir/Pass/PassRegistry.h"			#include "mlir/Pass/PassRegistry.h"
	#include "mlir/Transforms/LocationSnapshot.h"			#include "mlir/Transforms/LocationSnapshot.h"
	#include "mlir/Transforms/Passes.h"			#include "mlir/Transforms/Passes.h"
				#include "flang/Optimizer/CodeGen/CodeGen.h"

	namespace fir::support {			namespace fir::support {

	// The definitive list of dialects used by flang.			// The definitive list of dialects used by flang.
	#define FLANG_DIALECT_LIST \			#define FLANG_DIALECT_LIST \
	mlir::AffineDialect, FIROpsDialect, mlir::LLVM::LLVMDialect, \			mlir::AffineDialect, FIROpsDialect, FIRCodeGenDialect, \
	mlir::acc::OpenACCDialect, mlir::omp::OpenMPDialect, \			mlir::LLVM::LLVMDialect, mlir::acc::OpenACCDialect, \
	mlir::scf::SCFDialect, mlir::StandardOpsDialect, \			mlir::omp::OpenMPDialect, mlir::scf::SCFDialect, \
	mlir::vector::VectorDialect			mlir::StandardOpsDialect, mlir::vector::VectorDialect

	/// Register all the dialects used by flang.			/// Register all the dialects used by flang.
	inline void registerDialects(mlir::DialectRegistry &registry) {			inline void registerDialects(mlir::DialectRegistry &registry) {
	registry.insert<FLANG_DIALECT_LIST>();			registry.insert<FLANG_DIALECT_LIST>();
	}			}

	/// Forced load of all the dialects used by flang. Lowering is not an MLIR			/// Forced load of all the dialects used by flang. Lowering is not an MLIR
	/// pass, but a producer of FIR and MLIR. It is therefore a requirement that the			/// pass, but a producer of FIR and MLIR. It is therefore a requirement that the
	/// dialects be preloaded to be able to build the IR.			/// dialects be preloaded to be able to build the IR.
	inline void loadDialects(mlir::MLIRContext &context) {			inline void loadDialects(mlir::MLIRContext &context) {
	context.loadDialect<FLANG_DIALECT_LIST>();			context.loadDialect<FLANG_DIALECT_LIST>();
	}			}

	/// Register the standard passes we use. This comes from registerAllPasses(),			/// Register the standard passes we use. This comes from registerAllPasses(),
	/// but is a smaller set since we aren't using many of the passes found there.			/// but is a smaller set since we aren't using many of the passes found there.
	inline void registerFIRPasses() {			inline void registerMLIRPassesForFortranTools() {
	mlir::registerCanonicalizerPass();			mlir::registerCanonicalizerPass();
	mlir::registerCSEPass();			mlir::registerCSEPass();
	mlir::registerAffineLoopFusionPass();			mlir::registerAffineLoopFusionPass();
	mlir::registerLoopInvariantCodeMotionPass();			mlir::registerLoopInvariantCodeMotionPass();
	mlir::registerLoopCoalescingPass();			mlir::registerLoopCoalescingPass();
	mlir::registerStripDebugInfoPass();			mlir::registerStripDebugInfoPass();
	mlir::registerPrintOpStatsPass();			mlir::registerPrintOpStatsPass();
	mlir::registerInlinerPass();			mlir::registerInlinerPass();
	mlir::registerSCCPPass();			mlir::registerSCCPPass();
	mlir::registerMemRefDataFlowOptPass();			mlir::registerMemRefDataFlowOptPass();
	mlir::registerSymbolDCEPass();			mlir::registerSymbolDCEPass();
	mlir::registerLocationSnapshotPass();			mlir::registerLocationSnapshotPass();
	mlir::registerAffinePipelineDataTransferPass();			mlir::registerAffinePipelineDataTransferPass();

	mlir::registerAffineVectorizePass();			mlir::registerAffineVectorizePass();
	mlir::registerAffineLoopUnrollPass();			mlir::registerAffineLoopUnrollPass();
	mlir::registerAffineLoopUnrollAndJamPass();			mlir::registerAffineLoopUnrollAndJamPass();
	mlir::registerSimplifyAffineStructuresPass();			mlir::registerSimplifyAffineStructuresPass();
	mlir::registerAffineLoopInvariantCodeMotionPass();			mlir::registerAffineLoopInvariantCodeMotionPass();
	mlir::registerAffineLoopTilingPass();			mlir::registerAffineLoopTilingPass();
	mlir::registerAffineDataCopyGenerationPass();			mlir::registerAffineDataCopyGenerationPass();

	mlir::registerConvertAffineToStandardPass();			mlir::registerConvertAffineToStandardPass();

				// Flang passes
				fir::registerOptCodeGenPasses();
	}			}

	} // namespace fir::support			} // namespace fir::support

	#endif // FORTRAN_OPTIMIZER_SUPPORT_INITFIR_H			#endif // FORTRAN_OPTIMIZER_SUPPORT_INITFIR_H

flang/lib/Optimizer/CMakeLists.txt

	get_property(dialect_libs GLOBAL PROPERTY MLIR_DIALECT_LIBS)			get_property(dialect_libs GLOBAL PROPERTY MLIR_DIALECT_LIBS)

	add_flang_library(FIROptimizer			add_flang_library(FIROptimizer
	Dialect/FIRAttr.cpp			Dialect/FIRAttr.cpp
	Dialect/FIRDialect.cpp			Dialect/FIRDialect.cpp
	Dialect/FIROps.cpp			Dialect/FIROps.cpp
	Dialect/FIRType.cpp			Dialect/FIRType.cpp

	Support/FIRContext.cpp			Support/FIRContext.cpp
	Support/InternalNames.cpp			Support/InternalNames.cpp
	Support/KindMapping.cpp			Support/KindMapping.cpp

				CodeGen/CGOps.cpp
				CodeGen/PreCGRewrite.cpp

	Transforms/Inliner.cpp			Transforms/Inliner.cpp

	DEPENDS			DEPENDS
	FIROpsIncGen			FIROpsIncGen
				FIROptCodeGenPassIncGen
	FIROptTransformsPassIncGen			FIROptTransformsPassIncGen
				CGOpsIncGen
	${dialect_libs}			${dialect_libs}

	LINK_LIBS			LINK_LIBS
	${dialect_libs}			${dialect_libs}
	MLIRLLVMToLLVMIRTranslation			MLIRLLVMToLLVMIRTranslation
	MLIRTargetLLVMIRExport			MLIRTargetLLVMIRExport

	LINK_COMPONENTS			LINK_COMPONENTS
	AsmParser			AsmParser
	AsmPrinter			AsmPrinter
	Remarks			Remarks
	)			)

flang/lib/Optimizer/CodeGen/CGOps.h

This file was added.

				//===-- CGOps.h -------------------------------------------------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// Coding style: https://mlir.llvm.org/getting_started/DeveloperGuide/
				//
				//===----------------------------------------------------------------------===//

				#ifndef OPTIMIZER_CODEGEN_CGOPS_H
				#define OPTIMIZER_CODEGEN_CGOPS_H

				#include "flang/Optimizer/Dialect/FIRType.h"
				#include "mlir/Dialect/StandardOps/IR/Ops.h"

				using namespace mlir;

				#define GET_OP_CLASSES
				#include "flang/Optimizer/CodeGen/CGOps.h.inc"

				#endif

flang/lib/Optimizer/CodeGen/CGOps.cpp

This file was added.

				//===-- CGOps.cpp -- FIR codegen operations -------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// Coding style: https://mlir.llvm.org/getting_started/DeveloperGuide/
				//
				//===----------------------------------------------------------------------===//

				#include "CGOps.h"
				#include "flang/Optimizer/Dialect/FIRDialect.h"
				#include "flang/Optimizer/Dialect/FIROps.h"
				#include "flang/Optimizer/Dialect/FIRType.h"

				/// FIR codegen dialect constructor.
				fir::FIRCodeGenDialect::FIRCodeGenDialect(mlir::MLIRContext *ctx)
				: mlir::Dialect("fircg", ctx, mlir::TypeID::get<FIRCodeGenDialect>()) {
				addOperations<
				#define GET_OP_LIST
				#include "flang/Optimizer/CodeGen/CGOps.cpp.inc"
				>();
				}

				// anchor the class vtable to this compilation unit
				fir::FIRCodeGenDialect::~FIRCodeGenDialect() {
				// do nothing
				}

				#define GET_OP_CLASSES
				#include "flang/Optimizer/CodeGen/CGOps.cpp.inc"

				unsigned fir::cg::XEmboxOp::getOutRank() {
				if (slice().empty())
				return getRank();
				auto outRank = fir::SliceOp::getOutputRank(slice());
				assert(outRank >= 1);
				return outRank;
				}

				unsigned fir::cg::XReboxOp::getOutRank() {
				if (auto seqTy =
				fir::dyn_cast_ptrOrBoxEleTy(getType()).dyn_cast<fir::SequenceType>())
				return seqTy.getDimension();
				return 0;
				}

				unsigned fir::cg::XReboxOp::getRank() {
				if (auto seqTy = fir::dyn_cast_ptrOrBoxEleTy(box().getType())
				.dyn_cast<fir::SequenceType>())
				return seqTy.getDimension();
				return 0;
				}

				unsigned fir::cg::XArrayCoorOp::getRank() {
				auto memrefTy = memref().getType();
				if (memrefTy.isa<fir::BoxType>())
				if (auto seqty =
				fir::dyn_cast_ptrOrBoxEleTy(memrefTy).dyn_cast<fir::SequenceType>())
				return seqty.getDimension();
				return shape().size();
				}

flang/lib/Optimizer/CodeGen/PassDetail.h

This file was added.

				//===- PassDetail.h - Optimizer code gen Pass class details ------ C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef OPTMIZER_CODEGEN_PASSDETAIL_H
				#define OPTMIZER_CODEGEN_PASSDETAIL_H

				#include "flang/Optimizer/Dialect/FIRDialect.h"
				#include "mlir/Dialect/LLVMIR/LLVMDialect.h"
				#include "mlir/Dialect/OpenMP/OpenMPDialect.h"
				#include "mlir/IR/BuiltinDialect.h"
				#include "mlir/Pass/Pass.h"
				#include "mlir/Pass/PassRegistry.h"

				namespace fir {

				#define GEN_PASS_CLASSES
				#include "flang/Optimizer/CodeGen/CGPasses.h.inc"

				} // namespace fir

				#endif // OPTMIZER_CODEGEN_PASSDETAIL_H

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp

This file was added.

				//===-- PreCGRewrite.cpp --------------------------------------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// Coding style: https://mlir.llvm.org/getting_started/DeveloperGuide/
				//
				//===----------------------------------------------------------------------===//

				#include "CGOps.h"
				#include "PassDetail.h"
				#include "flang/Optimizer/CodeGen/CodeGen.h"
				#include "flang/Optimizer/Dialect/FIRDialect.h"
				#include "flang/Optimizer/Dialect/FIROps.h"
				#include "flang/Optimizer/Dialect/FIRType.h"
				#include "flang/Optimizer/Support/FIRContext.h"
				#include "mlir/Transforms/DialectConversion.h"
				#include "llvm/ADT/STLExtras.h"

				//===----------------------------------------------------------------------===//
				// Codegen rewrite: rewriting of subgraphs of ops
				//===----------------------------------------------------------------------===//

				using namespace fir;

				#define DEBUG_TYPE "flang-codegen-rewrite"

				static void populateShape(llvm::SmallVectorImpl<mlir::Value> &vec,
				ShapeOp shape) {
				vec.append(shape.extents().begin(), shape.extents().end());
				}

				// Operands of fir.shape_shift split into two vectors.
				static void populateShapeAndShift(llvm::SmallVectorImpl<mlir::Value> &shapeVec,
				llvm::SmallVectorImpl<mlir::Value> &shiftVec,
				ShapeShiftOp shift) {
				auto endIter = shift.pairs().end();
				for (auto i = shift.pairs().begin(); i != endIter;) {
				shiftVec.push_back(*i++);
				shapeVec.push_back(*i++);
				}
				}

				static void populateShift(llvm::SmallVectorImpl<mlir::Value> &vec,
				ShiftOp shift) {
				vec.append(shift.origins().begin(), shift.origins().end());
				}

				namespace {

				/// Convert fir.embox to the extended form where necessary.
				///
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions Is the notion of "extended form" for embox documented anywhere? If not can you expand the doc here to describe what it is? (`rewriteStaticShape` and `rewriteDynamicShape` aren't documented, but that may not be necessary with a longer description for the pattern here, ideally with snippets example) (same for all patterns) mehdi_amini: Is the notion of "extended form" for embox documented anywhere? If not can you expand the doc…
				/// The embox operation can take arguments that specify multidimensional array
				/// properties at runtime. These properties may be shared between distinct
				/// objects that have the same properties. Before we lower these small DAGs to
				/// LLVM-IR, we gather all the information into a single extended operation. For
				/// example,
				/// ```
				/// %1 = fir.shape_shift %4, %5 : (index, index) -> !fir.shapeshift<1>
				/// %2 = fir.slice %6, %7, %8 : (index, index, index) -> !fir.slice<1>
				/// %3 = fir.embox %0 (%1) [%2] : (!fir.ref<!fir.array<?xi32>>, !fir.shapeshift<1>, !fir.slice<1>) -> !fir.box<!fir.array<?xi32>>
				/// ```
				/// can be rewritten as
				/// ```
				/// %1 = fircg.ext_embox %0(%5) origin %4[%6, %7, %8] : (!fir.ref<!fir.array<?xi32>>, index, index, index, index, index) -> !fir.box<!fir.array<?xi32>>
				/// ```
				class EmboxConversion : public mlir::OpRewritePattern<EmboxOp> {
				public:
				using OpRewritePattern::OpRewritePattern;

				mlir::LogicalResult
				matchAndRewrite(EmboxOp embox,
				mlir::PatternRewriter &rewriter) const override {
				auto shapeVal = embox.getShape();
				// If the embox does not include a shape, then do not convert it
				if (shapeVal)
				return rewriteDynamicShape(embox, rewriter, shapeVal);
				if (auto boxTy = embox.getType().dyn_cast<BoxType>())
				if (auto seqTy = boxTy.getEleTy().dyn_cast<SequenceType>())
				if (seqTy.hasConstantShape())
				return rewriteStaticShape(embox, rewriter, seqTy);
				return mlir::failure();
				}

				mlir::LogicalResult rewriteStaticShape(EmboxOp embox,
				mlir::PatternRewriter &rewriter,
				SequenceType seqTy) const {
				auto loc = embox.getLoc();
				llvm::SmallVector<mlir::Value> shapeOpers;
				auto idxTy = rewriter.getIndexType();
				for (auto ext : seqTy.getShape()) {
				auto iAttr = rewriter.getIndexAttr(ext);
				auto extVal = rewriter.create<mlir::ConstantOp>(loc, idxTy, iAttr);
				shapeOpers.push_back(extVal);
				}
				auto xbox = rewriter.create<cg::XEmboxOp>(
				loc, embox.getType(), embox.memref(), shapeOpers, llvm::None,
				llvm::None, llvm::None, embox.lenParams());
				LLVM_DEBUG(llvm::dbgs() << "rewriting " << embox << " to " << xbox << '\n');
				rewriter.replaceOp(embox, xbox.getOperation()->getResults());
				return mlir::success();
				}

				mlir::LogicalResult rewriteDynamicShape(EmboxOp embox,
				mlir::PatternRewriter &rewriter,
				mlir::Value shapeVal) const {
				auto loc = embox.getLoc();
				auto shapeOp = dyn_cast<ShapeOp>(shapeVal.getDefiningOp());
				llvm::SmallVector<mlir::Value> shapeOpers;
				llvm::SmallVector<mlir::Value> shiftOpers;
				if (shapeOp) {
				populateShape(shapeOpers, shapeOp);
				} else {
				auto shiftOp = dyn_cast<ShapeShiftOp>(shapeVal.getDefiningOp());
				assert(shiftOp && "shape is neither fir.shape nor fir.shape_shift");
				populateShapeAndShift(shapeOpers, shiftOpers, shiftOp);
				}
				llvm::SmallVector<mlir::Value> sliceOpers;
				llvm::SmallVector<mlir::Value> subcompOpers;
				if (auto s = embox.getSlice())
				if (auto sliceOp = dyn_cast_or_null<SliceOp>(s.getDefiningOp())) {
				sliceOpers.append(sliceOp.triples().begin(), sliceOp.triples().end());
				subcompOpers.append(sliceOp.fields().begin(), sliceOp.fields().end());
				}
				auto xbox = rewriter.create<cg::XEmboxOp>(
				loc, embox.getType(), embox.memref(), shapeOpers, shiftOpers,
				sliceOpers, subcompOpers, embox.lenParams());
				LLVM_DEBUG(llvm::dbgs() << "rewriting " << embox << " to " << xbox << '\n');
				rewriter.replaceOp(embox, xbox.getOperation()->getResults());
				return mlir::success();
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions Nit: you can omit the `, 8` everywhere, `SmallVector` now computes a default. (that is unless you know that 8 is better than the default in this particular case of course) mehdi_amini: Nit: you can omit the `, 8` everywhere, `SmallVector` now computes a default. (that is unless…
				schweitzAuthorUnsubmitted Done Reply Inline Actions Thanks. schweitz: Thanks.
				}
				};

				/// Convert fir.rebox to the extended form where necessary.
				///
				/// For example,
				/// ```
				/// %5 = fir.rebox %3(%1) : (!fir.box<!fir.array<?xi32>>, !fir.shapeshift<1>) -> !fir.box<!fir.array<?xi32>>
				/// ```
				/// converted to
				/// ```
				/// %5 = fircg.ext_rebox %3(%13) origin %12 : (!fir.box<!fir.array<?xi32>>, index, index) -> !fir.box<!fir.array<?xi32>>
				/// ```
				class ReboxConversion : public mlir::OpRewritePattern<ReboxOp> {
				public:
				using OpRewritePattern::OpRewritePattern;

				mlir::LogicalResult
				matchAndRewrite(ReboxOp rebox,
				mlir::PatternRewriter &rewriter) const override {
				auto loc = rebox.getLoc();
				llvm::SmallVector<mlir::Value> shapeOpers;
				llvm::SmallVector<mlir::Value> shiftOpers;
				if (auto shapeVal = rebox.shape()) {
				if (auto shapeOp = dyn_cast<ShapeOp>(shapeVal.getDefiningOp()))
				populateShape(shapeOpers, shapeOp);
				else if (auto shiftOp = dyn_cast<ShapeShiftOp>(shapeVal.getDefiningOp()))
				populateShapeAndShift(shapeOpers, shiftOpers, shiftOp);
				else if (auto shiftOp = dyn_cast<ShiftOp>(shapeVal.getDefiningOp()))
				populateShift(shiftOpers, shiftOp);
				else
				return mlir::failure();
				}
				llvm::SmallVector<mlir::Value> sliceOpers;
				llvm::SmallVector<mlir::Value> subcompOpers;
				if (auto s = rebox.slice())
				if (auto sliceOp = dyn_cast_or_null<SliceOp>(s.getDefiningOp())) {
				sliceOpers.append(sliceOp.triples().begin(), sliceOp.triples().end());
				subcompOpers.append(sliceOp.fields().begin(), sliceOp.fields().end());
				}

				auto xRebox = rewriter.create<cg::XReboxOp>(
				loc, rebox.getType(), rebox.box(), shapeOpers, shiftOpers, sliceOpers,
				subcompOpers);
				LLVM_DEBUG(llvm::dbgs()
				<< "rewriting " << rebox << " to " << xRebox << '\n');
				rewriter.replaceOp(rebox, xRebox.getOperation()->getResults());
				return mlir::success();
				}
				};

				/// Convert all fir.array_coor to the extended form.
				///
				/// For example,
				/// ```
				/// %4 = fir.array_coor %addr (%1) [%2] %0 : (!fir.ref<!fir.array<?xi32>>, !fir.shapeshift<1>, !fir.slice<1>, index) -> !fir.ref<i32>
				/// ```
				/// converted to
				/// ```
				/// %40 = fircg.ext_array_coor %addr(%9) origin %8[%4, %5, %6<%39> : (!fir.ref<!fir.array<?xi32>>, index, index, index, index, index, index) -> !fir.ref<i32>
				/// ```
				class ArrayCoorConversion : public mlir::OpRewritePattern<ArrayCoorOp> {
				public:
				using OpRewritePattern::OpRewritePattern;

				mlir::LogicalResult
				matchAndRewrite(ArrayCoorOp arrCoor,
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions Is this comment up-to-date? mehdi_amini: Is this comment up-to-date?
				schweitzAuthorUnsubmitted Done Reply Inline Actions Removed. schweitz: Removed.
				mlir::PatternRewriter &rewriter) const override {
				auto loc = arrCoor.getLoc();
				llvm::SmallVector<mlir::Value> shapeOpers;
				llvm::SmallVector<mlir::Value> shiftOpers;
				if (auto shapeVal = arrCoor.shape()) {
				if (auto shapeOp = dyn_cast<ShapeOp>(shapeVal.getDefiningOp()))
				populateShape(shapeOpers, shapeOp);
				else if (auto shiftOp = dyn_cast<ShapeShiftOp>(shapeVal.getDefiningOp()))
				populateShapeAndShift(shapeOpers, shiftOpers, shiftOp);
				else if (auto shiftOp = dyn_cast<ShiftOp>(shapeVal.getDefiningOp()))
				populateShift(shiftOpers, shiftOp);
				else
				return mlir::failure();
				}
				llvm::SmallVector<mlir::Value> sliceOpers;
				llvm::SmallVector<mlir::Value> subcompOpers;
				if (auto s = arrCoor.slice())
				if (auto sliceOp = dyn_cast_or_null<SliceOp>(s.getDefiningOp())) {
				sliceOpers.append(sliceOp.triples().begin(), sliceOp.triples().end());
				subcompOpers.append(sliceOp.fields().begin(), sliceOp.fields().end());
				}
				auto xArrCoor = rewriter.create<cg::XArrayCoorOp>(
				loc, arrCoor.getType(), arrCoor.memref(), shapeOpers, shiftOpers,
				sliceOpers, subcompOpers, arrCoor.indices(), arrCoor.lenParams());
				LLVM_DEBUG(llvm::dbgs()
				<< "rewriting " << arrCoor << " to " << xArrCoor << '\n');
				rewriter.replaceOp(arrCoor, xArrCoor.getOperation()->getResults());
				return mlir::success();
				}
				};

				class CodeGenRewrite : public CodeGenRewriteBase<CodeGenRewrite> {
				public:
				void runOnOperation() override final {
				auto op = getOperation();
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions By making this a module pass, we're losing on parallelism. Can you make is an operation pass and filter here instead? void runOnOperation() { Operation op = getOperation(); if (auto func = op->dyn_cast<mlir::FuncOp>()) runOn(func, func.getBody()); if (auto global : op->dyn_cast<fir::GlobalOp>()) runOn(global, global.getRegion()); } mehdi_amini:* By making this a module pass, we're losing on parallelism. Can you make is an operation pass…
				schweitzAuthorUnsubmitted Done Reply Inline Actions It might be possible, but a quick experiment showed a bunch of tests regressing. schweitz: It might be possible, but a quick experiment showed a bunch of tests regressing.
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions What kind of regressions? mehdi_amini: What kind of regressions?
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions Actually in case I wasn't clear, the snippet I wrote above requires to schedule the pass twice in a pipeline, the test would look like this: `fir-opt --pass-pipeline="func(cg-rewrite),fir.global(cg-rewrite)"` If you can provide an .mlir test where this regresses, I'd be happy to take a look. mehdi_amini: Actually in case I wasn't clear, the snippet I wrote above requires to schedule the pass twice…
				schweitzAuthorUnsubmitted Done Reply Inline Actions I spent some time working to make this pass multithreaded, including converting data structures to be thread local, etc. We have approximately 800 basic tests and many thousands of Fortran tests. The attempted changes caused many of these tests to regress and start failing. The existing pass accomplishes its task as is. There is no data-based argument that the pass is a bottleneck. The code is being upstreamed in a series of many patches, and there will be ample opportunity to write and debug a parallel algorithm in subsequent patches. Finally, anyone that wants to can contribute improvements on fir-dev. schweitz: I spent some time working to make this pass multithreaded, including converting data structures…
				mehdi_aminiUnsubmitted Done Reply Inline Actions he code is being upstreamed in a series of many patches, and there will be ample opportunity to write and debug a parallel algorithm in subsequent patches The problem is that I could your pass right after you land it, but that will create issues for you downstream, and right now it seems you can't show upstream why it is an issue with a test for this pass. That seems problematic to me for upstream development right now. The "FuncOp Pass" vs "Module Pass" is more than just "waiting for a bottleneck: it is about system design as whole. The fact that you bring thread_local here seems fishy and makes me worried about what kind of skeletons we'll find down the road. It is also in our experience almost impossible to come back and fix this later after it creeps everywhere in the compiler. We have a good example of this: LLVM itself. I tried for a while various approaches to revisit some of the LLVM internals but this is engrained too far by now. MLIR was designed with more care to avoid any global state and produce "crash reproducers" that can be as hermetic as possible for example. It'll be quite sad to me to give on all this so early and without very strong justifications. mehdi_amini: > he code is being upstreamed in a series of many patches, and there will be ample opportunity…
				auto &context = getContext();
				mlir::OpBuilder rewriter(&context);
				mlir::ConversionTarget target(context);
				target.addLegalDialect<FIROpsDialect, FIRCodeGenDialect,
				mlir::StandardOpsDialect>();
				target.addIllegalOp<ArrayCoorOp>();
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions This test looks spurious? The loop inside the body would not execute if there are no region right? mehdi_amini: This test looks spurious? The loop inside the body would not execute if there are no region…
				schweitzAuthorUnsubmitted Done Reply Inline Actions Removed. schweitz: Removed.
				target.addIllegalOp<ReboxOp>();
				target.addDynamicallyLegalOp<EmboxOp>([](EmboxOp embox) {
				return !(embox.getShape() \|\|
				embox.getType().cast<BoxType>().getEleTy().isa<SequenceType>());
				});
				mlir::OwningRewritePatternList patterns;
				patterns.insert<EmboxConversion, ArrayCoorConversion, ReboxConversion>(
				&context);
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions I don't quite get the delayed erasing, this whole simplification could be done without keeping state in a vector: region.walk([] (Operation op) { if (auto embox = dyn_cast<EmboxOp>(op)) { if (embox.getShape()) { op->erase(); return; } } } (note that `op->erase()` should already assert for `op->use_empty()`, no need to duplicate here) But also, could you do it more simply: region.walk([] (Operation op) { if (isOpTriviallyDead(op)) op->erase(); } ? mehdi_amini: I don't quite get the delayed erasing, this whole simplification could be done without keeping…
				schweitzAuthorUnsubmitted Done Reply Inline Actions This should be a very general DCE like in mlir Transforms/.CSE.cpp. That seems to have been mangled but can be brought back. schweitz: This should be a very general DCE like in mlir Transforms/.CSE.cpp. That seems to have been…
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions Can you just call this instead? https://github.com/llvm/llvm-project/blob/main/mlir/include/mlir/Transforms/RegionUtils.h#L57 In general generic algorithm in passes like this ends up being replicated everywhere when they can be exposed a general utilities, if the `simplifyRegions` does not fit the bill here can we introduce another utility there? mehdi_amini: Can you just call this instead? https://github.com/llvm/llvm…
				schweitzAuthorUnsubmitted Done Reply Inline Actions Possibly. For the time being, we prefer to keep the simple dead-code cleanup. schweitz: Possibly. For the time being, we prefer to keep the simple dead-code cleanup.
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions I'm fine if you don't want to call `simplifyRegions`, but then can you refactor this as a pre-patch into a utility in a common place to avoid code duplication then? I'm not enthusiastic to see each pass reimplementing generic utility / algorithm, and I don't see really a reason why we should at all. This has community/project wise implications as well, the CIRCT folks were reporting performance issues with this kind of code path just a few days ago: https://llvm.discourse.group/t/speeding-up-canonicalize/3015 ; we were able to work together to improve the general tooling and infra around this. mehdi_amini: I'm fine if you don't want to call `simplifyRegions`, but then can you refactor this as a pre…
				kiranchandramohanUnsubmitted Not Done Reply Inline Actions Is the suggestion here to move the following DCE code to a file inside the mlir directory tree? // Clean up the region. void simplifyRegion(mlir::Region &region) { for (auto &block : region.getBlocks()) for (auto &op : block.getOperations()) { for (auto &reg : op.getRegions()) simplifyRegion(reg); maybeEraseOp(&op); } doDCE(); } /// Run a simple DCE cleanup to remove any dead code after the rewrites. void doDCE() { std::vector<mlir::Operation > workList; workList.swap(opsToErase); while (!workList.empty()) { for (auto op : workList) { std::vector<mlir::Value> opOperands(op->operand_begin(), op->operand_end()); LLVM_DEBUG(llvm::dbgs() << "DCE on " << op << '\n'); ++numDCE; op->erase(); for (auto opnd : opOperands) maybeEraseOp(opnd.getDefiningOp()); } workList.clear(); workList.swap(opsToErase); } } void maybeEraseOp(mlir::Operation op) { if (!op) return; if (op->hasTrait<mlir::OpTrait::IsTerminator>()) return; if (mlir::isOpTriviallyDead(op)) opsToErase.push_back(op); } kiranchandramohan: Is the suggestion here to move the following DCE code to a file inside the mlir directory tree?
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions Yes this isn't more involved that what you're mentioning. I'll be happy to help with this kind of things, I just don't know how to proceed at the moment. The DCE refactoring is trivial, I'm more concerned about the testing problem with the pass and the inability to have IR tests upstream when there are issues. mehdi_amini: Yes this isn't more involved that what you're mentioning. I'll be happy to help with this kind…
				schweitzAuthorUnsubmitted Done Reply Inline Actions The original suggestion was to make this pass not work on ModuleOp so that it would allow multithreading. That suggestion did not contain enough information. The code snippet does not compile. Fixing the compilation led to the test included in this patch failing. Subsequently an incorrect guess was made that appeared to fix the test in this patch. However, the guessed at solution was wrong as other tests not merged regressed. This was all done in an effort to follow your original suggestion. Clearly it is just not possible to merge tests that require code that has not been merged. schweitz: The original suggestion was to make this pass not work on ModuleOp so that it would allow…
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions I am not sure what you're trying to say here, would you like me to send a patch that can be applied on top of this one and change this pass in a way that works with the test in-tree? mehdi_amini: I am not sure what you're trying to say here, would you like me to send a patch that can be…
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions Here is the patch: https://reviews.llvm.org/D99132 for this part of the discussion. mehdi_amini: Here is the patch: https://reviews.llvm.org/D99132 for this part of the discussion.
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions Seems like you updated to use this version of the patch. I'm sorry it took so long to get there for this revision: if something isn't clear with any of my comment feel free to ask me to clarify and I'm always happy to send a patch when it helps! mehdi_amini: Seems like you updated to use this version of the patch. I'm sorry it took so long to get there…
				if (mlir::failed(
				mlir::applyPartialConversion(op, target, std::move(patterns)))) {
				mlir::emitError(mlir::UnknownLoc::get(&context),
				"error in running the pre-codegen conversions");
				signalPassFailure();
				}
				}
				};

				} // namespace

				std::unique_ptr<mlir::Pass> fir::createFirCodeGenRewritePass() {
				return std::make_unique<CodeGenRewrite>();
				}
				SouraVXUnsubmitted Not Done Reply Inline Actions NIT: These comments doesn't make sense here? Must be gone in while preparing the patch ? Could you please update them accordingly. SouraVX: NIT: These comments doesn't make sense here? Must be gone in while preparing the patch ? Could…
				schweitzAuthorUnsubmitted Done Reply Inline Actions Removed. The pass is now defined in a tablegen file. schweitz: Removed. The pass is now defined in a tablegen file.

flang/test/Fir/cg-ops.fir

This file was added.

				// RUN: fir-opt --pass-pipeline="func(cg-rewrite),fir.global(cg-rewrite),cse" %s \| FileCheck %s

				// CHECK-LABEL: func @codegen(
				// CHECK-SAME: %[[arg:.*]]: !fir
				func @codegen(%addr : !fir.ref<!fir.array<?xi32>>) {
				// CHECK: %[[zero:.*]] = constant 0 : index
				%0 = constant 0 : index
				%1 = fir.shape_shift %0, %0 : (index, index) -> !fir.shapeshift<1>
				%2 = fir.slice %0, %0, %0 : (index, index, index) -> !fir.slice<1>
				// CHECK: %[[box:.*]] = fircg.ext_embox %[[arg]](%[[zero]]) origin %[[zero]][%[[zero]], %[[zero]], %[[zero]]] : (!fir.ref<!fir.array<?xi32>>, index, index, index, index, index) -> !fir.box<!fir.array<?xi32>>
				%3 = fir.embox %addr (%1) [%2] : (!fir.ref<!fir.array<?xi32>>, !fir.shapeshift<1>, !fir.slice<1>) -> !fir.box<!fir.array<?xi32>>
				// CHECK: fircg.ext_array_coor %[[arg]](%[[zero]]) origin %[[zero]][%[[zero]], %[[zero]], %[[zero]]]<%[[zero]]> : (!fir.ref<!fir.array<?xi32>>, index, index, index, index, index, index) -> !fir.ref<i32>
				%4 = fir.array_coor %addr (%1) [%2] %0 : (!fir.ref<!fir.array<?xi32>>, !fir.shapeshift<1>, !fir.slice<1>, index) -> !fir.ref<i32>
				// CHECK: fircg.ext_rebox %[[box]](%[[zero]]) origin %[[zero]] : (!fir.box<!fir.array<?xi32>>, index, index) -> !fir.box<!fir.array<?xi32>>
				%5 = fir.rebox %3(%1) : (!fir.box<!fir.array<?xi32>>, !fir.shapeshift<1>) -> !fir.box<!fir.array<?xi32>>
				return
				}

				mehdi_aminiUnsubmitted Done Reply Inline Actions Your test does not have any global op? mehdi_amini: Your test does not have any global op?
				// CHECK-LABEL: fir.global @box_global
				fir.global @box_global : !fir.box<!fir.array<?xi32>> {
				// CHECK: %[[arr:.*]] = fir.zero_bits !fir.ref
				%arr = fir.zero_bits !fir.ref<!fir.array<?xi32>>
				// CHECK: %[[zero:.*]] = constant 0 : index
				%0 = constant 0 : index
				%1 = fir.shape_shift %0, %0 : (index, index) -> !fir.shapeshift<1>
				%2 = fir.slice %0, %0, %0 : (index, index, index) -> !fir.slice<1>
				// CHECK: fircg.ext_embox %[[arr]](%[[zero]]) origin %[[zero]][%[[zero]], %[[zero]], %[[zero]]] : (!fir.ref<!fir.array<?xi32>>, index, index, index, index, index) -> !fir.box<!fir.array<?xi32>>
				%3 = fir.embox %arr (%1) [%2] : (!fir.ref<!fir.array<?xi32>>, !fir.shapeshift<1>, !fir.slice<1>) -> !fir.box<!fir.array<?xi32>>
				fir.has_value %3 : !fir.box<!fir.array<?xi32>>
				}

flang/tools/fir-opt/fir-opt.cpp

	Show All 11 Lines
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "mlir/Support/MlirOptMain.h"			#include "mlir/Support/MlirOptMain.h"
	#include "flang/Optimizer/Support/InitFIR.h"			#include "flang/Optimizer/Support/InitFIR.h"

	using namespace mlir;			using namespace mlir;

	int main(int argc, char **argv) {			int main(int argc, char **argv) {
	fir::support::registerFIRPasses();			fir::support::registerMLIRPassesForFortranTools();
	DialectRegistry registry;			DialectRegistry registry;
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions What is the intended difference between `registerFIRPasses` and `registerOptPasses` ? mehdi_amini: What is the intended difference between `registerFIRPasses` and `registerOptPasses` ?
				schweitzAuthorUnsubmitted Done Reply Inline Actions Fixed. schweitz: Fixed.
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions It is still unclear to me what is the intent here: `registerOptimizerPasses` is not documented, and `registerMLIRPassesForFortranTools` says "Register the standard passes we use" which seems like it should have all the required passes. Why do you introduce `registerOptimizerPasses` at all? mehdi_amini: It is still unclear to me what is the intent here: `registerOptimizerPasses` is not documented…
				schweitzAuthorUnsubmitted Done Reply Inline Actions Upstreaming the code is being done in a series of many patches. I have chosen to group code in ways that make the upstreaming process more manageable. For example, the MLIR registration interfaces have been changing, and we want those calls isolated until they can be upstreamed. Furthermore, link times are already expensive, so we have made efforts for reduce that impact by selecting specific required libraries from MLIR. For these reasons there is no reason to change this now. There will be opportunities to regroup registration calls in subsequent patches schweitz: Upstreaming the code is being done in a series of many patches. I have chosen to group code in…
				mehdi_aminiUnsubmitted Not Done Reply Inline Actions Can you clarify how this is impacting link time here? Can you introduce the APIs when they make sense in-tree please! I have no way to figure if there is a good reason or not for what you send for review otherwise. I don't know in which state is your downstream project, but I'm concerned about the issue it is causing right now on the restriction it puts on the code structure and organization. For example the regressions you mentioned that you spot downstream but that can't be reproduced upstream with a lit tests. mehdi_amini: Can you clarify how this is impacting link time here? Can you introduce the APIs when they…
	fir::support::registerDialects(registry);			fir::support::registerDialects(registry);
	mehdi_aminiUnsubmitted Not Done Reply Inline Actions Why this change? mehdi_amini: Why this change?
	schweitzAuthorUnsubmitted Done Reply Inline Actions A synch problem with the source. schweitz: A synch problem with the source.
	return failed(MlirOptMain(argc, argv, "FIR modular optimizer driver\n",			return failed(MlirOptMain(argc, argv, "FIR modular optimizer driver\n",
				mehdi_aminiUnsubmitted Done Reply Inline Actions This does not seem needed mehdi_amini: This does not seem needed
	registry, /preloadDialectsInContext/ false));			registry, /preloadDialectsInContext=/false));
	}			}

flang/tools/tco/tco.cpp

Show First 20 Lines • Show All 100 Lines • ▼ Show 20 Lines	compileFIR(const mlir::PassPipelineCLParser &passPipeline) {

// pass manager failed		// pass manager failed
printModuleBody(*owningRef, errs());		printModuleBody(*owningRef, errs());
errs() << "\n\nFAILED: " << inputFilename << '\n';		errs() << "\n\nFAILED: " << inputFilename << '\n';
return mlir::failure();		return mlir::failure();
}		}

int main(int argc, char **argv) {		int main(int argc, char **argv) {
fir::support::registerFIRPasses();		fir::support::registerMLIRPassesForFortranTools();
[[maybe_unused]] InitLLVM y(argc, argv);		[[maybe_unused]] InitLLVM y(argc, argv);
mlir::registerPassManagerCLOptions();		mlir::registerPassManagerCLOptions();
mlir::PassPipelineCLParser passPipe("", "Compiler passes to run");		mlir::PassPipelineCLParser passPipe("", "Compiler passes to run");
cl::ParseCommandLineOptions(argc, argv, "Tilikum Crossing Optimizer\n");		cl::ParseCommandLineOptions(argc, argv, "Tilikum Crossing Optimizer\n");
return mlir::failed(compileFIR(passPipe));		return mlir::failed(compileFIR(passPipe));
}		}

This is an archive of the discontinued LLVM Phabricator instance.

[flang][fir] Add the pre-code gen rewrite pass and codegen ops.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 333191

flang/include/flang/Optimizer/CodeGen/CGOps.td

flang/include/flang/Optimizer/CodeGen/CGPasses.td

flang/include/flang/Optimizer/CodeGen/CMakeLists.txt

flang/include/flang/Optimizer/Dialect/FIRDialect.h

flang/include/flang/Optimizer/Support/InitFIR.h

flang/lib/Optimizer/CMakeLists.txt

flang/lib/Optimizer/CodeGen/CGOps.h

flang/lib/Optimizer/CodeGen/CGOps.cpp

flang/lib/Optimizer/CodeGen/PassDetail.h

flang/lib/Optimizer/CodeGen/PreCGRewrite.cpp

flang/test/Fir/cg-ops.fir

flang/tools/fir-opt/fir-opt.cpp

flang/tools/tco/tco.cpp

[flang][fir] Add the pre-code gen rewrite pass and codegen ops.
ClosedPublic