This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
flang/
-
lib/Frontend/
-
Frontend/
1/1
FrontendActions.cpp
-
test/
-
Driver/
-
mlir-debug-pass-pipeline.f90
-
mlir-pass-pipeline.f90
-
Lower/OpenMP/
-
OpenMP/
-
omp-declare-target-func-and-subr.f90
-
mlir/
-
include/mlir/
-
mlir/
-
Dialect/OpenMP/
-
OpenMP/
-
CMakeLists.txt
-
OpenMPPasses.h
-
OpenMPPasses.td
-
InitAllPasses.h
-
lib/Dialect/OpenMP/
-
Dialect/
-
OpenMP/
-
CMakeLists.txt
-
Transforms/
-
CMakeLists.txt
-
FilterDeviceHostFunctions.cpp
-
test/Dialect/OpenMP/
-
Dialect/
-
OpenMP/
-
filter-device-host-functions.mlir

Differential D147641

[Flang][OpenMP][MLIR] Filter emitted code depending on declare target and device
ClosedPublic

Authored by skatrak on Apr 5 2023, 10:05 AM.

Download Raw Diff

Details

Reviewers

dpalermo
jsjodin
agozillon
domada
ftynse
jdoerfert
nicolasvasilache
kiranchandramohan
sscalpone
awarzynski

Commits

rGdebdfc0ae21b: [Flang][OpenMP][MLIR] Filter emitted code depending on declare target and device

Summary

This patch adds support for selecting which functions are lowered to LLVM IR from MLIR depending on declare target information and whether host or device code is being generated.

The approach proposed by this patch is to perform the filtering in two stages:

An MLIR transformation pass, which is added to the Flang translation flow before the VerifierPass. The functions that are kept are those that match the OpenMP processor (host or device) the compiler invocation is targeting, according to the presence of the -fopenmp-is-device compiler option and declare target information. All functions contaning an omp.target are also kept, regardless of the declare target information of the function, due to the need for keeping target regions visible for both host and device compilation.
A filtering step during translation to LLVM IR, which is peformed for those functions that were kept because of the presence of a target region inside. If the targeted OpenMP processor does not match the declare target information of the function, then it is removed from the LLVM IR after its contents have been processed and translated. Since they should only contain an omp.target operation which, in turn, should have been outlined into another LLVM function, the wrapper can be deleted at that point.

Depends on D150328 and D150329.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

skatrak created this revision.Apr 5 2023, 10:05 AM

Herald added a reviewer: ftynse. · View Herald TranscriptApr 5 2023, 10:05 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: sunshaoce, Moerafaat, zero9178 and 25 others. · View Herald Transcript

skatrak requested review of this revision.Apr 5 2023, 10:05 AM

Herald added a reviewer: jdoerfert. · View Herald TranscriptApr 5 2023, 10:05 AM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: jplehr, sstefan1, stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Harbormaster completed remote builds in B223836: Diff 511148.Apr 5 2023, 11:41 AM

Update and add test.

Herald added a subscriber: bviyer. · View Herald TranscriptApr 25 2023, 3:54 AM

Harbormaster completed remote builds in B227979: Diff 516735.Apr 25 2023, 4:40 AM

agozillon mentioned this in D146063: [Flang][OpenMP][MLIR] Add lowering from parse tree to MLIR support for Declare Target for functions, subroutines and global data.Apr 28 2023, 8:23 AM

Rebase and fix tests

Harbormaster completed remote builds in B229467: Diff 518778.May 2 2023, 10:56 AM

generating functions and then deleting them is costly and will likely not work. We need to not emit them in the first place.

In D147641#4313262, @jdoerfert wrote:

generating functions and then deleting them is costly and will likely not work. We need to not emit them in the first place.

I agree that we should strive to avoid doing unnecessary work rather than discarding it after the fact. However, the current implementation of target region outlining interacts with this in a way that prevents us from doing that in all cases. Let's say we implemented this function filtering patch as a semantic analysis pass:

We could forward the information from the frontend about whether we're compiling for the host or device (-fopenmp-is-device) to this new pass.
We could add the pass after the ImplicitDeclareTargetCapture pass implemented in D146063 or similar, so implicit declare target information is already propagated and available.
Using the knowledge about whether functions are only intended for the host, device or both, and what we're compiling for, we could remove from the parse tree those functions that won't need lowering.
For the host that would mean removing declare target device_type(nohost) functions, and for the device it would mean removing declare target device_type(host) and non-declare target functions.

However, the issues come when we consider the interaction of that approach with target regions:

Target regions (which should be lowered for the device) can appear inside of host functions.
Reverse-offload target regions (which should eventually be lowered for the host) can appear inside of device functions and target regions.
The first two points mean that, while generating host code, you must also look for target regions inside device functions and vice versa.
Target region outlining currently happens very late, at the MLIR to LLVM IR translation stage, during the lowering of their parent function, when the related omp.target MLIR operation is visited. D147172 was the main patch implementing this, and D147940 is a proposal to extend it for the device.
If we remove a function before that point due to being intended for another device, then there won't be any outlining of the target regions defined inside of it either.

I've got a couple of ideas about to handle this problem, but I'm not particularly convinced about any of them:

We could relax the requirement of the hypothetical semantics pass to avoid discarding too early functions that contain target regions, so that they can be outlined later. But then they will have to be pruned after translating them from MLIR to LLVM IR, like this patch is doing. We would be implementing the same kind of idea in two different places so that we could reduce the number of functions that we unnecessarily lower. Perhaps it would also be possible to reduce further the work done on these functions by only lowering the target regions inside (perhaps by hijacking ModuleTranslation::convertBlock), and not the rest of operations in them.
Another option could be to try to move part of the outlining work into a semantics pass as well, so that we wouldn't have to keep around the parent functions intended for the other device. We would have to do all of the used variable capturing there (which it's not currently implemented AFAIK), so that they can be part of the new function as well. The result of this would be that at the MLIR level there would be some compiler-generated functions with just an omp.target operation inside (not called from anywhere else), together with the rest of functions in the module that apply to the device we're compiling for (possibly containing other omp.target operations that should actually be implemented by the opposite device). We would have to mark these compiler-generated functions somehow to make sure that they are treated differently during MLIR to LLVM IR translation -- for compiler-generated functions, only the contents of the omp.target operation inside should be lowered, whereas for other functions everything should be lowered and any occurrences of that operation should become a kernel call or a reverse-offload call, ignoring the region attached to them.

The implementation in this patch, although not very efficient, is a very simple first approach that doesn't require significantly reworking different parts of offloading support. I'd like to know what you think about this, or if @jsjodin, @domada, @agozillon, @kiranchandramohan or others have any thoughts about it as well. It'd be good to reach some sort of consensus before committing to significant work.

I don't think moving the outlining early on is the best approach since we want this code to be shared with clang. Would it be sufficient to filter the functions that do not contain target regions after the analysis in https://reviews.llvm.org/D146063 somewhere in the front-end and let the MLIR->LLVM IR translation handle the functions that has target regions?

Ping @jdoerfert. Do you have any other thoughts different to what Jan's suggesting? Just looking to make sure we're in the same page to avoid redoing the work multiple times.

Rebase and update to depend on declare target work split into multiple patches.

Herald added a reviewer: nicolasvasilache. · View Herald TranscriptMay 18 2023, 7:56 AM

Herald added a reviewer: kiranchandramohan. · View Herald Transcript

skatrak edited the summary of this revision. (Show Details)May 18 2023, 7:57 AM

skatrak added parent revisions: D150328: [Flang][OpenMP][MLIR] Add declare target attribute set and interface for the OpenMP dialect, D150329: [Flang][OpenMP][MLIR] Add lowering from PFT to MLIR (FIR) for OpenMP declare target directive in Flang, D150323: [Flang][OpenMP] A semantic analysis pass for marking functions, subroutines and their interfaces as implicitly declare target when called inside of a declare target function or target region.

Harbormaster completed remote builds in B232870: Diff 523385.May 18 2023, 7:58 AM

skatrak removed a parent revision: D146063: [Flang][OpenMP][MLIR] Add lowering from parse tree to MLIR support for Declare Target for functions, subroutines and global data.May 18 2023, 8:00 AM

jsjodin added inline comments.Jun 6 2023, 7:50 AM

mlir/lib/Target/LLVMIR/ModuleTranslation.cpp
1051 ↗	(On Diff #523385)	The maskedFunction information lives only inside this function. Can we make it in to a local variable and pass it in to convertOneFunction?

Ping again in case there any opinions against following Jan's suggestion before starting its implementation. In summary, the idea would be to:

Add a semantics pass to filter out functions that are not intended for the device we're compiling, according to implicit/explicit declare target information, except for those with a target region inside.
Keep also the approach in this first patch, so that the IR for the enclosing functions where target regions are located is deleted after the target regions themselves are outlined by the OpenMPIRBuilder.

I am thinking that there could be situations where this approach might break. Let's say there's a host-only subroutine f() called by a host-only subroutine g() that contains a target region as well (f() not called from inside the target region). Then, during compilation for the device, f() would be removed in semantics whereas g() would be kept due to it having a target region to be processed later during MLIR to LLVM IR translation. What would happen then with the call to the f() subroutine that would have become undefined?

Should we just replace the body of functions to be deleted in semantics by an empty body or a default 'return' statement instead? Or would it maybe be a preferable alternative to delete these functions altogether but also delete any calls to them? The first option could become difficult if these subroutines/functions return complex data types, and also we may end up generating many unused functions. The only issue I see with the second option is what would happen if function calls are used as part of an expression to initialize some data. I'm leaning towards just deleting all statements except variable declarations and target regions from functions intended for another device, which seems like solves all of the issues above.

Let me know if you have any more thoughts on this, and thank you for taking the time.

In D147641#4413305, @skatrak wrote:

Ping again in case there any opinions against following Jan's suggestion before starting its implementation. In summary, the idea would be to:

Add a semantics pass to filter out functions that are not intended for the device we're compiling, according to implicit/explicit declare target information, except for those with a target region inside.

Keep also the approach in this first patch, so that the IR for the enclosing functions where target regions are located is deleted after the target regions themselves are outlined by the OpenMPIRBuilder.

I am thinking that there could be situations where this approach might break. Let's say there's a host-only subroutine f() called by a host-only subroutine g() that contains a target region as well (f() not called from inside the target region). Then, during compilation for the device, f() would be removed in semantics whereas g() would be kept due to it having a target region to be processed later during MLIR to LLVM IR translation. What would happen then with the call to the f() subroutine that would have become undefined?

Should we just replace the body of functions to be deleted in semantics by an empty body or a default 'return' statement instead? Or would it maybe be a preferable alternative to delete these functions altogether but also delete any calls to them? The first option could become difficult if these subroutines/functions return complex data types, and also we may end up generating many unused functions. The only issue I see with the second option is what would happen if function calls are used as part of an expression to initialize some data. I'm leaning towards just deleting all statements except variable declarations and target regions from functions intended for another device, which seems like solves all of the issues above.

Let me know if you have any more thoughts on this, and thank you for taking the time.

Can we add declarations for them instead of having empty bodies?

In D147641#4417720, @jsjodin wrote:

Can we add declarations for them instead of having empty bodies?

Since semantics work at the PFT level, I've looked into how to define the Fortran-equivalent to forward declarations. It seems like external procedures could be one solution to this. So it should be possible to delete all function and subroutine definitions intended for another device and add an external declaration to all their callers. These calls and external procedure declarations will remain in MLIR but they will be removed in the MLIR to LLVM IR translation step.

Implement early function filtering as an MLIR pass.

Herald added a reviewer: sscalpone. · View Herald TranscriptJun 29 2023, 4:30 AM

Herald added a reviewer: awarzynski. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Harbormaster completed remote builds in B242034: Diff 535717.Jun 29 2023, 4:47 AM

skatrak edited the summary of this revision. (Show Details)Jun 29 2023, 5:43 AM

Update after splitting off pass infrastructure into D154194.

skatrak added a parent revision: D154194: [MLIR][OpenMP] Set up MLIR transform pass infrastructure for the OpenMP dialect.Jun 30 2023, 3:14 AM

skatrak edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B242364: Diff 536167.Jun 30 2023, 3:15 AM

skatrak mentioned this in D154247: [Flang][OpenMP][MLIR] An mlir transformation pass for marking FuncOp's implicitly called from TargetOp's and declare target marked FuncOp's as implicitly declare target.Jul 4 2023, 3:20 AM

Move MLIR transformation pass to Flang and implement 2-stage filtering to work together with early target region outlining.

Herald added subscribers: gysit, Dinistro. · View Herald TranscriptJul 7 2023, 5:49 AM

skatrak edited the summary of this revision. (Show Details)Jul 7 2023, 5:50 AM

skatrak edited the summary of this revision. (Show Details)

skatrak removed parent revisions: D154194: [MLIR][OpenMP] Set up MLIR transform pass infrastructure for the OpenMP dialect, D150323: [Flang][OpenMP] A semantic analysis pass for marking functions, subroutines and their interfaces as implicitly declare target when called inside of a declare target function or target region.

Harbormaster completed remote builds in B243749: Diff 538106.Jul 7 2023, 6:10 AM

jsjodin added inline comments.Jul 13 2023, 6:52 AM

flang/lib/Frontend/FrontendActions.cpp
698	I don't think this is needed. The FIR should already have been filtered by the front end.
flang/lib/Optimizer/Transforms/OMPFunctionFiltering.cpp
48 ↗	(On Diff #538106)	This could potentially be simplified if the early outlining is run before this pass. I think this is still okay though since the first instruction in the outlined functions is a TargetOp so it should be efficient, and it still works without the early outlining. So I think it is okay to keep as-is.

Address review comments and integrate with recently-landed early target region outlining. Small refactor to improve readability and minimize TargetOp search overhead.

Move misplaced function.

Thank you for your feedback Jan, your comments should have been addressed.

flang/lib/Optimizer/Transforms/OMPFunctionFiltering.cpp
48 ↗	(On Diff #538106)	I thought about this, but I think it's still necessary to check here whether there are `omp.target` ops inside because the outlined functions are host functions in the device pass. So they would by default be filtered out here and the `omp.target` inside wouldn't reach the MLIR to LLVM IR translation stage. But you're right. By running this after early outlining, the check should be very quick.

Harbormaster completed remote builds in B245358: Diff 540368.Jul 14 2023, 5:39 AM

LGTM, Please wait for another acceptance.

flang/lib/Optimizer/Transforms/OMPFunctionFiltering.cpp
48 ↗	(On Diff #538106)	It would be possible to check the early outlining interface to see if the parent function name is set, but that solution is less desirable imo.

This revision is now accepted and ready to land.Jul 14 2023, 7:21 AM

See comment inline.

mlir/lib/Target/LLVMIR/ModuleTranslation.cpp
934–944 ↗	(On Diff #540368)	The change in `convertOneFunction` is intrusive. Code here must not be aware of the OpenMP attributes.

This revision now requires changes to proceed.Jul 14 2023, 7:38 AM

jsjodin added inline comments.Jul 14 2023, 7:45 AM

mlir/lib/Target/LLVMIR/ModuleTranslation.cpp
934–944 ↗	(On Diff #540368)	The change in `convertOneFunction` is intrusive. Code here must not be aware of the OpenMP attributes. Would it be ensure that the omp.declare_target dialect attribute is available so that convertDialectAttributes can handle everything?

jsjodin added inline comments.Jul 14 2023, 7:47 AM

mlir/lib/Target/LLVMIR/ModuleTranslation.cpp
934–944 ↗	(On Diff #540368)	The change in `convertOneFunction` is intrusive. Code here must not be aware of the OpenMP attributes. Would it be ensure that the omp.declare_target dialect attribute is available so that convertDialectAttributes can handle everything?
934–944 ↗	(On Diff #540368)	The change in `convertOneFunction` is intrusive. Code here must not be aware of the OpenMP attributes. Would it be ensure that the omp.declare_target dialect attribute is available so that convertDialectAttributes can handle everything? Correction: Would it be possible to ensure that the omp.declare_target dialect attribute is available so that convertDialectAttributes can handle everything?

skatrak added inline comments.Jul 14 2023, 8:23 AM

mlir/lib/Target/LLVMIR/ModuleTranslation.cpp
934–944 ↗	(On Diff #540368)	The change in `convertOneFunction` is intrusive. Code here must not be aware of the OpenMP attributes. The problem here is that the `amendOperation` flow is called if there is an `omp.declare_target` attribute explicitly set for the function. However, functions without the attribute must also be treated as declare target with `device type = host`, so they must be filtered out during device compilation. So, the options I see are either to access the attribute directly here or ensure that all functions have the attribute, either by setting it here or at an earlier stage. Though it might be enough to mark outlined functions with the attribute, which should be the only ones that will be removed at this stage.

jsjodin added inline comments.Jul 14 2023, 8:28 AM

mlir/lib/Target/LLVMIR/ModuleTranslation.cpp
934–944 ↗	(On Diff #540368)	The change in `convertOneFunction` is intrusive. Code here must not be aware of the OpenMP attributes. The problem here is that the `amendOperation` flow is called if there is an `omp.declare_target` attribute explicitly set for the function. However, functions without the attribute must also be treated as declare target with `device type = host`, so they must be filtered out during device compilation. So, the options I see are either to access the attribute directly here or ensure that all functions have the attribute, either by setting it here or at an earlier stage. Though it might be enough to mark outlined functions with the attribute, which should be the only ones that will be removed at this stage. I think the best option is to modify the early outlining to set the attribute. You can add that to this patch and a test if the current tests don't already cover this.

agozillon added inline comments.Jul 14 2023, 8:37 AM

mlir/lib/Target/LLVMIR/ModuleTranslation.cpp
934–944 ↗	(On Diff #540368)	The type of the attribute is just a boolean though, so it has no dependencies on the OpenMPDialect, we already use something similar inside of the getOpenMPBuilder call: https://github.com/llvm/llvm-project/blob/main/mlir/lib/Target/LLVMIR/ModuleTranslation.cpp#L1311 Is it possible to do a similar segment of code inside of the destructor of ModuleTranslation, where we already have some OpenMP specific code (destruction of the OMPIRBuilder)? And is that a more acceptable approach?

Remove handling of OpenMP dialect attributes from generic translation. Not needed because the early outlining pass already adds the omp.declare_target attribute to the new function, so the second-stage filter can assume it will be always explicitly set.

Thanks @skatrak.

This revision is now accepted and ready to land.Jul 14 2023, 10:04 AM

skatrak marked 6 inline comments as done.Jul 14 2023, 10:09 AM

skatrak added inline comments.

mlir/lib/Target/LLVMIR/ModuleTranslation.cpp
934–944 ↗	(On Diff #540368)	After some testing it turns out that, in this specific case, we can assume that the attribute will be explicitly set by the early outlining. So there is no need to check for implicit host functions here. The type of the attribute is just a boolean though, so it has no dependencies on the OpenMPDialect, we already use something similar inside of the getOpenMPBuilder call: https://github.com/llvm/llvm-project/blob/main/mlir/lib/Target/LLVMIR/ModuleTranslation.cpp#L1311 Is it possible to do a similar segment of code inside of the destructor of ModuleTranslation, where we already have some OpenMP specific code (destruction of the OMPIRBuilder)? And is that a more acceptable approach? I think that is also something we should try to remove from this file if we can. In D147219 I implemented an approach that moves the OMPIRBuilder configuration to the amendOperation flow as well, which has the advantage of allowing the use of OpenMP custom attribute types. It's not been accepted, but I think it's a better way to go about that.

Harbormaster completed remote builds in B245424: Diff 540468.Jul 14 2023, 10:16 AM

agozillon added inline comments.Jul 14 2023, 10:27 AM

mlir/lib/Target/LLVMIR/ModuleTranslation.cpp
934–944 ↗	(On Diff #540368)	While I'm all for doing that, I don't think moving it into amendOperation would work at this time, as the attributes are tied to the ModuleOp and I believe the amendOperation for the ModuleOp is ran as one of the final things in the lowering process. The information in the Config for IRBuilder is required very early on, before lowering global ops (for declare target) which happens prior to even the function lowering which is when TargetOp will currently need it. I could be incorrect though!

Thank you all for the comments, I'll land this patch shortly.

mlir/lib/Target/LLVMIR/ModuleTranslation.cpp
934–944 ↗	(On Diff #540368)	Yes, I noticed that issue as well. If you look at that patch, you can see that I moved the `convertOperation()` call to the module inside of `translateModuleToLLVMIR()` to be before translating the functions inside. Doing it in that order means that the `OpenMPIRBuilder` is initialized before translating the contents of the module. I believe that is a good approach to avoid having dialect-specific attributes handling in the generic `ModuleTranslation`, since (at least at this time) there is no particular reason why the module should be converted/amended last.

Closed by commit rGdebdfc0ae21b: [Flang][OpenMP][MLIR] Filter emitted code depending on declare target and device (authored by skatrak). · Explain WhyJul 17 2023, 1:08 AM

This revision was automatically updated to reflect the committed changes.

skatrak marked an inline comment as done.

skatrak added a commit: rGdebdfc0ae21b: [Flang][OpenMP][MLIR] Filter emitted code depending on declare target and device.

Revision Contents

Path

Size

flang/

lib/

Frontend/

FrontendActions.cpp

3 lines

test/

Driver/

mlir-debug-pass-pipeline.f90

1 line

mlir-pass-pipeline.f90

1 line

Lower/

OpenMP/

omp-declare-target-func-and-subr.f90

55 lines

mlir/

include/

mlir/

Dialect/

OpenMP/

7 lines

39 lines

22 lines

2 lines

lib/

Dialect/

OpenMP/

CMakeLists.txt

2 lines

Transforms/

CMakeLists.txt

16 lines

FilterDeviceHostFunctions.cpp

72 lines

test/

Dialect/

OpenMP/

filter-device-host-functions.mlir

111 lines

Diff 535717

flang/lib/Frontend/FrontendActions.cpp

Show All 27 Lines
#include "flang/Parser/provenance.h"		#include "flang/Parser/provenance.h"
#include "flang/Parser/source.h"		#include "flang/Parser/source.h"
#include "flang/Parser/unparse.h"		#include "flang/Parser/unparse.h"
#include "flang/Semantics/runtime-type-info.h"		#include "flang/Semantics/runtime-type-info.h"
#include "flang/Semantics/semantics.h"		#include "flang/Semantics/semantics.h"
#include "flang/Semantics/unparse-with-symbols.h"		#include "flang/Semantics/unparse-with-symbols.h"
#include "flang/Tools/CrossToolHelpers.h"		#include "flang/Tools/CrossToolHelpers.h"

		#include "mlir/Dialect/OpenMP/OpenMPPasses.h"
#include "mlir/IR/Dialect.h"		#include "mlir/IR/Dialect.h"
#include "mlir/Parser/Parser.h"		#include "mlir/Parser/Parser.h"
#include "mlir/Pass/PassManager.h"		#include "mlir/Pass/PassManager.h"
#include "mlir/Support/LLVM.h"		#include "mlir/Support/LLVM.h"
#include "mlir/Target/LLVMIR/Import.h"		#include "mlir/Target/LLVMIR/Import.h"
#include "mlir/Target/LLVMIR/ModuleTranslation.h"		#include "mlir/Target/LLVMIR/ModuleTranslation.h"
#include "clang/Basic/Diagnostic.h"		#include "clang/Basic/Diagnostic.h"
#include "clang/Basic/DiagnosticFrontend.h"		#include "clang/Basic/DiagnosticFrontend.h"
▲ Show 20 Lines • Show All 252 Lines • ▼ Show 20 Lines	bool CodeGenAction::beginSourceFileAction() {
Fortran::parser::Program &parseTree{*ci.getParsing().parseTree()};		Fortran::parser::Program &parseTree{*ci.getParsing().parseTree()};
lb.lower(parseTree, ci.getInvocation().getSemanticsContext());		lb.lower(parseTree, ci.getInvocation().getSemanticsContext());

// run the default passes.		// run the default passes.
mlir::PassManager pm((*mlirModule)->getName(),		mlir::PassManager pm((*mlirModule)->getName(),
mlir::OpPassManager::Nesting::Implicit);		mlir::OpPassManager::Nesting::Implicit);
pm.enableVerifier(/verifyPasses=/true);		pm.enableVerifier(/verifyPasses=/true);
pm.addPass(std::make_unique<Fortran::lower::VerifierPass>());		pm.addPass(std::make_unique<Fortran::lower::VerifierPass>());
		pm.addPass(mlir::omp::createFilterDeviceHostFunctionsPass());

if (mlir::failed(pm.run(*mlirModule))) {		if (mlir::failed(pm.run(*mlirModule))) {
unsigned diagID = ci.getDiagnostics().getCustomDiagID(		unsigned diagID = ci.getDiagnostics().getCustomDiagID(
clang::DiagnosticsEngine::Error,		clang::DiagnosticsEngine::Error,
"verification of lowering to FIR failed");		"verification of lowering to FIR failed");
ci.getDiagnostics().Report(diagID);		ci.getDiagnostics().Report(diagID);
return false;		return false;
}		}
▲ Show 20 Lines • Show All 374 Lines • ▼ Show 20 Lines	void CodeGenAction::generateLLVMIR() {
fir::support::loadDialects(*mlirCtx);		fir::support::loadDialects(*mlirCtx);
fir::support::registerLLVMTranslation(*mlirCtx);		fir::support::registerLLVMTranslation(*mlirCtx);

// Set-up the MLIR pass manager		// Set-up the MLIR pass manager
mlir::PassManager pm((*mlirModule)->getName(),		mlir::PassManager pm((*mlirModule)->getName(),
mlir::OpPassManager::Nesting::Implicit);		mlir::OpPassManager::Nesting::Implicit);

pm.addPass(std::make_unique<Fortran::lower::VerifierPass>());		pm.addPass(std::make_unique<Fortran::lower::VerifierPass>());
		pm.addPass(mlir::omp::createFilterDeviceHostFunctionsPass());
pm.enableVerifier(/verifyPasses=/true);		pm.enableVerifier(/verifyPasses=/true);

		jsjodinUnsubmitted Done Reply Inline Actions I don't think this is needed. The FIR should already have been filtered by the front end. jsjodin: I don't think this is needed. The FIR should already have been filtered by the front end.
// Create the pass pipeline		// Create the pass pipeline
fir::createMLIRToLLVMPassPipeline(pm, level, opts.StackArrays,		fir::createMLIRToLLVMPassPipeline(pm, level, opts.StackArrays,
opts.Underscoring, opts.LoopVersioning,		opts.Underscoring, opts.LoopVersioning,
opts.getDebugInfo());		opts.getDebugInfo());
(void)mlir::applyPassManagerCLOptions(pm);		(void)mlir::applyPassManagerCLOptions(pm);

// run the pass manager		// run the pass manager
if (!mlir::succeeded(pm.run(*mlirModule))) {		if (!mlir::succeeded(pm.run(*mlirModule))) {
▲ Show 20 Lines • Show All 346 Lines • Show Last 20 Lines

flang/test/Driver/mlir-debug-pass-pipeline.f90

	Show All 19 Lines
	! DEBUG-DIRECTIVES: warning: Unsupported debug option: line-directives-only			! DEBUG-DIRECTIVES: warning: Unsupported debug option: line-directives-only
	!			!
	! DEBUG-ERR: error: invalid value 'invalid' in '-debug-info-kind=invalid'			! DEBUG-ERR: error: invalid value 'invalid' in '-debug-info-kind=invalid'
	! DEBUG-ERR-NOT: Pass statistics report			! DEBUG-ERR-NOT: Pass statistics report

	! ALL: Pass statistics report			! ALL: Pass statistics report

	! ALL: Fortran::lower::VerifierPass			! ALL: Fortran::lower::VerifierPass
				! ALL-NEXT: FilterDeviceHostFunctions
	! ALL-NEXT: 'func.func' Pipeline			! ALL-NEXT: 'func.func' Pipeline
	! ALL-NEXT: InlineElementals			! ALL-NEXT: InlineElementals
	! ALL-NEXT: LowerHLFIROrderedAssignments			! ALL-NEXT: LowerHLFIROrderedAssignments
	! ALL-NEXT: LowerHLFIRIntrinsics			! ALL-NEXT: LowerHLFIRIntrinsics
	! ALL-NEXT: BufferizeHLFIR			! ALL-NEXT: BufferizeHLFIR
	! ALL-NEXT: ConvertHLFIRtoFIR			! ALL-NEXT: ConvertHLFIRtoFIR
	! ALL-NEXT: CSE			! ALL-NEXT: CSE
	! Ideally, we need an output with only the pass names, but			! Ideally, we need an output with only the pass names, but
	▲ Show 20 Lines • Show All 51 Lines • Show Last 20 Lines

flang/test/Driver/mlir-pass-pipeline.f90

	! Test the MLIR pass pipeline			! Test the MLIR pass pipeline

	! RUN: %flang_fc1 -S -mmlir --mlir-pass-statistics -mmlir --mlir-pass-statistics-display=pipeline -o /dev/null %s 2>&1 \| FileCheck --check-prefixes=ALL %s			! RUN: %flang_fc1 -S -mmlir --mlir-pass-statistics -mmlir --mlir-pass-statistics-display=pipeline -o /dev/null %s 2>&1 \| FileCheck --check-prefixes=ALL %s
	! -O0 is the default:			! -O0 is the default:
	! RUN: %flang_fc1 -S -mmlir --mlir-pass-statistics -mmlir --mlir-pass-statistics-display=pipeline %s -O0 -o /dev/null 2>&1 \| FileCheck --check-prefixes=ALL %s			! RUN: %flang_fc1 -S -mmlir --mlir-pass-statistics -mmlir --mlir-pass-statistics-display=pipeline %s -O0 -o /dev/null 2>&1 \| FileCheck --check-prefixes=ALL %s
	! RUN: %flang_fc1 -S -mmlir --mlir-pass-statistics -mmlir --mlir-pass-statistics-display=pipeline %s -O2 -o /dev/null 2>&1 \| FileCheck --check-prefixes=ALL,O2 %s			! RUN: %flang_fc1 -S -mmlir --mlir-pass-statistics -mmlir --mlir-pass-statistics-display=pipeline %s -O2 -o /dev/null 2>&1 \| FileCheck --check-prefixes=ALL,O2 %s

	! REQUIRES: asserts			! REQUIRES: asserts

	end program			end program

	! ALL: Pass statistics report			! ALL: Pass statistics report

	! ALL: Fortran::lower::VerifierPass			! ALL: Fortran::lower::VerifierPass
				! ALL-NEXT: FilterDeviceHostFunctions
	! O2-NEXT: Canonicalizer			! O2-NEXT: Canonicalizer
	! O2-NEXT: 'func.func' Pipeline			! O2-NEXT: 'func.func' Pipeline
	! O2-NEXT: SimplifyHLFIRIntrinsics			! O2-NEXT: SimplifyHLFIRIntrinsics
	! ALL: InlineElementals			! ALL: InlineElementals
	! ALL: LowerHLFIROrderedAssignments			! ALL: LowerHLFIROrderedAssignments
	! ALL-NEXT: LowerHLFIRIntrinsics			! ALL-NEXT: LowerHLFIRIntrinsics
	! ALL-NEXT: BufferizeHLFIR			! ALL-NEXT: BufferizeHLFIR
	! ALL-NEXT: ConvertHLFIRtoFIR			! ALL-NEXT: ConvertHLFIRtoFIR
	▲ Show 20 Lines • Show All 53 Lines • Show Last 20 Lines

flang/test/Lower/OpenMP/omp-declare-target-func-and-subr.f90

	!RUN: %flang_fc1 -emit-fir -fopenmp %s -o - \| FileCheck %s			!RUN: %flang_fc1 -emit-fir -fopenmp %s -o - \| FileCheck %s --check-prefixes ALL,HOST
				!RUN: %flang_fc1 -emit-fir -fopenmp -fopenmp-is-device %s -o - \| FileCheck %s --check-prefixes ALL,DEVICE

	! Check specification valid forms of declare target with functions			! Check specification valid forms of declare target with functions
	! utilising device_type and to clauses as well as the default			! utilising device_type and to clauses as well as the default
	! zero clause declare target			! zero clause declare target

	! CHECK-LABEL: func.func @_QPfunc_t_device()			! DEVICE-LABEL: func.func @_QPfunc_t_device()
	! CHECK-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (nohost), capture_clause = (to)>{{.}}			! DEVICE-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (nohost), capture_clause = (to)>{{.}}
	FUNCTION FUNC_T_DEVICE() RESULT(I)			FUNCTION FUNC_T_DEVICE() RESULT(I)
	!$omp declare target to(FUNC_T_DEVICE) device_type(nohost)			!$omp declare target to(FUNC_T_DEVICE) device_type(nohost)
	INTEGER :: I			INTEGER :: I
	I = 1			I = 1
	END FUNCTION FUNC_T_DEVICE			END FUNCTION FUNC_T_DEVICE

	! CHECK-LABEL: func.func @_QPfunc_t_host()			! HOST-LABEL: func.func @_QPfunc_t_host()
	! CHECK-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (host), capture_clause = (to)>{{.}}			! HOST-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (host), capture_clause = (to)>{{.}}
	FUNCTION FUNC_T_HOST() RESULT(I)			FUNCTION FUNC_T_HOST() RESULT(I)
	!$omp declare target to(FUNC_T_HOST) device_type(host)			!$omp declare target to(FUNC_T_HOST) device_type(host)
	INTEGER :: I			INTEGER :: I
	I = 1			I = 1
	END FUNCTION FUNC_T_HOST			END FUNCTION FUNC_T_HOST

	! CHECK-LABEL: func.func @_QPfunc_t_any()			! ALL-LABEL: func.func @_QPfunc_t_any()
	! CHECK-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}			! ALL-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}
	FUNCTION FUNC_T_ANY() RESULT(I)			FUNCTION FUNC_T_ANY() RESULT(I)
	!$omp declare target to(FUNC_T_ANY) device_type(any)			!$omp declare target to(FUNC_T_ANY) device_type(any)
	INTEGER :: I			INTEGER :: I
	I = 1			I = 1
	END FUNCTION FUNC_T_ANY			END FUNCTION FUNC_T_ANY

	! CHECK-LABEL: func.func @_QPfunc_default_t_any()			! ALL-LABEL: func.func @_QPfunc_default_t_any()
	! CHECK-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}			! ALL-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}
	FUNCTION FUNC_DEFAULT_T_ANY() RESULT(I)			FUNCTION FUNC_DEFAULT_T_ANY() RESULT(I)
	!$omp declare target to(FUNC_DEFAULT_T_ANY)			!$omp declare target to(FUNC_DEFAULT_T_ANY)
	INTEGER :: I			INTEGER :: I
	I = 1			I = 1
	END FUNCTION FUNC_DEFAULT_T_ANY			END FUNCTION FUNC_DEFAULT_T_ANY

	! CHECK-LABEL: func.func @_QPfunc_default_any()			! ALL-LABEL: func.func @_QPfunc_default_any()
	! CHECK-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}			! ALL-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}
	FUNCTION FUNC_DEFAULT_ANY() RESULT(I)			FUNCTION FUNC_DEFAULT_ANY() RESULT(I)
	!$omp declare target			!$omp declare target
	INTEGER :: I			INTEGER :: I
	I = 1			I = 1
	END FUNCTION FUNC_DEFAULT_ANY			END FUNCTION FUNC_DEFAULT_ANY

	! CHECK-LABEL: func.func @_QPfunc_default_extendedlist()			! ALL-LABEL: func.func @_QPfunc_default_extendedlist()
	! CHECK-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}			! ALL-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}
	FUNCTION FUNC_DEFAULT_EXTENDEDLIST() RESULT(I)			FUNCTION FUNC_DEFAULT_EXTENDEDLIST() RESULT(I)
	!$omp declare target(FUNC_DEFAULT_EXTENDEDLIST)			!$omp declare target(FUNC_DEFAULT_EXTENDEDLIST)
	INTEGER :: I			INTEGER :: I
	I = 1			I = 1
	END FUNCTION FUNC_DEFAULT_EXTENDEDLIST			END FUNCTION FUNC_DEFAULT_EXTENDEDLIST

	!! -----			!! -----

	! Check specification valid forms of declare target with subroutines			! Check specification valid forms of declare target with subroutines
	! utilising device_type and to clauses as well as the default			! utilising device_type and to clauses as well as the default
	! zero clause declare target			! zero clause declare target

	! CHECK-LABEL: func.func @_QPsubr_t_device()			! DEVICE-LABEL: func.func @_QPsubr_t_device()
	! CHECK-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (nohost), capture_clause = (to)>{{.}}			! DEVICE-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (nohost), capture_clause = (to)>{{.}}
	SUBROUTINE SUBR_T_DEVICE()			SUBROUTINE SUBR_T_DEVICE()
	!$omp declare target to(SUBR_T_DEVICE) device_type(nohost)			!$omp declare target to(SUBR_T_DEVICE) device_type(nohost)
	END			END

	! CHECK-LABEL: func.func @_QPsubr_t_host()			! HOST-LABEL: func.func @_QPsubr_t_host()
	! CHECK-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (host), capture_clause = (to)>{{.}}			! HOST-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (host), capture_clause = (to)>{{.}}
	SUBROUTINE SUBR_T_HOST()			SUBROUTINE SUBR_T_HOST()
	!$omp declare target to(SUBR_T_HOST) device_type(host)			!$omp declare target to(SUBR_T_HOST) device_type(host)
	END			END

	! CHECK-LABEL: func.func @_QPsubr_t_any()			! ALL-LABEL: func.func @_QPsubr_t_any()
	! CHECK-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}			! ALL-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}
	SUBROUTINE SUBR_T_ANY()			SUBROUTINE SUBR_T_ANY()
	!$omp declare target to(SUBR_T_ANY) device_type(any)			!$omp declare target to(SUBR_T_ANY) device_type(any)
	END			END

	! CHECK-LABEL: func.func @_QPsubr_default_t_any()			! ALL-LABEL: func.func @_QPsubr_default_t_any()
	! CHECK-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}			! ALL-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}
	SUBROUTINE SUBR_DEFAULT_T_ANY()			SUBROUTINE SUBR_DEFAULT_T_ANY()
	!$omp declare target to(SUBR_DEFAULT_T_ANY)			!$omp declare target to(SUBR_DEFAULT_T_ANY)
	END			END

	! CHECK-LABEL: func.func @_QPsubr_default_any()			! ALL-LABEL: func.func @_QPsubr_default_any()
	! CHECK-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}			! ALL-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}
	SUBROUTINE SUBR_DEFAULT_ANY()			SUBROUTINE SUBR_DEFAULT_ANY()
	!$omp declare target			!$omp declare target
	END			END

	! CHECK-LABEL: func.func @_QPsubr_default_extendedlist()			! ALL-LABEL: func.func @_QPsubr_default_extendedlist()
	! CHECK-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}			! ALL-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (any), capture_clause = (to)>{{.}}
	SUBROUTINE SUBR_DEFAULT_EXTENDEDLIST()			SUBROUTINE SUBR_DEFAULT_EXTENDEDLIST()
	!$omp declare target(SUBR_DEFAULT_EXTENDEDLIST)			!$omp declare target(SUBR_DEFAULT_EXTENDEDLIST)
	END			END

	!! -----			!! -----

	! CHECK-LABEL: func.func @_QPrecursive_declare_target			! DEVICE-LABEL: func.func @_QPrecursive_declare_target
	! CHECK-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (nohost), capture_clause = (to)>{{.}}			! DEVICE-SAME: {{.}}attributes {omp.declare_target = #omp.declaretarget<device_type = (nohost), capture_clause = (to)>{{.}}
	RECURSIVE FUNCTION RECURSIVE_DECLARE_TARGET(INCREMENT) RESULT(K)			RECURSIVE FUNCTION RECURSIVE_DECLARE_TARGET(INCREMENT) RESULT(K)
	!$omp declare target to(RECURSIVE_DECLARE_TARGET) device_type(nohost)			!$omp declare target to(RECURSIVE_DECLARE_TARGET) device_type(nohost)
	INTEGER :: INCREMENT, K			INTEGER :: INCREMENT, K
	IF (INCREMENT == 10) THEN			IF (INCREMENT == 10) THEN
	K = INCREMENT			K = INCREMENT
	ELSE			ELSE
	K = RECURSIVE_DECLARE_TARGET(INCREMENT + 1)			K = RECURSIVE_DECLARE_TARGET(INCREMENT + 1)
	END IF			END IF
	END FUNCTION RECURSIVE_DECLARE_TARGET			END FUNCTION RECURSIVE_DECLARE_TARGET

mlir/include/mlir/Dialect/OpenMP/CMakeLists.txt

	Show All 15 Lines
	add_dependencies(OpenMPDialectDocGen omp_common_td)			add_dependencies(OpenMPDialectDocGen omp_common_td)
	add_mlir_interface(OpenMPOpsInterfaces)			add_mlir_interface(OpenMPOpsInterfaces)

	set(LLVM_TARGET_DEFINITIONS OpenMPTypeInterfaces.td)			set(LLVM_TARGET_DEFINITIONS OpenMPTypeInterfaces.td)
	mlir_tablegen(OpenMPTypeInterfaces.h.inc -gen-type-interface-decls)			mlir_tablegen(OpenMPTypeInterfaces.h.inc -gen-type-interface-decls)
	mlir_tablegen(OpenMPTypeInterfaces.cpp.inc -gen-type-interface-defs)			mlir_tablegen(OpenMPTypeInterfaces.cpp.inc -gen-type-interface-defs)
	add_public_tablegen_target(MLIROpenMPTypeInterfacesIncGen)			add_public_tablegen_target(MLIROpenMPTypeInterfacesIncGen)
	add_dependencies(mlir-generic-headers MLIROpenMPTypeInterfacesIncGen)			add_dependencies(mlir-generic-headers MLIROpenMPTypeInterfacesIncGen)

				set(LLVM_TARGET_DEFINITIONS OpenMPPasses.td)
				mlir_tablegen(OpenMPPasses.h.inc -gen-pass-decls -name OpenMP)
				mlir_tablegen(OpenMPPasses.capi.h.inc -gen-pass-capi-header --prefix OpenMP)
				mlir_tablegen(OpenMPPasses.capi.cpp.inc -gen-pass-capi-impl --prefix OpenMP)
				add_public_tablegen_target(MLIROpenMPPassIncGen)
				add_mlir_doc(Passes OpenMPPasses ./ -gen-pass-doc)

mlir/include/mlir/Dialect/OpenMP/OpenMPPasses.h

This file was added.

				//===- OpenMPPasses.h - OpenMP pass entry points ----------------- C++ --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This header file defines prototypes that expose pass constructors.
				//
				//===----------------------------------------------------------------------===//
				#ifndef MLIR_DIALECT_OPENMP_OPENMPPASSES_H_
				#define MLIR_DIALECT_OPENMP_OPENMPPASSES_H_

				#include "mlir/Pass/Pass.h"

				namespace mlir {
				namespace omp {

				#define GEN_PASS_DECL
				#include "mlir/Dialect/OpenMP/OpenMPPasses.h.inc"

				/// Create a pass to filter out functions intended for the host when compiling
				/// for the device and vice versa.
				std::unique_ptr<Pass> createFilterDeviceHostFunctionsPass();

				} // namespace omp

				//===----------------------------------------------------------------------===//
				// Registration
				//===----------------------------------------------------------------------===//

				/// Generate the code for registering passes.
				#define GEN_PASS_REGISTRATION
				#include "mlir/Dialect/OpenMP/OpenMPPasses.h.inc"

				} // namespace mlir

				#endif // MLIR_DIALECT_OPENMP_OPENMPPASSES_H_

mlir/include/mlir/Dialect/OpenMP/OpenMPPasses.td

This file was added.

				//===-- OpenMPPasses.td - OpenMp pass definition file ------- tablegen --===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#ifndef MLIR_DIALECT_OPENMP_OPENMPPASSES_TD_
				#define MLIR_DIALECT_OPENMP_OPENMPPASSES_TD_

				include "mlir/Pass/PassBase.td"

				def FilterDeviceHostFunctions : Pass<"omp-filter-device-host-functions"> {
				let summary = "Filters out functions intended for the host when compiling for the device and vice versa.";
				let constructor = "mlir::omp::createFilterDeviceHostFunctionsPass()";
				let dependentDialects = [
				"func::FuncDialect"
				];
				}

				#endif // MLIR_DIALECT_OPENMP_OPENMPPASSES_TD_

mlir/include/mlir/InitAllPasses.h

Show All 22 Lines
#include "mlir/Dialect/Bufferization/Transforms/Passes.h"		#include "mlir/Dialect/Bufferization/Transforms/Passes.h"
#include "mlir/Dialect/Func/Transforms/Passes.h"		#include "mlir/Dialect/Func/Transforms/Passes.h"
#include "mlir/Dialect/GPU/Transforms/Passes.h"		#include "mlir/Dialect/GPU/Transforms/Passes.h"
#include "mlir/Dialect/LLVMIR/Transforms/Passes.h"		#include "mlir/Dialect/LLVMIR/Transforms/Passes.h"
#include "mlir/Dialect/Linalg/Passes.h"		#include "mlir/Dialect/Linalg/Passes.h"
#include "mlir/Dialect/Math/Transforms/Passes.h"		#include "mlir/Dialect/Math/Transforms/Passes.h"
#include "mlir/Dialect/MemRef/Transforms/Passes.h"		#include "mlir/Dialect/MemRef/Transforms/Passes.h"
#include "mlir/Dialect/NVGPU/Passes.h"		#include "mlir/Dialect/NVGPU/Passes.h"
		#include "mlir/Dialect/OpenMP/OpenMPPasses.h"
#include "mlir/Dialect/SCF/Transforms/Passes.h"		#include "mlir/Dialect/SCF/Transforms/Passes.h"
#include "mlir/Dialect/SPIRV/Transforms/Passes.h"		#include "mlir/Dialect/SPIRV/Transforms/Passes.h"
#include "mlir/Dialect/Shape/Transforms/Passes.h"		#include "mlir/Dialect/Shape/Transforms/Passes.h"
#include "mlir/Dialect/SparseTensor/Pipelines/Passes.h"		#include "mlir/Dialect/SparseTensor/Pipelines/Passes.h"
#include "mlir/Dialect/SparseTensor/Transforms/Passes.h"		#include "mlir/Dialect/SparseTensor/Transforms/Passes.h"
#include "mlir/Dialect/Tensor/Transforms/Passes.h"		#include "mlir/Dialect/Tensor/Transforms/Passes.h"
#include "mlir/Dialect/Tosa/Transforms/Passes.h"		#include "mlir/Dialect/Tosa/Transforms/Passes.h"
#include "mlir/Dialect/Transform/Transforms/Passes.h"		#include "mlir/Dialect/Transform/Transforms/Passes.h"
Show All 29 Lines	inline void registerAllPasses() {
registerGpuSerializeToCubinPass();		registerGpuSerializeToCubinPass();
registerGpuSerializeToHsacoPass();		registerGpuSerializeToHsacoPass();
registerLinalgPasses();		registerLinalgPasses();
registerNVGPUPasses();		registerNVGPUPasses();
registerSparseTensorPasses();		registerSparseTensorPasses();
LLVM::registerLLVMPasses();		LLVM::registerLLVMPasses();
math::registerMathPasses();		math::registerMathPasses();
memref::registerMemRefPasses();		memref::registerMemRefPasses();
		registerOpenMPPasses();
registerSCFPasses();		registerSCFPasses();
registerShapePasses();		registerShapePasses();
spirv::registerSPIRVPasses();		spirv::registerSPIRVPasses();
tensor::registerTensorPasses();		tensor::registerTensorPasses();
tosa::registerTosaOptPasses();		tosa::registerTosaOptPasses();
transform::registerTransformPasses();		transform::registerTransformPasses();
vector::registerVectorPasses();		vector::registerVectorPasses();
arm_sme::registerArmSMEPasses();		arm_sme::registerArmSMEPasses();

// Dialect pipelines		// Dialect pipelines
sparse_tensor::registerSparseTensorPipelines();		sparse_tensor::registerSparseTensorPipelines();
}		}

} // namespace mlir		} // namespace mlir

#endif // MLIR_INITALLPASSES_H_		#endif // MLIR_INITALLPASSES_H_

mlir/lib/Dialect/OpenMP/CMakeLists.txt

	add_mlir_dialect_library(MLIROpenMPDialect			add_mlir_dialect_library(MLIROpenMPDialect
	IR/OpenMPDialect.cpp			IR/OpenMPDialect.cpp

	ADDITIONAL_HEADER_DIRS			ADDITIONAL_HEADER_DIRS
	${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/OpenMP			${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/OpenMP

	DEPENDS			DEPENDS
	MLIROpenMPOpsIncGen			MLIROpenMPOpsIncGen
	MLIROpenMPOpsInterfacesIncGen			MLIROpenMPOpsInterfacesIncGen
	MLIROpenMPTypeInterfacesIncGen			MLIROpenMPTypeInterfacesIncGen

	LINK_LIBS PUBLIC			LINK_LIBS PUBLIC
	MLIRIR			MLIRIR
	MLIRLLVMDialect			MLIRLLVMDialect
	MLIRFuncDialect			MLIRFuncDialect
	)			)

				add_subdirectory(Transforms)

mlir/lib/Dialect/OpenMP/Transforms/CMakeLists.txt

This file was added.

				add_mlir_dialect_library(MLIROpenMPTransforms
				FilterDeviceHostFunctions.cpp

				ADDITIONAL_HEADER_DIRS
				${MLIR_MAIN_INCLUDE_DIR}/mlir/Dialect/OpenMP

				DEPENDS
				MLIROpenMPPassIncGen

				LINK_LIBS PUBLIC
				MLIRIR
				MLIRFuncDialect
				MLIROpenMPDialect
				MLIRPass
				MLIRTransforms
				)

mlir/lib/Dialect/OpenMP/Transforms/FilterDeviceHostFunctions.cpp

This file was added.

				//===- FilterDeviceHostFunctions.cpp - MLIR OpenMP pass implementation ----===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//
				//
				// This file implements transforms to filter out functions intended for the host
				// when compiling for the device and vice versa.
				//
				//===----------------------------------------------------------------------===//

				#include "mlir/Dialect/OpenMP/OpenMPPasses.h"

				#include "mlir/Dialect/Func/IR/FuncOps.h"
				#include "mlir/Dialect/OpenMP/OpenMPDialect.h"
				#include "mlir/Dialect/OpenMP/OpenMPInterfaces.h"
				#include "mlir/IR/BuiltinOps.h"
				#include "llvm/ADT/SmallVector.h"

				namespace mlir {
				namespace omp {
				#define GEN_PASS_DEF_FILTERDEVICEHOSTFUNCTIONS
				#include "mlir/Dialect/OpenMP/OpenMPPasses.h.inc"
				} // namespace omp
				} // namespace mlir

				using namespace mlir;
				using namespace mlir::omp;

				namespace {
				class FilterDeviceHostFunctionsPass
				: public omp::impl::FilterDeviceHostFunctionsBase<
				FilterDeviceHostFunctionsPass> {
				public:
				FilterDeviceHostFunctionsPass() = default;

				void runOnOperation() override {
				auto op = dyn_cast<OffloadModuleInterface>(getOperation());
				if (!op)
				return;

				bool isDeviceCompilation = op.getIsDevice();
				op->walk<WalkOrder::PostOrder>([&](func::FuncOp funcOp) {
				// Do not filter functions with target regions inside, because they have
				// to be available for both host and device so that regular and reverse
				// offloading can be supported.
				bool hasTarget = false;
				funcOp->walk([&](TargetOp) { hasTarget = true; });
				if (hasTarget)
				return;

				DeclareTargetDeviceType declareType = DeclareTargetDeviceType::host;
				auto declareTargetOp =
				dyn_cast<DeclareTargetInterface>(funcOp.getOperation());
				if (declareTargetOp && declareTargetOp.isDeclareTarget())
				declareType = declareTargetOp.getDeclareTargetDeviceType();

				if ((isDeviceCompilation &&
				declareType == DeclareTargetDeviceType::host) \|\|
				(!isDeviceCompilation &&
				declareType == DeclareTargetDeviceType::nohost))
				funcOp->erase();
				});
				}
				};
				} // namespace

				std::unique_ptr<Pass> mlir::omp::createFilterDeviceHostFunctionsPass() {
				return std::make_unique<FilterDeviceHostFunctionsPass>();
				}

mlir/test/Dialect/OpenMP/filter-device-host-functions.mlir

This file was added.

				// RUN: mlir-opt %s -split-input-file --pass-pipeline='builtin.module(omp-filter-device-host-functions)' \| FileCheck %s

				// CHECK: func.func @any
				// CHECK: func.func @nohost
				// CHECK-NOT: func.func @host
				// CHECK-NOT: func.func @none
				// CHECK: func.func @nohost_target
				// CHECK: func.func @host_target
				// CHECK: func.func @none_target
				module attributes {omp.is_device = true} {
				func.func @any() -> ()
				attributes {
				omp.declare_target =
				#omp.declaretarget<device_type = (any), capture_clause = (to)>
				} {
				func.return
				}
				func.func @nohost() -> ()
				attributes {
				omp.declare_target =
				#omp.declaretarget<device_type = (nohost), capture_clause = (to)>
				} {
				func.return
				}
				func.func @host() -> ()
				attributes {
				omp.declare_target =
				#omp.declaretarget<device_type = (host), capture_clause = (to)>
				} {
				func.return
				}
				func.func @none() -> () {
				func.return
				}
				func.func @nohost_target() -> ()
				attributes {
				omp.declare_target =
				#omp.declaretarget<device_type = (nohost), capture_clause = (to)>
				} {
				omp.target {}
				func.return
				}
				func.func @host_target() -> ()
				attributes {
				omp.declare_target =
				#omp.declaretarget<device_type = (host), capture_clause = (to)>
				} {
				omp.target {}
				func.return
				}
				func.func @none_target() -> () {
				omp.target {}
				func.return
				}
				}

				// -----

				// CHECK: func.func @any
				// CHECK-NOT: func.func @nohost
				// CHECK: func.func @host
				// CHECK: func.func @none
				// CHECK: func.func @nohost_target
				// CHECK: func.func @host_target
				// CHECK: func.func @none_target
				module attributes {omp.is_device = false} {
				func.func @any() -> ()
				attributes {
				omp.declare_target =
				#omp.declaretarget<device_type = (any), capture_clause = (to)>
				} {
				func.return
				}
				func.func @nohost() -> ()
				attributes {
				omp.declare_target =
				#omp.declaretarget<device_type = (nohost), capture_clause = (to)>
				} {
				func.return
				}
				func.func @host() -> ()
				attributes {
				omp.declare_target =
				#omp.declaretarget<device_type = (host), capture_clause = (to)>
				} {
				func.return
				}
				func.func @none() -> () {
				func.return
				}
				func.func @nohost_target() -> ()
				attributes {
				omp.declare_target =
				#omp.declaretarget<device_type = (nohost), capture_clause = (to)>
				} {
				omp.target {}
				func.return
				}
				func.func @host_target() -> ()
				attributes {
				omp.declare_target =
				#omp.declaretarget<device_type = (host), capture_clause = (to)>
				} {
				omp.target {}
				func.return
				}
				func.func @none_target() -> () {
				omp.target {}
				func.return
				}
				}

This is an archive of the discontinued LLVM Phabricator instance.

[Flang][OpenMP][MLIR] Filter emitted code depending on declare target and deviceClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 535717

flang/lib/Frontend/FrontendActions.cpp

flang/test/Driver/mlir-debug-pass-pipeline.f90

flang/test/Driver/mlir-pass-pipeline.f90

flang/test/Lower/OpenMP/omp-declare-target-func-and-subr.f90

mlir/include/mlir/Dialect/OpenMP/CMakeLists.txt

mlir/include/mlir/Dialect/OpenMP/OpenMPPasses.h

mlir/include/mlir/Dialect/OpenMP/OpenMPPasses.td

mlir/include/mlir/InitAllPasses.h

mlir/lib/Dialect/OpenMP/CMakeLists.txt

mlir/lib/Dialect/OpenMP/Transforms/CMakeLists.txt

mlir/lib/Dialect/OpenMP/Transforms/FilterDeviceHostFunctions.cpp

mlir/test/Dialect/OpenMP/filter-device-host-functions.mlir

[Flang][OpenMP][MLIR] Filter emitted code depending on declare target and device
ClosedPublic