This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/include/llvm/Frontend/OpenMP/
-
include/
-
llvm/
-
Frontend/
-
OpenMP/
-
OMPIRBuilder.h
-
mlir/
-
lib/Target/LLVMIR/
-
Target/
-
LLVMIR/
-
CMakeLists.txt
-
Dialect/OpenMP/
-
OpenMP/
18/18
OpenMPToLLVMIRTranslation.cpp
2/2
ModuleTranslation.cpp
-
test/Target/LLVMIR/
-
Target/
-
LLVMIR/
-
openmp-llvm.mlir

Differential D147219

[OpenMP][Flang][MLIR] Lowering of requires directive from MLIR to LLVM IR
ClosedPublic

Authored by skatrak on Mar 30 2023, 4:47 AM.

Download Raw Diff

Details

Reviewers

dpalermo
jsjodin
domada
agozillon
sscalpone
ftynse
jdoerfert
nicolasvasilache
kiranchandramohan
awarzynski

Commits

rG9058762789c0: [OpenMP][Flang][MLIR] Lowering of requires directive from MLIR to LLVM IR

Summary

Default atomic ordering information is processed in the OpenMP dialect
to LLVM IR lowering stage at every spot where an operation can be
affected by it. The rest of clauses are stored globally in the
OpenMPIRBuilderConfig object before starting that lowering stage, so
that the OMPIRBuilder can conditionally modify code generation
depending on these. At the end of the process, the omp.requires
attribute is itself lowered into a global constructor that passes these
clauses as flags to the OpenMP runtime.

Depends on D147217, D147218 and D158278.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

skatrak created this revision.Mar 30 2023, 4:47 AM

Herald added a reviewer: sscalpone. · View Herald TranscriptMar 30 2023, 4:47 AM

Herald added a reviewer: ftynse. · View Herald Transcript

Herald added projects: Restricted Project, Restricted Project. · View Herald Transcript

Herald added subscribers: sunshaoce, Moerafaat, zero9178 and 25 others. · View Herald Transcript

skatrak requested review of this revision.Mar 30 2023, 4:47 AM

Herald added a reviewer: jdoerfert. · View Herald TranscriptMar 30 2023, 4:47 AM

Herald added a reviewer: nicolasvasilache. · View Herald Transcript

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: jplehr, sstefan1, stephenneuendorffer and 2 others. · View Herald Transcript

Harbormaster completed remote builds in B222707: Diff 509618.Mar 30 2023, 5:32 AM

Herald added a reviewer: kiranchandramohan. · View Herald TranscriptMar 30 2023, 5:32 AM

Fix unit test failure

Herald added a reviewer: awarzynski. · View Herald TranscriptApr 11 2023, 9:05 AM

Herald added a subscriber: bviyer. · View Herald Transcript

Harbormaster completed remote builds in B224799: Diff 512485.Apr 11 2023, 9:19 AM

awarzynski added inline comments.Apr 11 2023, 10:16 AM

flang/test/Driver/code-gen-aarch64.f90
21 ↗	(On Diff #512485)	Can you explain the rationale for this extra line? I'm guessing that `ret` is being generated somewhere? I'm mostly just curious. And also wondering whether there's any way to make this test less fragile :)

skatrak added inline comments.Apr 12 2023, 4:17 AM

flang/test/Driver/code-gen-aarch64.f90
21 ↗	(On Diff #512485)	This test started failing because the codegen changes resulted in an `lretl` instruction appear somewhere below (outside of the _QQmain region of code). The change I made was just to make sure that we're only checking that there's no "ret" within the region, which is probably not an ideal way to test either. But since I don't think this test is really related to the patch, I didn't want to make significant changes to it. I'm not sure how we could make it less fragile, since that part of the test is basically interpreting the input as something it's not. So, if I understand it correctly, we're basically checking "garbage" data making sure that a pattern doesn't appear by chance. We can tighten the pattern to reduce the likelihood of false positives/negatives (which is what I did with that added line), but I suppose they are never going to be zero. A solution could be to use `llvm-readelf -h` instead of `llvm-objdump`, and then check for the "Machine" header.

awarzynski added inline comments.Apr 16 2023, 12:56 PM

flang/test/Driver/code-gen-aarch64.f90
21 ↗	(On Diff #512485)	Sorry about the delay and thanks for the explanation. The intent of this test is to verify that this fails: ! RUN: %flang_fc1 -emit-obj -triple <some-triple> -o - \| llvm-objdump --triple <some-other-triple> I couldn't find any other way to leverage this apart from checking for the "return" instruction in the output from `llvm-objdump`. Sadly, it looks like in your case the disassembler generates a "return" instruction anyway. Although it's under some other Asm label, it's still there. I appreciate your efforts to preserve the intended behavior of this test, but this patch demonstrates how fragile it is regardless. I couldn't find any other way to make it more robust and am leaning towards deleting it altogether (atm there are other tests to verify the `--target` flag). Having said that, it is still very unclear to me why your changes affect the non-OpenMP code generation. This test is meant to be unrelated to OpenMP, so shouldn't be affected by what you are updating here, right? Any idea?

For code in mlir/lib/Target/LLVMIR/ModuleTranslation.cpp, mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp we should have tests in mlir/test/Target/LLVMIR/.

This revision now requires changes to proceed.Apr 16 2023, 1:20 PM

Update unit tests.

Harbormaster completed remote builds in B226118: Diff 514263.Apr 17 2023, 10:23 AM

Undo changes to unrelated unit test.

flang/test/Driver/code-gen-aarch64.f90
21 ↗	(On Diff #512485)	I couldn't find any other way to leverage this apart from checking for the "return" instruction in the output from `llvm-objdump`. Sadly, it looks like in your case the disassembler generates a "return" instruction anyway. Although it's under some other Asm label, it's still there. I appreciate your efforts to preserve the intended behavior of this test, but this patch demonstrates how fragile it is regardless. I couldn't find any other way to make it more robust and am leaning towards deleting it altogether (atm there are other tests to verify the `--target` flag). I see, I'm not sure either whether this is a good test to make. If `llvm-objdump` was able to detect if a user is trying to disassemble using the wrong triple and emit an error, then we could check that. Since that seems not to be the case, we could maybe check that `flang -c` produces a binary for the intended architecture by running it through `llvm-readelf -h`. This would only check the headers and not the machine code, but maybe that's more useful than removing the test. If other tests already check this, then maybe removing this wouldn't really be an issue. Having said that, it is still very unclear to me why your changes affect the non-OpenMP code generation. This test is meant to be unrelated to OpenMP, so shouldn't be affected by what you are updating here, right? Any idea? Thank you for pointing this out, you are right. I spent some time looking into it, and it seems like the issue is that the registration function is being generated also when `-fopenmp` is not specified, which is a bug. It just turned out that this unit test picked it up by chance because the generated binary for aarch64 happened to contain a `ret` instruction if interpreted as x86_64. I'll revert changes to that unit test here and fix the bug in one of the patches this one depends on.

Harbormaster completed remote builds in B226351: Diff 514592.Apr 18 2023, 3:56 AM

awarzynski added inline comments.Apr 19 2023, 1:57 PM

flang/test/Driver/code-gen-aarch64.f90
21 ↗	(On Diff #512485)	Since that seems not to be the case, we could maybe check that flang -c produces a binary for the intended architecture by running it through llvm-readelf -h. This would only check the headers and not the machine code, but maybe that's more useful than removing the test. If other tests already check this, then maybe removing this wouldn't really be an issue. I really like this idea - the test could verify `e_machine`: https://en.wikipedia.org/wiki/Executable_and_Linkable_Format. Are interested in trying? I'd happily review (no spare cycles to do it myself ATM). No worries if not, this is completely unrelated to this patch!

Rebase and update.

skatrak marked an inline comment as done.Apr 21 2023, 9:35 AM

skatrak added inline comments.

flang/test/Driver/code-gen-aarch64.f90
21 ↗	(On Diff #512485)	Sure @awarzynski, I think I should be able to find some time for this next week if that's alright. Do you know if this the only test of this type or if there are others that could be changed at the same time?

awarzynski added inline comments.Apr 23 2023, 9:22 AM

flang/test/Driver/code-gen-aarch64.f90
21 ↗	(On Diff #512485)	Thanks! There are 3 tests like this: flang/test/Driver/code-gen-x86.f90 flang/test/Driver/code-gen-aarch64.f90 flang/test/Driver/code-gen-rv64.f90 IIUC, the RISC-V one is already doing something similar to what we've discussed here.

Harbormaster completed remote builds in B227171: Diff 515727.Apr 24 2023, 2:28 AM

Fix formatting.

Harbormaster completed remote builds in B227734: Diff 516414.Apr 24 2023, 8:01 AM

Update with latest changes to parent patch.

Harbormaster completed remote builds in B227972: Diff 516727.Apr 25 2023, 6:02 AM

kiranchandramohan mentioned this in D147344: [OpenMP][MLIR] Add lowering of omp::FlagsAttr to LLVM-IR.Apr 27 2023, 3:55 AM

Update

Harbormaster completed remote builds in B228594: Diff 517619.Apr 27 2023, 10:29 AM

Could you rebase this patch?

mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
1083–1086	Should this be part of Semantics?
1601–1620	Can this be an operation in the OpenMP dialect?
mlir/lib/Target/LLVMIR/ModuleTranslation.cpp
1360	Could you move this to `ModuleTranslation::getOpenMPBuilder`?

Thank you @kiranchandramohan for the feedback. Before working on the larger items, I'm requesting some clarification so that I've got a clearer idea of what the preferred approach is. Meanwhile, I'll start working on rebasing the complete stack of related patches and move the code in ModuleTranslation.cpp as suggested.

mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
1083–1086	It would be possible to gather the `atomic_default_mem_order` clause present in any 'requires' directives in the compilation unit, and then apply it (or 'monotonic', if not present) to all atomic operations with no memory order clauses as a semantic pass. I suppose that would also come together with making the memory order attribute mandatory in MLIR for these operations, since we would be handling the defaults prior to that. Is this a preferable approach?
1601–1620	Do you mean removing the `omp.requires` attribute and instead having an `omp.requires` operation to be specified similarly to this? module { omp.requires { clauses = #omp<clause_requires reverse_offload\|unified_shared_memory> } ... } Doing this would allow us to lower the operation in `OpenMPDialectLLVMIRTranslationInterface::convertOperation` instead of `OpenMPDialectLLVMIRTranslationInterface::amendOperation`, but the clauses from this directive must be used to modify the lowering of some other operations, not just to produce the registration function. The current approach allows accessing this information through the `OffloadModuleInterface` to be passed along to the `OpenMPIRBuilder` (in ModuleTranslation.cpp, currently), but if we had an operation instead of an attribute we would have to search for it in the module to extract this, which I'm not sure it would be a cleaner solution. Also there would be no guarantee that no more than a single occurrence of this operation would be present within the module at all times (e.g. if the input is handwritten MLIR). Or is your suggestion to lower every instance of a 'requires' directive in Fortran to its own `omp.requires` operation at the spot where it appears? Something like this: module { func.func @f() { omp.requires { clauses = #omp<clause_requires reverse_offload> } ... } ... func.func @g() { omp.requires { clauses = #omp<clause_requires unified_shared_memory> } ... } } The issue in that case becomes where to gather all of the clauses and lower to a single registration function in LLVM IR, since `convertOperation` would be called once per instance otherwise. The gathering of the clauses coming from different instances of 'requires' would currently be done in the bridge, so that the single module attribute is produced: D147218. I suppose a solution to issues on both approaches would be to update the 'requires' flags in the `OpenMPIRBuilder::Config` structure in `convertOperation` and replace the `OpenMPIRBuilder::createRegisterRequires` function for a new `OpenMPIRBuilder::createOrUpdateRegisterRequires`, and make sure that only a single registration function is generated, eventually with the whole set of flags passed to the runtime (each subsequent call would update the 'flags' argument instead of creating the function and registering it as a constructor). I guess it wouldn't be an issue the fact that the config structure wouldn't have the complete set of clauses at all times, since they are supposed (and checked in semantics: D149337) to come before any operations that may be affected by them. But I'm also not sure about how clean this solution would be either. Let me know what your thoughts are on whether any of these approaches to supporting 'requires' via an operation is preferable to an attribute, or if you have another idea, since any of these changes would result in multiple updates across all related pending patches so I want to make sure we agree on an approach before committing the time.

Rebase and move OpenMPIRBuilder configuration to ModuleTranslation::getOpenMPBuilder

Herald added a project: Restricted Project. · View Herald TranscriptMay 2 2023, 9:23 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

skatrak marked an inline comment as done.May 2 2023, 9:24 AM

Harbormaster completed remote builds in B229461: Diff 518767.May 2 2023, 10:32 AM

Thanks for the updates. Please see comments inline.

mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
1083–1086	Yes, that would be my preference. And it is fine for these instances of atomic operations to have the memory order attribute. Since these are anyway accepted as optional, there is no change in the operation definition. I think in general, we should try to keep the translation as dumb and minimal as possible. So we can exclude this from here.
1601–1620	Thanks for the detailed reply and exploring the options. My question was basically coming from the point of view that `convertRequiresAttr` is generating some llvm IR instructions. So, we could model these instructions as an operation in the OpenMP dialect. But you make a good point (point copied below). but if we had an operation instead of an attribute we would have to search for it in the module to extract this, which I'm not sure it would be a cleaner solution. Also there would be no guarantee that no more than a single occurrence of this operation would be present within the module at all times (e.g. if the input is handwritten MLIR). Can we move this to `getOpenMPBuilder`? This way if we decide to add an omp.module then we can just replace the use of the interface with this module operation. There are a few concerns regarding the amendOperation flow, -> Just like you said about having the operation multiple times, the attribute can also appear multiple times. This can be handled by checking that this attribute is only present in the module. -> There is no equivalent to amendOperation in the the regular Dialect Conversion flow. If some one is using their own OpenMP libraries downstream and would like to use the Dialect Conversion mechanism, then there will be a problem. -> The discardable nature of these attributes. What do you think about this?

kiranchandramohan added inline comments.May 2 2023, 4:52 PM

mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
1083–1086	The other option is to let the Openmpirbuilder handle this through the config. Why is this (the registration fn and the default ordering) done separately now?

Thank you Kiran for your suggestions. I'd be happy to follow any approach that's considered best for this, so I tried to explain the best I could my thinking about the points you raised and hopefully we can decide on a solution that can inform some of the other related work that's dealing with the same issues, as well.

mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
1083–1086	Yes, that would be my preference. And it is fine for these instances of atomic operations to have the memory order attribute. Since these are anyway accepted as optional, there is no change in the operation definition. I think in general, we should try to keep the translation as dumb and minimal as possible. So we can exclude this from here. Sure, it makes sense, I will start working on that. If there are no instances of `requires atomic_default_mem_order`, should the semantic pass leave atomic operations unchanged or should it explicitly add the memory order clause with the default value? The other option is to let the Openmpirbuilder handle this through the config. Why is this (the registration fn and the default ordering) done separately now? It made sense to split the 'requires' clauses into two attributes because there are different requirements for generating the registration function and for honoring the `atomic_default_mem_order`. The first should be produced if there are device constructs in the compilation unit, even if there are no 'requires' directives, whereas the second is honored as long as it appears in a 'requires' directive. By splitting these into two MLIR attributes it was possible to just add the ones that applied and avoid generating the registration function when it shouldn't be. With regards to the way in which this patch currently picks up the value of the `omp.atomic_default_mem_order`, I agree that this could have been moved to the `OpenMPIRBuilder` and updated the `createAtomic...` functions to take that into account if an explicit memory order was not given. I only added it where I did because that's where the default was being picked up already, so it required less code changes. I can make this change instead of moving the logic to semantics, but I understand that the other approach is preferred.
1601–1620	Can we move this to `getOpenMPBuilder`? This way if we decide to add an omp.module then we can just replace the use of the interface with this module operation. This could be done there, yes. However, it doesn't seem quite right to me lowering OpenMP-specific attributes inside of ModuleTranslation, even though this case would be a relatively small addition. In the case of adding an `omp.module` later I would assume that the proper spot to lower these attributes would be `OpenMPDialectLLVMIRTranslationInterface::convertOperation`, so it would have to be moved anyways. It makes sense to set up the `OpenMPIRBuilder::Config` in `getOpenMPBuilder` because it's where the OpenMP IR builder is created, so it's configured at the same time. But I'm not sure about doing any lowering there. There are a few concerns regarding the amendOperation flow, -> Just like you said about having the operation multiple times, the attribute can also appear multiple times. This can be handled by checking that this attribute is only present in the module. That's a good point, if we keep this approach I'll add that check. -> There is no equivalent to amendOperation in the the regular Dialect Conversion flow. If some one is using their own OpenMP libraries downstream and would like to use the Dialect Conversion mechanism, then there will be a problem. I'm not quite familiar with the details of the regular dialect conversion flow, so I'd be happy if you could point out if I'm missing something here. The OpenMP dialect currently only has implemented lowering to the LLVM IR dialect via the Dialect Conversion mechanism (mlir/lib/Conversion/OpenMPToLLVM/OpenMPToLLVM.cpp), which handles some but not all of the operations. This is currently triggered during flang compilation (flang/lib/Optimizer/CodeGen/CodeGen.cpp), resulting in an MLIR module with mostly LLVM IR dialect operations and possibly some remaining OpenMP dialect operations. This final MLIR representation is then translated to LLVM IR using the `ModuleTranslation` class, which in turn relies on all these dialect-specific translation interfaces that can define `amendOperation`. If I understand it correctly, the Dialect Conversion mechanism on its own would not be enough to take advantage of all current support for OpenMP lowering regardless of whether we relied on dialect attributes or not, since some operations are translated directly to LLVM IR at the end of the MLIR flow. That last part is what's built on top of the `OpenMPIRBuilder`. So it feels to me like the more general issue really comes down to having the `OpenMPIRBuilder`, which translates OpenMP operations into LLVM IR instead of into LLVM IR dialect operations, rather than to using dialect attributes. However, I do see how it would be possible to move logic from the `OpenMPIRBuilder` to the `ConvertOpenMPToLLVMPass`, but I'm not sure if we could do the same for OpenMP dialect attributes attached to operations from other dialects. The point I'm trying to make, I guess, is that we want to keep the `OpenMPIRBuilder`, so keeping OpenMP dialect-related information as attributes wouldn't make much of a difference as to the limitations of the Dialect Conversion mechanism. But again, I'm not that familiar with this, so if I'm wrong I'd be happy to be corrected and understand it better. -> The discardable nature of these attributes. According to the MLIR Language Reference, discardable attributes are just attributes that have semantics defined externally to the operation. I haven't been able to find any documentation related to the "discardability" of these attributes, but to me it should mean that the operation doesn't become invalid if it's taken away. So it could be the case that printing in text format some operation could ignore them, hence losing that information if that text is used to continue the compilation process. However, if we make sure that this is not the case for the `builtin.module` operation, there is no reason for any transformation/optimization/lowering passes outside of the OpenMP dialect to remove it, so I think it should not be a reason to decide against it. I can see how these kinds of attributes could be lost if, during a pass, the operation they are attached to is replaced by something else without forwarding them or taking them into consideration to impact in some way the results of the process. And it wouldn't make sense to try to make every pass working on those operations aware of these attributes, so it makes sense that they would be discardable in this sense. However, in the case of the `builtin.module` I feel like this should not be a concern, since that kind of situation doesn't seem likely or even something that should be supported.

Thanks @skatrak for the detailed reply. It will be good to discuss this a bit more and once we are all convinced with the approach then we can go fast.

My reply is inline.

mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
1601–1620	Can we move this to getOpenMPBuilder? This way if we decide to add an omp.module then we can just replace the use of the interface with this module operation. This could be done there, yes. However, it doesn't seem quite right to me lowering OpenMP-specific attributes inside of ModuleTranslation, even though this case would be a relatively small addition. In the case of adding an omp.module later I would assume that the proper spot to lower these attributes would be OpenMPDialectLLVMIRTranslationInterface::convertOperation, so it would have to be moved anyways. It makes sense to set up the OpenMPIRBuilder::Config in getOpenMPBuilder because it's where the OpenMP IR builder is created, so it's configured at the same time. But I'm not sure about doing any lowering there. The concern here was that even though we are representing this as an attribute it seems to have an IR representation in LLVM IR. Also FWIU, it seems that we are generating a global constructor and not a metadata or a function attribute. The fact that it is a global constructor makes me feel that this should be provided as an option to the initializer/constructor in the OpenMPIRBuilder which can internally generate these. There is no equivalent to amendOperation in the regular Dialect Conversion flow. If someone is using their own OpenMP libraries downstream and would like to use the Dialect Conversion mechanism, then there will be a problem. I'm not quite familiar with the details of the regular dialect conversion flow, so I'd be happy if you could point out if I'm missing something here. The OpenMP dialect currently only has implemented lowering to the LLVM IR dialect via the Dialect Conversion mechanism (mlir/lib/Conversion/OpenMPToLLVM/OpenMPToLLVM.cpp), which handles some but not all of the operations. This is currently triggered during flang compilation (flang/lib/Optimizer/CodeGen/CodeGen.cpp), resulting in an MLIR module with mostly LLVM IR dialect operations and possibly some remaining OpenMP dialect operations. This final MLIR representation is then translated to LLVM IR using the ModuleTranslation class, which in turn relies on all these dialect-specific translation interfaces that can define amendOperation. If I understand it correctly, the Dialect Conversion mechanism on its own would not be enough to take advantage of all current support for OpenMP lowering regardless of whether we relied on dialect attributes or not, since some operations are translated directly to LLVM IR at the end of the MLIR flow. That last part is what's built on top of the OpenMPIRBuilder. So it feels to me like the more general issue really comes down to having the OpenMPIRBuilder, which translates OpenMP operations into LLVM IR instead of into LLVM IR dialect operations, rather than to using dialect attributes. However, I do see how it would be possible to move logic from the OpenMPIRBuilder to the ConvertOpenMPToLLVMPass, but I'm not sure if we could do the same for OpenMP dialect attributes attached to operations from other dialects. The point I'm trying to make, I guess, is that we want to keep the OpenMPIRBuilder, so keeping OpenMP dialect-related information as attributes wouldn't make much of a difference as to the limitations of the Dialect Conversion mechanism. But again, I'm not that familiar with this, so if I'm wrong I'd be happy to be corrected and understand it better. This is a general point that I am making here. The OpenMP Dialect sits in MLIR tree and it will have users both in upstream llvm-project as well as downstream. The current method we have chosen to lower the OpenMP dialect is through the OpenMPIRBuilder at the translation phase. This is done so as to share the code with Clang and have a single implementation of OpenMP in llvm-project as far as possible. I am not suggesting a change in this flow. But what we have to keep in mind is that there will be others who might be lowering the OpenMP dialect differently for e.g. because they might have their own OpenMP runtime. So, in the best case, the design of the dialect should not be dependent on the OpenMPIRBuilder lowering, it should be general enough so that other users can also use it. Now the problem here is that AFAIS, the Dialect conversions are usually defined for Operations and we do not have a counterpart for `amendOperation` that takes an Operation and an attribute. So how will a general user lower the OpenMP dialect? The best case here is again that there is an `omp.module` operation that when it undergoes DialectConversion will do all the required changes. With the current representation that we have chosen, they will have to add a custom Conversion Patter for the Builtin Module where the OpenMP attributes are interpreted. I don't know whether this is possible currently. Would be good to give this a try so that we know there is a way for DialectConversion users.

skatrak mentioned this in D149337: [Flang][OpenMP][Semantics] Add semantics checks to requires directive.May 8 2023, 3:12 AM

skatrak added inline comments.May 8 2023, 3:28 AM

mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
1083–1086	This is now implemented in semantics in patch D149337. I'm currently working on updating the rest of patches to stop using the `omp.atomic_default_mem_order` attribute.

Remove atomic_default_mem_order handling, as it's now done in semantics.

Harbormaster completed remote builds in B230620: Diff 520341.May 8 2023, 6:18 AM

Rebase.

Harbormaster completed remote builds in B230664: Diff 520405.May 8 2023, 10:17 AM

Fix comments.

Harbormaster completed remote builds in B230830: Diff 520641.May 9 2023, 3:10 AM

skatrak added inline comments.May 9 2023, 5:01 AM

mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
1601–1620	The concern here was that even though we are representing this as an attribute it seems to have an IR representation in LLVM IR. Also FWIU, it seems that we are generating a global constructor and not a metadata or a function attribute. The fact that it is a global constructor makes me feel that this should be provided as an option to the initializer/constructor in the OpenMPIRBuilder which can internally generate these. Yes, in this case we are generating a new function in the IR that's registered as a global constructor, and not metadata/attributes. I see your point, and I think that the approach of moving this code generation to the `OpenMPIRBuilder::initialize` method makes sense as well. The only considerations for doing this I think would be to first set the IRBuilder configuration before calling the initializer (which makes sense, but it's done the other way around at the moment) and letting the initializer know that it should optionally generate this constructor. The flag to tell whether to generate the constructor would be necessary to ensure that it's only produced if there are device constructs in the compilation unit, but it can also serve temporarily to prevent clang from generating it twice, as it currently does not use the IRBuilder for this. Now the problem here is that AFAIS, the Dialect conversions are usually defined for Operations and we do not have a counterpart for `amendOperation` that takes an Operation and an attribute. So how will a general user lower the OpenMP dialect? The best case here is again that there is an `omp.module` operation that when it undergoes DialectConversion will do all the required changes. With the current representation that we have chosen, they will have to add a custom Conversion Patter for the Builtin Module where the OpenMP attributes are interpreted. I don't know whether this is possible currently. Would be good to give this a try so that we know there is a way for DialectConversion users. I can look into this, and see if I can make something like that work. But this requirement does make a good case for defining and using an `omp.module` to hold all these attributes we're working with. I think it's also a similar consideration with function attributes for declare target in D146063. I think we're back to the first discussion here from a few months ago. We've now seen what using dialect attributes looks like, and what the problems related to that could be, and we have to decide whether to stick with them or to follow an alternative approach. The main downside that I see to defining `omp.module` and `omp.func` or similar is the difficulty around being able to rely on the existing infrastructure without breaking optimization/canonicalization/lowering passes. It's probably easy to inadvertently introduce bugs and the extent of the changes to replace `builtin.module` for `omp.module` and `func.func` for `omp.func` would likely be quite large. So would you say that keeping the current approach of representing global OpenMP-specific information with discardable attributes is dependent on there being a clean way of lowering them in MLIR via DialectConversion? In your opinion, is this just an issue for this operation because it produces more than just LLVM IR metadata/attributes, or is it also applicable to other simpler flags (e.g. `omp.is_device`)? @jsjodin, @agozillon, @domada, do you have any other thoughts about this?

agozillon added inline comments.May 9 2023, 6:12 AM

mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
1601–1620	Not sure it's a good enough case, but that is just my opinion and I'm more than happy to defer the judgement to everyone else, here's my original version of trying to maintain both a builtin.module and an omp.module in Flang for an is_device which passed the test-suite at the time of making: https://github.com/ROCm-Developer-Tools/llvm-project/pull/193/files The dozens of test changes aren't necessarily required (some are, but not quite at that level), however, all of the MLIR opt related components do not play well with other modules as there is some inherent expectation that it is an builtin.module that's being utilised and if it's not, generate one implicitly. So, you can ignore those (although if we want the most sensible user experience for opt passes I think those change are likely required). While I'm very sure someone with more knowledge could do a much better job at shrinking the change list and making a more elegant solution, I think you still very likely are going to run into a lot of issues where bugs are created unintentionally by implementers through catering to only one module type and not considering both, as all an omp.module really is, is a superset of builtin.module, but there is no inheritance for operations in MLIR (although, I did try, but I can't remember the mileage I got out of it) as far as I am aware, so you have to make sure things stay appropriately aligned, if Flang only had one module type, it would likely not be a problem, as you could focus on catering to that sole module type without any regard for another (as is done today). And there is as you said a lot of optimization passes that depend on builtin.modules, a lot of these can be generalized (and I do with a fair number in the pull request), but future passes have to take into consideration the chance of being used with two module types as well. I'm not sure that's the same case for https://reviews.llvm.org/D146063 with function attributes at least as they're never actually lowered in any specialized way based on the attribute, just filtered out (and that might happen in the semantic phase at least partially based on where the current discussion on the patch goes), it might change if we ever want specialised lowering of device functions though. However, for data on GlobalOps, the attribute does act as an indicator that lowering changes are required, e.g. removal of original Global (certain cases), addition of metadata to the module or new globals in addition to the original. So you may end up also having to have an omp.GlobalOp and omp.all that can be made declare target, this might be an exhaustive list based on the Fortran/C++ programming language + OpenMP spec, but it is perhaps not in terms of MLIR dialects as new operations for other dialects that support a mix of omp are made. Although, perhaps the MLIR way is just that they have to convert their operations to the omp operations that support declare target at some point in their pipeline or maybe we only need to care about what exists in terms of LLVM IR / Dialect. We could have a seperate omp.declare_target operation, like I originally tried, but lowering functions if we ever decide to do so becomes a bit painful as any declare targets that resides inside of a function (which I think is the required placement in Fortran for a lot of constructs) has to have some kind of deferral mechanism when it's being lowered to LLVM-IR as convertOperation will process it when the function is being lowered (and so not completely lowered yet). This at least was how it was with my original dialect implementation, you could perhaps design omp.declare_target to act as a region container, similar to target.op, but that may create not so easy to read IR. Anyway, I have droned on a little bit here, so I hope it's somewhat intelligible and useful with not too many silly assumptions on my part although I know there will be some. I don't mind which direction is chosen (I'm sure they'll all have their pros and cons, but I do have a lot of love for the simplicity of the attributes), just that we do chose one!

jsjodin added inline comments.May 9 2023, 9:40 AM

mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
1601–1620	If we look at the message for the commit that added amendOperation it says: Port the translation of five dialects that define LLVM IR intrinsics (LLVMAVX512, LLVMArmNeon, LLVMArmSVE, NVVM, ROCDL) to the new dialect interface-based mechanism. This allows us to remove individual translations that were created for each of these dialects and just use one common MLIR-to-LLVM-IR translation that potentially supports all dialects instead, based on what is registered and including any combination of translatable dialects. This removal was one of the main goals of the refactoring. To support the addition of GPU-related metadata, the translation interface is extended with the `amendOperation` function that allows the interface implementation to post-process any translated operation with dialect attributes from the dialect for which the interface is implemented regardless of the operation's dialect. This is currently applied to "kernel" functions, but can be used to construct other metadata in dialect-specific ways without necessarily affecting operations. If I understand the wording correctly, the second paragraph says that it is okay to add dialect attributes to operations of a different dialect, and process them using amendOperation, regardless of the operation's dialect. Also it is okay to affect the operation (but not necessarily), so generating code (modifying the module operation) should be okay. So I don't really see a problem using amendOperation if we follow that logic. One issue is that this directly accesses attributes and will not go through any interfaces that have been defined. FWIW, my personal opinion about interfaces is that they are best used to capture abstract or derived properties that are not tied to a specific dialect so that generic analyzes can be written without knowing the specifics of any dialect, and that dialect attributes are owned by a specific dialect and can be used directly. So using amendOperation should be okay imo.

Thanks for the detailed discussion @skatrak @agozillon @jsjodin.

LGTM.

mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
1601–1620	Not sure it's a good enough case, but that is just my opinion and I'm more than happy to defer the judgement to everyone else, here's my original version of trying to maintain both a builtin.module and an omp.module in Flang for an is_device which passed the test-suite at the time of making: https://github.com/ROCm-Developer-Tools/llvm-project/pull/193/files Thanks for the pointer to this pull-request. As discussed in https://discourse.llvm.org/t/supporting-top-level-ops-other-than-builtin-module/65224, it seems having a non-builtin-module as a top-level Op is a welcome change. It might be worth seeing if some of those changes can make it into upstream MLIR when you get some time.
1601–1620	If we look at the message for the commit that added amendOperation it says: Port the translation of five dialects that define LLVM IR intrinsics (LLVMAVX512, LLVMArmNeon, LLVMArmSVE, NVVM, ROCDL) to the new dialect interface-based mechanism. This allows us to remove individual translations that were created for each of these dialects and just use one common MLIR-to-LLVM-IR translation that potentially supports all dialects instead, based on what is registered and including any combination of translatable dialects. This removal was one of the main goals of the refactoring. To support the addition of GPU-related metadata, the translation interface is extended with the `amendOperation` function that allows the interface implementation to post-process any translated operation with dialect attributes from the dialect for which the interface is implemented regardless of the operation's dialect. This is currently applied to "kernel" functions, but can be used to construct other metadata in dialect-specific ways without necessarily affecting operations. If I understand the wording correctly, the second paragraph says that it is okay to add dialect attributes to operations of a different dialect, and process them using amendOperation, regardless of the operation's dialect. Also it is okay to affect the operation (but not necessarily), so generating code (modifying the module operation) should be okay. So I don't really see a problem using amendOperation if we follow that logic. One issue is that this directly accesses attributes and will not go through any interfaces that have been defined. FWIW, my personal opinion about interfaces is that they are best used to capture abstract or derived properties that are not tied to a specific dialect so that generic analyzes can be written without knowing the specifics of any dialect, and that dialect attributes are owned by a specific dialect and can be used directly. So using amendOperation should be okay imo. The second paragraph, although it says that it can post-process any translated operation, mostly speaks about metadata. We are using it here to generate global variables. The concern is also that there does not seem to be a similar functionality like amendOperation in DialectConversion unlike convertOperation. The interface we are using is just a place-holder to capture all the changes required to enable device offloading/target operations handling. Ideally, finally, that should just dissolve into an `omp.module` operation.
1601–1620	The concern here was that even though we are representing this as an attribute it seems to have an IR representation in LLVM IR. Also FWIU, it seems that we are generating a global constructor and not a metadata or a function attribute. The fact that it is a global constructor makes me feel that this should be provided as an option to the initializer/constructor in the OpenMPIRBuilder which can internally generate these. Yes, in this case we are generating a new function in the IR that's registered as a global constructor, and not metadata/attributes. I see your point, and I think that the approach of moving this code generation to the `OpenMPIRBuilder::initialize` method makes sense as well. The only considerations for doing this I think would be to first set the IRBuilder configuration before calling the initializer (which makes sense, but it's done the other way around at the moment) and letting the initializer know that it should optionally generate this constructor. The flag to tell whether to generate the constructor would be necessary to ensure that it's only produced if there are device constructs in the compilation unit, but it can also serve temporarily to prevent clang from generating it twice, as it currently does not use the IRBuilder for this. Now the problem here is that AFAIS, the Dialect conversions are usually defined for Operations and we do not have a counterpart for `amendOperation` that takes an Operation and an attribute. So how will a general user lower the OpenMP dialect? The best case here is again that there is an `omp.module` operation that when it undergoes DialectConversion will do all the required changes. With the current representation that we have chosen, they will have to add a custom Conversion Patter for the Builtin Module where the OpenMP attributes are interpreted. I don't know whether this is possible currently. Would be good to give this a try so that we know there is a way for DialectConversion users. I can look into this, and see if I can make something like that work. But this requirement does make a good case for defining and using an `omp.module` to hold all these attributes we're working with. I think it's also a similar consideration with function attributes for declare target in D146063. I think we're back to the first discussion here from a few months ago. We've now seen what using dialect attributes looks like, and what the problems related to that could be, and we have to decide whether to stick with them or to follow an alternative approach. The main downside that I see to defining `omp.module` and `omp.func` or similar is the difficulty around being able to rely on the existing infrastructure without breaking optimization/canonicalization/lowering passes. It's probably easy to inadvertently introduce bugs and the extent of the changes to replace `builtin.module` for `omp.module` and `func.func` for `omp.func` would likely be quite large. So would you say that keeping the current approach of representing global OpenMP-specific information with discardable attributes is dependent on there being a clean way of lowering them in MLIR via DialectConversion? In your opinion, is this just an issue for this operation because it produces more than just LLVM IR metadata/attributes, or is it also applicable to other simpler flags (e.g. `omp.is_device`)? The concern is primarily that there does not seem to be something similar to `amendOperation` in DialectConversion. So we are getting into a lowering flow that might not work directly with DialectConversion. Anyway, at the moment, I do not have an alternative suggestion. So we can go ahead with this.

This revision is now accepted and ready to land.May 9 2023, 3:21 PM

skatrak mentioned this in D150275: [WIP] PoC Lowering of dialect attribute on builtin.module via DialectConversion.May 10 2023, 8:42 AM

skatrak added inline comments.May 10 2023, 9:00 AM

mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp
1601–1620	Hi Kiran, I spent some time trying to prototype something like what you're describing. A DialectConversion-based way to lower dialect attributes attached to builtin modules, or at least my understanding of what that would look like. I managed to implement something, although I suppose there are still many shortcomings to that approach: D150275. At least I think it shows it's doable. Thank you for all the feedback. I will merge this after all parent patches are approved as well.

skatrak mentioned this in D147217: [OpenMP][OMPIRBuilder] OpenMPIRBuilder support for requires directive.May 31 2023, 10:09 AM

Update and remove OpenMP dialect dependency from the generic LLVM IR translation.

Followed @kiranchandramohan's suggestion in the comments for D147172, since the
current approach prevents the use of OpenMP dialect-specific attributes for
initializing the OpenMPIRBuilder configuration. One of these is defined for
representing 'requires' clauses, which need to be stored in the
OpenMPIRBuilderConfig, so a different approach is necessary. The approach
implemented in this review is based on the amendOperation translation flow.

Herald added a project: Restricted Project. · View Herald TranscriptJun 2 2023, 7:06 AM

Herald added subscribers: cfe-commits, hiraditya. · View Herald Transcript

Harbormaster completed remote builds in B236180: Diff 527854.Jun 2 2023, 9:19 AM

skatrak mentioned this in D147641: [Flang][OpenMP][MLIR] Filter emitted code depending on declare target and device.Jul 14 2023, 10:09 AM

Update patch.

Herald added subscribers: gysit, Dinistro. · View Herald TranscriptAug 17 2023, 8:30 AM

I think the change of approach to initialize the OpenMPIRBuilderConfig through the amendOperation() flow (rather than directly inside of ModuleTranslation::getOpenMPBuilder()) after @kiranchandramohan 's acceptance of the patch requires another review before landing. This patch still depends on a few pending patches so, in case significant changes have to be made to these, I think it's best to leave this review for last.

kiranchandramohan added inline comments.Aug 17 2023, 9:24 AM

mlir/lib/Target/LLVMIR/ModuleTranslation.cpp
1372–1374	This change has to go in another patch reviewed by the general MLIR/LLVM dialect community. Not sure about mentioning too much about OpenMPIRbuilder or the OpenMPDialect here.

Harbormaster completed remote builds in B253228: Diff 551147.Aug 17 2023, 9:39 AM

Update to split generic MLIR into its own patch.

skatrak added a parent revision: D158278: [MLIR] Move builtin.module LLVM IR translation to before nested operations.Aug 18 2023, 7:18 AM

skatrak edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B253486: Diff 551502.Aug 18 2023, 8:43 AM

Update patch.

Harbormaster completed remote builds in B253809: Diff 551968.Aug 21 2023, 4:33 AM

LG.

This revision is now accepted and ready to land.Sep 1 2023, 9:15 AM

Rebase.

Harbormaster completed remote builds in B257137: Diff 556657.Sep 13 2023, 7:08 AM

Closed by commit rG9058762789c0: [OpenMP][Flang][MLIR] Lowering of requires directive from MLIR to LLVM IR (authored by skatrak). · Explain WhySep 14 2023, 2:36 AM

This revision was automatically updated to reflect the committed changes.

skatrak added a commit: rG9058762789c0: [OpenMP][Flang][MLIR] Lowering of requires directive from MLIR to LLVM IR.

Revision Contents

Path

Size

llvm/

include/

llvm/

Frontend/

OpenMP/

OMPIRBuilder.h

1 line

mlir/

lib/

Target/

LLVMIR/

CMakeLists.txt

1 line

Dialect/

OpenMP/

OpenMPToLLVMIRTranslation.cpp

71 lines

ModuleTranslation.cpp

33 lines

test/

Target/

LLVMIR/

openmp-llvm.mlir

48 lines

Diff 518767

llvm/include/llvm/Frontend/OpenMP/OMPIRBuilder.h

Show First 20 Lines • Show All 153 Lines • ▼ Show 20 Lines	if (Separator.has_value())
return *Separator;		return *Separator;
if (isTargetCodegen())		if (isTargetCodegen())
return "$";		return "$";
return ".";		return ".";
}		}

void setIsEmbedded(bool Value) { IsEmbedded = Value; }		void setIsEmbedded(bool Value) { IsEmbedded = Value; }
void setIsTargetCodegen(bool Value) { IsTargetCodegen = Value; }		void setIsTargetCodegen(bool Value) { IsTargetCodegen = Value; }
		void setOpenMPOffloadMandatory(bool Value) { OpenMPOffloadMandatory = Value; }
void setFirstSeparator(StringRef FS) { FirstSeparator = FS; }		void setFirstSeparator(StringRef FS) { FirstSeparator = FS; }
void setSeparator(StringRef S) { Separator = S; }		void setSeparator(StringRef S) { Separator = S; }

void setHasRequiresReverseOffload(bool Value);		void setHasRequiresReverseOffload(bool Value);
void setHasRequiresUnifiedAddress(bool Value);		void setHasRequiresUnifiedAddress(bool Value);
void setHasRequiresUnifiedSharedMemory(bool Value);		void setHasRequiresUnifiedSharedMemory(bool Value);
void setHasRequiresDynamicAllocators(bool Value);		void setHasRequiresDynamicAllocators(bool Value);
};		};
▲ Show 20 Lines • Show All 2,305 Lines • Show Last 20 Lines

mlir/lib/Target/LLVMIR/CMakeLists.txt

Show All 32 Lines	add_mlir_translation_library(MLIRTargetLLVMIRExport
FrontendOpenMP		FrontendOpenMP
TransformUtils		TransformUtils
TargetParser		TargetParser

LINK_LIBS PUBLIC		LINK_LIBS PUBLIC
MLIRDLTIDialect		MLIRDLTIDialect
MLIRLLVMDialect		MLIRLLVMDialect
MLIRLLVMIRTransforms		MLIRLLVMIRTransforms
		MLIROpenMPDialect
MLIRTranslateLib		MLIRTranslateLib
MLIROpenMPDialect		MLIROpenMPDialect
)		)

add_mlir_translation_library(MLIRToLLVMIRTranslationRegistration		add_mlir_translation_library(MLIRToLLVMIRTranslationRegistration
ConvertToLLVMIR.cpp		ConvertToLLVMIR.cpp

LINK_LIBS PUBLIC		LINK_LIBS PUBLIC
Show All 39 Lines

mlir/lib/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.cpp

Show All 10 Lines
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
#include "mlir/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.h"		#include "mlir/Target/LLVMIR/Dialect/OpenMP/OpenMPToLLVMIRTranslation.h"
#include "mlir/Dialect/OpenMP/OpenMPDialect.h"		#include "mlir/Dialect/OpenMP/OpenMPDialect.h"
#include "mlir/Dialect/OpenMP/OpenMPInterfaces.h"		#include "mlir/Dialect/OpenMP/OpenMPInterfaces.h"
#include "mlir/IR/IRMapping.h"		#include "mlir/IR/IRMapping.h"
#include "mlir/IR/Operation.h"		#include "mlir/IR/Operation.h"
#include "mlir/Support/LLVM.h"		#include "mlir/Support/LLVM.h"
		#include "mlir/Support/LogicalResult.h"
#include "mlir/Target/LLVMIR/Dialect/OpenMPCommon.h"		#include "mlir/Target/LLVMIR/Dialect/OpenMPCommon.h"
#include "mlir/Target/LLVMIR/ModuleTranslation.h"		#include "mlir/Target/LLVMIR/ModuleTranslation.h"
#include "mlir/Transforms/RegionUtils.h"		#include "mlir/Transforms/RegionUtils.h"

#include "llvm/ADT/SetVector.h"		#include "llvm/ADT/SetVector.h"
#include "llvm/ADT/TypeSwitch.h"		#include "llvm/ADT/TypeSwitch.h"
#include "llvm/Frontend/OpenMP/OMPConstants.h"		#include "llvm/Frontend/OpenMP/OMPConstants.h"
#include "llvm/Frontend/OpenMP/OMPIRBuilder.h"		#include "llvm/Frontend/OpenMP/OMPIRBuilder.h"
#include "llvm/IR/DebugInfoMetadata.h"		#include "llvm/IR/DebugInfoMetadata.h"
#include "llvm/IR/IRBuilder.h"		#include "llvm/IR/IRBuilder.h"
#include "llvm/Support/FileSystem.h"		#include "llvm/Support/FileSystem.h"
		#include "llvm/Transforms/Utils/ModuleUtils.h"
		#include <optional>

using namespace mlir;		using namespace mlir;

namespace {		namespace {
static llvm::omp::ScheduleKind		static llvm::omp::ScheduleKind
convertToScheduleKind(std::optional<omp::ClauseScheduleKind> schedKind) {		convertToScheduleKind(std::optional<omp::ClauseScheduleKind> schedKind) {
if (!schedKind.has_value())		if (!schedKind.has_value())
return llvm::omp::OMP_SCHEDULE_Default;		return llvm::omp::OMP_SCHEDULE_Default;
▲ Show 20 Lines • Show All 991 Lines • ▼ Show 20 Lines	ompBuilder->applySimd(
loop.getIfExpr() ? moduleTranslation.lookupValue(loop.getIfExpr())		loop.getIfExpr() ? moduleTranslation.lookupValue(loop.getIfExpr())
: nullptr,		: nullptr,
llvm::omp::OrderKind::OMP_ORDER_unknown, simdlen, safelen);		llvm::omp::OrderKind::OMP_ORDER_unknown, simdlen, safelen);

builder.restoreIP(afterIP);		builder.restoreIP(afterIP);
return success();		return success();
}		}

		static std::optional<omp::ClauseMemoryOrderKind>
		getAtomicDefaultMemOrder(Operation &opInst) {
		// Try to get the omp.atomic_default_mem_order attribute, if present
		if (auto offloadModule =
		opInst.getParentOfType<omp::OffloadModuleInterface>())
		return offloadModule.getAtomicDefaultMemOrder();

		return std::nullopt;
		}

/// Convert an Atomic Ordering attribute to llvm::AtomicOrdering.		/// Convert an Atomic Ordering attribute to llvm::AtomicOrdering.
llvm::AtomicOrdering		static llvm::AtomicOrdering
convertAtomicOrdering(std::optional<omp::ClauseMemoryOrderKind> ao) {		convertAtomicOrdering(std::optional<omp::ClauseMemoryOrderKind> ao,
		std::optional<omp::ClauseMemoryOrderKind> defaultAo) {
		// If not specified, try using the default atomic ordering gathered from a
		// requires atomic_mem_default_order clause, if present
if (!ao)		if (!ao)
return llvm::AtomicOrdering::Monotonic; // Default Memory Ordering		ao = defaultAo.value_or(omp::ClauseMemoryOrderKind::Relaxed);

switch (*ao) {		switch (*ao) {
case omp::ClauseMemoryOrderKind::Seq_cst:		case omp::ClauseMemoryOrderKind::Seq_cst:
return llvm::AtomicOrdering::SequentiallyConsistent;		return llvm::AtomicOrdering::SequentiallyConsistent;
case omp::ClauseMemoryOrderKind::Acq_rel:		case omp::ClauseMemoryOrderKind::Acq_rel:
return llvm::AtomicOrdering::AcquireRelease;		return llvm::AtomicOrdering::AcquireRelease;
case omp::ClauseMemoryOrderKind::Acquire:		case omp::ClauseMemoryOrderKind::Acquire:
return llvm::AtomicOrdering::Acquire;		return llvm::AtomicOrdering::Acquire;
case omp::ClauseMemoryOrderKind::Release:		case omp::ClauseMemoryOrderKind::Release:
return llvm::AtomicOrdering::Release;		return llvm::AtomicOrdering::Release;
case omp::ClauseMemoryOrderKind::Relaxed:		case omp::ClauseMemoryOrderKind::Relaxed:
return llvm::AtomicOrdering::Monotonic;		return llvm::AtomicOrdering::Monotonic;
}		}
llvm_unreachable("Unknown ClauseMemoryOrderKind kind");		llvm_unreachable("Unknown ClauseMemoryOrderKind kind");
}		}

/// Convert omp.atomic.read operation to LLVM IR.		/// Convert omp.atomic.read operation to LLVM IR.
static LogicalResult		static LogicalResult
convertOmpAtomicRead(Operation &opInst, llvm::IRBuilderBase &builder,		convertOmpAtomicRead(Operation &opInst, llvm::IRBuilderBase &builder,
LLVM::ModuleTranslation &moduleTranslation) {		LLVM::ModuleTranslation &moduleTranslation) {

auto readOp = cast<omp::AtomicReadOp>(opInst);		auto readOp = cast<omp::AtomicReadOp>(opInst);
llvm::OpenMPIRBuilder *ompBuilder = moduleTranslation.getOpenMPBuilder();		llvm::OpenMPIRBuilder *ompBuilder = moduleTranslation.getOpenMPBuilder();

llvm::OpenMPIRBuilder::LocationDescription ompLoc(builder);		llvm::OpenMPIRBuilder::LocationDescription ompLoc(builder);

llvm::AtomicOrdering AO = convertAtomicOrdering(readOp.getMemoryOrderVal());		auto defaultAO = getAtomicDefaultMemOrder(opInst);
		llvm::AtomicOrdering AO =
		convertAtomicOrdering(readOp.getMemoryOrderVal(), defaultAO);
llvm::Value *x = moduleTranslation.lookupValue(readOp.getX());		llvm::Value *x = moduleTranslation.lookupValue(readOp.getX());
		kiranchandramohanUnsubmitted Done Reply Inline Actions Should this be part of Semantics? kiranchandramohan: Should this be part of Semantics?
		skatrakAuthorUnsubmitted Done Reply Inline Actions It would be possible to gather the `atomic_default_mem_order` clause present in any 'requires' directives in the compilation unit, and then apply it (or 'monotonic', if not present) to all atomic operations with no memory order clauses as a semantic pass. I suppose that would also come together with making the memory order attribute mandatory in MLIR for these operations, since we would be handling the defaults prior to that. Is this a preferable approach? skatrak: It would be possible to gather the `atomic_default_mem_order` clause present in any 'requires'…
		kiranchandramohanUnsubmitted Done Reply Inline Actions Yes, that would be my preference. And it is fine for these instances of atomic operations to have the memory order attribute. Since these are anyway accepted as optional, there is no change in the operation definition. I think in general, we should try to keep the translation as dumb and minimal as possible. So we can exclude this from here. kiranchandramohan: Yes, that would be my preference. And it is fine for these instances of atomic operations to…
		kiranchandramohanUnsubmitted Done Reply Inline Actions The other option is to let the Openmpirbuilder handle this through the config. Why is this (the registration fn and the default ordering) done separately now? kiranchandramohan: The other option is to let the Openmpirbuilder handle this through the config. Why is this (the…
		skatrakAuthorUnsubmitted Done Reply Inline Actions Yes, that would be my preference. And it is fine for these instances of atomic operations to have the memory order attribute. Since these are anyway accepted as optional, there is no change in the operation definition. I think in general, we should try to keep the translation as dumb and minimal as possible. So we can exclude this from here. Sure, it makes sense, I will start working on that. If there are no instances of `requires atomic_default_mem_order`, should the semantic pass leave atomic operations unchanged or should it explicitly add the memory order clause with the default value? The other option is to let the Openmpirbuilder handle this through the config. Why is this (the registration fn and the default ordering) done separately now? It made sense to split the 'requires' clauses into two attributes because there are different requirements for generating the registration function and for honoring the `atomic_default_mem_order`. The first should be produced if there are device constructs in the compilation unit, even if there are no 'requires' directives, whereas the second is honored as long as it appears in a 'requires' directive. By splitting these into two MLIR attributes it was possible to just add the ones that applied and avoid generating the registration function when it shouldn't be. With regards to the way in which this patch currently picks up the value of the `omp.atomic_default_mem_order`, I agree that this could have been moved to the `OpenMPIRBuilder` and updated the `createAtomic...` functions to take that into account if an explicit memory order was not given. I only added it where I did because that's where the default was being picked up already, so it required less code changes. I can make this change instead of moving the logic to semantics, but I understand that the other approach is preferred. skatrak: > Yes, that would be my preference. And it is fine for these instances of atomic operations to…
		skatrakAuthorUnsubmitted Done Reply Inline Actions This is now implemented in semantics in patch D149337. I'm currently working on updating the rest of patches to stop using the `omp.atomic_default_mem_order` attribute. skatrak: This is now implemented in semantics in patch D149337. I'm currently working on updating the…
llvm::Value *v = moduleTranslation.lookupValue(readOp.getV());		llvm::Value *v = moduleTranslation.lookupValue(readOp.getV());

llvm::Type *elementType =		llvm::Type *elementType =
moduleTranslation.convertType(readOp.getElementType());		moduleTranslation.convertType(readOp.getElementType());

llvm::OpenMPIRBuilder::AtomicOpValue V = {v, elementType, false, false};		llvm::OpenMPIRBuilder::AtomicOpValue V = {v, elementType, false, false};
llvm::OpenMPIRBuilder::AtomicOpValue X = {x, elementType, false, false};		llvm::OpenMPIRBuilder::AtomicOpValue X = {x, elementType, false, false};
builder.restoreIP(ompBuilder->createAtomicRead(ompLoc, X, V, AO));		builder.restoreIP(ompBuilder->createAtomicRead(ompLoc, X, V, AO));
return success();		return success();
}		}

/// Converts an omp.atomic.write operation to LLVM IR.		/// Converts an omp.atomic.write operation to LLVM IR.
static LogicalResult		static LogicalResult
convertOmpAtomicWrite(Operation &opInst, llvm::IRBuilderBase &builder,		convertOmpAtomicWrite(Operation &opInst, llvm::IRBuilderBase &builder,
LLVM::ModuleTranslation &moduleTranslation) {		LLVM::ModuleTranslation &moduleTranslation) {
auto writeOp = cast<omp::AtomicWriteOp>(opInst);		auto writeOp = cast<omp::AtomicWriteOp>(opInst);
llvm::OpenMPIRBuilder *ompBuilder = moduleTranslation.getOpenMPBuilder();		llvm::OpenMPIRBuilder *ompBuilder = moduleTranslation.getOpenMPBuilder();

llvm::OpenMPIRBuilder::LocationDescription ompLoc(builder);		llvm::OpenMPIRBuilder::LocationDescription ompLoc(builder);
llvm::AtomicOrdering ao = convertAtomicOrdering(writeOp.getMemoryOrderVal());		auto defaultAO = getAtomicDefaultMemOrder(opInst);
		llvm::AtomicOrdering ao =
		convertAtomicOrdering(writeOp.getMemoryOrderVal(), defaultAO);
llvm::Value *expr = moduleTranslation.lookupValue(writeOp.getValue());		llvm::Value *expr = moduleTranslation.lookupValue(writeOp.getValue());
llvm::Value *dest = moduleTranslation.lookupValue(writeOp.getAddress());		llvm::Value *dest = moduleTranslation.lookupValue(writeOp.getAddress());
llvm::Type *ty = moduleTranslation.convertType(writeOp.getValue().getType());		llvm::Type *ty = moduleTranslation.convertType(writeOp.getValue().getType());
llvm::OpenMPIRBuilder::AtomicOpValue x = {dest, ty, /isSigned=/false,		llvm::OpenMPIRBuilder::AtomicOpValue x = {dest, ty, /isSigned=/false,
/isVolatile=/false};		/isVolatile=/false};
builder.restoreIP(ompBuilder->createAtomicWrite(ompLoc, x, expr, ao));		builder.restoreIP(ompBuilder->createAtomicWrite(ompLoc, x, expr, ao));
return success();		return success();
}		}
▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	convertOmpAtomicUpdate(omp::AtomicUpdateOp &opInst,
llvm::Value *llvmExpr = moduleTranslation.lookupValue(mlirExpr);		llvm::Value *llvmExpr = moduleTranslation.lookupValue(mlirExpr);
llvm::Value *llvmX = moduleTranslation.lookupValue(opInst.getX());		llvm::Value *llvmX = moduleTranslation.lookupValue(opInst.getX());
llvm::Type *llvmXElementType = moduleTranslation.convertType(		llvm::Type *llvmXElementType = moduleTranslation.convertType(
opInst.getRegion().getArgument(0).getType());		opInst.getRegion().getArgument(0).getType());
llvm::OpenMPIRBuilder::AtomicOpValue llvmAtomicX = {llvmX, llvmXElementType,		llvm::OpenMPIRBuilder::AtomicOpValue llvmAtomicX = {llvmX, llvmXElementType,
/isSigned=/false,		/isSigned=/false,
/isVolatile=/false};		/isVolatile=/false};

		auto defaultAO = getAtomicDefaultMemOrder(*opInst.getOperation());
llvm::AtomicOrdering atomicOrdering =		llvm::AtomicOrdering atomicOrdering =
convertAtomicOrdering(opInst.getMemoryOrderVal());		convertAtomicOrdering(opInst.getMemoryOrderVal(), defaultAO);

// Generate update code.		// Generate update code.
LogicalResult updateGenStatus = success();		LogicalResult updateGenStatus = success();
auto updateFn = [&opInst, &moduleTranslation, &updateGenStatus](		auto updateFn = [&opInst, &moduleTranslation, &updateGenStatus](
llvm::Value *atomicx,		llvm::Value *atomicx,
llvm::IRBuilder<> &builder) -> llvm::Value * {		llvm::IRBuilder<> &builder) -> llvm::Value * {
Block &bb = *opInst.getRegion().begin();		Block &bb = *opInst.getRegion().begin();
moduleTranslation.mapValue(*opInst.getRegion().args_begin(), atomicx);		moduleTranslation.mapValue(*opInst.getRegion().args_begin(), atomicx);
▲ Show 20 Lines • Show All 71 Lines • ▼ Show 20 Lines	llvm::Type *llvmXElementType = moduleTranslation.convertType(
atomicCaptureOp.getAtomicReadOp().getElementType());		atomicCaptureOp.getAtomicReadOp().getElementType());
llvm::OpenMPIRBuilder::AtomicOpValue llvmAtomicX = {llvmX, llvmXElementType,		llvm::OpenMPIRBuilder::AtomicOpValue llvmAtomicX = {llvmX, llvmXElementType,
/isSigned=/false,		/isSigned=/false,
/isVolatile=/false};		/isVolatile=/false};
llvm::OpenMPIRBuilder::AtomicOpValue llvmAtomicV = {llvmV, llvmXElementType,		llvm::OpenMPIRBuilder::AtomicOpValue llvmAtomicV = {llvmV, llvmXElementType,
/isSigned=/false,		/isSigned=/false,
/isVolatile=/false};		/isVolatile=/false};

		auto defaultAO = getAtomicDefaultMemOrder(*atomicCaptureOp.getOperation());
llvm::AtomicOrdering atomicOrdering =		llvm::AtomicOrdering atomicOrdering =
convertAtomicOrdering(atomicCaptureOp.getMemoryOrderVal());		convertAtomicOrdering(atomicCaptureOp.getMemoryOrderVal(), defaultAO);

LogicalResult updateGenStatus = success();		LogicalResult updateGenStatus = success();
auto updateFn = [&](llvm::Value *atomicx,		auto updateFn = [&](llvm::Value *atomicx,
llvm::IRBuilder<> &builder) -> llvm::Value * {		llvm::IRBuilder<> &builder) -> llvm::Value * {
if (atomicWriteOp)		if (atomicWriteOp)
return moduleTranslation.lookupValue(atomicWriteOp.getValue());		return moduleTranslation.lookupValue(atomicWriteOp.getValue());
Block &bb = *atomicUpdateOp.getRegion().begin();		Block &bb = *atomicUpdateOp.getRegion().begin();
moduleTranslation.mapValue(*atomicUpdateOp.getRegion().args_begin(),		moduleTranslation.mapValue(*atomicUpdateOp.getRegion().args_begin(),
▲ Show 20 Lines • Show All 320 Lines • ▼ Show 20 Lines	ompBuilder->createGlobalFlag(
attribute		attribute
.getAssumeNoNestedParallelism() /LangOpts().OpenMPNoNestedParallelism/		.getAssumeNoNestedParallelism() /LangOpts().OpenMPNoNestedParallelism/
,		,
"__omp_rtl_assume_no_nested_parallelism");		"__omp_rtl_assume_no_nested_parallelism");

return success();		return success();
}		}

static bool getTargetEntryUniqueInfo(llvm::TargetRegionEntryInfo &targetInfo,		static bool getTargetEntryUniqueInfo(llvm::TargetRegionEntryInfo &targetInfo,
omp::TargetOp targetOp,		omp::TargetOp targetOp,
llvm::StringRef parentName = "") {		llvm::StringRef parentName = "") {
auto fileLoc = targetOp.getLoc()->findInstanceOf<FileLineColLoc>();		auto fileLoc = targetOp.getLoc()->findInstanceOf<FileLineColLoc>();

assert(fileLoc && "No file found from location");		assert(fileLoc && "No file found from location");
StringRef fileName = fileLoc.getFilename().getValue();		StringRef fileName = fileLoc.getFilename().getValue();

llvm::sys::fs::UniqueID id;		llvm::sys::fs::UniqueID id;
if (auto ec = llvm::sys::fs::getUniqueID(fileName, id)) {		if (auto ec = llvm::sys::fs::getUniqueID(fileName, id)) {
targetOp.emitError("Unable to get unique ID for file");		targetOp.emitError("Unable to get unique ID for file");
return false;		return false;
}		}

uint64_t line = fileLoc.getLine();		uint64_t line = fileLoc.getLine();
targetInfo = llvm::TargetRegionEntryInfo(parentName, id.getDevice(),		targetInfo = llvm::TargetRegionEntryInfo(parentName, id.getDevice(),
id.getFile(), line);		id.getFile(), line);
return true;		return true;
}		}

		kiranchandramohanUnsubmitted Done Reply Inline Actions Can this be an operation in the OpenMP dialect? kiranchandramohan: Can this be an operation in the OpenMP dialect?
		skatrakAuthorUnsubmitted Done Reply Inline Actions Do you mean removing the `omp.requires` attribute and instead having an `omp.requires` operation to be specified similarly to this? module { omp.requires { clauses = #omp<clause_requires reverse_offload\|unified_shared_memory> } ... } Doing this would allow us to lower the operation in `OpenMPDialectLLVMIRTranslationInterface::convertOperation` instead of `OpenMPDialectLLVMIRTranslationInterface::amendOperation`, but the clauses from this directive must be used to modify the lowering of some other operations, not just to produce the registration function. The current approach allows accessing this information through the `OffloadModuleInterface` to be passed along to the `OpenMPIRBuilder` (in ModuleTranslation.cpp, currently), but if we had an operation instead of an attribute we would have to search for it in the module to extract this, which I'm not sure it would be a cleaner solution. Also there would be no guarantee that no more than a single occurrence of this operation would be present within the module at all times (e.g. if the input is handwritten MLIR). Or is your suggestion to lower every instance of a 'requires' directive in Fortran to its own `omp.requires` operation at the spot where it appears? Something like this: module { func.func @f() { omp.requires { clauses = #omp<clause_requires reverse_offload> } ... } ... func.func @g() { omp.requires { clauses = #omp<clause_requires unified_shared_memory> } ... } } The issue in that case becomes where to gather all of the clauses and lower to a single registration function in LLVM IR, since `convertOperation` would be called once per instance otherwise. The gathering of the clauses coming from different instances of 'requires' would currently be done in the bridge, so that the single module attribute is produced: D147218. I suppose a solution to issues on both approaches would be to update the 'requires' flags in the `OpenMPIRBuilder::Config` structure in `convertOperation` and replace the `OpenMPIRBuilder::createRegisterRequires` function for a new `OpenMPIRBuilder::createOrUpdateRegisterRequires`, and make sure that only a single registration function is generated, eventually with the whole set of flags passed to the runtime (each subsequent call would update the 'flags' argument instead of creating the function and registering it as a constructor). I guess it wouldn't be an issue the fact that the config structure wouldn't have the complete set of clauses at all times, since they are supposed (and checked in semantics: D149337) to come before any operations that may be affected by them. But I'm also not sure about how clean this solution would be either. Let me know what your thoughts are on whether any of these approaches to supporting 'requires' via an operation is preferable to an attribute, or if you have another idea, since any of these changes would result in multiple updates across all related pending patches so I want to make sure we agree on an approach before committing the time. skatrak: Do you mean removing the `omp.requires` attribute and instead having an `omp.requires`…
		kiranchandramohanUnsubmitted Done Reply Inline Actions Thanks for the detailed reply and exploring the options. My question was basically coming from the point of view that `convertRequiresAttr` is generating some llvm IR instructions. So, we could model these instructions as an operation in the OpenMP dialect. But you make a good point (point copied below). but if we had an operation instead of an attribute we would have to search for it in the module to extract this, which I'm not sure it would be a cleaner solution. Also there would be no guarantee that no more than a single occurrence of this operation would be present within the module at all times (e.g. if the input is handwritten MLIR). Can we move this to `getOpenMPBuilder`? This way if we decide to add an omp.module then we can just replace the use of the interface with this module operation. There are a few concerns regarding the amendOperation flow, -> Just like you said about having the operation multiple times, the attribute can also appear multiple times. This can be handled by checking that this attribute is only present in the module. -> There is no equivalent to amendOperation in the the regular Dialect Conversion flow. If some one is using their own OpenMP libraries downstream and would like to use the Dialect Conversion mechanism, then there will be a problem. -> The discardable nature of these attributes. What do you think about this? kiranchandramohan: Thanks for the detailed reply and exploring the options. My question was basically coming from…
		skatrakAuthorUnsubmitted Done Reply Inline Actions Can we move this to `getOpenMPBuilder`? This way if we decide to add an omp.module then we can just replace the use of the interface with this module operation. This could be done there, yes. However, it doesn't seem quite right to me lowering OpenMP-specific attributes inside of ModuleTranslation, even though this case would be a relatively small addition. In the case of adding an `omp.module` later I would assume that the proper spot to lower these attributes would be `OpenMPDialectLLVMIRTranslationInterface::convertOperation`, so it would have to be moved anyways. It makes sense to set up the `OpenMPIRBuilder::Config` in `getOpenMPBuilder` because it's where the OpenMP IR builder is created, so it's configured at the same time. But I'm not sure about doing any lowering there. There are a few concerns regarding the amendOperation flow, -> Just like you said about having the operation multiple times, the attribute can also appear multiple times. This can be handled by checking that this attribute is only present in the module. That's a good point, if we keep this approach I'll add that check. -> There is no equivalent to amendOperation in the the regular Dialect Conversion flow. If some one is using their own OpenMP libraries downstream and would like to use the Dialect Conversion mechanism, then there will be a problem. I'm not quite familiar with the details of the regular dialect conversion flow, so I'd be happy if you could point out if I'm missing something here. The OpenMP dialect currently only has implemented lowering to the LLVM IR dialect via the Dialect Conversion mechanism (mlir/lib/Conversion/OpenMPToLLVM/OpenMPToLLVM.cpp), which handles some but not all of the operations. This is currently triggered during flang compilation (flang/lib/Optimizer/CodeGen/CodeGen.cpp), resulting in an MLIR module with mostly LLVM IR dialect operations and possibly some remaining OpenMP dialect operations. This final MLIR representation is then translated to LLVM IR using the `ModuleTranslation` class, which in turn relies on all these dialect-specific translation interfaces that can define `amendOperation`. If I understand it correctly, the Dialect Conversion mechanism on its own would not be enough to take advantage of all current support for OpenMP lowering regardless of whether we relied on dialect attributes or not, since some operations are translated directly to LLVM IR at the end of the MLIR flow. That last part is what's built on top of the `OpenMPIRBuilder`. So it feels to me like the more general issue really comes down to having the `OpenMPIRBuilder`, which translates OpenMP operations into LLVM IR instead of into LLVM IR dialect operations, rather than to using dialect attributes. However, I do see how it would be possible to move logic from the `OpenMPIRBuilder` to the `ConvertOpenMPToLLVMPass`, but I'm not sure if we could do the same for OpenMP dialect attributes attached to operations from other dialects. The point I'm trying to make, I guess, is that we want to keep the `OpenMPIRBuilder`, so keeping OpenMP dialect-related information as attributes wouldn't make much of a difference as to the limitations of the Dialect Conversion mechanism. But again, I'm not that familiar with this, so if I'm wrong I'd be happy to be corrected and understand it better. -> The discardable nature of these attributes. According to the MLIR Language Reference, discardable attributes are just attributes that have semantics defined externally to the operation. I haven't been able to find any documentation related to the "discardability" of these attributes, but to me it should mean that the operation doesn't become invalid if it's taken away. So it could be the case that printing in text format some operation could ignore them, hence losing that information if that text is used to continue the compilation process. However, if we make sure that this is not the case for the `builtin.module` operation, there is no reason for any transformation/optimization/lowering passes outside of the OpenMP dialect to remove it, so I think it should not be a reason to decide against it. I can see how these kinds of attributes could be lost if, during a pass, the operation they are attached to is replaced by something else without forwarding them or taking them into consideration to impact in some way the results of the process. And it wouldn't make sense to try to make every pass working on those operations aware of these attributes, so it makes sense that they would be discardable in this sense. However, in the case of the `builtin.module` I feel like this should not be a concern, since that kind of situation doesn't seem likely or even something that should be supported. skatrak: > Can we move this to `getOpenMPBuilder`? This way if we decide to add an omp.module then we…
		kiranchandramohanUnsubmitted Done Reply Inline Actions Can we move this to getOpenMPBuilder? This way if we decide to add an omp.module then we can just replace the use of the interface with this module operation. This could be done there, yes. However, it doesn't seem quite right to me lowering OpenMP-specific attributes inside of ModuleTranslation, even though this case would be a relatively small addition. In the case of adding an omp.module later I would assume that the proper spot to lower these attributes would be OpenMPDialectLLVMIRTranslationInterface::convertOperation, so it would have to be moved anyways. It makes sense to set up the OpenMPIRBuilder::Config in getOpenMPBuilder because it's where the OpenMP IR builder is created, so it's configured at the same time. But I'm not sure about doing any lowering there. The concern here was that even though we are representing this as an attribute it seems to have an IR representation in LLVM IR. Also FWIU, it seems that we are generating a global constructor and not a metadata or a function attribute. The fact that it is a global constructor makes me feel that this should be provided as an option to the initializer/constructor in the OpenMPIRBuilder which can internally generate these. There is no equivalent to amendOperation in the regular Dialect Conversion flow. If someone is using their own OpenMP libraries downstream and would like to use the Dialect Conversion mechanism, then there will be a problem. I'm not quite familiar with the details of the regular dialect conversion flow, so I'd be happy if you could point out if I'm missing something here. The OpenMP dialect currently only has implemented lowering to the LLVM IR dialect via the Dialect Conversion mechanism (mlir/lib/Conversion/OpenMPToLLVM/OpenMPToLLVM.cpp), which handles some but not all of the operations. This is currently triggered during flang compilation (flang/lib/Optimizer/CodeGen/CodeGen.cpp), resulting in an MLIR module with mostly LLVM IR dialect operations and possibly some remaining OpenMP dialect operations. This final MLIR representation is then translated to LLVM IR using the ModuleTranslation class, which in turn relies on all these dialect-specific translation interfaces that can define amendOperation. If I understand it correctly, the Dialect Conversion mechanism on its own would not be enough to take advantage of all current support for OpenMP lowering regardless of whether we relied on dialect attributes or not, since some operations are translated directly to LLVM IR at the end of the MLIR flow. That last part is what's built on top of the OpenMPIRBuilder. So it feels to me like the more general issue really comes down to having the OpenMPIRBuilder, which translates OpenMP operations into LLVM IR instead of into LLVM IR dialect operations, rather than to using dialect attributes. However, I do see how it would be possible to move logic from the OpenMPIRBuilder to the ConvertOpenMPToLLVMPass, but I'm not sure if we could do the same for OpenMP dialect attributes attached to operations from other dialects. The point I'm trying to make, I guess, is that we want to keep the OpenMPIRBuilder, so keeping OpenMP dialect-related information as attributes wouldn't make much of a difference as to the limitations of the Dialect Conversion mechanism. But again, I'm not that familiar with this, so if I'm wrong I'd be happy to be corrected and understand it better. This is a general point that I am making here. The OpenMP Dialect sits in MLIR tree and it will have users both in upstream llvm-project as well as downstream. The current method we have chosen to lower the OpenMP dialect is through the OpenMPIRBuilder at the translation phase. This is done so as to share the code with Clang and have a single implementation of OpenMP in llvm-project as far as possible. I am not suggesting a change in this flow. But what we have to keep in mind is that there will be others who might be lowering the OpenMP dialect differently for e.g. because they might have their own OpenMP runtime. So, in the best case, the design of the dialect should not be dependent on the OpenMPIRBuilder lowering, it should be general enough so that other users can also use it. Now the problem here is that AFAIS, the Dialect conversions are usually defined for Operations and we do not have a counterpart for `amendOperation` that takes an Operation and an attribute. So how will a general user lower the OpenMP dialect? The best case here is again that there is an `omp.module` operation that when it undergoes DialectConversion will do all the required changes. With the current representation that we have chosen, they will have to add a custom Conversion Patter for the Builtin Module where the OpenMP attributes are interpreted. I don't know whether this is possible currently. Would be good to give this a try so that we know there is a way for DialectConversion users. kiranchandramohan: >> Can we move this to getOpenMPBuilder? This way if we decide to add an omp.module then we can…
		skatrakAuthorUnsubmitted Done Reply Inline Actions The concern here was that even though we are representing this as an attribute it seems to have an IR representation in LLVM IR. Also FWIU, it seems that we are generating a global constructor and not a metadata or a function attribute. The fact that it is a global constructor makes me feel that this should be provided as an option to the initializer/constructor in the OpenMPIRBuilder which can internally generate these. Yes, in this case we are generating a new function in the IR that's registered as a global constructor, and not metadata/attributes. I see your point, and I think that the approach of moving this code generation to the `OpenMPIRBuilder::initialize` method makes sense as well. The only considerations for doing this I think would be to first set the IRBuilder configuration before calling the initializer (which makes sense, but it's done the other way around at the moment) and letting the initializer know that it should optionally generate this constructor. The flag to tell whether to generate the constructor would be necessary to ensure that it's only produced if there are device constructs in the compilation unit, but it can also serve temporarily to prevent clang from generating it twice, as it currently does not use the IRBuilder for this. Now the problem here is that AFAIS, the Dialect conversions are usually defined for Operations and we do not have a counterpart for `amendOperation` that takes an Operation and an attribute. So how will a general user lower the OpenMP dialect? The best case here is again that there is an `omp.module` operation that when it undergoes DialectConversion will do all the required changes. With the current representation that we have chosen, they will have to add a custom Conversion Patter for the Builtin Module where the OpenMP attributes are interpreted. I don't know whether this is possible currently. Would be good to give this a try so that we know there is a way for DialectConversion users. I can look into this, and see if I can make something like that work. But this requirement does make a good case for defining and using an `omp.module` to hold all these attributes we're working with. I think it's also a similar consideration with function attributes for declare target in D146063. I think we're back to the first discussion here from a few months ago. We've now seen what using dialect attributes looks like, and what the problems related to that could be, and we have to decide whether to stick with them or to follow an alternative approach. The main downside that I see to defining `omp.module` and `omp.func` or similar is the difficulty around being able to rely on the existing infrastructure without breaking optimization/canonicalization/lowering passes. It's probably easy to inadvertently introduce bugs and the extent of the changes to replace `builtin.module` for `omp.module` and `func.func` for `omp.func` would likely be quite large. So would you say that keeping the current approach of representing global OpenMP-specific information with discardable attributes is dependent on there being a clean way of lowering them in MLIR via DialectConversion? In your opinion, is this just an issue for this operation because it produces more than just LLVM IR metadata/attributes, or is it also applicable to other simpler flags (e.g. `omp.is_device`)? @jsjodin, @agozillon, @domada, do you have any other thoughts about this? skatrak: > The concern here was that even though we are representing this as an attribute it seems to…
		agozillonUnsubmitted Done Reply Inline Actions Not sure it's a good enough case, but that is just my opinion and I'm more than happy to defer the judgement to everyone else, here's my original version of trying to maintain both a builtin.module and an omp.module in Flang for an is_device which passed the test-suite at the time of making: https://github.com/ROCm-Developer-Tools/llvm-project/pull/193/files The dozens of test changes aren't necessarily required (some are, but not quite at that level), however, all of the MLIR opt related components do not play well with other modules as there is some inherent expectation that it is an builtin.module that's being utilised and if it's not, generate one implicitly. So, you can ignore those (although if we want the most sensible user experience for opt passes I think those change are likely required). While I'm very sure someone with more knowledge could do a much better job at shrinking the change list and making a more elegant solution, I think you still very likely are going to run into a lot of issues where bugs are created unintentionally by implementers through catering to only one module type and not considering both, as all an omp.module really is, is a superset of builtin.module, but there is no inheritance for operations in MLIR (although, I did try, but I can't remember the mileage I got out of it) as far as I am aware, so you have to make sure things stay appropriately aligned, if Flang only had one module type, it would likely not be a problem, as you could focus on catering to that sole module type without any regard for another (as is done today). And there is as you said a lot of optimization passes that depend on builtin.modules, a lot of these can be generalized (and I do with a fair number in the pull request), but future passes have to take into consideration the chance of being used with two module types as well. I'm not sure that's the same case for https://reviews.llvm.org/D146063 with function attributes at least as they're never actually lowered in any specialized way based on the attribute, just filtered out (and that might happen in the semantic phase at least partially based on where the current discussion on the patch goes), it might change if we ever want specialised lowering of device functions though. However, for data on GlobalOps, the attribute does act as an indicator that lowering changes are required, e.g. removal of original Global (certain cases), addition of metadata to the module or new globals in addition to the original. So you may end up also having to have an omp.GlobalOp and omp.all that can be made declare target, this might be an exhaustive list based on the Fortran/C++ programming language + OpenMP spec, but it is perhaps not in terms of MLIR dialects as new operations for other dialects that support a mix of omp are made. Although, perhaps the MLIR way is just that they have to convert their operations to the omp operations that support declare target at some point in their pipeline or maybe we only need to care about what exists in terms of LLVM IR / Dialect. We could have a seperate omp.declare_target operation, like I originally tried, but lowering functions if we ever decide to do so becomes a bit painful as any declare targets that resides inside of a function (which I think is the required placement in Fortran for a lot of constructs) has to have some kind of deferral mechanism when it's being lowered to LLVM-IR as convertOperation will process it when the function is being lowered (and so not completely lowered yet). This at least was how it was with my original dialect implementation, you could perhaps design omp.declare_target to act as a region container, similar to target.op, but that may create not so easy to read IR. Anyway, I have droned on a little bit here, so I hope it's somewhat intelligible and useful with not too many silly assumptions on my part although I know there will be some. I don't mind which direction is chosen (I'm sure they'll all have their pros and cons, but I do have a lot of love for the simplicity of the attributes), just that we do chose one! agozillon: Not sure it's a good enough case, but that is just my opinion and I'm more than happy to defer…
		jsjodinUnsubmitted Done Reply Inline Actions If we look at the message for the commit that added amendOperation it says: Port the translation of five dialects that define LLVM IR intrinsics (LLVMAVX512, LLVMArmNeon, LLVMArmSVE, NVVM, ROCDL) to the new dialect interface-based mechanism. This allows us to remove individual translations that were created for each of these dialects and just use one common MLIR-to-LLVM-IR translation that potentially supports all dialects instead, based on what is registered and including any combination of translatable dialects. This removal was one of the main goals of the refactoring. To support the addition of GPU-related metadata, the translation interface is extended with the `amendOperation` function that allows the interface implementation to post-process any translated operation with dialect attributes from the dialect for which the interface is implemented regardless of the operation's dialect. This is currently applied to "kernel" functions, but can be used to construct other metadata in dialect-specific ways without necessarily affecting operations. If I understand the wording correctly, the second paragraph says that it is okay to add dialect attributes to operations of a different dialect, and process them using amendOperation, regardless of the operation's dialect. Also it is okay to affect the operation (but not necessarily), so generating code (modifying the module operation) should be okay. So I don't really see a problem using amendOperation if we follow that logic. One issue is that this directly accesses attributes and will not go through any interfaces that have been defined. FWIW, my personal opinion about interfaces is that they are best used to capture abstract or derived properties that are not tied to a specific dialect so that generic analyzes can be written without knowing the specifics of any dialect, and that dialect attributes are owned by a specific dialect and can be used directly. So using amendOperation should be okay imo. jsjodin: If we look at the message for the commit that added amendOperation it says: Port the…
		kiranchandramohanUnsubmitted Done Reply Inline Actions If we look at the message for the commit that added amendOperation it says: Port the translation of five dialects that define LLVM IR intrinsics (LLVMAVX512, LLVMArmNeon, LLVMArmSVE, NVVM, ROCDL) to the new dialect interface-based mechanism. This allows us to remove individual translations that were created for each of these dialects and just use one common MLIR-to-LLVM-IR translation that potentially supports all dialects instead, based on what is registered and including any combination of translatable dialects. This removal was one of the main goals of the refactoring. To support the addition of GPU-related metadata, the translation interface is extended with the `amendOperation` function that allows the interface implementation to post-process any translated operation with dialect attributes from the dialect for which the interface is implemented regardless of the operation's dialect. This is currently applied to "kernel" functions, but can be used to construct other metadata in dialect-specific ways without necessarily affecting operations. If I understand the wording correctly, the second paragraph says that it is okay to add dialect attributes to operations of a different dialect, and process them using amendOperation, regardless of the operation's dialect. Also it is okay to affect the operation (but not necessarily), so generating code (modifying the module operation) should be okay. So I don't really see a problem using amendOperation if we follow that logic. One issue is that this directly accesses attributes and will not go through any interfaces that have been defined. FWIW, my personal opinion about interfaces is that they are best used to capture abstract or derived properties that are not tied to a specific dialect so that generic analyzes can be written without knowing the specifics of any dialect, and that dialect attributes are owned by a specific dialect and can be used directly. So using amendOperation should be okay imo. The second paragraph, although it says that it can post-process any translated operation, mostly speaks about metadata. We are using it here to generate global variables. The concern is also that there does not seem to be a similar functionality like amendOperation in DialectConversion unlike convertOperation. The interface we are using is just a place-holder to capture all the changes required to enable device offloading/target operations handling. Ideally, finally, that should just dissolve into an `omp.module` operation. kiranchandramohan: > If we look at the message for the commit that added amendOperation it says: > Port the…
		kiranchandramohanUnsubmitted Done Reply Inline Actions Not sure it's a good enough case, but that is just my opinion and I'm more than happy to defer the judgement to everyone else, here's my original version of trying to maintain both a builtin.module and an omp.module in Flang for an is_device which passed the test-suite at the time of making: https://github.com/ROCm-Developer-Tools/llvm-project/pull/193/files Thanks for the pointer to this pull-request. As discussed in https://discourse.llvm.org/t/supporting-top-level-ops-other-than-builtin-module/65224, it seems having a non-builtin-module as a top-level Op is a welcome change. It might be worth seeing if some of those changes can make it into upstream MLIR when you get some time. kiranchandramohan: > Not sure it's a good enough case, but that is just my opinion and I'm more than happy to…
		kiranchandramohanUnsubmitted Done Reply Inline Actions The concern here was that even though we are representing this as an attribute it seems to have an IR representation in LLVM IR. Also FWIU, it seems that we are generating a global constructor and not a metadata or a function attribute. The fact that it is a global constructor makes me feel that this should be provided as an option to the initializer/constructor in the OpenMPIRBuilder which can internally generate these. Yes, in this case we are generating a new function in the IR that's registered as a global constructor, and not metadata/attributes. I see your point, and I think that the approach of moving this code generation to the `OpenMPIRBuilder::initialize` method makes sense as well. The only considerations for doing this I think would be to first set the IRBuilder configuration before calling the initializer (which makes sense, but it's done the other way around at the moment) and letting the initializer know that it should optionally generate this constructor. The flag to tell whether to generate the constructor would be necessary to ensure that it's only produced if there are device constructs in the compilation unit, but it can also serve temporarily to prevent clang from generating it twice, as it currently does not use the IRBuilder for this. Now the problem here is that AFAIS, the Dialect conversions are usually defined for Operations and we do not have a counterpart for `amendOperation` that takes an Operation and an attribute. So how will a general user lower the OpenMP dialect? The best case here is again that there is an `omp.module` operation that when it undergoes DialectConversion will do all the required changes. With the current representation that we have chosen, they will have to add a custom Conversion Patter for the Builtin Module where the OpenMP attributes are interpreted. I don't know whether this is possible currently. Would be good to give this a try so that we know there is a way for DialectConversion users. I can look into this, and see if I can make something like that work. But this requirement does make a good case for defining and using an `omp.module` to hold all these attributes we're working with. I think it's also a similar consideration with function attributes for declare target in D146063. I think we're back to the first discussion here from a few months ago. We've now seen what using dialect attributes looks like, and what the problems related to that could be, and we have to decide whether to stick with them or to follow an alternative approach. The main downside that I see to defining `omp.module` and `omp.func` or similar is the difficulty around being able to rely on the existing infrastructure without breaking optimization/canonicalization/lowering passes. It's probably easy to inadvertently introduce bugs and the extent of the changes to replace `builtin.module` for `omp.module` and `func.func` for `omp.func` would likely be quite large. So would you say that keeping the current approach of representing global OpenMP-specific information with discardable attributes is dependent on there being a clean way of lowering them in MLIR via DialectConversion? In your opinion, is this just an issue for this operation because it produces more than just LLVM IR metadata/attributes, or is it also applicable to other simpler flags (e.g. `omp.is_device`)? The concern is primarily that there does not seem to be something similar to `amendOperation` in DialectConversion. So we are getting into a lowering flow that might not work directly with DialectConversion. Anyway, at the moment, I do not have an alternative suggestion. So we can go ahead with this. kiranchandramohan: > > The concern here was that even though we are representing this as an attribute it seems to…
		skatrakAuthorUnsubmitted Done Reply Inline Actions Hi Kiran, I spent some time trying to prototype something like what you're describing. A DialectConversion-based way to lower dialect attributes attached to builtin modules, or at least my understanding of what that would look like. I managed to implement something, although I suppose there are still many shortcomings to that approach: D150275. At least I think it shows it's doable. Thank you for all the feedback. I will merge this after all parent patches are approved as well. skatrak: Hi Kiran, I spent some time trying to prototype something like what you're describing. A…
static bool targetOpSupported(Operation &opInst) {		static bool targetOpSupported(Operation &opInst) {
auto targetOp = cast<omp::TargetOp>(opInst);		auto targetOp = cast<omp::TargetOp>(opInst);
if (targetOp.getIfExpr()) {		if (targetOp.getIfExpr()) {
opInst.emitError("If clause not yet supported");		opInst.emitError("If clause not yet supported");
return false;		return false;
}		}

if (targetOp.getDevice()) {		if (targetOp.getDevice()) {
▲ Show 20 Lines • Show All 65 Lines • ▼ Show 20 Lines	convertOmpTarget(Operation &opInst, llvm::IRBuilderBase &builder,

builder.restoreIP(moduleTranslation.getOpenMPBuilder()->createTarget(		builder.restoreIP(moduleTranslation.getOpenMPBuilder()->createTarget(
ompLoc, builder.saveIP(), entryInfo, defaultValTeams, defaultValThreads,		ompLoc, builder.saveIP(), entryInfo, defaultValTeams, defaultValThreads,
inputs, bodyCB));		inputs, bodyCB));

return bodyGenStatus;		return bodyGenStatus;
}		}

		/// Converts the module-level set of OpenMP requires clauses into LLVM IR using
		/// OpenMPIRBuilder.
		static LogicalResult
		convertRequiresAttr(Operation &op, omp::ClauseRequiresAttr requiresAttr,
		LLVM::ModuleTranslation &moduleTranslation) {
		auto *ompBuilder = moduleTranslation.getOpenMPBuilder();

		// No need to read requiresAttr here, because it has already been done in
		// translateModuleToLLVMIR(). There, flags are stored in the
		// OpenMPIRBuilderConfig object, available to the OpenMPIRBuilder.
		auto *regFn =
		ompBuilder->createRegisterRequires(ompBuilder->createPlatformSpecificName(
		{"omp_offloading", "requires_reg"}));

		// Add registration function as global constructor
		if (regFn)
		llvm::appendToGlobalCtors(ompBuilder->M, regFn, /* Priority = */ 0);

		return success();
		}

namespace {		namespace {

/// Implementation of the dialect interface that converts operations belonging		/// Implementation of the dialect interface that converts operations belonging
/// to the OpenMP dialect to LLVM IR.		/// to the OpenMP dialect to LLVM IR.
class OpenMPDialectLLVMIRTranslationInterface		class OpenMPDialectLLVMIRTranslationInterface
: public LLVMTranslationDialectInterface {		: public LLVMTranslationDialectInterface {
public:		public:
using LLVMTranslationDialectInterface::LLVMTranslationDialectInterface;		using LLVMTranslationDialectInterface::LLVMTranslationDialectInterface;

/// Translates the given operation to LLVM IR using the provided IR builder		/// Translates the given operation to LLVM IR using the provided IR builder
/// and saving the state in `moduleTranslation`.		/// and saving the state in `moduleTranslation`.
LogicalResult		LogicalResult
convertOperation(Operation *op, llvm::IRBuilderBase &builder,		convertOperation(Operation *op, llvm::IRBuilderBase &builder,
LLVM::ModuleTranslation &moduleTranslation) const final;		LLVM::ModuleTranslation &moduleTranslation) const final;

		/// Given an OpenMP MLIR attribute, create the corresponding LLVM-IR, runtime
		/// calls, or operation amendments
LogicalResult		LogicalResult
amendOperation(Operation *op, NamedAttribute attribute,		amendOperation(Operation *op, NamedAttribute attribute,
LLVM::ModuleTranslation &moduleTranslation) const final;		LLVM::ModuleTranslation &moduleTranslation) const final;
};		};

} // namespace		} // namespace

/// Given an OpenMP MLIR attribute, create the corresponding LLVM-IR, runtime
/// calls, or operation amendments
LogicalResult OpenMPDialectLLVMIRTranslationInterface::amendOperation(		LogicalResult OpenMPDialectLLVMIRTranslationInterface::amendOperation(
Operation *op, NamedAttribute attribute,		Operation *op, NamedAttribute attribute,
LLVM::ModuleTranslation &moduleTranslation) const {		LLVM::ModuleTranslation &moduleTranslation) const {

return llvm::TypeSwitch<Attribute, LogicalResult>(attribute.getValue())		return llvm::TypeSwitch<Attribute, LogicalResult>(attribute.getValue())
.Case([&](mlir::omp::FlagsAttr rtlAttr) {		.Case([&](mlir::omp::FlagsAttr rtlAttr) {
return convertFlagsAttr(op, rtlAttr, moduleTranslation);		return convertFlagsAttr(op, rtlAttr, moduleTranslation);
})		})
		.Case([&](omp::ClauseRequiresAttr requiresAttr) {
		return convertRequiresAttr(*op, requiresAttr, moduleTranslation);
		})
.Default([&](Attribute attr) {		.Default([&](Attribute attr) {
// fall through for omp attributes that do not require lowering and/or		// Fall through for omp attributes that do not require lowering and/or
// have no concrete definition and thus no type to define a case on		// have no concrete definition and thus no type to define a case on.
		// The omp.atomic_default_mem_order attribute is read directly during
		// OpenMP atomic ops lowering
return success();		return success();
});		});

return failure();		return failure();
}		}

/// Given an OpenMP MLIR operation, create the corresponding LLVM IR		/// Given an OpenMP MLIR operation, create the corresponding LLVM IR
/// (including OpenMP runtime calls).		/// (including OpenMP runtime calls).
▲ Show 20 Lines • Show All 118 Lines • Show Last 20 Lines

mlir/lib/Target/LLVMIR/ModuleTranslation.cpp

Show First 20 Lines • Show All 1,255 Lines • ▼ Show 20 Lines	SmallVector<llvm::Value *> ModuleTranslation::lookupValues(ValueRange values) {
return remapped;		return remapped;
}		}

llvm::OpenMPIRBuilder *ModuleTranslation::getOpenMPBuilder() {		llvm::OpenMPIRBuilder *ModuleTranslation::getOpenMPBuilder() {
if (!ompBuilder) {		if (!ompBuilder) {
ompBuilder = std::make_unique<llvm::OpenMPIRBuilder>(*llvmModule);		ompBuilder = std::make_unique<llvm::OpenMPIRBuilder>(*llvmModule);
ompBuilder->initialize();		ompBuilder->initialize();

bool isDevice = false;		// Set OpenMP IR Builder configuration
if (auto offloadMod =		if (auto offloadMod = dyn_cast<omp::OffloadModuleInterface>(mlirModule)) {
dyn_cast<mlir::omp::OffloadModuleInterface>(mlirModule))		llvm::OpenMPIRBuilderConfig config;
isDevice = offloadMod.getIsDevice();		config.setIsEmbedded(offloadMod.getIsDevice());
		config.setIsTargetCodegen(false);
// TODO: set the flags when available		config.setOpenMPOffloadMandatory(false);
llvm::OpenMPIRBuilderConfig Config(
isDevice, /* IsTargetCodegen */ false,		const auto requiresFlags = offloadMod.getRequires();
/* OpenMPOffloadMandatory */ false,		config.setHasRequiresReverseOffload(bitEnumContainsAll(
/* HasRequiresReverseOffload */ false,		requiresFlags, omp::ClauseRequires::reverse_offload));
/* HasRequiresUnifiedAddress */ false,		config.setHasRequiresUnifiedAddress(bitEnumContainsAll(
/* HasRequiresUnifiedSharedMemory */ false,		requiresFlags, omp::ClauseRequires::unified_address));
/* HasRequiresDynamicAllocators */ false);		config.setHasRequiresUnifiedSharedMemory(bitEnumContainsAll(
ompBuilder->setConfig(Config);		requiresFlags, omp::ClauseRequires::unified_shared_memory));
		config.setHasRequiresDynamicAllocators(bitEnumContainsAll(
		requiresFlags, omp::ClauseRequires::dynamic_allocators));
		ompBuilder->setConfig(config);
		}
}		}
return ompBuilder.get();		return ompBuilder.get();
}		}

const llvm::DILocation *		const llvm::DILocation *
ModuleTranslation::translateLoc(Location loc, llvm::DILocalScope *scope) {		ModuleTranslation::translateLoc(Location loc, llvm::DILocalScope *scope) {
return debugTranslation->translateLoc(loc, scope);		return debugTranslation->translateLoc(loc, scope);
}		}
▲ Show 20 Lines • Show All 60 Lines • ▼ Show 20 Lines	mlir::translateModuleToLLVMIR(Operation *module, llvm::LLVMContext &llvmContext,
std::unique_ptr<llvm::Module> llvmModule =		std::unique_ptr<llvm::Module> llvmModule =
prepareLLVMModule(module, llvmContext, name);		prepareLLVMModule(module, llvmContext, name);
if (!llvmModule)		if (!llvmModule)
return nullptr;		return nullptr;

LLVM::ensureDistinctSuccessors(module);		LLVM::ensureDistinctSuccessors(module);

ModuleTranslation translator(module, std::move(llvmModule));		ModuleTranslation translator(module, std::move(llvmModule));

if (failed(translator.convertFunctionSignatures()))		if (failed(translator.convertFunctionSignatures()))
return nullptr;		return nullptr;
		kiranchandramohanUnsubmitted Done Reply Inline Actions Could you move this to `ModuleTranslation::getOpenMPBuilder`? kiranchandramohan: Could you move this to `ModuleTranslation::getOpenMPBuilder`?
if (failed(translator.convertGlobals()))		if (failed(translator.convertGlobals()))
return nullptr;		return nullptr;
if (failed(translator.createAccessGroupMetadata()))		if (failed(translator.createAccessGroupMetadata()))
return nullptr;		return nullptr;
if (failed(translator.createAliasScopeMetadata()))		if (failed(translator.createAliasScopeMetadata()))
return nullptr;		return nullptr;
if (failed(translator.createTBAAMetadata()))		if (failed(translator.createTBAAMetadata()))
return nullptr;		return nullptr;
if (failed(translator.convertFunctions()))		if (failed(translator.convertFunctions()))
return nullptr;		return nullptr;

// Convert other top-level operations if possible.		// Convert other top-level operations if possible.
llvm::IRBuilder<> llvmBuilder(llvmContext);		llvm::IRBuilder<> llvmBuilder(llvmContext);
for (Operation &o : getModuleBody(module).getOperations()) {		for (Operation &o : getModuleBody(module).getOperations()) {
		kiranchandramohanUnsubmitted Done Reply Inline Actions This change has to go in another patch reviewed by the general MLIR/LLVM dialect community. Not sure about mentioning too much about OpenMPIRbuilder or the OpenMPDialect here. kiranchandramohan: This change has to go in another patch reviewed by the general MLIR/LLVM dialect community. Not…
if (!isa<LLVM::LLVMFuncOp, LLVM::GlobalOp, LLVM::GlobalCtorsOp,		if (!isa<LLVM::LLVMFuncOp, LLVM::GlobalOp, LLVM::GlobalCtorsOp,
LLVM::GlobalDtorsOp, LLVM::MetadataOp>(&o) &&		LLVM::GlobalDtorsOp, LLVM::MetadataOp>(&o) &&
!o.hasTrait<OpTrait::IsTerminator>() &&		!o.hasTrait<OpTrait::IsTerminator>() &&
failed(translator.convertOperation(o, llvmBuilder))) {		failed(translator.convertOperation(o, llvmBuilder))) {
return nullptr;		return nullptr;
}		}
}		}

Show All 9 Lines

mlir/test/Target/LLVMIR/openmp-llvm.mlir

	Show First 20 Lines • Show All 2,487 Lines • ▼ Show 20 Lines
	// -----			// -----

	// CHECK: @__omp_rtl_debug_kind = weak_odr hidden constant i32 0			// CHECK: @__omp_rtl_debug_kind = weak_odr hidden constant i32 0
	// CHECK: @__omp_rtl_assume_teams_oversubscription = weak_odr hidden constant i32 1			// CHECK: @__omp_rtl_assume_teams_oversubscription = weak_odr hidden constant i32 1
	// CHECK: @__omp_rtl_assume_threads_oversubscription = weak_odr hidden constant i32 0			// CHECK: @__omp_rtl_assume_threads_oversubscription = weak_odr hidden constant i32 0
	// CHECK: @__omp_rtl_assume_no_thread_state = weak_odr hidden constant i32 1			// CHECK: @__omp_rtl_assume_no_thread_state = weak_odr hidden constant i32 1
	// CHECK: @__omp_rtl_assume_no_nested_parallelism = weak_odr hidden constant i32 0			// CHECK: @__omp_rtl_assume_no_nested_parallelism = weak_odr hidden constant i32 0
	module attributes {omp.flags = #omp.flags<assume_teams_oversubscription = true, assume_no_thread_state = true>} {}			module attributes {omp.flags = #omp.flags<assume_teams_oversubscription = true, assume_no_thread_state = true>} {}

				// -----

				// Check that the atomic default memory order is picked up by atomic operations.
				module attributes {
				omp.atomic_default_mem_order = #omp<memoryorderkind seq_cst>
				} {
				// CHECK-LABEL: @omp_atomic_default_mem_order
				// CHECK-SAME: (ptr %[[ARG0:.]], ptr %[[ARG1:.]], i32 %[[EXPR:.*]])
				llvm.func @omp_atomic_default_mem_order(%arg0 : !llvm.ptr<i32>,
				%arg1 : !llvm.ptr<i32>,
				%expr : i32) -> () {

				// CHECK: %[[X1:.*]] = load atomic i32, ptr %[[ARG0]] seq_cst, align 4
				// CHECK: store i32 %[[X1]], ptr %[[ARG1]], align 4
				omp.atomic.read %arg1 = %arg0 : !llvm.ptr<i32>, i32

				// CHECK: store atomic i32 %[[EXPR]], ptr %[[ARG1]] seq_cst, align 4
				// CHECK: call void @__kmpc_flush(ptr @{{.*}})
				omp.atomic.write %arg1 = %expr : !llvm.ptr<i32>, i32

				// CHECK: atomicrmw add ptr %[[ARG1]], i32 %[[EXPR]] seq_cst
				omp.atomic.update %arg1 : !llvm.ptr<i32> {
				^bb0(%xval: i32):
				%newval = llvm.add %xval, %expr : i32
				omp.yield(%newval : i32)
				}

				// CHECK: %[[xval:.*]] = atomicrmw xchg ptr %[[ARG0]], i32 %[[EXPR]] seq_cst
				// CHECK: store i32 %[[xval]], ptr %[[ARG1]]
				omp.atomic.capture {
				omp.atomic.read %arg1 = %arg0 : !llvm.ptr<i32>, i32
				omp.atomic.write %arg0 = %expr : !llvm.ptr<i32>, i32
				}

				llvm.return
				}
				}

				// -----

				// Check that OpenMP requires flags are registered by a global constructor.
				// CHECK: @llvm.global_ctors = appending global [1 x { i32, ptr, ptr }]
				// CHECK-SAME: [{ i32, ptr, ptr } { i32 0, ptr @[[REG_FN:.*]], ptr null }]
				// CHECK: define {{.}} @[[REG_FN]]({{.}})
				// CHECK-NOT: }
				// CHECK: call void @__tgt_register_requires(i64 10)
				module attributes {omp.requires = #omp<clause_requires reverse_offload\|unified_shared_memory>} {}