This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
flang/
-
lib/Lower/
-
Lower/
9/27
OpenMP.cpp
-
test/Lower/OpenMP/
-
Lower/
-
OpenMP/
6/6
atomic-update.f90

Differential D125668

[flang][OpenMP] Lowering support for atomic update construct
ClosedPublic

Authored by NimishMishra on May 16 2022, 1:03 AM.

Download Raw Diff

Details

Reviewers

jdoerfert
peixin
shraiysh
kiranchandramohan
kiranktp
clementval
sscalpone

Commits

rGa56b76d9ca52: [flang][OpenMP] Lowering support for atomic update construct

Summary

This patch adds lowering support for atomic update construct. A region is associated with every omp.atomic.update operation wherein resides: (1) the evaluation of the expression on the RHS of the atomic assignment statement, and (2) a omp.yield operation that yields the extended value of expression evaluated in (1).

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

NimishMishra created this revision.May 16 2022, 1:03 AM

Herald added a project: Restricted Project. · View Herald TranscriptMay 16 2022, 1:03 AM

Herald added subscribers: mehdi_amini, jdoerfert, guansong, yaxunl. · View Herald Transcript

NimishMishra requested review of this revision.May 16 2022, 1:03 AM

Herald added a reviewer: jdoerfert. · View Herald TranscriptMay 16 2022, 1:03 AM

Herald added a subscriber: sstefan1. · View Herald Transcript

Harbormaster completed remote builds in B164596: Diff 429635.May 16 2022, 1:03 AM

NimishMishra updated this revision to Diff 429682.May 16 2022, 4:34 AM

Harbormaster completed remote builds in B164622: Diff 429682.May 16 2022, 5:15 AM

NimishMishra added reviewers: peixin, shraiysh, kiranchandramohan, kiranktp, clementval.May 16 2022, 5:16 AM

Please check D125793. This patch should fail for pointers. If this fails, please implement atomic update depending on D125793.

NimishMishra edited the summary of this revision. (Show Details)May 23 2022, 2:55 AM

NimishMishra added a parent revision: D125793: [flang][OpenMP] Fix pointer variables in atomic read/write.

nit: name the test file to atomic-update.f90 in accordance with the recent changes by Peixin.

Minor changes requested. Thanks!

flang/lib/Lower/OpenMP.cpp
1188–1189	nit: move declarations closer to their use.
flang/test/Lower/OpenMP/atomic03.f90
48 ↗	(On Diff #429682)	Can we please add more testcases. There are a many possibilities with memory order and hint and it would be nice to test most of them (ideally all of them). Having more testcases that test different scenarios can't hurt us. [Suggestion] Maybe we can separate them into different subroutines (based on memory order) for clarity. Feel free to separate them however you'd like to ensure readability.

This revision now requires changes to proceed.May 23 2022, 3:39 AM

peixin added inline comments.May 23 2022, 3:59 AM

flang/test/Lower/OpenMP/atomic03.f90
48 ↗	(On Diff #429682)	Please add one more test case for pointer similar to D125793.

peixin requested changes to this revision.May 24 2022, 5:09 AM

peixin added inline comments.

flang/lib/Lower/OpenMP.cpp
1202	This should be incorrect for pointers. You may use the following code: mlir::Value address = fir::getBase(converter.genExprAddr( *Fortran::semantics::GetExpr(assignmentStmtVariable), stmtCtx));

NimishMishra edited the summary of this revision. (Show Details)May 27 2022, 7:12 PM

NimishMishra removed a parent revision: D125793: [flang][OpenMP] Fix pointer variables in atomic read/write.

Addressed comments

Harbormaster completed remote builds in B168255: Diff 434749.Jun 7 2022, 3:37 AM

Please add a description to the summary of the patch to describe what is being implemented in the patch along with a brief description of how it is implemented.

flang/lib/Lower/OpenMP.cpp
283	Nit: please document this option.
300	Would it be better to pass the `mlir::Value result` as an argument instead of `Expr` and also reuse it below at the terminator insertion?
349	Nit: expand auto
1220	Nit: Can this function share a lot of code with the `genOmpAtomicUpdate`? The hint clause setting, getting the designator, creating the update op etc seem to be similar?
1253	Can you add a comment here on why the omp::AtomicUpdateOp is created here? Something like the following. "If atomic-clause is not present on the construct, the behavior is as if the update clause is specified."
flang/test/Lower/OpenMP/atomic-update.f90
2	Nit: please test with the flang driver as well.
2	Nit: atomic and atomic update constructs.
4	Nit: can this be on the same line as the one above?
47–50	If the options in `{{.*}}` are significant then capture in a variable and use it in the operation where it is used. If it is not significant then please remove. This comment applies to all the tests.

NimishMishra added a child revision: Restricted Differential Revision.Jun 7 2022, 8:48 AM

NimishMishra added a child revision: D127272: [flang][OpenMP] Lowering support for atomic capture.Jun 8 2022, 12:56 AM

NimishMishra removed a child revision: D127272: [flang][OpenMP] Lowering support for atomic capture.Jun 8 2022, 1:02 AM

NimishMishra mentioned this in D127272: [flang][OpenMP] Lowering support for atomic capture.Jun 8 2022, 1:38 AM

NimishMishra added inline comments.Jun 9 2022, 10:52 PM

flang/lib/Lower/OpenMP.cpp

300

This is causing a change.

If earlier the IR was

omp.atomic.update %[[VAR_Y]] : !fir.ref<i32> {
   ^bb0(%[[ARG:.*]]: i32):
    fir.store %[[ARG]] to %[[TEMP_9]] : !fir.ref<i32>
    %4 = fir.load %[[TEMP_9]] : !fir.ref<i32>
    %5 = arith.constant 1 : i32
    %[[RESULT:.*]] = arith.addi %4, %5 : i32
     omp.yield(%[[RESULT]] : i32)
}

Now it becomes:

%4 = fir.load %[[TEMP_9]] : !fir.ref<i32>
%5 = arith.constant 1 : i32
%[[RESULT:.*]] = arith.addi %4, %5 : i32
omp.atomic.update %[[VAR_Y]] : !fir.ref<i32> {
   ^bb0(%[[ARG:.*]]: i32):
    fir.store %[[ARG]] to %[[TEMP_9]] : !fir.ref<i32>
     omp.yield(%[[RESULT]] : i32)
}

This does seem OK to me. But still if you could confirm it once.

shraiysh added inline comments.Jun 13 2022, 7:39 AM

flang/lib/Lower/OpenMP.cpp
300	This is not accurate IR. The "before" IR was correct (not optimal but correct). The value would get updated with that IR. The "after" IR is both incorrect and not optimal. It is loading an un-related value before the atomic update begins and so the result will not accurately "update" the value. The standard mentions that the evaluation of `expr` in `x += expr` is atomic. In this case, `expr` is `%5`, not `%4` or `%[[RESULT]]`. Hence the evaluation of `%5` can be outside the construct, but not of `%[[RESULT]]`. Solution: The best way to do this would be to avoid generation of load-store operations and directly use the argument. This load-store is required for worksharing loops but it would be great if we could avoid generating them for update. The `arith.addi` operation must be inside the update region. If it seems very hard to avoid load-store then the "before" IR is at least correct.

NimishMishra mentioned this in D127468: [flang][OpenMP] Initial support the lowering of copyin clause.Jun 15 2022, 8:25 AM

shraiysh added inline comments.Jun 21 2022, 3:07 AM

flang/lib/Lower/OpenMP.cpp
300	The standard mentions that the evaluation of expr in x += expr is atomic. In this case, expr is %5, not %4 or %[[RESULT]]. Hence the evaluation of %5 can be outside the construct, but not of %[[RESULT]]. Correction (apologies for mistyping some stuff) - The standard mentions that only the evaluation of `expr` in `x += expr` need not be atomic, but the update to `x` must be atomic. In this case, `expr` is `%5`, not `%4` or `%[[RESULT]]`. Hence the evaluation of `%5` can be outside the construct, but both `%4` and `%[[RESULT]]` must be inside the atomic region. This still does not change the solution and the fact that the "after" IR is incorrect.

Rebased and addressed comments

Herald added a reviewer: sscalpone. · View Herald TranscriptJun 24 2022, 12:13 AM

NimishMishra edited the summary of this revision. (Show Details)Jun 24 2022, 12:16 AM

Harbormaster completed remote builds in B171787: Diff 439650.Jun 24 2022, 12:39 AM

peixin added inline comments.Jun 25 2022, 2:22 AM

flang/lib/Lower/OpenMP.cpp
274	I am thinking if this should be split into one new single function instead of handling the special case in current `createBodyOfOp`, which is complex enough. The atomic update only uses part of code in `createBodyOfOp`, and it has nothing to do with the `outerCombined` and `privitization`. Actually, atomic update is one kind of special case, which defines the specific region statements in parsing. It is not the general body in the atomic region. So, would it be better to create one single function to support this?

created a new function for atomic update lowering

modified the generated IR to use %arg0 in the region instead of separately loading the region argument

NimishMishra added inline comments.Jun 30 2022, 5:16 AM

flang/test/Lower/OpenMP/atomic-update.f90
96	Missed this. Will fix in next diff upload
114	Missed this too. Will fix in next diff upload

Harbormaster completed remote builds in B173012: Diff 441359.Jun 30 2022, 5:30 AM

peixin added inline comments.Jul 1 2022, 10:27 PM

flang/lib/Lower/OpenMP.cpp
281	// bind the argument to the symbol This is not necessary. The code can comment itself.
289	Why do you generate the result twice?
1095	address = fir::getBase(converter.genExprAddr( *Fortran::semantics::GetExpr(assignmentStmtVariable), stmtCtx)); Move this out of `if condition`.
1111	`createBodyOfAtomicUpdateOp` is only used here, and it's not one large function, so it is OK to inline it directly.

Addressed comments.

Harbormaster completed remote builds in B173520: Diff 442043.Jul 4 2022, 3:27 AM

The generated code in testcases looks good. Please wait for @peixin's approval for the code part, but functionality-wise this LGTM.

Thanks for the update. The structure of lowering atomic update operation is much more clear now. I have a few more comments.

flang/lib/Lower/OpenMP.cpp
263	Remove the `inline`.
284
1091	The vector of symbols seems to be not necessary. Use one symbol is enough since update operation always has one argument. Use one more description name -> updateSymbol, or better one.
1111	No need to pass currentLocation. It can be obtained from `converter`.
1112	`symbolVector` is from `assignmentStmtVariable`. You can move the code of getting the `updateSymbol` inside the callee.

peixin added inline comments.Jul 5 2022, 3:57 AM

flang/lib/Lower/OpenMP.cpp
263	I didn't mean add the `inline` attribute. What I mean is that remove this function and move the code into `genOmpAtomicUpdateStatement`.

Addressed comments

Harbormaster completed remote builds in B175335: Diff 444566.Jul 14 2022, 3:25 AM

LGTM except for a few nits.

flang/lib/Lower/OpenMP.cpp
1111	Nit: Add one empty line after 1149.
1112	Ni: Remove the empty line.
1114	Nit
1115	Ni: Remove the empty line.
1121	`tiv` means type of induction variable. It's not appropriate for update variable.
1122	Then you can remove the following: tiv.push_back(varType); locs.push_back(currentLocation);

This revision was not accepted when it landed; it landed in state Needs Review.Jul 14 2022, 5:52 AM

Closed by commit rGa56b76d9ca52: [flang][OpenMP] Lowering support for atomic update construct (authored by NimishMishra). · Explain Why

This revision was automatically updated to reflect the committed changes.

NimishMishra added a commit: rGa56b76d9ca52: [flang][OpenMP] Lowering support for atomic update construct.

I am getting:

/.../llvm-project/flang/lib/Lower/OpenMP.cpp:1134:21: error: variable 'updateSymbol' is used uninitialized whenever 'if' condition is false [-Werror,-Wsometimes-uninitialized]

If I understand the code correctly, updateSymbol is sometimes initialized. Then, we always dereference the pointer later in the function:

converter.bindSymbol(*updateSymbol, val);

I could silence the warning with:

const Fortran::semantics::Symbol *updateSymbol = nullptr;

but that wouldn't fix the fundamental problem. Would you mind taking a look? Thanks!

In D125668#3655687, @kazu wrote:
I am getting:
/.../llvm-project/flang/lib/Lower/OpenMP.cpp:1134:21: error: variable 'updateSymbol' is used uninitialized whenever 'if' condition is false [-Werror,-Wsometimes-uninitialized]
If I understand the code correctly, updateSymbol is sometimes initialized. Then, we always dereference the pointer later in the function:
converter.bindSymbol(*updateSymbol, val);
I could silence the warning with:
const Fortran::semantics::Symbol *updateSymbol = nullptr;
but that wouldn't fix the fundamental problem. Would you mind taking a look? Thanks!

Thanks @kazu for reaching out. I have put a potential fix at https://reviews.llvm.org/D129914.

@peixin @kiranchandramohan Could you take a quick look at the fix? The issue is failing some builds with [-Werror,-Wsometimes-uninitialized]

kiranchandramohan mentioned this in D129914: [flang][OpenMP] Fix warning due to uninitialized pointer dereference during atomic update lowering.Jul 15 2022, 9:21 PM

Revision Contents

Path

Size

flang/

lib/

Lower/

OpenMP.cpp

137 lines

test/

Lower/

OpenMP/

atomic-update.f90

135 lines

Diff 439650

flang/lib/Lower/OpenMP.cpp

Show First 20 Lines • Show All 254 Lines • ▼ Show 20 Lines if (eval.block) {

"expected terminator op"); "expected terminator op");

} }

if (!eval.isDirective() && eval.hasNestedEvaluations()) if (!eval.isDirective() && eval.hasNestedEvaluations())

createEmptyRegionBlocks(firOpBuilder, eval.getNestedEvaluations()); createEmptyRegionBlocks(firOpBuilder, eval.getNestedEvaluations());

} }

/// Create the body (block) for an OpenMP Operation. /// Create the body (block) for an OpenMP Operation.

peixinUnsubmitted

Not Done

Remove the inline.

peixin: Remove the `inline`.

peixinUnsubmitted

Not Done

I didn't mean add the inline attribute. What I mean is that remove this function and move the code into genOmpAtomicUpdateStatement.

peixin: I didn't mean add the `inline` attribute. What I mean is that remove this function and move the…

/// ///

/// \param [in] op - the operation the body belongs to. /// \param [in] op - the operation the body belongs to.

/// \param [inout] converter - converter to use for the clauses. /// \param [inout] converter - converter to use for the clauses.

/// \param [in] loc - location in source code. /// \param [in] loc - location in source code.

/// \param [in] eval - current PFT node/evaluation. /// \param [in] eval - current PFT node/evaluation.

/// \oaran [in] clauses - list of clauses to process. /// \oaran [in] clauses - list of clauses to process.

/// \param [in] args - block arguments (induction variable[s]) for the /// \param [in] args - block arguments (induction variable[s]) for the

//// region. //// region.

/// \param [in] outerCombined - is this an outer operation - prevents /// \param [in] outerCombined - is this an outer operation - prevents

/// privatization. /// privatization.

/// \param [in] expr - the expression whose evaluation's extended

peixinUnsubmitted

Not Done

I am thinking if this should be split into one new single function instead of handling the special case in current createBodyOfOp, which is complex enough. The atomic update only uses part of code in createBodyOfOp, and it has nothing to do with the outerCombined and privitization.

Actually, atomic update is one kind of special case, which defines the specific region statements in parsing. It is not the general body in the atomic region. So, would it be better to create one single function to support this?

peixin: I am thinking if this should be split into one new single function instead of handling the…

//// value is required

template <typename Op> template <typename Op>

static void static void

createBodyOfOp(Op &op, Fortran::lower::AbstractConverter &converter, createBodyOfOp(Op &op, Fortran::lower::AbstractConverter &converter,

mlir::Location &loc, Fortran::lower::pft::Evaluation &eval, mlir::Location &loc, Fortran::lower::pft::Evaluation &eval,

const Fortran::parser::OmpClauseList *clauses = nullptr, const Fortran::parser::OmpClauseList *clauses = nullptr,

const SmallVector<const Fortran::semantics::Symbol *> &args = {}, const SmallVector<const Fortran::semantics::Symbol *> &args = {},

peixinUnsubmitted

Not Done

// bind the argument to the symbol

This is not necessary. The code can comment itself.

peixin: ``` // bind the argument to the symbol ``` This is not necessary. The code can comment itself.

bool outerCombined = false) { bool outerCombined = false,

const Fortran::parser::Expr *expr = nullptr) {

kiranchandramohanUnsubmitted

Not Done

Nit: please document this option.

kiranchandramohan: Nit: please document this option.

fir::FirOpBuilder &firOpBuilder = converter.getFirOpBuilder(); fir::FirOpBuilder &firOpBuilder = converter.getFirOpBuilder();

peixinUnsubmitted

Not Done

// Set the insert for the terminator operation to go at the end of the

- // block

+ // block.

mlir::Block &block = op.getRegion().back();

peixin:

// If an argument for the region is provided then create the block with that // If an argument for the region is provided then create the block with that

// argument. Also update the symbol's address with the mlir argument value. // argument. Also update the symbol's address with the mlir argument value.

// e.g. For loops the argument is the induction variable. And all further // e.g. For loops the argument is the induction variable. And all further

// uses of the induction variable should use this mlir value. // uses of the induction variable should use this mlir value.

mlir::Operation *storeOp = nullptr; mlir::Operation *storeOp = nullptr;

peixinUnsubmitted

Not Done

Why do you generate the result twice?

peixin: Why do you generate the result twice?

if (args.size()) { if (args.size()) {

std::size_t loopVarTypeSize = 0; std::size_t loopVarTypeSize = 0;

for (const Fortran::semantics::Symbol *arg : args) for (const Fortran::semantics::Symbol *arg : args)

loopVarTypeSize = std::max(loopVarTypeSize, arg->GetUltimate().size()); loopVarTypeSize = std::max(loopVarTypeSize, arg->GetUltimate().size());

mlir::Type loopVarType = getLoopVarType(converter, loopVarTypeSize); mlir::Type varType;

if constexpr (std::is_same_v<Op, omp::AtomicUpdateOp>) {

Fortran::lower::StatementContext statementCtx;

mlir::Value result = fir::getBase(converter.genExprValue(

*Fortran::semantics::GetExpr(*expr), statementCtx));

varType = result.getType();

} else {

kiranchandramohanUnsubmitted

Done

Would it be better to pass the mlir::Value result as an argument instead of Expr and also reuse it below at the terminator insertion?

kiranchandramohan: Would it be better to pass the `mlir::Value result` as an argument instead of `Expr` and also…

NimishMishraAuthorUnsubmitted

Done

This is causing a change.

If earlier the IR was

omp.atomic.update %[[VAR_Y]] : !fir.ref<i32> {
   ^bb0(%[[ARG:.*]]: i32):
    fir.store %[[ARG]] to %[[TEMP_9]] : !fir.ref<i32>
    %4 = fir.load %[[TEMP_9]] : !fir.ref<i32>
    %5 = arith.constant 1 : i32
    %[[RESULT:.*]] = arith.addi %4, %5 : i32
     omp.yield(%[[RESULT]] : i32)
}

Now it becomes:

%4 = fir.load %[[TEMP_9]] : !fir.ref<i32>
%5 = arith.constant 1 : i32
%[[RESULT:.*]] = arith.addi %4, %5 : i32
omp.atomic.update %[[VAR_Y]] : !fir.ref<i32> {
   ^bb0(%[[ARG:.*]]: i32):
    fir.store %[[ARG]] to %[[TEMP_9]] : !fir.ref<i32>
     omp.yield(%[[RESULT]] : i32)
}

This does seem OK to me. But still if you could confirm it once.

NimishMishra: This is causing a change. If earlier the IR was ``` omp.atomic.update %[[VAR_Y]] : !fir.

shraiyshUnsubmitted

Done

This is not accurate IR.

The "before" IR was correct (not optimal but correct). The value would get updated with that IR.

The "after" IR is both incorrect and not optimal. It is loading an un-related value before the atomic update begins and so the result will not accurately "update" the value.

The standard mentions that the evaluation of expr in x += expr is atomic. In this case, expr is %5, not %4 or %[[RESULT]]. Hence the evaluation of %5 can be outside the construct, but not of %[[RESULT]].

Solution: The best way to do this would be to avoid generation of load-store operations and directly use the argument. This load-store is required for worksharing loops but it would be great if we could avoid generating them for update. The arith.addi operation must be inside the update region. If it seems very hard to avoid load-store then the "before" IR is at least correct.

shraiysh: This is not accurate IR. The "before" IR was correct (not optimal but correct). The value…

shraiyshUnsubmitted

Done

The standard mentions that the evaluation of expr in x += expr is atomic. In this case, expr is %5, not %4 or %[[RESULT]]. Hence the evaluation of %5 can be outside the construct, but not of %[[RESULT]].

Correction (apologies for mistyping some stuff) - The standard mentions that only the evaluation of expr in x += expr need not be atomic, but the update to x must be atomic. In this case, expr is %5, not %4 or %[[RESULT]]. Hence the evaluation of %5 can be outside the construct, but both %4 and %[[RESULT]] must be inside the atomic region.

This still does not change the solution and the fact that the "after" IR is incorrect.

shraiysh: > The standard mentions that the evaluation of expr in x += expr is atomic. In this case, expr…

varType = getLoopVarType(converter, loopVarTypeSize);

}

SmallVector<Type> tiv; SmallVector<Type> tiv;

SmallVector<Location> locs; SmallVector<Location> locs;

for (int i = 0; i < (int)args.size(); i++) { for (int i = 0; i < (int)args.size(); i++) {

tiv.push_back(loopVarType); tiv.push_back(varType);

locs.push_back(loc); locs.push_back(loc);

} }

firOpBuilder.createBlock(&op.getRegion(), {}, tiv, locs); firOpBuilder.createBlock(&op.getRegion(), {}, tiv, locs);

if constexpr (!std::is_same_v<Op, omp::AtomicUpdateOp>) {

// No need to create a temporary for the argument in case of

// omp::AtomicUpdateOp

int argIndex = 0; int argIndex = 0;

// The argument is not currently in memory, so make a temporary for the // The argument is not currently in memory, so make a temporary for the

// argument, and store it there, then bind that location to the argument. // argument, and store it there, then bind that location to the argument.

for (const Fortran::semantics::Symbol *arg : args) { for (const Fortran::semantics::Symbol *arg : args) {

mlir::Value val = mlir::Value val =

fir::getBase(op.getRegion().front().getArgument(argIndex)); fir::getBase(op.getRegion().front().getArgument(argIndex));

mlir::Value temp = firOpBuilder.createTemporary( mlir::Value temp = firOpBuilder.createTemporary(

loc, loopVarType, loc, varType,

llvm::ArrayRef<mlir::NamedAttribute>{ llvm::ArrayRef<mlir::NamedAttribute>{

Fortran::lower::getAdaptToByRefAttr(firOpBuilder)}); Fortran::lower::getAdaptToByRefAttr(firOpBuilder)});

storeOp = firOpBuilder.create<fir::StoreOp>(loc, val, temp); storeOp = firOpBuilder.create<fir::StoreOp>(loc, val, temp);

converter.bindSymbol(*arg, temp); converter.bindSymbol(*arg, temp);

argIndex++; argIndex++;

} }

}

} else { } else {

firOpBuilder.createBlock(&op.getRegion()); firOpBuilder.createBlock(&op.getRegion());

} }

// Set the insert for the terminator operation to go at the end of the // Set the insert for the terminator operation to go at the end of the

// block - this is either empty or the block with the stores above, // block - this is either empty or the block with the stores above,

// the end of the block works for both. // the end of the block works for both.

mlir::Block &block = op.getRegion().back(); mlir::Block &block = op.getRegion().back();

firOpBuilder.setInsertionPointToEnd(&block); firOpBuilder.setInsertionPointToEnd(&block);

// If it is an unstructured region and is not the outer region of a combined // If it is an unstructured region and is not the outer region of a combined

// construct, create empty blocks for all evaluations. // construct, create empty blocks for all evaluations.

if (eval.lowerAsUnstructured() && !outerCombined) if (eval.lowerAsUnstructured() && !outerCombined)

createEmptyRegionBlocks(firOpBuilder, eval.getNestedEvaluations()); createEmptyRegionBlocks(firOpBuilder, eval.getNestedEvaluations());

// Insert the terminator. // Insert the terminator.

if constexpr (std::is_same_v<Op, omp::WsLoopOp> || if constexpr (std::is_same_v<Op, omp::WsLoopOp> ||

std::is_same_v<Op, omp::SimdLoopOp>) { std::is_same_v<Op, omp::SimdLoopOp>) {

mlir::ValueRange results; mlir::ValueRange results;

firOpBuilder.create<mlir::omp::YieldOp>(loc, results); firOpBuilder.create<mlir::omp::YieldOp>(loc, results);

} else if constexpr (std::is_same_v<Op, omp::AtomicUpdateOp>) {

Fortran::lower::StatementContext statementCtx;

mlir::Value result = fir::getBase(converter.genExprValue(

kiranchandramohanUnsubmitted

Done

Nit: expand auto

kiranchandramohan: Nit: expand auto

*Fortran::semantics::GetExpr(*expr), statementCtx));

firOpBuilder.create<mlir::omp::YieldOp>(loc, result);

} else { } else {

firOpBuilder.create<mlir::omp::TerminatorOp>(loc); firOpBuilder.create<mlir::omp::TerminatorOp>(loc);

} }

// Reset the insert point to before the terminator. // Reset the insert point to before the terminator.

if (storeOp) if (storeOp)

firOpBuilder.setInsertionPointAfter(storeOp); firOpBuilder.setInsertionPointAfter(storeOp);

else else

▲ Show 20 Lines • Show All 709 Lines • ▼ Show 20 Lines if (auto ompClause = std::get_if<Fortran::parser::OmpClause>(&clause.u)) {

&ompMemoryOrderClause->v.u)) { &ompMemoryOrderClause->v.u)) {

memory_order = mlir::omp::ClauseMemoryOrderKindAttr::get( memory_order = mlir::omp::ClauseMemoryOrderKindAttr::get(

firOpBuilder.getContext(), omp::ClauseMemoryOrderKind::Release); firOpBuilder.getContext(), omp::ClauseMemoryOrderKind::Release);

} }

static void genOmpAtomicUpdateStatement(

Fortran::lower::AbstractConverter &converter,

Fortran::lower::pft::Evaluation &eval,

const Fortran::parser::Variable &assignmentStmtVariable,

const Fortran::parser::Expr &assignmentStmtExpr,

const Fortran::parser::OmpAtomicClauseList *leftHandClauseList,

const Fortran::parser::OmpAtomicClauseList *rightHandClauseList) {

// Generate `omp.atomic.update` operation for atomic assignment statements

auto &firOpBuilder = converter.getFirOpBuilder();

auto currentLocation = converter.getCurrentLocation();

mlir::Value address;

SmallVector<const Fortran::semantics::Symbol *> symbolVector;

Fortran::lower::StatementContext stmtCtx;

if (auto varDesignator = std::get_if<

Fortran::common::Indirection<Fortran::parser::Designator>>(

peixinUnsubmitted

Not Done

The vector of symbols seems to be not necessary. Use one symbol is enough since update operation always has one argument.

Use one more description name -> updateSymbol, or better one.

peixin: The vector of symbols seems to be not necessary. Use one symbol is enough since update…

&assignmentStmtVariable.u)) {

if (const auto *name = getDesignatorNameIfDataRef(varDesignator->value())) {

address = fir::getBase(converter.genExprAddr(

*Fortran::semantics::GetExpr(assignmentStmtVariable), stmtCtx));

peixinUnsubmitted

Not Done

address = fir::getBase(converter.genExprAddr(
    *Fortran::semantics::GetExpr(assignmentStmtVariable), stmtCtx));

Move this out of if condition.

peixin: ``` address = fir::getBase(converter.genExprAddr( *Fortran::semantics::GetExpr…

symbolVector.push_back(name->symbol);

}

// If no hint clause is specified, the effect is as if

// hint(omp_sync_hint_none) had been specified.

mlir::IntegerAttr hint = nullptr;

mlir::omp::ClauseMemoryOrderKindAttr memory_order = nullptr;

if (leftHandClauseList)

genOmpAtomicHintAndMemoryOrderClauses(converter, *leftHandClauseList, hint,

memory_order);

if (rightHandClauseList)

genOmpAtomicHintAndMemoryOrderClauses(converter, *rightHandClauseList, hint,

memory_order);

auto atomicUpdateOp = firOpBuilder.create<mlir::omp::AtomicUpdateOp>(

currentLocation, address, hint, memory_order);

createBodyOfOp<omp::AtomicUpdateOp>(atomicUpdateOp, converter,

peixinUnsubmitted

Not Done

createBodyOfAtomicUpdateOp is only used here, and it's not one large function, so it is OK to inline it directly.

peixin: `createBodyOfAtomicUpdateOp` is only used here, and it's not one large function, so it is OK to…

peixinUnsubmitted

Not Done

No need to pass currentLocation. It can be obtained from converter.

peixin: No need to pass currentLocation. It can be obtained from `converter`.

peixinUnsubmitted

Not Done

currentLocation, address, hint, memory_order);

- //// Generate body of Atomic Update operation

+ // Generate body of Atomic Update operation

// If an argument for the region is provided then create the block with that

Nit: Add one empty line after 1149.

peixin: Nit: Add one empty line after 1149.

currentLocation, eval, nullptr,

peixinUnsubmitted

Not Done

symbolVector is from assignmentStmtVariable. You can move the code of getting the updateSymbol inside the callee.

peixin: `symbolVector` is from `assignmentStmtVariable`. You can move the code of getting the…

peixinUnsubmitted

Not Done

Ni: Remove the empty line.

peixin: Ni: Remove the empty line.

symbolVector, false, &assignmentStmtExpr);

}

peixinUnsubmitted

Not Done

// If an argument for the region is provided then create the block with that

- // argument. Also update the symbol's address with the mlir argument value.

+ // argument. Also update the symbol's address with the argument mlir value.

mlir::Type varType =

Nit

peixin: Nit

peixinUnsubmitted

Not Done

Ni: Remove the empty line.

peixin: Ni: Remove the empty line.

static void static void

genOmpAtomicWrite(Fortran::lower::AbstractConverter &converter, genOmpAtomicWrite(Fortran::lower::AbstractConverter &converter,

Fortran::lower::pft::Evaluation &eval, Fortran::lower::pft::Evaluation &eval,

const Fortran::parser::OmpAtomicWrite &atomicWrite) { const Fortran::parser::OmpAtomicWrite &atomicWrite) {

auto &firOpBuilder = converter.getFirOpBuilder(); auto &firOpBuilder = converter.getFirOpBuilder();

auto currentLocation = converter.getCurrentLocation(); auto currentLocation = converter.getCurrentLocation();

peixinUnsubmitted

Not Done

.getType();

- SmallVector<Type> tiv;

+ SmallVector<Type> varTys = {varType};

SmallVector<Location> locs;

tiv means type of induction variable. It's not appropriate for update variable.

peixin: `tiv` means type of induction variable. It's not appropriate for update variable.

// Get the value and address of atomic write operands. // Get the value and address of atomic write operands.

peixinUnsubmitted

Not Done

SmallVector<Type> tiv;

- SmallVector<Location> locs;

+ SmallVector<Location> locs = {currentLocation};

tiv.push_back(varType);

Then you can remove the following:

tiv.push_back(varType);
locs.push_back(currentLocation);

peixin: Then you can remove the following: ``` tiv.push_back(varType); locs.push_back…

const Fortran::parser::OmpAtomicClauseList &rightHandClauseList = const Fortran::parser::OmpAtomicClauseList &rightHandClauseList =

std::get<2>(atomicWrite.t); std::get<2>(atomicWrite.t);

const Fortran::parser::OmpAtomicClauseList &leftHandClauseList = const Fortran::parser::OmpAtomicClauseList &leftHandClauseList =

std::get<0>(atomicWrite.t); std::get<0>(atomicWrite.t);

const auto &assignmentStmtExpr = const auto &assignmentStmtExpr =

std::get<Fortran::parser::Expr>(std::get<3>(atomicWrite.t).statement.t); std::get<Fortran::parser::Expr>(std::get<3>(atomicWrite.t).statement.t);

const auto &assignmentStmtVariable = std::get<Fortran::parser::Variable>( const auto &assignmentStmtVariable = std::get<Fortran::parser::Variable>(

std::get<3>(atomicWrite.t).statement.t); std::get<3>(atomicWrite.t).statement.t);

▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines genOmpAtomicHintAndMemoryOrderClauses(converter, leftHandClauseList, hint,

memory_order); memory_order);

genOmpAtomicHintAndMemoryOrderClauses(converter, rightHandClauseList, hint, genOmpAtomicHintAndMemoryOrderClauses(converter, rightHandClauseList, hint,

memory_order); memory_order);

firOpBuilder.create<mlir::omp::AtomicReadOp>(currentLocation, from_address, firOpBuilder.create<mlir::omp::AtomicReadOp>(currentLocation, from_address,

to_address, hint, memory_order); to_address, hint, memory_order);

} }

static void static void

genOmpAtomicUpdate(Fortran::lower::AbstractConverter &converter,

Fortran::lower::pft::Evaluation &eval,

const Fortran::parser::OmpAtomicUpdate &atomicUpdate) {

const Fortran::parser::OmpAtomicClauseList &rightHandClauseList =

std::get<2>(atomicUpdate.t);

const Fortran::parser::OmpAtomicClauseList &leftHandClauseList =

std::get<0>(atomicUpdate.t);

const auto &assignmentStmtExpr =

std::get<Fortran::parser::Expr>(std::get<3>(atomicUpdate.t).statement.t);

const auto &assignmentStmtVariable = std::get<Fortran::parser::Variable>(

shraiyshUnsubmitted

Done

nit: move declarations closer to their use.

shraiysh: nit: move declarations closer to their use.

std::get<3>(atomicUpdate.t).statement.t);

genOmpAtomicUpdateStatement(converter, eval, assignmentStmtVariable,

assignmentStmtExpr, &leftHandClauseList,

&rightHandClauseList);

}

static void genOmpAtomic(Fortran::lower::AbstractConverter &converter,

Fortran::lower::pft::Evaluation &eval,

const Fortran::parser::OmpAtomic &atomicConstruct) {

const Fortran::parser::OmpAtomicClauseList &atomicClauseList =

std::get<Fortran::parser::OmpAtomicClauseList>(atomicConstruct.t);

const auto &assignmentStmtExpr = std::get<Fortran::parser::Expr>(

peixinUnsubmitted

Done

This should be incorrect for pointers. You may use the following code:

mlir::Value address = fir::getBase(converter.genExprAddr(
      *Fortran::semantics::GetExpr(assignmentStmtVariable), stmtCtx));

peixin: This should be incorrect for pointers. You may use the following code: ``` mlir::Value address…

std::get<Fortran::parser::Statement<Fortran::parser::AssignmentStmt>>(

atomicConstruct.t)

.statement.t);

const auto &assignmentStmtVariable = std::get<Fortran::parser::Variable>(

std::get<Fortran::parser::Statement<Fortran::parser::AssignmentStmt>>(

atomicConstruct.t)

.statement.t);

// If atomic-clause is not present on the construct, the behaviour is as if

// the update clause is specified

genOmpAtomicUpdateStatement(converter, eval, assignmentStmtVariable,

assignmentStmtExpr, &atomicClauseList, nullptr);

}

static void

genOMP(Fortran::lower::AbstractConverter &converter, genOMP(Fortran::lower::AbstractConverter &converter,

Fortran::lower::pft::Evaluation &eval, Fortran::lower::pft::Evaluation &eval,

const Fortran::parser::OpenMPAtomicConstruct &atomicConstruct) { const Fortran::parser::OpenMPAtomicConstruct &atomicConstruct) {

std::visit(Fortran::common::visitors{ std::visit(Fortran::common::visitors{

kiranchandramohanUnsubmitted

Done

Nit: Can this function share a lot of code with the genOmpAtomicUpdate? The hint clause setting, getting the designator, creating the update op etc seem to be similar?

kiranchandramohan: Nit: Can this function share a lot of code with the `genOmpAtomicUpdate`? The hint clause…

[&](const Fortran::parser::OmpAtomicRead &atomicRead) { [&](const Fortran::parser::OmpAtomicRead &atomicRead) {

genOmpAtomicRead(converter, eval, atomicRead); genOmpAtomicRead(converter, eval, atomicRead);

}, },

[&](const Fortran::parser::OmpAtomicWrite &atomicWrite) { [&](const Fortran::parser::OmpAtomicWrite &atomicWrite) {

genOmpAtomicWrite(converter, eval, atomicWrite); genOmpAtomicWrite(converter, eval, atomicWrite);

}, },

[&](const Fortran::parser::OmpAtomic &atomicConstruct) {

genOmpAtomic(converter, eval, atomicConstruct);

[&](const Fortran::parser::OmpAtomicUpdate &atomicUpdate) {

genOmpAtomicUpdate(converter, eval, atomicUpdate);

[&](const auto &) { [&](const auto &) {

TODO(converter.getCurrentLocation(), TODO(converter.getCurrentLocation(), "Atomic capture");

"Atomic update & capture");

}, },

atomicConstruct.u); atomicConstruct.u);

} }

void Fortran::lower::genOpenMPConstruct( void Fortran::lower::genOpenMPConstruct(

Fortran::lower::AbstractConverter &converter, Fortran::lower::AbstractConverter &converter,

Fortran::lower::pft::Evaluation &eval, Fortran::lower::pft::Evaluation &eval,

const Fortran::parser::OpenMPConstruct &ompConstruct) { const Fortran::parser::OpenMPConstruct &ompConstruct) {

std::visit( std::visit(

common::visitors{ common::visitors{

[&](const Fortran::parser::OpenMPStandaloneConstruct [&](const Fortran::parser::OpenMPStandaloneConstruct

&standaloneConstruct) { &standaloneConstruct) {

genOMP(converter, eval, standaloneConstruct); genOMP(converter, eval, standaloneConstruct);

}, },

[&](const Fortran::parser::OpenMPSectionsConstruct [&](const Fortran::parser::OpenMPSectionsConstruct

&sectionsConstruct) { &sectionsConstruct) {

genOMP(converter, eval, sectionsConstruct); genOMP(converter, eval, sectionsConstruct);

kiranchandramohanUnsubmitted

Done

Can you add a comment here on why the omp::AtomicUpdateOp is created here? Something like the following.

"If atomic-clause is not present on the construct, the behavior is as if the update clause is specified."

kiranchandramohan: Can you add a comment here on why the omp::AtomicUpdateOp is created here? Something like the…

}, },

[&](const Fortran::parser::OpenMPSectionConstruct &sectionConstruct) { [&](const Fortran::parser::OpenMPSectionConstruct &sectionConstruct) {

genOMP(converter, eval, sectionConstruct); genOMP(converter, eval, sectionConstruct);

}, },

[&](const Fortran::parser::OpenMPLoopConstruct &loopConstruct) { [&](const Fortran::parser::OpenMPLoopConstruct &loopConstruct) {

genOMP(converter, eval, loopConstruct); genOMP(converter, eval, loopConstruct);

}, },

[&](const Fortran::parser::OpenMPDeclarativeAllocate [&](const Fortran::parser::OpenMPDeclarativeAllocate

▲ Show 20 Lines • Show All 89 Lines • Show Last 20 Lines

flang/test/Lower/OpenMP/atomic-update.f90

This file was added.

				! This test checks lowering of atomic and atomic update constructs
				! RUN: bbc -fopenmp -emit-fir %s -o - \| FileCheck %s
				kiranchandramohanUnsubmitted Done Reply Inline Actions Nit: please test with the flang driver as well. kiranchandramohan: Nit: please test with the flang driver as well.
				kiranchandramohanUnsubmitted Done Reply Inline Actions Nit: atomic and atomic update constructs. kiranchandramohan: Nit: atomic and atomic update constructs.
				! RUN: %flang_fc1 -emit-fir -fopenmp %s -o - \| FileCheck %s

				kiranchandramohanUnsubmitted Done Reply Inline Actions Nit: can this be on the same line as the one above? kiranchandramohan: Nit: can this be on the same line as the one above?
				program OmpAtomicUpdate
				use omp_lib
				integer :: x, y, z
				integer, pointer :: a, b
				integer, target :: c, d
				a=>c
				b=>d

				!CHECK: func.func @_QQmain() {
				!CHECK: %[[A:.*]] = fir.alloca !fir.box<!fir.ptr<i32>> {bindc_name = "a", uniq_name = "_QFEa"}
				!CHECK: %[[A_ADDR:.*]] = fir.alloca !fir.ptr<i32> {uniq_name = "_QFEa.addr"}
				!CHECK: %{{.*}} = fir.zero_bits !fir.ptr<i32>
				!CHECK: fir.store %{{.*}} to %[[A_ADDR]] : !fir.ref<!fir.ptr<i32>>
				!CHECK: %[[B:.*]] = fir.alloca !fir.box<!fir.ptr<i32>> {bindc_name = "b", uniq_name = "_QFEb"}
				!CHECK: %[[B_ADDR:.*]] = fir.alloca !fir.ptr<i32> {uniq_name = "_QFEb.addr"}
				!CHECK: %{{.*}} = fir.zero_bits !fir.ptr<i32>
				!CHECK: fir.store %{{.*}} to %[[B_ADDR]] : !fir.ref<!fir.ptr<i32>>
				!CHECK: %[[C_ADDR:.*]] = fir.address_of(@_QFEc) : !fir.ref<i32>
				!CHECK: %[[D_ADDR:.*]] = fir.address_of(@_QFEd) : !fir.ref<i32>
				!CHECK: %[[X:.*]] = fir.alloca i32 {bindc_name = "x", uniq_name = "_QFEx"}
				!CHECK: %[[Y:.*]] = fir.alloca i32 {bindc_name = "y", uniq_name = "_QFEy"}
				!CHECK: %[[Z:.*]] = fir.alloca i32 {bindc_name = "z", uniq_name = "_QFEz"}
				!CHECK: %{{.*}} = fir.convert %[[C_ADDR]] : (!fir.ref<i32>) -> !fir.ptr<i32>
				!CHECK: fir.store %{{.*}} to %[[A_ADDR]] : !fir.ref<!fir.ptr<i32>>
				!CHECK: %{{.*}} = fir.convert %[[D_ADDR]] : (!fir.ref<i32>) -> !fir.ptr<i32>
				!CHECK: fir.store {{.*}} to %[[B_ADDR]] : !fir.ref<!fir.ptr<i32>>
				!CHECK: %[[LOADED_A:.*]] = fir.load %[[A_ADDR]] : !fir.ref<!fir.ptr<i32>>
				!CHECK: omp.atomic.update %[[LOADED_A]] : !fir.ptr<i32> {
				!CHECK: ^bb0(%[[ARG:.*]]: i32):
				!CHECK: %[[LOADED_A:.*]] = fir.load %[[A_ADDR]] : !fir.ref<!fir.ptr<i32>>
				!CHECK: %{{.*}} = fir.load %[[LOADED_A]] : !fir.ptr<i32>
				!CHECK: %[[LOADED_B:.*]] = fir.load %[[B_ADDR]] : !fir.ref<!fir.ptr<i32>>
				!CHECK: %{{.*}} = fir.load %[[LOADED_B]] : !fir.ptr<i32>
				!CHECK: %[[RESULT:.]] = arith.addi %{{.}}, %{{.*}} : i32
				!CHECK: omp.yield(%[[RESULT]] : i32)
				!CHECK: }
				!$omp atomic update
				a = a + b

				!CHECK: omp.atomic.update %[[Y]] : !fir.ref<i32> {
				!CHECK: ^bb0(%[[ARG:.*]]: i32):
				!CHECK: %[[LOADED_Y:.*]] = fir.load %[[Y]] : !fir.ref<i32>
				!CHECK: {{.*}} = arith.constant 1 : i32
				!CHECK: %[[RESULT:.]] = arith.addi %[[LOADED_Y]], {{.}} : i32
				!CHECK: omp.yield(%[[RESULT]] : i32)
				!CHECK: }
				kiranchandramohanUnsubmitted Done Reply Inline Actions If the options in `{{.}}` are significant then capture in a variable and use it in the operation where it is used. If it is not significant then please remove. This comment applies to all the tests. kiranchandramohan:* If the options in `{{.*}}` are significant then capture in a variable and use it in the…
				!CHECK: omp.atomic.update %[[Z]] : !fir.ref<i32> {
				!CHECK: ^bb0(%[[ARG:.*]]: i32):
				!CHECK: %[[LOADED_X:.*]] = fir.load %[[X]] : !fir.ref<i32>
				!CHECK: %[[LOADED_Z:.*]] = fir.load %[[Z]] : !fir.ref<i32>
				!CHECK: %[[RESULT:.*]] = arith.muli %[[LOADED_X]], %[[LOADED_Z]] : i32
				!CHECK: omp.yield(%16 : i32)
				!CHECK: }
				!$omp atomic
				y = y + 1
				!$omp atomic update
				z = x * z

				!CHECK: omp.atomic.update memory_order(relaxed) hint(uncontended) %[[X]] : !fir.ref<i32> {
				!CHECK: ^bb0(%[[ARG:.*]]: i32):
				!CHECK: %[[LOADED_X:.*]] = fir.load %[[X]] : !fir.ref<i32>
				!CHECK: %{{.*}} = arith.constant 1 : i32
				!CHECK: %[[RESULT:.]] = arith.subi %[[LOADED_X]], {{.}} : i32
				!CHECK: omp.yield(%[[RESULT]] : i32)
				!CHECK: }
				!CHECK: omp.atomic.update memory_order(relaxed) %[[Y]] : !fir.ref<i32> {
				!CHECK: ^bb0(%[[ARG:.*]]: i32):
				!CHECK: %[[LOADED_X:.*]] = fir.load %[[X]] : !fir.ref<i32>
				!CHECK: %[[LOADED_Y:.*]] = fir.load %[[Y]] : !fir.ref<i32>
				!CHECK: %[[LOADED_Z:.*]] = fir.load %[[Z]] : !fir.ref<i32>
				!CHECK: %{{.*}} = arith.cmpi sgt, %[[LOADED_X]], %[[LOADED_Y]] : i32
				!CHECK: %{{.}} = arith.select %{{.}}, %[[LOADED_X]], %[[LOADED_Y]] : i32
				!CHECK: %{{.}} = arith.cmpi sgt, %{{.}}, %[[LOADED_Z]] : i32
				!CHECK: %[[RESULT:.]] = arith.select %{{.}}, %{{.*}}, %[[LOADED_Z]] : i32
				!CHECK: omp.yield(%[[RESULT]] : i32)
				!CHECK: }
				!CHECK: omp.atomic.update memory_order(relaxed) hint(contended) %[[Z]] : !fir.ref<i32> {
				!CHECK: ^bb0(%[[ARG:.*]]: i32):
				!CHECK: %[[LOADED_Z:.*]] = fir.load %[[Z]] : !fir.ref<i32>
				!CHECK: %[[LOADED_X:.*]] = fir.load %[[X]] : !fir.ref<i32>
				!CHECK: %[[RESULT:.*]] = arith.addi %[[LOADED_Z]], %[[LOADED_X]] : i32
				!CHECK: omp.yield(%[[RESULT]] : i32)
				!CHECK: }
				!$omp atomic relaxed update hint(omp_sync_hint_uncontended)
				x = x - 1
				!$omp atomic update relaxed
				y = max(x, y, z)
				!$omp atomic relaxed hint(omp_sync_hint_contended)
				z = z + x

				!CHECK: omp.atomic.update memory_order(release) hint(contended) %[[Z]] : !fir.ref<i32> {
				!CHECK: ^bb0(%[[ARG:.*]]: i32):
				NimishMishraAuthorUnsubmitted Done Reply Inline Actions Missed this. Will fix in next diff upload NimishMishra: Missed this. Will fix in next diff upload
				!CHECK: %{{.*}} = arith.constant 10 : i32
				!CHECK: %[[LOADED_Z:.*]] = fir.load %[[Z]] : !fir.ref<i32>
				!CHECK: %[[RESULT:.]] = arith.muli {{.}}, %[[LOADED_Z]] : i32
				!CHECK: omp.yield(%[[RESULT]] : i32)
				!CHECK: }
				!CHECK: omp.atomic.update memory_order(release) hint(speculative) %[[X]] : !fir.ref<i32> {
				!CHECK: ^bb0(%arg0: i32):
				!CHECK: %[[LOADED_X:.*]] = fir.load %[[X]] : !fir.ref<i32>
				!CHECK: %[[LOADED_Z:.*]] = fir.load %[[Z]] : !fir.ref<i32>
				!CHECK: %[[RESULT:.*]] = arith.divsi %[[LOADED_X]], %[[LOADED_Z]] : i32
				!CHECK: omp.yield(%[[RESULT]] : i32)
				!CHECK: }

				!$omp atomic release update hint(omp_lock_hint_contended)
				z = z * 10
				!$omp atomic hint(omp_lock_hint_speculative) update release
				x = x / z

				NimishMishraAuthorUnsubmitted Done Reply Inline Actions Missed this too. Will fix in next diff upload NimishMishra: Missed this too. Will fix in next diff upload
				!CHECK: omp.atomic.update memory_order(seq_cst) hint(nonspeculative) %[[Y]] : !fir.ref<i32> {
				!CHECK: ^bb0(%[[ARG:.*]]: i32):
				!CHECK: %{{.*}} = arith.constant 10 : i32
				!CHECK: %[[LOADED_Y:.*]] = fir.load %[[Y]] : !fir.ref<i32>
				!CHECK: %[[RESULT:.]] = arith.addi %{{.}}, %[[LOADED_Y]] : i32
				!CHECK: omp.yield(%[[RESULT]] : i32)
				!CHECK: }
				!CHECK: omp.atomic.update memory_order(seq_cst) %[[Z]] : !fir.ref<i32> {
				!CHECK: ^bb0(%arg0: i32):
				!CHECK: %[[LOADED_Y:.*]] = fir.load %[[Y]] : !fir.ref<i32>
				!CHECK: %[[LOADED_Z:.*]] = fir.load %[[Z]] : !fir.ref<i32>
				!CHECK: %[[RESULT:.*]] = arith.addi %[[LOADED_Y]], %[[LOADED_Z]] : i32
				!CHECK: omp.yield(%[[RESULT]] : i32)
				!CHECK: }
				!CHECK: return
				!CHECK: }
				!$omp atomic hint(omp_sync_hint_nonspeculative) seq_cst
				y = 10 + y
				!$omp atomic seq_cst update
				z = y + z
				end program OmpAtomicUpdate

This is an archive of the discontinued LLVM Phabricator instance.

[flang][OpenMP] Lowering support for atomic update constructClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 439650

flang/lib/Lower/OpenMP.cpp

flang/test/Lower/OpenMP/atomic-update.f90

[flang][OpenMP] Lowering support for atomic update construct
ClosedPublic