This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
include/clang/AST/
-
clang/
-
AST/
5/5
OpenMPClause.h
-
lib/
-
CodeGen/
-
CGOpenMPRuntime.h
57/59
CGOpenMPRuntime.cpp
-
Sema/
15/15
SemaOpenMP.cpp
-
Serialization/
11/11
ASTReader.cpp
2/2
ASTWriter.cpp
-
test/OpenMP/
-
OpenMP/
-
target_update_ast_print.cpp
-
target_update_codegen.cpp
-
target_update_messages.cpp
1/1
target_update_to_messages.cpp

Differential D79972

[OpenMP5.0] map item can be non-contiguous for target update
AbandonedPublic

Authored by cchen on May 14 2020, 3:40 PM.

Download Raw Diff

Details

Reviewers

ABataev
jdoerfert

Summary

In order not to modify the tgt_target_data_update information but still be
able to pass the extra information for non-contiguous map item (offset,
count, and stride for each dimension), this patch overload arg when
the maptype is set as OMP_MAP_DESCRIPTOR. The origin arg is for
passing the pointer information, however, the overloaded arg is an
array of descriptor_dim:

struct descriptor_dim {
  int64_t offset;
  int64_t count;
  int64_t stride
};

and the array size is the same as dimension size. In addition, since we
have count and stride information in descriptor_dim, we can replace/overload the
arg_size parameter by using dimension size.

More details can be found here: https://github.com/chichunchen/openmp-50-design/blob/master/target_update_noncontiguous.pptx

Edit:
The runtime implementation I'm thinking of is to convert the non-contiguous data into several chunks of contiguous.

For example:

int arr[3][3][3];
#pragma omp target update to (arr[1:2][1:2][0:2])

We can visualize the noncontiguous data as below (X is the data we want to transfer, O is the data want don't bother with):

Dim 0 = {Offset: 0, Count: 1, Stride: 4 bytes (int)}
XXO

Dim 1 = {Offset: 1, Count: 2, Stride: 12 bytes (4 * 3 - since Dim 0 has 3 elements)
OOO
XXO
XXO

Dim 2 = {Offset: 1, Count: 2, Stride: 36 bytes (12 * 3 since Dim 1 has 3 elements)
OOO OOO OOO
OOO XXO XXO
OOO XXO XXO

For the visualization, we know that we want to transfer 4 contiguous chunks and the runtime code could be something similar to:

// we expect this loop to transfer 4 contiguous chunks:
// arr[1][1][0:2]
// arr[1][2][0:2]
// arr[2][1][0:2]
// arr[2][2][0:2]
for (int i = Dim[2].offset; i < Dim[2].count; i++) {
  for (int j = Dim[1].offset; j < Dim[1].count; j++) {
    ptr = bast_ptr + Dim[2].stride * i + Dim[1].stride * j + Dim[2].stride * Dim[0].offset;
    size = Dim[0].count * Dim[0].stride;  // we can hoist it I think
    transfer(ptr, size, /*flag or some other stuff...*/);
  }
}

For this design, we can support strides by just adding an extra dimension. For instance:

int arr[5][5][5]
#pragma omp target update to(arr[0:2:2][1:2:1][0:2:2])

Dim 0 = {offset: 0, count: 1, stride: 4 bytes (int) } // the extra dimension for supporting stride
XO

Dim 1 = {offset: 0, count: 3, stride 8 bytes (4 * 2) }
XOXOX

Dim 2 = {offset: 0, count: 2, stride: 40 bytes (8 * 5) }
OOOOO
XOXOX
XOXOX
OOOOO
OOOOO

Dim 3 = {offset: 0, count: 2, stride: 200 bytes (40 * 5) }
...

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

cchen created this revision.May 14 2020, 3:40 PM

Herald added a reviewer: jdoerfert. · View Herald TranscriptMay 14 2020, 3:40 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: cfe-commits, guansong, yaxunl. · View Herald Transcript

Harbormaster failed remote builds in B56801: Diff 264118!May 14 2020, 4:54 PM

Rebase

Harbormaster failed remote builds in B56881: Diff 264266!May 15 2020, 10:51 AM

Remove redundant code

Herald added a subscriber: sstefan1. · View Herald TranscriptMay 21 2020, 11:48 AM

ping

Harbormaster failed remote builds in B57565: Diff 265562!May 21 2020, 1:00 PM

How are you going to pass this non-contiguous data in the runtime? Are you going to map it in a loop or convert this non-contiguous data into the contiguous and map it as a contiguous chunk of data? Your presentation provides interface only interface changes but has nothing about implementation in the runtime.

In D79972#2049838, @ABataev wrote:

How are you going to pass this non-contiguous data in the runtime? Are you going to map it in a loop or convert this non-contiguous data into the contiguous and map it as a contiguous chunk of data? Your presentation provides interface only interface changes but has nothing about implementation in the runtime.

Hi Alexey, thanks for asking. The runtime implementation I'm thinking of is to convert the non-contiguous data into several chunks of contiguous.

For example:

int arr[3][3][3];

#pragma omp target update to (arr[1:2][1:2][0:2])

We can visualize the noncontiguous data as below (X is the data we want to transfer, O is the data want don't bother with):

Dim 0 = {Offset: 0, Count: 1, Stride: 4bytes (int)}
XXO

Dim 1 = {Offset: 1, Count: 2, Stride: 12bytes (4 * 3 - since Dim 0 has 3 elements)
OOO
XXO
XXO

Dim 2 = {Offset: 1, Count: 2, Stride: 36 bytes (12 * 3 since Dim 1 has 3 elements)
OOO
OOO
OOO
\\\\\
OOO
XXO
XXO
\\\\\
OOO
XXO
XXO

For the visualization, we know that we want to transfer 4 contiguous chunks and the runtime code could be something similar to:

// we expect this loop to transfer 4 contiguous chunks:
// arr[1][1][0:2]
// arr[1][2][0:2]
// arr[2][1][0:2]
// arr[2][2][0:2]
for (int i = Dim[2].offset; i < Dim[2].count; i++) {
  for (int j = Dim[1].offset; j < Dim[1].count; j++) {
    ptr = bast_ptr + Dim[2].stride * i + Dim[1].stride * j + Dim[2].stride * Dim[0].offset;
    size = Dim[0].count * Dim[0].stride;  // we can hoist it I think
    transfer(ptr, size, /*flag or some other stuff...*/);
  }
}

Is my guess correct that for OpenMP >= 50 for target update directive we always emit possibly non-continuous runtime calls?

clang/include/clang/AST/OpenMPClause.h
5346	Why do you need this bool flag? Seems to me, it is set to `true` always if `OpenMP >= 50 && Directive == OMPD_target_update`. Could check it during the codegen rather than introduce this new extra data here?

cchen marked an inline comment as done.May 27 2020, 2:38 PM

cchen added inline comments.

clang/include/clang/AST/OpenMPClause.h
5346	You're right, I shouldn't add bool here since we only need it in OMPToClause and OMPFromClause. I was adding it since I'm assuming they should have the same type for the inherited TrailingObject.

cchen edited the summary of this revision. (Show Details)May 27 2020, 2:46 PM

In D79972#2058516, @ABataev wrote:

Is my guess correct that for OpenMP >= 50 for target update directive we always emit possibly non-continuous runtime calls?

My intent is to emit possibly non-contiguous runtime calls only if the analysis in Sema set the IsNonContiguous flag to true.

cchen edited the summary of this revision. (Show Details)May 27 2020, 2:51 PM

In D79972#2058555, @cchen wrote:

In D79972#2058516, @ABataev wrote:

Is my guess correct that for OpenMP >= 50 for target update directive we always emit possibly non-continuous runtime calls?

My intent is to emit possibly non-contiguous runtime calls only if the analysis in Sema set the IsNonContiguous flag to true.

But this analysis only checks for the directive and the version,nothing else.

In D79972#2058608, @ABataev wrote:

In D79972#2058555, @cchen wrote:

In D79972#2058516, @ABataev wrote:

Is my guess correct that for OpenMP >= 50 for target update directive we always emit possibly non-continuous runtime calls?

My intent is to emit possibly non-contiguous runtime calls only if the analysis in Sema set the IsNonContiguous flag to true.

But this analysis only checks for the directive and the version,nothing else.

The context of the checks for the directive and version:

bool NotWhole =
  checkArrayExpressionDoesNotReferToWholeSize(SemaRef, OASE, CurType);
bool NotUnity =
  checkArrayExpressionDoesNotReferToUnitySize(SemaRef, OASE, CurType);

if (AllowWholeSizeArraySection) {
  // Any array section is currently allowed. Allowing a whole size array
  // section implies allowing a unity array section as well.
  //
  // If this array section refers to the whole dimension we can still
  // accept other array sections before this one, except if the base is a
  // pointer. Otherwise, only unitary sections are accepted.
  if (NotWhole || IsPointer)
    AllowWholeSizeArraySection = false;
} else if (DKind == OMPD_target_update &&
           SemaRef.getLangOpts().OpenMP >= 50) {
  IsNonContiguousRef = true;
} else if (AllowUnitySizeArraySection && NotUnity) {
  // A unity or whole array section is not allowed and that is not
  // compatible with the properties of the current array section.
  SemaRef.Diag(
    ELoc, diag::err_array_section_does_not_specify_contiguous_storage)
    << OASE->getSourceRange();
  return false;
}

The original analysis checks for non-contiguous by finding if there is more than one "array-section" expression with length greater than one. Therefore, I added my check there to allow more than one array-section with length greater than one by depending on the existing analysis (and also set IsNonContiguous to true so that we can pass it to codegen rather than doing analysis in codegen). This change allows me to pass all the existing lit test but still emit the "non-contiguous" runtime.

cchen marked an inline comment as done.May 27 2020, 3:48 PM

cchen added inline comments.

clang/lib/Sema/SemaOpenMP.cpp
16621	@ABataev , I guess you're saying the condition should be `!AllowWholeSizeArraySection && DKind == OMPD_target_update && SemaRef.getLangOpts().OpenMP >= 50`?

Did you think about implementing it in the compiler instead of the runtime?

clang/lib/CodeGen/CGOpenMPRuntime.cpp
7624–7627	Do you really need to count `DimSize` for array shaping operators and array subscript expressions? I don't see tests for it.
clang/lib/Sema/SemaOpenMP.cpp
16621	No, what I want is to try to simplify the code. I see now why do you need this flag. I'm just thinking can we avoid adding this flag to the clause and save some mem space?

Fix based on feedback

cchen marked 2 inline comments as done.May 28 2020, 9:54 AM

cchen added inline comments.

clang/lib/CodeGen/CGOpenMPRuntime.cpp
7624–7627	You're right, I don't need to count `DimSize` for array shaping and array subscript.
clang/lib/Sema/SemaOpenMP.cpp
16621	But we also don't want to do the analysis in codegen I guess? Also, if we emit non-contiguous runtime for every target update call, we need to change tons of stuff (tons of lit tests, runtime implementation, etc...).

ABataev added inline comments.May 28 2020, 10:09 AM

clang/lib/Sema/SemaOpenMP.cpp
16621	Maybe make it a part of `MappableComponent`, if possible, and put it into `PointerIntPair<Expr *, 1, bool> AssociatedExpression;`?

Harbormaster failed remote builds in B58252: Diff 266921!May 28 2020, 10:57 AM

Use PointerIntPair to pass non-contiguous information in AST
Error out in Sema if we don't have enough size information for cases involving pointers
Allows *arr[N][M] since we don't need size information for the last dimension
Add more test cases

Harbormaster failed remote builds in B58670: Diff 267725!Jun 1 2020, 2:40 PM

Still: Did you think about implementing it in the compiler instead of the runtime?

clang/include/clang/AST/OpenMPClause.h
4752–4753	I would suggest to pass `Expr *` and `bool` as separate parameters here rather than as `PointerIntPair`
4766	`isNonContiguous()`
clang/lib/Sema/SemaOpenMP.cpp
16442	Add default initializer
16623–16627	Remove braces here, they are not needed.
clang/lib/Serialization/ASTWriter.cpp
6584	There is a member function `writeBool`
6609	Same, use `writeBool`

In D79972#2068976, @ABataev wrote:

Still: Did you think about implementing it in the compiler instead of the runtime?

I'm not sure I understand your question, which part of code are you asking?
The main work compiler needs to do is to send the {offset, count, stride} struct to runtime.

In D79972#2069322, @cchen wrote:

In D79972#2068976, @ABataev wrote:

Still: Did you think about implementing it in the compiler instead of the runtime?

I'm not sure I understand your question, which part of code are you asking?
The main work compiler needs to do is to send the {offset, count, stride} struct to runtime.

I mean did you think about calling __tgt_target_data_update function in a loop in the compiler-generated code instead of putting it into the runtime?

In D79972#2069358, @ABataev wrote:

In D79972#2069322, @cchen wrote:

In D79972#2068976, @ABataev wrote:

Still: Did you think about implementing it in the compiler instead of the runtime?

I'm not sure I understand your question, which part of code are you asking?
The main work compiler needs to do is to send the {offset, count, stride} struct to runtime.

I mean did you think about calling __tgt_target_data_update function in a loop in the compiler-generated code instead of putting it into the runtime?

Oh, I would prefer to call tgt_target_data_update once in the compiler and I'm also doing it now.

In D79972#2069366, @cchen wrote:

In D79972#2069358, @ABataev wrote:

In D79972#2069322, @cchen wrote:

In D79972#2068976, @ABataev wrote:

Still: Did you think about implementing it in the compiler instead of the runtime?

I'm not sure I understand your question, which part of code are you asking?
The main work compiler needs to do is to send the {offset, count, stride} struct to runtime.

I mean did you think about calling __tgt_target_data_update function in a loop in the compiler-generated code instead of putting it into the runtime?

Oh, I would prefer to call tgt_target_data_update once in the compiler and I'm also doing it now.

I was not quite correct. What I mean, is to generate the array with the array section as VLA in the compiler, and fill it in the loop generated by the compiler for non-contiguous sections but not in the runtime?
Say, we have the code:

int arr[3][3]
...
 #pragma omp update to(arr[1:2][1:2]

In this case, we're going to transfer the next elements:

000
0xx
0xx

In the compiler-generated code we emit something like this:

void *bptr[<n>];
void *ptr[<n>];
int64 sizes[<n>];
int64 maptypes[<n>];
for (int i = 0; i < <n>; ++i) {
  bptr[i] = &arr[1+i][1];
  ptr[i] = &arr[1+i][1];
  sizes[i] = ...;'
  maptypes[i] = ...;
}
call void @__tgt_target_data_update(i64 -1, i32 <n>, bptr, ptr, sizes, maptypes);

With this solution, you won't need to modify the runtime and add a new mapping flag.

Fix based on feedback

In D79972#2069435, @ABataev wrote:
In D79972#2069366, @cchen wrote:

In D79972#2069358, @ABataev wrote:

In D79972#2069322, @cchen wrote:

In D79972#2068976, @ABataev wrote:

Still: Did you think about implementing it in the compiler instead of the runtime?

I'm not sure I understand your question, which part of code are you asking?
The main work compiler needs to do is to send the {offset, count, stride} struct to runtime.

I mean did you think about calling __tgt_target_data_update function in a loop in the compiler-generated code instead of putting it into the runtime?

Oh, I would prefer to call tgt_target_data_update once in the compiler and I'm also doing it now.

I was not quite correct. What I mean, is to generate the array with the array section as VLA in the compiler, and fill it in the loop generated by the compiler for non-contiguous sections but not in the runtime?
Say, we have the code:
int arr[3][3]
...
 #pragma omp update to(arr[1:2][1:2]
In this case, we're going to transfer the next elements:
000
0xx
0xx
In the compiler-generated code we emit something like this:
void *bptr[<n>];
void *ptr[<n>];
int64 sizes[<n>];
int64 maptypes[<n>];
for (int i = 0; i < <n>; ++i) {
  bptr[i] = &arr[1+i][1];
  ptr[i] = &arr[1+i][1];
  sizes[i] = ...;'
  maptypes[i] = ...;
}
call void @__tgt_target_data_update(i64 -1, i32 <n>, bptr, ptr, sizes, maptypes);
With this solution, you won't need to modify the runtime and add a new mapping flag.

For my current implementation, we have discussed in the bi-weekly meeting several weeks back, and there was a general consensus that it was an acceptable approach.

The major advantage of sending a descriptor to runtime can be elaborated in the following example:

#define N 10000
int a[N][2];
…
#pragma amp target update to (a[0:N][0:1])

This would require passing through O(N) entries in the tgt_target_data_update call, or 10000 entries. The current implementation only require a descriptor with 2 entries. I think this could be a real concern -
splitting out the transfers in compiler-generated code results in a list containing one entry per non-contiguous chunk (easily hitting scaling issues), while the descriptor approach is bounded by the number of dimensions.
That seems like a pretty compelling reason to use the descriptor - it’s much more space efficient.

Also, the descriptor idea is very similar to how Cray supported Fortran dope vectors for years (we send in a pointer to a dope vector rather than a pointer to the data, and a flag to indicate it’s a dope vector, and the runtime library handles it as a dope vector).
I think the runtime library changes will not be very extensive or difficult at all and we’re very willing to implement the runtime for non-contiguous.

Harbormaster failed remote builds in B58829: Diff 268009!Jun 2 2020, 4:29 PM

ping

Do you have a test for mapping of something like arr[0][:n], where the base is an array subscript and the remaining part is an array section?

clang/include/clang/AST/OpenMPClause.h
4756–4757	I think you can initialize `AssociatedExpressionNonContiguousPr` using just `AssociatedExpressionNonContiguousPr(AssociatedExpression, IsNonContiguous)` form, no?
clang/lib/CodeGen/CGOpenMPRuntime.cpp
7114	Restore original formatting
7602–7603	Better to convert it to `!IsNonContiguous && isFinalArraySectionExpression(I->getAssociatedExpression())`.
7622	Use prefix form `++DimSize`.
7679	No need for parameter name comment here, it is required only if the `true\|false` constants are used
7736	Same, comment not required
7913–7927	Can we merge the functionality in this new function with the existing ones somehow? It is not the best idea to duplicate functionality using copy-paste if any.
8583–8585	Why removed the comment?
8950	Same question as before - can we merge this functionality with the existing functions?
clang/lib/Sema/SemaOpenMP.cpp
18468–18473	Use `.emplace_back(SimpleRefExpr, D, false);`
clang/lib/Serialization/ASTReader.cpp
12514–12516	`.emplace_back(AssociatedExprPr, AssociatedDecl, /IsNonContiguous=/false);`
12632–12634	Same, use `emplace_back()`
12683–12684	Same, use `emplace_back()`
12733–12734	Same, use `emplace_back()`
12819–12820	Same, use `emplace_back()`
clang/test/OpenMP/target_update_to_messages.cpp
147	Delete this extra line

Fix based on feedback

In D79972#2082017, @ABataev wrote:

Do you have a test for mapping of something like arr[0][:n], where the base is an array subscript and the remaining part is an array section?

I'm not having it right now, but it seems like if the base is an array subscript and the remaining part is an array section, then this map-item will always be contiguous, and will not trigger my code in Codegen. I can still add a test for Sema though.

cchen marked 26 inline comments as done.Jun 9 2020, 2:17 PM

Harbormaster failed remote builds in B59694: Diff 269663!Jun 9 2020, 3:31 PM

ABataev added inline comments.Jun 10 2020, 8:51 AM

clang/lib/CodeGen/CGOpenMPRuntime.cpp
7046–7049	I'm not sure about the value of this flag. If I do recall it correctly, this value might be used for something different by XL compiler, for example. Maybe use some other value, maybe high bits? It is a kind of service flag, not data mapping attribute, so better to move it to high bits (the bit before OMP_MAP_MEMBER_OF maybe?).
7806–7807	Use range-based loop, if possible.
7809–7811	Do you really need to analyze array subscript expressions here? I though that we should analyze only array sections, no?
7868	Same, try to use range-based loop, if possible.
7873	Same question about array subscript expressions.
8218–8223	Just `generateInfoForComponentList( L.MapType, L.MapModifiers, L.Components, CurBasePointers, CurPointers, CurSizes, CurTypes, CurDims, PartialStruct, IsFirstComponentList, L.IsImplicit, /OverlappedElements=/llvm::None, L.Components.back().isNonContiguous(), &CurOffsets, &CurCounts, &CurStrides);`
8768–8771	Can we encapsulate these new data into `CGOpenMPRuntime::TargetDataInfo`?

Fix based on feedback

Harbormaster completed remote builds in B59871: Diff 269961.Jun 10 2020, 3:02 PM

ABataev added inline comments.Jun 11 2020, 12:12 PM

clang/lib/CodeGen/CGOpenMPRuntime.cpp
7327–7330	I would prefer to pack these 4 params into a single parameter (a struct). Also, can we put `Dims` parameter into the list of the optional parameters?
7805	Expand `auto` here to a real type
7821	What if the base is a pointer, not an array?
7831–7838	The code for `SizeV` must be under the control of the next `if`: if (DimSizes.size() < Components.size() - 1) { .... }
7834	Create directly as of `CGF.Int64Ty` type.
7859	Expand `auto` here to a real type

ABataev added inline comments.Jun 11 2020, 12:12 PM

clang/lib/CodeGen/CGOpenMPRuntime.cpp
7807	Can we have anything else except for array section here? If not, use just `cast`. If yes, use `continue` to simplify complexity: if (!OASE) continue; ...
7861–7864	Can we have anything else except for array section here? If not, use just `cast`. If yes, use `continue` to simplify complexity: if (!OASE) continue; ...
7872–7873	Do you really to pass real offsets here? Can we use pointers instead?

cchen marked 4 inline comments as done.Jun 12 2020, 3:48 PM

cchen added inline comments.

clang/lib/CodeGen/CGOpenMPRuntime.cpp
7821	The `if (ElementType)` condition only push back stride when base is not pointer. I'm now allowing one dimension size to be unknown (pointer as base) and sema has analysis to check if more than one indirection as base. My last codegen test case is for testing pointer as base.
7831–7838	I don't think I understand this one. Why do you remove SizeV in the if condition?
7834	Doing this I'll get assertion error in this exact line if on a 32-bits target.
7872–7873	Do you mean I should set the type of Offset to Expr*?

Fix based on feedback

Harbormaster failed remote builds in B60181: Diff 270536!Jun 12 2020, 4:03 PM

ABataev added inline comments.Jun 15 2020, 11:53 AM

clang/lib/CodeGen/CGOpenMPRuntime.cpp
7327	Can we encapsulate `Dims` into `StructNonContiguousInfo`?
7831–7832	If only `CAT` or `VAT` is allowed, then transform this if into: else { assert(VAT&& ...);
7831–7838	This is for `SizeV`. You don't use it if `DimSizes.size() < Components.size() - 1` is `false`, looks like memory leak.
7834	Hmm, why, can you investigate?
7869–7872	Can we have anything else except for array section here? If not, use just cast. If yes, use continue to simplify complexity: if (!OASE) continue; ...
7872–7873	Currently, you're passing offsets to the runtime. Can we pass pointers instead? I mean, for `a[b]` you pass `b` to the runtime, can we pass `&a[b]` instead?
7911	Avoid expressions with some side effects, like `*DI++`

Resolve issues

cchen marked 3 inline comments as done.Jun 15 2020, 3:12 PM

cchen added inline comments.

clang/lib/CodeGen/CGOpenMPRuntime.cpp
7834	My comment was not accurate, I've updated it. What I want to convey is that we can only have `CAT, VAT, or pointer` here, since analysis in Sema has a restriction for it. (SemaOpenMP line 16623)
7869–7872	Not sure about this one, I've added: if (!OASE) continue; ...
7872–7873	Yes, I'm fine either passing index or passing address, though I'm curious why you're recommending passing address.

Harbormaster failed remote builds in B60378: Diff 270880!Jun 15 2020, 4:03 PM

ABataev added inline comments.Jun 16 2020, 5:53 AM

clang/lib/CodeGen/CGOpenMPRuntime.cpp
7834	It does not relate to the comments thread but I got it. Anyway, try to investigate why the compiler crashes if you try to cr4eate a constant ща ]СПАюШте64Ен] directly.
7872–7873	It is going to simplify the codegen. Currently, to get the offset, you need to dig through all the elements of the array section. If, instead, you use the pointers, you would not need to do this and you can rely on something like `CGF.EmitArraySectionLValue()`. At least, I hope so.
7903	The check is not required, you already checked that the expression must be array section only.

cchen marked 2 inline comments as done.Jun 16 2020, 9:34 AM

cchen added inline comments.

clang/lib/CodeGen/CGOpenMPRuntime.cpp
7834	I'll investigate it, thanks.
7872–7873	After discussed with my colleagues, I think passing relative offset makes more sense. For a 1-dim array, storing the offset as a pointer could work, but it seems strange to me to store as a pointer when there are 2+ dimensions with multiple disjoint chunks of memory because the pointer can only point to the offset for the first chunk. That is, a pointer would refer to an absolute location in a single chunk, whereas the offset is relative to the start of any chunk. For example: int a[4][4]; #pragma omp target update to(a[1:2][1:2]) This is two disjoint chunks of memory: XXXX XOOX XOOX XXXX The offset for the outer dimension could be store as a pointer, since there is only one instance of that dimension: Dim1: Offset=&a[1] But, the inner dimension is "instantiated" twice, once for each element in the outer dimension. So, there are really two absolute pointers, depending on which instance (element in the outer dimension) you're talking about: Dim2: Offset=&a[1][1] Dim2: Offset=&a[2][1] We could set the policy that the absolute offset would always be expressed as the offset in the first instance, but then wouldn't we need to refer to that location when computing the offset for all of the other instances? That seems unintuitive to me, and potentially complicates the implementation. The relative offset makes a lot more senes to me - for a starting point, what relative offset is needed for each dimension. The starting point for the outermost dimension does require the base address, but all inner dimensions have a variable starting pointer based on which element in the outer dimensions you're currently looking at.

Fix Int64Ty issue (The bitNum of APInt I used before is 32)

Harbormaster failed remote builds in B60535: Diff 271183!Jun 16 2020, 1:46 PM

ABataev added inline comments.Jun 16 2020, 2:35 PM

clang/lib/CodeGen/CGOpenMPRuntime.cpp
7920	Use preincrement
8224	No need to add `/OverlappedElements=/llvm::None` here, it is default value.
8878–8880	Better just to have something like this: if (!IsNonContiguous \|\| Info.Offsets.empty() \|\| Info.NumberOfPtrs == 0) return; ...

Fix based on feedback

Harbormaster failed remote builds in B60555: Diff 271221!Jun 16 2020, 3:24 PM

cchen marked 28 inline comments as done.Jun 17 2020, 12:50 PM

How do you plan to support
#pragma omp target update to (arr[1:2][1:2][0:2], x, b[1:5][0:2])
Are you going to split this into 3 updates since your are using the arg fields.

In D79972#2104854, @RaviNarayanaswamy wrote:

How do you plan to support
#pragma omp target update to (arr[1:2][1:2][0:2], x, b[1:5][0:2])
Are you going to split this into 3 updates since your are using the arg fields.

There's only one runtime call for your case. and args will be { descriptor_1, x, descriptor_2 }, where descriptor_1 will be { { 1, 2, 80 }, { 1, 2, 20 }, { 0, 2, 4 } }, descriptor_2 will be { { 1, 5, 16 }, { 0, 2, 4 } }. There's analysis in Sema that detecting if the item is non-contiguous or not and codegen only generate descriptor for non-contiguous item.

Updated test for clarification

In D79972#2104854, @RaviNarayanaswamy wrote:

How do you plan to support
#pragma omp target update to (arr[1:2][1:2][0:2], x, b[1:5][0:2])
Are you going to split this into 3 updates since your are using the arg fields.

I have added a test basically base on the case in your comment (CK19 in target_update_codegen.cpp). Thanks.

Harbormaster failed remote builds in B61314: Diff 272564!Jun 22 2020, 4:10 PM

ping

ABataev added inline comments.Jun 25 2020, 10:08 AM

clang/lib/CodeGen/CGOpenMPRuntime.cpp
7842–7844	No need for braces here
8935	`C.getTypeAlignInChars(C.VoidPtrTy)`->`CGF.getPointerAlign()`
10317–10320	Better just to pass `Info.Offsets`, `Info.Counts` and `Info.Strides` as arguments to `generateAllInfo()` function and do not create local copies at all.
10570–10573	Same, pass the fields as arguments instead.
clang/lib/Serialization/ASTReader.cpp
12515–12516	Still calling an extra constructor here, just `.emplace_back(AssociatedExprPr, AssociatedDecl, /IsNonContiguous=/false);`
12633–12635	Just `Components.emplace_back(AssociatedExprPr, AssociatedDecl, IsNonContiguous);`
12684–12685	`.emplace_back(AssociatedExprPr, AssociatedDecl, IsNonContiguous);`
12734–12735	.`emplace_back(AssociatedExprPr, AssociatedDecl, /IsNonContiguous=/false);`
12776–12778	`.emplace_back(AssociatedExpr, AssociatedDecl, /IsNonContiguous/ false);`
12819–12821	`.emplace_back(AssociatedExpr, AssociatedDecl, /IsNonContiguous=/false));`

Fix coding style

ABataev added inline comments.Jun 25 2020, 12:47 PM

clang/lib/CodeGen/CGOpenMPRuntime.cpp
8761–8763	Do you really need to pass `Dims` here if you have `Dims` data member in `Info` parameter? Why you can't use `Info.Dims` instead?
8881–8909	Maybe worth it to outline it into a separate function to reduce code size and the complexity of this function? And just call this new function here.
clang/lib/Sema/SemaOpenMP.cpp
16624	Better to use integer value as selectors, not boolean.
18519–18523	`.emplace_back(SimpleRefExpr, D, /IsNonContiguous=/false);`
18588	Add a comment for `false` argument with the name of parameter.

Harbormaster failed remote builds in B61801: Diff 273484!Jun 25 2020, 1:06 PM

cchen marked an inline comment as done.Jun 25 2020, 1:14 PM

cchen added inline comments.

clang/lib/CodeGen/CGOpenMPRuntime.cpp
8761–8763	I think I haven't added Dims in TargetDataInfo atm, I'll add into it and then use it via Info.

cchen marked an inline comment as done.Jun 25 2020, 1:28 PM

cchen added inline comments.

clang/lib/Sema/SemaOpenMP.cpp
16624	The selector for `err_omp_section_length_undefined` is a bool value. (true for unknown bound false for not a array type, so always be true here). Do you mean that I need to create a new kind of diagnosis message here and use integer as selectors?

ABataev added inline comments.Jun 25 2020, 1:41 PM

clang/lib/Sema/SemaOpenMP.cpp
16624	No, it is an integer, starts from `0`

Fix based on feedback

ABataev added inline comments.Jun 25 2020, 2:32 PM

clang/lib/CodeGen/CGOpenMPRuntime.cpp
10315–10316	Better to pass `Info` here directly.

Harbormaster failed remote builds in B61823: Diff 273517!Jun 25 2020, 2:45 PM

Pass Info directly

Harbormaster failed remote builds in B61831: Diff 273529!Jun 25 2020, 3:51 PM

ABataev added inline comments.Jun 26 2020, 6:04 AM

clang/lib/Sema/SemaOpenMP.cpp
16664	`/IsNonContiguous=/false`
16684	`/IsNonContiguous=/false`
18469–18473	`.emplace_back(SimpleRefExpr, D, /IsNonContiguous=/false);`

Fix based on feedback

Harbormaster failed remote builds in B61944: Diff 273747!Jun 26 2020, 8:45 AM

Rebase and resolve conflictions

Harbormaster failed remote builds in B61959: Diff 273778!Jun 26 2020, 12:02 PM

cchen marked 21 inline comments as done.Jun 29 2020, 1:27 PM

cchen marked an inline comment as done.Jun 30 2020, 9:52 AM

cchen added inline comments.

clang/lib/CodeGen/CGOpenMPRuntime.cpp
7046–7049	Hi @ABataev, is there any place I can find which value has been used for lower bits (like 0x800, 0x1000)?

ABataev added a subscriber: kkwli0.Jun 30 2020, 10:26 AM

ABataev added inline comments.

clang/lib/CodeGen/CGOpenMPRuntime.cpp
7046–7049	I rather doubt. You can try to ask @kkwli0

kkwli0 added inline comments.Jun 30 2020, 12:49 PM

clang/lib/CodeGen/CGOpenMPRuntime.cpp
7046–7049	We are using 0x800. I think your current choice should be fine.

@ABataev , I'm considering emitting an extra dimension for a non-contiguous descriptor to support stride in this patch (stride = 1 in array section is just a special case for computing stride, however, the formula computing stride do not change). Do you think I should do it in this patch?

Computing of stride after support stride in array section:

int arr[5][5][5];
#pragma omp target update to(arr[0:2:2][1:2:1][0:2:2]

D0: { offset = 0, count = 1, stride = 4 }                                           // offset, count, dimension size always be 0, 1, 1 for this extra dimension, stride is the unit size
D1: { offset = 0, count = 2, stride = 4 * 1 * 2 = 8 }                        // stride = unit size * (production of dimension size of D0) * D1.stride = 4 * 1 * 2 = 8
D2: { offset = 0, count = 1, stride = 4 * (1 * 5) * 1 = 20  }             // stride = unit size * (production of dimension size of D0, D1) * D2.stride = 4 * 5 * 1 = 20
D3: { offset = 0, count = 2, stride = 4 * (1 * 5 * 5) * 2 = 200 }      // stride = unit size * (production of dimension size of D0, D1, D2) * D3.stride = 4 * 25 * 2 = 200

For the case in this patch (stride = 1), we can use the same formula for computing stride with extra dimension:

int arr[5][5][5];
#pragma omp target update to(arr[0:2][1:2][0:2]

D0: { offset = 0, count = 1, stride = 4 }                                          // offset, count, dimension size always be 0, 1, 1 for this extra dimension, stride is the unit size
D1: { offset = 0, count = 2, stride = 4 * 1 * 1 = 4 }                        // stride = unit size * (production of dimension size of D0) * D1.stride = 4 * 1 * 1 = 4
D2: { offset = 0, count = 1, stride = 4 * (1 * 5) * 1 = 20  }            // stride = unit size * (production of dimension size of D0, D1) * D2.stride = 4 * 5 * 1 = 20
D3: { offset = 0, count = 2, stride = 4 * (1 * 5 * 5) * 1 = 100 }     // stride = unit size * (production of dimension size of D0, D1, D2) * D3.stride = 4 * 25 * 1 = 100

The extra dimension does not affect the runtime implementation at all since runtime will try to merge inner dimensions if they are contiguous. Take the above case for example (arr[0:2][1:2][0:2]):
The product of count and stride for D0 is 4 which is the same as the stride of D1, therefore, runtime just ignores D0.

In D79972#2124108, @cchen wrote:
@ABataev , I'm considering emitting an extra dimension for a non-contiguous descriptor to support stride in this patch (stride = 1 in array section is just a special case for computing stride, however, the formula computing stride do not change). Do you think I should do it in this patch?

Computing of stride after support stride in array section:
int arr[5][5][5];
#pragma omp target update to(arr[0:2:2][1:2:1][0:2:2]

D0: { offset = 0, count = 1, stride = 4 }                                           // offset, count, dimension size always be 0, 1, 1 for this extra dimension, stride is the unit size
D1: { offset = 0, count = 2, stride = 4 * 1 * 2 = 8 }                        // stride = unit size * (production of dimension size of D0) * D1.stride = 4 * 1 * 2 = 8
D2: { offset = 0, count = 1, stride = 4 * (1 * 5) * 1 = 20  }             // stride = unit size * (production of dimension size of D0, D1) * D2.stride = 4 * 5 * 1 = 20
D3: { offset = 0, count = 2, stride = 4 * (1 * 5 * 5) * 2 = 200 }      // stride = unit size * (production of dimension size of D0, D1, D2) * D3.stride = 4 * 25 * 2 = 200
For the case in this patch (stride = 1), we can use the same formula for computing stride with extra dimension:
int arr[5][5][5];
#pragma omp target update to(arr[0:2][1:2][0:2]

D0: { offset = 0, count = 1, stride = 4 }                                          // offset, count, dimension size always be 0, 1, 1 for this extra dimension, stride is the unit size
D1: { offset = 0, count = 2, stride = 4 * 1 * 1 = 4 }                        // stride = unit size * (production of dimension size of D0) * D1.stride = 4 * 1 * 1 = 4
D2: { offset = 0, count = 1, stride = 4 * (1 * 5) * 1 = 20  }            // stride = unit size * (production of dimension size of D0, D1) * D2.stride = 4 * 5 * 1 = 20
D3: { offset = 0, count = 2, stride = 4 * (1 * 5 * 5) * 1 = 100 }     // stride = unit size * (production of dimension size of D0, D1, D2) * D3.stride = 4 * 25 * 1 = 100
The extra dimension does not affect the runtime implementation at all since runtime will try to merge inner dimensions if they are contiguous. Take the above case for example (arr[0:2][1:2][0:2]):
The product of count and stride for D0 is 4 which is the same as the stride of D1, therefore, runtime just ignores D0.

You can do this patch. But at first, you need to commit the runtime part of the patch that supports it, and the part that introduces stride support.

Created a new patch with the support for stride: https://reviews.llvm.org/D84192.

ABataev mentioned this in D82245: [libomptarget] Add support for target update non-contiguous.Jul 31 2020, 12:46 PM

Revision Contents

Path

Size

clang/

include/

clang/

AST/

OpenMPClause.h

20 lines

lib/

CodeGen/

CGOpenMPRuntime.h

6 lines

CGOpenMPRuntime.cpp

336 lines

Sema/

SemaOpenMP.cpp

56 lines

Serialization/

ASTReader.cpp

34 lines

ASTWriter.cpp

2 lines

test/

OpenMP/

target_update_ast_print.cpp

109 lines

target_update_codegen.cpp

278 lines

target_update_messages.cpp

22 lines

target_update_to_messages.cpp

4 lines

Diff 269961

clang/include/clang/AST/OpenMPClause.h

This file is larger than 256 KB, so syntax highlighting is disabled by default.

	Show All 21 Lines
	#include "clang/AST/NestedNameSpecifier.h"			#include "clang/AST/NestedNameSpecifier.h"
	#include "clang/AST/Stmt.h"			#include "clang/AST/Stmt.h"
	#include "clang/AST/StmtIterator.h"			#include "clang/AST/StmtIterator.h"
	#include "clang/Basic/LLVM.h"			#include "clang/Basic/LLVM.h"
	#include "clang/Basic/OpenMPKinds.h"			#include "clang/Basic/OpenMPKinds.h"
	#include "clang/Basic/SourceLocation.h"			#include "clang/Basic/SourceLocation.h"
	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
	#include "llvm/ADT/MapVector.h"			#include "llvm/ADT/MapVector.h"
				#include "llvm/ADT/PointerIntPair.h"
	#include "llvm/ADT/SmallVector.h"			#include "llvm/ADT/SmallVector.h"
	#include "llvm/ADT/iterator.h"			#include "llvm/ADT/iterator.h"
	#include "llvm/ADT/iterator_range.h"			#include "llvm/ADT/iterator_range.h"
	#include "llvm/Frontend/OpenMP/OMPConstants.h"			#include "llvm/Frontend/OpenMP/OMPConstants.h"
	#include "llvm/Frontend/OpenMP/OMPContext.h"			#include "llvm/Frontend/OpenMP/OMPContext.h"
	#include "llvm/Support/Casting.h"			#include "llvm/Support/Casting.h"
	#include "llvm/Support/Compiler.h"			#include "llvm/Support/Compiler.h"
	#include "llvm/Support/TrailingObjects.h"			#include "llvm/Support/TrailingObjects.h"
	▲ Show 20 Lines • Show All 4,694 Lines • ▼ Show 20 Lines
	public:			public:
	/// Class that represents a component of a mappable expression. E.g.			/// Class that represents a component of a mappable expression. E.g.
	/// for an expression S.a, the first component is a declaration reference			/// for an expression S.a, the first component is a declaration reference
	/// expression associated with 'S' and the second is a member expression			/// expression associated with 'S' and the second is a member expression
	/// associated with the field declaration 'a'. If the expression is an array			/// associated with the field declaration 'a'. If the expression is an array
	/// subscript it may not have any associated declaration. In that case the			/// subscript it may not have any associated declaration. In that case the
	/// associated declaration is set to nullptr.			/// associated declaration is set to nullptr.
	class MappableComponent {			class MappableComponent {
	/// Expression associated with the component.			/// Pair of Expression and Non-contiguous pair associated with the
	Expr *AssociatedExpression = nullptr;			/// component.
				llvm::PointerIntPair<Expr *, 1, bool> AssociatedExpressionNonContiguousPr;

	/// Declaration associated with the declaration. If the component does			/// Declaration associated with the declaration. If the component does
	/// not have a declaration (e.g. array subscripts or section), this is set			/// not have a declaration (e.g. array subscripts or section), this is set
	/// to nullptr.			/// to nullptr.
	ValueDecl *AssociatedDeclaration = nullptr;			ValueDecl *AssociatedDeclaration = nullptr;

	public:			public:
	explicit MappableComponent() = default;			explicit MappableComponent() = default;
	explicit MappableComponent(Expr *AssociatedExpression,			explicit MappableComponent(Expr *AssociatedExpression,
	ValueDecl *AssociatedDeclaration)			ValueDecl *AssociatedDeclaration,
				ABataevUnsubmitted Done Reply Inline Actions I would suggest to pass `Expr ` and `bool` as separate parameters here rather than as `PointerIntPair` ABataev:* I would suggest to pass `Expr *` and `bool` as separate parameters here rather than as…
	: AssociatedExpression(AssociatedExpression),			bool IsNonContiguous)
				: AssociatedExpressionNonContiguousPr(AssociatedExpression,
				IsNonContiguous),
	AssociatedDeclaration(			AssociatedDeclaration(
				ABataevUnsubmitted Done Reply Inline Actions I think you can initialize `AssociatedExpressionNonContiguousPr` using just `AssociatedExpressionNonContiguousPr(AssociatedExpression, IsNonContiguous)` form, no? ABataev: I think you can initialize `AssociatedExpressionNonContiguousPr` using just…
	AssociatedDeclaration			AssociatedDeclaration
	? cast<ValueDecl>(AssociatedDeclaration->getCanonicalDecl())			? cast<ValueDecl>(AssociatedDeclaration->getCanonicalDecl())
	: nullptr) {}			: nullptr) {}

	Expr *getAssociatedExpression() const { return AssociatedExpression; }			Expr *getAssociatedExpression() const {
				return AssociatedExpressionNonContiguousPr.getPointer();
				}

				bool isNonContiguous() const {
				ABataevUnsubmitted Done Reply Inline Actions `isNonContiguous()` ABataev: `isNonContiguous()`
				return AssociatedExpressionNonContiguousPr.getInt();
				}

	ValueDecl *getAssociatedDeclaration() const {			ValueDecl *getAssociatedDeclaration() const {
	return AssociatedDeclaration;			return AssociatedDeclaration;
	}			}
	};			};

	// List of components of an expression. This first one is the whole			// List of components of an expression. This first one is the whole
	// expression and the last one is the base expression.			// expression and the last one is the base expression.
	▲ Show 20 Lines • Show All 561 Lines • ▼ Show 20 Lines
	/// In this example directive '#pragma omp target' has clause 'map'			/// In this example directive '#pragma omp target' has clause 'map'
	/// with the variables 'a' and 'b'.			/// with the variables 'a' and 'b'.
	class OMPMapClause final : public OMPMappableExprListClause<OMPMapClause>,			class OMPMapClause final : public OMPMappableExprListClause<OMPMapClause>,
	private llvm::TrailingObjects<			private llvm::TrailingObjects<
	OMPMapClause, Expr , ValueDecl , unsigned,			OMPMapClause, Expr , ValueDecl , unsigned,
	OMPClauseMappableExprCommon::MappableComponent> {			OMPClauseMappableExprCommon::MappableComponent> {
	friend class OMPClauseReader;			friend class OMPClauseReader;
	friend OMPMappableExprListClause;			friend OMPMappableExprListClause;
	friend OMPVarListClause;			friend OMPVarListClause;
				ABataevUnsubmitted Done Reply Inline Actions Why do you need this bool flag? Seems to me, it is set to `true` always if `OpenMP >= 50 && Directive == OMPD_target_update`. Could check it during the codegen rather than introduce this new extra data here? ABataev: Why do you need this bool flag? Seems to me, it is set to `true` always if `OpenMP >= 50 &&…
				cchenAuthorUnsubmitted Done Reply Inline Actions You're right, I shouldn't add bool here since we only need it in OMPToClause and OMPFromClause. I was adding it since I'm assuming they should have the same type for the inherited TrailingObject. cchen: You're right, I shouldn't add bool here since we only need it in OMPToClause and OMPFromClause.
	friend TrailingObjects;			friend TrailingObjects;

	/// Define the sizes of each trailing object array except the last one. This			/// Define the sizes of each trailing object array except the last one. This
	/// is required for TrailingObjects to work properly.			/// is required for TrailingObjects to work properly.
	size_t numTrailingObjects(OverloadToken<Expr *>) const {			size_t numTrailingObjects(OverloadToken<Expr *>) const {
	// There are varlist_size() of expressions, and varlist_size() of			// There are varlist_size() of expressions, and varlist_size() of
	// user-defined mappers.			// user-defined mappers.
	return 2 * varlist_size();			return 2 * varlist_size();
	▲ Show 20 Lines • Show All 2,309 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGOpenMPRuntime.h

Show First 20 Lines • Show All 1,613 Lines • ▼ Show 20 Lines	public:
llvm::Value *SizesArray = nullptr;		llvm::Value *SizesArray = nullptr;
/// The array of map types passed to the runtime library.		/// The array of map types passed to the runtime library.
llvm::Value *MapTypesArray = nullptr;		llvm::Value *MapTypesArray = nullptr;
/// The total number of pointers passed to the runtime library.		/// The total number of pointers passed to the runtime library.
unsigned NumberOfPtrs = 0u;		unsigned NumberOfPtrs = 0u;
/// Map between the a declaration of a capture and the corresponding base		/// Map between the a declaration of a capture and the corresponding base
/// pointer address where the runtime returns the device pointers.		/// pointer address where the runtime returns the device pointers.
llvm::DenseMap<const ValueDecl *, Address> CaptureDeviceAddrMap;		llvm::DenseMap<const ValueDecl *, Address> CaptureDeviceAddrMap;
		/// The array of array of offsets passed to the runtime library.
		SmallVector<SmallVector<llvm::Value *, 4>, 4> Offsets;
		/// The array of array of counts passed to the runtime library.
		SmallVector<SmallVector<llvm::Value *, 4>, 4> Counts;
		/// The array of array of strides passed to the runtime library.
		SmallVector<SmallVector<llvm::Value *, 4>, 4> Strides;

explicit TargetDataInfo() {}		explicit TargetDataInfo() {}
explicit TargetDataInfo(bool RequiresDevicePointerInfo)		explicit TargetDataInfo(bool RequiresDevicePointerInfo)
: RequiresDevicePointerInfo(RequiresDevicePointerInfo) {}		: RequiresDevicePointerInfo(RequiresDevicePointerInfo) {}
/// Clear information about the data arrays.		/// Clear information about the data arrays.
void clearArrayInfo() {		void clearArrayInfo() {
BasePointersArray = nullptr;		BasePointersArray = nullptr;
PointersArray = nullptr;		PointersArray = nullptr;
▲ Show 20 Lines • Show All 816 Lines • Show Last 20 Lines

clang/lib/CodeGen/CGOpenMPRuntime.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 7,037 Lines • ▼ Show 20 Lines	enum OpenMPOffloadMappingFlags : uint64_t {
OMP_MAP_PRIVATE = 0x80,		OMP_MAP_PRIVATE = 0x80,
/// Pass the element to the device by value.		/// Pass the element to the device by value.
OMP_MAP_LITERAL = 0x100,		OMP_MAP_LITERAL = 0x100,
/// Implicit map		/// Implicit map
OMP_MAP_IMPLICIT = 0x200,		OMP_MAP_IMPLICIT = 0x200,
/// Close is a hint to the runtime to allocate memory close to		/// Close is a hint to the runtime to allocate memory close to
/// the target device.		/// the target device.
OMP_MAP_CLOSE = 0x400,		OMP_MAP_CLOSE = 0x400,
		/// Signal that the runtime library should use args as an array of
		/// descriptor_dim pointers and use args_size as dims. Used when we have
		/// non-contiguous list items in target update directive
		OMP_MAP_DESCRIPTOR = 0x100000000000,
		ABataevUnsubmitted Done Reply Inline Actions I'm not sure about the value of this flag. If I do recall it correctly, this value might be used for something different by XL compiler, for example. Maybe use some other value, maybe high bits? It is a kind of service flag, not data mapping attribute, so better to move it to high bits (the bit before OMP_MAP_MEMBER_OF maybe?). ABataev: I'm not sure about the value of this flag. If I do recall it correctly, this value might be…
		cchenAuthorUnsubmitted Done Reply Inline Actions Hi @ABataev, is there any place I can find which value has been used for lower bits (like 0x800, 0x1000)? cchen: Hi @ABataev, is there any place I can find which value has been used for lower bits (like 0x800…
		ABataevUnsubmitted Not Done Reply Inline Actions I rather doubt. You can try to ask @kkwli0 ABataev: I rather doubt. You can try to ask @kkwli0
		kkwli0Unsubmitted Not Done Reply Inline Actions We are using 0x800. I think your current choice should be fine. kkwli0: We are using 0x800. I think your current choice should be fine.
/// The 16 MSBs of the flags indicate whether the entry is member of some		/// The 16 MSBs of the flags indicate whether the entry is member of some
/// struct/class.		/// struct/class.
OMP_MAP_MEMBER_OF = 0xffff000000000000,		OMP_MAP_MEMBER_OF = 0xffff000000000000,
LLVM_MARK_AS_BITMASK_ENUM(/* LargestFlag = */ OMP_MAP_MEMBER_OF),		LLVM_MARK_AS_BITMASK_ENUM(/* LargestFlag = */ OMP_MAP_MEMBER_OF),
};		};

/// Get the offset of the OMP_MAP_MEMBER_OF field.		/// Get the offset of the OMP_MAP_MEMBER_OF field.
static unsigned getFlagMemberOffset() {		static unsigned getFlagMemberOffset() {
Show All 19 Lines	public:
llvm::Value operator() const { return Ptr; }		llvm::Value operator() const { return Ptr; }
const ValueDecl *getDevicePtrDecl() const { return DevPtrDecl; }		const ValueDecl *getDevicePtrDecl() const { return DevPtrDecl; }
void setDevicePtrDecl(const ValueDecl *D) { DevPtrDecl = D; }		void setDevicePtrDecl(const ValueDecl *D) { DevPtrDecl = D; }
};		};

using MapBaseValuesArrayTy = SmallVector<BasePointerInfo, 4>;		using MapBaseValuesArrayTy = SmallVector<BasePointerInfo, 4>;
using MapValuesArrayTy = SmallVector<llvm::Value *, 4>;		using MapValuesArrayTy = SmallVector<llvm::Value *, 4>;
using MapFlagsArrayTy = SmallVector<OpenMPOffloadMappingFlags, 4>;		using MapFlagsArrayTy = SmallVector<OpenMPOffloadMappingFlags, 4>;
		using MapDimArrayTy = SmallVector<uint64_t, 4>;
		using MapNonContiguousArrayTy = SmallVector<MapValuesArrayTy, 4>;

/// Map between a struct and the its lowest & highest elements which have been		/// Map between a struct and the its lowest & highest elements which have been
/// mapped.		/// mapped.
/// [ValueDecl *] --> {LE(FieldIndex, Pointer),		/// [ValueDecl *] --> {LE(FieldIndex, Pointer),
/// HE(FieldIndex, Pointer)}		/// HE(FieldIndex, Pointer)}
struct StructRangeInfoTy {		struct StructRangeInfoTy {
std::pair<unsigned /FieldIndex/, Address /Pointer/> LowestElem = {		std::pair<unsigned /FieldIndex/, Address /Pointer/> LowestElem = {
0, Address::invalid()};		0, Address::invalid()};
Show All 11 Lines	struct MapInfo {
bool ReturnDevicePointer = false;		bool ReturnDevicePointer = false;
bool IsImplicit = false;		bool IsImplicit = false;

MapInfo() = default;		MapInfo() = default;
MapInfo(		MapInfo(
OMPClauseMappableExprCommon::MappableExprComponentListRef Components,		OMPClauseMappableExprCommon::MappableExprComponentListRef Components,
OpenMPMapClauseKind MapType,		OpenMPMapClauseKind MapType,
ArrayRef<OpenMPMapModifierKind> MapModifiers,		ArrayRef<OpenMPMapModifierKind> MapModifiers,
bool ReturnDevicePointer, bool IsImplicit)		bool ReturnDevicePointer, bool IsImplicit)
		ABataevUnsubmitted Done Reply Inline Actions Restore original formatting ABataev: Restore original formatting
: Components(Components), MapType(MapType), MapModifiers(MapModifiers),		: Components(Components), MapType(MapType), MapModifiers(MapModifiers),
ReturnDevicePointer(ReturnDevicePointer), IsImplicit(IsImplicit) {}		ReturnDevicePointer(ReturnDevicePointer), IsImplicit(IsImplicit) {}
};		};

/// If use_device_ptr is used on a pointer which is a struct member and there		/// If use_device_ptr is used on a pointer which is a struct member and there
/// is no map information about it, then emission of that entry is deferred		/// is no map information about it, then emission of that entry is deferred
/// until the whole struct has been processed.		/// until the whole struct has been processed.
struct DeferredDevicePtrEntryTy {		struct DeferredDevicePtrEntryTy {
▲ Show 20 Lines • Show All 99 Lines • ▼ Show 20 Lines	llvm::Value getExprTypeSize(const Expr E) const {
}		}
return CGF.getTypeSize(ExprTy);		return CGF.getTypeSize(ExprTy);
}		}

/// Return the corresponding bits for a given map clause modifier. Add		/// Return the corresponding bits for a given map clause modifier. Add
/// a flag marking the map as a pointer if requested. Add a flag marking the		/// a flag marking the map as a pointer if requested. Add a flag marking the
/// map as the first one of a series of maps that relate to the same map		/// map as the first one of a series of maps that relate to the same map
/// expression.		/// expression.
OpenMPOffloadMappingFlags getMapTypeBits(		OpenMPOffloadMappingFlags
OpenMPMapClauseKind MapType, ArrayRef<OpenMPMapModifierKind> MapModifiers,		getMapTypeBits(OpenMPMapClauseKind MapType,
bool IsImplicit, bool AddPtrFlag, bool AddIsTargetParamFlag) const {		ArrayRef<OpenMPMapModifierKind> MapModifiers, bool IsImplicit,
		bool AddPtrFlag, bool AddIsTargetParamFlag,
		bool IsNonContiguous) const {
OpenMPOffloadMappingFlags Bits =		OpenMPOffloadMappingFlags Bits =
IsImplicit ? OMP_MAP_IMPLICIT : OMP_MAP_NONE;		IsImplicit ? OMP_MAP_IMPLICIT : OMP_MAP_NONE;
switch (MapType) {		switch (MapType) {
case OMPC_MAP_alloc:		case OMPC_MAP_alloc:
case OMPC_MAP_release:		case OMPC_MAP_release:
// alloc and release is the default behavior in the runtime library, i.e.		// alloc and release is the default behavior in the runtime library, i.e.
// if we don't pass any bits alloc/release that is what the runtime is		// if we don't pass any bits alloc/release that is what the runtime is
// going to do. Therefore, we don't need to signal anything for these two		// going to do. Therefore, we don't need to signal anything for these two
Show All 19 Lines	getMapTypeBits(OpenMPMapClauseKind MapType,
if (AddIsTargetParamFlag)		if (AddIsTargetParamFlag)
Bits \|= OMP_MAP_TARGET_PARAM;		Bits \|= OMP_MAP_TARGET_PARAM;
if (llvm::find(MapModifiers, OMPC_MAP_MODIFIER_always)		if (llvm::find(MapModifiers, OMPC_MAP_MODIFIER_always)
!= MapModifiers.end())		!= MapModifiers.end())
Bits \|= OMP_MAP_ALWAYS;		Bits \|= OMP_MAP_ALWAYS;
if (llvm::find(MapModifiers, OMPC_MAP_MODIFIER_close)		if (llvm::find(MapModifiers, OMPC_MAP_MODIFIER_close)
!= MapModifiers.end())		!= MapModifiers.end())
Bits \|= OMP_MAP_CLOSE;		Bits \|= OMP_MAP_CLOSE;
		if (IsNonContiguous)
		Bits \|= OMP_MAP_DESCRIPTOR;
return Bits;		return Bits;
}		}

/// Return true if the provided expression is a final array section. A		/// Return true if the provided expression is a final array section. A
/// final array section, is one whose length can't be proved to be one.		/// final array section, is one whose length can't be proved to be one.
bool isFinalArraySectionExpression(const Expr *E) const {		bool isFinalArraySectionExpression(const Expr *E) const {
const auto *OASE = dyn_cast<OMPArraySectionExpr>(E);		const auto *OASE = dyn_cast<OMPArraySectionExpr>(E);

Show All 31 Lines	bool isFinalArraySectionExpression(const Expr *E) const {
return ConstLength.getSExtValue() != 1;		return ConstLength.getSExtValue() != 1;
}		}

/// Generate the base pointers, section pointers, sizes and map type		/// Generate the base pointers, section pointers, sizes and map type
/// bits for the provided map type, map modifier, and expression components.		/// bits for the provided map type, map modifier, and expression components.
/// \a IsFirstComponent should be set to true if the provided set of		/// \a IsFirstComponent should be set to true if the provided set of
/// components is the first associated with a capture.		/// components is the first associated with a capture.
void generateInfoForComponentList(		void generateInfoForComponentList(
OpenMPMapClauseKind MapType,		OpenMPMapClauseKind MapType, ArrayRef<OpenMPMapModifierKind> MapModifiers,
ArrayRef<OpenMPMapModifierKind> MapModifiers,
OMPClauseMappableExprCommon::MappableExprComponentListRef Components,		OMPClauseMappableExprCommon::MappableExprComponentListRef Components,
MapBaseValuesArrayTy &BasePointers, MapValuesArrayTy &Pointers,		MapBaseValuesArrayTy &BasePointers, MapValuesArrayTy &Pointers,
MapValuesArrayTy &Sizes, MapFlagsArrayTy &Types,		MapValuesArrayTy &Sizes, MapFlagsArrayTy &Types, MapDimArrayTy &Dims,
StructRangeInfoTy &PartialStruct, bool IsFirstComponentList,		StructRangeInfoTy &PartialStruct, bool IsFirstComponentList,
bool IsImplicit,		bool IsImplicit,
ArrayRef<OMPClauseMappableExprCommon::MappableExprComponentListRef>		ArrayRef<OMPClauseMappableExprCommon::MappableExprComponentListRef>
OverlappedElements = llvm::None) const {		OverlappedElements = llvm::None,
		bool IsNonContiguous = false,
		ABataevUnsubmitted Done Reply Inline Actions Can we encapsulate `Dims` into `StructNonContiguousInfo`? ABataev: Can we encapsulate `Dims` into `StructNonContiguousInfo`?
		MapNonContiguousArrayTy *const Offsets = nullptr,
		MapNonContiguousArrayTy *const Counts = nullptr,
		MapNonContiguousArrayTy *const Strides = nullptr) const {
		ABataevUnsubmitted Done Reply Inline Actions I would prefer to pack these 4 params into a single parameter (a struct). Also, can we put `Dims` parameter into the list of the optional parameters? ABataev: I would prefer to pack these 4 params into a single parameter (a struct). Also, can we put…
// The following summarizes what has to be generated for each map and the		// The following summarizes what has to be generated for each map and the
// types below. The generated information is expressed in this order:		// types below. The generated information is expressed in this order:
// base pointer, section pointer, size, flags		// base pointer, section pointer, size, flags
// (to add to the ones that come from the map type and modifier).		// (to add to the ones that come from the map type and modifier).
//		//
// double d;		// double d;
// int i[100];		// int i[100];
// float *p;		// float *p;
▲ Show 20 Lines • Show All 230 Lines • ▼ Show 20 Lines	void generateInfoForComponentList(
// in the component list which is a member expression. Useful when we have a		// in the component list which is a member expression. Useful when we have a
// pointer or a final array section, in which case it is the previous		// pointer or a final array section, in which case it is the previous
// component in the list which tells us whether we have a member expression.		// component in the list which tells us whether we have a member expression.
// E.g. X.f[:]		// E.g. X.f[:]
// While processing the final array section "[:]" it is "f" which tells us		// While processing the final array section "[:]" it is "f" which tells us
// whether we are dealing with a member of a declared struct.		// whether we are dealing with a member of a declared struct.
const MemberExpr *EncounteredME = nullptr;		const MemberExpr *EncounteredME = nullptr;

		// Track for the total number of dimension.
		uint64_t DimSize = 0;

for (; I != CE; ++I) {		for (; I != CE; ++I) {
// If the current component is member of a struct (parent struct) mark it.		// If the current component is member of a struct (parent struct) mark it.
if (!EncounteredME) {		if (!EncounteredME) {
EncounteredME = dyn_cast<MemberExpr>(I->getAssociatedExpression());		EncounteredME = dyn_cast<MemberExpr>(I->getAssociatedExpression());
// If we encounter a PTR_AND_OBJ entry from now on it should be marked		// If we encounter a PTR_AND_OBJ entry from now on it should be marked
// as MEMBER_OF the parent struct.		// as MEMBER_OF the parent struct.
if (EncounteredME)		if (EncounteredME)
ShouldBeMemberOf = true;		ShouldBeMemberOf = true;
}		}

auto Next = std::next(I);		auto Next = std::next(I);

// We need to generate the addresses and sizes if this is the last		// We need to generate the addresses and sizes if this is the last
// component, if the component is a pointer or if it is an array section		// component, if the component is a pointer or if it is an array section
// whose length can't be proved to be one. If this is a pointer, it		// whose length can't be proved to be one. If this is a pointer, it
// becomes the base address for the following components.		// becomes the base address for the following components.

// A final array section, is one whose length can't be proved to be one.		// A final array section, is one whose length can't be proved to be one.
		// If the map item is non-contiguous then we don't treat any array section
		// as final array section.
bool IsFinalArraySection =		bool IsFinalArraySection =
		!IsNonContiguous &&
isFinalArraySectionExpression(I->getAssociatedExpression());		isFinalArraySectionExpression(I->getAssociatedExpression());

		ABataevUnsubmitted Done Reply Inline Actions Better to convert it to `!IsNonContiguous && isFinalArraySectionExpression(I->getAssociatedExpression())`. ABataev: Better to convert it to `!IsNonContiguous && isFinalArraySectionExpression(I…
// Get information on whether the element is a pointer. Have to do a		// Get information on whether the element is a pointer. Have to do a
// special treatment for array sections given that they are built-in		// special treatment for array sections given that they are built-in
// types.		// types.
const auto *OASE =		const auto *OASE =
dyn_cast<OMPArraySectionExpr>(I->getAssociatedExpression());		dyn_cast<OMPArraySectionExpr>(I->getAssociatedExpression());
const auto *OAShE =		const auto *OAShE =
dyn_cast<OMPArrayShapingExpr>(I->getAssociatedExpression());		dyn_cast<OMPArrayShapingExpr>(I->getAssociatedExpression());
const auto *UO = dyn_cast<UnaryOperator>(I->getAssociatedExpression());		const auto *UO = dyn_cast<UnaryOperator>(I->getAssociatedExpression());
const auto *BO = dyn_cast<BinaryOperator>(I->getAssociatedExpression());		const auto *BO = dyn_cast<BinaryOperator>(I->getAssociatedExpression());
bool IsPointer =		bool IsPointer =
OAShE \|\|		OAShE \|\|
(OASE && OMPArraySectionExpr::getBaseOriginalType(OASE)		(OASE && OMPArraySectionExpr::getBaseOriginalType(OASE)
.getCanonicalType()		.getCanonicalType()
->isAnyPointerType()) \|\|		->isAnyPointerType()) \|\|
I->getAssociatedExpression()->getType()->isAnyPointerType();		I->getAssociatedExpression()->getType()->isAnyPointerType();
bool IsNonDerefPointer = IsPointer && !UO && !BO;		bool IsNonDerefPointer = IsPointer && !UO && !BO && !IsNonContiguous;

		if (OASE)
		++DimSize;
		ABataevUnsubmitted Done Reply Inline Actions Use prefix form `++DimSize`. ABataev: Use prefix form `++DimSize`.

if (Next == CE \|\| IsNonDerefPointer \|\| IsFinalArraySection) {		if (Next == CE \|\| IsNonDerefPointer \|\| IsFinalArraySection) {
// If this is not the last component, we expect the pointer to be		// If this is not the last component, we expect the pointer to be
// associated with an array expression or member expression.		// associated with an array expression or member expression.
assert((Next == CE \|\|		assert((Next == CE \|\|
		ABataevUnsubmitted Done Reply Inline Actions Do you really need to count `DimSize` for array shaping operators and array subscript expressions? I don't see tests for it. ABataev: Do you really need to count `DimSize` for array shaping operators and array subscript…
		cchenAuthorUnsubmitted Done Reply Inline Actions You're right, I don't need to count `DimSize` for array shaping and array subscript. cchen: You're right, I don't need to count `DimSize` for array shaping and array subscript.
isa<MemberExpr>(Next->getAssociatedExpression()) \|\|		isa<MemberExpr>(Next->getAssociatedExpression()) \|\|
isa<ArraySubscriptExpr>(Next->getAssociatedExpression()) \|\|		isa<ArraySubscriptExpr>(Next->getAssociatedExpression()) \|\|
isa<OMPArraySectionExpr>(Next->getAssociatedExpression()) \|\|		isa<OMPArraySectionExpr>(Next->getAssociatedExpression()) \|\|
isa<UnaryOperator>(Next->getAssociatedExpression()) \|\|		isa<UnaryOperator>(Next->getAssociatedExpression()) \|\|
isa<BinaryOperator>(Next->getAssociatedExpression())) &&		isa<BinaryOperator>(Next->getAssociatedExpression())) &&
"Unexpected expression");		"Unexpected expression");

Address LB = Address::invalid();		Address LB = Address::invalid();
Show All 34 Lines	for (; I != CE; ++I) {
PartialStruct.HighestElem.first)>::max(),		PartialStruct.HighestElem.first)>::max(),
HB};		HB};
PartialStruct.Base = BP;		PartialStruct.Base = BP;
// Emit data for non-overlapped data.		// Emit data for non-overlapped data.
OpenMPOffloadMappingFlags Flags =		OpenMPOffloadMappingFlags Flags =
OMP_MAP_MEMBER_OF \|		OMP_MAP_MEMBER_OF \|
getMapTypeBits(MapType, MapModifiers, IsImplicit,		getMapTypeBits(MapType, MapModifiers, IsImplicit,
/AddPtrFlag=/false,		/AddPtrFlag=/false,
/AddIsTargetParamFlag=/false);		/AddIsTargetParamFlag=/false, IsNonContiguous);
LB = BP;		LB = BP;
		ABataevUnsubmitted Done Reply Inline Actions No need for parameter name comment here, it is required only if the `true\|false` constants are used ABataev: No need for parameter name comment here, it is required only if the `true\|false` constants are…
llvm::Value *Size = nullptr;		llvm::Value *Size = nullptr;
// Do bitcopy of all non-overlapped structure elements.		// Do bitcopy of all non-overlapped structure elements.
for (OMPClauseMappableExprCommon::MappableExprComponentListRef		for (OMPClauseMappableExprCommon::MappableExprComponentListRef
Component : OverlappedElements) {		Component : OverlappedElements) {
Address ComponentLB = Address::invalid();		Address ComponentLB = Address::invalid();
for (const OMPClauseMappableExprCommon::MappableComponent &MC :		for (const OMPClauseMappableExprCommon::MappableComponent &MC :
Component) {		Component) {
if (MC.getAssociatedDeclaration()) {		if (MC.getAssociatedDeclaration()) {
ComponentLB =		ComponentLB =
CGF.EmitOMPSharedLValue(MC.getAssociatedExpression())		CGF.EmitOMPSharedLValue(MC.getAssociatedExpression())
.getAddress(CGF);		.getAddress(CGF);
Size = CGF.Builder.CreatePtrDiff(		Size = CGF.Builder.CreatePtrDiff(
CGF.EmitCastToVoidPtr(ComponentLB.getPointer()),		CGF.EmitCastToVoidPtr(ComponentLB.getPointer()),
CGF.EmitCastToVoidPtr(LB.getPointer()));		CGF.EmitCastToVoidPtr(LB.getPointer()));
break;		break;
}		}
}		}
BasePointers.push_back(BP.getPointer());		BasePointers.push_back(BP.getPointer());
Pointers.push_back(LB.getPointer());		Pointers.push_back(LB.getPointer());
Sizes.push_back(CGF.Builder.CreateIntCast(Size, CGF.Int64Ty,		Sizes.push_back(CGF.Builder.CreateIntCast(Size, CGF.Int64Ty,
/isSigned=/true));		/isSigned=/true));
Types.push_back(Flags);		Types.push_back(Flags);
		Dims.push_back(IsNonContiguous ? DimSize : 0);
LB = CGF.Builder.CreateConstGEP(ComponentLB, 1);		LB = CGF.Builder.CreateConstGEP(ComponentLB, 1);
}		}
BasePointers.push_back(BP.getPointer());		BasePointers.push_back(BP.getPointer());
Pointers.push_back(LB.getPointer());		Pointers.push_back(LB.getPointer());
Size = CGF.Builder.CreatePtrDiff(		Size = CGF.Builder.CreatePtrDiff(
CGF.EmitCastToVoidPtr(		CGF.EmitCastToVoidPtr(
CGF.Builder.CreateConstGEP(HB, 1).getPointer()),		CGF.Builder.CreateConstGEP(HB, 1).getPointer()),
CGF.EmitCastToVoidPtr(LB.getPointer()));		CGF.EmitCastToVoidPtr(LB.getPointer()));
Sizes.push_back(		Sizes.push_back(
CGF.Builder.CreateIntCast(Size, CGF.Int64Ty, /isSigned=/true));		CGF.Builder.CreateIntCast(Size, CGF.Int64Ty, /isSigned=/true));
Types.push_back(Flags);		Types.push_back(Flags);
		Dims.push_back(IsNonContiguous ? DimSize : 0);
break;		break;
}		}
llvm::Value *Size = getExprTypeSize(I->getAssociatedExpression());		llvm::Value *Size = getExprTypeSize(I->getAssociatedExpression());
if (!IsMemberPointer) {		if (!IsMemberPointer) {
BasePointers.push_back(BP.getPointer());		BasePointers.push_back(BP.getPointer());
Pointers.push_back(LB.getPointer());		Pointers.push_back(LB.getPointer());
Sizes.push_back(		Sizes.push_back(
CGF.Builder.CreateIntCast(Size, CGF.Int64Ty, /isSigned=/true));		CGF.Builder.CreateIntCast(Size, CGF.Int64Ty, /isSigned=/true));
		Dims.push_back(IsNonContiguous ? DimSize : 0);

// We need to add a pointer flag for each map that comes from the		// We need to add a pointer flag for each map that comes from the
// same expression except for the first one. We also need to signal		// same expression except for the first one. We also need to signal
// this map is the first one that relates with the current capture		// this map is the first one that relates with the current capture
// (there is a set of entries for each capture).		// (there is a set of entries for each capture).
OpenMPOffloadMappingFlags Flags = getMapTypeBits(		OpenMPOffloadMappingFlags Flags = getMapTypeBits(
MapType, MapModifiers, IsImplicit,		MapType, MapModifiers, IsImplicit,
!IsExpressionFirstInfo \|\| RequiresReference,		!IsExpressionFirstInfo \|\| RequiresReference,
IsCaptureFirstInfo && !RequiresReference);		IsCaptureFirstInfo && !RequiresReference, IsNonContiguous);

if (!IsExpressionFirstInfo) {		if (!IsExpressionFirstInfo) {
// If we have a PTR_AND_OBJ pair where the OBJ is a pointer as well,		// If we have a PTR_AND_OBJ pair where the OBJ is a pointer as well,
// then we reset the TO/FROM/ALWAYS/DELETE/CLOSE flags.		// then we reset the TO/FROM/ALWAYS/DELETE/CLOSE flags.
		ABataevUnsubmitted Done Reply Inline Actions Same, comment not required ABataev: Same, comment not required
if (IsPointer)		if (IsPointer)
Flags &= ~(OMP_MAP_TO \| OMP_MAP_FROM \| OMP_MAP_ALWAYS \|		Flags &= ~(OMP_MAP_TO \| OMP_MAP_FROM \| OMP_MAP_ALWAYS \|
OMP_MAP_DELETE \| OMP_MAP_CLOSE);		OMP_MAP_DELETE \| OMP_MAP_CLOSE);

if (ShouldBeMemberOf) {		if (ShouldBeMemberOf) {
// Set placeholder value MEMBER_OF=FFFF to indicate that the flag		// Set placeholder value MEMBER_OF=FFFF to indicate that the flag
// should be later updated with the correct value of MEMBER_OF.		// should be later updated with the correct value of MEMBER_OF.
Flags \|= OMP_MAP_MEMBER_OF;		Flags \|= OMP_MAP_MEMBER_OF;
Show All 39 Lines	for (; I != CE; ++I) {
// The pointer becomes the base for the next element.		// The pointer becomes the base for the next element.
if (Next != CE)		if (Next != CE)
BP = LB;		BP = LB;

IsExpressionFirstInfo = false;		IsExpressionFirstInfo = false;
IsCaptureFirstInfo = false;		IsCaptureFirstInfo = false;
}		}
}		}

		if (IsNonContiguous) {
		const ASTContext &Context = CGF.getContext();

		MapValuesArrayTy CurOffsets;
		MapValuesArrayTy CurCounts;
		MapValuesArrayTy CurStrides;
		llvm::Value *CurStride = nullptr;
		SmallVector<llvm::Value *, 4> DimSizes;

		// Collect Size information for each dimension and get the element size as
		// the first Stride. For example, for `int arr[10][10]`, the DimSizes
		// should be [10, 10] and the first stride is 4 btyes.
		for (const auto &Component : Components) {
		ABataevUnsubmitted Done Reply Inline Actions Expand `auto` here to a real type ABataev: Expand `auto` here to a real type
		const Expr *AssocExpr = Component.getAssociatedExpression();
		const auto *OASE = dyn_cast<OMPArraySectionExpr>(AssocExpr);
		ABataevUnsubmitted Done Reply Inline Actions Use range-based loop, if possible. ABataev: Use range-based loop, if possible.
		ABataevUnsubmitted Done Reply Inline Actions Can we have anything else except for array section here? If not, use just `cast`. If yes, use `continue` to simplify complexity: if (!OASE) continue; ... ABataev: Can we have anything else except for array section here? If not, use just `cast`. If yes, use…
		if (OASE) {
		QualType Ty;
		Ty = OMPArraySectionExpr::getBaseOriginalType(OASE->getBase());
		auto *CAT = Context.getAsConstantArrayType(Ty);
		ABataevUnsubmitted Done Reply Inline Actions Do you really need to analyze array subscript expressions here? I though that we should analyze only array sections, no? ABataev: Do you really need to analyze array subscript expressions here? I though that we should analyze…
		auto *VAT = Context.getAsVariableArrayType(Ty);
		// Get element size if CurStrides is empty.
		if (CurStrides.empty()) {
		const Type *ElementType = nullptr;
		uint64_t ElementTypeSize;
		if (CAT) {
		ElementType = CAT->getElementType().getTypePtr();
		ElementTypeSize =
		Context.getTypeSizeInChars(ElementType).getQuantity();
		} else if (VAT) {
		ABataevUnsubmitted Done Reply Inline Actions What if the base is a pointer, not an array? ABataev: What if the base is a pointer, not an array?
		cchenAuthorUnsubmitted Done Reply Inline Actions The `if (ElementType)` condition only push back stride when base is not pointer. I'm now allowing one dimension size to be unknown (pointer as base) and sema has analysis to check if more than one indirection as base. My last codegen test case is for testing pointer as base. cchen: The `if (ElementType)` condition only push back stride when base is not pointer. I'm now…
		ElementType = VAT->getElementType().getTypePtr();
		ElementTypeSize =
		Context.getTypeSizeInChars(ElementType).getQuantity();
		}
		if (ElementType)
		CurStrides.push_back(
		llvm::ConstantInt::get(CGF.Int64Ty, ElementTypeSize));
		}
		// Get dimension value.
		llvm::Value *SizeV = nullptr;
		if (CAT) {
		ABataevUnsubmitted Done Reply Inline Actions If only `CAT` or `VAT` is allowed, then transform this if into: else { assert(VAT&& ...); ABataev: If only `CAT` or `VAT` is allowed, then transform this if into: ``` else { assert(VAT&& ...)…
		llvm::APInt Size = CAT->getSize();
		SizeV = llvm::ConstantInt::get(CGF.SizeTy, Size);
		ABataevUnsubmitted Done Reply Inline Actions Create directly as of `CGF.Int64Ty` type. ABataev: Create directly as of `CGF.Int64Ty` type.
		cchenAuthorUnsubmitted Done Reply Inline Actions Doing this I'll get assertion error in this exact line if on a 32-bits target. cchen: Doing this I'll get assertion error in this exact line if on a 32-bits target.
		ABataevUnsubmitted Done Reply Inline Actions Hmm, why, can you investigate? ABataev: Hmm, why, can you investigate?
		cchenAuthorUnsubmitted Done Reply Inline Actions My comment was not accurate, I've updated it. What I want to convey is that we can only have `CAT, VAT, or pointer` here, since analysis in Sema has a restriction for it. (SemaOpenMP line 16623) cchen: My comment was not accurate, I've updated it. What I want to convey is that we can only have…
		ABataevUnsubmitted Done Reply Inline Actions It does not relate to the comments thread but I got it. Anyway, try to investigate why the compiler crashes if you try to cr4eate a constant ща ]СПАюШте64Ен] directly. ABataev: It does not relate to the comments thread but I got it. Anyway, try to investigate why the…
		cchenAuthorUnsubmitted Done Reply Inline Actions I'll investigate it, thanks. cchen: I'll investigate it, thanks.
		} else if (VAT) {
		const Expr *Size = VAT->getSizeExpr();
		SizeV = CGF.EmitScalarExpr(Size);
		}
		ABataevUnsubmitted Done Reply Inline Actions The code for `SizeV` must be under the control of the next `if`: if (DimSizes.size() < Components.size() - 1) { .... } ABataev: The code for `SizeV` must be under the control of the next `if`: ``` if (DimSizes.size() <…
		cchenAuthorUnsubmitted Done Reply Inline Actions I don't think I understand this one. Why do you remove SizeV in the if condition? cchen: I don't think I understand this one. Why do you remove SizeV in the if condition?
		ABataevUnsubmitted Done Reply Inline Actions This is for `SizeV`. You don't use it if `DimSizes.size() < Components.size() - 1` is `false`, looks like memory leak. ABataev: This is for `SizeV`. You don't use it if `DimSizes.size() < Components.size() - 1` is `false`…
		// We need all the dimension size except for the last dimension.
		assert((VAT \|\| CAT \|\| &Component == &*Components.begin()) &&
		"Should be either ConstantArray or VariableArray if not the "
		"first Component");
		if (SizeV && DimSizes.size() < Components.size() - 1)
		DimSizes.push_back(CGF.Builder.CreateIntCast(SizeV, CGF.Int64Ty,
		ABataevUnsubmitted Done Reply Inline Actions No need for braces here ABataev: No need for braces here
		/IsSigned=/false));
		}
		}

		// We need dimension size to compute stride
		auto DI = DimSizes.begin();

		// Collect info for non-contiguous. Notice that offset, count, and stride
		// are only meaningful for array-section, so we insert a null for anything
		// other than array-section.
		// Also, the size of offset, count, and stride are not the same as
		// pointers, base_pointers, sizes, or dims. Instead, the size of offset,
		// count, and stride are the same as the number of non-contiguous
		// declaration in target update to/from clause.
		for (const auto &Component : Components) {
		ABataevUnsubmitted Done Reply Inline Actions Expand `auto` here to a real type ABataev: Expand `auto` here to a real type
		const Expr *AssocExpr = Component.getAssociatedExpression();
		const auto *OASE = dyn_cast<OMPArraySectionExpr>(AssocExpr);

		if (OASE) {
		// Offset
		ABataevUnsubmitted Done Reply Inline Actions Can we have anything else except for array section here? If not, use just `cast`. If yes, use `continue` to simplify complexity: if (!OASE) continue; ... ABataev: Can we have anything else except for array section here? If not, use just `cast`. If yes, use…
		const Expr *OffsetExpr = nullptr;
		OffsetExpr = OASE->getLowerBound();
		llvm::Value *Offset = nullptr;
		if (!OffsetExpr) {
		ABataevUnsubmitted Done Reply Inline Actions Same, try to use range-based loop, if possible. ABataev: Same, try to use range-based loop, if possible.
		// If offset is absent, then we just set it to zero.
		Offset = llvm::ConstantInt::get(CGF.Int64Ty, 0);
		} else {
		Offset = CGF.Builder.CreateIntCast(CGF.EmitScalarExpr(OffsetExpr),
		ABataevUnsubmitted Done Reply Inline Actions Can we have anything else except for array section here? If not, use just cast. If yes, use continue to simplify complexity: if (!OASE) continue; ... ABataev: Can we have anything else except for array section here? If not, use just cast. If yes, use…
		cchenAuthorUnsubmitted Done Reply Inline Actions Not sure about this one, I've added: if (!OASE) continue; ... cchen: Not sure about this one, I've added: ``` if (!OASE) continue; ...
		CGF.Int64Ty,
		ABataevUnsubmitted Done Reply Inline Actions Same question about array subscript expressions. ABataev: Same question about array subscript expressions.
		ABataevUnsubmitted Done Reply Inline Actions Do you really to pass real offsets here? Can we use pointers instead? ABataev: Do you really to pass real offsets here? Can we use pointers instead?
		cchenAuthorUnsubmitted Done Reply Inline Actions Do you mean I should set the type of Offset to Expr? cchen:* Do you mean I should set the type of Offset to Expr*?
		ABataevUnsubmitted Done Reply Inline Actions Currently, you're passing offsets to the runtime. Can we pass pointers instead? I mean, for `a[b]` you pass `b` to the runtime, can we pass `&a[b]` instead? ABataev: Currently, you're passing offsets to the runtime. Can we pass pointers instead? I mean, for `a…
		cchenAuthorUnsubmitted Done Reply Inline Actions Yes, I'm fine either passing index or passing address, though I'm curious why you're recommending passing address. cchen: Yes, I'm fine either passing index or passing address, though I'm curious why you're…
		ABataevUnsubmitted Done Reply Inline Actions It is going to simplify the codegen. Currently, to get the offset, you need to dig through all the elements of the array section. If, instead, you use the pointers, you would not need to do this and you can rely on something like `CGF.EmitArraySectionLValue()`. At least, I hope so. ABataev: It is going to simplify the codegen. Currently, to get the offset, you need to dig through all…
		cchenAuthorUnsubmitted Done Reply Inline Actions After discussed with my colleagues, I think passing relative offset makes more sense. For a 1-dim array, storing the offset as a pointer could work, but it seems strange to me to store as a pointer when there are 2+ dimensions with multiple disjoint chunks of memory because the pointer can only point to the offset for the first chunk. That is, a pointer would refer to an absolute location in a single chunk, whereas the offset is relative to the start of any chunk. For example: int a[4][4]; #pragma omp target update to(a[1:2][1:2]) This is two disjoint chunks of memory: XXXX XOOX XOOX XXXX The offset for the outer dimension could be store as a pointer, since there is only one instance of that dimension: Dim1: Offset=&a[1] But, the inner dimension is "instantiated" twice, once for each element in the outer dimension. So, there are really two absolute pointers, depending on which instance (element in the outer dimension) you're talking about: Dim2: Offset=&a[1][1] Dim2: Offset=&a[2][1] We could set the policy that the absolute offset would always be expressed as the offset in the first instance, but then wouldn't we need to refer to that location when computing the offset for all of the other instances? That seems unintuitive to me, and potentially complicates the implementation. The relative offset makes a lot more senes to me - for a starting point, what relative offset is needed for each dimension. The starting point for the outermost dimension does require the base address, but all inner dimensions have a variable starting pointer based on which element in the outer dimensions you're currently looking at. cchen: After discussed with my colleagues, I think passing relative offset makes more sense. For a 1…
		/isSigned=/false);
		}
		CurOffsets.push_back(Offset);

		// Count
		const Expr *CountExpr = nullptr;
		if (OASE)
		CountExpr = OASE->getLength();
		llvm::Value *Count = nullptr;
		if (!CountExpr) {
		// If length is absent then we calculate it as (Total length -
		// lower_bound)
		Count = CGF.Builder.CreateNUWSub(*DI, Offset);
		} else {
		Count = CGF.EmitScalarExpr(CountExpr);
		}
		Count =
		CGF.Builder.CreateIntCast(Count, CGF.Int64Ty, /isSigned=/false);
		CurCounts.push_back(Count);

		// Stride = previous stride * previous dimension size
		// Take `int arr[5][10]` and `arr[0:2][0:2]` as an example:
		// Dimension 1 Dimension 0
		// Offset 0 0
		// Count 2 2
		// Stride 40 bytes (4x10) 4 bytes (int)
		if (DI != DimSizes.end()) {
		CurStride = CGF.Builder.CreateNUWMul(CurStrides.back(), *DI++);
		CurStrides.push_back(CurStride);
		}
		ABataevUnsubmitted Done Reply Inline Actions The check is not required, you already checked that the expression must be array section only. ABataev: The check is not required, you already checked that the expression must be array section only.
		}
		}

		Offsets->push_back(CurOffsets);
		Counts->push_back(CurCounts);
		Strides->push_back(CurStrides);
		}
}		}
		ABataevUnsubmitted Done Reply Inline Actions Avoid expressions with some side effects, like `DI++` ABataev:* Avoid expressions with some side effects, like `*DI++`

/// Return the adjusted map modifiers if the declaration a capture refers to		/// Return the adjusted map modifiers if the declaration a capture refers to
/// appears in a first-private clause. This is expected to be used only with		/// appears in a first-private clause. This is expected to be used only with
/// directives that start with 'target'.		/// directives that start with 'target'.
MappableExprsHandler::OpenMPOffloadMappingFlags		MappableExprsHandler::OpenMPOffloadMappingFlags
getMapModifiersForPrivateClauses(const CapturedStmt::Capture &Cap) const {		getMapModifiersForPrivateClauses(const CapturedStmt::Capture &Cap) const {
assert(Cap.capturesVariable() && "Expected capture by reference only!");		assert(Cap.capturesVariable() && "Expected capture by reference only!");

// A first private variable captured by reference will use only the		// A first private variable captured by reference will use only the
		ABataevUnsubmitted Done Reply Inline Actions Use preincrement ABataev: Use preincrement
// 'private ptr' and 'map to' flag. Return the right flags if the captured		// 'private ptr' and 'map to' flag. Return the right flags if the captured
// declaration is known as first-private in this handler.		// declaration is known as first-private in this handler.
if (FirstPrivateDecls.count(Cap.getCapturedVar())) {		if (FirstPrivateDecls.count(Cap.getCapturedVar())) {
if (Cap.getCapturedVar()->getType().isConstant(CGF.getContext()) &&		if (Cap.getCapturedVar()->getType().isConstant(CGF.getContext()) &&
Cap.getCaptureKind() == CapturedStmt::VCK_ByRef)		Cap.getCaptureKind() == CapturedStmt::VCK_ByRef)
return MappableExprsHandler::OMP_MAP_ALWAYS \|		return MappableExprsHandler::OMP_MAP_ALWAYS \|
MappableExprsHandler::OMP_MAP_TO;		MappableExprsHandler::OMP_MAP_TO;
		ABataevUnsubmitted Done Reply Inline Actions Can we merge the functionality in this new function with the existing ones somehow? It is not the best idea to duplicate functionality using copy-paste if any. ABataev: Can we merge the functionality in this new function with the existing ones somehow? It is not…
if (Cap.getCapturedVar()->getType()->isAnyPointerType())		if (Cap.getCapturedVar()->getType()->isAnyPointerType())
return MappableExprsHandler::OMP_MAP_TO \|		return MappableExprsHandler::OMP_MAP_TO \|
MappableExprsHandler::OMP_MAP_PTR_AND_OBJ;		MappableExprsHandler::OMP_MAP_PTR_AND_OBJ;
return MappableExprsHandler::OMP_MAP_PRIVATE \|		return MappableExprsHandler::OMP_MAP_PRIVATE \|
MappableExprsHandler::OMP_MAP_TO;		MappableExprsHandler::OMP_MAP_TO;
}		}
return MappableExprsHandler::OMP_MAP_TO \|		return MappableExprsHandler::OMP_MAP_TO \|
MappableExprsHandler::OMP_MAP_FROM;		MappableExprsHandler::OMP_MAP_FROM;
▲ Show 20 Lines • Show All 147 Lines • ▼ Show 20 Lines	public:
}		}

/// Generate all the base pointers, section pointers, sizes and map		/// Generate all the base pointers, section pointers, sizes and map
/// types for the extracted mappable expressions. Also, for each item that		/// types for the extracted mappable expressions. Also, for each item that
/// relates with a device pointer, a pair of the relevant declaration and		/// relates with a device pointer, a pair of the relevant declaration and
/// index where it occurs is appended to the device pointers info array.		/// index where it occurs is appended to the device pointers info array.
void generateAllInfo(MapBaseValuesArrayTy &BasePointers,		void generateAllInfo(MapBaseValuesArrayTy &BasePointers,
MapValuesArrayTy &Pointers, MapValuesArrayTy &Sizes,		MapValuesArrayTy &Pointers, MapValuesArrayTy &Sizes,
MapFlagsArrayTy &Types) const {		MapFlagsArrayTy &Types, MapDimArrayTy &Dims,
		MapNonContiguousArrayTy &Offsets,
		MapNonContiguousArrayTy &Counts,
		MapNonContiguousArrayTy &Strides) const {
// We have to process the component lists that relate with the same		// We have to process the component lists that relate with the same
// declaration in a single chunk so that we can generate the map flags		// declaration in a single chunk so that we can generate the map flags
// correctly. Therefore, we organize all lists in a map.		// correctly. Therefore, we organize all lists in a map.
llvm::MapVector<const ValueDecl *, SmallVector<MapInfo, 8>> Info;		llvm::MapVector<const ValueDecl *, SmallVector<MapInfo, 8>> Info;

// Helper function to fill the information map for the different supported		// Helper function to fill the information map for the different supported
// clauses.		// clauses.
auto &&InfoGen = [&Info](		auto &&InfoGen = [&Info](
▲ Show 20 Lines • Show All 93 Lines • ▼ Show 20 Lines	for (const auto &M : Info) {
// associated with a capture, because the mapping flags depend on it.		// associated with a capture, because the mapping flags depend on it.
bool IsFirstComponentList = true;		bool IsFirstComponentList = true;

// Temporary versions of arrays		// Temporary versions of arrays
MapBaseValuesArrayTy CurBasePointers;		MapBaseValuesArrayTy CurBasePointers;
MapValuesArrayTy CurPointers;		MapValuesArrayTy CurPointers;
MapValuesArrayTy CurSizes;		MapValuesArrayTy CurSizes;
MapFlagsArrayTy CurTypes;		MapFlagsArrayTy CurTypes;
		MapDimArrayTy CurDims;
		MapNonContiguousArrayTy CurOffsets;
		MapNonContiguousArrayTy CurCounts;
		MapNonContiguousArrayTy CurStrides;
StructRangeInfoTy PartialStruct;		StructRangeInfoTy PartialStruct;

for (const MapInfo &L : M.second) {		for (const MapInfo &L : M.second) {
assert(!L.Components.empty() &&		assert(!L.Components.empty() &&
"Not expecting declaration with no component lists.");		"Not expecting declaration with no component lists.");

// Remember the current base pointer index.		// Remember the current base pointer index.
unsigned CurrentBasePointersIdx = CurBasePointers.size();		unsigned CurrentBasePointersIdx = CurBasePointers.size();
generateInfoForComponentList(L.MapType, L.MapModifiers, L.Components,		generateInfoForComponentList(L.MapType, L.MapModifiers, L.Components,
CurBasePointers, CurPointers, CurSizes,		CurBasePointers, CurPointers, CurSizes,
CurTypes, PartialStruct,		CurTypes, CurDims, PartialStruct,
IsFirstComponentList, L.IsImplicit);		IsFirstComponentList, L.IsImplicit,
		/OverlappedElements=/llvm::None,
		L.Components.back().isNonContiguous(),
		&CurOffsets, &CurCounts, &CurStrides);

		ABataevUnsubmitted Done Reply Inline Actions Just `generateInfoForComponentList( L.MapType, L.MapModifiers, L.Components, CurBasePointers, CurPointers, CurSizes, CurTypes, CurDims, PartialStruct, IsFirstComponentList, L.IsImplicit, /OverlappedElements=/llvm::None, L.Components.back().isNonContiguous(), &CurOffsets, &CurCounts, &CurStrides);` ABataev: Just `generateInfoForComponentList( L.MapType, L.MapModifiers, L.Components…
// If this entry relates with a device pointer, set the relevant		// If this entry relates with a device pointer, set the relevant
		ABataevUnsubmitted Done Reply Inline Actions No need to add `/OverlappedElements=/llvm::None` here, it is default value. ABataev: No need to add `/OverlappedElements=/llvm::None` here, it is default value.
// declaration and add the 'return pointer' flag.		// declaration and add the 'return pointer' flag.
if (L.ReturnDevicePointer) {		if (L.ReturnDevicePointer) {
assert(CurBasePointers.size() > CurrentBasePointersIdx &&		assert(CurBasePointers.size() > CurrentBasePointersIdx &&
"Unexpected number of mapped base pointers.");		"Unexpected number of mapped base pointers.");

const ValueDecl *RelevantVD =		const ValueDecl *RelevantVD =
L.Components.back().getAssociatedDeclaration();		L.Components.back().getAssociatedDeclaration();
assert(RelevantVD &&		assert(RelevantVD &&
Show All 30 Lines	for (const auto &M : Info) {
emitCombinedEntry(BasePointers, Pointers, Sizes, Types, CurTypes,		emitCombinedEntry(BasePointers, Pointers, Sizes, Types, CurTypes,
PartialStruct);		PartialStruct);

// We need to append the results of this capture to what we already have.		// We need to append the results of this capture to what we already have.
BasePointers.append(CurBasePointers.begin(), CurBasePointers.end());		BasePointers.append(CurBasePointers.begin(), CurBasePointers.end());
Pointers.append(CurPointers.begin(), CurPointers.end());		Pointers.append(CurPointers.begin(), CurPointers.end());
Sizes.append(CurSizes.begin(), CurSizes.end());		Sizes.append(CurSizes.begin(), CurSizes.end());
Types.append(CurTypes.begin(), CurTypes.end());		Types.append(CurTypes.begin(), CurTypes.end());
		Dims.append(CurDims.begin(), CurDims.end());
		Offsets.append(CurOffsets.begin(), CurOffsets.end());
		Counts.append(CurCounts.begin(), CurCounts.end());
		Strides.append(CurStrides.begin(), CurStrides.end());
}		}
}		}

/// Generate all the base pointers, section pointers, sizes and map types for		/// Generate all the base pointers, section pointers, sizes and map types for
/// the extracted map clauses of user-defined mapper.		/// the extracted map clauses of user-defined mapper.
void generateAllInfoForMapper(MapBaseValuesArrayTy &BasePointers,		void generateAllInfoForMapper(MapBaseValuesArrayTy &BasePointers,
MapValuesArrayTy &Pointers,		MapValuesArrayTy &Pointers,
MapValuesArrayTy &Sizes,		MapValuesArrayTy &Sizes,
Show All 33 Lines	for (const auto &M : Info) {
// associated with a capture, because the mapping flags depend on it.		// associated with a capture, because the mapping flags depend on it.
bool IsFirstComponentList = true;		bool IsFirstComponentList = true;

// Temporary versions of arrays		// Temporary versions of arrays
MapBaseValuesArrayTy CurBasePointers;		MapBaseValuesArrayTy CurBasePointers;
MapValuesArrayTy CurPointers;		MapValuesArrayTy CurPointers;
MapValuesArrayTy CurSizes;		MapValuesArrayTy CurSizes;
MapFlagsArrayTy CurTypes;		MapFlagsArrayTy CurTypes;
		MapDimArrayTy CurDims;
		MapNonContiguousArrayTy CurOffsets;
		MapNonContiguousArrayTy CurCounts;
		MapNonContiguousArrayTy CurStrides;
StructRangeInfoTy PartialStruct;		StructRangeInfoTy PartialStruct;

for (const MapInfo &L : M.second) {		for (const MapInfo &L : M.second) {
assert(!L.Components.empty() &&		assert(!L.Components.empty() &&
"Not expecting declaration with no component lists.");		"Not expecting declaration with no component lists.");
generateInfoForComponentList(L.MapType, L.MapModifiers, L.Components,		generateInfoForComponentList(L.MapType, L.MapModifiers, L.Components,
CurBasePointers, CurPointers, CurSizes,		CurBasePointers, CurPointers, CurSizes,
CurTypes, PartialStruct,		CurTypes, CurDims, PartialStruct,
IsFirstComponentList, L.IsImplicit);		IsFirstComponentList, L.IsImplicit);
IsFirstComponentList = false;		IsFirstComponentList = false;
}		}

// If there is an entry in PartialStruct it means we have a struct with		// If there is an entry in PartialStruct it means we have a struct with
// individual members mapped. Emit an extra combined entry.		// individual members mapped. Emit an extra combined entry.
if (PartialStruct.Base.isValid())		if (PartialStruct.Base.isValid()) {
		// Make sure Dims have the same size as BP, P, Sizes, and Types.
		// Put 0 here to make sure that `emitOffloadingArrays` use it
		// to skip processing this one. (OpenMP do not allow non-contigous for
		// declare mapper)
		CurDims.push_back(0);
emitCombinedEntry(BasePointers, Pointers, Sizes, Types, CurTypes,		emitCombinedEntry(BasePointers, Pointers, Sizes, Types, CurTypes,
PartialStruct);		PartialStruct);
		}

// We need to append the results of this capture to what we already have.		// We need to append the results of this capture to what we already have.
BasePointers.append(CurBasePointers.begin(), CurBasePointers.end());		BasePointers.append(CurBasePointers.begin(), CurBasePointers.end());
Pointers.append(CurPointers.begin(), CurPointers.end());		Pointers.append(CurPointers.begin(), CurPointers.end());
Sizes.append(CurSizes.begin(), CurSizes.end());		Sizes.append(CurSizes.begin(), CurSizes.end());
Types.append(CurTypes.begin(), CurTypes.end());		Types.append(CurTypes.begin(), CurTypes.end());
}		}
}		}
▲ Show 20 Lines • Show All 93 Lines • ▼ Show 20 Lines	public:

/// Generate the base pointers, section pointers, sizes and map types		/// Generate the base pointers, section pointers, sizes and map types
/// associated to a given capture.		/// associated to a given capture.
void generateInfoForCapture(const CapturedStmt::Capture *Cap,		void generateInfoForCapture(const CapturedStmt::Capture *Cap,
llvm::Value *Arg,		llvm::Value *Arg,
MapBaseValuesArrayTy &BasePointers,		MapBaseValuesArrayTy &BasePointers,
MapValuesArrayTy &Pointers,		MapValuesArrayTy &Pointers,
MapValuesArrayTy &Sizes, MapFlagsArrayTy &Types,		MapValuesArrayTy &Sizes, MapFlagsArrayTy &Types,
		MapDimArrayTy &Dims,
StructRangeInfoTy &PartialStruct) const {		StructRangeInfoTy &PartialStruct) const {
assert(!Cap->capturesVariableArrayType() &&		assert(!Cap->capturesVariableArrayType() &&
"Not expecting to generate map info for a variable array type!");		"Not expecting to generate map info for a variable array type!");

// We need to know when we generating information for the first component		// We need to know when we generating information for the first component
const ValueDecl *VD = Cap->capturesThis()		const ValueDecl *VD = Cap->capturesThis()
? nullptr		? nullptr
: Cap->getCapturedVar()->getCanonicalDecl();		: Cap->getCapturedVar()->getCanonicalDecl();
▲ Show 20 Lines • Show All 133 Lines • ▼ Show 20 Lines	for (const auto &Pair : OverlappedData) {
OpenMPMapClauseKind MapType;		OpenMPMapClauseKind MapType;
ArrayRef<OpenMPMapModifierKind> MapModifiers;		ArrayRef<OpenMPMapModifierKind> MapModifiers;
bool IsImplicit;		bool IsImplicit;
std::tie(Components, MapType, MapModifiers, IsImplicit) = L;		std::tie(Components, MapType, MapModifiers, IsImplicit) = L;
ArrayRef<OMPClauseMappableExprCommon::MappableExprComponentListRef>		ArrayRef<OMPClauseMappableExprCommon::MappableExprComponentListRef>
OverlappedComponents = Pair.getSecond();		OverlappedComponents = Pair.getSecond();
bool IsFirstComponentList = true;		bool IsFirstComponentList = true;
generateInfoForComponentList(MapType, MapModifiers, Components,		generateInfoForComponentList(MapType, MapModifiers, Components,
BasePointers, Pointers, Sizes, Types,		BasePointers, Pointers, Sizes, Types, Dims,
PartialStruct, IsFirstComponentList,		PartialStruct, IsFirstComponentList,
IsImplicit, OverlappedComponents);		IsImplicit, OverlappedComponents);
}		}
// Go through other elements without overlapped elements.		// Go through other elements without overlapped elements.
bool IsFirstComponentList = OverlappedData.empty();		bool IsFirstComponentList = OverlappedData.empty();
for (const MapData &L : DeclComponentLists) {		for (const MapData &L : DeclComponentLists) {
OMPClauseMappableExprCommon::MappableExprComponentListRef Components;		OMPClauseMappableExprCommon::MappableExprComponentListRef Components;
OpenMPMapClauseKind MapType;		OpenMPMapClauseKind MapType;
ArrayRef<OpenMPMapModifierKind> MapModifiers;		ArrayRef<OpenMPMapModifierKind> MapModifiers;
bool IsImplicit;		bool IsImplicit;
std::tie(Components, MapType, MapModifiers, IsImplicit) = L;		std::tie(Components, MapType, MapModifiers, IsImplicit) = L;
auto It = OverlappedData.find(&L);		auto It = OverlappedData.find(&L);
if (It == OverlappedData.end())		if (It == OverlappedData.end())
generateInfoForComponentList(MapType, MapModifiers, Components,		generateInfoForComponentList(
BasePointers, Pointers, Sizes, Types,		MapType, MapModifiers, Components, BasePointers, Pointers, Sizes,
PartialStruct, IsFirstComponentList,		Types, Dims, PartialStruct, IsFirstComponentList, IsImplicit);
IsImplicit);
IsFirstComponentList = false;		IsFirstComponentList = false;
}		}
}		}

/// Generate the base pointers, section pointers, sizes and map types		/// Generate the base pointers, section pointers, sizes and map types
/// associated with the declare target link variables.		/// associated with the declare target link variables.
void generateInfoForDeclareTargetLink(MapBaseValuesArrayTy &BasePointers,		void generateInfoForDeclareTargetLink(MapBaseValuesArrayTy &BasePointers,
MapValuesArrayTy &Pointers,		MapValuesArrayTy &Pointers,
MapValuesArrayTy &Sizes,		MapValuesArrayTy &Sizes,
MapFlagsArrayTy &Types) const {		MapFlagsArrayTy &Types,
		MapDimArrayTy &Dims) const {
assert(CurDir.is<const OMPExecutableDirective *>() &&		assert(CurDir.is<const OMPExecutableDirective *>() &&
"Expect a executable directive");		"Expect a executable directive");
const auto CurExecDir = CurDir.get<const OMPExecutableDirective >();		const auto CurExecDir = CurDir.get<const OMPExecutableDirective >();
// Map other list items in the map clause which are not captured variables		// Map other list items in the map clause which are not captured variables
// but "declare target link" global variables.		// but "declare target link" global variables.
for (const auto *C : CurExecDir->getClausesOfKind<OMPMapClause>()) {		for (const auto *C : CurExecDir->getClausesOfKind<OMPMapClause>()) {
for (const auto L : C->component_lists()) {		for (const auto L : C->component_lists()) {
if (!L.first)		if (!L.first)
continue;		continue;
const auto *VD = dyn_cast<VarDecl>(L.first);		const auto *VD = dyn_cast<VarDecl>(L.first);
if (!VD)		if (!VD)
continue;		continue;
llvm::Optional<OMPDeclareTargetDeclAttr::MapTypeTy> Res =		llvm::Optional<OMPDeclareTargetDeclAttr::MapTypeTy> Res =
OMPDeclareTargetDeclAttr::isDeclareTargetDeclaration(VD);		OMPDeclareTargetDeclAttr::isDeclareTargetDeclaration(VD);
if (CGF.CGM.getOpenMPRuntime().hasRequiresUnifiedSharedMemory() \|\|		if (CGF.CGM.getOpenMPRuntime().hasRequiresUnifiedSharedMemory() \|\|
!Res \|\| *Res != OMPDeclareTargetDeclAttr::MT_Link)		!Res \|\| *Res != OMPDeclareTargetDeclAttr::MT_Link)
continue;		continue;
StructRangeInfoTy PartialStruct;		StructRangeInfoTy PartialStruct;
generateInfoForComponentList(		generateInfoForComponentList(
C->getMapType(), C->getMapTypeModifiers(), L.second, BasePointers,		C->getMapType(), C->getMapTypeModifiers(), L.second, BasePointers,
Pointers, Sizes, Types, PartialStruct,		Pointers, Sizes, Types, Dims, PartialStruct,
/IsFirstComponentList=/true, C->isImplicit());		/IsFirstComponentList=/true, C->isImplicit());
assert(!PartialStruct.Base.isValid() &&		assert(!PartialStruct.Base.isValid() &&
"No partial structs for declare target link expected.");		"No partial structs for declare target link expected.");
}		}
}		}
}		}

/// Generate the default map information for a given capture \a CI,		/// Generate the default map information for a given capture \a CI,
▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines	void generateDefaultMapInfo(const CapturedStmt::Capture &CI,

// Add flag stating this is an implicit map.		// Add flag stating this is an implicit map.
if (IsImplicit)		if (IsImplicit)
CurMapTypes.back() \|= OMP_MAP_IMPLICIT;		CurMapTypes.back() \|= OMP_MAP_IMPLICIT;
}		}
};		};
} // anonymous namespace		} // anonymous namespace

/// Emit the arrays used to pass the captures and map information to the		/// Emit the arrays used to pass the captures and map information to the
/// offloading runtime library. If there is no map or capture information,		/// offloading runtime library. If there is no map or capture information,
/// return nullptr by reference.		/// return nullptr by reference.
ABataevUnsubmitted Done Reply Inline Actions Why removed the comment? ABataev: Why removed the comment?
static void		static void
emitOffloadingArrays(CodeGenFunction &CGF,		emitOffloadingArrays(CodeGenFunction &CGF,
MappableExprsHandler::MapBaseValuesArrayTy &BasePointers,		MappableExprsHandler::MapBaseValuesArrayTy &BasePointers,
MappableExprsHandler::MapValuesArrayTy &Pointers,		MappableExprsHandler::MapValuesArrayTy &Pointers,
MappableExprsHandler::MapValuesArrayTy &Sizes,		MappableExprsHandler::MapValuesArrayTy &Sizes,
MappableExprsHandler::MapFlagsArrayTy &MapTypes,		MappableExprsHandler::MapFlagsArrayTy &MapTypes,
CGOpenMPRuntime::TargetDataInfo &Info) {		MappableExprsHandler::MapDimArrayTy &Dims,
		CGOpenMPRuntime::TargetDataInfo &Info,
		bool IsNonContiguous = false) {
		ABataevUnsubmitted Done Reply Inline Actions Do you really need to pass `Dims` here if you have `Dims` data member in `Info` parameter? Why you can't use `Info.Dims` instead? ABataev: Do you really need to pass `Dims` here if you have `Dims` data member in `Info` parameter? Why…
		cchenAuthorUnsubmitted Done Reply Inline Actions I think I haven't added Dims in TargetDataInfo atm, I'll add into it and then use it via Info. cchen: I think I haven't added Dims in TargetDataInfo atm, I'll add into it and then use it via Info.
CodeGenModule &CGM = CGF.CGM;		CodeGenModule &CGM = CGF.CGM;
ASTContext &Ctx = CGF.getContext();		ASTContext &Ctx = CGF.getContext();

// Reset the array information.		// Reset the array information.
Info.clearArrayInfo();		Info.clearArrayInfo();
Info.NumberOfPtrs = BasePointers.size();		Info.NumberOfPtrs = BasePointers.size();

if (Info.NumberOfPtrs) {		if (Info.NumberOfPtrs) {
		ABataevUnsubmitted Done Reply Inline Actions Can we encapsulate these new data into `CGOpenMPRuntime::TargetDataInfo`? ABataev: Can we encapsulate these new data into `CGOpenMPRuntime::TargetDataInfo`?
// Detect if we have any capture size requiring runtime evaluation of the		// Detect if we have any capture size requiring runtime evaluation of the
// size so that a constant array could be eventually used.		// size so that a constant array could be eventually used.
bool hasRuntimeEvaluationCaptureSize = false;		bool hasRuntimeEvaluationCaptureSize = false;
for (llvm::Value *S : Sizes)		for (llvm::Value *S : Sizes)
if (!isa<llvm::Constant>(S)) {		if (!isa<llvm::Constant>(S)) {
hasRuntimeEvaluationCaptureSize = true;		hasRuntimeEvaluationCaptureSize = true;
break;		break;
}		}
Show All 18 Lines	if (hasRuntimeEvaluationCaptureSize) {
Int64Ty, PointerNumAP, nullptr, ArrayType::Normal,		Int64Ty, PointerNumAP, nullptr, ArrayType::Normal,
/IndexTypeQuals=/0);		/IndexTypeQuals=/0);
Info.SizesArray =		Info.SizesArray =
CGF.CreateMemTemp(SizeArrayType, ".offload_sizes").getPointer();		CGF.CreateMemTemp(SizeArrayType, ".offload_sizes").getPointer();
} else {		} else {
// We expect all the sizes to be constant, so we collect them to create		// We expect all the sizes to be constant, so we collect them to create
// a constant array.		// a constant array.
SmallVector<llvm::Constant *, 16> ConstSizes;		SmallVector<llvm::Constant *, 16> ConstSizes;
for (llvm::Value *S : Sizes)		for (unsigned I = 0, E = Sizes.size(); I < E; ++I) {
ConstSizes.push_back(cast<llvm::Constant>(S));		if (IsNonContiguous &&
		(MapTypes[I] & MappableExprsHandler::OMP_MAP_DESCRIPTOR)) {
		ConstSizes.push_back(llvm::ConstantInt::get(CGF.Int64Ty, Dims[I]));
		} else {
		ConstSizes.push_back(cast<llvm::Constant>(Sizes[I]));
		}
		}

auto *SizesArrayInit = llvm::ConstantArray::get(		auto *SizesArrayInit = llvm::ConstantArray::get(
llvm::ArrayType::get(CGM.Int64Ty, ConstSizes.size()), ConstSizes);		llvm::ArrayType::get(CGM.Int64Ty, ConstSizes.size()), ConstSizes);
std::string Name = CGM.getOpenMPRuntime().getName({"offload_sizes"});		std::string Name = CGM.getOpenMPRuntime().getName({"offload_sizes"});
auto *SizesArrayGbl = new llvm::GlobalVariable(		auto *SizesArrayGbl = new llvm::GlobalVariable(
CGM.getModule(), SizesArrayInit->getType(),		CGM.getModule(), SizesArrayInit->getType(),
/isConstant=/true, llvm::GlobalValue::PrivateLinkage,		/isConstant=/true, llvm::GlobalValue::PrivateLinkage,
SizesArrayInit, Name);		SizesArrayInit, Name);
▲ Show 20 Lines • Show All 47 Lines • ▼ Show 20 Lines	for (unsigned I = 0; I < Info.NumberOfPtrs; ++I) {
/Idx1=/I);		/Idx1=/I);
Address SAddr(S, Ctx.getTypeAlignInChars(Int64Ty));		Address SAddr(S, Ctx.getTypeAlignInChars(Int64Ty));
CGF.Builder.CreateStore(		CGF.Builder.CreateStore(
CGF.Builder.CreateIntCast(Sizes[I], CGM.Int64Ty, /isSigned=/true),		CGF.Builder.CreateIntCast(Sizes[I], CGM.Int64Ty, /isSigned=/true),
SAddr);		SAddr);
}		}
}		}
}		}

		if (IsNonContiguous) {
		if (Info.Offsets.empty())
		return;
		ABataevUnsubmitted Done Reply Inline Actions Better just to have something like this: if (!IsNonContiguous \|\| Info.Offsets.empty() \|\| Info.NumberOfPtrs == 0) return; ... ABataev: Better just to have something like this: ``` if (!IsNonContiguous \|\| Info.Offsets.empty() \|\|…

		ASTContext &C = CGF.getContext();
		CodeGenModule &CGM = CGF.CGM;

		// Build an array of struct descriptor_dim and then assign it to
		// offload_args.
		if (Info.NumberOfPtrs) {
		// Build struct descriptor_dim {
		// int64_t offset;
		// int64_t count;
		// int64_t stride
		// };
		QualType Int64Ty =
		C.getIntTypeForBitwidth(/DestWidth=/64, /Signed=/true);
		RecordDecl *RD;
		RD = C.buildImplicitRecord("descriptor_dim");
		RD->startDefinition();
		addFieldToRecordDecl(C, RD, Int64Ty);
		addFieldToRecordDecl(C, RD, Int64Ty);
		addFieldToRecordDecl(C, RD, Int64Ty);
		RD->completeDefinition();
		QualType DimTy = C.getRecordType(RD);

		enum { OffsetFD = 0, CountFD, StrideFD };
		// The reason we need two index variable here is because the size of
		// "Dims" is the same as the size of Components, however, the size of
		// offset, count , and stride is equal to the size of base declaration
		// that is non-contiguous.
		for (unsigned I = 0, L = 0, E = Info.Offsets.size(); I < E; ++I) {
		ABataevUnsubmitted Done Reply Inline Actions Maybe worth it to outline it into a separate function to reduce code size and the complexity of this function? And just call this new function here. ABataev: Maybe worth it to outline it into a separate function to reduce code size and the complexity of…
		if (Dims[I] == 0)
		continue;
		llvm::APInt Size(/numBits=/32, Dims[I]);
		QualType ArrayTy =
		C.getConstantArrayType(DimTy, Size, nullptr, ArrayType::Normal, 0);
		Address DimsAddr = CGF.CreateMemTemp(ArrayTy, "dims");
		for (unsigned II = 0, EE = Dims[I]; II < EE; ++II) {
		unsigned RevIdx = EE - II - 1;
		LValue DimsLVal = CGF.MakeAddrLValue(
		CGF.Builder.CreateConstArrayGEP(DimsAddr, II), DimTy);
		// Offset
		LValue OffsetLVal = CGF.EmitLValueForField(
		DimsLVal, *std::next(RD->field_begin(), OffsetFD));
		CGF.EmitStoreOfScalar(Info.Offsets[L][RevIdx], OffsetLVal);
		// Count
		LValue CountLVal = CGF.EmitLValueForField(
		DimsLVal, *std::next(RD->field_begin(), CountFD));
		CGF.EmitStoreOfScalar(Info.Counts[L][RevIdx], CountLVal);
		// Stride
		LValue StrideLVal = CGF.EmitLValueForField(
		DimsLVal, *std::next(RD->field_begin(), StrideFD));
		CGF.EmitStoreOfScalar(Info.Strides[L][RevIdx], StrideLVal);
		}
		// args[I] = &dims
		Address DAddr = CGF.Builder.CreatePointerBitCastOrAddrSpaceCast(
		DimsAddr, CGM.Int8PtrTy);
		ABataevUnsubmitted Done Reply Inline Actions `C.getTypeAlignInChars(C.VoidPtrTy)`->`CGF.getPointerAlign()` ABataev: `C.getTypeAlignInChars(C.VoidPtrTy)`->`CGF.getPointerAlign()`
		llvm::Value *P = CGF.Builder.CreateConstInBoundsGEP2_32(
		llvm::ArrayType::get(CGM.VoidPtrTy, Info.NumberOfPtrs),
		Info.PointersArray, 0, I);
		Address PAddr(P, C.getTypeAlignInChars(C.VoidPtrTy));
		CGF.Builder.CreateStore(DAddr.getPointer(), PAddr);
		++L;
		}
		}
		}
}		}

/// Emit the arguments to be passed to the runtime library based on the		/// Emit the arguments to be passed to the runtime library based on the
/// arrays of pointers, sizes and map types.		/// arrays of pointers, sizes and map types.
static void emitOffloadingArraysArgument(		static void emitOffloadingArraysArgument(
CodeGenFunction &CGF, llvm::Value *&BasePointersArrayArg,		CodeGenFunction &CGF, llvm::Value *&BasePointersArrayArg,
		ABataevUnsubmitted Done Reply Inline Actions Same question as before - can we merge this functionality with the existing functions? ABataev: Same question as before - can we merge this functionality with the existing functions?
llvm::Value &PointersArrayArg, llvm::Value &SizesArrayArg,		llvm::Value &PointersArrayArg, llvm::Value &SizesArrayArg,
llvm::Value *&MapTypesArrayArg, CGOpenMPRuntime::TargetDataInfo &Info) {		llvm::Value *&MapTypesArrayArg, CGOpenMPRuntime::TargetDataInfo &Info) {
CodeGenModule &CGM = CGF.CGM;		CodeGenModule &CGM = CGF.CGM;
if (Info.NumberOfPtrs) {		if (Info.NumberOfPtrs) {
BasePointersArrayArg = CGF.Builder.CreateConstInBoundsGEP2_32(		BasePointersArrayArg = CGF.Builder.CreateConstInBoundsGEP2_32(
llvm::ArrayType::get(CGM.VoidPtrTy, Info.NumberOfPtrs),		llvm::ArrayType::get(CGM.VoidPtrTy, Info.NumberOfPtrs),
Info.BasePointersArray,		Info.BasePointersArray,
/Idx0=/0, /Idx1=/0);		/Idx0=/0, /Idx1=/0);
▲ Show 20 Lines • Show All 662 Lines • ▼ Show 20 Lines	void CGOpenMPRuntime::emitTargetCall(
auto &&TargetThenGen = [this, &ThenGen, &D, &InputInfo, &MapTypesArray,		auto &&TargetThenGen = [this, &ThenGen, &D, &InputInfo, &MapTypesArray,
&CapturedVars, RequiresOuterTask,		&CapturedVars, RequiresOuterTask,
&CS](CodeGenFunction &CGF, PrePostActionTy &) {		&CS](CodeGenFunction &CGF, PrePostActionTy &) {
// Fill up the arrays with all the captured variables.		// Fill up the arrays with all the captured variables.
MappableExprsHandler::MapBaseValuesArrayTy BasePointers;		MappableExprsHandler::MapBaseValuesArrayTy BasePointers;
MappableExprsHandler::MapValuesArrayTy Pointers;		MappableExprsHandler::MapValuesArrayTy Pointers;
MappableExprsHandler::MapValuesArrayTy Sizes;		MappableExprsHandler::MapValuesArrayTy Sizes;
MappableExprsHandler::MapFlagsArrayTy MapTypes;		MappableExprsHandler::MapFlagsArrayTy MapTypes;
		MappableExprsHandler::MapDimArrayTy Dims;

// Get mappable expression information.		// Get mappable expression information.
MappableExprsHandler MEHandler(D, CGF);		MappableExprsHandler MEHandler(D, CGF);
llvm::DenseMap<llvm::Value , llvm::Value > LambdaPointers;		llvm::DenseMap<llvm::Value , llvm::Value > LambdaPointers;

auto RI = CS.getCapturedRecordDecl()->field_begin();		auto RI = CS.getCapturedRecordDecl()->field_begin();
auto CV = CapturedVars.begin();		auto CV = CapturedVars.begin();
for (CapturedStmt::const_capture_iterator CI = CS.capture_begin(),		for (CapturedStmt::const_capture_iterator CI = CS.capture_begin(),
CE = CS.capture_end();		CE = CS.capture_end();
CI != CE; ++CI, ++RI, ++CV) {		CI != CE; ++CI, ++RI, ++CV) {
MappableExprsHandler::MapBaseValuesArrayTy CurBasePointers;		MappableExprsHandler::MapBaseValuesArrayTy CurBasePointers;
MappableExprsHandler::MapValuesArrayTy CurPointers;		MappableExprsHandler::MapValuesArrayTy CurPointers;
MappableExprsHandler::MapValuesArrayTy CurSizes;		MappableExprsHandler::MapValuesArrayTy CurSizes;
MappableExprsHandler::MapFlagsArrayTy CurMapTypes;		MappableExprsHandler::MapFlagsArrayTy CurMapTypes;
		MappableExprsHandler::MapDimArrayTy CurDims;
MappableExprsHandler::StructRangeInfoTy PartialStruct;		MappableExprsHandler::StructRangeInfoTy PartialStruct;

// VLA sizes are passed to the outlined region by copy and do not have map		// VLA sizes are passed to the outlined region by copy and do not have map
// information associated.		// information associated.
if (CI->capturesVariableArrayType()) {		if (CI->capturesVariableArrayType()) {
CurBasePointers.push_back(*CV);		CurBasePointers.push_back(*CV);
CurPointers.push_back(*CV);		CurPointers.push_back(*CV);
CurSizes.push_back(CGF.Builder.CreateIntCast(		CurSizes.push_back(CGF.Builder.CreateIntCast(
CGF.getTypeSize(RI->getType()), CGF.Int64Ty, /isSigned=/true));		CGF.getTypeSize(RI->getType()), CGF.Int64Ty, /isSigned=/true));
// Copy to the device as an argument. No need to retrieve it.		// Copy to the device as an argument. No need to retrieve it.
CurMapTypes.push_back(MappableExprsHandler::OMP_MAP_LITERAL \|		CurMapTypes.push_back(MappableExprsHandler::OMP_MAP_LITERAL \|
MappableExprsHandler::OMP_MAP_TARGET_PARAM \|		MappableExprsHandler::OMP_MAP_TARGET_PARAM \|
MappableExprsHandler::OMP_MAP_IMPLICIT);		MappableExprsHandler::OMP_MAP_IMPLICIT);
} else {		} else {
// If we have any information in the map clause, we use it, otherwise we		// If we have any information in the map clause, we use it, otherwise we
// just do a default mapping.		// just do a default mapping.
MEHandler.generateInfoForCapture(CI, *CV, CurBasePointers, CurPointers,		MEHandler.generateInfoForCapture(CI, *CV, CurBasePointers, CurPointers,
CurSizes, CurMapTypes, PartialStruct);		CurSizes, CurMapTypes, CurDims,
		PartialStruct);
if (CurBasePointers.empty())		if (CurBasePointers.empty())
MEHandler.generateDefaultMapInfo(CI, RI, CV, CurBasePointers,		MEHandler.generateDefaultMapInfo(CI, RI, CV, CurBasePointers,
CurPointers, CurSizes, CurMapTypes);		CurPointers, CurSizes, CurMapTypes);
// Generate correct mapping for variables captured by reference in		// Generate correct mapping for variables captured by reference in
// lambdas.		// lambdas.
if (CI->capturesVariable())		if (CI->capturesVariable())
MEHandler.generateInfoForLambdaCaptures(		MEHandler.generateInfoForLambdaCaptures(
CI->getCapturedVar(), *CV, CurBasePointers, CurPointers, CurSizes,		CI->getCapturedVar(), *CV, CurBasePointers, CurPointers, CurSizes,
Show All 20 Lines	for (CapturedStmt::const_capture_iterator CI = CS.capture_begin(),
MapTypes.append(CurMapTypes.begin(), CurMapTypes.end());		MapTypes.append(CurMapTypes.begin(), CurMapTypes.end());
}		}
// Adjust MEMBER_OF flags for the lambdas captures.		// Adjust MEMBER_OF flags for the lambdas captures.
MEHandler.adjustMemberOfForLambdaCaptures(LambdaPointers, BasePointers,		MEHandler.adjustMemberOfForLambdaCaptures(LambdaPointers, BasePointers,
Pointers, MapTypes);		Pointers, MapTypes);
// Map other list items in the map clause which are not captured variables		// Map other list items in the map clause which are not captured variables
// but "declare target link" global variables.		// but "declare target link" global variables.
MEHandler.generateInfoForDeclareTargetLink(BasePointers, Pointers, Sizes,		MEHandler.generateInfoForDeclareTargetLink(BasePointers, Pointers, Sizes,
MapTypes);		MapTypes, Dims);

TargetDataInfo Info;		TargetDataInfo Info;
// Fill up the arrays and create the arguments.		// Fill up the arrays and create the arguments.
emitOffloadingArrays(CGF, BasePointers, Pointers, Sizes, MapTypes, Info);		emitOffloadingArrays(CGF, BasePointers, Pointers, Sizes, MapTypes, Dims,
		Info);
emitOffloadingArraysArgument(CGF, Info.BasePointersArray,		emitOffloadingArraysArgument(CGF, Info.BasePointersArray,
Info.PointersArray, Info.SizesArray,		Info.PointersArray, Info.SizesArray,
Info.MapTypesArray, Info);		Info.MapTypesArray, Info);
InputInfo.NumberOfTargetItems = Info.NumberOfPtrs;		InputInfo.NumberOfTargetItems = Info.NumberOfPtrs;
InputInfo.BasePointersArray =		InputInfo.BasePointersArray =
Address(Info.BasePointersArray, CGM.getPointerAlign());		Address(Info.BasePointersArray, CGM.getPointerAlign());
InputInfo.PointersArray =		InputInfo.PointersArray =
Address(Info.PointersArray, CGM.getPointerAlign());		Address(Info.PointersArray, CGM.getPointerAlign());
▲ Show 20 Lines • Show All 585 Lines • ▼ Show 20 Lines	void CGOpenMPRuntime::emitTargetDataCalls(
// closing of the region.		// closing of the region.
auto &&BeginThenGen = [this, &D, Device, &Info,		auto &&BeginThenGen = [this, &D, Device, &Info,
&CodeGen](CodeGenFunction &CGF, PrePostActionTy &) {		&CodeGen](CodeGenFunction &CGF, PrePostActionTy &) {
// Fill up the arrays with all the mapped variables.		// Fill up the arrays with all the mapped variables.
MappableExprsHandler::MapBaseValuesArrayTy BasePointers;		MappableExprsHandler::MapBaseValuesArrayTy BasePointers;
MappableExprsHandler::MapValuesArrayTy Pointers;		MappableExprsHandler::MapValuesArrayTy Pointers;
MappableExprsHandler::MapValuesArrayTy Sizes;		MappableExprsHandler::MapValuesArrayTy Sizes;
MappableExprsHandler::MapFlagsArrayTy MapTypes;		MappableExprsHandler::MapFlagsArrayTy MapTypes;
		MappableExprsHandler::MapDimArrayTy Dims;
		MappableExprsHandler::MapNonContiguousArrayTy Offsets;
		MappableExprsHandler::MapNonContiguousArrayTy Counts;
		MappableExprsHandler::MapNonContiguousArrayTy Strides;

// Get map clause information.		// Get map clause information.
MappableExprsHandler MCHandler(D, CGF);		MappableExprsHandler MCHandler(D, CGF);
MCHandler.generateAllInfo(BasePointers, Pointers, Sizes, MapTypes);		MCHandler.generateAllInfo(BasePointers, Pointers, Sizes, MapTypes, Dims,
		Offsets, Counts, Strides);

		ABataevUnsubmitted Done Reply Inline Actions Better to pass `Info` here directly. ABataev: Better to pass `Info` here directly.
		// Fill up non-contiguous information.
		Info.Offsets = std::move((Offsets));
		Info.Counts = std::move((Counts));
		Info.Strides = std::move((Strides));
		ABataevUnsubmitted Done Reply Inline Actions Better just to pass `Info.Offsets`, `Info.Counts` and `Info.Strides` as arguments to `generateAllInfo()` function and do not create local copies at all. ABataev: Better just to pass `Info.Offsets`, `Info.Counts` and `Info.Strides` as arguments to…

// Fill up the arrays and create the arguments.		// Fill up the arrays and create the arguments.
emitOffloadingArrays(CGF, BasePointers, Pointers, Sizes, MapTypes, Info);		emitOffloadingArrays(CGF, BasePointers, Pointers, Sizes, MapTypes, Dims,
		Info, /IsNonContiguous=/true);

llvm::Value *BasePointersArrayArg = nullptr;		llvm::Value *BasePointersArrayArg = nullptr;
llvm::Value *PointersArrayArg = nullptr;		llvm::Value *PointersArrayArg = nullptr;
llvm::Value *SizesArrayArg = nullptr;		llvm::Value *SizesArrayArg = nullptr;
llvm::Value *MapTypesArrayArg = nullptr;		llvm::Value *MapTypesArrayArg = nullptr;
emitOffloadingArraysArgument(CGF, BasePointersArrayArg, PointersArrayArg,		emitOffloadingArraysArgument(CGF, BasePointersArrayArg, PointersArrayArg,
SizesArrayArg, MapTypesArrayArg, Info);		SizesArrayArg, MapTypesArrayArg, Info);

▲ Show 20 Lines • Show All 217 Lines • ▼ Show 20 Lines	void CGOpenMPRuntime::emitTargetDataStandAloneCall(

auto &&TargetThenGen = [this, &ThenGen, &D, &InputInfo, &MapTypesArray](		auto &&TargetThenGen = [this, &ThenGen, &D, &InputInfo, &MapTypesArray](
CodeGenFunction &CGF, PrePostActionTy &) {		CodeGenFunction &CGF, PrePostActionTy &) {
// Fill up the arrays with all the mapped variables.		// Fill up the arrays with all the mapped variables.
MappableExprsHandler::MapBaseValuesArrayTy BasePointers;		MappableExprsHandler::MapBaseValuesArrayTy BasePointers;
MappableExprsHandler::MapValuesArrayTy Pointers;		MappableExprsHandler::MapValuesArrayTy Pointers;
MappableExprsHandler::MapValuesArrayTy Sizes;		MappableExprsHandler::MapValuesArrayTy Sizes;
MappableExprsHandler::MapFlagsArrayTy MapTypes;		MappableExprsHandler::MapFlagsArrayTy MapTypes;
		MappableExprsHandler::MapDimArrayTy Dims;
		MappableExprsHandler::MapNonContiguousArrayTy Offsets;
		MappableExprsHandler::MapNonContiguousArrayTy Counts;
		MappableExprsHandler::MapNonContiguousArrayTy Strides;

// Get map clause information.		// Get map clause information.
MappableExprsHandler MEHandler(D, CGF);		MappableExprsHandler MEHandler(D, CGF);
MEHandler.generateAllInfo(BasePointers, Pointers, Sizes, MapTypes);		MEHandler.generateAllInfo(BasePointers, Pointers, Sizes, MapTypes, Dims,
		Offsets, Counts, Strides);

TargetDataInfo Info;		TargetDataInfo Info;

		// Fill up non-contiguous information.
		Info.Offsets = std::move((Offsets));
		Info.Counts = std::move((Counts));
		Info.Strides = std::move((Strides));
		ABataevUnsubmitted Done Reply Inline Actions Same, pass the fields as arguments instead. ABataev: Same, pass the fields as arguments instead.

// Fill up the arrays and create the arguments.		// Fill up the arrays and create the arguments.
emitOffloadingArrays(CGF, BasePointers, Pointers, Sizes, MapTypes, Info);		emitOffloadingArrays(CGF, BasePointers, Pointers, Sizes, MapTypes, Dims,
		Info, /IsNonContiguous=/true);
emitOffloadingArraysArgument(CGF, Info.BasePointersArray,		emitOffloadingArraysArgument(CGF, Info.BasePointersArray,
Info.PointersArray, Info.SizesArray,		Info.PointersArray, Info.SizesArray,
Info.MapTypesArray, Info);		Info.MapTypesArray, Info);
InputInfo.NumberOfTargetItems = Info.NumberOfPtrs;		InputInfo.NumberOfTargetItems = Info.NumberOfPtrs;
InputInfo.BasePointersArray =		InputInfo.BasePointersArray =
Address(Info.BasePointersArray, CGM.getPointerAlign());		Address(Info.BasePointersArray, CGM.getPointerAlign());
InputInfo.PointersArray =		InputInfo.PointersArray =
Address(Info.PointersArray, CGM.getPointerAlign());		Address(Info.PointersArray, CGM.getPointerAlign());
▲ Show 20 Lines • Show All 1,624 Lines • Show Last 20 Lines

clang/lib/Sema/SemaOpenMP.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 42 Lines • ▼ Show 20 Lines

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// Stack of data-sharing attributes for variables		// Stack of data-sharing attributes for variables
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static const Expr *checkMapClauseExpressionBase(		static const Expr *checkMapClauseExpressionBase(
Sema &SemaRef, Expr *E,		Sema &SemaRef, Expr *E,
OMPClauseMappableExprCommon::MappableExprComponentList &CurComponents,		OMPClauseMappableExprCommon::MappableExprComponentList &CurComponents,
OpenMPClauseKind CKind, bool NoDiagnose);		OpenMPClauseKind CKind, OpenMPDirectiveKind DKind, bool NoDiagnose);

namespace {		namespace {
/// Default data sharing attributes, which can be applied to directive.		/// Default data sharing attributes, which can be applied to directive.
enum DefaultDataSharingAttributes {		enum DefaultDataSharingAttributes {
DSA_unspecified = 0, /// Data sharing attribute not specified.		DSA_unspecified = 0, /// Data sharing attribute not specified.
DSA_none = 1 << 0, /// Default data sharing attribute 'none'.		DSA_none = 1 << 0, /// Default data sharing attribute 'none'.
DSA_shared = 1 << 1, /// Default data sharing attribute 'shared'.		DSA_shared = 1 << 1, /// Default data sharing attribute 'shared'.
};		};
▲ Show 20 Lines • Show All 3,454 Lines • ▼ Show 20 Lines	if (auto *TE = dyn_cast<CXXThisExpr>(E->getBase()->IgnoreParenCasts())) {
if (DVar.CKind != OMPC_unknown)		if (DVar.CKind != OMPC_unknown)
ImplicitFirstprivate.push_back(E);		ImplicitFirstprivate.push_back(E);
}		}
return;		return;
}		}
if (isOpenMPTargetExecutionDirective(DKind)) {		if (isOpenMPTargetExecutionDirective(DKind)) {
OMPClauseMappableExprCommon::MappableExprComponentList CurComponents;		OMPClauseMappableExprCommon::MappableExprComponentList CurComponents;
if (!checkMapClauseExpressionBase(SemaRef, E, CurComponents, OMPC_map,		if (!checkMapClauseExpressionBase(SemaRef, E, CurComponents, OMPC_map,
		Stack->getCurrentDirective(),
/NoDiagnose=/true))		/NoDiagnose=/true))
return;		return;
const auto *VD = cast<ValueDecl>(		const auto *VD = cast<ValueDecl>(
CurComponents.back().getAssociatedDeclaration()->getCanonicalDecl());		CurComponents.back().getAssociatedDeclaration()->getCanonicalDecl());
if (!Stack->checkMappableExprComponentListsForDecl(		if (!Stack->checkMappableExprComponentListsForDecl(
VD, /CurrentRegionOnly=/true,		VD, /CurrentRegionOnly=/true,
[&CurComponents](		[&CurComponents](
OMPClauseMappableExprCommon::MappableExprComponentListRef		OMPClauseMappableExprCommon::MappableExprComponentListRef
▲ Show 20 Lines • Show All 12,901 Lines • ▼ Show 20 Lines
// but these would be valid:		// but these would be valid:
// r.ArrS[3].Arr[6:7]		// r.ArrS[3].Arr[6:7]
//		//
// r.ArrS[3].x		// r.ArrS[3].x
namespace {		namespace {
class MapBaseChecker final : public StmtVisitor<MapBaseChecker, bool> {		class MapBaseChecker final : public StmtVisitor<MapBaseChecker, bool> {
Sema &SemaRef;		Sema &SemaRef;
OpenMPClauseKind CKind = OMPC_unknown;		OpenMPClauseKind CKind = OMPC_unknown;
		OpenMPDirectiveKind DKind = OMPD_unknown;
OMPClauseMappableExprCommon::MappableExprComponentList &Components;		OMPClauseMappableExprCommon::MappableExprComponentList &Components;
		bool IsNonContiguous = false;
		ABataevUnsubmitted Done Reply Inline Actions Add default initializer ABataev: Add default initializer
bool NoDiagnose = false;		bool NoDiagnose = false;
const Expr *RelevantExpr = nullptr;		const Expr *RelevantExpr = nullptr;
bool AllowUnitySizeArraySection = true;		bool AllowUnitySizeArraySection = true;
bool AllowWholeSizeArraySection = true;		bool AllowWholeSizeArraySection = true;
		bool AllowAnotherPtr = true;
SourceLocation ELoc;		SourceLocation ELoc;
SourceRange ERange;		SourceRange ERange;

void emitErrorMsg() {		void emitErrorMsg() {
// If nothing else worked, this is not a valid map clause expression.		// If nothing else worked, this is not a valid map clause expression.
if (SemaRef.getLangOpts().OpenMP < 50) {		if (SemaRef.getLangOpts().OpenMP < 50) {
SemaRef.Diag(ELoc,		SemaRef.Diag(ELoc,
diag::err_omp_expected_named_var_member_or_array_expression)		diag::err_omp_expected_named_var_member_or_array_expression)
<< ERange;		<< ERange;
} else {		} else {
SemaRef.Diag(ELoc, diag::err_omp_non_lvalue_in_map_or_motion_clauses)		SemaRef.Diag(ELoc, diag::err_omp_non_lvalue_in_map_or_motion_clauses)
<< getOpenMPClauseName(CKind) << ERange;		<< getOpenMPClauseName(CKind) << ERange;
}		}
}		}

public:		public:
bool VisitDeclRefExpr(DeclRefExpr *DRE) {		bool VisitDeclRefExpr(DeclRefExpr *DRE) {
if (!isa<VarDecl>(DRE->getDecl())) {		if (!isa<VarDecl>(DRE->getDecl())) {
emitErrorMsg();		emitErrorMsg();
return false;		return false;
}		}
assert(!RelevantExpr && "RelevantExpr is expected to be nullptr");		assert(!RelevantExpr && "RelevantExpr is expected to be nullptr");
RelevantExpr = DRE;		RelevantExpr = DRE;
// Record the component.		// Record the component.
Components.emplace_back(DRE, DRE->getDecl());		Components.emplace_back(DRE, DRE->getDecl(), IsNonContiguous);
return true;		return true;
}		}

bool VisitMemberExpr(MemberExpr *ME) {		bool VisitMemberExpr(MemberExpr *ME) {
Expr *E = ME;		Expr *E = ME;
Expr *BaseE = ME->getBase()->IgnoreParenCasts();		Expr *BaseE = ME->getBase()->IgnoreParenCasts();

if (isa<CXXThisExpr>(BaseE)) {		if (isa<CXXThisExpr>(BaseE)) {
▲ Show 20 Lines • Show All 55 Lines • ▼ Show 20 Lines	bool VisitMemberExpr(MemberExpr *ME) {
// OpenMP 4.5 [2.15.5.1, map Clause, Restrictions, p.7]		// OpenMP 4.5 [2.15.5.1, map Clause, Restrictions, p.7]
// If a list item is an element of a structure, only the rightmost symbol		// If a list item is an element of a structure, only the rightmost symbol
// of the variable reference can be an array section.		// of the variable reference can be an array section.
//		//
AllowUnitySizeArraySection = false;		AllowUnitySizeArraySection = false;
AllowWholeSizeArraySection = false;		AllowWholeSizeArraySection = false;

// Record the component.		// Record the component.
Components.emplace_back(ME, FD);		Components.emplace_back(ME, FD, IsNonContiguous);
return RelevantExpr \|\| Visit(E);		return RelevantExpr \|\| Visit(E);
}		}

bool VisitArraySubscriptExpr(ArraySubscriptExpr *AE) {		bool VisitArraySubscriptExpr(ArraySubscriptExpr *AE) {
Expr *E = AE->getBase()->IgnoreParenImpCasts();		Expr *E = AE->getBase()->IgnoreParenImpCasts();

if (!E->getType()->isAnyPointerType() && !E->getType()->isArrayType()) {		if (!E->getType()->isAnyPointerType() && !E->getType()->isArrayType()) {
if (!NoDiagnose) {		if (!NoDiagnose) {
Show All 21 Lines	if (const auto *TE = dyn_cast<CXXThisExpr>(E->IgnoreParenCasts())) {
SemaRef.Diag(AE->getIdx()->getExprLoc(),		SemaRef.Diag(AE->getIdx()->getExprLoc(),
diag::note_omp_invalid_subscript_on_this_ptr_map);		diag::note_omp_invalid_subscript_on_this_ptr_map);
}		}
assert(!RelevantExpr && "RelevantExpr is expected to be nullptr");		assert(!RelevantExpr && "RelevantExpr is expected to be nullptr");
RelevantExpr = TE;		RelevantExpr = TE;
}		}

// Record the component - we don't have any declaration associated.		// Record the component - we don't have any declaration associated.
Components.emplace_back(AE, nullptr);		Components.emplace_back(AE, nullptr, IsNonContiguous);

return RelevantExpr \|\| Visit(E);		return RelevantExpr \|\| Visit(E);
}		}

bool VisitOMPArraySectionExpr(OMPArraySectionExpr *OASE) {		bool VisitOMPArraySectionExpr(OMPArraySectionExpr *OASE) {
assert(!NoDiagnose && "Array sections cannot be implicitly mapped.");		assert(!NoDiagnose && "Array sections cannot be implicitly mapped.");
Expr *E = OASE->getBase()->IgnoreParenImpCasts();		Expr *E = OASE->getBase()->IgnoreParenImpCasts();
QualType CurType =		QualType CurType =
Show All 22 Lines	if (AllowWholeSizeArraySection) {
// Any array section is currently allowed. Allowing a whole size array		// Any array section is currently allowed. Allowing a whole size array
// section implies allowing a unity array section as well.		// section implies allowing a unity array section as well.
//		//
// If this array section refers to the whole dimension we can still		// If this array section refers to the whole dimension we can still
// accept other array sections before this one, except if the base is a		// accept other array sections before this one, except if the base is a
// pointer. Otherwise, only unitary sections are accepted.		// pointer. Otherwise, only unitary sections are accepted.
if (NotWhole \|\| IsPointer)		if (NotWhole \|\| IsPointer)
AllowWholeSizeArraySection = false;		AllowWholeSizeArraySection = false;
		} else if (DKind == OMPD_target_update &&
		cchenAuthorUnsubmitted Done Reply Inline Actions @ABataev , I guess you're saying the condition should be `!AllowWholeSizeArraySection && DKind == OMPD_target_update && SemaRef.getLangOpts().OpenMP >= 50`? cchen: @ABataev , I guess you're saying the condition should be `!AllowWholeSizeArraySection && DKind…
		ABataevUnsubmitted Done Reply Inline Actions No, what I want is to try to simplify the code. I see now why do you need this flag. I'm just thinking can we avoid adding this flag to the clause and save some mem space? ABataev: No, what I want is to try to simplify the code. I see now why do you need this flag. I'm just…
		cchenAuthorUnsubmitted Done Reply Inline Actions But we also don't want to do the analysis in codegen I guess? Also, if we emit non-contiguous runtime for every target update call, we need to change tons of stuff (tons of lit tests, runtime implementation, etc...). cchen: But we also don't want to do the analysis in codegen I guess? Also, if we emit non-contiguous…
		ABataevUnsubmitted Done Reply Inline Actions Maybe make it a part of `MappableComponent`, if possible, and put it into `PointerIntPair<Expr , 1, bool> AssociatedExpression;`? ABataev:* Maybe make it a part of `MappableComponent`, if possible, and put it into `PointerIntPair<Expr…
		SemaRef.getLangOpts().OpenMP >= 50) {
		if (IsPointer && !AllowAnotherPtr)
		SemaRef.Diag(ELoc, diag::err_omp_section_length_undefined) << true;
		ABataevUnsubmitted Done Reply Inline Actions Better to use integer value as selectors, not boolean. ABataev: Better to use integer value as selectors, not boolean.
		cchenAuthorUnsubmitted Done Reply Inline Actions The selector for `err_omp_section_length_undefined` is a bool value. (true for unknown bound false for not a array type, so always be true here). Do you mean that I need to create a new kind of diagnosis message here and use integer as selectors? cchen: The selector for `err_omp_section_length_undefined` is a bool value. (true for unknown bound…
		ABataevUnsubmitted Done Reply Inline Actions No, it is an integer, starts from `0` ABataev: No, it is an integer, starts from `0`
		else
		IsNonContiguous = true;
} else if (AllowUnitySizeArraySection && NotUnity) {		} else if (AllowUnitySizeArraySection && NotUnity) {
		ABataevUnsubmitted Done Reply Inline Actions Remove braces here, they are not needed. ABataev: Remove braces here, they are not needed.
// A unity or whole array section is not allowed and that is not		// A unity or whole array section is not allowed and that is not
// compatible with the properties of the current array section.		// compatible with the properties of the current array section.
SemaRef.Diag(		SemaRef.Diag(
ELoc, diag::err_array_section_does_not_specify_contiguous_storage)		ELoc, diag::err_array_section_does_not_specify_contiguous_storage)
<< OASE->getSourceRange();		<< OASE->getSourceRange();
return false;		return false;
}		}

		if (IsPointer)
		AllowAnotherPtr = false;

if (const auto *TE = dyn_cast<CXXThisExpr>(E)) {		if (const auto *TE = dyn_cast<CXXThisExpr>(E)) {
Expr::EvalResult ResultR;		Expr::EvalResult ResultR;
Expr::EvalResult ResultL;		Expr::EvalResult ResultL;
if (!OASE->getLength()->isValueDependent() &&		if (!OASE->getLength()->isValueDependent() &&
OASE->getLength()->EvaluateAsInt(ResultR, SemaRef.getASTContext()) &&		OASE->getLength()->EvaluateAsInt(ResultR, SemaRef.getASTContext()) &&
!ResultR.Val.getInt().isOneValue()) {		!ResultR.Val.getInt().isOneValue()) {
SemaRef.Diag(OASE->getLength()->getExprLoc(),		SemaRef.Diag(OASE->getLength()->getExprLoc(),
diag::err_omp_invalid_map_this_expr);		diag::err_omp_invalid_map_this_expr);
Show All 9 Lines	if (const auto *TE = dyn_cast<CXXThisExpr>(E)) {
SemaRef.Diag(OASE->getLowerBound()->getExprLoc(),		SemaRef.Diag(OASE->getLowerBound()->getExprLoc(),
diag::note_omp_invalid_lower_bound_on_this_ptr_mapping);		diag::note_omp_invalid_lower_bound_on_this_ptr_mapping);
}		}
assert(!RelevantExpr && "RelevantExpr is expected to be nullptr");		assert(!RelevantExpr && "RelevantExpr is expected to be nullptr");
RelevantExpr = TE;		RelevantExpr = TE;
}		}

// Record the component - we don't have any declaration associated.		// Record the component - we don't have any declaration associated.
Components.emplace_back(OASE, nullptr);		Components.emplace_back(OASE, nullptr, false);
		ABataevUnsubmitted Done Reply Inline Actions `/IsNonContiguous=/false` ABataev: `/IsNonContiguous=/false`
return RelevantExpr \|\| Visit(E);		return RelevantExpr \|\| Visit(E);
}		}
bool VisitOMPArrayShapingExpr(OMPArrayShapingExpr *E) {		bool VisitOMPArrayShapingExpr(OMPArrayShapingExpr *E) {
Expr *Base = E->getBase();		Expr *Base = E->getBase();

// Record the component - we don't have any declaration associated.		// Record the component - we don't have any declaration associated.
Components.emplace_back(E, nullptr);		Components.emplace_back(E, nullptr, IsNonContiguous);

return Visit(Base->IgnoreParenImpCasts());		return Visit(Base->IgnoreParenImpCasts());
}		}

bool VisitUnaryOperator(UnaryOperator *UO) {		bool VisitUnaryOperator(UnaryOperator *UO) {
if (SemaRef.getLangOpts().OpenMP < 50 \|\| !UO->isLValue() \|\|		if (SemaRef.getLangOpts().OpenMP < 50 \|\| !UO->isLValue() \|\|
UO->getOpcode() != UO_Deref) {		UO->getOpcode() != UO_Deref) {
emitErrorMsg();		emitErrorMsg();
return false;		return false;
}		}
if (!RelevantExpr) {		if (!RelevantExpr) {
// Record the component if haven't found base decl.		// Record the component if haven't found base decl.
Components.emplace_back(UO, nullptr);		Components.emplace_back(UO, nullptr, false);
		ABataevUnsubmitted Done Reply Inline Actions `/IsNonContiguous=/false` ABataev: `/IsNonContiguous=/false`
}		}
return RelevantExpr \|\| Visit(UO->getSubExpr()->IgnoreParenImpCasts());		return RelevantExpr \|\| Visit(UO->getSubExpr()->IgnoreParenImpCasts());
}		}
bool VisitBinaryOperator(BinaryOperator *BO) {		bool VisitBinaryOperator(BinaryOperator *BO) {
if (SemaRef.getLangOpts().OpenMP < 50 \|\| !BO->getType()->isPointerType()) {		if (SemaRef.getLangOpts().OpenMP < 50 \|\| !BO->getType()->isPointerType()) {
emitErrorMsg();		emitErrorMsg();
return false;		return false;
}		}

// Pointer arithmetic is the only thing we expect to happen here so after we		// Pointer arithmetic is the only thing we expect to happen here so after we
// make sure the binary operator is a pointer type, the we only thing need		// make sure the binary operator is a pointer type, the we only thing need
// to to is to visit the subtree that has the same type as root (so that we		// to to is to visit the subtree that has the same type as root (so that we
// know the other subtree is just an offset)		// know the other subtree is just an offset)
Expr *LE = BO->getLHS()->IgnoreParenImpCasts();		Expr *LE = BO->getLHS()->IgnoreParenImpCasts();
Expr *RE = BO->getRHS()->IgnoreParenImpCasts();		Expr *RE = BO->getRHS()->IgnoreParenImpCasts();
Components.emplace_back(BO, nullptr);		Components.emplace_back(BO, nullptr, false);
assert((LE->getType().getTypePtr() == BO->getType().getTypePtr() \|\|		assert((LE->getType().getTypePtr() == BO->getType().getTypePtr() \|\|
RE->getType().getTypePtr() == BO->getType().getTypePtr()) &&		RE->getType().getTypePtr() == BO->getType().getTypePtr()) &&
"Either LHS or RHS have base decl inside");		"Either LHS or RHS have base decl inside");
if (BO->getType().getTypePtr() == LE->getType().getTypePtr())		if (BO->getType().getTypePtr() == LE->getType().getTypePtr())
return RelevantExpr \|\| Visit(LE);		return RelevantExpr \|\| Visit(LE);
return RelevantExpr \|\| Visit(RE);		return RelevantExpr \|\| Visit(RE);
}		}
bool VisitCXXThisExpr(CXXThisExpr *CTE) {		bool VisitCXXThisExpr(CXXThisExpr *CTE) {
assert(!RelevantExpr && "RelevantExpr is expected to be nullptr");		assert(!RelevantExpr && "RelevantExpr is expected to be nullptr");
RelevantExpr = CTE;		RelevantExpr = CTE;
Components.emplace_back(CTE, nullptr);		Components.emplace_back(CTE, nullptr, IsNonContiguous);
return true;		return true;
}		}
bool VisitStmt(Stmt *) {		bool VisitStmt(Stmt *) {
emitErrorMsg();		emitErrorMsg();
return false;		return false;
}		}
const Expr *getFoundBase() const {		const Expr *getFoundBase() const {
return RelevantExpr;		return RelevantExpr;
}		}
explicit MapBaseChecker(		explicit MapBaseChecker(
Sema &SemaRef, OpenMPClauseKind CKind,		Sema &SemaRef, OpenMPClauseKind CKind, OpenMPDirectiveKind DKind,
OMPClauseMappableExprCommon::MappableExprComponentList &Components,		OMPClauseMappableExprCommon::MappableExprComponentList &Components,
bool NoDiagnose, SourceLocation &ELoc, SourceRange &ERange)		bool NoDiagnose, SourceLocation &ELoc, SourceRange &ERange)
: SemaRef(SemaRef), CKind(CKind), Components(Components),		: SemaRef(SemaRef), CKind(CKind), DKind(DKind), Components(Components),
NoDiagnose(NoDiagnose), ELoc(ELoc), ERange(ERange) {}		NoDiagnose(NoDiagnose), ELoc(ELoc), ERange(ERange) {}
};		};
} // namespace		} // namespace

/// Return the expression of the base of the mappable expression or null if it		/// Return the expression of the base of the mappable expression or null if it
/// cannot be determined and do all the necessary checks to see if the expression		/// cannot be determined and do all the necessary checks to see if the expression
/// is valid as a standalone mappable expression. In the process, record all the		/// is valid as a standalone mappable expression. In the process, record all the
/// components of the expression.		/// components of the expression.
static const Expr *checkMapClauseExpressionBase(		static const Expr *checkMapClauseExpressionBase(
Sema &SemaRef, Expr *E,		Sema &SemaRef, Expr *E,
OMPClauseMappableExprCommon::MappableExprComponentList &CurComponents,		OMPClauseMappableExprCommon::MappableExprComponentList &CurComponents,
OpenMPClauseKind CKind, bool NoDiagnose) {		OpenMPClauseKind CKind, OpenMPDirectiveKind DKind, bool NoDiagnose) {
SourceLocation ELoc = E->getExprLoc();		SourceLocation ELoc = E->getExprLoc();
SourceRange ERange = E->getSourceRange();		SourceRange ERange = E->getSourceRange();
MapBaseChecker Checker(SemaRef, CKind, CurComponents, NoDiagnose, ELoc,		MapBaseChecker Checker(SemaRef, CKind, DKind, CurComponents, NoDiagnose, ELoc,
ERange);		ERange);
if (Checker.Visit(E->IgnoreParens()))		if (Checker.Visit(E->IgnoreParens()))
return Checker.getFoundBase();		return Checker.getFoundBase();
return nullptr;		return nullptr;
}		}

// Return true if expression E associated with value VD has conflicts with other		// Return true if expression E associated with value VD has conflicts with other
// map information.		// map information.
▲ Show 20 Lines • Show All 463 Lines • ▼ Show 20 Lines	for (Expr *RE : MVLI.VarList) {
}		}

OMPClauseMappableExprCommon::MappableExprComponentList CurComponents;		OMPClauseMappableExprCommon::MappableExprComponentList CurComponents;
ValueDecl *CurDeclaration = nullptr;		ValueDecl *CurDeclaration = nullptr;

// Obtain the array or member expression bases if required. Also, fill the		// Obtain the array or member expression bases if required. Also, fill the
// components array with all the components identified in the process.		// components array with all the components identified in the process.
const Expr *BE = checkMapClauseExpressionBase(		const Expr *BE = checkMapClauseExpressionBase(
SemaRef, SimpleExpr, CurComponents, CKind, /NoDiagnose=/false);		SemaRef, SimpleExpr, CurComponents, CKind, DSAS->getCurrentDirective(),
		/NoDiagnose=/false);
if (!BE)		if (!BE)
continue;		continue;

assert(!CurComponents.empty() &&		assert(!CurComponents.empty() &&
"Invalid mappable expression information.");		"Invalid mappable expression information.");

if (const auto *TE = dyn_cast<CXXThisExpr>(BE)) {		if (const auto *TE = dyn_cast<CXXThisExpr>(BE)) {
// Add store "this" pointer to class in DSAStackTy for future checking		// Add store "this" pointer to class in DSAStackTy for future checking
▲ Show 20 Lines • Show All 1,230 Lines • ▼ Show 20 Lines	for (Expr *RefExpr : VarList) {
// We need to add a data sharing attribute for this variable to make sure it		// We need to add a data sharing attribute for this variable to make sure it
// is correctly captured. A variable that shows up in a use_device_ptr has		// is correctly captured. A variable that shows up in a use_device_ptr has
// similar properties of a first private variable.		// similar properties of a first private variable.
DSAStack->addDSA(D, RefExpr->IgnoreParens(), OMPC_firstprivate, Ref);		DSAStack->addDSA(D, RefExpr->IgnoreParens(), OMPC_firstprivate, Ref);

// Create a mappable component for the list item. List items in this clause		// Create a mappable component for the list item. List items in this clause
// only need a component.		// only need a component.
MVLI.VarBaseDeclarations.push_back(D);		MVLI.VarBaseDeclarations.push_back(D);
MVLI.VarComponents.resize(MVLI.VarComponents.size() + 1);		MVLI.VarComponents.resize(MVLI.VarComponents.size() + 1);
MVLI.VarComponents.back().push_back(		MVLI.VarComponents.back().emplace_back(
OMPClauseMappableExprCommon::MappableComponent(SimpleRefExpr, D));		OMPClauseMappableExprCommon::MappableComponent(
		SimpleRefExpr, D,
		/IsNonContiguous=/false));
}		}
		ABataevUnsubmitted Done Reply Inline Actions Use `.emplace_back(SimpleRefExpr, D, false);` ABataev: Use `.emplace_back(SimpleRefExpr, D, false);`
		ABataevUnsubmitted Done Reply Inline Actions `.emplace_back(SimpleRefExpr, D, /IsNonContiguous=/false);` ABataev: `.emplace_back(SimpleRefExpr, D, /IsNonContiguous=/false);`

if (MVLI.ProcessedVarList.empty())		if (MVLI.ProcessedVarList.empty())
return nullptr;		return nullptr;

return OMPUseDevicePtrClause::Create(		return OMPUseDevicePtrClause::Create(
Context, Locs, MVLI.ProcessedVarList, PrivateCopies, Inits,		Context, Locs, MVLI.ProcessedVarList, PrivateCopies, Inits,
MVLI.VarBaseDeclarations, MVLI.VarComponents);		MVLI.VarBaseDeclarations, MVLI.VarComponents);
}		}
Show All 29 Lines	for (Expr *RefExpr : VarList) {
// is correctly captured. A variable that shows up in a use_device_addr has		// is correctly captured. A variable that shows up in a use_device_addr has
// similar properties of a first private variable.		// similar properties of a first private variable.
DSAStack->addDSA(D, RefExpr->IgnoreParens(), OMPC_firstprivate, Ref);		DSAStack->addDSA(D, RefExpr->IgnoreParens(), OMPC_firstprivate, Ref);

// Create a mappable component for the list item. List items in this clause		// Create a mappable component for the list item. List items in this clause
// only need a component.		// only need a component.
MVLI.VarBaseDeclarations.push_back(D);		MVLI.VarBaseDeclarations.push_back(D);
MVLI.VarComponents.emplace_back();		MVLI.VarComponents.emplace_back();
MVLI.VarComponents.back().push_back(		MVLI.VarComponents.back().emplace_back(
OMPClauseMappableExprCommon::MappableComponent(SimpleRefExpr, D));		OMPClauseMappableExprCommon::MappableComponent(
		SimpleRefExpr, D,
		/IsNonContiguous=/false));
}		}
		ABataevUnsubmitted Done Reply Inline Actions `.emplace_back(SimpleRefExpr, D, /IsNonContiguous=/false);` ABataev: `.emplace_back(SimpleRefExpr, D, /IsNonContiguous=/false);`

if (MVLI.ProcessedVarList.empty())		if (MVLI.ProcessedVarList.empty())
return nullptr;		return nullptr;

return OMPUseDeviceAddrClause::Create(Context, Locs, MVLI.ProcessedVarList,		return OMPUseDeviceAddrClause::Create(Context, Locs, MVLI.ProcessedVarList,
MVLI.VarBaseDeclarations,		MVLI.VarBaseDeclarations,
MVLI.VarComponents);		MVLI.VarComponents);
}		}
▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines	if (DSAStack->checkMappableExprComponentListsForDecl(
Diag(ELoc, diag::err_omp_map_shared_storage) << RefExpr->getSourceRange();		Diag(ELoc, diag::err_omp_map_shared_storage) << RefExpr->getSourceRange();
Diag(ConflictExpr->getExprLoc(), diag::note_used_here)		Diag(ConflictExpr->getExprLoc(), diag::note_used_here)
<< ConflictExpr->getSourceRange();		<< ConflictExpr->getSourceRange();
continue;		continue;
}		}

// Store the components in the stack so that they can be used to check		// Store the components in the stack so that they can be used to check
// against other clauses later on.		// against other clauses later on.
OMPClauseMappableExprCommon::MappableComponent MC(SimpleRefExpr, D);		OMPClauseMappableExprCommon::MappableComponent MC(SimpleRefExpr, D, false);
		ABataevUnsubmitted Done Reply Inline Actions Add a comment for `false` argument with the name of parameter. ABataev: Add a comment for `false` argument with the name of parameter.
DSAStack->addMappableExpressionComponents(		DSAStack->addMappableExpressionComponents(
D, MC, /WhereFoundClauseKind=/OMPC_is_device_ptr);		D, MC, /WhereFoundClauseKind=/OMPC_is_device_ptr);

// Record the expression we've just processed.		// Record the expression we've just processed.
MVLI.ProcessedVarList.push_back(SimpleRefExpr);		MVLI.ProcessedVarList.push_back(SimpleRefExpr);

// Create a mappable component for the list item. List items in this clause		// Create a mappable component for the list item. List items in this clause
// only need a component. We use a null declaration to signal fields in		// only need a component. We use a null declaration to signal fields in
▲ Show 20 Lines • Show All 382 Lines • Show Last 20 Lines

clang/lib/Serialization/ASTReader.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 12,504 Lines • ▼ Show 20 Lines	void OMPClauseReader::VisitOMPMapClause(OMPMapClause *C) {
ListSizes.reserve(TotalLists);		ListSizes.reserve(TotalLists);
for (unsigned i = 0; i < TotalLists; ++i)		for (unsigned i = 0; i < TotalLists; ++i)
ListSizes.push_back(Record.readInt());		ListSizes.push_back(Record.readInt());
C->setComponentListSizes(ListSizes);		C->setComponentListSizes(ListSizes);

SmallVector<OMPClauseMappableExprCommon::MappableComponent, 32> Components;		SmallVector<OMPClauseMappableExprCommon::MappableComponent, 32> Components;
Components.reserve(TotalComponents);		Components.reserve(TotalComponents);
for (unsigned i = 0; i < TotalComponents; ++i) {		for (unsigned i = 0; i < TotalComponents; ++i) {
Expr *AssociatedExpr = Record.readExpr();		Expr *AssociatedExprPr = Record.readExpr();
auto *AssociatedDecl = Record.readDeclAs<ValueDecl>();		auto *AssociatedDecl = Record.readDeclAs<ValueDecl>();
Components.push_back(OMPClauseMappableExprCommon::MappableComponent(		Components.emplace_back(OMPClauseMappableExprCommon::MappableComponent(
AssociatedExpr, AssociatedDecl));		AssociatedExprPr, AssociatedDecl, /IsNonContiguous=/false));
		ABataevUnsubmitted Done Reply Inline Actions `.emplace_back(AssociatedExprPr, AssociatedDecl, /IsNonContiguous=/false);` ABataev: `.emplace_back(AssociatedExprPr, AssociatedDecl, /IsNonContiguous=/false);`
		ABataevUnsubmitted Done Reply Inline Actions Still calling an extra constructor here, just `.emplace_back(AssociatedExprPr, AssociatedDecl, /IsNonContiguous=/false);` ABataev: Still calling an extra constructor here, just `.emplace_back(AssociatedExprPr, AssociatedDecl…
}		}
C->setComponents(Components, ListSizes);		C->setComponents(Components, ListSizes);
}		}

void OMPClauseReader::VisitOMPAllocateClause(OMPAllocateClause *C) {		void OMPClauseReader::VisitOMPAllocateClause(OMPAllocateClause *C) {
C->setLParenLoc(Record.readSourceLocation());		C->setLParenLoc(Record.readSourceLocation());
C->setColonLoc(Record.readSourceLocation());		C->setColonLoc(Record.readSourceLocation());
C->setAllocator(Record.readSubExpr());		C->setAllocator(Record.readSubExpr());
▲ Show 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	void OMPClauseReader::VisitOMPToClause(OMPToClause *C) {
ListSizes.reserve(TotalLists);		ListSizes.reserve(TotalLists);
for (unsigned i = 0; i < TotalLists; ++i)		for (unsigned i = 0; i < TotalLists; ++i)
ListSizes.push_back(Record.readInt());		ListSizes.push_back(Record.readInt());
C->setComponentListSizes(ListSizes);		C->setComponentListSizes(ListSizes);

SmallVector<OMPClauseMappableExprCommon::MappableComponent, 32> Components;		SmallVector<OMPClauseMappableExprCommon::MappableComponent, 32> Components;
Components.reserve(TotalComponents);		Components.reserve(TotalComponents);
for (unsigned i = 0; i < TotalComponents; ++i) {		for (unsigned i = 0; i < TotalComponents; ++i) {
Expr *AssociatedExpr = Record.readSubExpr();		Expr *AssociatedExprPr = Record.readSubExpr();
		bool IsNonContiguous = Record.readBool();
auto *AssociatedDecl = Record.readDeclAs<ValueDecl>();		auto *AssociatedDecl = Record.readDeclAs<ValueDecl>();
Components.push_back(OMPClauseMappableExprCommon::MappableComponent(		Components.emplace_back(OMPClauseMappableExprCommon::MappableComponent(
AssociatedExpr, AssociatedDecl));		AssociatedExprPr, AssociatedDecl, IsNonContiguous));
		ABataevUnsubmitted Done Reply Inline Actions Same, use `emplace_back()` ABataev: Same, use `emplace_back()`
}		}
		ABataevUnsubmitted Done Reply Inline Actions Just `Components.emplace_back(AssociatedExprPr, AssociatedDecl, IsNonContiguous);` ABataev: Just `Components.emplace_back(AssociatedExprPr, AssociatedDecl, IsNonContiguous);`
C->setComponents(Components, ListSizes);		C->setComponents(Components, ListSizes);
}		}

void OMPClauseReader::VisitOMPFromClause(OMPFromClause *C) {		void OMPClauseReader::VisitOMPFromClause(OMPFromClause *C) {
C->setLParenLoc(Record.readSourceLocation());		C->setLParenLoc(Record.readSourceLocation());
C->setMapperQualifierLoc(Record.readNestedNameSpecifierLoc());		C->setMapperQualifierLoc(Record.readNestedNameSpecifierLoc());
C->setMapperIdInfo(Record.readDeclarationNameInfo());		C->setMapperIdInfo(Record.readDeclarationNameInfo());
auto NumVars = C->varlist_size();		auto NumVars = C->varlist_size();
Show All 29 Lines	void OMPClauseReader::VisitOMPFromClause(OMPFromClause *C) {
ListSizes.reserve(TotalLists);		ListSizes.reserve(TotalLists);
for (unsigned i = 0; i < TotalLists; ++i)		for (unsigned i = 0; i < TotalLists; ++i)
ListSizes.push_back(Record.readInt());		ListSizes.push_back(Record.readInt());
C->setComponentListSizes(ListSizes);		C->setComponentListSizes(ListSizes);

SmallVector<OMPClauseMappableExprCommon::MappableComponent, 32> Components;		SmallVector<OMPClauseMappableExprCommon::MappableComponent, 32> Components;
Components.reserve(TotalComponents);		Components.reserve(TotalComponents);
for (unsigned i = 0; i < TotalComponents; ++i) {		for (unsigned i = 0; i < TotalComponents; ++i) {
Expr *AssociatedExpr = Record.readSubExpr();		Expr *AssociatedExprPr = Record.readSubExpr();
		bool IsNonContiguous = Record.readBool();
auto *AssociatedDecl = Record.readDeclAs<ValueDecl>();		auto *AssociatedDecl = Record.readDeclAs<ValueDecl>();
Components.push_back(OMPClauseMappableExprCommon::MappableComponent(		Components.emplace_back(OMPClauseMappableExprCommon::MappableComponent(
		ABataevUnsubmitted Done Reply Inline Actions Same, use `emplace_back()` ABataev: Same, use `emplace_back()`
AssociatedExpr, AssociatedDecl));		AssociatedExprPr, AssociatedDecl, IsNonContiguous));
		ABataevUnsubmitted Done Reply Inline Actions `.emplace_back(AssociatedExprPr, AssociatedDecl, IsNonContiguous);` ABataev: `.emplace_back(AssociatedExprPr, AssociatedDecl, IsNonContiguous);`
}		}
C->setComponents(Components, ListSizes);		C->setComponents(Components, ListSizes);
}		}

void OMPClauseReader::VisitOMPUseDevicePtrClause(OMPUseDevicePtrClause *C) {		void OMPClauseReader::VisitOMPUseDevicePtrClause(OMPUseDevicePtrClause *C) {
C->setLParenLoc(Record.readSourceLocation());		C->setLParenLoc(Record.readSourceLocation());
auto NumVars = C->varlist_size();		auto NumVars = C->varlist_size();
auto UniqueDecls = C->getUniqueDeclarationsNum();		auto UniqueDecls = C->getUniqueDeclarationsNum();
Show All 30 Lines	void OMPClauseReader::VisitOMPUseDevicePtrClause(OMPUseDevicePtrClause *C) {
ListSizes.reserve(TotalLists);		ListSizes.reserve(TotalLists);
for (unsigned i = 0; i < TotalLists; ++i)		for (unsigned i = 0; i < TotalLists; ++i)
ListSizes.push_back(Record.readInt());		ListSizes.push_back(Record.readInt());
C->setComponentListSizes(ListSizes);		C->setComponentListSizes(ListSizes);

SmallVector<OMPClauseMappableExprCommon::MappableComponent, 32> Components;		SmallVector<OMPClauseMappableExprCommon::MappableComponent, 32> Components;
Components.reserve(TotalComponents);		Components.reserve(TotalComponents);
for (unsigned i = 0; i < TotalComponents; ++i) {		for (unsigned i = 0; i < TotalComponents; ++i) {
Expr *AssociatedExpr = Record.readSubExpr();		auto *AssociatedExprPr = Record.readSubExpr();
auto *AssociatedDecl = Record.readDeclAs<ValueDecl>();		auto *AssociatedDecl = Record.readDeclAs<ValueDecl>();
Components.push_back(OMPClauseMappableExprCommon::MappableComponent(		Components.emplace_back(OMPClauseMappableExprCommon::MappableComponent(
		ABataevUnsubmitted Done Reply Inline Actions Same, use `emplace_back()` ABataev: Same, use `emplace_back()`
AssociatedExpr, AssociatedDecl));		AssociatedExprPr, AssociatedDecl, /IsNonContiguous=/false));
		ABataevUnsubmitted Done Reply Inline Actions .`emplace_back(AssociatedExprPr, AssociatedDecl, /IsNonContiguous=/false);` ABataev: .`emplace_back(AssociatedExprPr, AssociatedDecl, /IsNonContiguous=/false);`
}		}
C->setComponents(Components, ListSizes);		C->setComponents(Components, ListSizes);
}		}

void OMPClauseReader::VisitOMPUseDeviceAddrClause(OMPUseDeviceAddrClause *C) {		void OMPClauseReader::VisitOMPUseDeviceAddrClause(OMPUseDeviceAddrClause *C) {
C->setLParenLoc(Record.readSourceLocation());		C->setLParenLoc(Record.readSourceLocation());
auto NumVars = C->varlist_size();		auto NumVars = C->varlist_size();
auto UniqueDecls = C->getUniqueDeclarationsNum();		auto UniqueDecls = C->getUniqueDeclarationsNum();
Show All 24 Lines	for (unsigned i = 0; i < TotalLists; ++i)
ListSizes.push_back(Record.readInt());		ListSizes.push_back(Record.readInt());
C->setComponentListSizes(ListSizes);		C->setComponentListSizes(ListSizes);

SmallVector<OMPClauseMappableExprCommon::MappableComponent, 32> Components;		SmallVector<OMPClauseMappableExprCommon::MappableComponent, 32> Components;
Components.reserve(TotalComponents);		Components.reserve(TotalComponents);
for (unsigned i = 0; i < TotalComponents; ++i) {		for (unsigned i = 0; i < TotalComponents; ++i) {
Expr *AssociatedExpr = Record.readSubExpr();		Expr *AssociatedExpr = Record.readSubExpr();
auto *AssociatedDecl = Record.readDeclAs<ValueDecl>();		auto *AssociatedDecl = Record.readDeclAs<ValueDecl>();
Components.push_back(OMPClauseMappableExprCommon::MappableComponent(		Components.emplace_back(OMPClauseMappableExprCommon::MappableComponent(
AssociatedExpr, AssociatedDecl));		AssociatedExpr, AssociatedDecl, /IsNonContiguous/ false));
}		}
		ABataevUnsubmitted Done Reply Inline Actions `.emplace_back(AssociatedExpr, AssociatedDecl, /IsNonContiguous/ false);` ABataev: `.emplace_back(AssociatedExpr, AssociatedDecl, /IsNonContiguous/ false);`
C->setComponents(Components, ListSizes);		C->setComponents(Components, ListSizes);
}		}

void OMPClauseReader::VisitOMPIsDevicePtrClause(OMPIsDevicePtrClause *C) {		void OMPClauseReader::VisitOMPIsDevicePtrClause(OMPIsDevicePtrClause *C) {
C->setLParenLoc(Record.readSourceLocation());		C->setLParenLoc(Record.readSourceLocation());
auto NumVars = C->varlist_size();		auto NumVars = C->varlist_size();
auto UniqueDecls = C->getUniqueDeclarationsNum();		auto UniqueDecls = C->getUniqueDeclarationsNum();
auto TotalLists = C->getTotalComponentListNum();		auto TotalLists = C->getTotalComponentListNum();
Show All 24 Lines	for (unsigned i = 0; i < TotalLists; ++i)
ListSizes.push_back(Record.readInt());		ListSizes.push_back(Record.readInt());
C->setComponentListSizes(ListSizes);		C->setComponentListSizes(ListSizes);

SmallVector<OMPClauseMappableExprCommon::MappableComponent, 32> Components;		SmallVector<OMPClauseMappableExprCommon::MappableComponent, 32> Components;
Components.reserve(TotalComponents);		Components.reserve(TotalComponents);
for (unsigned i = 0; i < TotalComponents; ++i) {		for (unsigned i = 0; i < TotalComponents; ++i) {
Expr *AssociatedExpr = Record.readSubExpr();		Expr *AssociatedExpr = Record.readSubExpr();
auto *AssociatedDecl = Record.readDeclAs<ValueDecl>();		auto *AssociatedDecl = Record.readDeclAs<ValueDecl>();
Components.push_back(OMPClauseMappableExprCommon::MappableComponent(		Components.emplace_back(OMPClauseMappableExprCommon::MappableComponent(
AssociatedExpr, AssociatedDecl));		AssociatedExpr, AssociatedDecl, /IsNonContiguous=/false));
		ABataevUnsubmitted Done Reply Inline Actions Same, use `emplace_back()` ABataev: Same, use `emplace_back()`
}		}
		ABataevUnsubmitted Done Reply Inline Actions `.emplace_back(AssociatedExpr, AssociatedDecl, /IsNonContiguous=/false));` ABataev: `.emplace_back(AssociatedExpr, AssociatedDecl, /IsNonContiguous=/false));`
C->setComponents(Components, ListSizes);		C->setComponents(Components, ListSizes);
}		}

void OMPClauseReader::VisitOMPNontemporalClause(OMPNontemporalClause *C) {		void OMPClauseReader::VisitOMPNontemporalClause(OMPNontemporalClause *C) {
C->setLParenLoc(Record.readSourceLocation());		C->setLParenLoc(Record.readSourceLocation());
unsigned NumVars = C->varlist_size();		unsigned NumVars = C->varlist_size();
SmallVector<Expr *, 16> Vars;		SmallVector<Expr *, 16> Vars;
Vars.reserve(NumVars);		Vars.reserve(NumVars);
▲ Show 20 Lines • Show All 81 Lines • Show Last 20 Lines

clang/lib/Serialization/ASTWriter.cpp

Show First 20 Lines • Show All 6,575 Lines • ▼ Show 20 Lines	void OMPClauseWriter::VisitOMPToClause(OMPToClause *C) {
for (auto *D : C->all_decls())		for (auto *D : C->all_decls())
Record.AddDeclRef(D);		Record.AddDeclRef(D);
for (auto N : C->all_num_lists())		for (auto N : C->all_num_lists())
Record.push_back(N);		Record.push_back(N);
for (auto N : C->all_lists_sizes())		for (auto N : C->all_lists_sizes())
Record.push_back(N);		Record.push_back(N);
for (auto &M : C->all_components()) {		for (auto &M : C->all_components()) {
Record.AddStmt(M.getAssociatedExpression());		Record.AddStmt(M.getAssociatedExpression());
		Record.writeBool(M.isNonContiguous());
		ABataevUnsubmitted Done Reply Inline Actions There is a member function `writeBool` ABataev: There is a member function `writeBool`
Record.AddDeclRef(M.getAssociatedDeclaration());		Record.AddDeclRef(M.getAssociatedDeclaration());
}		}
}		}

void OMPClauseWriter::VisitOMPFromClause(OMPFromClause *C) {		void OMPClauseWriter::VisitOMPFromClause(OMPFromClause *C) {
Record.push_back(C->varlist_size());		Record.push_back(C->varlist_size());
Record.push_back(C->getUniqueDeclarationsNum());		Record.push_back(C->getUniqueDeclarationsNum());
Record.push_back(C->getTotalComponentListNum());		Record.push_back(C->getTotalComponentListNum());
Record.push_back(C->getTotalComponentsNum());		Record.push_back(C->getTotalComponentsNum());
Record.AddSourceLocation(C->getLParenLoc());		Record.AddSourceLocation(C->getLParenLoc());
Record.AddNestedNameSpecifierLoc(C->getMapperQualifierLoc());		Record.AddNestedNameSpecifierLoc(C->getMapperQualifierLoc());
Record.AddDeclarationNameInfo(C->getMapperIdInfo());		Record.AddDeclarationNameInfo(C->getMapperIdInfo());
for (auto *E : C->varlists())		for (auto *E : C->varlists())
Record.AddStmt(E);		Record.AddStmt(E);
for (auto *E : C->mapperlists())		for (auto *E : C->mapperlists())
Record.AddStmt(E);		Record.AddStmt(E);
for (auto *D : C->all_decls())		for (auto *D : C->all_decls())
Record.AddDeclRef(D);		Record.AddDeclRef(D);
for (auto N : C->all_num_lists())		for (auto N : C->all_num_lists())
Record.push_back(N);		Record.push_back(N);
for (auto N : C->all_lists_sizes())		for (auto N : C->all_lists_sizes())
Record.push_back(N);		Record.push_back(N);
for (auto &M : C->all_components()) {		for (auto &M : C->all_components()) {
Record.AddStmt(M.getAssociatedExpression());		Record.AddStmt(M.getAssociatedExpression());
		Record.writeBool(M.isNonContiguous());
		ABataevUnsubmitted Done Reply Inline Actions Same, use `writeBool` ABataev: Same, use `writeBool`
Record.AddDeclRef(M.getAssociatedDeclaration());		Record.AddDeclRef(M.getAssociatedDeclaration());
}		}
}		}

void OMPClauseWriter::VisitOMPUseDevicePtrClause(OMPUseDevicePtrClause *C) {		void OMPClauseWriter::VisitOMPUseDevicePtrClause(OMPUseDevicePtrClause *C) {
Record.push_back(C->varlist_size());		Record.push_back(C->varlist_size());
Record.push_back(C->getUniqueDeclarationsNum());		Record.push_back(C->getUniqueDeclarationsNum());
Record.push_back(C->getTotalComponentListNum());		Record.push_back(C->getTotalComponentListNum());
▲ Show 20 Lines • Show All 144 Lines • Show Last 20 Lines

clang/test/OpenMP/target_update_ast_print.cpp

	// RUN: %clang_cc1 -verify -fopenmp -fopenmp-version=50 -ast-print %s \| FileCheck %s			// RUN: %clang_cc1 -verify -fopenmp -fopenmp-version=50 -ast-print %s \| FileCheck %s
	// RUN: %clang_cc1 -fopenmp -fopenmp-version=50 -x c++ -std=c++11 -emit-pch -o %t %s			// RUN: %clang_cc1 -fopenmp -fopenmp-version=50 -x c++ -std=c++11 -emit-pch -o %t %s
	// RUN: %clang_cc1 -fopenmp -fopenmp-version=50 -std=c++11 -include-pch %t -fsyntax-only -verify %s -ast-print \| FileCheck %s			// RUN: %clang_cc1 -fopenmp -fopenmp-version=50 -std=c++11 -include-pch %t -fsyntax-only -verify %s -ast-print \| FileCheck %s

	// RUN: %clang_cc1 -verify -fopenmp-simd -fopenmp-version=50 -ast-print %s \| FileCheck %s			// RUN: %clang_cc1 -verify -fopenmp-simd -fopenmp-version=50 -ast-print %s \| FileCheck %s
	// RUN: %clang_cc1 -fopenmp-simd -fopenmp-version=50 -x c++ -std=c++11 -emit-pch -o %t %s			// RUN: %clang_cc1 -fopenmp-simd -fopenmp-version=50 -x c++ -std=c++11 -emit-pch -o %t %s
	// RUN: %clang_cc1 -fopenmp-simd -fopenmp-version=50 -std=c++11 -include-pch %t -fsyntax-only -verify %s -ast-print \| FileCheck %s			// RUN: %clang_cc1 -fopenmp-simd -fopenmp-version=50 -std=c++11 -include-pch %t -fsyntax-only -verify %s -ast-print \| FileCheck %s

				// RUN: %clang_cc1 -DOMP5 -verify -fopenmp -fopenmp-version=50 -ast-print %s \| FileCheck %s --check-prefix=OMP5
				// RUN: %clang_cc1 -DOMP5 -fopenmp -fopenmp-version=50 -x c++ -std=c++11 -emit-pch -o %t %s
				// RUN: %clang_cc1 -fopenmp -fopenmp-version=50 -std=c++11 -include-pch %t -fsyntax-only -verify %s -ast-print \| FileCheck %s --check-prefix=OMP5

				// RUN: %clang_cc1 -DOMP5 -verify -fopenmp-simd -fopenmp-version=50 -ast-print %s \| FileCheck %s --check-prefix=OMP5
				// RUN: %clang_cc1 -DOMP5 -fopenmp-simd -fopenmp-version=50 -x c++ -std=c++11 -emit-pch -o %t %s
				// RUN: %clang_cc1 -DOMP5 -fopenmp-simd -fopenmp-version=50 -std=c++11 -include-pch %t -fsyntax-only -verify %s -ast-print \| FileCheck %s --check-prefix=OMP5
	// expected-no-diagnostics			// expected-no-diagnostics

	#ifndef HEADER			#ifndef HEADER
	#define HEADER			#define HEADER

	void foo() {}			void foo() {}

	template <class T, class U>			template <class T, class U>
	T foo(T targ, U uarg) {			T foo(T targ, U uarg) {
	static T a, *p;			static T a, *p;
	U b;			U b;
	int l;			int l;
	#pragma omp target update to(([a][targ])p, a) if(l>5) device(l) nowait depend(inout:l)			#pragma omp target update to(([a][targ])p, a) if(l>5) device(l) nowait depend(inout:l)

	#pragma omp target update from(b, ([a][targ])p) if(l<5) device(l-1) nowait depend(inout:l)			#pragma omp target update from(b, ([a][targ])p) if(l<5) device(l-1) nowait depend(inout:l)

				#ifdef OMP5
				U marr[10][10][10];
				#pragma omp target update to(marr[2] [0:2] [0:2])

				#pragma omp target update from(marr[2] [0:2] [0:2])

				#pragma omp target update to(marr[:] [0:2] [0:2])

				#pragma omp target update from(marr[:] [0:2] [0:2])

				#pragma omp target update to(marr[:][:l] [l:])

				#pragma omp target update from(marr[:][:l] [l:])

				#pragma omp target update to(marr[:2][:1][:])

				#pragma omp target update from(marr[:2][:1][:])

				#pragma omp target update to(marr[:2][:][:1])

				#pragma omp target update from(marr[:2][:][:1])

				#pragma omp target update to(marr[:2][:] [1:])

				#pragma omp target update from(marr[:2][:] [1:])

				#pragma omp target update to(marr[:1] [3:2][:2])

				#pragma omp target update from(marr[:1] [3:2][:2])

				#pragma omp target update to(marr[:1][:2][0])

				#pragma omp target update from(marr[:1][:2][0])

				// OMP5: marr[10][10][10];
				// OMP5-NEXT: #pragma omp target update to(marr[2][0:2][0:2])
				// OMP5-NEXT: #pragma omp target update from(marr[2][0:2][0:2])
				// OMP5-NEXT: #pragma omp target update to(marr[:][0:2][0:2])
				// OMP5-NEXT: #pragma omp target update from(marr[:][0:2][0:2])
				// OMP5-NEXT: #pragma omp target update to(marr[:][:l][l:])
				// OMP5-NEXT: #pragma omp target update from(marr[:][:l][l:])
				// OMP5-NEXT: #pragma omp target update to(marr[:2][:1][:])
				// OMP5-NEXT: #pragma omp target update from(marr[:2][:1][:])
				// OMP5-NEXT: #pragma omp target update to(marr[:2][:][:1])
				// OMP5-NEXT: #pragma omp target update from(marr[:2][:][:1])
				// OMP5-NEXT: #pragma omp target update to(marr[:2][:][1:])
				// OMP5-NEXT: #pragma omp target update from(marr[:2][:][1:])
				// OMP5-NEXT: #pragma omp target update to(marr[:1][3:2][:2])
				// OMP5-NEXT: #pragma omp target update from(marr[:1][3:2][:2])
				// OMP5-NEXT: #pragma omp target update to(marr[:1][:2][0])
				// OMP5-NEXT: #pragma omp target update from(marr[:1][:2][0])
				#endif

	return a + targ + (T)b;			return a + targ + (T)b;
	}			}
	// CHECK: static T a, *p;			// CHECK: static T a, *p;
	// CHECK-NEXT: U b;			// CHECK-NEXT: U b;
	// CHECK-NEXT: int l;
	// CHECK-NEXT: #pragma omp target update to(([a][targ])p,a) if(l > 5) device(l) nowait depend(inout : l){{$}}
	// CHECK-NEXT: #pragma omp target update from(b,([a][targ])p) if(l < 5) device(l - 1) nowait depend(inout : l)
	// CHECK: static int a, *p;
	// CHECK-NEXT: float b;
	// CHECK-NEXT: int l;
	// CHECK-NEXT: #pragma omp target update to(([a][targ])p,a) if(l > 5) device(l) nowait depend(inout : l)
	// CHECK-NEXT: #pragma omp target update from(b,([a][targ])p) if(l < 5) device(l - 1) nowait depend(inout : l)
	// CHECK: static char a, *p;
	// CHECK-NEXT: float b;
	// CHECK-NEXT: int l;
	// CHECK-NEXT: #pragma omp target update to(([a][targ])p,a) if(l > 5) device(l) nowait depend(inout : l)
	// CHECK-NEXT: #pragma omp target update from(b,([a][targ])p) if(l < 5) device(l - 1) nowait depend(inout : l)

	int main(int argc, char **argv) {			int main(int argc, char **argv) {
	static int a;			static int a;
	int n;			int n;
	float f;			float f;

	// CHECK: static int a;			// CHECK: static int a;
	// CHECK-NEXT: int n;			// CHECK-NEXT: int n;
	// CHECK-NEXT: float f;			// CHECK-NEXT: float f;
	#pragma omp target update to(a) if(f>0.0) device(n) nowait depend(in:n)			#pragma omp target update to(a) if(f>0.0) device(n) nowait depend(in:n)
	// CHECK-NEXT: #pragma omp target update to(a) if(f > 0.) device(n) nowait depend(in : n)			// CHECK-NEXT: #pragma omp target update to(a) if(f > 0.) device(n) nowait depend(in : n)
	#pragma omp target update from(f) if(f<0.0) device(n+1) nowait depend(in:n)			#pragma omp target update from(f) if(f<0.0) device(n+1) nowait depend(in:n)
	// CHECK-NEXT: #pragma omp target update from(f) if(f < 0.) device(n + 1) nowait depend(in : n)			// CHECK-NEXT: #pragma omp target update from(f) if(f < 0.) device(n + 1) nowait depend(in : n)

				#ifdef OMP5
				float marr[10][10][10];
				// OMP5: marr[10][10][10];
				#pragma omp target update to(marr[2] [0:2] [0:2])
				// OMP5-NEXT: #pragma omp target update to(marr[2][0:2][0:2])
				#pragma omp target update from(marr[2] [0:2] [0:2])
				// OMP5-NEXT: #pragma omp target update from(marr[2][0:2][0:2])
				#pragma omp target update to(marr[:] [0:2] [0:2])
				// OMP5-NEXT: #pragma omp target update to(marr[:][0:2][0:2])
				#pragma omp target update from(marr[:] [0:2] [0:2])
				// OMP5-NEXT: #pragma omp target update from(marr[:][0:2][0:2])
				#pragma omp target update to(marr[:][:n] [n:])
				// OMP5: #pragma omp target update to(marr[:][:n][n:])
				#pragma omp target update from(marr[:2][:1][:])
				// OMP5-NEXT: #pragma omp target update from(marr[:2][:1][:])
				#pragma omp target update to(marr[:2][:][:1])
				// OMP5-NEXT: #pragma omp target update to(marr[:2][:][:1])
				#pragma omp target update from(marr[:2][:][:1])
				// OMP5-NEXT: #pragma omp target update from(marr[:2][:][:1])
				#pragma omp target update to(marr[:2][:] [1:])
				// OMP5-NEXT: #pragma omp target update to(marr[:2][:][1:])
				#pragma omp target update from(marr[:2][:] [1:])
				// OMP5-NEXT: #pragma omp target update from(marr[:2][:][1:])
				#pragma omp target update to(marr[:1] [3:2][:2])
				// OMP5-NEXT: #pragma omp target update to(marr[:1][3:2][:2])
				#pragma omp target update from(marr[:1] [3:2][:2])
				// OMP5-NEXT: #pragma omp target update from(marr[:1][3:2][:2])
				#pragma omp target update to(marr[:1][:2][0])
				// OMP5-NEXT: #pragma omp target update to(marr[:1][:2][0])
				#pragma omp target update from(marr[:1][:2][0])
				// OMP5-NEXT: #pragma omp target update from(marr[:1][:2][0])
				#endif

	return foo(argc, f) + foo(argv[0][0], f) + a;			return foo(argc, f) + foo(argv[0][0], f) + a;
	}			}

	#endif			#endif

clang/test/OpenMP/target_update_codegen.cpp

Show First 20 Lines • Show All 1,054 Lines • ▼ Show 20 Lines	void array_shaping(float *f, int sa) {
// CK18-64-DAG: [[SZ1]] = mul nuw i64 4, %{{.+}}		// CK18-64-DAG: [[SZ1]] = mul nuw i64 4, %{{.+}}
// CK18-32-DAG: [[SIZE]] = sext i32 [[SZ1:%.+]] to i64		// CK18-32-DAG: [[SIZE]] = sext i32 [[SZ1:%.+]] to i64
// CK18-32-DAG: [[SZ1]] = mul nuw i32 [[SZ2:%.+]], 5		// CK18-32-DAG: [[SZ1]] = mul nuw i32 [[SZ2:%.+]], 5
// CK18-32-DAG: [[SZ2]] = mul nuw i32 4, %{{.+}}		// CK18-32-DAG: [[SZ2]] = mul nuw i32 4, %{{.+}}
#pragma omp target update from(([sa][5])f)		#pragma omp target update from(([sa][5])f)
}		}

#endif		#endif

		///==========================================================================///
		// RUN: %clang_cc1 -DCK19 -verify -fopenmp -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -emit-llvm %s -o - \| FileCheck %s --check-prefix CK19 --check-prefix CK19-64
		// RUN: %clang_cc1 -DCK19 -fopenmp -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -std=c++11 -triple powerpc64le-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck %s --check-prefix CK19 --check-prefix CK19-64
		// RUN: %clang_cc1 -DCK19 -verify -fopenmp -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -emit-llvm %s -o - \| FileCheck %s --check-prefix CK19 --check-prefix CK19-32
		// RUN: %clang_cc1 -DCK19 -fopenmp -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -std=c++11 -triple i386-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck %s --check-prefix CK19 --check-prefix CK19-32

		// RUN: %clang_cc1 -DCK19 -verify -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -emit-llvm %s -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// RUN: %clang_cc1 -DCK19 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -std=c++11 -triple powerpc64le-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// RUN: %clang_cc1 -DCK19 -verify -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -emit-llvm %s -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// RUN: %clang_cc1 -DCK19 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -std=c++11 -triple i386-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// SIMD-ONLY19-NOT: {{__kmpc\|__tgt}}
		#ifdef CK19

		// CK19: [[STRUCT_DESCRIPTOR:%.+]] = type { i64, i64, i64 }

		// CK19: [[MSIZE:@.+]] = {{.+}}constant [1 x i64] [i64 3]
		// CK19: [[MTYPE:@.+]] = {{.+}}constant [1 x i64] [i64 17592186044449]

		// CK19-LABEL: _Z3foo
		void foo(int arg) {
		int arr[3][4][5];

		// CK19: [[DIMS:%.+]] = alloca [3 x [[STRUCT_DESCRIPTOR]]],
		// CK19: [[ARRAY_IDX:%.+]] = getelementptr inbounds [3 x [4 x [5 x i32]]], [3 x [4 x [5 x i32]]]* [[ARR:%.+]], {{.+}} 0, {{.+}} 0
		// CK19: [[ARRAY_DECAY:%.+]] = getelementptr inbounds [4 x [5 x i32]], [4 x [5 x i32]]* [[ARRAY_IDX]], {{.+}} 0, {{.+}} 0
		// CK19: [[ARRAY_IDX_1:%.+]] = getelementptr inbounds [5 x i32], [5 x i32]* [[ARRAY_DECAY]], {{.+}}
		// CK19: [[ARRAY_DECAY_2:%.+]] = getelementptr inbounds [5 x i32], [5 x i32]* [[ARRAY_IDX_1]], {{.+}} 0, {{.+}} 0
		// CK19: [[ARRAY_IDX_3:%.+]] = getelementptr inbounds {{.+}}, {{.+}}* [[ARRAY_DECAY_2]], {{.+}} 1
		// CK19: [[LEN:%.+]] = sub nuw i64 4, [[ARG_ADDR:%.+]]
		// CK19: [[BP0:%.+]] = getelementptr inbounds [1 x i8], [1 x i8]* [[BP:%.+]], i{{.+}} 0, i{{.+}} 0
		// CK19: [[P0:%.+]] = getelementptr inbounds [1 x i8], [1 x i8]* [[P:%.+]], i{{.+}} 0, i{{.+}} 0
		// CK19: [[DIM_1:%.+]] = getelementptr inbounds [3 x [[STRUCT_DESCRIPTOR]]], [3 x [[STRUCT_DESCRIPTOR]]]* [[DIMS]], {{.+}} 0, {{.+}} 0
		// CK19: [[OFFSET:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_1]], {{.+}} 0, {{.+}} 0
		// CK19: store i64 0, i64* [[OFFSET]],
		// CK19: [[COUNT:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_1]], {{.+}} 0, {{.+}} 1
		// CK19: store i64 2, i64* [[COUNT]],
		// CK19: [[STRIDE:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_1]], {{.+}} 0, {{.+}} 2
		// CK19: store i64 80, i64* [[STRIDE]],
		// CK19: [[DIM_2:%.+]] = getelementptr inbounds [3 x [[STRUCT_DESCRIPTOR]]], [3 x [[STRUCT_DESCRIPTOR]]]* [[DIMS]], {{.+}} 0, {{.+}} 1
		// CK19: [[OFFSET_2:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_2]], {{.+}} 0, {{.+}} 0
		// CK19: store i64 [[ARG:%.+]], i64* [[OFFSET_2]],
		// CK19: [[COUNT_2:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_2]], {{.+}} 0, {{.+}} 1
		// CK19: store i64 [[LEN]], i64* [[COUNT_2]],
		// CK19: [[STRIDE_2:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_2]], {{.+}} 0, {{.+}} 2
		// CK19: store i64 20, i64* [[STRIDE_2]],
		// CK19: [[DIM_3:%.+]] = getelementptr inbounds [3 x [[STRUCT_DESCRIPTOR]]], [3 x [[STRUCT_DESCRIPTOR]]]* [[DIMS]], {{.+}} 0, {{.+}} 2
		// CK19: [[OFFSET_3:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_3]], {{.+}} 0, {{.+}} 0
		// CK19: store i64 1, i64* [[OFFSET_3]],
		// CK19: [[COUNT_3:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_3]], {{.+}} 0, {{.+}} 1
		// CK19: store i64 4, i64* [[COUNT_3]],
		// CK19: [[STRIDE_3:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_3]], {{.+}} 0, {{.+}} 2
		// CK19: store i64 4, i64* [[STRIDE_3]],

		// CK19-DAG: call void @__tgt_target_data_update(i64 -1, i32 1, i8 [[GEPBP:%.+]], i8 [[GEPP:%.+]], {{.+}}getelementptr {{.+}}[1 x i{{.+}}]* [[MSIZE]], {{.+}}getelementptr {{.+}}[1 x i{{.+}}]* [[MTYPE]]{{.+}})
		// CK19-DAG: [[GEPBP]] = getelementptr inbounds {{.+}}[[BP]]
		// CK19-DAG: [[GEPP]] = getelementptr inbounds {{.+}}[[P:%[^,]+]]
		// CK19-DAG: [[PC0:%.+]] = bitcast [3 x [[STRUCT_DESCRIPTOR]]]* [[DIMS]] to i8*
		// CK19-DAG: [[PTRS:%.+]] = getelementptr inbounds [1 x i8], [1 x i8]* %.offload_ptrs, i32 0, i32 0
		// CK19-DAG: store i8* [[PC0]], i8** [[PTRS]],

		#pragma omp target update to(arr [0:2] [arg:] [1:4])
		{ ++arg; }
		}

		#endif
		///==========================================================================///
		// RUN: %clang_cc1 -DCK20 -verify -fopenmp -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -emit-llvm %s -o - \| FileCheck %s --check-prefix CK20 --check-prefix CK20-64
		// RUN: %clang_cc1 -DCK20 -fopenmp -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -std=c++11 -triple powerpc64le-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck %s --check-prefix CK20 --check-prefix CK20-64
		// RUN: %clang_cc1 -DCK20 -verify -fopenmp -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -emit-llvm %s -o - \| FileCheck %s --check-prefix CK20 --check-prefix CK20-32
		// RUN: %clang_cc1 -DCK20 -fopenmp -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -std=c++11 -triple i386-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck %s --check-prefix CK20 --check-prefix CK20-32

		// RUN: %clang_cc1 -DCK20 -verify -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -emit-llvm %s -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// RUN: %clang_cc1 -DCK20 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -std=c++11 -triple powerpc64le-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// RUN: %clang_cc1 -DCK20 -verify -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -emit-llvm %s -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// RUN: %clang_cc1 -DCK20 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -std=c++11 -triple i386-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// SIMD-ONLY19-NOT: {{__kmpc\|__tgt}}
		#ifdef CK20

		struct ST {
		int a;
		double *b;
		};

		// CK20: [[STRUCT_ST:%.+]] = type { i32, double* }
		// CK20: [[STRUCT_DESCRIPTOR:%.+]] = type { i64, i64, i64 }

		// CK20: [[MSIZE:@.+]] = {{.+}}constant [1 x i64] [i64 2]
		// CK20: [[MTYPE:@.+]] = {{.+}}constant [1 x i64] [i64 17592186044449]

		// CK20-LABEL: _Z3foo
		void foo(int arg) {
		ST arr[3][4];
		// CK20: [[DIMS:%.+]] = alloca [2 x [[STRUCT_DESCRIPTOR]]],
		// CK20: [[ARRAY_IDX:%.+]] = getelementptr inbounds [3 x [4 x [[STRUCT_ST]]]], [3 x [4 x [[STRUCT_ST]]]]* [[ARR:%.+]], {{.+}} 0, {{.+}} 0
		// CK20: [[ARRAY_DECAY:%.+]] = getelementptr inbounds [4 x [[STRUCT_ST]]], [4 x [[STRUCT_ST]]]* [[ARRAY_IDX]], {{.+}} 0, {{.+}} 0
		// CK20: [[ARRAY_IDX_1:%.+]] = getelementptr inbounds [[STRUCT_ST]], [[STRUCT_ST]]* [[ARRAY_DECAY]], {{.+}}
		// CK20: [[BP0:%.+]] = getelementptr inbounds [1 x i8], [1 x i8]* [[BP:%.+]], {{.+}} 0, {{.+}} 0
		// CK20: [[BPC:%.+]] = bitcast i8 [[BP0]] to [3 x [4 x [[STRUCT_ST]]]]
		// CK20: store [3 x [4 x [[STRUCT_ST]]]]* [[ARR]], [3 x [4 x [[STRUCT_ST]]]]** [[BPC]],
		// CK20: [[P0:%.+]] = getelementptr inbounds [1 x i8], [1 x i8]* [[P:%.+]], {{.+}} 0, {{.+}} 0
		// CK20: [[PC:%.+]] = bitcast i8 [[P0]] to [[STRUCT_ST]]
		// CK20: store [[STRUCT_ST]]* [[ARRAY_IDX_1]], [[STRUCT_ST]]** [[PC]],
		// CK20: [[DIM_1:%.+]] = getelementptr inbounds [2 x [[STRUCT_DESCRIPTOR]]], [2 x [[STRUCT_DESCRIPTOR]]]* [[DIMS]], {{.+}} 0, {{.+}} 0
		// CK20: [[OFFSET:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_1]], {{.+}} 0, {{.+}} 0
		// CK20: store i64 0, i64* [[OFFSET]],
		// CK20: [[COUNT:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_1]], {{.+}} 0, {{.+}} 1
		// CK20: store i64 2, i64* [[COUNT]],
		// CK20: [[STRIDE:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_1]], {{.+}} 0, {{.+}} 2
		// CK20: store i64 {{32\|64}}, i64* [[STRIDE]],
		// CK20: [[DIM_2:%.+]] = getelementptr inbounds [2 x [[STRUCT_DESCRIPTOR]]], [2 x [[STRUCT_DESCRIPTOR]]]* [[DIMS]], {{.+}} 0, {{.+}} 1
		// CK20: [[OFFSET_2:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_2]], {{.+}} 0, {{.+}} 0
		// CK20: store i64 1, i64* [[OFFSET_2]],
		// CK20: [[COUNT_2:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_2]], {{.+}} 0, {{.+}} 1
		// CK20: store i64 4, i64* [[COUNT_2]],
		// CK20: [[STRIDE_2:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_2]], {{.+}} 0, {{.+}} 2
		// CK20: store i64 {{8\|16}}, i64* [[STRIDE_2]],
		// CK20-DAG: call void @__tgt_target_data_update(i64 -1, i32 1, i8 [[GEPBP:%.+]], i8 [[GEPP:%.+]], {{.+}}getelementptr {{.+}}[1 x i{{.+}}]* [[MSIZE]], {{.+}}getelementptr {{.+}}[1 x i{{.+}}]* [[MTYPE]]{{.+}})
		// CK20-DAG: [[GEPBP]] = getelementptr inbounds {{.+}}[[BP]]
		// CK20-DAG: [[GEPP]] = getelementptr inbounds {{.+}}[[P:%[^,]+]]
		// CK20-DAG: [[PC0:%.+]] = bitcast [2 x [[STRUCT_DESCRIPTOR]]]* [[DIMS]] to i8*
		// CK20-DAG: [[PTRS:%.+]] = getelementptr inbounds [1 x i8], [1 x i8]* %.offload_ptrs, i32 0, i32 0
		// CK20-DAG: store i8* [[PC0]], i8** [[PTRS]],

		#pragma omp target update to(arr [0:2] [1:4])
		{ ++arg; }
		}

		#endif
		///==========================================================================///
		// RUN: %clang_cc1 -DCK21 -verify -fopenmp -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -emit-llvm %s -o - \| FileCheck %s --check-prefix CK21 --check-prefix CK21-64
		// RUN: %clang_cc1 -DCK21 -fopenmp -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -std=c++11 -triple powerpc64le-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck %s --check-prefix CK21 --check-prefix CK21-64
		// RUN: %clang_cc1 -DCK21 -verify -fopenmp -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -emit-llvm %s -o - \| FileCheck %s --check-prefix CK21 --check-prefix CK21-32
		// RUN: %clang_cc1 -DCK21 -fopenmp -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -std=c++11 -triple i386-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck %s --check-prefix CK21 --check-prefix CK21-32

		// RUN: %clang_cc1 -DCK21 -verify -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -emit-llvm %s -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// RUN: %clang_cc1 -DCK21 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -std=c++11 -triple powerpc64le-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// RUN: %clang_cc1 -DCK21 -verify -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -emit-llvm %s -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// RUN: %clang_cc1 -DCK21 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -std=c++11 -triple i386-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// SIMD-ONLY19-NOT: {{__kmpc\|__tgt}}
		#ifdef CK21

		// CK21: [[STRUCT_ST:%.+]] = type { [10 x [10 x [10 x double*]]] }
		// CK21: [[STRUCT_DESCRIPTOR:%.+]] = type { i64, i64, i64 }

		// CK21: [[MTYPE:@.+]] = {{.+}}constant [2 x i64] [i64 32, i64 299067162755073]

		struct ST {
		double *dptr[10][10][10];

		// CK21: _ZN2ST3fooEv
		void foo() {
		// CK21: [[DIMS:%.+]] = alloca [3 x [[STRUCT_DESCRIPTOR]]],
		// CK21: [[ARRAY_IDX:%.+]] = getelementptr inbounds [10 x [10 x [10 x double]]], [10 x [10 x [10 x double]]]* [[DPTR:%.+]], {{.+}} 0, {{.+}} 0
		// CK21: [[ARRAY_DECAY:%.+]] = getelementptr inbounds [10 x [10 x double]], [10 x [10 x double]]* [[ARRAY_IDX]], {{.+}} 0, {{.+}} 0
		// CK21: [[ARRAY_IDX_1:%.+]] = getelementptr inbounds [10 x double], [10 x double]* [[ARRAY_DECAY]], {{.+}} 1
		// CK21: [[ARRAY_DECAY_2:%.+]] = getelementptr inbounds [10 x double], [10 x double]* [[ARRAY_IDX_1]], {{.+}} 0, {{.+}} 0
		// CK21: [[ARRAY_IDX_3:%.+]] = getelementptr inbounds {{.+}}, {{.+}}* [[ARRAY_DECAY_2]], {{.+}} 0
		// CK21: [[BP0:%.+]] = getelementptr inbounds [2 x i8], [2 x i8]* [[BP:%.+]], {{.+}} 0, {{.+}} 0
		// CK21: [[P0:%.+]] = getelementptr inbounds [2 x i8], [2 x i8]* [[P:%.+]], i{{.+}} 0, i{{.+}} 0
		// CK21: [[DIM_1:%.+]] = getelementptr inbounds [3 x [[STRUCT_DESCRIPTOR]]], [3 x [[STRUCT_DESCRIPTOR]]]* [[DIMS]], {{.+}} 0, {{.+}} 0
		// CK21: [[OFFSET:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_1]], {{.+}} 0, {{.+}} 0
		// CK21: store i64 0, i64* [[OFFSET]],
		// CK21: [[COUNT:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_1]], {{.+}} 0, {{.+}} 1
		// CK21: store i64 2, i64* [[COUNT]],
		// CK21: [[STRIDE:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_1]], {{.+}} 0, {{.+}} 2
		// CK21: store i64 {{400\|800}}, i64* [[STRIDE]],
		// CK21: [[DIM_2:%.+]] = getelementptr inbounds [3 x [[STRUCT_DESCRIPTOR]]], [3 x [[STRUCT_DESCRIPTOR]]]* [[DIMS]], {{.+}} 0, {{.+}} 1
		// CK21: [[OFFSET:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_2]], {{.+}} 0, {{.+}} 0
		// CK21: store i64 1, i64* [[OFFSET]],
		// CK21: [[COUNT:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_2]], {{.+}} 0, {{.+}} 1
		// CK21: store i64 3, i64* [[COUNT]],
		// CK21: [[STRIDE:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_2]], {{.+}} 0, {{.+}} 2
		// CK21: store i64 {{40\|80}}, i64* [[STRIDE]],
		// CK21: [[DIM_3:%.+]] = getelementptr inbounds [3 x [[STRUCT_DESCRIPTOR]]], [3 x [[STRUCT_DESCRIPTOR]]]* [[DIMS]], {{.+}} 0, {{.+}} 2
		// CK21: [[OFFSET:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_3]], {{.+}} 0, {{.+}} 0
		// CK21: store i64 0, i64* [[OFFSET]],
		// CK21: [[COUNT:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_3]], {{.+}} 0, {{.+}} 1
		// CK21: store i64 4, i64* [[COUNT]],
		// CK21: [[STRIDE:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_3]], {{.+}} 0, {{.+}} 2
		// CK21: store i64 {{4\|8}}, i64* [[STRIDE]],
		// CK21-DAG: call void @__tgt_target_data_update(i64 -1, i32 2, i8 [[GEPBP:%.+]], i8 [[GEPP:%.+]], i{{.+}}* [[GEPSZ:%.+]], {{.+}}getelementptr {{.+}}[2 x i{{.+}}]* [[MTYPE]]{{.+}})
		// CK21-DAG: [[GEPBP]] = getelementptr inbounds {{.+}}[[BP]]
		// CK21-DAG: [[GEPP]] = getelementptr inbounds {{.+}}[[P:%[^,]+]]
		// CK21-DAG: [[PC0:%.+]] = bitcast [3 x [[STRUCT_DESCRIPTOR]]]* [[DIMS]] to i8*
		// CK21-DAG: [[PTRS:%.+]] = getelementptr inbounds [2 x i8], [2 x i8]* %.offload_ptrs, i32 0, i32 0
		// CK21-DAG: store i8* [[PC0]], i8** [[PTRS]],
		#pragma omp target update to(dptr [0:2] [1:3] [0:4])
		}
		};

		void bar() {
		ST st;
		st.foo();
		}

		#endif
		///==========================================================================///
		// RUN: %clang_cc1 -DCK22 -verify -fopenmp -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -emit-llvm %s -o - \| FileCheck %s --check-prefix CK22 --check-prefix CK22-64
		// RUN: %clang_cc1 -DCK22 -fopenmp -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -std=c++11 -triple powerpc64le-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck %s --check-prefix CK22 --check-prefix CK22-64
		// RUN: %clang_cc1 -DCK22 -verify -fopenmp -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -emit-llvm %s -o - \| FileCheck %s --check-prefix CK22 --check-prefix CK22-32
		// RUN: %clang_cc1 -DCK22 -fopenmp -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -std=c++11 -triple i386-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck %s --check-prefix CK22 --check-prefix CK22-32

		// RUN: %clang_cc1 -DCK22 -verify -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -emit-llvm %s -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// RUN: %clang_cc1 -DCK22 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -std=c++11 -triple powerpc64le-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=powerpc64le-ibm-linux-gnu -x c++ -triple powerpc64le-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// RUN: %clang_cc1 -DCK22 -verify -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -emit-llvm %s -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// RUN: %clang_cc1 -DCK22 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -std=c++11 -triple i386-unknown-unknown -emit-pch -o %t %s
		// RUN: %clang_cc1 -fopenmp-simd -fopenmp-version=50 -fopenmp-targets=i386-pc-linux-gnu -x c++ -triple i386-unknown-unknown -std=c++11 -include-pch %t -verify %s -emit-llvm -o - \| FileCheck --check-prefix SIMD-ONLY19 %s
		// SIMD-ONLY19-NOT: {{__kmpc\|__tgt}}
		#ifdef CK22

		// CK22: [[STRUCT_DESCRIPTOR:%.+]] = type { i64, i64, i64 }

		// CK22: [[MSIZE:@.+]] = {{.+}}constant [1 x i64] [i64 3]
		// CK22: [[MTYPE:@.+]] = {{.+}}constant [1 x i64] [i64 17592186044449]

		struct ST {
		// CK22: _ZN2ST3fooEPA10_Pi
		void foo(int *arr[5][10]) {
		// CK22: [[DIMS:%.+]] = alloca [3 x [[STRUCT_DESCRIPTOR]]],
		// CK22: [[ARRAY_IDX:%.+]] = getelementptr inbounds [10 x i32], [10 x i32]* [[ARR:%.+]], {{.+}} 0
		// CK22: [[ARRAY_DECAY:%.+]] = getelementptr inbounds [10 x i32], [10 x i32]* [[ARRAY_IDX]], {{.+}} 0, {{.+}} 0
		// CK22: [[ARRAY_IDX_2:%.+]] = getelementptr inbounds i32, i32* [[ARRAY_DECAY:%.+]], {{.+}} 1
		// CK22: [[BP0:%.+]] = getelementptr inbounds [1 x i8], [1 x i8]* [[BP:%.+]], {{.+}} 0, {{.+}} 0
		// CK22: [[P0:%.+]] = getelementptr inbounds [1 x i8], [1 x i8]* [[P:%.+]], i{{.+}} 0, i{{.+}} 0
		// CK22: [[DIM_1:%.+]] = getelementptr inbounds [3 x [[STRUCT_DESCRIPTOR]]], [3 x [[STRUCT_DESCRIPTOR]]]* [[DIMS]], {{.+}} 0, {{.+}} 0
		// CK22: [[OFFSET:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_1]], {{.+}} 0, {{.+}} 0
		// CK22: store i64 0, i64* [[OFFSET]],
		// CK22: [[COUNT:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_1]], {{.+}} 0, {{.+}} 1
		// CK22: store i64 2, i64* [[COUNT]],
		// CK22: [[STRIDE:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_1]], {{.+}} 0, {{.+}} 2
		// CK22: store i64 {{200\|400}}, i64* [[STRIDE]],
		// CK22: [[DIM_2:%.+]] = getelementptr inbounds [3 x [[STRUCT_DESCRIPTOR]]], [3 x [[STRUCT_DESCRIPTOR]]]* [[DIMS]], {{.+}} 0, {{.+}} 1
		// CK22: [[OFFSET:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_2]], {{.+}} 0, {{.+}} 0
		// CK22: store i64 1, i64* [[OFFSET]],
		// CK22: [[COUNT:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_2]], {{.+}} 0, {{.+}} 1
		// CK22: store i64 3, i64* [[COUNT]],
		// CK22: [[STRIDE:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_2]], {{.+}} 0, {{.+}} 2
		// CK22: store i64 {{40\|80}}, i64* [[STRIDE]],
		// CK22: [[DIM_3:%.+]] = getelementptr inbounds [3 x [[STRUCT_DESCRIPTOR]]], [3 x [[STRUCT_DESCRIPTOR]]]* [[DIMS]], {{.+}} 0, {{.+}} 2
		// CK22: [[OFFSET:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_3]], {{.+}} 0, {{.+}} 0
		// CK22: store i64 0, i64* [[OFFSET]],
		// CK22: [[COUNT:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_3]], {{.+}} 0, {{.+}} 1
		// CK22: store i64 4, i64* [[COUNT]],
		// CK22: [[STRIDE:%.+]] = getelementptr inbounds [[STRUCT_DESCRIPTOR]], [[STRUCT_DESCRIPTOR]]* [[DIM_3]], {{.+}} 0, {{.+}} 2
		// CK22: store i64 {{4\|8}}, i64* [[STRIDE]],
		// CK22-DAG: call void @__tgt_target_data_update(i64 -1, i32 1, i8 [[GEPBP:%.+]], i8 [[GEPP:%.+]], {{.+}}getelementptr {{.+}}[1 x i{{.+}}]* [[MSIZE]], {{.+}}getelementptr {{.+}}[1 x i{{.+}}]* [[MTYPE]]{{.+}})
		// CK22-DAG: [[GEPBP]] = getelementptr inbounds {{.+}}[[BP]]
		// CK22-DAG: [[GEPP]] = getelementptr inbounds {{.+}}[[P:%[^,]+]]
		// CK22-DAG: [[PC0:%.+]] = bitcast [3 x [[STRUCT_DESCRIPTOR]]]* [[DIMS]] to i8*
		// CK22-DAG: [[PTRS:%.+]] = getelementptr inbounds [1 x i8], [1 x i8]* %.offload_ptrs, i32 0, i32 0
		// CK22-DAG: store i8* [[PC0]], i8** [[PTRS]],
		#pragma omp target update to(arr [0:2] [1:3] [0:4])
		}
		};

		void bar() {
		ST st;
		int *arr[5][10];
		st.foo(arr);
		}

		#endif
#endif		#endif

clang/test/OpenMP/target_update_messages.cpp

// RUN: %clang_cc1 -verify -fopenmp -ferror-limit 100 %s -Wuninitialized		// RUN: %clang_cc1 -verify=expected,le45 -fopenmp -ferror-limit 100 %s -Wuninitialized
		// RUN: %clang_cc1 -verify=expected,le50 -fopenmp -fopenmp-version=50 -ferror-limit 100 %s -Wuninitialized

// RUN: %clang_cc1 -verify -fopenmp-simd -ferror-limit 100 %s -Wuninitialized		// RUN: %clang_cc1 -verify=expected,le45 -fopenmp-simd -ferror-limit 100 %s -Wuninitialized
		// RUN: %clang_cc1 -verify=expected,le50 -fopenmp-simd -fopenmp-version=50 -ferror-limit 100 %s -Wuninitialized

void xxx(int argc) {		void xxx(int argc) {
int x; // expected-note {{initialize the variable 'x' to silence this warning}}		int x; // expected-note {{initialize the variable 'x' to silence this warning}}
#pragma omp target update to(x)		#pragma omp target update to(x)
argc = x; // expected-warning {{variable 'x' is uninitialized when used here}}		argc = x; // expected-warning {{variable 'x' is uninitialized when used here}}
}		}

void foo() {		void foo() {
Show All 19 Lines	int main(int argc, char **argv) {
#pragma omp target update to(m) [ // expected-warning {{extra tokens at the end of '#pragma omp target update' are ignored}}		#pragma omp target update to(m) [ // expected-warning {{extra tokens at the end of '#pragma omp target update' are ignored}}
#pragma omp target update to(m) ] // expected-warning {{extra tokens at the end of '#pragma omp target update' are ignored}}		#pragma omp target update to(m) ] // expected-warning {{extra tokens at the end of '#pragma omp target update' are ignored}}
#pragma omp target update to(m) ) // expected-warning {{extra tokens at the end of '#pragma omp target update' are ignored}}		#pragma omp target update to(m) ) // expected-warning {{extra tokens at the end of '#pragma omp target update' are ignored}}

#pragma omp target update from(m) allocate(m) // expected-error {{unexpected OpenMP clause 'allocate' in directive '#pragma omp target update'}}		#pragma omp target update from(m) allocate(m) // expected-error {{unexpected OpenMP clause 'allocate' in directive '#pragma omp target update'}}
{		{
foo();		foo();
}		}

		double marr[10][5][10];
		#pragma omp target update to(marr [0:] [2:4] [1:2]) // le45-error {{array section does not specify contiguous storage}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}
		{}
		#pragma omp target update from(marr [0:] [2:4] [1:2]) // le45-error {{array section does not specify contiguous storage}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}

		int arr[4][3][2][1];
		#pragma omp target update to(arr [0:2] [2:4][:2][1]) // le45-error {{array section does not specify contiguous storage}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}
		{}
		#pragma omp target update from(arr [0:2] [2:4][:2][1]) // le45-error {{array section does not specify contiguous storage}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}

		double ***dptr;
		#pragma omp target update to(dptr [0:2] [2:4] [1:2]) // le45-error {{array section does not specify contiguous storage}} le50-error 2 {{section length is unspecified and cannot be inferred because subscripted value is an array of unknown bound}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}
		{}
		#pragma omp target update from(dptr [0:2] [2:4] [1:2]) // le45-error {{array section does not specify contiguous storage}} le50-error 2 {{section length is unspecified and cannot be inferred because subscripted value is an array of unknown bound}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}

return tmain(argc, argv);		return tmain(argc, argv);
}		}

clang/test/OpenMP/target_update_to_messages.cpp

Show First 20 Lines • Show All 73 Lines • ▼ Show 20 Lines	#pragma omp target update to(*this) // le45-error {{expected expression containing only member accesses and/or array sections based on named variables}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}
{}		{}
#pragma omp target update to(*(this->ptr)) // le45-error {{expected expression containing only member accesses and/or array sections based on named variables}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}		#pragma omp target update to(*(this->ptr)) // le45-error {{expected expression containing only member accesses and/or array sections based on named variables}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}
#pragma omp target update to(*(this->S->i+this->S->p)) // le45-error {{expected expression containing only member accesses and/or array sections based on named variables}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}		#pragma omp target update to(*(this->S->i+this->S->p)) // le45-error {{expected expression containing only member accesses and/or array sections based on named variables}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}
#pragma omp target update to(*(this->S->i+this->S->s6[0].pp)) // le45-error {{expected expression containing only member accesses and/or array sections based on named variables}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}		#pragma omp target update to(*(this->S->i+this->S->s6[0].pp)) // le45-error {{expected expression containing only member accesses and/or array sections based on named variables}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}
#pragma omp target update to(*(a+this->ptr)) // le45-error {{expected expression containing only member accesses and/or array sections based on named variables}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}		#pragma omp target update to(*(a+this->ptr)) // le45-error {{expected expression containing only member accesses and/or array sections based on named variables}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}
#pragma omp target update to(((this->ptr)+a+this->ptr)) // le45-error {{expected expression containing only member accesses and/or array sections based on named variables}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}		#pragma omp target update to(((this->ptr)+a+this->ptr)) // le45-error {{expected expression containing only member accesses and/or array sections based on named variables}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}
#pragma omp target update to((this+this)) // expected-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}} expected-error {{invalid operands to binary expression ('S8 ' and 'S8 *')}}		#pragma omp target update to((this+this)) // expected-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}} expected-error {{invalid operands to binary expression ('S8 ' and 'S8 *')}}
{}		{}

		double marr[10][5][10];
		#pragma omp target update to(marr [0:] [2:4] [1:2]) // le45-error {{array section does not specify contiguous storage}} le45-error {{expected at least one 'to' clause or 'from' clause specified to '#pragma omp target update'}}
		{}
}		}
};		};

S3 h;		S3 h;
#pragma omp threadprivate(h) // expected-note 2 {{defined as threadprivate or thread local}}		#pragma omp threadprivate(h) // expected-note 2 {{defined as threadprivate or thread local}}

typedef int from;		typedef int from;

▲ Show 20 Lines • Show All 45 Lines • ▼ Show 20 Lines
#pragma omp target update to(x, s7.s6[:5].aa[:6]) // expected-error {{OpenMP array section is not allowed here}}		#pragma omp target update to(x, s7.s6[:5].aa[:6]) // expected-error {{OpenMP array section is not allowed here}}
#pragma omp target update to(s7.p[:10])		#pragma omp target update to(s7.p[:10])
#pragma omp target update to(x, s7.bfa) // expected-error {{bit fields cannot be used to specify storage in a 'to' clause}}		#pragma omp target update to(x, s7.bfa) // expected-error {{bit fields cannot be used to specify storage in a 'to' clause}}
#pragma omp target update to(x, s7.p[:]) // expected-error {{section length is unspecified and cannot be inferred because subscripted value is not an array}}		#pragma omp target update to(x, s7.p[:]) // expected-error {{section length is unspecified and cannot be inferred because subscripted value is not an array}}
#pragma omp target data map(to: s7.i)		#pragma omp target data map(to: s7.i)
{		{
#pragma omp target update to(s7.x)		#pragma omp target update to(s7.x)
}		}
return 0;		return 0;
		ABataevUnsubmitted Done Reply Inline Actions Delete this extra line ABataev: Delete this extra line
}		}

int main(int argc, char **argv) {		int main(int argc, char **argv) {
const int d = 5;		const int d = 5;
const int da[5] = { 0 };		const int da[5] = { 0 };
S4 e(4);		S4 e(4);
S5 g(5);		S5 g(5);
int i, t[20];		int i, t[20];
▲ Show 20 Lines • Show All 62 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[OpenMP5.0] map item can be non-contiguous for target updateAbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 269961

clang/include/clang/AST/OpenMPClause.h

clang/lib/CodeGen/CGOpenMPRuntime.h

clang/lib/CodeGen/CGOpenMPRuntime.cpp

clang/lib/Sema/SemaOpenMP.cpp

clang/lib/Serialization/ASTReader.cpp

clang/lib/Serialization/ASTWriter.cpp

clang/test/OpenMP/target_update_ast_print.cpp

clang/test/OpenMP/target_update_codegen.cpp

clang/test/OpenMP/target_update_messages.cpp

clang/test/OpenMP/target_update_to_messages.cpp

[OpenMP5.0] map item can be non-contiguous for target update
AbandonedPublic