This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/test/OpenMP/
-
test/
-
OpenMP/
-
cancel_codegen.cpp
-
cancellation_point_codegen.cpp
-
debug-info-complex-byval.cpp
-
debug-info-openmp-array.cpp
-
declare_target_codegen_globalization.cpp
-
distribute_codegen.cpp
-
distribute_firstprivate_codegen.cpp
-
distribute_lastprivate_codegen.cpp
-
distribute_parallel_for_codegen.cpp
-
distribute_parallel_for_firstprivate_codegen.cpp
-
distribute_parallel_for_if_codegen.cpp
-
distribute_parallel_for_lastprivate_codegen.cpp
-
distribute_parallel_for_num_threads_codegen.cpp
-
distribute_parallel_for_private_codegen.cpp
-
distribute_parallel_for_proc_bind_codegen.cpp
-
distribute_parallel_for_reduction_task_codegen.cpp
-
distribute_parallel_for_simd_codegen.cpp
-
distribute_parallel_for_simd_firstprivate_codegen.cpp
-
distribute_parallel_for_simd_if_codegen.cpp
-
distribute_parallel_for_simd_lastprivate_codegen.cpp
-
distribute_parallel_for_simd_num_threads_codegen.cpp
-
distribute_parallel_for_simd_private_codegen.cpp
-
distribute_parallel_for_simd_proc_bind_codegen.cpp
-
distribute_private_codegen.cpp
-
distribute_simd_codegen.cpp
-
distribute_simd_firstprivate_codegen.cpp
-
distribute_simd_lastprivate_codegen.cpp
-
distribute_simd_private_codegen.cpp
-
distribute_simd_reduction_codegen.cpp
-
for_firstprivate_codegen.cpp
-
for_lastprivate_codegen.cpp
-
for_linear_codegen.cpp
-
for_private_codegen.cpp
-
for_reduction_codegen.cpp
-
for_reduction_codegen_UDR.cpp
-
for_reduction_task_codegen.cpp
-
master_taskloop_in_reduction_codegen.cpp
-
master_taskloop_simd_in_reduction_codegen.cpp
-
nvptx_allocate_codegen.cpp
-
nvptx_data_sharing.cpp
-
nvptx_distribute_parallel_generic_mode_codegen.cpp
-
nvptx_lambda_capturing.cpp
-
nvptx_multi_target_parallel_codegen.cpp
-
nvptx_nested_parallel_codegen.cpp
-
nvptx_parallel_codegen.cpp
-
nvptx_parallel_for_codegen.cpp
-
nvptx_target_codegen.cpp
-
nvptx_target_parallel_codegen.cpp
-
nvptx_target_parallel_num_threads_codegen.cpp
-
nvptx_target_parallel_reduction_codegen_tbaa_PR46146.cpp
-
nvptx_target_teams_codegen.cpp
-
nvptx_target_teams_distribute_codegen.cpp
-
nvptx_target_teams_distribute_parallel_for_codegen.cpp
-
nvptx_target_teams_distribute_parallel_for_generic_mode_codegen.cpp
-
nvptx_target_teams_distribute_parallel_for_simd_codegen.cpp
-
nvptx_teams_codegen.cpp
-
nvptx_teams_reduction_codegen.cpp
-
openmp_win_codegen.cpp
-
ordered_codegen.cpp
-
parallel_codegen.cpp
-
parallel_copyin_codegen.cpp
-
parallel_firstprivate_codegen.cpp
-
parallel_for_codegen.cpp
-
parallel_for_lastprivate_conditional.cpp
-
parallel_for_linear_codegen.cpp
-
parallel_for_reduction_task_codegen.cpp
-
parallel_if_codegen.cpp
-
parallel_master_codegen.cpp
-
parallel_master_reduction_task_codegen.cpp
-
parallel_master_taskloop_codegen.cpp
-
parallel_master_taskloop_lastprivate_codegen.cpp
-
parallel_master_taskloop_simd_codegen.cpp
-
parallel_master_taskloop_simd_lastprivate_codegen.cpp
-
parallel_private_codegen.cpp
-
parallel_reduction_codegen.cpp
-
parallel_reduction_task_codegen.cpp
-
parallel_sections_codegen.cpp
-
parallel_sections_reduction_task_codegen.cpp
-
sections_firstprivate_codegen.cpp
-
sections_lastprivate_codegen.cpp
-
sections_private_codegen.cpp
-
sections_reduction_codegen.cpp
-
sections_reduction_task_codegen.cpp
-
single_codegen.cpp
-
single_firstprivate_codegen.cpp
-
single_private_codegen.cpp
-
target_codegen_global_capture.cpp
-
target_map_codegen_03.cpp
-
target_parallel_codegen.cpp
-
target_parallel_debug_codegen.cpp
-
target_parallel_for_codegen.cpp
-
target_parallel_for_debug_codegen.cpp
-
target_parallel_for_reduction_task_codegen.cpp
-
target_parallel_for_simd_codegen.cpp
-
target_parallel_if_codegen.cpp
-
target_parallel_num_threads_codegen.cpp
-
target_parallel_reduction_task_codegen.cpp
-
target_teams_codegen.cpp
-
target_teams_distribute_codegen.cpp
-
target_teams_distribute_collapse_codegen.cpp
-
target_teams_distribute_dist_schedule_codegen.cpp
-
target_teams_distribute_firstprivate_codegen.cpp
-
target_teams_distribute_lastprivate_codegen.cpp
-
target_teams_distribute_parallel_for_codegen.cpp
-
target_teams_distribute_parallel_for_collapse_codegen.cpp
-
target_teams_distribute_parallel_for_dist_schedule_codegen.cpp
-
target_teams_distribute_parallel_for_firstprivate_codegen.cpp
-
target_teams_distribute_parallel_for_if_codegen.cpp
-
target_teams_distribute_parallel_for_lastprivate_codegen.cpp
-
target_teams_distribute_parallel_for_order_codegen.cpp
-
target_teams_distribute_parallel_for_private_codegen.cpp
-
target_teams_distribute_parallel_for_proc_bind_codegen.cpp
-
target_teams_distribute_parallel_for_reduction_codegen.cpp
-
target_teams_distribute_parallel_for_reduction_task_codegen.cpp
-
target_teams_distribute_parallel_for_schedule_codegen.cpp
-
target_teams_distribute_parallel_for_simd_codegen.cpp
-
target_teams_distribute_parallel_for_simd_collapse_codegen.cpp
-
target_teams_distribute_parallel_for_simd_dist_schedule_codegen.cpp
-
target_teams_distribute_parallel_for_simd_firstprivate_codegen.cpp
-
target_teams_distribute_parallel_for_simd_if_codegen.cpp
-
target_teams_distribute_parallel_for_simd_lastprivate_codegen.cpp
-
target_teams_distribute_parallel_for_simd_private_codegen.cpp
-
target_teams_distribute_parallel_for_simd_proc_bind_codegen.cpp
-
target_teams_distribute_parallel_for_simd_reduction_codegen.cpp
-
target_teams_distribute_parallel_for_simd_schedule_codegen.cpp
-
target_teams_distribute_private_codegen.cpp
-
target_teams_distribute_reduction_codegen.cpp
-
target_teams_distribute_simd_codegen.cpp
-
target_teams_distribute_simd_collapse_codegen.cpp
-
target_teams_distribute_simd_dist_schedule_codegen.cpp
-
target_teams_distribute_simd_firstprivate_codegen.cpp
-
target_teams_distribute_simd_lastprivate_codegen.cpp
-
target_teams_distribute_simd_private_codegen.cpp
-
target_teams_distribute_simd_reduction_codegen.cpp
-
target_teams_map_codegen.cpp
-
target_teams_num_teams_codegen.cpp
-
target_teams_thread_limit_codegen.cpp
-
task_codegen.cpp
-
task_if_codegen.cpp
-
task_in_reduction_codegen.cpp
-
taskloop_in_reduction_codegen.cpp
-
taskloop_simd_in_reduction_codegen.cpp
-
teams_codegen.cpp
-
teams_distribute_codegen.cpp
-
teams_distribute_collapse_codegen.cpp
-
teams_distribute_dist_schedule_codegen.cpp
-
teams_distribute_firstprivate_codegen.cpp
-
teams_distribute_lastprivate_codegen.cpp
-
teams_distribute_parallel_for_codegen.cpp
-
teams_distribute_parallel_for_collapse_codegen.cpp
-
teams_distribute_parallel_for_copyin_codegen.cpp
-
teams_distribute_parallel_for_dist_schedule_codegen.cpp
-
teams_distribute_parallel_for_firstprivate_codegen.cpp
-
teams_distribute_parallel_for_if_codegen.cpp
-
teams_distribute_parallel_for_lastprivate_codegen.cpp
-
teams_distribute_parallel_for_num_threads_codegen.cpp
-
teams_distribute_parallel_for_private_codegen.cpp
-
teams_distribute_parallel_for_proc_bind_codegen.cpp
-
teams_distribute_parallel_for_reduction_codegen.cpp
-
teams_distribute_parallel_for_reduction_task_codegen.cpp
-
teams_distribute_parallel_for_schedule_codegen.cpp
-
teams_distribute_parallel_for_simd_codegen.cpp
-
teams_distribute_parallel_for_simd_collapse_codegen.cpp
-
teams_distribute_parallel_for_simd_dist_schedule_codegen.cpp
-
teams_distribute_parallel_for_simd_firstprivate_codegen.cpp
-
teams_distribute_parallel_for_simd_if_codegen.cpp
-
teams_distribute_parallel_for_simd_lastprivate_codegen.cpp
-
teams_distribute_parallel_for_simd_num_threads_codegen.cpp
-
teams_distribute_parallel_for_simd_private_codegen.cpp
-
teams_distribute_parallel_for_simd_proc_bind_codegen.cpp
-
teams_distribute_parallel_for_simd_reduction_codegen.cpp
-
teams_distribute_parallel_for_simd_schedule_codegen.cpp
-
teams_distribute_private_codegen.cpp
-
teams_distribute_reduction_codegen.cpp
-
teams_distribute_simd_codegen.cpp
-
teams_distribute_simd_collapse_codegen.cpp
-
teams_distribute_simd_dist_schedule_codegen.cpp
-
teams_distribute_simd_firstprivate_codegen.cpp
-
teams_distribute_simd_lastprivate_codegen.cpp
-
teams_distribute_simd_private_codegen.cpp
-
teams_distribute_simd_reduction_codegen.cpp
-
teams_firstprivate_codegen.cpp
-
teams_private_codegen.cpp
-
tile_codegen.cpp
-
vla_crash.c

Differential D101849

[OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks
ClosedPublic

Authored by ggeorgakoudis on May 4 2021, 12:48 PM.

Download Raw Diff

Details

Reviewers

jdoerfert

Commits

rG207b08a9130b: [OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks
rG956cae2f09b2: [OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks

Summary

This patch refactors a subset of Clang OpenMP tests, generating checklines using the update_cc_test_checks script. This refactoring facilitates updating the Clang OpenMP code generation codebase by automating test generation.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

ggeorgakoudis created this revision.May 4 2021, 12:48 PM

Herald added subscribers: jfb, guansong, yaxunl. · View Herald TranscriptMay 4 2021, 12:48 PM

ggeorgakoudis requested review of this revision.May 4 2021, 12:48 PM

Herald added a reviewer: jdoerfert. · View Herald TranscriptMay 4 2021, 12:48 PM

Herald added a project: Restricted Project. · View Herald Transcript

Herald added subscribers: cfe-commits, sstefan1. · View Herald Transcript

ggeorgakoudis edited the summary of this revision. (Show Details)May 4 2021, 12:54 PM

LG, thanks for making this happen.

This revision is now accepted and ready to land.May 4 2021, 3:26 PM

This revision was landed with ongoing or failed builds.May 4 2021, 4:59 PM

Closed by commit rG956cae2f09b2: [OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks (authored by ggeorgakoudis). · Explain Why

This revision was automatically updated to reflect the committed changes.

ggeorgakoudis added a commit: rG956cae2f09b2: [OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks.

ggeorgakoudis added a reverting change: rGf016c06abb1d: Revert "[OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks".May 4 2021, 5:13 PM

ggeorgakoudis reopened this revision.May 4 2021, 5:16 PM

This revision is now accepted and ready to land.May 4 2021, 5:16 PM

Update tests

More updates to tests

This revision was landed with ongoing or failed builds.May 5 2021, 8:09 PM

Closed by commit rG207b08a9130b: [OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks (authored by ggeorgakoudis). · Explain Why

This revision was automatically updated to reflect the committed changes.

ggeorgakoudis added a commit: rG207b08a9130b: [OpenMP][NFC] Refactor Clang OpenMP tests using update_cc_test_checks.

Harbormaster completed remote builds in B102861: Diff 343210.May 5 2021, 9:22 PM

@ggeorgakoudis Clang::parallel_for_codegen.cpp is failing on some of our bots:
https://lab.llvm.org/buildbot/#/builders/43/builds/5751
https://lab.llvm.org/buildbot/#/builders/107/builds/7585

The failing test line is:

// RUN: %clang_cc1 -verify -triple x86_64-apple-darwin10 -O1 -fopenmp-simd -emit-llvm %s -o - | FileCheck %s --check-prefix=CHECK10

It passes if you have the X86 backend enabled but fails if you do not. Our bots usually have only Arm or AArch64 enabled.

When you have the X86 backend:

; Function Attrs: nofree norecurse nosync nounwind mustprogress
define void @_Z14static_chunkedPfS_S_S_(float* nocapture %a, float* nocapture readonly %b, float* nocapture readonly %c, float* nocapture readonly %d) local_unnamed_addr #1 {
entry:
  %arrayidx.0 = getelementptr inbounds float, float* %b, i64 131071
  %arrayidx2.0 = getelementptr inbounds float, float* %c, i64 131071
  %arrayidx4.0 = getelementptr inbounds float, float* %d, i64 131071
  %arrayidx7.0 = getelementptr inbounds float, float* %a, i64 131071
  %indvars.iv.next.0 = add nuw nsw i64 131071, 127
  br label %for.body

for.cond.cleanup:                                 ; preds = %for.body
  ret void

for.body:                                         ; preds = %for.body.for.body_crit_edge, %entry
  %indvars.iv.next.phi = phi i64 [ %indvars.iv.next.0, %entry ], [ %indvars.iv.next.1, %for.body.for.body_crit_edge ]
  %arrayidx7.phi = phi float* [ %arrayidx7.0, %entry ], [ %arrayidx7.1, %for.body.for.body_crit_edge ]
  %arrayidx4.phi = phi float* [ %arrayidx4.0, %entry ], [ %arrayidx4.1, %for.body.for.body_crit_edge ]
  %arrayidx2.phi = phi float* [ %arrayidx2.0, %entry ], [ %arrayidx2.1, %for.body.for.body_crit_edge ]
  %arrayidx.phi = phi float* [ %arrayidx.0, %entry ], [ %arrayidx.1, %for.body.for.body_crit_edge ]
  %0 = load float, float* %arrayidx.phi, align 4, !tbaa !2
  %1 = load float, float* %arrayidx2.phi, align 4, !tbaa !2
  %mul = fmul float %0, %1
  %2 = load float, float* %arrayidx4.phi, align 4, !tbaa !2
  %mul5 = fmul float %mul, %2
  store float %mul5, float* %arrayidx7.phi, align 4, !tbaa !2
  %3 = trunc i64 %indvars.iv.next.phi to i32
  %cmp = icmp sgt i32 %3, -1
  br i1 %cmp, label %for.body.for.body_crit_edge, label %for.cond.cleanup, !llvm.loop !10

for.body.for.body_crit_edge:                      ; preds = %for.body
  %arrayidx.1 = getelementptr inbounds float, float* %b, i64 %indvars.iv.next.phi
  %arrayidx2.1 = getelementptr inbounds float, float* %c, i64 %indvars.iv.next.phi
  %arrayidx4.1 = getelementptr inbounds float, float* %d, i64 %indvars.iv.next.phi
  %arrayidx7.1 = getelementptr inbounds float, float* %a, i64 %indvars.iv.next.phi
  %indvars.iv.next.1 = add nuw nsw i64 %indvars.iv.next.phi, 127
  br label %for.body
}

When you don't:

; Function Attrs: nofree norecurse nosync nounwind mustprogress
define void @_Z14static_chunkedPfS_S_S_(float* nocapture %a, float* nocapture readonly %b, float* nocapture readonly %c, float* nocapture readonly %d) local_unnamed_addr #1 {
entry:
  br label %for.body

for.cond.cleanup:                                 ; preds = %for.body
  ret void

for.body:                                         ; preds = %entry, %for.body
  %indvars.iv = phi i64 [ 131071, %entry ], [ %indvars.iv.next, %for.body ]
  %arrayidx = getelementptr inbounds float, float* %b, i64 %indvars.iv
  %0 = load float, float* %arrayidx, align 4, !tbaa !2
  %arrayidx2 = getelementptr inbounds float, float* %c, i64 %indvars.iv
  %1 = load float, float* %arrayidx2, align 4, !tbaa !2
  %mul = fmul float %0, %1
  %arrayidx4 = getelementptr inbounds float, float* %d, i64 %indvars.iv
  %2 = load float, float* %arrayidx4, align 4, !tbaa !2
  %mul5 = fmul float %mul, %2
  %arrayidx7 = getelementptr inbounds float, float* %a, i64 %indvars.iv
  store float %mul5, float* %arrayidx7, align 4, !tbaa !2
  %indvars.iv.next = add nuw nsw i64 %indvars.iv, 127
  %3 = trunc i64 %indvars.iv.next to i32
  %cmp = icmp sgt i32 %3, -1
  br i1 %cmp, label %for.body, label %for.cond.cleanup, !llvm.loop !10
}

This is one of a few differences.

Either way this seems like a bug given that clang usually doesn't require the llvm backend for a particular target. (which is why things like the target parser are always enabled) But I'm new to OpenMP so please correct me if not.

DavidSpickett mentioned this in rGe4b790c5e365: [OpenMP] Temporarily require X86 target for parallel_for_codegen.cpp test.May 6 2021, 7:17 AM

I've required X86 target for this test to get our bots green again.

In D101849#2741980, @DavidSpickett wrote:

I've required X86 target for this test to get our bots green again.

We can have that as a workaround sure.

Either way this seems like a bug given that clang usually doesn't require the llvm backend for a particular target.

I agree, something is amiss.

We will probably remove the fopenmp-simd check lines again now, but this is still something we might want to investigate, non-determinism has the tendency to come back and bite you.

In D101849#2741980, @DavidSpickett wrote:

I've required X86 target for this test to get our bots green again.

Thanks David for the workaround. I agree with Johannes's comments below. It looks like a bug.

// RUN: %clang_cc1 -verify -triple x86_64-apple-darwin10 -O1 -fopenmp-simd -emit-llvm %s -o - | FileCheck %s --check-prefix=CHECK10

Is there a good reason to run this with -O1? Doing so makes it super sensitive to which llvm passes run and now this test fails for us now with x86 as well.

In D101849#2764703, @mikerice wrote:

// RUN: %clang_cc1 -verify -triple x86_64-apple-darwin10 -O1 -fopenmp-simd -emit-llvm %s -o - | FileCheck %s --check-prefix=CHECK10

Is there a good reason to run this with -O1? Doing so makes it super sensitive to which llvm passes run and now this test fails for us now with x86 as well.

One of the O1 was introduced by 6e8248fdad5fc59306beb286a3089fe401460826 and I cannot really tell why we would need it.
@ABataev do you remember if the O1 was needed?
If not I'd suggest to remove the O1 run line or add -disable-llvm-optzns to the run line.
If it is needed, we can have a simple parallel for test with manual check lines.

In D101849#2764798, @jdoerfert wrote:

In D101849#2764703, @mikerice wrote:

// RUN: %clang_cc1 -verify -triple x86_64-apple-darwin10 -O1 -fopenmp-simd -emit-llvm %s -o - | FileCheck %s --check-prefix=CHECK10

Is there a good reason to run this with -O1? Doing so makes it super sensitive to which llvm passes run and now this test fails for us now with x86 as well.

One of the O1 was introduced by 6e8248fdad5fc59306beb286a3089fe401460826 and I cannot really tell why we would need it.
@ABataev do you remember if the O1 was needed?
If not I'd suggest to remove the O1 run line or add -disable-llvm-optzns to the run line.
If it is needed, we can have a simple parallel for test with manual check lines.

No, it is not required. Most probably, needed to simplify test checks, nothing else.

In D101849#2764825, @ABataev wrote:

No, it is not required. Most probably, needed to simplify test checks, nothing else.

Thanks. I'd like to remove the "REQUIRES: x86-registered-target", the -O1 for CHECK6,10, and regenerate the CHECK lines. Unfortunately I am seeing this mangling issue (https://bugs.llvm.org/show_bug.cgi?id=49767) when running the script. @ggeorgakoudis, how did you get past this to generate the CHECK lines in your change?

In D101849#2766411, @mikerice wrote:

In D101849#2764825, @ABataev wrote:

No, it is not required. Most probably, needed to simplify test checks, nothing else.

Thanks. I'd like to remove the "REQUIRES: x86-registered-target", the -O1 for CHECK6,10, and regenerate the CHECK lines. Unfortunately I am seeing this mangling issue (https://bugs.llvm.org/show_bug.cgi?id=49767) when running the script. @ggeorgakoudis, how did you get past this to generate the CHECK lines in your change?

The workaround is to check if the assertion would trigger and just return from the function instead.

In D101849#2766411, @mikerice wrote:

In D101849#2764825, @ABataev wrote:

No, it is not required. Most probably, needed to simplify test checks, nothing else.

Thanks. I'd like to remove the "REQUIRES: x86-registered-target", the -O1 for CHECK6,10, and regenerate the CHECK lines. Unfortunately I am seeing this mangling issue (https://bugs.llvm.org/show_bug.cgi?id=49767) when running the script. @ggeorgakoudis, how did you get past this to generate the CHECK lines in your change?