Page MenuHomePhabricator

dhruvachak (Dhruva Chakrabarti)
User

Projects

User does not belong to any projects.

User Details

User Since
Mar 15 2021, 11:51 AM (106 w, 3 d)

Recent Activity

Fri, Mar 24

dhruvachak accepted D145831: [OpenMP][libomptarget] Add support for critical regions in AMD GPU device offloading.

LG. All of my concerns have been resolved.

Fri, Mar 24, 11:12 AM · Restricted Project, Restricted Project

Wed, Mar 22

dhruvachak added inline comments to D124652: [OpenMP] [OMPT] [amdgpu] [5/8] Implemented device init/fini/load callbacks.
Wed, Mar 22, 11:25 AM · Restricted Project, Restricted Project
dhruvachak added inline comments to D124652: [OpenMP] [OMPT] [amdgpu] [5/8] Implemented device init/fini/load callbacks.
Wed, Mar 22, 9:52 AM · Restricted Project, Restricted Project

Tue, Mar 21

dhruvachak added inline comments to D124070: [OpenMP] [OMPT] [amdgpu] [4/8] Implemented callback registration in nextgen plugins.
Tue, Mar 21, 10:25 PM · Restricted Project, Restricted Project
dhruvachak added a comment to D145831: [OpenMP][libomptarget] Add support for critical regions in AMD GPU device offloading.

OK then as written this is definitely going to blow up on us. We shouldn't implement the general purpose lock API if it deadlocks unless called in a very specific situation.

Probably best to emit the CAS in line as part of the IR transform, but otherwise we could add more runtime functions specific to critical. Uses of the general purpose omp_lock should be a compile time error on platforms that can't do it (it's unfortunate that lock returns void), but until then builtin_trap at least looks clearer when debugging than deadlock.

Tue, Mar 21, 4:17 PM · Restricted Project, Restricted Project

Mon, Mar 20

dhruvachak added inline comments to D145831: [OpenMP][libomptarget] Add support for critical regions in AMD GPU device offloading.
Mon, Mar 20, 7:13 PM · Restricted Project, Restricted Project

Sun, Mar 19

dhruvachak added inline comments to D145831: [OpenMP][libomptarget] Add support for critical regions in AMD GPU device offloading.
Sun, Mar 19, 4:44 PM · Restricted Project, Restricted Project

Fri, Mar 17

dhruvachak committed rG1c9ec74e3f2a: [Clang][OpenMP] Insert alloca for kernel args at function entry block instead… (authored by dhruvachak).
[Clang][OpenMP] Insert alloca for kernel args at function entry block instead…
Fri, Mar 17, 1:36 PM · Restricted Project, Restricted Project, Restricted Project
dhruvachak closed D145820: [Clang][OpenMP] Insert alloca for kernel args at function entry block instead of the launch point..
Fri, Mar 17, 1:36 PM · Restricted Project, Restricted Project, Restricted Project
dhruvachak updated the diff for D145820: [Clang][OpenMP] Insert alloca for kernel args at function entry block instead of the launch point..

Removed attributes (as original) from clang test OpenMP/target_map_codegen_hold.cpp.

Fri, Mar 17, 9:20 AM · Restricted Project, Restricted Project, Restricted Project

Thu, Mar 16

dhruvachak added inline comments to D145820: [Clang][OpenMP] Insert alloca for kernel args at function entry block instead of the launch point..
Thu, Mar 16, 11:22 PM · Restricted Project, Restricted Project, Restricted Project
dhruvachak updated the diff for D145820: [Clang][OpenMP] Insert alloca for kernel args at function entry block instead of the launch point..

Fixed clang test OpenMP/target_map_codegen_hold.cpp.

Thu, Mar 16, 11:17 PM · Restricted Project, Restricted Project, Restricted Project
dhruvachak added inline comments to D145831: [OpenMP][libomptarget] Add support for critical regions in AMD GPU device offloading.
Thu, Mar 16, 10:16 AM · Restricted Project, Restricted Project

Mon, Mar 13

dhruvachak updated the diff for D145820: [Clang][OpenMP] Insert alloca for kernel args at function entry block instead of the launch point..

Fixed LIT test failure, added new clang test OpenMP/bug60602.cpp.

Mon, Mar 13, 10:38 PM · Restricted Project, Restricted Project, Restricted Project
dhruvachak updated the summary of D145820: [Clang][OpenMP] Insert alloca for kernel args at function entry block instead of the launch point..
Mon, Mar 13, 12:04 AM · Restricted Project, Restricted Project, Restricted Project

Sun, Mar 12

dhruvachak updated the diff for D145820: [Clang][OpenMP] Insert alloca for kernel args at function entry block instead of the launch point..

Addressed comment. Using the alloca insert point for kernel args alloca.

Sun, Mar 12, 11:59 PM · Restricted Project, Restricted Project, Restricted Project

Fri, Mar 10

dhruvachak added a reviewer for D145820: [Clang][OpenMP] Insert alloca for kernel args at function entry block instead of the launch point.: RaviNarayanaswamy.
Fri, Mar 10, 12:09 PM · Restricted Project, Restricted Project, Restricted Project
dhruvachak retitled D145820: [Clang][OpenMP] Insert alloca for kernel args at function entry block instead of the launch point. from Insert alloca for kernel args at function entry block instead of the launch point. to [Clang][OpenMP] Insert alloca for kernel args at function entry block instead of the launch point..
Fri, Mar 10, 11:54 AM · Restricted Project, Restricted Project, Restricted Project
dhruvachak added a comment to D145820: [Clang][OpenMP] Insert alloca for kernel args at function entry block instead of the launch point..

No tests updated?

Fri, Mar 10, 11:50 AM · Restricted Project, Restricted Project, Restricted Project
dhruvachak added reviewers for D145820: [Clang][OpenMP] Insert alloca for kernel args at function entry block instead of the launch point.: doru1004, carlo.bertolli, jhuber6, tianshilei1992, ABataev, jplehr, ronlieb.
Fri, Mar 10, 11:48 AM · Restricted Project, Restricted Project, Restricted Project
dhruvachak added a comment to D145820: [Clang][OpenMP] Insert alloca for kernel args at function entry block instead of the launch point..

TODOs: Update existing LIT tests, add a new one.

Fri, Mar 10, 11:46 AM · Restricted Project, Restricted Project, Restricted Project
dhruvachak requested review of D145820: [Clang][OpenMP] Insert alloca for kernel args at function entry block instead of the launch point..
Fri, Mar 10, 11:44 AM · Restricted Project, Restricted Project, Restricted Project

Wed, Mar 8

dhruvachak added a comment to D127372: [OpenMP] [OMPT] [8/8] Added lit tests for OMPT target callbacks.

Just to reiterate what I think this does is: It adds tests which currently ensure that the call backs can not be registered (as it should).
The tests would then be updated for the other patches, when these call backs can be registered, right?

Wed, Mar 8, 1:59 PM · Restricted Project, Restricted Project
dhruvachak added a comment to D127372: [OpenMP] [OMPT] [8/8] Added lit tests for OMPT target callbacks.

ping. @jhuber6 @dreachem

Wed, Mar 8, 11:08 AM · Restricted Project, Restricted Project

Fri, Mar 3

dhruvachak added a reviewer for D127372: [OpenMP] [OMPT] [8/8] Added lit tests for OMPT target callbacks: jhuber6.
Fri, Mar 3, 9:12 AM · Restricted Project, Restricted Project
dhruvachak updated subscribers of D127372: [OpenMP] [OMPT] [8/8] Added lit tests for OMPT target callbacks.

@jhuber6 Is this patch ok to land? This will provide some test coverage to the existing OMPT target functionality, the device-independent part. As plugin-specific changes land, these tests will have to be updated.

Fri, Mar 3, 9:12 AM · Restricted Project, Restricted Project

Feb 27 2023

dhruvachak added a comment to D127372: [OpenMP] [OMPT] [8/8] Added lit tests for OMPT target callbacks.

@mhalk Can this be landed on top of trunk today? I would be interested in landing this one on top of what we have today so that we get some test coverage as part of "make check". You will need to update the tests to match the current code and update the tests as patches 4-7 land.

Feb 27 2023, 11:01 AM · Restricted Project, Restricted Project

Feb 10 2023

dhruvachak added inline comments to D124070: [OpenMP] [OMPT] [amdgpu] [4/8] Implemented callback registration in nextgen plugins.
Feb 10 2023, 9:31 AM · Restricted Project, Restricted Project

Feb 9 2023

dhruvachak updated the summary of D124070: [OpenMP] [OMPT] [amdgpu] [4/8] Implemented callback registration in nextgen plugins.
Feb 9 2023, 9:11 AM · Restricted Project, Restricted Project

Jan 17 2023

dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.

After rebasing on top of main today and regenerating all the auto-update clang tests, here are the test results. The AST tests have to be updated manually as Johannes mentioned earlier. I haven't looked at the other clang test failures.

Jan 17 2023, 9:52 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak updated the diff for D102107: [OpenMP] Codegen aggregate for outlined function captures.

Rebased.

Jan 17 2023, 9:43 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project

Dec 22 2022

dhruvachak added a comment to D140591: [OpenMP] Solve potential VERSION script error w/ OMPT symbols.

I don't know why the problem is showing up now. ompt_start_tool has been there for a long time. While libomp_ompt_connect is new, it is unconditionally defined.

Dec 22 2022, 10:13 PM · Restricted Project, Restricted Project

Dec 12 2022

dhruvachak added inline comments to D139208: [OpenMP][libomptarget][AMDGPU] lock/unlock (pin/unpin) mechanism in libomptarget amdgpu plugin (API and implementation).
Dec 12 2022, 10:02 AM · Restricted Project, Restricted Project

Dec 9 2022

dhruvachak added a comment to D133463: [OpenMP] Device Time Profile.

Can you please describe at a high level what data "Device Time Profile" is supposed to collect and how you are collecting it? Is there a specification somewhere for this functionality?

Dec 9 2022, 11:02 PM · Restricted Project, Restricted Project, Restricted Project

Dec 8 2022

dhruvachak committed rGaa4c0f116c93: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget (authored by dhruvachak).
[OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget
Dec 8 2022, 11:43 AM · Restricted Project, Restricted Project
dhruvachak closed D123974: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget.
Dec 8 2022, 11:43 AM · Restricted Project, Restricted Project

Dec 7 2022

dhruvachak added a comment to D123974: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget.

@tianshilei1992 I tested builds w/ and w/o -DLIBOMPTARGET_ENABLE_DEBUG=ON. Any concern if I land the updated patch?

Dec 7 2022, 5:12 PM · Restricted Project, Restricted Project

Dec 2 2022

dhruvachak updated the diff for D123974: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget.

Fixed build failure with LIBOMPTARGET_ENABLE_DEBUG.

Dec 2 2022, 11:56 PM · Restricted Project, Restricted Project

Dec 1 2022

dhruvachak reopened D123974: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget.
Dec 1 2022, 11:30 PM · Restricted Project, Restricted Project
dhruvachak added a comment to D123974: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget.

I can repro the failure with -DLIBOMPTARGET_ENABLE_DEBUG=ON added. I will update the patch.

Dec 1 2022, 11:29 PM · Restricted Project, Restricted Project
dhruvachak added a comment to D123974: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget.

Looks like this patch breaks the build.

Dec 1 2022, 10:04 PM · Restricted Project, Restricted Project
dhruvachak added a reverting change for rG2b234ce3f07c: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget: rG4763e877f720: Revert "[OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget".
Dec 1 2022, 10:02 PM · Restricted Project, Restricted Project
dhruvachak committed rG4763e877f720: Revert "[OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget" (authored by dhruvachak).
Revert "[OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget"
Dec 1 2022, 10:02 PM · Restricted Project, Restricted Project
dhruvachak added a reverting change for D123974: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget: rG4763e877f720: Revert "[OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget".
Dec 1 2022, 10:02 PM · Restricted Project, Restricted Project
dhruvachak committed rG2b234ce3f07c: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget (authored by dhruvachak).
[OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget
Dec 1 2022, 4:07 PM · Restricted Project, Restricted Project
dhruvachak closed D123974: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget.
Dec 1 2022, 4:06 PM · Restricted Project, Restricted Project
dhruvachak added a comment to D123974: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget.

Anyone else has any comment before I land this patch?

Dec 1 2022, 12:15 PM · Restricted Project, Restricted Project
dhruvachak added inline comments to D123974: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget.
Dec 1 2022, 11:29 AM · Restricted Project, Restricted Project
dhruvachak updated the diff for D123974: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget.

Variable name changed.

Dec 1 2022, 11:27 AM · Restricted Project, Restricted Project

Nov 29 2022

dhruvachak updated the diff for D123974: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget.

Rebased. Please review.

Nov 29 2022, 12:24 AM · Restricted Project, Restricted Project

Nov 15 2022

dhruvachak committed rG5b67bce787e4: [OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT… (authored by dhruvachak).
[OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT…
Nov 15 2022, 2:22 PM · Restricted Project, Restricted Project
dhruvachak closed D123572: [OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT callbacks between libraries..
Nov 15 2022, 2:22 PM · Restricted Project, Restricted Project
dhruvachak updated the diff for D123572: [OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT callbacks between libraries..

Rebased.

Nov 15 2022, 2:20 PM · Restricted Project, Restricted Project

Nov 14 2022

dhruvachak added a comment to D123572: [OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT callbacks between libraries..

Make sure the new files in libomptarget match the LLVM style for the variables and filenames. Otherwise looks file

Nov 14 2022, 11:16 PM · Restricted Project, Restricted Project
dhruvachak updated the diff for D123572: [OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT callbacks between libraries..

Changed new file name from ompt-connector.h to ompt_connector.h.

Nov 14 2022, 11:09 PM · Restricted Project, Restricted Project
dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.

Turns out a rebase on top of trunk had ~200 test conflicts. During my last update in Sep, I had resolved all of the clang test conflicts and failures, there were only llvm test failures.

Nov 14 2022, 10:51 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak updated the diff for D102107: [OpenMP] Codegen aggregate for outlined function captures.

Rebased.

Nov 14 2022, 10:30 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project

Nov 11 2022

dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.

@dhruvachak Do you still need help updating the LLVM tests?

Nov 11 2022, 5:54 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project

Nov 4 2022

dhruvachak updated the diff for D123572: [OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT callbacks between libraries..

Fixed issues reported in https://github.com/llvm/llvm-project/issues/58721

Nov 4 2022, 3:20 PM · Restricted Project, Restricted Project
dhruvachak reopened D123572: [OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT callbacks between libraries..
Nov 4 2022, 3:16 PM · Restricted Project, Restricted Project

Nov 3 2022

dhruvachak updated the summary of D123572: [OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT callbacks between libraries..
Nov 3 2022, 7:19 PM · Restricted Project, Restricted Project

Nov 1 2022

dhruvachak added a comment to D123572: [OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT callbacks between libraries..

This patch had to be reverted because of a link failure https://github.com/llvm/llvm-project/issues/58721

Nov 1 2022, 9:08 AM · Restricted Project, Restricted Project
dhruvachak added a reverting change for rGf94c2679cb1d: [OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT…: rG88e557cbc940: Revert "[OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT….
Nov 1 2022, 9:00 AM · Restricted Project, Restricted Project
dhruvachak committed rG88e557cbc940: Revert "[OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT… (authored by dhruvachak).
Revert "[OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT…
Nov 1 2022, 9:00 AM · Restricted Project, Restricted Project
dhruvachak added a reverting change for D123572: [OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT callbacks between libraries.: rG88e557cbc940: Revert "[OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT….
Nov 1 2022, 9:00 AM · Restricted Project, Restricted Project

Oct 31 2022

dhruvachak updated the summary of D123974: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget.
Oct 31 2022, 2:48 PM · Restricted Project, Restricted Project
dhruvachak updated the diff for D123974: [OpenMP] [OMPT] [3/8] Implemented callback registration in libomptarget.

Rebased

Oct 31 2022, 2:46 PM · Restricted Project, Restricted Project
dhruvachak committed rGf94c2679cb1d: [OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT… (authored by dhruvachak).
[OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT…
Oct 31 2022, 10:34 AM · Restricted Project, Restricted Project
dhruvachak closed D123572: [OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT callbacks between libraries..
Oct 31 2022, 10:33 AM · Restricted Project, Restricted Project

Oct 28 2022

dhruvachak updated the diff for D123572: [OpenMP] [OMPT] [2/8] Implemented a connector for communication of OMPT callbacks between libraries..

Rebased

Oct 28 2022, 5:31 PM · Restricted Project, Restricted Project

Sep 30 2022

dhruvachak committed rG667af48179cd: [OpenMP] [OMPT] [1/8] Create separate categories for host, device, [no]emi… (authored by dhruvachak).
[OpenMP] [OMPT] [1/8] Create separate categories for host, device, [no]emi…
Sep 30 2022, 5:47 PM · Restricted Project, Restricted Project
dhruvachak closed D123429: [OpenMP] [OMPT] [1/8] Create separate categories for host, device, [no]emi events.
Sep 30 2022, 5:46 PM · Restricted Project, Restricted Project
dhruvachak updated the diff for D123429: [OpenMP] [OMPT] [1/8] Create separate categories for host, device, [no]emi events.

Rebase

Sep 30 2022, 10:08 AM · Restricted Project, Restricted Project

Sep 23 2022

dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.

Updated llvm tests. The following 3 tests still fail:

LLVM :: Transforms/OpenMP/spmdization_constant_prop.ll
LLVM :: Transforms/OpenMP/spmdization_guarding_two_reaching_kernels.ll
LLVM :: Transforms/OpenMP/spmdization_remarks.ll

@jdoerfert @jhuber6
I updated the LLVM tests except one, Transforms/OpenMP/spmdization_constant_prop.ll. There is no C source snippet in there. Can you help as to how to update it? Please review the diffs for all the updated LLVM tests as well.

In addition, the other 2 tests above fail even after updating. Looks like something is wrong. Can you help as to how to fix them?

Similar to clang tests, we are seeing remarks differences. We already decided to file an issue (after this patch lands) and look at them after-the-fact.

Did you recreate the tests from the C snipped? That is probably not a good idea. We should modify the IR. If we start with C code we can't do it like this anyway. I mean:

  • the IR is totally different,
  • the debug info is missing,
  • lots of unrelated metadata,
  • part of the device runtime was merged in,
  • ...
Sep 23 2022, 6:09 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.

Updated llvm tests. The following 3 tests still fail:

LLVM :: Transforms/OpenMP/spmdization_constant_prop.ll
LLVM :: Transforms/OpenMP/spmdization_guarding_two_reaching_kernels.ll
LLVM :: Transforms/OpenMP/spmdization_remarks.ll
Sep 23 2022, 1:51 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak updated the summary of D102107: [OpenMP] Codegen aggregate for outlined function captures.
Sep 23 2022, 1:44 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak updated the diff for D102107: [OpenMP] Codegen aggregate for outlined function captures.

Updated llvm tests. The following 3 tests still fail:

Sep 23 2022, 1:42 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak reopened D102107: [OpenMP] Codegen aggregate for outlined function captures.

This patch was reverted.

Sep 23 2022, 1:41 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project

Sep 14 2022

dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.

I reverted this commit while I fix the failing tests.

Sep 14 2022, 8:16 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak added a reverting change for rG7539e9cf811e: [OpenMP] Codegen aggregate for outlined function captures: rG839ac62c5085: Revert "[OpenMP] Codegen aggregate for outlined function captures".
Sep 14 2022, 8:10 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak committed rG839ac62c5085: Revert "[OpenMP] Codegen aggregate for outlined function captures" (authored by dhruvachak).
Revert "[OpenMP] Codegen aggregate for outlined function captures"
Sep 14 2022, 8:10 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak added a reverting change for D102107: [OpenMP] Codegen aggregate for outlined function captures: rG839ac62c5085: Revert "[OpenMP] Codegen aggregate for outlined function captures".
Sep 14 2022, 8:10 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.

check-llvm fails bunch of test for me


Failed Tests (12):

LLVM :: Transforms/OpenMP/custom_state_machines.ll
LLVM :: Transforms/OpenMP/custom_state_machines_remarks.ll
LLVM :: Transforms/OpenMP/get_hardware_num_threads_in_block_fold.ll
LLVM :: Transforms/OpenMP/gpu_state_machine_function_ptr_replacement.ll
LLVM :: Transforms/OpenMP/is_spmd_exec_mode_fold.ll
LLVM :: Transforms/OpenMP/parallel_level_fold.ll
LLVM :: Transforms/OpenMP/spmdization.ll
LLVM :: Transforms/OpenMP/spmdization_assumes.ll
LLVM :: Transforms/OpenMP/spmdization_constant_prop.ll
LLVM :: Transforms/OpenMP/spmdization_guarding.ll
LLVM :: Transforms/OpenMP/spmdization_guarding_two_reaching_kernels.ll
LLVM :: Transforms/OpenMP/spmdization_remarks.ll
Sep 14 2022, 7:28 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak committed rG7539e9cf811e: [OpenMP] Codegen aggregate for outlined function captures (authored by ggeorgakoudis).
[OpenMP] Codegen aggregate for outlined function captures
Sep 14 2022, 5:55 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak closed D102107: [OpenMP] Codegen aggregate for outlined function captures.
Sep 14 2022, 5:55 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak updated the diff for D102107: [OpenMP] Codegen aggregate for outlined function captures.

Rebased

Sep 14 2022, 5:47 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project

Aug 3 2022

dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.

Pointing out the recent changes at the corresponding source locations.

Aug 3 2022, 4:36 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.

@jdoerfert Attached are the device IR files, generated with -save-temps.

Aug 3 2022, 10:57 AM · Restricted Project, Restricted Project, Restricted Project, Restricted Project

Jul 28 2022

dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.

All changes from my end are in. Please review.

Jul 28 2022, 12:04 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.

@jdoerfert With this patch, additional remarks are being generated. Please check whether the new OMP121 remarks in the following tests are OK.

Jul 28 2022, 11:59 AM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.

Rebased on top of a recent commit. Both check-clang and check-openmp (on amdgpu) pass.

Jul 28 2022, 11:56 AM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak updated the diff for D102107: [OpenMP] Codegen aggregate for outlined function captures.

Regenerated clang tests, make check-clang passes

Jul 28 2022, 11:52 AM · Restricted Project, Restricted Project, Restricted Project, Restricted Project

Jul 26 2022

dhruvachak added a comment to D123429: [OpenMP] [OMPT] [1/8] Create separate categories for host, device, [no]emi events.

ping

Jul 26 2022, 11:37 AM · Restricted Project, Restricted Project

Jul 15 2022

dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.
Jul 15 2022, 12:37 AM · Restricted Project, Restricted Project, Restricted Project, Restricted Project

Jul 8 2022

dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.

make check-openmp passes on amdgpu. Need to check on nvptx.

Jul 8 2022, 7:43 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.

Results from "make check-clang":

Jul 8 2022, 7:39 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak added inline comments to D102107: [OpenMP] Codegen aggregate for outlined function captures.
Jul 8 2022, 6:04 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak added inline comments to D102107: [OpenMP] Codegen aggregate for outlined function captures.
Jul 8 2022, 6:02 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak added a comment to D102107: [OpenMP] Codegen aggregate for outlined function captures.

Thanks. I followed the above steps and regenerated a couple of the AST tests but they still fail. Perhaps I am missing some options?

I currently have a handful of clang test failures where regen did not work. I am going to update the patch, post the current test results, and we can figure out how to regen the rest before we land this patch.

So, generate check lines for new tests in a separate patch first.

Jul 8 2022, 5:58 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project
dhruvachak updated the diff for D102107: [OpenMP] Codegen aggregate for outlined function captures.

Fixed opaque pointer miscompile.
Added alloc_aggregate_arg entry point to OpenMPOpt SPMD list.
Fixed nocapture attribute of kmpc_alloc_aggregate_arg,
Added align attribute for call to
kmpc_alloc_shared.
Updated (most) failing clang tests.

Jul 8 2022, 5:53 PM · Restricted Project, Restricted Project, Restricted Project, Restricted Project