Page MenuHomePhabricator

Please use GitHub pull requests for new patches. Phabricator shutdown timeline

Maetveis (Mészáros Gergely)
User

Projects

User does not belong to any projects.

User Details

User Since
Jul 19 2022, 11:51 AM (61 w, 4 d)

Recent Activity

May 11 2023

Maetveis accepted D150282: [Driver] -ftime-trace: derive trace file names from -o and -dumpdir.

I don't see anything in here that would block later extension for supporting offloading, so LGTM from my POV.

May 11 2023, 3:24 PM · Restricted Project, Restricted Project
Maetveis added a comment to D150282: [Driver] -ftime-trace: derive trace file names from -o and -dumpdir.

LGTM, but I don't feel like I have the experience to "formally" approve. I left a nice-to have suggestion too, but feel free to ignore, if you feel its out of scope.

May 11 2023, 7:20 AM · Restricted Project, Restricted Project

Jan 21 2023

Maetveis abandoned D142008: [libomptarget] Delay recursive shared library registrations until offload RTLs are loaded.

Fixed by D142249.

Jan 21 2023, 4:19 AM · Restricted Project, Restricted Project
Maetveis accepted D142249: [openmp] Workaround for HSA in issue 60119.
Jan 21 2023, 3:53 AM · Restricted Project, Restricted Project
Maetveis requested changes to D142249: [openmp] Workaround for HSA in issue 60119.
Jan 21 2023, 12:28 AM · Restricted Project, Restricted Project
Maetveis accepted D142249: [openmp] Workaround for HSA in issue 60119.

Looks good, also works for my original program, thank you! Just one nit and a question:

Jan 21 2023, 12:10 AM · Restricted Project, Restricted Project

Jan 20 2023

Maetveis added inline comments to D142249: [openmp] Workaround for HSA in issue 60119.
Jan 20 2023, 1:40 PM · Restricted Project, Restricted Project
Maetveis added a comment to D142249: [openmp] Workaround for HSA in issue 60119.

This change avoids the deadlock, but now library registration happens before (during) the device runtimes are loaded essentially skipping the registration.

Jan 20 2023, 1:35 PM · Restricted Project, Restricted Project
Maetveis added a comment to D142008: [libomptarget] Delay recursive shared library registrations until offload RTLs are loaded.

I'm moderately surprised that ctors are called when dlopen opens something that is already open. Hopefully that's not platform specific.

Jan 20 2023, 4:57 AM · Restricted Project, Restricted Project
Maetveis added a comment to D142008: [libomptarget] Delay recursive shared library registrations until offload RTLs are loaded.

We have provisional consensus that this is a bug in HSA. It shouldn't be calling dlopen on various processes it finds in the address space and a patch to stop it doing that is in progress.

Jan 20 2023, 1:46 AM · Restricted Project, Restricted Project

Jan 19 2023

Maetveis updated the diff for D142008: [libomptarget] Delay recursive shared library registrations until offload RTLs are loaded.

Replace thread local with class member + atomic thread Id. Also factor out the delayed handling into member functions.
The reason the thread id checks are necessary is 1. avoiding race conditions. 2. not returning before the library is registered in threads that are not doing loadRtls.

Jan 19 2023, 2:32 AM · Restricted Project, Restricted Project

Jan 18 2023

Maetveis added a comment to D142008: [libomptarget] Delay recursive shared library registrations until offload RTLs are loaded.

I think you're missing some of the changes, I should have added a loop around the registration of the library to the plugins that goes over DelayedDescs and registers each, maybe it did not upload right?

Jan 18 2023, 9:49 AM · Restricted Project, Restricted Project
Maetveis added a comment to D142008: [libomptarget] Delay recursive shared library registrations until offload RTLs are loaded.

This is too complex and we should not need static thread local stuff at all.
Why don't we set the flag that we loaded libomptarget early. If we end up loading it again we should just not initialize it. All we really want is to reduce the scope of the lock guarding the init flag, no?

Yes but the problem is that we didn't finish initializing it yet when the second call comes in, we're still inside the initialization of the amdgpu plugin so that one and any others after it didn't finish initialization yet. I.e. with your proposal the inner call would skip the "second" initialization then try to register the library to partially initialized plugins.

I don't see it. In either scheme the "inner/second" initialization of libomptarget will not actually initialize anything. You also return for the inner call and you will never come back, or how would it ever come back? Even if you do now, why would we. The second initialization is not necessary/wanted.

Jan 18 2023, 9:19 AM · Restricted Project, Restricted Project
Maetveis added a comment to D142008: [libomptarget] Delay recursive shared library registrations until offload RTLs are loaded.

This is too complex and we should not need static thread local stuff at all.
Why don't we set the flag that we loaded libomptarget early. If we end up loading it again we should just not initialize it. All we really want is to reduce the scope of the lock guarding the init flag, no?

Jan 18 2023, 9:01 AM · Restricted Project, Restricted Project
Maetveis added a comment to D142008: [libomptarget] Delay recursive shared library registrations until offload RTLs are loaded.

That's a really surprising thing for HSA to be doing. It looks like it's crawling all dynamic libraries that happen to be in the host process address space, but that seems unlikely to be necessary.

Jan 18 2023, 8:09 AM · Restricted Project, Restricted Project
Maetveis added a comment to D142008: [libomptarget] Delay recursive shared library registrations until offload RTLs are loaded.

So, this occurs when we load the plugin shared library?

Jan 18 2023, 6:58 AM · Restricted Project, Restricted Project
Maetveis updated the summary of D142008: [libomptarget] Delay recursive shared library registrations until offload RTLs are loaded.
Jan 18 2023, 4:29 AM · Restricted Project, Restricted Project
Maetveis updated the diff for D142008: [libomptarget] Delay recursive shared library registrations until offload RTLs are loaded.

Add test.

Jan 18 2023, 4:22 AM · Restricted Project, Restricted Project
Maetveis requested review of D142008: [libomptarget] Delay recursive shared library registrations until offload RTLs are loaded.
Jan 18 2023, 4:19 AM · Restricted Project, Restricted Project

Oct 25 2022

Maetveis added a comment to D131469: [Clang] change default storing path of `-ftime-trace`.

I'm sorry that I didn't see the comments until now because of some personal work... Thank you for your kindness and suppot, I will improve this part of work according to your idea in D133662 : )

Oct 25 2022, 12:21 PM · Restricted Project, Restricted Project

Sep 18 2022

Maetveis requested changes to D131469: [Clang] change default storing path of `-ftime-trace`.

As discussed with @jamieschmeiser on D133662, I have left suggestions regarding the approach I took for handling -o and passing the option to the jobs.

Sep 18 2022, 12:20 PM · Restricted Project, Restricted Project
Maetveis added a comment to D133662: [Clang] Change -ftime-trace storing path and support multiple compilation jobs.

I had an email exchange with @dongjunduo about this situation. He was a GSoC student that @Whitney and I were mentoring for the past summer. He agrees that your approach is cleaner. There appears to be two parts to your work. First, you implemented the determining and passing of the options differently, and secondly, you improved the handling of off-loading and system specific file handling. Based on your earlier response, we proposed to him the following and he agrees that it seems appropriate. Could you please add comments to https://reviews.llvm.org/D131469 and he will work with you to change his code to reflect searching for -o and using the virtual functions. Then, if @MaskRay agrees, he can land his code and finish up his GSoC work. You can then add your extensions of off-loading and file-handling. Is this acceptable to you?

Sep 18 2022, 11:21 AM · Restricted Project, Restricted Project

Sep 15 2022

Maetveis updated the diff for D133662: [Clang] Change -ftime-trace storing path and support multiple compilation jobs.

Remove inline elements from the map. Add tests. With the tests I now consider this complete, so removed the draft from the title.

Sep 15 2022, 2:07 PM · Restricted Project, Restricted Project

Sep 13 2022

Maetveis added a comment to D133662: [Clang] Change -ftime-trace storing path and support multiple compilation jobs.

I'm a little confused as to what is being proposed here. Is this building on D131469 or is it an alternative? It seems that there are portions of the code from D131469 included in these changes, which implies that you are building on it. I think this patch is premature in that the other patch has not yet landed (@MaskRay has asked for revisions that @dongjunduo has made but is waiting for review). When that patch has landed, this could be reposted based on those changes.

Sep 13 2022, 1:10 PM · Restricted Project, Restricted Project

Sep 11 2022

Maetveis retitled D133662: [Clang] Change -ftime-trace storing path and support multiple compilation jobs from [Clang] Change -ftime-trace storing path and support multiple compilation jobs to [Clang] WIP: Change -ftime-trace storing path and support multiple compilation jobs.
Sep 11 2022, 6:58 AM · Restricted Project, Restricted Project
Maetveis added reviewers for D133662: [Clang] Change -ftime-trace storing path and support multiple compilation jobs: jamieschmeiser, Whitney, steven_wu, MaskRay.
Sep 11 2022, 2:46 AM · Restricted Project, Restricted Project
Maetveis updated the summary of D133662: [Clang] Change -ftime-trace storing path and support multiple compilation jobs.
Sep 11 2022, 2:41 AM · Restricted Project, Restricted Project
Maetveis updated the summary of D133662: [Clang] Change -ftime-trace storing path and support multiple compilation jobs.
Sep 11 2022, 2:35 AM · Restricted Project, Restricted Project
Maetveis updated the summary of D133662: [Clang] Change -ftime-trace storing path and support multiple compilation jobs.
Sep 11 2022, 2:35 AM · Restricted Project, Restricted Project
Maetveis updated the summary of D133662: [Clang] Change -ftime-trace storing path and support multiple compilation jobs.
Sep 11 2022, 2:34 AM · Restricted Project, Restricted Project
Maetveis updated the summary of D133662: [Clang] Change -ftime-trace storing path and support multiple compilation jobs.
Sep 11 2022, 2:34 AM · Restricted Project, Restricted Project
Maetveis requested review of D133662: [Clang] Change -ftime-trace storing path and support multiple compilation jobs.
Sep 11 2022, 2:34 AM · Restricted Project, Restricted Project

Aug 10 2022

Maetveis added a comment to D130108: git-clang-format: format index not worktree when using --staged.

Sorry, I somehow missed the big obvious checkbox to mark the comments, fixed now.
I don't have commit access could you commit if the last change seems okay to you?

Aug 10 2022, 11:37 AM · Restricted Project, Restricted Project, Restricted Project
Maetveis updated the diff for D130108: git-clang-format: format index not worktree when using --staged.
Aug 10 2022, 11:32 AM · Restricted Project, Restricted Project, Restricted Project

Aug 9 2022

Maetveis updated the diff for D130108: git-clang-format: format index not worktree when using --staged.

Address formatting and stylistic comments.

Aug 9 2022, 1:34 PM · Restricted Project, Restricted Project, Restricted Project

Aug 8 2022

Maetveis added a comment to D130108: git-clang-format: format index not worktree when using --staged.

So trying to understand the problem statement, is it:

  1. you have some files staged
  2. and then you have to change them locally (review comment)
  3. but forget to git add them,
  4. you run git clang-format --staged
  5. and it formats the new file (not added) based on the line numbers that are changing in the already staged file, which could be a different range to the original (is that correct?)
  6. You think you've formatted the new file correctly (but likely missed a line or two)
  7. You perform git add (as you now notice you've missed adding it)
  8. You commit, but a commit hook tells you that you didn't clang format. (or worse still you are able to commit and push but you are pushing a badly formatted file, potentially)

is that correct?

Aug 8 2022, 10:39 AM · Restricted Project, Restricted Project, Restricted Project

Aug 6 2022

Maetveis added a comment to D130108: git-clang-format: format index not worktree when using --staged.

Friendly Ping.

Aug 6 2022, 4:41 AM · Restricted Project, Restricted Project, Restricted Project

Jul 29 2022

Maetveis updated the diff for D130108: git-clang-format: format index not worktree when using --staged.

Updated to add missing comma after 'git ls-tree'. It broke normal (non staged) operation.

Jul 29 2022, 2:57 AM · Restricted Project, Restricted Project, Restricted Project
Maetveis added a comment to D130108: git-clang-format: format index not worktree when using --staged.
Jul 29 2022, 1:36 AM · Restricted Project, Restricted Project, Restricted Project

Jul 19 2022

Maetveis requested review of D130108: git-clang-format: format index not worktree when using --staged.
Jul 19 2022, 11:59 AM · Restricted Project, Restricted Project, Restricted Project