User Details
- User Since
- Aug 20 2022, 9:07 AM (40 w, 6 d)
Apr 12 2023
Apr 3 2023
Test added
Test updated.
This should work 🤞
Test updated.
Apr 2 2023
Test fixed
Mar 30 2023
Test with filter clause added
Mar 24 2023
Adding masked support to the device
Mar 23 2023
Generated functions included in the test
The parallel masked directive is emitted as a combined directive that consists of two implicit directives: 'parallel' with 'masked' directive.
Mar 20 2023
Patch updated to the trunk version.
Changes compared to the last patch:
- the HasDataTransfer flag was added.
- the functions AsyncInfoTy *get() was added to both AsyncInfoTy and TaskAsyncInfoWrapperTy
Feb 8 2023
Feb 7 2023
Jan 7 2023
Invalid state of FnAA is now considered
Minor changes
Jan 6 2023
This new patch addresses comments from reviewers
Jan 5 2023
Jan 4 2023
Dec 15 2022
ping
Dec 6 2022
ping
Nov 21 2022
This patch addresses comments from reviewers:
- Env var name is LIBOMPTARGET_INTRA_THREAD_ASYNC
- The pattern to get and destroy the AsyncInfoTy object changed.
- A DenseMap is used instead of the std::map.
Oct 28 2022
Oct 26 2022
Ping
Oct 21 2022
Oct 20 2022
Oct 19 2022
This new patch addresses all previous comments from reviewers
Oct 18 2022
ping
Sep 26 2022
This new version has the following changes:
Sep 19 2022
Documentation added
Sep 13 2022
diff updated.
- The AsyncInfoManager is now part of the omptarget.h and not device.h
- Global variable AIM was added in interface.cpp
- Flag was added to the Synchronization in asyncinfo destructor
Sep 5 2022
Aug 20 2022
As long as the RTL of the device provides a queue mechanism to execute target tasks sequentially, the dependencies among different tasks will be respected, for NVIDIA GPUs the queue mechanism is the stream. This patch, in contrast with the current OpenMP offloading implementation, launches the execution of tasks into the same stream.