User Details
- User Since
- May 4 2020, 11:17 AM (150 w, 6 d)
Today
Yesterday
The generator expressions should only be used for NVPTX.
Fri, Mar 24
LG, might make sense to move these off the device anyway. Fewer things to fail. I'm assuming these all pass as you've written them.
Updating to simply add an entirely new attribute again. The existing
CUDAGlobal attribute does what we want, but it's also highly coupled with the
CUDA language. This made it pretty much impossible to find a way to re-use it
without breaking existing functionality. The amount of code duplicated is
minimal and this is required to be able to emit a callable kernel targeting
NVPTX directly. I'd like to use this for my ongoing GPU libc project so I'd
appreciate someone looking at this again.
LG as long as we can fix it later once the bot picks it up.
Landing this because my build has been broken for the past few hours.
Thu, Mar 23
@tra would it be possible to go to the earlier version that simply duplicated a slight amount of logic to introduce the new and separate attribute nvptx_kernel? Overloading CUDA's device attribute is problematic because it's used and checked in several different contexts. I'd like to be able to simplify this code https://github.com/llvm/llvm-project/blob/main/libc/startup/gpu/nvptx/start.cpp.
This was committed externally and I forgot to link it up.
Checking allocation return values.
Return nullptr early if the allocation returns null.
Forgot to check errors on the AMD implementation.
Addressing comments.
Moving device copying functions into a common utility.
Wed, Mar 22
If you include Fixes: <bug url> in the commit message it'll automatically close it by the way.
Tue, Mar 21
LG
We should have a clang test as well
Hopefully this works.
Mon, Mar 20
I can commandeer tomorrow
Fri, Mar 17
Remove whitespace
Changing message
Addressing comments
Updating, error message now looks like this.
We should only set the special libc.utils.gpu.loader target once for the target under test.
Thu, Mar 16
LGTM, it's much simpler for now since Flang doesn't support CUDA, HIP, OpenCL, OpenMP, etc.
Addressing comments
Wed, Mar 15
Merging the dependency list and addressing comments. I'll try to remove the other dependencies by picking them up from the startup in a later patch.