- User Since
- May 14 2014, 8:22 AM (370 w, 5 d)
Jun 19 2018
Jun 18 2018
May 29 2018
I am not quite familiar with Clang driver set up, I will add Greg for more comments. But I have hacked one the latest YKT tree to support simple AMDGCN path the same way as NVPTX. The last patch is here
May 18 2018
May 15 2018
Can we double check this LLVM IR backward compatible thing? My understanding is that it is not that good, at least for the major versions.
May 4 2018
Update as suggested.
May 3 2018
Update name as suggested.
May 2 2018
Apr 27 2018
Apr 26 2018
Apr 25 2018
Use PRIu64 for chuck
With the proper position of include file, I can use PRId64 now.
Apr 20 2018
Apr 19 2018
I checked again on the format you suggested. To use it, you need #include <inttypes.h> For example the macro was defined like this in my version of the inttypes.h
Apr 15 2018
Apr 13 2018
update as suggested (except PRId64)
Apr 12 2018
Ok, let me abandon this one and think on this again.
Apr 11 2018
update according to suggestion, thanks.
Apr 10 2018
Apr 9 2018
I can see the advantage of dynamic testing. But to test everything dynamically maybe tedious. For bc file handling, I consider this is more of a project need to embed LLVM itself, using find_package is more suitable. http://llvm.org/docs/CMake.html#embedding
Apr 8 2018
I see. I was trying to set the bc file utilities using find_pacakge(llvm...) instead of using COMPILER_ID and common it to an upper level as we search things like elf and ffi.
Apr 5 2018
That is fine. I don't think this conflicts with what you said?
This is only for compiler and linker used for building bclib. It is not the default build compiler.
Apr 3 2018
I see. We compared the change in (4.0?) and after, realized the extern shared is turned off. We reenabled (basically allow it to pass parsing) and everything is expected.
Apr 2 2018
My search got the same conclusion as Greg did, I don't think -fcuda-rdc will change the IR code, except allowing multiple compilation units as specified in the options.td
Apr 1 2018
this fix I had is similar to 751ae2ab8, which is exactly for this.
Mar 28 2018
I mean using -DLIBOMPTARGET_NVPTX_ENABLE_BCLIB=1 on cmake command line, to build the bc libraries. Did you use that flag?
Jan 12 2018
Dec 11 2017
Dec 8 2017
Here is a general comment, we should follow Alex's suggestion to make a generic GPU device and extend that to NVPTX and AMDGCN. My current idea is:
Here is a question, do we require that the alloca size to be compile time constant?
Dec 7 2017
Jul 19 2017
Jun 7 2017
Just to share what I have locally. I have OFFLOAD_DEBUG=1 for the plugin side debug messages, and DEVICE_DEBUG=1 for the deviceRTL side messages.
Mar 15 2017
Mar 14 2017
Thanks, update as suggested.
Mar 13 2017
Mar 10 2017
As suggested, mv the dir to the empty dir down below
Mar 9 2017
Use ElfNote instead of AMD_PT_NOTE as suggested.
Mar 8 2017
add a test
Feb 17 2016
Should we mention that the runtime can be built "in-tree" as well? This simplifies users experience to access OpenMP support.
May 28 2014
I found the patch does not work with the latest trunk. Could you please rebase, or provide a version number the patch is based on?
BTW, I am using git mirror, it will be appropriated to show me how to locate the base. Are we using git log? I did not find any tags...