- User Since
- Dec 15 2015, 11:31 AM (213 w, 5 d)
Dec 19 2016
Dec 2 2016
Dec 1 2016
- Switch to more specific error
- "Support" ASAN in CudaToolChain
Before this patch, the following command would fail:
Oct 31 2016
Hi grosser. Sorry for not including the motivation in the commit message. During the review (https://reviews.llvm.org/D25701) I added a comment at the beginning about the motivation for the change in response to the same question from hfinkel. I won't repeat the details here. Instead, I'll just provide the previous link and summarize by saying that the old kernel launch model didn't work with templated CUDA kernels, so I decided not to keep it, but it could return later (hopefully in a more general form).
Oct 27 2016
- Default DeviceIndex for getSymbolMemory
Oct 25 2016
I'm contacting Tanya Lattner to make sure I have set the hooks up correctly for this documentation to be generated and published automatically by the standard LLVM doc scripts. I will wait to check this in until I hear back.
- Add ctors for Expected(Expected<U>)
- Respond to jlebar's comments 2016-10-24
Oct 24 2016
In addition to responding to jlebar's posted comments, I also removed the acxxel::getPlatform function and replaced it with the two functions acxxel::getCUDAPlatform and acxxel::getOpenCLPlatform. I also added a comment to explain that the CUDA and OpenCL platforms are available out of the box with Acxxel, but that other platforms can be created as well. The old acxxel::getPlatform function made it confusing to think about how to add a new platform because it seemed like the new platform should also be registered somehow to put in on equal footing with CUDA and OpenCL. I hope this new design will be clearer in this aspect.
- Remove asserts in OpenCL example
- Respond to jlebar's OpenCL, util comments
Thanks for the review!
Oct 21 2016
- Early exit if not Failure.ShouldFix
Oct 19 2016
- Fix deleted Span container constructor
- Respond to jlebar's comments on cuda_acxxel.cpp
Oct 18 2016
- Remove fixed_vector.h
Latest patch responds to jlebar's comments on acxxel.cpp and does a couple of other things.
- Removes old Platform::getContext function. It used to be used for launching OpenCL kernels, but is not needed now.
- Cleans up a bunch of minor documentation stuff.
- Remove unused Platform::getContext function
- Documentation fixes
- Respond to jlebar's comments on acxxel.cpp
In my latest patch I responded to jlebar's comments about error handling. The new model in this patch is to have each Stream own its own error state, as was done in StreamExecutor. There is now a function to query the state of the Stream, and all the enqueuing functions that used to return Status now return Stream& instead. This means the fluent Stream launching interface is back as it was in StreamExecutor. Maybe we'll keep the name StreamExecutor for this new thing instead of calling it Acxxel in the end, but we'll keep it as Acxxel for now, at least to distinguish it from the old StreamExecutor code.
- New error handling in stream
- Reorganize kernel launch code
- Move enqueueEvent to Stream
- Respond to jlebar's comments 2
- Respond to jlebar's comments
- Keep track of first error status per thread
Oct 17 2016
We've decided to come at this problem from a different angle, so I'm abandoning this revision.
Adding arron.ballman as a reviewer as alexfh seems to be on leave for a few weeks.
Oct 10 2016
I just found and fixed another bug in this patch. Before, I wasn't using the spelling location for the fixit hint. This meant that a macro argument that was expanded to two locations, for example, would have the same fixit hint applied to it twice. My new test case verifies that this does not happen anymore.
- Prevent multiple fixes for macro expansion
I found a bug in my first patch that I have fixed now. I was trying to iterate over the source range by using SourceLocation::getLocWithOffset, but I realized that doesn't work, so I removed it and went back to the original method of checking SourceRange.getBegin().isMacroID() and SourceRange.getEnd().isMacroID().
- Return to original checking for macro in range
alexfh, sorry if you are not the right person to review this change. I based my choice on this history of this file.
Sep 27 2016
Sep 26 2016
Sep 15 2016
- Comment on dyn-shared-memory arg efficiency
- Convert framework library names
Sep 14 2016
- streamexecutor-config report CUDA lib
- Use CMake's standard FindCUDA
- Respond to review comments