Details
Details
- Reviewers
jdoerfert ronlieb dpalermo dhruvachak
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
Comment Actions
Low priority, posting it so I don't forget about it. Would remove the only reviewer but phab automatically re-adds you.
openmp/libomptarget/deviceRTLs/amdgcn/src/target_impl.h | ||
---|---|---|
48 | As a performance optimisation, this is probably in the noise. However it will eliminate all the warp32 vs wave64 differences in the deviceRTL, making gfx10 a useful datapoint for debugging works on nvptx and fails on amdgpu. That is, if gfx10 works, it suggests the bug is in wave size. If it fails, it suggests the bug is not in wave size. |
As a performance optimisation, this is probably in the noise.
However it will eliminate all the warp32 vs wave64 differences in the deviceRTL, making gfx10 a useful datapoint for debugging works on nvptx and fails on amdgpu. That is, if gfx10 works, it suggests the bug is in wave size. If it fails, it suggests the bug is not in wave size.