This is an archive of the discontinued LLVM Phabricator instance.

[libomptarget][amdgpu] Fix truncation error for partial wavefront
ClosedPublic

Authored by JonChesterfield on May 13 2021, 8:50 AM.

Details

Summary

[libomptarget][amdgpu] Fix truncation error for partial wavefront

The partial barrier implementation involves one wavefront resetting and N-1
waiting. This change future proofs against launching with a number of threads
that is not a multiple of the wavefront size.

Diff Detail

Event Timeline

JonChesterfield requested review of this revision.May 13 2021, 8:50 AM
Herald added a project: Restricted Project. · View Herald TranscriptMay 13 2021, 8:50 AM
jdoerfert accepted this revision.May 13 2021, 9:03 AM
This revision is now accepted and ready to land.May 13 2021, 9:03 AM
This revision was landed with ongoing or failed builds.May 13 2021, 9:32 AM
This revision was automatically updated to reflect the committed changes.