This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Use stricter bounds for workitem builtins
AbandonedPublic

Authored by arsenm on Oct 10 2017, 3:21 PM.

Details

Diff Detail

Event Timeline

arsenm created this revision.Oct 10 2017, 3:21 PM
This revision is now accepted and ready to land.Oct 11 2017, 11:31 PM
arsenm updated this revision to Diff 119209.Oct 16 2017, 1:54 PM

Use 1024 for OpenCL

t-tye added inline comments.Oct 16 2017, 8:42 PM
include/clang/Basic/TargetInfo.h
1060

Is this specifically tied to OpenCL or is it the target's maximum supported work-group size? There are other languages besides OpenCL that have the notion of work-groups, and would seem better to generalize this.

lib/Basic/Targets/AMDGPU.h
73–74

Probably want to update this comment as all runtimes are moving to support the full capabilities of the hardware which is 1024 for all current AMDGCN targets.

arsenm abandoned this revision.Apr 1 2020, 1:23 PM