This is an archive of the discontinued LLVM Phabricator instance.

[HIP] Change default --gpu-max-threads-per-block value to 1024
ClosedPublic

Authored by yaxunl on Mar 25 2020, 11:54 AM.

Details

Diff Detail

Event Timeline

yaxunl created this revision.Mar 25 2020, 11:54 AM
b-sumner added inline comments.Mar 25 2020, 12:36 PM
clang/lib/CodeGen/TargetInfo.cpp
8825

I'd like to see the word default, e.g. OpenCLDefaultMaxWorkGroupSize, used more since that is what this is about. Ideally the option would have been named gpu-default-max-threads-per-block, but I suppose I can see why it was shortened.

yaxunl updated this revision to Diff 252661.Mar 25 2020, 1:47 PM

change variable names

yaxunl marked 2 inline comments as done.Mar 25 2020, 1:49 PM
yaxunl added inline comments.
clang/lib/CodeGen/TargetInfo.cpp
8825

changed variable names.

Thanks. This looks fine to me.

tra accepted this revision.Mar 30 2020, 9:51 AM
This revision is now accepted and ready to land.Mar 30 2020, 9:51 AM
This revision was automatically updated to reflect the committed changes.
yaxunl marked an inline comment as done.
Herald added a project: Restricted Project. · View Herald TranscriptJun 3 2020, 8:14 AM