This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU - Add diagnostic for compiling modules with AMD HSA OS type and GFX 6 arch
AbandonedPublic

Authored by pvellien on Nov 25 2020, 10:17 AM.

Details

Summary

Bail out from compiling modules for GFX6 + AMD HSA OS type as HSA is not supported for SI ASICs. Currently gfx6+hsa setup crashing during ISel for global load/stores due to lack of FLAT instructions. This patch add a check to report error when modules are compiled with -mtriple=amdgcn-amd-amdhsa -mcpu=gfx600 and exit from compilation pipeline.

Diff Detail

Event Timeline

pvellien created this revision.Nov 25 2020, 10:17 AM
Herald added projects: Restricted Project, Restricted Project. · View Herald TranscriptNov 25 2020, 10:17 AM
pvellien requested review of this revision.Nov 25 2020, 10:17 AM

You need to add a new test for this new error.

llvm/lib/Target/AMDGPU/AMDGPUSubtarget.cpp
134

"do not support". I would also drop "(SI)" from the message. Maybe even better just "GFX6 does not support AMD HSA".

201

Please keep original formatting.

llvm/test/CodeGen/AMDGPU/directive-amdgcn-target.ll
1

You probably just need to change triple for these targets, not just drop them from the test.

llvm/test/CodeGen/AMDGPU/lower-kernargs-si-mesa.ll
2

There are no such checks?

pvellien updated this revision to Diff 308015.Nov 27 2020, 4:06 AM

Updated with stanislav comments

t-tye requested changes to this revision.Nov 27 2020, 9:43 AM
t-tye added inline comments.
llvm/docs/AMDGPUUsage.rst
2109–2112

This is not the right place for mentioning this. The Processor table would likely be a better place. It should be in terms of supporting the amdhsa ABI.

llvm/lib/Target/AMDGPU/AMDGPUSubtarget.cpp
62–72

I am not clear what this function is doing. It seems to be returning a generation unrelated to to the actual target generation to satisfy the one place it is called. If the target is not SEA_ISLANDS it seems incorrect to be returning SEA_ISLANDS. If this function is doing something special for the one place it is called perhaps it should be expanded there?

134

Make the message include the full target triple text so the user understands how to resolve the issue. For example:

The target triple %s is not supported: the processor %s does not support the amdhsa OS

Do the r600 targets also produce a similar error message?

Is this really the right test? My understanding is that the issue is not that gfx60x does not support the amdhsa OS, but that it does not use the FLAT address space.

My understanding is that the current problem is that FLAT instructions are being used for the GLOBAL address space accesses. The use of FLAT instructions for the global address space was introduced after gfx60x was initially being supported on amdhsa. Originally BUFFER instructions that use an SRD that has a 0 base and are marked as addr64 where used for GLOBAL address space accesses. This was changed to use FLAT instructions due to later targets dropping the SRD addr64 support. I suspect it is that change that broke gfx60x as there were no tests to catch it.

So the real fix seems to find that change and make the code still use use BUFFER instructions for gfxx60x and FLAT instructions for gfx70x+. The tests can then be updated to test gfx60x for amdhsa but to omit the FLAT address space tests. The error would then indicate that the gfx60x does not support the FLAT address space (and that is not conditional on the OS). The documentation in AMDGPUUsage can state that gfx60x does not support the FLAT address space in the Address Space section. The Processor table can add a column for processor characteristics and mention that the gfx60x targets do not support the FLAT address space.

This revision now requires changes to proceed.Nov 27 2020, 9:43 AM
pvellien added inline comments.Nov 27 2020, 10:40 AM
llvm/docs/AMDGPUUsage.rst
2109–2112

Thanks for your feedback, I will update this

llvm/lib/Target/AMDGPU/AMDGPUSubtarget.cpp
134

Previously in the internal review process it mentioned that gfx60x does not support HSA and agreed to add a diagnostic to report that GFX6 do not support HSA OS type, @rampitec mentioned that SI ASICs cannot support HSA because we can't able to map memory on SI as HSA requires so the user will just have weird runtime failures. But based on your comment it seems like we have to use MUBUF instructions for -mtriple=amdgcn-amd-amdhsa -mcpu=gfx60x combination and use FLAT instructions for -mtriple=amdgcn-amd-amdhsa -mcpu=gfx70x+. Is my understanding correct? If the compiler emits the MUBUF instructions for global address space accesses, it is still required to produce the error msg?

t-tye added inline comments.Nov 27 2020, 11:36 AM
llvm/lib/Target/AMDGPU/AMDGPUSubtarget.cpp
134

In the early days of implementing HSA I believe we were bringing up on gfx6. It could not support all HSA features, but it did function with the parts it could support. So I was suggesting we restore the code to support what it did originally. That would mean using MUBUF for the GLOBAL address space like it used to do (is that code still present?).

The compiler can then report errors for the features it cannot support, which in this case is it cannot support instruction selection of the GENERIC address space on gfx6.

If you could find the commit that switched to using FLAT instructions to access the GLOBAL address space that will likely provide the necessary information to decide the best thing to do for this issue.

pvellien added inline comments.Nov 27 2020, 11:50 AM
llvm/lib/Target/AMDGPU/AMDGPUSubtarget.cpp
134

I code to select MUBUF instructions for Global address space is still present, In fact my first patch for this issue is to generate MUBUF instructions instead of reporting error. But it got rejected due to SI ASICs do not support HSA.
This is the patch https://reviews.llvm.org/D15543 which switched to using FLAT instructions for global.
So whether the new approach is to off FlatForGlobal flag for -mtriple=amdgcn-amd-amdhsa -mcpu=gfx60x combination and generate MUBUF instructions instead. It would be very much to know what is the expectation :) whether we wait for @rampitec for a comment
Btw, thanks a lot for your feedback.

rampitec added inline comments.Dec 7 2020, 1:01 PM
llvm/lib/Target/AMDGPU/AMDGPUSubtarget.cpp
62–72

It is probably better just print an error if amdhsa is requested but no compatible -mcpu specified (e.g. when we have generic/tahiti etc).

134

We can try to use MUBUF for global, but we certainly cannot support HSA and/or generic pointers on SI.

pvellien abandoned this revision.Dec 24 2020, 2:24 AM

This change is wrong, the different patch is landed in llvm to handle global address space access in gfx60x for HSA Os. So closing it.