This is an archive of the discontinued LLVM Phabricator instance.

[OpenMP] Increase opportunity for parallel kernel launch in AMDGPUs: add multiple hsa queue's per device in plugin
ClosedPublic

Authored by carlo.bertolli on Dec 14 2021, 5:47 PM.

Details

Summary

This patch extends the AMDGPU plugin for OpenMP target offloading from using a single HSA queue to multiple queues (four in this patch) per device. This enables concurrent threads to concurrently submit kernel launches to the same GPU.

Diff Detail

Event Timeline

carlo.bertolli created this revision.Dec 14 2021, 5:47 PM
carlo.bertolli requested review of this revision.Dec 14 2021, 5:47 PM
JonChesterfield accepted this revision.Dec 15 2021, 12:35 AM

LG, thanks!

This revision is now accepted and ready to land.Dec 15 2021, 12:35 AM
Herald added a project: Restricted Project. · View Herald TranscriptDec 15 2021, 7:35 AM