Page MenuHomePhabricator

[libomptarget] [amdgpu] Change default number of teams per computation unit
Needs ReviewPublic

Authored by dhruvachak on Mar 19 2021, 7:00 PM.

Details

Summary

This patch is related to https://reviews.llvm.org/D98832. Based on discussions there, I decided to separate out the teams default as this patch. This change is to increase the number of teams per computation unit so as to provide more wavefronts for hiding latency. This change improves performance for some programs, including 20-50% for some Stream benchmarks.

Diff Detail

Event Timeline

dhruvachak created this revision.Mar 19 2021, 7:00 PM
dhruvachak requested review of this revision.Mar 19 2021, 7:00 PM
Herald added a project: Restricted Project. · View Herald TranscriptMar 19 2021, 7:00 PM