Page MenuHomePhabricator

[OpenMP] Manually unroll the argument copy loop
AcceptedPublic

Authored by jdoerfert on Sep 2 2021, 9:16 AM.

Details

Summary

The unroll pragma did not properly work as the loop bound was not known
when we optimize the runtime and we then added a "unroll disable"
metadata which prevented unrolling later when the bounds were known.
For now we manually unroll to make sure up to 16 elements are handled
nicely. This helps optimizations to look through the argument passing.

Diff Detail