This is an archive of the discontinued LLVM Phabricator instance.

[AMDGPU] Fix latency for implicit vcc_lo operands on GFX10 wave32
ClosedPublic

Authored by foad on Oct 22 2021, 7:39 AM.

Details

Summary

As described in the comment, the way we change vcc to vcc_lo in these
operands confuses addPhysRegDataDeps into treating them as implicit
pseudo operands. Fix this by setting the correct latency from the
SchedModel after addPhysRegDataDeps wrongly set it to 0.

Diff Detail

Event Timeline

foad created this revision.Oct 22 2021, 7:39 AM
foad requested review of this revision.Oct 22 2021, 7:39 AM
Herald added a project: Restricted Project. · View Herald TranscriptOct 22 2021, 7:39 AM

Really we should have wave32 variants of all instructions

foad added a comment.Oct 22 2021, 8:48 AM

Really we should have wave32 variants of all instructions

I agree, but I'd still like to be able to fix minor scheduling problems without tackling that fundamental issue.

rampitec accepted this revision.Oct 22 2021, 10:17 AM
This revision is now accepted and ready to land.Oct 22 2021, 10:17 AM
This revision was landed with ongoing or failed builds.Oct 22 2021, 12:15 PM
This revision was automatically updated to reflect the committed changes.