This is an archive of the discontinued LLVM Phabricator instance.

AMDGPU: Add pass to cleanup DAG SALU/VALU messes
AbandonedPublic

Authored by arsenm on Aug 3 2017, 4:09 PM.

Details

Reviewers
tstellar
rampitec
Summary

This is a pass for selecting instructions which
should be patterns, if we were aware of whether
it's a scalar or vector operation.

For now select v_cvt_pk_u16_u32 with it. This should
be enabled for SDWA subtargets, but that requires more
work and test updates.

Diff Detail

Event Timeline

arsenm created this revision.Aug 3 2017, 4:09 PM
rampitec edited edge metadata.Aug 3 2017, 9:09 PM

The actual HW is SDWA enabled, so usability is limited, even though can be extended.
Anyway, please consider D35267, which may be a more general approach.

arsenm added a comment.Aug 4 2017, 5:08 PM

The actual HW is SDWA enabled, so usability is limited, even though can be extended.
Anyway, please consider D35267, which may be a more general approach.

What do you mean? SDWA was a new in VI feature. This also needs to be more precise and check if the source supports writing an SDWA output etc.

arsenm abandoned this revision.Feb 22 2019, 7:02 AM