Teach convertToThreeAddress to use the V_FMA_F16_gfx9 pseudo (i.e. the
standard instruction in GFX9 onwards) instead of V_FMA_F16 (the legacy
pseudo for GFX8 compatibility, which is no longer supported in GFX10).
This follows the example of macToMad in SIFoldOperands.
Details
Details
- Reviewers
arsenm rampitec - Commits
- rG54fc9eb9b313: [AMDGPU] Use v_fma_f16 on GFX10
Diff Detail
Diff Detail
- Repository
- rG LLVM Github Monorepo
Unit Tests
Unit Tests
Time | Test | |
---|---|---|
60 ms | x64 debian > LLVM.Bindings/Go::go.test |
Event Timeline
Comment Actions
We should probably rename the pseudos:
- V_FMA_F16_gfx9 -> V_FMA_F16
- V_FMA_F16 -> V_FMA_F16_vi
and the same for the other 16-bit ones that were renamed in GFX9.
clang-format: please reformat the code