HomePhabricator

[AMDGPU] Fix DPP combiner

Description

[AMDGPU] Fix DPP combiner

Fixed issue with identity values and other cases, f32/f16 identity values to be added later. fma/mac instructions is disabled for now.
Test is fully reworked, added comments. Other fixes:

  1. dpp move with uses and old reg initializer should be in the same BB.
  2. bound_ctrl:0 is only considered when bank_mask and row_mask are fully enabled (0xF). Othervise the old register value is checked for identity.
  3. Added add, subrev, and, or instructions to the old folding function.
  4. Kill flag is cleared for the src0 (DPP register) as it may be copied into more than one user.

Differential revision: https://reviews.llvm.org/D55444

Details

Committed
vpykhtinJan 9 2019, 5:43 AM
Differential Revision
D55444: AMDGPU: Fix DPP combiner
Parents
rL350720: [clangd] Add a test for SignatureHelp on dynamic index.
Branches
Unknown
Tags
Unknown