Details
- Reviewers
arsenm - Commits
- rGfa592180b3f4: [AMDGPU] Add more llc tests for 48-bit mul generation.
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
llvm/test/CodeGen/AMDGPU/mul_int24.ll | ||
---|---|---|
184–186 | We should have been able to eliminate these shifts since the high bits are known unused. Maybe we need a SimplifyDemandedBits combine on the mul24/mulhi24 sources? |
llvm/test/CodeGen/AMDGPU/mul_uint24-amdgcn.ll | ||
---|---|---|
581–582 | This seems to show that we've done a simplify-demanded-bits for the mul (because it is using the original v0 and v2) but not for the mul_hi (which is using the ANDed v1 and v3). Maybe AMDGPUTargetLowering::performIntrinsicWOChainCombine needs to call simplifyMul24 for the new mulhi intrinsics? |
We should have been able to eliminate these shifts since the high bits are known unused. Maybe we need a SimplifyDemandedBits combine on the mul24/mulhi24 sources?