We define mov/update dpp intrinsics as overloaded but do not
support i64, which is a practically useful type. Fix the
selection and lowering.
Details
Details
Diff Detail
Diff Detail
Event Timeline
llvm/lib/Target/AMDGPU/SIInstructions.td | ||
---|---|---|
1869–1875 | Why not do the split here? Why treat it as a post-RA pseudo? At latest I would have expected this to be expanded in FinalizeISel |
Comment Actions
GCNDPPCombiner can split the new pseudo and then handle the split.
Post-RA split is needed anyway since combining is an optimization.
Tests are updated to handle case w/o optimization.
llvm/lib/Target/AMDGPU/SIInstructions.td | ||
---|---|---|
1869–1875 | I still don't see the point of the pseudo. You can just emit the reg_sequence directly here? |
Why not do the split here? Why treat it as a post-RA pseudo? At latest I would have expected this to be expanded in FinalizeISel