This improves code for the same reasons as scalarizing 32-bit
element vectors.
Details
Details
Diff Detail
Diff Detail
Paths
| Differential D49255
AMDGPU: Split wide vectors of i16/f16 into 32-bit regs on calls ClosedPublic Authored by arsenm on Jul 12 2018, 10:37 AM.
Details
Summary This improves code for the same reasons as scalarizing 32-bit
Diff Detail Event TimelineHerald added subscribers: t-tye, tpr, dstuttard and 4 others. · View Herald TranscriptJul 12 2018, 10:37 AM arsenm added a parent revision: D49254: AMDGPU: Scalarize vector argument types to calls.Jul 12 2018, 10:37 AM This revision is now accepted and ready to land.Jul 12 2018, 10:48 AM
Revision Contents
Diff 155215 lib/Target/AMDGPU/SIISelLowering.cpp
test/CodeGen/AMDGPU/call-argument-types.ll
test/CodeGen/AMDGPU/mad-mix-lo.ll
test/CodeGen/AMDGPU/mul.i16.ll
|