Try to access pieces 4 bytes at a time. This helps
various hasOneUse extract_vector_elt combines, such
as load width reductions.
Avoids test regressions in a future commit.
Paths
| Differential D48126
AMDGPU: Add combine for short vector extract_vector_elts ClosedPublic Authored by arsenm on Jun 13 2018, 7:31 AM.
Details
Diff Detail Event TimelineHerald added subscribers: t-tye, tpr, dstuttard and 4 others. · View Herald TranscriptJun 13 2018, 7:31 AM This revision is now accepted and ready to land.Jun 13 2018, 11:23 AM
Revision Contents
Diff 151161 lib/Target/AMDGPU/SIISelLowering.cpp
test/CodeGen/AMDGPU/extract_vector_elt-f16.ll
test/CodeGen/AMDGPU/extract_vector_elt-i16.ll
test/CodeGen/AMDGPU/extract_vector_elt-i8.ll
|