4-byte alignment is good enough in all address spaces,
so early exit for that case.
This fixes being overly conservative with stack accesses
of larger types.
Paths
| Differential D29601
AMDGPU: Simplify allowsMisalignedMemoryAccesses AbandonedPublic Authored by arsenm on Feb 6 2017, 1:20 PM.
Details
Summary 4-byte alignment is good enough in all address spaces, This fixes being overly conservative with stack accesses
Diff Detail Event Timelinearsenm retitled this revision from AMDGPU: Simplify isLegalAddressingMode to AMDGPU: Simplify allowsMisalignedMemoryAccesses.Feb 6 2017, 1:21 PM Comment Actions Missed test update. This changes the vectorizer to start combining private accesses which will need to be split up later. This isn't exactly ideal, but from an alignment perspective I think is correct. It might be desirable to add a different hook for the vectorizer to check
Revision Contents
Diff 87345 lib/Target/AMDGPU/SIISelLowering.cpp
test/CodeGen/AMDGPU/unaligned-load-store.ll
test/Transforms/LoadStoreVectorizer/AMDGPU/merge-stores-private.ll
|