This is an archive of the discontinued LLVM Phabricator instance.

[SROA] Check typeSizeEqualsStoreSize in isVectorPromotionViable
ClosedPublic

Authored by bjope on Sep 16 2022, 6:02 AM.

Download Raw Diff

Details

Reviewers

MatzeB
efriedma
A-Wadhwani

Commits

rG3f08d248c44c: [SROA] Check typeSizeEqualsStoreSize in isVectorPromotionViable

Summary

Commit de3445e0ef15c4209 (https://reviews.llvm.org/D132096) made
changes to isVectorPromotionViable basically doing

// Create Vector with size of V, and each element of type Ty
...
uint64_t ElementSize = DL.getTypeStoreSizeInBits(Ty).getFixedSize();
uint64_t VectorSize = DL.getTypeSizeInBits(V).getFixedSize();
...
VectorType *VTy = VectorType::get(Ty, VectorSize / ElementSize, false);

Not quite sure why it uses the TypeStoreSize for the ElementSize,
but the new vector would only match in size with the old vector in
situations when the TypeStoreSize equals the TypeSize for Ty.
Therefore this patch adds a typeSizeEqualsStoreSize check as yet
another condition for allowing the the new type as a promotion
candidate.

Without this fix the new @test15 test would fail with an assert
like this:

opt: ../lib/Transforms/Scalar/SROA.cpp:1966:

auto isVectorPromotionViable(llvm::sroa::Partition &,
                             const llvm::DataLayout &)
     ::(anonymous class)::operator()(llvm::VectorType *,
                                     llvm::VectorType *) const:
Assertion `DL.getTypeSizeInBits(RHSTy).getFixedSize() ==
           DL.getTypeSizeInBits(LHSTy).getFixedSize() &&
           "Cannot have vector types of different sizes!"' failed.

...
#8 isVectorPromotionViable(...)::$_10::operator()...
#9 llvm::SROAPass::rewritePartition(...)
#10 llvm::SROAPass::splitAlloca(...)
#11 llvm::SROAPass::runOnAlloca(...)
#12 llvm::SROAPass::runImpl(...)
#13 llvm::SROAPass::run(...)

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

bjope created this revision.Sep 16 2022, 6:02 AM

Herald added a project: Restricted Project. · View Herald TranscriptSep 16 2022, 6:02 AM

Herald added subscribers: ctetreau, hiraditya, tschuett. · View Herald Transcript

bjope requested review of this revision.Sep 16 2022, 6:02 AM

Herald added a project: Restricted Project. · View Herald TranscriptSep 16 2022, 6:02 AM

Harbormaster completed remote builds in B187114: Diff 460714.Sep 16 2022, 7:18 AM

Adding missing CHECKS to the new test case.

Harbormaster completed remote builds in B187144: Diff 460750.Sep 16 2022, 8:49 AM

Thanks for catching this. I would like someone with more experience in SROA than me to take a look, in case there's something I've missed, though.

bjope added inline comments.Sep 19 2022, 8:19 AM

llvm/lib/Transforms/Scalar/SROA.cpp
1936	I should probably add some explanatory code comment here (given that this solution is accepted). Background to why I used the typeSizeEqualsStoreSize check: The idea is to make a similar check to what is done at line 1859 ("While the definition of LLVM vectors is bitpacked, we don't support sizes that aren't byte sized."), although that isn't exactly what testing for typeSizeEqualsStoreSize is doing. Not sure if it would be better to check for "byte sized". It might work as well (but it would complicate stuff for our out-of-target with 16 bit bytes if hard-coding another "8" here). Anyhow, for the "VectorSize / ElementSize" calculation below to result in a vector with the same size as the original vector I think typeSizeEqualsStoreSize needs to be fulfilled (or we could change the ElementSize calculation to just use getTypeSizeInBits instead of getTypeStoreSizeInBits, but then I suspect that the transform might be unsafe in case typeSizeEqualsStoreSize isn't fulfilled). So checking for typeSizeEqualsStoreSize seemed like a defensive approach here to avoid the corner cases without sacrificing the normal cases that we want to handle (and that is regression tested).

Seems sensible. Thanks for catching and fixing. LGTM

This revision is now accepted and ready to land.Sep 20 2022, 1:12 PM

Closed by commit rG3f08d248c44c: [SROA] Check typeSizeEqualsStoreSize in isVectorPromotionViable (authored by bjope). · Explain WhySep 21 2022, 1:07 AM

This revision was automatically updated to reflect the committed changes.

bjope added a commit: rG3f08d248c44c: [SROA] Check typeSizeEqualsStoreSize in isVectorPromotionViable.

MatzeB mentioned this in D132096: [SROA] Create additional vector type candidates based on store and load slices.Sep 23 2022, 9:17 AM

dyung added a reverting change: rG0a7f4e03a9a1: Revert "[SROA] Check typeSizeEqualsStoreSize in isVectorPromotionViable".Sep 23 2022, 12:24 PM

zhuhan0 mentioned this in D143225: [SROA] Create additional vector type candidates based on store and load slices.Feb 2 2023, 3:56 PM

zhuhan0 mentioned this in rGf9c2a341b94c: [SROA] Create additional vector type candidates based on store and load slices.Mar 8 2023, 12:01 PM

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Scalar/

SROA.cpp

2 lines

test/

Transforms/

SROA/

vector-promotion.ll

16 lines

Diff 461807

llvm/lib/Transforms/Scalar/SROA.cpp

Show First 20 Lines • Show All 1,927 Lines • ▼ Show 20 Lines	for (const Slice &S : P) {
if (LoadInst *LI = dyn_cast<LoadInst>(S.getUse()->getUser()))		if (LoadInst *LI = dyn_cast<LoadInst>(S.getUse()->getUser()))
Ty = LI->getType();		Ty = LI->getType();
else if (StoreInst *SI = dyn_cast<StoreInst>(S.getUse()->getUser()))		else if (StoreInst *SI = dyn_cast<StoreInst>(S.getUse()->getUser()))
Ty = SI->getValueOperand()->getType();		Ty = SI->getValueOperand()->getType();
else		else
continue;		continue;
if (isa<VectorType>(Ty))		if (isa<VectorType>(Ty))
continue;		continue;
		if (!DL.typeSizeEqualsStoreSize(Ty))
		bjopeAuthorUnsubmitted Done Reply Inline Actions I should probably add some explanatory code comment here (given that this solution is accepted). Background to why I used the typeSizeEqualsStoreSize check: The idea is to make a similar check to what is done at line 1859 ("While the definition of LLVM vectors is bitpacked, we don't support sizes that aren't byte sized."), although that isn't exactly what testing for typeSizeEqualsStoreSize is doing. Not sure if it would be better to check for "byte sized". It might work as well (but it would complicate stuff for our out-of-target with 16 bit bytes if hard-coding another "8" here). Anyhow, for the "VectorSize / ElementSize" calculation below to result in a vector with the same size as the original vector I think typeSizeEqualsStoreSize needs to be fulfilled (or we could change the ElementSize calculation to just use getTypeSizeInBits instead of getTypeStoreSizeInBits, but then I suspect that the transform might be unsafe in case typeSizeEqualsStoreSize isn't fulfilled). So checking for typeSizeEqualsStoreSize seemed like a defensive approach here to avoid the corner cases without sacrificing the normal cases that we want to handle (and that is regression tested). bjope: I should probably add some explanatory code comment here (given that this solution is accepted).
		continue;
// Create Vector with size of V, and each element of type Ty		// Create Vector with size of V, and each element of type Ty
VectorType *V = CandidateTys[0];		VectorType *V = CandidateTys[0];
uint64_t ElementSize = DL.getTypeStoreSizeInBits(Ty).getFixedSize();		uint64_t ElementSize = DL.getTypeStoreSizeInBits(Ty).getFixedSize();
uint64_t VectorSize = DL.getTypeSizeInBits(V).getFixedSize();		uint64_t VectorSize = DL.getTypeSizeInBits(V).getFixedSize();
if ((ElementSize != VectorSize) && (VectorSize % ElementSize == 0)) {		if ((ElementSize != VectorSize) && (VectorSize % ElementSize == 0)) {
VectorType *VTy = VectorType::get(Ty, VectorSize / ElementSize, false);		VectorType *VTy = VectorType::get(Ty, VectorSize / ElementSize, false);
CandidateTys.push_back(VTy);		CandidateTys.push_back(VTy);
if (CommonEltTy != Ty)		if (CommonEltTy != Ty)
▲ Show 20 Lines • Show All 2,922 Lines • Show Last 20 Lines

llvm/test/Transforms/SROA/vector-promotion.ll

Show First 20 Lines • Show All 622 Lines • ▼ Show 20 Lines	entry:
%x.tmp4 = getelementptr inbounds i32, ptr %x.cast, i64 3		%x.tmp4 = getelementptr inbounds i32, ptr %x.cast, i64 3
%d = load i32, ptr %x.tmp4		%d = load i32, ptr %x.tmp4

%add = add i32 %a, %b		%add = add i32 %a, %b
%add1 = add i32 %c, %d		%add1 = add i32 %c, %d
%add2 = add i32 %add, %add1		%add2 = add i32 %add, %add1
ret i32 %add2		ret i32 %add2
}		}

		; This used to hit an assert after commit de3445e0ef15c4.
		; Added as regression test to verify that we handle this without crashing.
		define i1 @test15() {
		; CHECK-LABEL: @test15(
		; CHECK-NEXT: [[A_SROA_0:%.*]] = alloca <2 x i64>, align 32
		; CHECK-NEXT: store <2 x i64> <i64 0, i64 -1>, ptr [[A_SROA_0]], align 32
		; CHECK-NEXT: [[A_SROA_0_0_A_SROA_0_0_L:%.*]] = load i1, ptr [[A_SROA_0]], align 32
		; CHECK-NEXT: ret i1 [[A_SROA_0_0_A_SROA_0_0_L]]
		;
		%a = alloca <8 x i32>
		store <2 x i64> <i64 0, i64 -1>, ptr %a
		%l = load i1, ptr %a, align 1
		ret i1 %l

		}