This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/Vectorize/
-
Transforms/
-
Vectorize/
2/2
VPlan.h
-
test/Transforms/LoopVectorize/AArch64/
-
Transforms/
-
LoopVectorize/
-
AArch64/
-
sve-inv-store.ll

Differential D144434

[VPlan] Handle invariant GEPs in isUniformAfterVectorization.
ClosedPublic

Authored by fhahn on Feb 20 2023, 3:08 PM.

Download Raw Diff

Details

Reviewers

Ayal
gilr
rengolin

Commits

rG1a28b9bce75d: [VPlan] Handle invariant GEPs in isUniformAfterVectorization.

Summary

This fixes a crash caused by legal treating a scalable GEP as invariant,
but isUniformAfterVectorization does not handle GEPs.

Fixes https://github.com/llvm/llvm-project/issues/60831.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	3,010 ms	x64 debian > LLVM.Transforms/LoopVectorize/AArch64::sve-inv-store.ll
	60,050 ms	x64 debian > MLIR.Examples/standalone::test.toy
	60,060 ms	x64 debian > ThreadSanitizer-x86_64.ThreadSanitizer-x86_64::restore_stack.cpp
	60,080 ms	x64 debian > libFuzzer.libFuzzer::fuzzer-leak.test
	60,070 ms	x64 debian > libFuzzer.libFuzzer::minimize_crash.test
		View Full Test Results (7 Failed)

Event Timeline

fhahn created this revision.Feb 20 2023, 3:08 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 20 2023, 3:08 PM

Herald added subscribers: StephenFan, tschuett, psnobl and 3 others. · View Herald Transcript

fhahn requested review of this revision.Feb 20 2023, 3:08 PM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 20 2023, 3:08 PM

Herald added subscribers: • pcwang-thead, alextsao1999, vkmr. · View Herald Transcript

Harbormaster completed remote builds in B214863: Diff 498969.Feb 20 2023, 4:02 PM

fhahn mentioned this in D144491: [VPlan] Use isUniformAfterVec in VPReplicateRecipe::execute..Feb 21 2023, 7:35 AM

fhahn added a parent revision: D144491: [VPlan] Use isUniformAfterVec in VPReplicateRecipe::execute..Feb 21 2023, 7:37 AM

Generate check lines for test.

I think there's a deeper logic issue here. I'm pretty sure this is a bug I introduced, so, sorry!

LoopAccessInfo::isUniform uses SCEV's notion of loop invariant. This is both things which are currently loop invariant, but also things whose computation is loop invariant (e.g. uniform).

As you noticed, VPReplicateRecipe::execute uses only whether the recipe was defined. This is essentially a proxy for whether an instructions *placement* was loop invariant. This explicitly *does not* reason about computation. (I think... this part of things I'm less confident in.)

I think the basic issue here is that a Recipe can correspond to the isUniform Value without itself being uniform-per-part or UniformAfterVectorization. So, I think your change is patching over a bug, not fixing one.

Harbormaster completed remote builds in B215024: Diff 499174.Feb 21 2023, 9:29 AM

This patch does not solve the issue completely.
But I don't have expertise enough to comment on what needs to be fixed.

A simplified test case which still breaks the Loop vectorizer even after applying this candidate patch is as follows

test_loop2.ll

target datalayout = "e-m:e-i8:8:32-i16:16:32-i64:64-i128:128-n32:64-S128"
target triple = "aarch64-unknown-linux-gnu"

define dso_local fastcc void @_test_loop2(i64 %init, i64 %n, i8* %store_dest) unnamed_addr #0 {
    br label %1.
  critedge5:                                       ; preds = %1
    ret void
  1:                                               ; preds = %0, %1
    %2 = phi i64 [ %7, %1 ], [ %init, %0 ]
    %3 = sub nsw i64 %n, %2
    %4 = trunc i64 %n to i8
    %5 = add i64 %2, %3
    %6 = getelementptr i8, i8* %store_dest, i64 %5
    store i8 %4, i8* %6, align 1
    %7 = add nsw i64 %2, 1
    %8 = icmp sle i64 %7, 1000
    br i1 %8, label %1, label %.critedge5
}

attributes #0 = { "target-features"="+crc,+crypto,+fp-armv8,+neon,+sve,+v8.1a,+v8.2a,+v8.3a,+v8.4a,+v8a" }

Issue an be reproduced by command

opt -passes=loop-vectorize test_loop2.ll

fhahn mentioned this in rG07e5f57df4bf: [LV] Add tests for #60831..Apr 28 2023, 2:42 AM

fhahn mentioned this in rG55903151a2a5: [VPlan] Use isUniformAfterVec in VPReplicateRecipe::execute..May 19 2023, 10:15 AM

In D144434#4141906, @reames wrote:

I think there's a deeper logic issue here. I'm pretty sure this is a bug I introduced, so, sorry!

LoopAccessInfo::isUniform uses SCEV's notion of loop invariant. This is both things which are currently loop invariant, but also things whose computation is loop invariant (e.g. uniform).

As you noticed, VPReplicateRecipe::execute uses only whether the recipe was defined. This is essentially a proxy for whether an instructions *placement* was loop invariant. This explicitly *does not* reason about computation. (I think... this part of things I'm less confident in.)

At the moment, isUniformAfterVectorization reasons about more than just placement by using information about uniform operands, but relies on uniforms being represented as uniform VPReplicateRecipes. I think the underlying issue is that we don't classify the GEP as uniform correctly. I put up D150991 to use Legal::isUniform to bring the reasoning during classification in line with isUniformMemOp which is used in other places.

In D144434#4149935, @kartcq wrote:

This patch does not solve the issue completely.
But I don't have expertise enough to comment on what needs to be fixed.

A simplified test case which still breaks the Loop vectorizer even after applying this candidate patch is as follows

Thanks, I added the new test in 07e5f57df4bf

This looks good to me. @reames, @kartcq - ok with you too?

llvm/lib/Transforms/Vectorize/VPlan.h
2710	Propagating uniformity forwards on-demand from operands is fine, for GEPs or other non-phi recipes, where not cached as a result of SCEV/Divergence Analysis. Should this function still be "inline"? The "AfterVectorization" suffix should be dropped if favor of "onlyFirstLaneUsed()" above (independent of this patch).

This revision is now accepted and ready to land.May 23 2023, 1:48 PM

I'm fine with this.

Closed by commit rG1a28b9bce75d: [VPlan] Handle invariant GEPs in isUniformAfterVectorization. (authored by fhahn). · Explain WhyMay 30 2023, 7:53 AM

This revision was automatically updated to reflect the committed changes.

fhahn added a commit: rG1a28b9bce75d: [VPlan] Handle invariant GEPs in isUniformAfterVectorization..

fhahn marked an inline comment as done.May 30 2023, 8:17 AM

fhahn added inline comments.

llvm/lib/Transforms/Vectorize/VPlan.h
2710	Will move into VPlan.cpp now that this has grown and rename accordingly.

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Vectorize/

VPlan.h

2 lines

test/

Transforms/

LoopVectorize/

AArch64/

sve-inv-store.ll

17 lines

Diff 498969

llvm/lib/Transforms/Vectorize/VPlan.h

Show First 20 Lines • Show All 2,700 Lines • ▼ Show 20 Lines	inline bool isUniformAfterVectorization(VPValue *VPV) {
// A value defined outside the vector region must be uniform after		// A value defined outside the vector region must be uniform after
// vectorization inside a vector region.		// vectorization inside a vector region.
if (VPV->isDefinedOutsideVectorRegions())		if (VPV->isDefinedOutsideVectorRegions())
return true;		return true;
VPRecipeBase *Def = VPV->getDefiningRecipe();		VPRecipeBase *Def = VPV->getDefiningRecipe();
assert(Def && "Must have definition for value defined inside vector region");		assert(Def && "Must have definition for value defined inside vector region");
if (auto Rep = dyn_cast<VPReplicateRecipe>(Def))		if (auto Rep = dyn_cast<VPReplicateRecipe>(Def))
return Rep->isUniform();		return Rep->isUniform();
		if (auto *GEP = dyn_cast<VPWidenGEPRecipe>(Def))
		return all_of(GEP->operands(), isUniformAfterVectorization);
		AyalUnsubmitted Not Done Reply Inline Actions Propagating uniformity forwards on-demand from operands is fine, for GEPs or other non-phi recipes, where not cached as a result of SCEV/Divergence Analysis. Should this function still be "inline"? The "AfterVectorization" suffix should be dropped if favor of "onlyFirstLaneUsed()" above (independent of this patch). Ayal: Propagating uniformity forwards on-demand from operands is fine, for GEPs or other non-phi…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Will move into VPlan.cpp now that this has grown and rename accordingly. fhahn: Will move into VPlan.cpp now that this has grown and rename accordingly.
return false;		return false;
}		}
} // end namespace vputils		} // end namespace vputils

} // end namespace llvm		} // end namespace llvm

#endif // LLVM_TRANSFORMS_VECTORIZE_VPLAN_H		#endif // LLVM_TRANSFORMS_VECTORIZE_VPLAN_H

llvm/test/Transforms/LoopVectorize/AArch64/sve-inv-store.ll

Show First 20 Lines • Show All 133 Lines • ▼ Show 20 Lines	for.inc: ; preds = %for.body, %if.then
%inc = add nuw nsw i64 %i.09, 1		%inc = add nuw nsw i64 %i.09, 1
%exitcond.not = icmp eq i64 %inc, %N		%exitcond.not = icmp eq i64 %inc, %N
br i1 %exitcond.not, label %for.end, label %for.body, !llvm.loop !0		br i1 %exitcond.not, label %for.end, label %for.body, !llvm.loop !0

for.end: ; preds = %for.inc, %entry		for.end: ; preds = %for.inc, %entry
ret void		ret void
}		}

		; Test case for PR60831.
		define void @test_invar_gep(i8* %store_dest) #0 {
		entry:
		br label %loop

		loop:
		%iv = phi i64 [ 0, %entry ], [ %iv.next, %loop ]
		%gep.invar = getelementptr i8, i8* %store_dest, i64 0
		store i64 %iv, i8* %gep.invar, align 1
		%iv.next = add nsw i64 %iv, 1
		%ec = icmp eq i64 %iv.next, 100
		br i1 %ec, label %exit, label %loop, !llvm.loop !0

		exit:
		ret void
		}

attributes #0 = { "target-features"="+neon,+sve" vscale_range(1, 16) }		attributes #0 = { "target-features"="+neon,+sve" vscale_range(1, 16) }

!0 = distinct !{!0, !1, !2, !3, !4, !5}		!0 = distinct !{!0, !1, !2, !3, !4, !5}
!1 = !{!"llvm.loop.mustprogress"}		!1 = !{!"llvm.loop.mustprogress"}
!2 = !{!"llvm.loop.vectorize.width", i32 4}		!2 = !{!"llvm.loop.vectorize.width", i32 4}
!3 = !{!"llvm.loop.vectorize.scalable.enable", i1 true}		!3 = !{!"llvm.loop.vectorize.scalable.enable", i1 true}
!4 = !{!"llvm.loop.vectorize.enable", i1 true}		!4 = !{!"llvm.loop.vectorize.enable", i1 true}
!5 = !{!"llvm.loop.interleave.count", i32 1}		!5 = !{!"llvm.loop.interleave.count", i32 1}