Diff 51504

lib/Transforms/Scalar/LoopUnrollPass.cpp

Show First 20 Lines • Show All 633 Lines • ▼ Show 20 Lines	if (!AllowPartial && !CountSetExplicitly) {
return false;		return false;
}		}
if (UP.PartialThreshold != NoThreshold &&		if (UP.PartialThreshold != NoThreshold &&
UnrolledSize > UP.PartialThreshold) {		UnrolledSize > UP.PartialThreshold) {
// Reduce unroll count to be modulo of TripCount for partial unrolling.		// Reduce unroll count to be modulo of TripCount for partial unrolling.
Count = (std::max(UP.PartialThreshold, 3u) - 2) / (LoopSize - 2);		Count = (std::max(UP.PartialThreshold, 3u) - 2) / (LoopSize - 2);
while (Count != 0 && TripCount % Count != 0)		while (Count != 0 && TripCount % Count != 0)
Count--;		Count--;
		if (Count <= 1 \|\| (LoopSize - 2) * Count + 2 > UP.PartialThreshold) {
		// If there is no Count that is modulo of TripCount or we still
		// exceed threshold, set Count to largest power-of-two factor that
		// satisfies the threshold limit.
		Count = (std::max(UP.PartialThreshold, 3u)-2) / (LoopSize-2);
		UnrolledSize = (LoopSize - 2) * Count + 2;
		while (Count != 0 && UnrolledSize > UP.PartialThreshold) {
		Count >>= 1;
		UnrolledSize = (LoopSize - 2) * Count + 2;
		}
		mzolotukhinUnsubmitted Not Done Reply Inline Actions Do I understand it correctly that with this change we start to unroll every loop if it's possible at all (some of them with remainder)? If that's correct, have you measured performance, compile time, and code size impact of this change? Also, it might make sense to limit it to O3. mzolotukhin: Do I understand it correctly that with this change we start to unroll every loop if it's…
		evstupacAuthorUnsubmitted Not Done Reply Inline Actions The change allows unroll only for loops that satisfy threshold limit. There are not much cases where this hits. On spec2000 I've got almost all build same or <0.1% code size changes. Without the changes it happens that the same loop with unknown bounds get unrolled, but with constant no. For example when we have prime TripCount we'll not found Count that is modulo of TripCount. for (i = 0; i < 17; i++) I don't see any reason why we should restrict unroll in the case if we are in threshold limits. As for remainder - there is no such by default. As we know remainder TripCount (it is constant) we can jump into the middle of the loop first. The other point is that now we check threshold limit only at entrance. So potentially we can find unroll factor which exceed threshold limit. evstupac: The change allows unroll only for loops that satisfy threshold limit. There are not much cases…
		}
}		}
} else if (Unrolling == Runtime) {		} else if (Unrolling == Runtime) {
if (!AllowRuntime && !CountSetExplicitly) {		if (!AllowRuntime && !CountSetExplicitly) {
DEBUG(dbgs() << " will not try to unroll loop with runtime trip count "		DEBUG(dbgs() << " will not try to unroll loop with runtime trip count "
<< "-unroll-runtime not given\n");		<< "-unroll-runtime not given\n");
return false;		return false;
}		}

▲ Show 20 Lines • Show All 165 Lines • Show Last 20 Lines

test/Transforms/LoopUnroll/partial-unroll-const-bounds.ll

				; RUN: opt < %s -S -unroll-threshold=20 -loop-unroll -unroll-allow-partial \| FileCheck %s

				mzolotukhinUnsubmitted Not Done Reply Inline Actions You probably don't want to run `opt ... -O2` in this test. O2 will run the entire optimization pipeline, while we only want loop-unroll. mzolotukhin: You probably don't want to run `opt ... -O2` in this test. O2 will run the entire optimization…
				evstupacAuthorUnsubmitted Not Done Reply Inline Actions "-O2" makes CHECK statements easier. However I agree it is not required here. I'll fix the test. evstupac: "-O2" makes CHECK statements easier. However I agree it is not required here. I'll fix the test.
				mzolotukhinUnsubmitted Not Done Reply Inline Actions You could add some specific passes after yours. You probably need just something like `-dce -instcombine -simplifycfg` - some tests do this. Adding the entire "-O2" might introduce undesired side effects from e.g. running loop-unroll twice. mzolotukhin: You could add some specific passes after yours. You probably need just something like `-dce…
				; The Loop TripCount is 9. However unroll factors 3 or 9 exceed given threshold.
				; The test checks that we choose a smaller, power-of-two, unroll count and do not give up on unrolling.

				; CHECK: for.body:
				; CHECK: store
				; CHECK: for.body.1:
				; CHECK: store

				define void @foo(i32* nocapture %a, i32* nocapture readonly %b) nounwind uwtable {
				entry:
				br label %for.body

				for.body: ; preds = %for.body, %entry
				%indvars.iv = phi i64 [ 1, %entry ], [ %indvars.iv.next, %for.body ]
				%arrayidx = getelementptr inbounds i32, i32* %b, i64 %indvars.iv
				%ld = load i32, i32* %arrayidx, align 4
				%idxprom1 = sext i32 %ld to i64
				%arrayidx2 = getelementptr inbounds i32, i32* %a, i64 %idxprom1
				mzolotukhinUnsubmitted Not Done Reply Inline Actions Please pass the test through `opt -instnamer` to replace names like %0 with some symbolic names. These digit names make it harder to change the test in future. Also, you probably could only remove almost all instructions from the body to make the test smaller. If you run only loop-unroll, nothing will optimize that to `ret void`, so it would be fine. mzolotukhin: Please pass the test through `opt -instnamer` to replace names like %0 with some symbolic names.
				evstupacAuthorUnsubmitted Not Done Reply Inline Actions Ok. Will update. evstupac: Ok. Will update.
				%st = trunc i64 %indvars.iv to i32
				store i32 %st, i32* %arrayidx2, align 4
				%indvars.iv.next = add nuw nsw i64 %indvars.iv, 1
				%exitcond = icmp eq i64 %indvars.iv.next, 10
				br i1 %exitcond, label %for.end, label %for.body

				for.end: ; preds = %for.body
				ret void
				}

This is an archive of the discontinued LLVM Phabricator instance.

Unroll of loops with constant bounds
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 51504

lib/Transforms/Scalar/LoopUnrollPass.cpp

test/Transforms/LoopUnroll/partial-unroll-const-bounds.ll

This is an archive of the discontinued LLVM Phabricator instance.

Unroll of loops with constant boundsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 51504

lib/Transforms/Scalar/LoopUnrollPass.cpp

test/Transforms/LoopUnroll/partial-unroll-const-bounds.ll

Unroll of loops with constant bounds
ClosedPublic