This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Analysis/
-
Analysis/
3/9
ScalarEvolution.cpp
-
unittests/Analysis/
-
Analysis/
2
ScalarEvolutionTest.cpp

Differential D89821

[SCEV] Match 'zext (trunc A to iB) to iY' as URem.
ClosedPublic

Authored by fhahn on Oct 20 2020, 1:41 PM.

Download Raw Diff

Details

Reviewers

reames
mkazantsev
sanjoy.google
lebedev.ri

Commits

rG88d6421e4c43: [SCEV] Match 'zext (trunc A to iB) to iY' as URem.

Summary

URem operations with constant power-of-2 second operands are modeled as
such. This patch on its own has very little impact (e.g. no changes in
CodeGen for MultiSource/SPEC2000/SPEC2006 on X86 -O3 -flto), but I'll
soon post follow-up patches that make use of it to more accurately
determine the trip multiple.

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	110 ms	linux > Polly.ScopInfo/NonAffine::non-affine-loop-condition-dependent-access_3.ll

Event Timeline

fhahn created this revision.Oct 20 2020, 1:41 PM

Herald added a project: Restricted Project. · View Herald TranscriptOct 20 2020, 1:41 PM

Herald added subscribers: pengfei, hiraditya. · View Herald Transcript

fhahn requested review of this revision.Oct 20 2020, 1:41 PM

I would expect that you should be able to showcase the effect with opt -analyze -scalar-evolution-based test:
https://godbolt.org/z/xddz8d (because there's zext(A % B) --> zext(A) % zext(B) matchURem()-driven fold)

llvm/lib/Analysis/ScalarEvolution.cpp
12757	What if `Expr` isn't an `zext` itself, but it is used by zext (which is the case in the single user of this method)?
12760–12761	What if we had `zext(zext(trunc))`, so we end up with a wider type than A was?

Harbormaster completed remote builds in B75757: Diff 299457.Oct 20 2020, 2:24 PM

lebedev.ri added a reviewer: lebedev.ri.Oct 20 2020, 11:41 PM

mkazantsev added inline comments.Oct 23 2020, 3:37 AM

llvm/lib/Analysis/ScalarEvolution.cpp
12767	I think `getPowerOf2` would be a useful utility function in SCEV. Consider factoring out (maybe as separate patch).
12768	`1u << X` will overflowan become zero if `X > 32`. Consider using APInt.
llvm/unittests/Analysis/ScalarEvolutionTest.cpp
67	static?

mkazantsev added inline comments.Oct 23 2020, 3:39 AM

llvm/unittests/Analysis/ScalarEvolutionTest.cpp
1334	Could you please make a 64-bit version of this test to catch bug with `1u << size` if it happens?

Address comments: use APInt instead unsigned for shift, handle different zext sizes, add test with divisor > 1 << 32.

Thanks!

In D89821#2342783, @lebedev.ri wrote:

I would expect that you should be able to showcase the effect with opt -analyze -scalar-evolution-based test:
https://godbolt.org/z/xddz8d (because there's zext(A % B) --> zext(A) % zext(B) matchURem()-driven fold)

Not sure if that's possible, because the expression is already in the form zext (trunc ... to iX) to iY and I think if there was an outer zext, it would be already folded into the inner zext.

llvm/lib/Analysis/ScalarEvolution.cpp
12757	Hm, I not sure what to do about that case. I think we need to zext (trunc) combo to match that.
12760–12761	Originally the code had a check to ensure the type of the starting expression matched the type of A, but I now updated things to do a ZExt on demand to make A match the type of Expr.
12768	Thanks, updated to use APInt and added a test where the LHS is > 1 << 32.

make matchURem static in test class

Harbormaster completed remote builds in B76332: Diff 300552.Oct 25 2020, 1:25 PM

Harbormaster completed remote builds in B76331: Diff 300551.Oct 25 2020, 1:30 PM

Looks good, thanks!

llvm/lib/Analysis/ScalarEvolution.cpp
12762	`{}` not needed
12763	`if (auto *Trunc = ...`

This revision is now accepted and ready to land.Oct 25 2020, 9:14 PM

This revision was landed with ongoing or failed builds.Oct 29 2020, 3:48 AM

Closed by commit rG88d6421e4c43: [SCEV] Match 'zext (trunc A to iB) to iY' as URem. (authored by fhahn). · Explain Why

This revision was automatically updated to reflect the committed changes.

fhahn added a commit: rG88d6421e4c43: [SCEV] Match 'zext (trunc A to iB) to iY' as URem..

Thanks! Adjusted the comments in the committed version

Revision Contents

Path

Size

llvm/

lib/

Analysis/

ScalarEvolution.cpp

16 lines

unittests/

Analysis/

ScalarEvolutionTest.cpp

35 lines

Diff 299457

llvm/lib/Analysis/ScalarEvolution.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 12,744 Lines • ▼ Show 20 Lines	for (auto &I : *BB) {

OS.indent(Depth) << "[PSE]" << I << ":\n";		OS.indent(Depth) << "[PSE]" << I << ":\n";
OS.indent(Depth + 2) << *Expr << "\n";		OS.indent(Depth + 2) << *Expr << "\n";
OS.indent(Depth + 2) << "--> " << *II->second.second << "\n";		OS.indent(Depth + 2) << "--> " << *II->second.second << "\n";
}		}
}		}

// Match the mathematical pattern A - (A / B) * B, where A and B can be		// Match the mathematical pattern A - (A / B) * B, where A and B can be
// arbitrary expressions.		// arbitrary expressions. Also match zext (trunc A to iB) to iY, which is used
		// for URem with constant power-of-2 second operands.
// It's not always easy, as A and B can be folded (imagine A is X / 2, and B is		// It's not always easy, as A and B can be folded (imagine A is X / 2, and B is
// 4, A / B becomes X / 8).		// 4, A / B becomes X / 8).
bool ScalarEvolution::matchURem(const SCEV Expr, const SCEV &LHS,		bool ScalarEvolution::matchURem(const SCEV Expr, const SCEV &LHS,
		lebedev.riUnsubmitted Not Done Reply Inline Actions What if `Expr` isn't an `zext` itself, but it is used by zext (which is the case in the single user of this method)? lebedev.ri: What if `Expr` isn't an `zext` itself, but it is used by zext (which is the case in the single…
		fhahnAuthorUnsubmitted Done Reply Inline Actions Hm, I not sure what to do about that case. I think we need to zext (trunc) combo to match that. fhahn: Hm, I not sure what to do about that case. I think we need to zext (trunc) combo to match that.
const SCEV *&RHS) {		const SCEV *&RHS) {
		// Try to match 'zext (trunc A to iB) to iY', which is used
		// for URem with constant power-of-2 second operands. Make sure the size of
		// the operand A matches the size of the whole expressions.
		lebedev.riUnsubmitted Not Done Reply Inline Actions What if we had `zext(zext(trunc))`, so we end up with a wider type than A was? lebedev.ri: What if we had `zext(zext(trunc))`, so we end up with a wider type than A was?
		fhahnAuthorUnsubmitted Done Reply Inline Actions Originally the code had a check to ensure the type of the starting expression matched the type of A, but I now updated things to do a ZExt on demand to make A match the type of Expr. fhahn: Originally the code had a check to ensure the type of the starting expression matched the type…
		if (const auto *ZExt = dyn_cast<SCEVZeroExtendExpr>(Expr)) {
		mkazantsevUnsubmitted Not Done Reply Inline Actions `{}` not needed mkazantsev: `{}` not needed
		const auto *Trunc = dyn_cast<SCEVTruncateExpr>(ZExt->getOperand(0));
		mkazantsevUnsubmitted Not Done Reply Inline Actions `if (auto Trunc = ...` mkazantsev:* `if (auto *Trunc = ...`
		if (Trunc && getTypeSizeInBits(Trunc->getOperand()->getType()) ==
		getTypeSizeInBits(Expr->getType())) {
		LHS = Trunc->getOperand();
		RHS = getConstant(Expr->getType(),
		mkazantsevUnsubmitted Not Done Reply Inline Actions I think `getPowerOf2` would be a useful utility function in SCEV. Consider factoring out (maybe as separate patch). mkazantsev: I think `getPowerOf2` would be a useful utility function in SCEV. Consider factoring out (maybe…
		1u << getTypeSizeInBits(Trunc->getType()));
		mkazantsevUnsubmitted Not Done Reply Inline Actions `1u << X` will overflowan become zero if `X > 32`. Consider using APInt. mkazantsev: `1u << X` will overflowan become zero if `X > 32`. Consider using APInt.
		fhahnAuthorUnsubmitted Done Reply Inline Actions Thanks, updated to use APInt and added a test where the LHS is > 1 << 32. fhahn: Thanks, updated to use APInt and added a test where the LHS is > 1 << 32.
		return true;
		}
		}
const auto *Add = dyn_cast<SCEVAddExpr>(Expr);		const auto *Add = dyn_cast<SCEVAddExpr>(Expr);
if (Add == nullptr \|\| Add->getNumOperands() != 2)		if (Add == nullptr \|\| Add->getNumOperands() != 2)
return false;		return false;

const SCEV *A = Add->getOperand(1);		const SCEV *A = Add->getOperand(1);
const auto *Mul = dyn_cast<SCEVMulExpr>(Add->getOperand(0));		const auto *Mul = dyn_cast<SCEVMulExpr>(Add->getOperand(0));

if (Mul == nullptr)		if (Mul == nullptr)
▲ Show 20 Lines • Show All 167 Lines • Show Last 20 Lines

llvm/unittests/Analysis/ScalarEvolutionTest.cpp

Show First 20 Lines • Show All 57 Lines • ▼ Show 20 Lines	void runWithSE(
Test(F, LI, SE);		Test(F, LI, SE);
}		}

static Optional<APInt> computeConstantDifference(ScalarEvolution &SE,		static Optional<APInt> computeConstantDifference(ScalarEvolution &SE,
const SCEV *LHS,		const SCEV *LHS,
const SCEV *RHS) {		const SCEV *RHS) {
return SE.computeConstantDifference(LHS, RHS);		return SE.computeConstantDifference(LHS, RHS);
}		}

		bool matchURem(ScalarEvolution &SE, const SCEV Expr, const SCEV &LHS,
		mkazantsevUnsubmitted Not Done Reply Inline Actions static? mkazantsev: static?
		const SCEV *&RHS) {
		return SE.matchURem(Expr, LHS, RHS);
		}
};		};

TEST_F(ScalarEvolutionsTest, SCEVUnknownRAUW) {		TEST_F(ScalarEvolutionsTest, SCEVUnknownRAUW) {
FunctionType *FTy = FunctionType::get(Type::getVoidTy(Context),		FunctionType *FTy = FunctionType::get(Type::getVoidTy(Context),
std::vector<Type *>(), false);		std::vector<Type *>(), false);
Function *F = Function::Create(FTy, Function::ExternalLinkage, "f", M);		Function *F = Function::Create(FTy, Function::ExternalLinkage, "f", M);
BasicBlock *BB = BasicBlock::Create(Context, "entry", F);		BasicBlock *BB = BasicBlock::Create(Context, "entry", F);
ReturnInst::Create(Context, nullptr, BB);		ReturnInst::Create(Context, nullptr, BB);
▲ Show 20 Lines • Show All 1,237 Lines • ▼ Show 20 Lines	runWithSE(*M, "foo", [](Function &F, LoopInfo &LI, ScalarEvolution &SE) {
ASSERT_TRUE(Guarded);		ASSERT_TRUE(Guarded);
EXPECT_TRUE(		EXPECT_TRUE(
SE.isBasicBlockEntryGuardedByCond(Guarded, ICmpInst::ICMP_ULT, Y, X));		SE.isBasicBlockEntryGuardedByCond(Guarded, ICmpInst::ICMP_ULT, Y, X));
EXPECT_TRUE(		EXPECT_TRUE(
SE.isBasicBlockEntryGuardedByCond(Guarded, ICmpInst::ICMP_UGT, X, Y));		SE.isBasicBlockEntryGuardedByCond(Guarded, ICmpInst::ICMP_UGT, X, Y));
});		});
}		}

		TEST_F(ScalarEvolutionsTest, MatchURem) {
		LLVMContext C;
		SMDiagnostic Err;
		std::unique_ptr<Module> M = parseAssemblyString(
		"target datalayout = \"e-m:e-p:32:32-f64:32:64-f80:32-n8:16:32-S128\" "
		" "
		"define void @test(i32 %a, i32 %b) {"
		"entry: "
		" %rem1 = urem i32 %a, 2"
		" %rem2 = urem i32 %a, 5"
		" %rem3 = urem i32 %a, %b"
		mkazantsevUnsubmitted Not Done Reply Inline Actions Could you please make a 64-bit version of this test to catch bug with `1u << size` if it happens? mkazantsev: Could you please make a 64-bit version of this test to catch bug with `1u << size` if it…
		" ret void "
		"} ",
		Err, C);

		assert(M && "Could not parse module?");
		assert(!verifyModule(*M) && "Must have been well formed!");

		runWithSE(*M, "test", [&](Function &F, LoopInfo &LI, ScalarEvolution &SE) {
		for (auto *N : {"rem1", "rem2", "rem3"}) {
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto N' can be declared as 'const auto N' [llvm-qualified-auto] not useful Lint: Pre-merge checks: clang-tidy: warning: 'auto N' can be declared as 'const auto N' [llvm-qualified-auto] [[https…
		auto *URemI = getInstructionByName(F, N);
		auto *S = SE.getSCEV(URemI);
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: 'auto S' can be declared as 'const auto S' [llvm-qualified-auto] not useful Lint: Pre-merge checks: clang-tidy: warning: 'auto S' can be declared as 'const auto S' [llvm-qualified-auto] [[https…
		const SCEV LHS, RHS;
		EXPECT_TRUE(matchURem(SE, S, LHS, RHS));
		EXPECT_EQ(LHS, SE.getSCEV(URemI->getOperand(0)));
		EXPECT_EQ(RHS, SE.getSCEV(URemI->getOperand(1)));
		}
		});
		}

} // end namespace llvm		} // end namespace llvm