This is an archive of the discontinued LLVM Phabricator instance.

[JumpThreading] Don't limit the type of an operand
ClosedPublic

Authored by aqjune on Jul 30 2020, 7:52 AM.

Download Raw Diff

Details

Reviewers

efriedma
nikic
lebedev.ri

Commits

rG6f97103b561c: [JumpThreading] Don't limit the type of an operand

Summary

Compared to the optimized code with branch conditions never frozen,
limiting the type of freeze's operand causes generation of suboptimal code in
some cases.
I would like to suggest removing the constraint, as this patch does.
If the number of freeze instructions becomes significant, this can be revisited.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

aqjune created this revision.Jul 30 2020, 7:52 AM

Herald added a project: Restricted Project. · View Herald TranscriptJul 30 2020, 7:52 AM

Herald added subscribers: llvm-commits, hiraditya. · View Herald Transcript

aqjune requested review of this revision.Jul 30 2020, 7:52 AM

aqjune added a parent revision: D84940: [JumpThreading] Conditionally freeze its condition when unfolding select.Jul 30 2020, 7:53 AM

Harbormaster completed remote builds in B66399: Diff 281931.Jul 30 2020, 8:48 AM

efriedma added inline comments.Jul 30 2020, 12:40 PM

llvm/lib/Transforms/Scalar/JumpThreading.cpp
678	Oh, hmm, I misintepreted this comment, and didn't read the code carefully enough. The "if" is actually a heuristic that suppresses potential optimizations. Do you have any idea what the actual compile-time impact would be if we just recursed over all casts?

aqjune added inline comments.Jul 31 2020, 1:34 AM

llvm/lib/Transforms/Scalar/JumpThreading.cpp

678

I ran a test, and actually it brought a slight speedup when compiled with -O3 (without LTO):

+-----------------------------------------------+-------+-------+------------+
|                   unit:sec.                   | base  | cast  | speedup(%) |
+-----------------------------------------------+-------+-------+------------+
| CTMark/7zip/7zip-benchmark.test               | 90.18 | 89.83 | 0.39%      |
| CTMark/Bullet/bullet.test                     | 63.11 | 62.92 | 0.31%      |
| CTMark/ClamAV/clamscan.test                   | 27.00 | 26.98 | 0.05%      |
| CTMark/SPASS/SPASS.test                       | 26.27 | 26.14 | 0.52%      |
| CTMark/consumer-typeset/consumer-typeset.test | 19.73 | 19.74 | -0.02%     |
| CTMark/kimwitu++/kc.test                      | 26.60 | 26.51 | 0.37%      |
| CTMark/lencod/lencod.test                     | 34.64 | 34.65 | -0.03%     |
| CTMark/mafft/pairlocalalign.test              | 16.24 | 16.24 | 0.00%      |
| CTMark/sqlite3/sqlite3.test                   | 24.61 | 24.68 | -0.27%     |
| CTMark/tramp3d-v4/tramp3d-v4.test             | 49.35 | 49.24 | 0.22%      |
+-----------------------------------------------+-------+-------+------------+

Would it be a right direction if I remove this condition in a separate patch?

lebedev.ri added inline comments.Jul 31 2020, 3:59 AM

llvm/lib/Transforms/Scalar/JumpThreading.cpp
678	Compile time != run time

aqjune added inline comments.Jul 31 2020, 4:25 AM

llvm/lib/Transforms/Scalar/JumpThreading.cpp
678	The table depicts compile time. Compilation becomes slightly faster (or almost equivalent, assuming that they are errors) if the conditions are removed, interestingly.

lebedev.ri added inline comments.Jul 31 2020, 6:16 AM

llvm/lib/Transforms/Scalar/JumpThreading.cpp
678	http://llvm-compile-time-tracker.com/compare.php?from=03116a9f8c2fc98577e153083aaf9b6a701ab8f9&to=503a232ea5a630ddacde3ba01498776b61c4c8d4&stat=instructions So it doesn't look like the cost is too great

LGTM

llvm/lib/Transforms/Scalar/JumpThreading.cpp
678	Sure, we can do it in a separate patch.
698	Maybe tweak this comment, if we're going to drop the check for cmpinst.

This revision is now accepted and ready to land.Aug 3 2020, 1:48 PM

Update the comment

aqjune marked an inline comment as done.Aug 4 2020, 12:21 AM

This revision was landed with ongoing or failed builds.Aug 4 2020, 12:22 AM

Closed by commit rG6f97103b561c: [JumpThreading] Don't limit the type of an operand (authored by aqjune). · Explain Why

This revision was automatically updated to reflect the committed changes.

aqjune added a commit: rG6f97103b561c: [JumpThreading] Don't limit the type of an operand.

Harbormaster completed remote builds in B66881: Diff 282811.Aug 4 2020, 12:57 AM

aqjune mentioned this in D85188: [JumpThreading] Remove cast's constraint.Aug 4 2020, 2:28 AM

aqjune mentioned this in rGe734e8286b4b: [JumpThreading] Remove cast's constraint.Aug 4 2020, 3:09 AM

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Scalar/

JumpThreading.cpp

4 lines

test/

Transforms/

JumpThreading/

freeze.ll

16 lines

Diff 282812

llvm/lib/Transforms/Scalar/JumpThreading.cpp

Show First 20 Lines • Show All 669 Lines • ▼ Show 20 Lines	for (unsigned i = 0, e = PN->getNumIncomingValues(); i != e; ++i) {
Result.emplace_back(KC, PN->getIncomingBlock(i));		Result.emplace_back(KC, PN->getIncomingBlock(i));
}		}
}		}

return !Result.empty();		return !Result.empty();
}		}

// Handle Cast instructions. Only see through Cast when the source operand is		// Handle Cast instructions. Only see through Cast when the source operand is
// PHI, Cmp, or Freeze to save the compilation time.		// PHI, Cmp, or Freeze to save the compilation time.
		efriedmaUnsubmitted Not Done Reply Inline Actions Oh, hmm, I misintepreted this comment, and didn't read the code carefully enough. The "if" is actually a heuristic that suppresses potential optimizations. Do you have any idea what the actual compile-time impact would be if we just recursed over all casts? efriedma: Oh, hmm, I misintepreted this comment, and didn't read the code carefully enough. The "if" is…
		aqjuneAuthorUnsubmitted Done Reply Inline Actions I ran a test, and actually it brought a slight speedup when compiled with -O3 (without LTO): +-----------------------------------------------+-------+-------+------------+ \| unit:sec. \| base \| cast \| speedup(%) \| +-----------------------------------------------+-------+-------+------------+ \| CTMark/7zip/7zip-benchmark.test \| 90.18 \| 89.83 \| 0.39% \| \| CTMark/Bullet/bullet.test \| 63.11 \| 62.92 \| 0.31% \| \| CTMark/ClamAV/clamscan.test \| 27.00 \| 26.98 \| 0.05% \| \| CTMark/SPASS/SPASS.test \| 26.27 \| 26.14 \| 0.52% \| \| CTMark/consumer-typeset/consumer-typeset.test \| 19.73 \| 19.74 \| -0.02% \| \| CTMark/kimwitu++/kc.test \| 26.60 \| 26.51 \| 0.37% \| \| CTMark/lencod/lencod.test \| 34.64 \| 34.65 \| -0.03% \| \| CTMark/mafft/pairlocalalign.test \| 16.24 \| 16.24 \| 0.00% \| \| CTMark/sqlite3/sqlite3.test \| 24.61 \| 24.68 \| -0.27% \| \| CTMark/tramp3d-v4/tramp3d-v4.test \| 49.35 \| 49.24 \| 0.22% \| +-----------------------------------------------+-------+-------+------------+ Would it be a right direction if I remove this condition in a separate patch? aqjune: I ran a test, and actually it brought a slight speedup when compiled with -O3 (without LTO)…
		lebedev.riUnsubmitted Not Done Reply Inline Actions Compile time != run time lebedev.ri: Compile time != run time
		aqjuneAuthorUnsubmitted Done Reply Inline Actions The table depicts compile time. Compilation becomes slightly faster (or almost equivalent, assuming that they are errors) if the conditions are removed, interestingly. aqjune: The table depicts compile time. Compilation becomes slightly faster (or almost equivalent…
		lebedev.riUnsubmitted Not Done Reply Inline Actions http://llvm-compile-time-tracker.com/compare.php?from=03116a9f8c2fc98577e153083aaf9b6a701ab8f9&to=503a232ea5a630ddacde3ba01498776b61c4c8d4&stat=instructions So it doesn't look like the cost is too great lebedev.ri: http://llvm-compile-time-tracker.com/compare.php?
		efriedmaUnsubmitted Not Done Reply Inline Actions Sure, we can do it in a separate patch. efriedma: Sure, we can do it in a separate patch.
if (CastInst *CI = dyn_cast<CastInst>(I)) {		if (CastInst *CI = dyn_cast<CastInst>(I)) {
Value *Source = CI->getOperand(0);		Value *Source = CI->getOperand(0);
if (!isa<PHINode>(Source) && !isa<CmpInst>(Source) &&		if (!isa<PHINode>(Source) && !isa<CmpInst>(Source) &&
!isa<FreezeInst>(Source))		!isa<FreezeInst>(Source))
return false;		return false;
ComputeValueKnownInPredecessorsImpl(Source, BB, Result, Preference,		ComputeValueKnownInPredecessorsImpl(Source, BB, Result, Preference,
RecursionSet, CxtI);		RecursionSet, CxtI);
if (Result.empty())		if (Result.empty())
return false;		return false;

// Convert the known values.		// Convert the known values.
for (auto &R : Result)		for (auto &R : Result)
R.first = ConstantExpr::getCast(CI->getOpcode(), R.first, CI->getType());		R.first = ConstantExpr::getCast(CI->getOpcode(), R.first, CI->getType());

return true;		return true;
}		}

// Handle Freeze instructions, in a manner similar to Cast.
if (FreezeInst *FI = dyn_cast<FreezeInst>(I)) {		if (FreezeInst *FI = dyn_cast<FreezeInst>(I)) {
Value *Source = FI->getOperand(0);		Value *Source = FI->getOperand(0);
if (!isa<PHINode>(Source) && !isa<CmpInst>(Source) &&
!isa<CastInst>(Source))
return false;
ComputeValueKnownInPredecessorsImpl(Source, BB, Result, Preference,		ComputeValueKnownInPredecessorsImpl(Source, BB, Result, Preference,
		efriedmaUnsubmitted Done Reply Inline Actions Maybe tweak this comment, if we're going to drop the check for cmpinst. efriedma: Maybe tweak this comment, if we're going to drop the check for cmpinst.
RecursionSet, CxtI);		RecursionSet, CxtI);

erase_if(Result, [](auto &Pair) {		erase_if(Result, [](auto &Pair) {
return !isGuaranteedNotToBeUndefOrPoison(Pair.first);		return !isGuaranteedNotToBeUndefOrPoison(Pair.first);
});		});

return !Result.empty();		return !Result.empty();
}		}
▲ Show 20 Lines • Show All 2,292 Lines • Show Last 20 Lines

llvm/test/Transforms/JumpThreading/freeze.ll

Show First 20 Lines • Show All 79 Lines • ▼ Show 20 Lines	T2:
ret i32 %B		ret i32 %B

F2:		F2:
ret i32 %B		ret i32 %B
}		}

define i32 @test1_cast2(i1 %cond) {		define i32 @test1_cast2(i1 %cond) {
; CHECK-LABEL: @test1_cast2(		; CHECK-LABEL: @test1_cast2(
; CHECK-NEXT: br i1 [[COND:%.]], label [[MERGE_THREAD:%.]], label [[MERGE:%.*]]		; CHECK-NEXT: br i1 [[COND:%.]], label [[T2:%.]], label [[F2:%.*]]
; CHECK: Merge.thread:
; CHECK-NEXT: [[V1:%.*]] = call i32 @f1()
; CHECK-NEXT: br label [[T2:%.*]]
; CHECK: Merge:
; CHECK-NEXT: [[V2:%.*]] = call i32 @f2()
; CHECK-NEXT: [[A0_FR:%.*]] = freeze i32 0
; CHECK-NEXT: [[A_FR:%.*]] = trunc i32 [[A0_FR]] to i1
; CHECK-NEXT: br i1 [[A_FR]], label [[T2]], label [[F2:%.*]]
; CHECK: T2:		; CHECK: T2:
; CHECK-NEXT: [[B5:%.*]] = phi i32 [ [[V1]], [[MERGE_THREAD]] ], [ [[V2]], [[MERGE]] ]		; CHECK-NEXT: [[V1:%.*]] = call i32 @f1()
; CHECK-NEXT: call void @f3()		; CHECK-NEXT: call void @f3()
; CHECK-NEXT: ret i32 [[B5]]		; CHECK-NEXT: ret i32 [[V1]]
; CHECK: F2:		; CHECK: F2:
		; CHECK-NEXT: [[V2:%.*]] = call i32 @f2()
		; CHECK-NEXT: [[A0_FR:%.*]] = freeze i32 0
; CHECK-NEXT: ret i32 [[V2]]		; CHECK-NEXT: ret i32 [[V2]]
;		;
br i1 %cond, label %T1, label %F1		br i1 %cond, label %T1, label %F1

T1:		T1:
%v1 = call i32 @f1()		%v1 = call i32 @f1()
br label %Merge		br label %Merge

▲ Show 20 Lines • Show All 94 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[JumpThreading] Don't limit the type of an operandClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 282812

llvm/lib/Transforms/Scalar/JumpThreading.cpp

llvm/test/Transforms/JumpThreading/freeze.ll

[JumpThreading] Don't limit the type of an operand
ClosedPublic