This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/lib/Transforms/Scalar/
-
lib/
-
Transforms/
-
Scalar/
-
LoopRotation.cpp

Differential D83663

[LoopRotate] Bump rotation-max-header-size to 64
AbandonedPublic

Authored by lebedev.ri on Jul 13 2020, 3:00 AM.

Download Raw Diff

Details

Reviewers

reames
aprantl
asbirlea
fedor.sergeev

Summary

Currently, rotation-max-header-size is at 16, and it was always that way since the beginning (rL35714).
It is not very obvious (to me?) what reasoning is behind this threshold,
is it there to cap the maximal amount of work to be done to rotate the loop,
to cap the amount of instruction number growth, other?

But i think, threshold of 16 could use adjustment.

In particular, i've identified these loops here as not being rotated,
because they have more instructions than allowed (16 < x < 64, ~50).
That, in turn, prevents vectorizer from dealing with them,
which is not optimal since they should be vectorizable.

On my benchmarks, i'm not seeing any perf impact from this change.
And yes, those loops in question still aren't being vectorized, there are more issues.

llvm-compile-time-tracker is also not very unhappy with this:
http://llvm-compile-time-tracker.com/compare.php?from=07c4c7e7959b7fd09830bbdf4dcd533e98aa45ab&to=b40c8ea61a4bea615f2b3e1bbd0eaa67b7a13b44&stat=instructions
It clearly affects those tests, because

there's pretty small +0.01 .. +0.05% compile-time regression
this causes, on average, +0.03% size-text increase

So i'm wondering, what is the procedure here, would this be a reasonable change?

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	450 ms	linux > SanitizerCommon-asan-x86_64-Linux.Linux::Unknown Unit Message ("")
	280 ms	linux > SanitizerCommon-lsan-x86_64-Linux.Linux::Unknown Unit Message ("")
	370 ms	linux > SanitizerCommon-msan-x86_64-Linux.Linux::Unknown Unit Message ("")
	460 ms	linux > SanitizerCommon-tsan-x86_64-Linux.Linux::Unknown Unit Message ("")
	340 ms	linux > SanitizerCommon-ubsan-x86_64-Linux.Linux::Unknown Unit Message ("")

Event Timeline

lebedev.ri created this revision.Jul 13 2020, 3:00 AM

Herald added a subscriber: hiraditya. · View Herald TranscriptJul 13 2020, 3:00 AM

there's pretty small (+0.01 .. +0.05% compile-time regression)

this causes, on average, +0.03% size-text increase

So i'm wondering, what is the procedure here, would this be a reasonable change?

I think the cap is mostly there to guard against unnecessary code size increases.

It would be interesting to see what the impact on code size would be with -Os across more than the ctmark tests. I think for -Oz we already force the header size to 0, but I am not sure if it would be great to add another -Os threshold.

lebedev.ri edited the summary of this revision. (Show Details)Jul 13 2020, 3:25 AM

Harbormaster failed remote builds in B63929: Diff 277354!Jul 13 2020, 3:58 AM

In D83663#2146895, @fhahn wrote:

there's pretty small (+0.01 .. +0.05% compile-time regression)

this causes, on average, +0.03% size-text increase

So i'm wondering, what is the procedure here, would this be a reasonable change?

I think the cap is mostly there to guard against unnecessary code size increases.

Aha.

It would be interesting to see what the impact on code size would be with -Os across more than the ctmark tests. I think for -Oz we already force the header size to 0, but I am not sure if it would be great to add another -Os threshold.

I did indeed think about -Os/-Oz cases, and i indeed suspect this might not be beneficial for them.
I'm fully okay with keeping 16 for -Os.

But sadly i'm rather unfamiliar wit them, so i'm not sure i'm in position to actually assess the effects for -Os.

I'm not 100% sure, but possibly D84108 will fix most of the motivational code patterns.

lebedev.ri abandoned this revision.Jan 17 2022, 2:41 PM

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Scalar/

LoopRotation.cpp

2 lines

Diff 277354

llvm/lib/Transforms/Scalar/LoopRotation.cpp

	Show All 24 Lines
	#include "llvm/Transforms/Scalar/LoopPassManager.h"			#include "llvm/Transforms/Scalar/LoopPassManager.h"
	#include "llvm/Transforms/Utils/LoopRotationUtils.h"			#include "llvm/Transforms/Utils/LoopRotationUtils.h"
	#include "llvm/Transforms/Utils/LoopUtils.h"			#include "llvm/Transforms/Utils/LoopUtils.h"
	using namespace llvm;			using namespace llvm;

	#define DEBUG_TYPE "loop-rotate"			#define DEBUG_TYPE "loop-rotate"

	static cl::opt<unsigned> DefaultRotationThreshold(			static cl::opt<unsigned> DefaultRotationThreshold(
	"rotation-max-header-size", cl::init(16), cl::Hidden,			"rotation-max-header-size", cl::init(64), cl::Hidden,
	cl::desc("The default maximum header size for automatic loop rotation"));			cl::desc("The default maximum header size for automatic loop rotation"));

	LoopRotatePass::LoopRotatePass(bool EnableHeaderDuplication)			LoopRotatePass::LoopRotatePass(bool EnableHeaderDuplication)
	: EnableHeaderDuplication(EnableHeaderDuplication) {}			: EnableHeaderDuplication(EnableHeaderDuplication) {}

	PreservedAnalyses LoopRotatePass::run(Loop &L, LoopAnalysisManager &AM,			PreservedAnalyses LoopRotatePass::run(Loop &L, LoopAnalysisManager &AM,
	LoopStandardAnalysisResults &AR,			LoopStandardAnalysisResults &AR,
	LPMUpdater &) {			LPMUpdater &) {
	▲ Show 20 Lines • Show All 86 Lines • Show Last 20 Lines