This is an archive of the discontinued LLVM Phabricator instance.

[Refactor] Replace RegionPasses by FunctionPasses
AbandonedPublic

Authored by jdoerfert on Mar 1 2015, 10:55 AM.

Download Raw Diff

Details

Reviewers

sebpop
• zinob
grosser
simbuerg

Summary

The main change is the switch to function instead of region passes.
This will save compile time as we do not have to query the
TempScopInfo for each region anymore.  However, due to the changed
interface and the now explicit iteration over regions in a function
other adjustments were made too.

Diff Detail

Event Timeline

jdoerfert updated this revision to Diff 20961.Mar 1 2015, 10:55 AM

jdoerfert retitled this revision from to [Refactor] Replace RegionPasses by FunctionPasses.

jdoerfert added reviewers: grosser, sebpop, simbuerg, • zinob.

jdoerfert updated this object.

jdoerfert added subscribers: Restricted Project, Unknown Object (MLST).

jdoerfert added inline comments.Mar 1 2015, 11:07 AM

lib/Analysis/ScopInfo.cpp
2014–2015	A Scops.pop_back() is missing here.
2024–2025	This line needs to be removed in order to loop again.

Small fixes

Allow multiple SCoPs in ScopPasses + fix tests

I will cut this commit into pieces (as far as possible) but I wanted to get some initial feedback.

Hi Johannes,

what is the motivation and the impact of this change? The only motivation of this change you give in the commit message is compile time. Did you measure any significant improvements here? I personally think there are very good reasons to change/improve our pass infrastructure, but am surprised compile-time is the one and only that motivates your change.

I see two main reasons to work on the pass infrastructure:

Make it work with Chandler's new pass manager

Chandler's new pass manager uses caching to keep multiple analysis results. I believe when we perform changes to our pass infrastructure, we should try to make sure it will work with Chandler's new pass manager.
Besides his last PassManager talk, commits such as https://llvm.org/svn/llvm-project/llvm/trunk@226560 show the idea of using caching results.

Fix the LoopInfo/RegionInfo misconception

We currently assume ScopPasses on different regions do not affect each other. However, they indeed
affect each other, which means code-generating one scop may invalidate the next scop. Hence, we have some hacks in place to detect this invalidated scops. The (implicit) pass order change your patch brings does not seem to improve this in any way.

I am not saying your patch should solve those two issues, but maybe it is a good time to reason about this and at least understand if they will not negatively affect these changes (or possibly even help to solve some).

I added a couple of first comments to your patch, but before going further I would like to understand your intentions.

Cheers,
Tobias

include/polly/DependenceInfo.h
1 ↗	(On Diff #20974)	Needs update, if renamed.
24 ↗	(On Diff #20974)	Needs update, if renamed.
140 ↗	(On Diff #20974)	The idea of introducing an analysis result for each individual Scop is very much in line with Chandler's new analysis infrastructure. However, to me it seem you only went half wayby leaving most functions on the pass itself. Looking at this, introducing per-scop-dependence objects seems to be an almost independent change.
include/polly/LinkAllPasses.h
31 ↗	(On Diff #20974)	I would personally not perform such renaming as part this patch, as it causes noise all over the place.
lib/Analysis/DependenceInfo.cpp
1 ↗	(On Diff #20974)	Needs updated if you want to perform renaming.
351 ↗	(On Diff #20974)	These are a lot of D.*? Would it make sense to make all this functionality part of the Dependences object/class.
453 ↗	(On Diff #20974)	I have the feeling passing the scop to each of this functions unnecessarily complicates the interface. Could we not just once ask for the dependences object of this scop and then work with it?

Hey Tobias,

let me give you a short answer now and a more detailed one later:

Compile time: We perform much better on large test with a lot of regions. Here are some examples after one lnt run for a release build.

MultiSource/Benchmarks/tramp3d-v4/tramp3d-v4 -89.42% 345.5400 36.5533 - - SingleSource/Benchmarks/Misc-C++-EH/spirit -78.71% 33.4433 7.1200 - - MultiSource/Applications/kimwitu++/kc -71.26% 77.1700 22.1766 - - MultiSource/Benchmarks/MiBench/consumer-typeset/consumer-typeset -67.55% 65.2370 21.1667 - - SingleSource/UnitTests/DefaultInitDynArrays -50.00% 0.0200 0.0100 - - SingleSource/Benchmarks/Adobe-C++/simple_types_constant_folding -42.39% 8.6267 4.9700 - - MultiSource/Applications/sqlite3/sqlite3 -42.16% 60.4734 34.9800 - - MultiSource/Benchmarks/ASCI_Purple/SMG2000/smg2000 -39.79% 19.6970 11.8601 - - SingleSource/Regression/C++/EH/inlined_cleanup -38.34% 0.0433 0.0267 - - MultiSource/Applications/JM/ldecod/ldecod -37.42% 23.0533 14.4267 - - MultiSource/Applications/oggenc/oggenc -33.81% 24.6900 16.3433 - - SingleSource/Benchmarks/Stanford/Treesort -31.79% 0.0733 0.0500 - - MultiSource/Applications/lemon/lemon -30.80% 2.8467 1.9700 - - SingleSource/Benchmarks/Shootout/strcat -30.72% 0.0433 0.0300 - - MultiSource/Applications/JM/lencod/lencod -28.96% 39.8598 28.3167 - - MultiSource/Benchmarks/PAQ8p/paq8p -26.53% 4.6367 3.4067

Note that for a debug build I more or less always time out on some of these benchmarks after 500s with the old pass system! I admit the changes (especially in the DependenceInfo pass) can be made even more efficient but that is an easy patch I can write afterwards.
I don't know enough about the new pass manager to say how this change will affect it.
SCoPs affecting each other are a misconception by design (at least in my opinion). We should make the code generation aware of that (e.g., do not split the exit block of a region at 3 different locations in Polly!). However, this change passes lnt and all unit tests and we detect exactly the same number of SCoPs, that is at least a hint that we can make function passes work. With scalar/phi code generation and without independent blocks and code prepare the impact of one SCoP on another should be even smaller.

I hope you reconsider this change, at least after I provide more accurate
numbers.

Best regards,

Johannes

msg-20286-67.dat219 BDownload

Hey Tobias,

I attached 2 lnt reports to this mail. The first (report_region.json)
was created using the current polly/master. The second
(report_function.json) was created with the region->function pass patch.
Note that this patch is not yet at its full capacity, however over the 3
runs I measured I got:

Performance Regressions  16
Performance Improvements  159
Unchanged Tests   817
Total Tests   992

Where as the execution time changes (only 8 in total) should be jitter.

The point I try to make here is, 3 runs with the region pass manager take

2:35:27 (3 runs region passes)

but 3 runs with the function pass manager only

2:07:48 (3 runs function passes)

hence we save ~18% of the lnt time (if I didn't do a stupid math mistake).

To be precise, we save compile time "only" for larger functions with small SCoP
coverage but we do not pay for it in any other case.

Does this convince you that region passes have a bad impact on compile time?

Best regards,

Johannes

report_function.json548 KBDownload
msg-7066-192.dat219 BDownload
report_region.json548 KBDownload

Johannes and Tobias,

We just discovered an issues with compiler time in the region pass manager. In particular in Release build where Name are not preserved

Lib/Analysis/RegionPass.cpp (86):
dumpPassInfo(P, EXECUTION_MSG, ON_REGION_MSG, CurrentRegion->getNameStr());

getNameStr() is very expensive because it LLVM construct a new Name. The compile-time issue you guys are seeing might be due to this issue.

Our fix right now is to pass empty string in release build. Toby, if you think this acceptable short term solution. Sanjin can submit a patch?

-Zino

Ok we will post it shortly.

Zino

The patch Zino refers to was committed in 231485. It did not have a large impact on our LNT performance builders (which do not seem to even see the slowdown we try to address here). However, on my laptop I was able to reproduce this performance issue in a cmake release build and Zino's patch fixes it at least for tramp3d-v4 nicely. Johannes, could you check if there is still a performance issue that needs to be addressed?

lib/Transform/ScheduleOptimizer.cpp
58	Why are these renamed?
521	Is this rename intentional?

msg-29914-148.dat219 BDownload

I will cut this commit into pieces (as far as possible) but I wanted to get some initial feedback.

In D7986#136089, @grosser wrote:

The patch Zino refers to was committed in 231485. It did not have a large impact on our LNT performance builders (which do not seem to even see the slowdown we try to address here). However, on my laptop I was able to reproduce this performance issue in a cmake release build and Zino's patch fixes it at least for tramp3d-v4 nicely. Johannes, could you check if there is still a performance issue that needs to be addressed?

How can the buildbots run e.g., tramp3d in 9sec when it takes for me with lnt
[1,2] >40sec for one of the source files alone. This is not new but they do
perform that good all the time... I'm puzzled here...

[1] --mllvm=-polly -clfag=-O3 in the cmd line options of the lnt runtest command

(Polly is linked into clang/opt)

[2] Alternatively the command with polly basically disabled (via polly-only-func):

/home/johannes/projects/polly/llvm-build/bin/clang++ -fno-exceptions -I/home/johannes/mysandbox/nt/test-2015-03-01_14-37-50/sample-0/MultiSource/Benchmarks/tramp3d-v4 -I/home/johannes/repos/llvm-test-suite/MultiSource/Benchmarks/tramp3d-v4 -I/home/johannes/repos/llvm-test-suite/include -I../../../include -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -DNDEBUG -O3 -mllvm -polly -m64 -fomit-frame-pointer -c /home/johannes/repos/llvm-test-suite/MultiSource/Benchmarks/tramp3d-v4/tramp3d-v4.cpp -o /dev/null  -mllvm -polly-only-func=dsadsasadsa -mllvm -polly-allow-nonaffine-branches=false

In D7986#136089, @grosser wrote:

The patch Zino refers to was committed in 231485. It did not have a large impact on our LNT performance builders (which do not seem to even see the slowdown we try to address here). However, on my laptop I was able to reproduce this performance issue in a cmake release build and Zino's patch fixes it at least for tramp3d-v4 nicely. Johannes, could you check if there is still a performance issue that needs to be addressed?

How can the buildbots run e.g., tramp3d in 9sec when it takes for me with lnt
[1,2] >40sec for one of the source files alone. This is not new but they do
perform that good all the time... I'm puzzled here...

Looking at the buildbot history, it seems we never had compile-times below 35 seconds. Can you remember where you got these 9 seconds from?

http://llvm.org/perf/db_default/v4/nts/graph?highlight_run=27014&plot.1349=23.1349.1

Johannes, I also wonder what you plan to do with the patch here. It seems the original compile time issues that it meant to fix have been resolved. Are you still planning to submit this patch for other reasons or should we close this review for now?

In D7986#175077, @grosser wrote:

Looking at the buildbot history, it seems we never had compile-times below 35 seconds. Can you remember where you got these 9 seconds from?

http://llvm.org/perf/db_default/v4/nts/graph?highlight_run=27014&plot.1349=23.1349.1

My comment is over 2 months old. I cannot remember where I got this number from. Maybe I missread something or I used a number of a local lnt run.

Johannes, I also wonder what you plan to do with the patch here. It seems the original compile time issues that it meant to fix have been resolved. Are you still planning to submit this patch for other reasons or should we close this review for now?

I'll close this as there doesn't seem any interest in it.

jdoerfert abandoned this revision.May 20 2015, 7:05 AM

Revision Contents

Path

Size

include/

polly/

ScopInfo.h

74 lines

ScopPass.h

21 lines

lib/

Analysis/

ScopInfo.cpp

69 lines

ScopPass.cpp

12 lines

Transform/

ScheduleOptimizer.cpp

2 lines

Diff 20963

include/polly/ScopInfo.h

Show All 16 Lines
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef POLLY_SCOP_INFO_H		#ifndef POLLY_SCOP_INFO_H
#define POLLY_SCOP_INFO_H		#define POLLY_SCOP_INFO_H

#include "polly/ScopDetection.h"		#include "polly/ScopDetection.h"

#include "llvm/Analysis/RegionPass.h"		#include "llvm/Pass.h"

#include "isl/ctx.h"		#include "isl/ctx.h"

using namespace llvm;		using namespace llvm;

namespace llvm {		namespace llvm {
class Loop;		class Loop;
class LoopInfo;		class LoopInfo;
▲ Show 20 Lines • Show All 701 Lines • ▼ Show 20 Lines	private:
/// In a program with int and float pointers annotated with tbaa information		/// In a program with int and float pointers annotated with tbaa information
/// we would probably generate two alias groups, one for the int pointers and		/// we would probably generate two alias groups, one for the int pointers and
/// one for the float pointers.		/// one for the float pointers.
///		///
/// During code generation we will create a runtime alias check for each alias		/// During code generation we will create a runtime alias check for each alias
/// group to ensure the SCoP is executed in an alias free environment.		/// group to ensure the SCoP is executed in an alias free environment.
MinMaxVectorVectorTy MinMaxAliasGroups;		MinMaxVectorVectorTy MinMaxAliasGroups;

/// Create the static control part with a region, max loop depth of this
/// region and parameters used in this region.
Scop(TempScop &TempScop, LoopInfo &LI, ScalarEvolution &SE, ScopDetection &SD,
isl_ctx *ctx);

/// @brief Check if a basic block is trivial.		/// @brief Check if a basic block is trivial.
///		///
/// A trivial basic block does not contain any useful calculation. Therefore,		/// A trivial basic block does not contain any useful calculation. Therefore,
/// it does not need to be represented as a polyhedral statement.		/// it does not need to be represented as a polyhedral statement.
///		///
/// @param BB The basic block to check		/// @param BB The basic block to check
/// @param tempScop TempScop returning further information regarding the Scop.		/// @param tempScop TempScop returning further information regarding the Scop.
///		///
Show All 38 Lines	private:
void printContext(raw_ostream &OS) const;		void printContext(raw_ostream &OS) const;
void printStatements(raw_ostream &OS) const;		void printStatements(raw_ostream &OS) const;
void printAliasAssumptions(raw_ostream &OS) const;		void printAliasAssumptions(raw_ostream &OS) const;
///}		///}

friend class ScopInfo;		friend class ScopInfo;

public:		public:
		/// Create the static control part with a region, max loop depth of this
		/// region and parameters used in this region.
		Scop(TempScop &TempScop, LoopInfo &LI, ScalarEvolution &SE, ScopDetection &SD,
		isl_ctx *ctx);

		/// @brief Create a SCoP from @p Other by moving all its members over.
		Scop(Scop &&Other)
		: SE(Other.SE), R(Other.R), IsOptimized(Other.IsOptimized),
		MaxLoopDepth(Other.MaxLoopDepth), Stmts(std::move(Other.Stmts)),
		Parameters(std::move(Other.Parameters)),
		ParameterIds(std::move(Other.ParameterIds)), IslCtx(Other.IslCtx),
		StmtMap(std::move(Other.StmtMap)), Context(Other.Context),
		ScopArrayInfoMap(std::move(Other.ScopArrayInfoMap)),
		AssumedContext(Other.AssumedContext),
		MinMaxAliasGroups(std::move(Other.MinMaxAliasGroups)){};

~Scop();		~Scop();

ScalarEvolution *getSE() const;		ScalarEvolution *getSE() const;

/// @brief Get the count of parameters used in this Scop.		/// @brief Get the count of parameters used in this Scop.
///		///
/// @return The count of parameters used in this Scop.		/// @return The count of parameters used in this Scop.
inline ParamVecType::size_type getNumParams() const {		inline ParamVecType::size_type getNumParams() const {
▲ Show 20 Lines • Show All 190 Lines • ▼ Show 20 Lines
static inline raw_ostream &operator<<(raw_ostream &O, const Scop &scop) {		static inline raw_ostream &operator<<(raw_ostream &O, const Scop &scop) {
scop.print(O);		scop.print(O);
return O;		return O;
}		}

///===---------------------------------------------------------------------===//		///===---------------------------------------------------------------------===//
/// @brief Build the Polly IR (Scop and ScopStmt) on a Region.		/// @brief Build the Polly IR (Scop and ScopStmt) on a Region.
///		///
class ScopInfo : public RegionPass {		class ScopInfo : public FunctionPass {
//===-------------------------------------------------------------------===//		public:
		/// @brief Type for a vector of SCoPs.
		using ScopVectorTy = SmallVector<Scop, 4>;

		/// @brief Iterator type for a SCoP vector.
		using iterator = ScopVectorTy::iterator;

		private:
ScopInfo(const ScopInfo &) = delete;		ScopInfo(const ScopInfo &) = delete;
const ScopInfo &operator=(const ScopInfo &) = delete;		const ScopInfo &operator=(const ScopInfo &) = delete;

// The Scop		/// @brief The SCoPs build in this function.
Scop *scop;		ScopVectorTy Scops;
isl_ctx *ctx;

void clear() {		/// @brief The isl context used to alloca __all__ isl objects in the SCoP.
if (scop) {		isl_ctx *ctx;
delete scop;
scop = 0;
}
}

public:		public:
static char ID;		static char ID;
explicit ScopInfo();		explicit ScopInfo();
~ScopInfo();		~ScopInfo();

/// @brief Try to build the Polly IR of static control part on the current		/// @brief Iterator interface to access all SCoPs in this function.
/// SESE-Region.
///		///
/// @return If the current region is a valid for a static control part,		///{
/// return the Polly IR representing this static control part,		iterator begin() { return Scops.begin(); }
/// return null otherwise.		iterator end() { return Scops.end(); }
Scop *getScop() { return scop; }		///}
const Scop *getScop() const { return scop; }

/// @name RegionPass interface		/// @name FunctionPass interface
//@{		//@{
virtual bool runOnRegion(Region *R, RGPassManager &RGM);		virtual bool runOnFunction(Function &F) override;
virtual void getAnalysisUsage(AnalysisUsage &AU) const;		virtual void getAnalysisUsage(AnalysisUsage &AU) const override;
virtual void releaseMemory() { clear(); }		virtual void releaseMemory() override;
virtual void print(raw_ostream &OS, const Module *) const {		virtual void print(raw_ostream &OS, const Module *) const override;
if (scop)
scop->print(OS);
else
OS << "Invalid Scop!\n";
}
//@}		//@}
};		};

} // end namespace polly		} // end namespace polly

namespace llvm {		namespace llvm {
class PassRegistry;		class PassRegistry;
void initializeScopInfoPass(llvm::PassRegistry &);		void initializeScopInfoPass(llvm::PassRegistry &);
}		}

#endif		#endif

include/polly/ScopPass.h

	Show All 12 Lines
	// to modify the LLVM IR. Due to this limitation, the ScopPass class takes			// to modify the LLVM IR. Due to this limitation, the ScopPass class takes
	// care of declaring that no LLVM passes are invalidated.			// care of declaring that no LLVM passes are invalidated.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef POLLY_SCOP_PASS_H			#ifndef POLLY_SCOP_PASS_H
	#define POLLY_SCOP_PASS_H			#define POLLY_SCOP_PASS_H

	#include "llvm/Analysis/RegionPass.h"			#include "llvm/Pass.h"

	using namespace llvm;			using namespace llvm;

	struct isl_ctx;			struct isl_ctx;

	namespace polly {			namespace polly {
	class Scop;			class Scop;
				class ScopInfo;

	/// ScopPass - This class adapts the RegionPass interface to allow convenient			/// ScopPass - This class adapts the FunctionPass interface to allow convenient
	/// creation of passes that operate on the Polly IR. Instead of overriding			/// creation of passes that operate on the Polly IR. Instead of overriding
	/// runOnRegion, subclasses override runOnScop.			/// runOnFunction, subclasses override runOnScop.
	class ScopPass : public RegionPass {			class ScopPass : public FunctionPass {
	Scop *S;
				/// @brief The ScopInfo class that builds the SCoPs.
				ScopInfo *SI;

	protected:			protected:
	explicit ScopPass(char &ID) : RegionPass(ID), S(0) {}			explicit ScopPass(char &ID) : FunctionPass(ID) {}

	/// runOnScop - This method must be overloaded to perform the			/// @brief Run method for all ScopPass subclasses. Called for each SCoP once.
	/// desired Polyhedral transformation or analysis.
	///
	virtual bool runOnScop(Scop &S) = 0;			virtual bool runOnScop(Scop &S) = 0;

	/// @brief Print method for SCoPs.			/// @brief Print method for SCoPs.
	virtual void printScop(raw_ostream &OS, Scop &S) const = 0;			virtual void printScop(raw_ostream &OS, Scop &S) const = 0;

	/// getAnalysisUsage - Subclasses that override getAnalysisUsage			/// getAnalysisUsage - Subclasses that override getAnalysisUsage
	/// must call this.			/// must call this.
	///			///
	virtual void getAnalysisUsage(AnalysisUsage &AU) const override;			virtual void getAnalysisUsage(AnalysisUsage &AU) const override;

	private:			private:
	bool runOnRegion(Region *R, RGPassManager &RGM) override;			bool runOnFunction(Function &F) override;
	void print(raw_ostream &OS, const Module *) const override;			void print(raw_ostream &OS, const Module *) const override;
	};			};

	} // End llvm namespace			} // End llvm namespace

	#endif			#endif

lib/Analysis/ScopInfo.cpp

	Show First 20 Lines • Show All 1,963 Lines • ▼ Show 20 Lines
	ScopStmt Scop::getStmtForBasicBlock(BasicBlock BB) const {			ScopStmt Scop::getStmtForBasicBlock(BasicBlock BB) const {
	const auto &StmtMapIt = StmtMap.find(BB);			const auto &StmtMapIt = StmtMap.find(BB);
	if (StmtMapIt == StmtMap.end())			if (StmtMapIt == StmtMap.end())
	return nullptr;			return nullptr;
	return StmtMapIt->second;			return StmtMapIt->second;
	}			}

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	ScopInfo::ScopInfo() : RegionPass(ID), scop(0) {			ScopInfo::ScopInfo() : FunctionPass(ID) {
	ctx = isl_ctx_alloc();			ctx = isl_ctx_alloc();
	isl_options_set_on_error(ctx, ISL_ON_ERROR_ABORT);			isl_options_set_on_error(ctx, ISL_ON_ERROR_ABORT);
	}			}

	ScopInfo::~ScopInfo() {			ScopInfo::~ScopInfo() { isl_ctx_free(ctx); }
	clear();
	isl_ctx_free(ctx);
	}

	void ScopInfo::getAnalysisUsage(AnalysisUsage &AU) const {			void ScopInfo::getAnalysisUsage(AnalysisUsage &AU) const {
				AU.addRequiredTransitive<RegionInfoPass>();
	AU.addRequired<LoopInfoWrapperPass>();			AU.addRequired<LoopInfoWrapperPass>();
	AU.addRequired<RegionInfoPass>();
	AU.addRequired<ScalarEvolution>();			AU.addRequired<ScalarEvolution>();
	AU.addRequired<ScopDetection>();			AU.addRequired<ScopDetection>();
	AU.addRequired<TempScopInfo>();			AU.addRequired<TempScopInfo>();
	AU.addRequired<AliasAnalysis>();			AU.addRequired<AliasAnalysis>();
	AU.setPreservesAll();			AU.setPreservesAll();
	}			}

	bool ScopInfo::runOnRegion(Region *R, RGPassManager &RGM) {			bool ScopInfo::runOnFunction(Function &F) {
	LoopInfo &LI = getAnalysis<LoopInfoWrapperPass>().getLoopInfo();			LoopInfo &LI = getAnalysis<LoopInfoWrapperPass>().getLoopInfo();
	AliasAnalysis &AA = getAnalysis<AliasAnalysis>();			AliasAnalysis &AA = getAnalysis<AliasAnalysis>();
	ScopDetection &SD = getAnalysis<ScopDetection>();			ScopDetection &SD = getAnalysis<ScopDetection>();
	ScalarEvolution &SE = getAnalysis<ScalarEvolution>();			ScalarEvolution &SE = getAnalysis<ScalarEvolution>();

				for (ScopDetection::iterator I = SD.begin(), E = SD.end(); I != E; ++I) {
				const Region R = I;
				if (!SD.isMaxRegionInScop(*R))
				continue;

	TempScop *tempScop = getAnalysis<TempScopInfo>().getTempScop(R);			TempScop *tempScop = getAnalysis<TempScopInfo>().getTempScop(R);
				assert(tempScop && "A valid region should have a TempScop");

	// This region is no Scop.			Scops.emplace_back(*tempScop, LI, SE, SD, ctx);
	if (!tempScop) {			Scop &S = Scops.back();
	scop = nullptr;
	return false;
	}

	scop = new Scop(*tempScop, LI, SE, SD, ctx);			if (PollyUseRuntimeAliasChecks && !S.buildAliasGroups(AA)) {
				DEBUG(dbgs() << "\n\nNOTE: Run time checks for " << S.getNameStr()
				<< " could not be created as the number of parameters "
				"involved is too high. The SCoP will be "
				"dismissed.\nUse:\n\t--polly-rtc-max-parameters=X\nto "
				"adjust the maximal number of parameters but be advised "
				"that the compile time might increase "
				"exponentially.\n\n");

	if (!PollyUseRuntimeAliasChecks) {			Scops.pop_back();
				jdoerfertAuthorUnsubmitted Not Done Reply Inline Actions A Scops.pop_back() is missing here. jdoerfert: A Scops.pop_back() is missing here.
	// Statistics.			continue;
	++ScopFound;
	if (scop->getMaxLoopDepth() > 0)
	++RichScopFound;
	return false;
	}			}

	// If a problem occurs while building the alias groups we need to delete			// If a problem occurs while building the alias groups we need to delete
	// this SCoP and pretend it wasn't valid in the first place.			// this SCoP and pretend it wasn't valid in the first place.
	if (scop->buildAliasGroups(AA)) {
	// Statistics.			// Statistics.
	++ScopFound;			++ScopFound;
	if (scop->getMaxLoopDepth() > 0)			if (S.getMaxLoopDepth() > 0)
	++RichScopFound;			++RichScopFound;
	return false;
	}			}
				jdoerfertAuthorUnsubmitted Not Done Reply Inline Actions This line needs to be removed in order to loop again. jdoerfert: This line needs to be removed in order to loop again.

	DEBUG(dbgs()
	<< "\n\nNOTE: Run time checks for " << scop->getNameStr()
	<< " could not be created as the number of parameters involved is too "
	"high. The SCoP will be "
	"dismissed.\nUse:\n\t--polly-rtc-max-parameters=X\nto adjust the "
	"maximal number of parameters but be advised that the compile time "
	"might increase exponentially.\n\n");

	delete scop;
	scop = nullptr;
	return false;			return false;
	}			}

				void ScopInfo::print(raw_ostream &OS, const Module *) const {
				OS << "ScopInfo: Build " << Scops.size() << " SCoPs\n";
				}

				void ScopInfo::releaseMemory() { Scops.clear(); }

	char ScopInfo::ID = 0;			char ScopInfo::ID = 0;

	Pass *polly::createScopInfoPass() { return new ScopInfo(); }			Pass *polly::createScopInfoPass() { return new ScopInfo(); }

	INITIALIZE_PASS_BEGIN(ScopInfo, "polly-scops",			INITIALIZE_PASS_BEGIN(ScopInfo, "polly-scops",
	"Polly - Create polyhedral description of Scops", false,			"Polly - Create polyhedral description of Scops", false,
	false);			false);
	INITIALIZE_AG_DEPENDENCY(AliasAnalysis);			INITIALIZE_AG_DEPENDENCY(AliasAnalysis);
	INITIALIZE_PASS_DEPENDENCY(LoopInfoWrapperPass);			INITIALIZE_PASS_DEPENDENCY(LoopInfoWrapperPass);
	INITIALIZE_PASS_DEPENDENCY(RegionInfoPass);			INITIALIZE_PASS_DEPENDENCY(RegionInfoPass);
	INITIALIZE_PASS_DEPENDENCY(ScalarEvolution);			INITIALIZE_PASS_DEPENDENCY(ScalarEvolution);
	INITIALIZE_PASS_DEPENDENCY(ScopDetection);			INITIALIZE_PASS_DEPENDENCY(ScopDetection);
	INITIALIZE_PASS_DEPENDENCY(TempScopInfo);			INITIALIZE_PASS_DEPENDENCY(TempScopInfo);
	INITIALIZE_PASS_END(ScopInfo, "polly-scops",			INITIALIZE_PASS_END(ScopInfo, "polly-scops",
	"Polly - Create polyhedral description of Scops", false,			"Polly - Create polyhedral description of Scops", false,
	false)			false)

lib/Analysis/ScopPass.cpp

	Show All 11 Lines
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "polly/ScopPass.h"			#include "polly/ScopPass.h"
	#include "polly/ScopInfo.h"			#include "polly/ScopInfo.h"

	using namespace llvm;			using namespace llvm;
	using namespace polly;			using namespace polly;

	bool ScopPass::runOnRegion(Region *R, RGPassManager &RGM) {			bool ScopPass::runOnFunction(Function &F) {
	S = nullptr;			SI = &getAnalysis<ScopInfo>();

	if ((S = getAnalysis<ScopInfo>().getScop()))			for (Scop &S : *SI)
	return runOnScop(*S);			runOnScop(S);

	return false;			return false;
	}			}

	void ScopPass::print(raw_ostream &OS, const Module *M) const {			void ScopPass::print(raw_ostream &OS, const Module *M) const {
	if (S)			for (Scop &S : *SI)
	printScop(OS, *S);			printScop(OS, S);
	}			}

	void ScopPass::getAnalysisUsage(AnalysisUsage &AU) const {			void ScopPass::getAnalysisUsage(AnalysisUsage &AU) const {
	AU.addRequired<ScopInfo>();			AU.addRequired<ScopInfo>();
	AU.setPreservesAll();			AU.setPreservesAll();
	}			}

lib/Transform/ScheduleOptimizer.cpp

Show First 20 Lines • Show All 49 Lines • ▼ Show 20 Lines
static cl::opt<std::string>		static cl::opt<std::string>
OptimizeDeps("polly-opt-optimize-only",		OptimizeDeps("polly-opt-optimize-only",
cl::desc("Only a certain kind of dependences (all/raw)"),		cl::desc("Only a certain kind of dependences (all/raw)"),
cl::Hidden, cl::init("all"), cl::ZeroOrMore,		cl::Hidden, cl::init("all"), cl::ZeroOrMore,
cl::cat(PollyCategory));		cl::cat(PollyCategory));

static cl::opt<std::string>		static cl::opt<std::string>
SimplifyDeps("polly-opt-simplify-deps",		SimplifyDeps("polly-opt-simplify-deps",
cl::desc("Dependences should be simplified (yes/no)"),		cl::desc("Dependences should be simplified (yes/no)"),
		grosserUnsubmitted Not Done Reply Inline Actions Why are these renamed? grosser: Why are these renamed?
cl::Hidden, cl::init("yes"), cl::ZeroOrMore,		cl::Hidden, cl::init("yes"), cl::ZeroOrMore,
cl::cat(PollyCategory));		cl::cat(PollyCategory));

static cl::opt<int> MaxConstantTerm(		static cl::opt<int> MaxConstantTerm(
"polly-opt-max-constant-term",		"polly-opt-max-constant-term",
cl::desc("The maximal constant term allowed (-1 is unlimited)"), cl::Hidden,		cl::desc("The maximal constant term allowed (-1 is unlimited)"), cl::Hidden,
cl::init(20), cl::ZeroOrMore, cl::cat(PollyCategory));		cl::init(20), cl::ZeroOrMore, cl::cat(PollyCategory));

▲ Show 20 Lines • Show All 132 Lines • ▼ Show 20 Lines	private:
/// individual bands to the overall schedule. In case tiling is requested,		/// individual bands to the overall schedule. In case tiling is requested,
/// the individual bands are tiled.		/// the individual bands are tiled.
static isl_union_map getScheduleForBandList(isl_band_list BandList);		static isl_union_map getScheduleForBandList(isl_band_list BandList);

static isl_union_map getScheduleMap(isl_schedule Schedule);		static isl_union_map getScheduleMap(isl_schedule Schedule);

using llvm::Pass::doFinalization;		using llvm::Pass::doFinalization;

virtual bool doFinalization() override {		virtual bool doFinalization(Module &) override {
isl_schedule_free(LastSchedule);		isl_schedule_free(LastSchedule);
LastSchedule = nullptr;		LastSchedule = nullptr;
return true;		return true;
}		}
};		};
}		}

char IslScheduleOptimizer::ID = 0;		char IslScheduleOptimizer::ID = 0;
▲ Show 20 Lines • Show All 297 Lines • ▼ Show 20 Lines	bool IslScheduleOptimizer::runOnScop(Scop &S) {

isl_union_map *Validity = D->getDependences(ValidityKinds);		isl_union_map *Validity = D->getDependences(ValidityKinds);
isl_union_map *Proximity = D->getDependences(ProximityKinds);		isl_union_map *Proximity = D->getDependences(ProximityKinds);

// Simplify the dependences by removing the constraints introduced by the		// Simplify the dependences by removing the constraints introduced by the
// domains. This can speed up the scheduling time significantly, as large		// domains. This can speed up the scheduling time significantly, as large
// constant coefficients will be removed from the dependences. The		// constant coefficients will be removed from the dependences. The
// introduction of some additional dependences reduces the possible		// introduction of some additional dependences reduces the possible
// transformations, but in most cases, such transformation do not seem to be		// transformations, but in most cases, such transformation do not seem to be
		grosserUnsubmitted Not Done Reply Inline Actions Is this rename intentional? grosser: Is this rename intentional?
// interesting anyway. In some cases this option may stop the scheduler to		// interesting anyway. In some cases this option may stop the scheduler to
// find any schedule.		// find any schedule.
if (SimplifyDeps == "yes") {		if (SimplifyDeps == "yes") {
Validity = isl_union_map_gist_domain(Validity, isl_union_set_copy(Domain));		Validity = isl_union_map_gist_domain(Validity, isl_union_set_copy(Domain));
Validity = isl_union_map_gist_range(Validity, isl_union_set_copy(Domain));		Validity = isl_union_map_gist_range(Validity, isl_union_set_copy(Domain));
Proximity =		Proximity =
isl_union_map_gist_domain(Proximity, isl_union_set_copy(Domain));		isl_union_map_gist_domain(Proximity, isl_union_set_copy(Domain));
Proximity = isl_union_map_gist_range(Proximity, isl_union_set_copy(Domain));		Proximity = isl_union_map_gist_range(Proximity, isl_union_set_copy(Domain));
▲ Show 20 Lines • Show All 134 Lines • Show Last 20 Lines