This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Analysis/
-
llvm/
-
Analysis/
3/4
DependenceAnalysis.h
-
lib/Analysis/
-
Analysis/
8/8
DependenceAnalysis.cpp
-
test/
-
Analysis/DependenceAnalysis/
-
DependenceAnalysis/
-
PreliminaryNoValidityCheckFixedSize.ll
-
SimpleSIVNoValidityCheckFixedSize.ll
-
Transforms/LoopInterchange/
-
LoopInterchange/
1/1
currentLimitation.ll
-
loop-interchange-optimization-remarks.ll
-
profitability.ll

Differential D72178

[DA] Delinearization of fixed-size multi-dimensional arrays
ClosedPublic

Authored by bmahjour on Jan 3 2020, 1:14 PM.

Download Raw Diff

Details

Reviewers

sebpop
fhahn
Meinersbur
dmgreen
hfinkel
grosser
etiotto
bollu

Commits

rG1b811ff8a935: [DA] Delinearization of fixed-size multi-dimensional arrays

Summary

Currently the dependence analysis in LLVM is unable to compute accurate dependence vectors for multi-dimensional fixed size arrays. For example:

#define N 1024
#define M 2048
void foo(int a[N][M]) {                                                                                                                                                                                                                                                                                                                                                                                                               
  for (long i = 0; i < N-1; ++i)                                                                                                                                                                                      
    for (long j = 2; j < M; ++j)                                                                                                                                                                                      
      a[i][j] = a[i+1][j-2];

gives the following output for the dependence between the load of a[i+1][j-2] and the store into a[i][j]:

da analyze - anti [< >]!

While the direction vectors are correct, no dependence distances are computed, as we expect:

da analyze - anti [1 -2]!

This is mainly because the delinearization algorithm in scalar evolution relies on parametric terms to be present in the access functions. In the case of fixed size arrays such parametric terms are not present, but we can use the indexes from GEP instructions to recover the subscripts for each dimension of the arrays.

It appears that https://reviews.llvm.org/D35430 removed the capability to look at GEP instructions early in 2018. The justification for that change appears to be related to the concern over subscripts overlapping into next array dimensions in languages like C/C++. Please note that https://reviews.llvm.org/D62610 added a debug option to address this concern.

Removing support for fixed-size array delinearization resulted in over-pissimization of DependenceAnalysis and reduction of test coverage for both DA and LoopInterchange.

I've also noticed that polly tries to recover fixed-size array subscripts for dependence testing as well. In this patch I add support for fixed-size array delinearization (using the same mechanism that polly uses) under the option introduced in D62610. Two follow up revisions succeed this patch:

In D73995 I've refactored the polly code and moved it to scalar-evolution to make it reuseable by both polly and DA.
In D73998 I've renamed the option to reflect the fact that it controls more than just the delinearization validity checks.

Quite a few DA and loop interchange test cases improve as a result of this change, and it makes writing tests for new and existing transformations easier.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

bmahjour created this revision.Jan 3 2020, 1:14 PM

Herald added a reviewer: bollu. · View Herald TranscriptJan 3 2020, 1:14 PM

Herald added subscribers: llvm-commits, • wuzish, hiraditya. · View Herald Transcript

ping

Could people on the review list please take a look at this patch? I would also appreciate suggestions for who should be a reviewer instead (or in addition) to the current list of reviewers, if there are any.

I think it would be good to split this up into the 3 distinct parts you mention in the description:

re-introduce the checks
rename option
improve logic

llvm/lib/Analysis/DependenceAnalysis.cpp
120	What is the rational for renaming the option? Enabling the option is unsafe and the new name makes it sound a lot more harmless than the original one. Also I think we should retain the wording in the description stating why it is unsafe.
llvm/test/Analysis/DependenceAnalysis/Separability.ll
2 ↗	(On Diff #236114)	Not sure if it's the best idea to enable this option for a bunch of tests, as we miss test coverage for the code path actually enabled by default. I think the runs with -da-assume-inrange-subscripts should be an addition, rather than replacing the existing one.

bmahjour marked 2 inline comments as done.Feb 3 2020, 11:34 AM

bmahjour added inline comments.

llvm/lib/Analysis/DependenceAnalysis.cpp
120	I renamed it because the option is now being used to control more than just the delinearization validity checks. Now it also controls whether fixed size array delinearization happens or not. I can add the words of caution from the old description back.
llvm/test/Analysis/DependenceAnalysis/Separability.ll
2 ↗	(On Diff #236114)	Unfortunately that will cause dual maintenance. IMHO the loss of coverage suffered as a result of disabling delinearization for the old tests are more concerning than missing coverage on inaccurate results of the current default. Having said that I see your point in that we may want to make sure the current default results don't get any more inaccurate than they already are. I really don't see a point in dual maintaining the loop interchange tests though, given that the transform is off by default and there is no point in testing that it fails due to data dependence when there is no real data dependence. Agreed?

@fhahn As per your request I've reduced this patch to just adding support for delinearization of fixed-size arrays. Also restored the default tests and added new ones for when the option is enabled. Two more patches are created to 1. do the refactoring (D73995) 2. rename the option (D73998).

bmahjour added a child revision: D73995: [NFC] [DA] Refactoring getIndexExpressionsFromGEP .Feb 4 2020, 2:29 PM

bmahjour added a child revision: D73998: [DA] renaming the -da-disable-delinearization-checks option.Feb 4 2020, 2:36 PM

bmahjour edited the summary of this revision. (Show Details)Feb 4 2020, 2:42 PM

ping

Meinersbur added inline comments.Feb 17 2020, 12:51 PM

llvm/include/llvm/Analysis/DependenceAnalysis.h
932–949	Could you add some more details that it tries to derive the dimensions from GEP and `tryDelinearizeParametricSize` tries to do delinearization? I mean, that's the reason why these functions are separate.
llvm/lib/Analysis/DependenceAnalysis.cpp
3276–3327	Please add a comment why this is bailing out here. We could also check for GEP's inrange modifier and require it unless `DisableDelinearizationChecks` is set.
3311–3314	I think this has been split into D73995.
3354–3355	[style] LLVM style does not use almost-always-auto
3435	[style] Don't evaluate size() in every iteration
llvm/test/Analysis/DependenceAnalysis/Separability.ll
2 ↗	(On Diff #236114)	I second @fhahn's remark. IMHO regression tests for something unsafe does not happen (here: delinearization without proven safety) are even more important. Regarding maintenance: I too wish that we had something more maintainable than FileCheck.
llvm/test/Transforms/LoopInterchange/currentLimitation.ll
1–3	I think we should continue to also check that interchange does not happen without `-da-disable-delinearization-checks`. Could you add a separate RUN line with another `-check-prefix=`? This also applies to other test cases.

From the description

Removing support for fixed-size array delinearization resulted in over-pissimization of DependenceAnalysis and reduction of test coverage for both DA and LoopInterchange.

I'm not sure it over-pessimized DA. The change was required for correctness.

llvm/test/Analysis/DependenceAnalysis/Separability.ll
2 ↗	(On Diff #236114)	I think it might also be beneficial to just extract the interesting cases into a separate test file, rather than adding just a few additional lines to the existing tests.

bmahjour marked an inline comment as done.Feb 19 2020, 1:06 PM

bmahjour added inline comments.

llvm/lib/Analysis/DependenceAnalysis.cpp
3276–3327	We could also check for GEP's inrange modifier and require it unless DisableDelinearizationChecks is set. We could, however I'd suggest we consider that as a separate patch. There are some peculiarities with GEP's inrange that I need to understand better. In particular, it's not clear to me why the syntax allows `inrange` to appear before multiple indexes, but the `getInRangeIndex()` API only allows a single index to be retrieved. If I understand this conversation https://reviews.llvm.org/D22793?id=65626#inline-194586 correctly, then we can only try to delinearize GEPs when `SrcGEP->getInRangeIndex().getValue() == SrcGEP->getNumIndices()`. Is that true?

Meinersbur added a subscriber: pcc.Feb 19 2020, 1:46 PM

Meinersbur added inline comments.

llvm/lib/Analysis/DependenceAnalysis.cpp
3276–3327	We could, however I'd suggest we consider that as a separate patch. Agreed. Just something to keep in mind when renaming `-da-disable-delinearization-checks` that might not totally disable non-linear GEPs unless set. the getInRangeIndex() API only allows a single index to be retrieved. Yes, and it contradicts the LLVM reference for `inrange`: https://llvm.org/docs/LangRef.html#id231 . It also seems to be allowed for constant expressions currently. <result> = getelementptr inbounds <ty>, <ty>* <ptrval>{, [inrange] <ty> <idx>}* ... if the load or store would access memory outside of the bounds of the element selected by the index marked as inrange. For reference, here is the inline comment from @pcc: No more than one. The rationale is that inrange on a given index will be at least as restrictive as inrange on any earlier index, so there's no point in allowing more than one. I'll document this in the langref. This seems not to have been documented. However, there are other motivations than @pcc had in mind (vtables) and some may allow non-inrange indices after an inrange one. Furthermore, I think it is problematic if we'd implicitly interpret all subscripts after an inrange subscript as well.

bmahjour removed a child revision: D73995: [NFC] [DA] Refactoring getIndexExpressionsFromGEP .Feb 24 2020, 2:47 PM

bmahjour added a parent revision: D73995: [NFC] [DA] Refactoring getIndexExpressionsFromGEP .

Addressed all review comments.

LGTM

llvm/include/llvm/Analysis/DependenceAnalysis.h
930	Is `checkSubscript` supposed to be public now?

This revision is now accepted and ready to land.Feb 25 2020, 3:38 PM

Harbormaster failed remote builds in B47251: Diff 246585!Feb 25 2020, 3:45 PM

bmahjour marked 2 inline comments as done.Feb 26 2020, 6:16 AM

bmahjour added inline comments.

llvm/include/llvm/Analysis/DependenceAnalysis.h
930	No, it is still private. The `private` keyword above was redundant, which is why I removed it.

fix issues found by clang-tidy and clang-format.

Harbormaster completed remote builds in B47337: Diff 246777.Feb 26 2020, 12:08 PM

Meinersbur added inline comments.Feb 26 2020, 9:13 PM

llvm/include/llvm/Analysis/DependenceAnalysis.h
930	OK, I see.

Closed by commit rG1b811ff8a935: [DA] Delinearization of fixed-size multi-dimensional arrays (authored by bmahjour). · Explain WhyFeb 27 2020, 9:51 AM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

DependenceAnalysis.h

20 lines

lib/

Analysis/

DependenceAnalysis.cpp

150 lines

test/

Analysis/

DependenceAnalysis/

PreliminaryNoValidityCheckFixedSize.ll

106 lines

SimpleSIVNoValidityCheckFixedSize.ll

120 lines

Transforms/

LoopInterchange/

currentLimitation.ll

7 lines

loop-interchange-optimization-remarks.ll

41 lines

profitability.ll

10 lines

Diff 246585

llvm/include/llvm/Analysis/DependenceAnalysis.h

Show First 20 Lines • Show All 918 Lines • ▼ Show 20 Lines	const SCEV addToCoefficient(const SCEV Expr,
const Loop *TargetLoop,		const Loop *TargetLoop,
const SCEV *Value) const;		const SCEV *Value) const;

/// updateDirection - Update direction vector entry		/// updateDirection - Update direction vector entry
/// based on the current constraint.		/// based on the current constraint.
void updateDirection(Dependence::DVEntry &Level,		void updateDirection(Dependence::DVEntry &Level,
const Constraint &CurConstraint) const;		const Constraint &CurConstraint) const;

		/// Given a linear access function, tries to recover subscripts
		/// for each dimension of the array element access.
bool tryDelinearize(Instruction Src, Instruction Dst,		bool tryDelinearize(Instruction Src, Instruction Dst,
SmallVectorImpl<Subscript> &Pair);		SmallVectorImpl<Subscript> &Pair);

private:		/// Tries to delinearize access function for a fixed size multi-dimensional
MeinersburUnsubmitted Done Reply Inline Actions Is `checkSubscript` supposed to be public now? Meinersbur: Is `checkSubscript` supposed to be public now?
bmahjourAuthorUnsubmitted Done Reply Inline Actions No, it is still private. The `private` keyword above was redundant, which is why I removed it. bmahjour: No, it is still private. The `private` keyword above was redundant, which is why I removed it.
MeinersburUnsubmitted Not Done Reply Inline Actions OK, I see. Meinersbur: OK, I see.
		/// array, by deriving subscripts from GEP instructions. Returns true upon
		/// success and false otherwise.
		bool tryDelinearizeFixedSize(Instruction Src, Instruction Dst,
		const SCEV *SrcAccessFn,
		const SCEV *DstAccessFn,
		SmallVectorImpl<const SCEV *> &SrcSubscripts,
		SmallVectorImpl<const SCEV *> &DstSubscripts);

		/// Tries to delinearize access function for a multi-dimensional array with
		/// symbolic runtime sizes.
		/// Returns true upon success and false otherwise.
		bool tryDelinearizeParametricSize(
		Instruction Src, Instruction Dst, const SCEV *SrcAccessFn,
		const SCEV DstAccessFn, SmallVectorImpl<const SCEV > &SrcSubscripts,
		SmallVectorImpl<const SCEV *> &DstSubscripts);

/// checkSubscript - Helper function for checkSrcSubscript and		/// checkSubscript - Helper function for checkSrcSubscript and
		MeinersburUnsubmitted Done Reply Inline Actions Could you add some more details that it tries to derive the dimensions from GEP and `tryDelinearizeParametricSize` tries to do delinearization? I mean, that's the reason why these functions are separate. Meinersbur: Could you add some more details that it tries to derive the dimensions from GEP and…
/// checkDstSubscript to avoid duplicate code		/// checkDstSubscript to avoid duplicate code
bool checkSubscript(const SCEV Expr, const Loop LoopNest,		bool checkSubscript(const SCEV Expr, const Loop LoopNest,
SmallBitVector &Loops, bool IsSrc);		SmallBitVector &Loops, bool IsSrc);
}; // class DependenceInfo		}; // class DependenceInfo

/// AnalysisPass to compute dependence information in a function		/// AnalysisPass to compute dependence information in a function
class DependenceAnalysis : public AnalysisInfoMixin<DependenceAnalysis> {		class DependenceAnalysis : public AnalysisInfoMixin<DependenceAnalysis> {
public:		public:
▲ Show 20 Lines • Show All 42 Lines • Show Last 20 Lines

llvm/lib/Analysis/DependenceAnalysis.cpp

Show First 20 Lines • Show All 111 Lines • ▼ Show 20 Lines	Delinearize("da-delinearize", cl::init(true), cl::Hidden, cl::ZeroOrMore,
cl::desc("Try to delinearize array references."));		cl::desc("Try to delinearize array references."));
static cl::opt<bool> DisableDelinearizationChecks(		static cl::opt<bool> DisableDelinearizationChecks(
"da-disable-delinearization-checks", cl::init(false), cl::Hidden,		"da-disable-delinearization-checks", cl::init(false), cl::Hidden,
cl::ZeroOrMore,		cl::ZeroOrMore,
cl::desc(		cl::desc(
"Disable checks that try to statically verify validity of "		"Disable checks that try to statically verify validity of "
"delinearized subscripts. Enabling this option may result in incorrect "		"delinearized subscripts. Enabling this option may result in incorrect "
"dependence vectors for languages that allow the subscript of one "		"dependence vectors for languages that allow the subscript of one "
"dimension to underflow or overflow into another dimension."));		"dimension to underflow or overflow into another dimension."));
		fhahnUnsubmitted Done Reply Inline Actions What is the rational for renaming the option? Enabling the option is unsafe and the new name makes it sound a lot more harmless than the original one. Also I think we should retain the wording in the description stating why it is unsafe. fhahn: What is the rational for renaming the option? Enabling the option is unsafe and the new name…
		bmahjourAuthorUnsubmitted Done Reply Inline Actions I renamed it because the option is now being used to control more than just the delinearization validity checks. Now it also controls whether fixed size array delinearization happens or not. I can add the words of caution from the old description back. bmahjour: I renamed it because the option is now being used to control more than just the delinearization…

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// basics		// basics

DependenceAnalysis::Result		DependenceAnalysis::Result
DependenceAnalysis::run(Function &F, FunctionAnalysisManager &FAM) {		DependenceAnalysis::run(Function &F, FunctionAnalysisManager &FAM) {
auto &AA = FAM.getResult<AAManager>(F);		auto &AA = FAM.getResult<AAManager>(F);
auto &SE = FAM.getResult<ScalarEvolutionAnalysis>(F);		auto &SE = FAM.getResult<ScalarEvolutionAnalysis>(F);
▲ Show 20 Lines • Show All 3,130 Lines • ▼ Show 20 Lines
/// this function flattens the nested recurrences into separate recurrences		/// this function flattens the nested recurrences into separate recurrences
/// for each loop level.		/// for each loop level.
bool DependenceInfo::tryDelinearize(Instruction Src, Instruction Dst,		bool DependenceInfo::tryDelinearize(Instruction Src, Instruction Dst,
SmallVectorImpl<Subscript> &Pair) {		SmallVectorImpl<Subscript> &Pair) {
assert(isLoadOrStore(Src) && "instruction is not load or store");		assert(isLoadOrStore(Src) && "instruction is not load or store");
assert(isLoadOrStore(Dst) && "instruction is not load or store");		assert(isLoadOrStore(Dst) && "instruction is not load or store");
Value *SrcPtr = getLoadStorePointerOperand(Src);		Value *SrcPtr = getLoadStorePointerOperand(Src);
Value *DstPtr = getLoadStorePointerOperand(Dst);		Value *DstPtr = getLoadStorePointerOperand(Dst);

Loop *SrcLoop = LI->getLoopFor(Src->getParent());		Loop *SrcLoop = LI->getLoopFor(Src->getParent());
Loop *DstLoop = LI->getLoopFor(Dst->getParent());		Loop *DstLoop = LI->getLoopFor(Dst->getParent());
		const SCEV *SrcAccessFn = SE->getSCEVAtScope(SrcPtr, SrcLoop);
		const SCEV *DstAccessFn = SE->getSCEVAtScope(DstPtr, DstLoop);
		const SCEVUnknown *SrcBase =
		dyn_cast<SCEVUnknown>(SE->getPointerBase(SrcAccessFn));
		const SCEVUnknown *DstBase =
		dyn_cast<SCEVUnknown>(SE->getPointerBase(DstAccessFn));

		if (!SrcBase \|\| !DstBase \|\| SrcBase != DstBase)
		return false;

		SmallVector<const SCEV *, 4> SrcSubscripts, DstSubscripts;

		if (!tryDelinearizeFixedSize(Src, Dst, SrcAccessFn, DstAccessFn,
		SrcSubscripts, DstSubscripts) &&
		!tryDelinearizeParametricSize(Src, Dst, SrcAccessFn, DstAccessFn,
		SrcSubscripts, DstSubscripts))
		return false;

		int size = SrcSubscripts.size();
		LLVM_DEBUG({
		dbgs() << "\nSrcSubscripts: ";
		for (int i = 0; i < size; i++)
		dbgs() << *SrcSubscripts[i];
		dbgs() << "\nDstSubscripts: ";
		for (int i = 0; i < size; i++)
		dbgs() << *DstSubscripts[i];
		});

		// The delinearization transforms a single-subscript MIV dependence test into
		// a multi-subscript SIV dependence test that is easier to compute. So we
		// resize Pair to contain as many pairs of subscripts as the delinearization
		// has found, and then initialize the pairs following the delinearization.
		Pair.resize(size);
		for (int i = 0; i < size; ++i) {
		Pair[i].Src = SrcSubscripts[i];
		Pair[i].Dst = DstSubscripts[i];
		unifySubscriptType(&Pair[i]);
		}

		return true;
		}

// Below code mimics the code in Delinearization.cpp		bool DependenceInfo::tryDelinearizeFixedSize(
const SCEV *SrcAccessFn =		Instruction Src, Instruction Dst, const SCEV *SrcAccessFn,
SE->getSCEVAtScope(SrcPtr, SrcLoop);		const SCEV DstAccessFn, SmallVectorImpl<const SCEV > &SrcSubscripts,
const SCEV *DstAccessFn =		SmallVectorImpl<const SCEV *> &DstSubscripts) {
		MeinersburUnsubmitted Done Reply Inline Actions I think this has been split into D73995. Meinersbur: I think this has been split into D73995.
SE->getSCEVAtScope(DstPtr, DstLoop);
		// In general we cannot safely assume that the subscripts recovered from GEPs
		// are in the range of values defined for their corresponding array
		// dimensions. For example some C language usage/interpretation make it
		// impossible to verify this at compile-time. As such we give up here unless
		// we can assume that the subscripts do not overlap into neighboring
		// dimensions and that the number of dimensions matches the number of
		// subscripts being recovered.
		if (!DisableDelinearizationChecks)
		return false;

		Value *SrcPtr = getLoadStorePointerOperand(Src);
		Value *DstPtr = getLoadStorePointerOperand(Dst);
		MeinersburUnsubmitted Done Reply Inline Actions Please add a comment why this is bailing out here. We could also check for GEP's inrange modifier and require it unless `DisableDelinearizationChecks` is set. Meinersbur: Please add a comment why this is bailing out here. We could also check for GEP's [[ https…
		bmahjourAuthorUnsubmitted Done Reply Inline Actions We could also check for GEP's inrange modifier and require it unless DisableDelinearizationChecks is set. We could, however I'd suggest we consider that as a separate patch. There are some peculiarities with GEP's inrange that I need to understand better. In particular, it's not clear to me why the syntax allows `inrange` to appear before multiple indexes, but the `getInRangeIndex()` API only allows a single index to be retrieved. If I understand this conversation https://reviews.llvm.org/D22793?id=65626#inline-194586 correctly, then we can only try to delinearize GEPs when `SrcGEP->getInRangeIndex().getValue() == SrcGEP->getNumIndices()`. Is that true? bmahjour: > We could also check for GEP's inrange modifier and require it unless…
		MeinersburUnsubmitted Done Reply Inline Actions We could, however I'd suggest we consider that as a separate patch. Agreed. Just something to keep in mind when renaming `-da-disable-delinearization-checks` that might not totally disable non-linear GEPs unless set. the getInRangeIndex() API only allows a single index to be retrieved. Yes, and it contradicts the LLVM reference for `inrange`: https://llvm.org/docs/LangRef.html#id231 . It also seems to be allowed for constant expressions currently. <result> = getelementptr inbounds <ty>, <ty>* <ptrval>{, [inrange] <ty> <idx>}* ... if the load or store would access memory outside of the bounds of the element selected by the index marked as inrange. For reference, here is the inline comment from @pcc: No more than one. The rationale is that inrange on a given index will be at least as restrictive as inrange on any earlier index, so there's no point in allowing more than one. I'll document this in the langref. This seems not to have been documented. However, there are other motivations than @pcc had in mind (vtables) and some may allow non-inrange indices after an inrange one. Furthermore, I think it is problematic if we'd implicitly interpret all subscripts after an inrange subscript as well. Meinersbur: > We could, however I'd suggest we consider that as a separate patch. Agreed. Just something…
const SCEVUnknown *SrcBase =		const SCEVUnknown *SrcBase =
dyn_cast<SCEVUnknown>(SE->getPointerBase(SrcAccessFn));		dyn_cast<SCEVUnknown>(SE->getPointerBase(SrcAccessFn));
const SCEVUnknown *DstBase =		const SCEVUnknown *DstBase =
dyn_cast<SCEVUnknown>(SE->getPointerBase(DstAccessFn));		dyn_cast<SCEVUnknown>(SE->getPointerBase(DstAccessFn));
		assert(SrcBase && DstBase && SrcBase == DstBase &&
		"expected src and dst scev unknowns to be equal");

if (!SrcBase \|\| !DstBase \|\| SrcBase != DstBase)		// Check the simple case where the array dimensions are fixed size.
		auto *SrcGEP = dyn_cast<GetElementPtrInst>(SrcPtr);
		auto *DstGEP = dyn_cast<GetElementPtrInst>(DstPtr);
		if (!SrcGEP \|\| !DstGEP)
		return false;

		SmallVector<int, 4> SrcSizes, DstSizes;
		SE->getIndexExpressionsFromGEP(SrcGEP, SrcSubscripts, SrcSizes);
		SE->getIndexExpressionsFromGEP(DstGEP, DstSubscripts, DstSizes);

		// Check that the two size arrays are non-empty and equal in length and
		// value.
		if (SrcSizes.empty() \|\| SrcSubscripts.size() <= 1 \|\| SrcSizes.size() != DstSizes.size() \|\|
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - if (SrcSizes.empty() \|\| SrcSubscripts.size() <= 1 \|\| SrcSizes.size() != DstSizes.size() \|\| + if (SrcSizes.empty() \|\| SrcSubscripts.size() <= 1 \|\| + SrcSizes.size() != DstSizes.size() \|\| Lint: Pre-merge checks: clang-format: please reformat the code ``` - if (SrcSizes.empty() \|\| SrcSubscripts.size() <= 1…
		!std::equal(SrcSizes.begin(), SrcSizes.end(), DstSizes.begin())) {
		SrcSubscripts.clear();
		DstSubscripts.clear();
		return false;
		}

		Value *SrcBasePtr = SrcGEP->getOperand(0);
		Value *DstBasePtr = DstGEP->getOperand(0);
		MeinersburUnsubmitted Done Reply Inline Actions [style] LLVM style does not use almost-always-auto Meinersbur: [style] LLVM style does not use almost-always-auto
		while (auto *PCast = dyn_cast<BitCastInst>(SrcBasePtr))
		SrcBasePtr = PCast->getOperand(0);
		while (auto *PCast = dyn_cast<BitCastInst>(DstBasePtr))
		DstBasePtr = PCast->getOperand(0);

		// Check that for identical base pointers we do not miss index offsets
		// that have been added before this GEP is applied.
		if (SrcBasePtr == SrcBase->getValue() &&
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - if (SrcBasePtr == SrcBase->getValue() && - DstBasePtr == DstBase->getValue()) { + if (SrcBasePtr == SrcBase->getValue() && DstBasePtr == DstBase->getValue()) { Lint: Pre-merge checks: clang-format: please reformat the code ``` - if (SrcBasePtr == SrcBase->getValue() &&…
		DstBasePtr == DstBase->getValue()) {
		assert(SrcSubscripts.size() == DstSubscripts.size() &&
		SrcSubscripts.size() == SrcSizes.size() + 1 &&
		"Expected equal number of entries in the list of sizes and "
		"subscripts.");
		LLVM_DEBUG({
		dbgs() << "Delinearized subscripts of fixed-size array\n"
		<< "SrcGEP:" << *SrcGEP << "\n"
		<< "DstGEP:" << *DstGEP << "\n";
		});
		return true;
		}

		SrcSubscripts.clear();
		DstSubscripts.clear();
return false;		return false;
		}

		bool DependenceInfo::tryDelinearizeParametricSize(
		Instruction Src, Instruction Dst, const SCEV *SrcAccessFn,
		const SCEV DstAccessFn, SmallVectorImpl<const SCEV > &SrcSubscripts,
		SmallVectorImpl<const SCEV *> &DstSubscripts) {

		Value *SrcPtr = getLoadStorePointerOperand(Src);
		Value *DstPtr = getLoadStorePointerOperand(Dst);
		const SCEVUnknown *SrcBase =
		dyn_cast<SCEVUnknown>(SE->getPointerBase(SrcAccessFn));
		const SCEVUnknown *DstBase =
		dyn_cast<SCEVUnknown>(SE->getPointerBase(DstAccessFn));
		assert(SrcBase && DstBase && SrcBase == DstBase &&
		"expected src and dst scev unknowns to be equal");

const SCEV *ElementSize = SE->getElementSize(Src);		const SCEV *ElementSize = SE->getElementSize(Src);
if (ElementSize != SE->getElementSize(Dst))		if (ElementSize != SE->getElementSize(Dst))
return false;		return false;

const SCEV *SrcSCEV = SE->getMinusSCEV(SrcAccessFn, SrcBase);		const SCEV *SrcSCEV = SE->getMinusSCEV(SrcAccessFn, SrcBase);
const SCEV *DstSCEV = SE->getMinusSCEV(DstAccessFn, DstBase);		const SCEV *DstSCEV = SE->getMinusSCEV(DstAccessFn, DstBase);

const SCEVAddRecExpr *SrcAR = dyn_cast<SCEVAddRecExpr>(SrcSCEV);		const SCEVAddRecExpr *SrcAR = dyn_cast<SCEVAddRecExpr>(SrcSCEV);
const SCEVAddRecExpr *DstAR = dyn_cast<SCEVAddRecExpr>(DstSCEV);		const SCEVAddRecExpr *DstAR = dyn_cast<SCEVAddRecExpr>(DstSCEV);
if (!SrcAR \|\| !DstAR \|\| !SrcAR->isAffine() \|\| !DstAR->isAffine())		if (!SrcAR \|\| !DstAR \|\| !SrcAR->isAffine() \|\| !DstAR->isAffine())
return false;		return false;

// First step: collect parametric terms in both array references.		// First step: collect parametric terms in both array references.
SmallVector<const SCEV *, 4> Terms;		SmallVector<const SCEV *, 4> Terms;
SE->collectParametricTerms(SrcAR, Terms);		SE->collectParametricTerms(SrcAR, Terms);
SE->collectParametricTerms(DstAR, Terms);		SE->collectParametricTerms(DstAR, Terms);

// Second step: find subscript sizes.		// Second step: find subscript sizes.
SmallVector<const SCEV *, 4> Sizes;		SmallVector<const SCEV *, 4> Sizes;
SE->findArrayDimensions(Terms, Sizes, ElementSize);		SE->findArrayDimensions(Terms, Sizes, ElementSize);

// Third step: compute the access functions for each subscript.		// Third step: compute the access functions for each subscript.
SmallVector<const SCEV *, 4> SrcSubscripts, DstSubscripts;
SE->computeAccessFunctions(SrcAR, SrcSubscripts, Sizes);		SE->computeAccessFunctions(SrcAR, SrcSubscripts, Sizes);
SE->computeAccessFunctions(DstAR, DstSubscripts, Sizes);		SE->computeAccessFunctions(DstAR, DstSubscripts, Sizes);

// Fail when there is only a subscript: that's a linearized access function.		// Fail when there is only a subscript: that's a linearized access function.
if (SrcSubscripts.size() < 2 \|\| DstSubscripts.size() < 2 \|\|		if (SrcSubscripts.size() < 2 \|\| DstSubscripts.size() < 2 \|\|
SrcSubscripts.size() != DstSubscripts.size())		SrcSubscripts.size() != DstSubscripts.size())
return false;		return false;

int size = SrcSubscripts.size();		size_t size = SrcSubscripts.size();

// Statically check that the array bounds are in-range. The first subscript we		// Statically check that the array bounds are in-range. The first subscript we
// don't have a size for and it cannot overflow into another subscript, so is		// don't have a size for and it cannot overflow into another subscript, so is
// always safe. The others need to be 0 <= subscript[i] < bound, for both src		// always safe. The others need to be 0 <= subscript[i] < bound, for both src
// and dst.		// and dst.
// FIXME: It may be better to record these sizes and add them as constraints		// FIXME: It may be better to record these sizes and add them as constraints
// to the dependency checks.		// to the dependency checks.
if (!DisableDelinearizationChecks)		if (!DisableDelinearizationChecks)
for (int i = 1; i < size; ++i) {		for (size_t i = 1; i < size; ++i) {
		MeinersburUnsubmitted Done Reply Inline Actions [style] Don't evaluate size() in every iteration Meinersbur: [style] [[ https://llvm.org/docs/CodingStandards.html#don-t-evaluate-end-every-time-through-a…
if (!isKnownNonNegative(SrcSubscripts[i], SrcPtr))		if (!isKnownNonNegative(SrcSubscripts[i], SrcPtr))
return false;		return false;

if (!isKnownLessThan(SrcSubscripts[i], Sizes[i - 1]))		if (!isKnownLessThan(SrcSubscripts[i], Sizes[i - 1]))
return false;		return false;

if (!isKnownNonNegative(DstSubscripts[i], DstPtr))		if (!isKnownNonNegative(DstSubscripts[i], DstPtr))
return false;		return false;

if (!isKnownLessThan(DstSubscripts[i], Sizes[i - 1]))		if (!isKnownLessThan(DstSubscripts[i], Sizes[i - 1]))
return false;		return false;
}		}

LLVM_DEBUG({
dbgs() << "\nSrcSubscripts: ";
for (int i = 0; i < size; i++)
dbgs() << *SrcSubscripts[i];
dbgs() << "\nDstSubscripts: ";
for (int i = 0; i < size; i++)
dbgs() << *DstSubscripts[i];
});

// The delinearization transforms a single-subscript MIV dependence test into
// a multi-subscript SIV dependence test that is easier to compute. So we
// resize Pair to contain as many pairs of subscripts as the delinearization
// has found, and then initialize the pairs following the delinearization.
Pair.resize(size);
for (int i = 0; i < size; ++i) {
Pair[i].Src = SrcSubscripts[i];
Pair[i].Dst = DstSubscripts[i];
unifySubscriptType(&Pair[i]);
}

return true;		return true;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef NDEBUG		#ifndef NDEBUG
// For debugging purposes, dump a small bit vector to dbgs().		// For debugging purposes, dump a small bit vector to dbgs().
static void dumpSmallBitVector(SmallBitVector &BV) {		static void dumpSmallBitVector(SmallBitVector &BV) {
▲ Show 20 Lines • Show All 637 Lines • Show Last 20 Lines

llvm/test/Analysis/DependenceAnalysis/PreliminaryNoValidityCheckFixedSize.ll

This file was added.

				; RUN: opt < %s -disable-output "-passes=print<da>" -aa-pipeline=basic-aa 2>&1 \
				; RUN: -da-disable-delinearization-checks \| FileCheck %s

				target datalayout = "e-p:64:64:64-i1:8:8-i8:8:8-i16:16:16-i32:32:32-i64:64:64-f32:32:32-f64:64:64-v64:64:64-v128:128:128-a0:0:64-s0:64:64-f80:128:128-n8:16:32:64-S128"
				target triple = "x86_64-apple-macosx10.6.0"

				;; for (long int i = 0; i < n; i++) {
				;; for (long int j = 0; j < n; j++) {
				;; for (long int k = 0; k < n; k++) {
				;; A[i][j][k] = i;
				;; }
				;; for (long int k = 0; k < n; k++) {
				;; *B++ = A[i + 3][j + 2][k + 1];

				define void @p2(i64 %n, [100 x [100 x i64]]* %A, i64* %B) nounwind uwtable ssp {
				entry:
				%cmp10 = icmp sgt i64 %n, 0
				br i1 %cmp10, label %for.cond1.preheader.preheader, label %for.end26

				; CHECK-LABEL: p2
				; CHECK: da analyze - none!
				; CHECK: da analyze - flow [-3 -2]!
				; CHECK: da analyze - confused!
				; CHECK: da analyze - none!
				; CHECK: da analyze - confused!
				; CHECK: da analyze - output [* * *]!

				for.cond1.preheader.preheader: ; preds = %entry
				br label %for.cond1.preheader

				for.cond1.preheader: ; preds = %for.cond1.preheader.preheader, %for.inc24
				%B.addr.012 = phi i64* [ %B.addr.1.lcssa, %for.inc24 ], [ %B, %for.cond1.preheader.preheader ]
				%i.011 = phi i64 [ %inc25, %for.inc24 ], [ 0, %for.cond1.preheader.preheader ]
				%cmp26 = icmp sgt i64 %n, 0
				br i1 %cmp26, label %for.cond4.preheader.preheader, label %for.inc24

				for.cond4.preheader.preheader: ; preds = %for.cond1.preheader
				br label %for.cond4.preheader

				for.cond4.preheader: ; preds = %for.cond4.preheader.preheader, %for.inc21
				%B.addr.18 = phi i64* [ %B.addr.2.lcssa, %for.inc21 ], [ %B.addr.012, %for.cond4.preheader.preheader ]
				%j.07 = phi i64 [ %inc22, %for.inc21 ], [ 0, %for.cond4.preheader.preheader ]
				%cmp51 = icmp sgt i64 %n, 0
				br i1 %cmp51, label %for.body6.preheader, label %for.cond10.loopexit

				for.body6.preheader: ; preds = %for.cond4.preheader
				br label %for.body6

				for.body6: ; preds = %for.body6.preheader, %for.body6
				%k.02 = phi i64 [ %inc, %for.body6 ], [ 0, %for.body6.preheader ]
				%arrayidx8 = getelementptr inbounds [100 x [100 x i64]], [100 x [100 x i64]]* %A, i64 %i.011, i64 %j.07, i64 %k.02
				store i64 %i.011, i64* %arrayidx8, align 8
				%inc = add nsw i64 %k.02, 1
				%exitcond13 = icmp ne i64 %inc, %n
				br i1 %exitcond13, label %for.body6, label %for.cond10.loopexit.loopexit

				for.cond10.loopexit.loopexit: ; preds = %for.body6
				br label %for.cond10.loopexit

				for.cond10.loopexit: ; preds = %for.cond10.loopexit.loopexit, %for.cond4.preheader
				%cmp113 = icmp sgt i64 %n, 0
				br i1 %cmp113, label %for.body12.preheader, label %for.inc21

				for.body12.preheader: ; preds = %for.cond10.loopexit
				br label %for.body12

				for.body12: ; preds = %for.body12.preheader, %for.body12
				%k9.05 = phi i64 [ %inc19, %for.body12 ], [ 0, %for.body12.preheader ]
				%B.addr.24 = phi i64* [ %incdec.ptr, %for.body12 ], [ %B.addr.18, %for.body12.preheader ]
				%add = add nsw i64 %k9.05, 1
				%add13 = add nsw i64 %j.07, 2
				%add14 = add nsw i64 %i.011, 3
				%arrayidx17 = getelementptr inbounds [100 x [100 x i64]], [100 x [100 x i64]]* %A, i64 %add14, i64 %add13, i64 %add
				%0 = load i64, i64* %arrayidx17, align 8
				%incdec.ptr = getelementptr inbounds i64, i64* %B.addr.24, i64 1
				store i64 %0, i64* %B.addr.24, align 8
				%inc19 = add nsw i64 %k9.05, 1
				%exitcond = icmp ne i64 %inc19, %n
				br i1 %exitcond, label %for.body12, label %for.inc21.loopexit

				for.inc21.loopexit: ; preds = %for.body12
				%scevgep = getelementptr i64, i64* %B.addr.18, i64 %n
				br label %for.inc21

				for.inc21: ; preds = %for.inc21.loopexit, %for.cond10.loopexit
				%B.addr.2.lcssa = phi i64* [ %B.addr.18, %for.cond10.loopexit ], [ %scevgep, %for.inc21.loopexit ]
				%inc22 = add nsw i64 %j.07, 1
				%exitcond14 = icmp ne i64 %inc22, %n
				br i1 %exitcond14, label %for.cond4.preheader, label %for.inc24.loopexit

				for.inc24.loopexit: ; preds = %for.inc21
				%B.addr.2.lcssa.lcssa = phi i64* [ %B.addr.2.lcssa, %for.inc21 ]
				br label %for.inc24

				for.inc24: ; preds = %for.inc24.loopexit, %for.cond1.preheader
				%B.addr.1.lcssa = phi i64* [ %B.addr.012, %for.cond1.preheader ], [ %B.addr.2.lcssa.lcssa, %for.inc24.loopexit ]
				%inc25 = add nsw i64 %i.011, 1
				%exitcond15 = icmp ne i64 %inc25, %n
				br i1 %exitcond15, label %for.cond1.preheader, label %for.end26.loopexit

				for.end26.loopexit: ; preds = %for.inc24
				br label %for.end26

				for.end26: ; preds = %for.end26.loopexit, %entry
				ret void
				}

llvm/test/Analysis/DependenceAnalysis/SimpleSIVNoValidityCheckFixedSize.ll

This file was added.

				; RUN: opt < %s -disable-output -passes="print<da>" \
				; RUN: -da-disable-delinearization-checks 2>&1 \| FileCheck %s
				; RUN: opt < %s -da -analyze -da-disable-delinearization-checks \| FileCheck %s

				; CHECK-LABEL: t1
				; CHECK: da analyze - none!
				; CHECK: da analyze - consistent anti [1 -2]!
				; CHECK: da analyze - none!

				;; #define N 1024
				;; #define M 2048
				;; void t1(int a[N][M]) {
				;; for (int i = 0; i < N-1; ++i)
				;; for (int j = 2; j < M; ++j)
				;; a[i][j] = a[i+1][j-2];
				;; }

				define void @t1([2048 x i32]* %a) {
				entry:
				br label %for.body

				for.body: ; preds = %entry, %for.inc11
				%indvars.iv4 = phi i64 [ 0, %entry ], [ %indvars.iv.next5, %for.inc11 ]
				br label %for.body4

				for.body4: ; preds = %for.body, %for.body4
				%indvars.iv = phi i64 [ 2, %for.body ], [ %indvars.iv.next, %for.body4 ]
				%0 = add nuw nsw i64 %indvars.iv4, 1
				%1 = add nsw i64 %indvars.iv, -2
				%arrayidx6 = getelementptr inbounds [2048 x i32], [2048 x i32]* %a, i64 %0, i64 %1
				%2 = load i32, i32* %arrayidx6, align 4
				%arrayidx10 = getelementptr inbounds [2048 x i32], [2048 x i32]* %a, i64 %indvars.iv4, i64 %indvars.iv
				store i32 %2, i32* %arrayidx10, align 4
				%indvars.iv.next = add nuw nsw i64 %indvars.iv, 1
				%exitcond = icmp ne i64 %indvars.iv.next, 2048
				br i1 %exitcond, label %for.body4, label %for.inc11

				for.inc11: ; preds = %for.body4
				%indvars.iv.next5 = add nuw nsw i64 %indvars.iv4, 1
				%exitcond7 = icmp ne i64 %indvars.iv.next5, 1023
				br i1 %exitcond7, label %for.body, label %for.end13

				for.end13: ; preds = %for.inc11
				ret void
				}


				; CHECK-LABEL: t2
				; CHECK: da analyze - none!
				; CHECK: da analyze - consistent anti [1 -2 0 -3 2]!
				; CHECK: da analyze - none!

				;; #define N 1024
				;; #define M 2048
				;; void t2(int a[][N][N][N][M]) {
				;; for (int i1 = 0; i1 < N-1; ++i1)
				;; for (int i2 = 2; i2 < N; ++i2)
				;; for (int i3 = 0; i3 < N; ++i3)
				;; for (int i4 = 3; i4 < N; ++i4)
				;; for (int i5 = 0; i5 < M-2; ++i5)
				;; a[i1][i2][i3][i4][i5] = a[i1+1][i2-2][i3][i4-3][i5+2];
				;; }

				define void @t2([1024 x [1024 x [1024 x [2048 x i32]]]]* %a) {
				entry:
				br label %for.body

				for.body: ; preds = %entry, %for.inc46
				%indvars.iv18 = phi i64 [ 0, %entry ], [ %indvars.iv.next19, %for.inc46 ]
				br label %for.body4

				for.body4: ; preds = %for.body, %for.inc43
				%indvars.iv14 = phi i64 [ 2, %for.body ], [ %indvars.iv.next15, %for.inc43 ]
				br label %for.body8

				for.body8: ; preds = %for.body4, %for.inc40
				%indvars.iv11 = phi i64 [ 0, %for.body4 ], [ %indvars.iv.next12, %for.inc40 ]
				br label %for.body12

				for.body12: ; preds = %for.body8, %for.inc37
				%indvars.iv7 = phi i64 [ 3, %for.body8 ], [ %indvars.iv.next8, %for.inc37 ]
				br label %for.body16

				for.body16: ; preds = %for.body12, %for.body16
				%indvars.iv = phi i64 [ 0, %for.body12 ], [ %indvars.iv.next, %for.body16 ]
				%0 = add nuw nsw i64 %indvars.iv18, 1
				%1 = add nsw i64 %indvars.iv14, -2
				%2 = add nsw i64 %indvars.iv7, -3
				%3 = add nuw nsw i64 %indvars.iv, 2
				%arrayidx26 = getelementptr inbounds [1024 x [1024 x [1024 x [2048 x i32]]]], [1024 x [1024 x [1024 x [2048 x i32]]]]* %a, i64 %0, i64 %1, i64 %indvars.iv11, i64 %2, i64 %3
				%4 = load i32, i32* %arrayidx26, align 4
				%arrayidx36 = getelementptr inbounds [1024 x [1024 x [1024 x [2048 x i32]]]], [1024 x [1024 x [1024 x [2048 x i32]]]]* %a, i64 %indvars.iv18, i64 %indvars.iv14, i64 %indvars.iv11, i64 %indvars.iv7, i64 %indvars.iv
				store i32 %4, i32* %arrayidx36, align 4
				%indvars.iv.next = add nuw nsw i64 %indvars.iv, 1
				%exitcond = icmp ne i64 %indvars.iv.next, 2046
				br i1 %exitcond, label %for.body16, label %for.inc37

				for.inc37: ; preds = %for.body16
				%indvars.iv.next8 = add nuw nsw i64 %indvars.iv7, 1
				%exitcond10 = icmp ne i64 %indvars.iv.next8, 1024
				br i1 %exitcond10, label %for.body12, label %for.inc40

				for.inc40: ; preds = %for.inc37
				%indvars.iv.next12 = add nuw nsw i64 %indvars.iv11, 1
				%exitcond13 = icmp ne i64 %indvars.iv.next12, 1024
				br i1 %exitcond13, label %for.body8, label %for.inc43

				for.inc43: ; preds = %for.inc40
				%indvars.iv.next15 = add nuw nsw i64 %indvars.iv14, 1
				%exitcond17 = icmp ne i64 %indvars.iv.next15, 1024
				br i1 %exitcond17, label %for.body4, label %for.inc46

				for.inc46: ; preds = %for.inc43
				%indvars.iv.next19 = add nuw nsw i64 %indvars.iv18, 1
				%exitcond21 = icmp ne i64 %indvars.iv.next19, 1023
				br i1 %exitcond21, label %for.body, label %for.end48

				for.end48: ; preds = %for.inc46
				ret void
				}

llvm/test/Transforms/LoopInterchange/currentLimitation.ll

	; RUN: opt < %s -basicaa -loop-interchange -pass-remarks-missed='loop-interchange' \			; RUN: opt < %s -basicaa -loop-interchange -pass-remarks-missed='loop-interchange' \
	; RUN: -pass-remarks-output=%t -verify-loop-info -verify-dom-info -S \| FileCheck -check-prefix=IR %s			; RUN: -pass-remarks-output=%t -verify-loop-info -verify-dom-info -S \| FileCheck -check-prefix=IR %s
	; RUN: FileCheck --input-file=%t %s			; RUN: FileCheck --input-file=%t %s
				MeinersburUnsubmitted Done Reply Inline Actions I think we should continue to also check that interchange does not happen without `-da-disable-delinearization-checks`. Could you add a separate RUN line with another `-check-prefix=`? This also applies to other test cases. Meinersbur: I think we should continue to also check that interchange does not happen without `-da-disable…

				; RUN: opt < %s -basicaa -loop-interchange -pass-remarks-missed='loop-interchange' \
				; RUN: -da-disable-delinearization-checks -pass-remarks-output=%t \
				; RUN: -verify-loop-info -verify-dom-info -S \| FileCheck -check-prefix=IR %s
				; RUN: FileCheck --check-prefix=DELIN --input-file=%t %s

	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	@A = common global [100 x [100 x i32]] zeroinitializer			@A = common global [100 x [100 x i32]] zeroinitializer
	@B = common global [100 x [100 x [100 x i32]]] zeroinitializer			@B = common global [100 x [100 x [100 x i32]]] zeroinitializer
	@C = common global [100 x [100 x i64]] zeroinitializer			@C = common global [100 x [100 x i64]] zeroinitializer

	;;--------------------------------------Test case 01------------------------------------			;;--------------------------------------Test case 01------------------------------------
	;; [FIXME] This loop though valid is currently not interchanged due to the limitation that we cannot split the inner loop latch due to multiple use of inner induction			;; [FIXME] This loop though valid is currently not interchanged due to the limitation that we cannot split the inner loop latch due to multiple use of inner induction
	;; variable.(used to increment the loop counter and to access A[j+1][i+1]			;; variable.(used to increment the loop counter and to access A[j+1][i+1]
	;; for(int i=0;i<N-1;i++)			;; for(int i=0;i<N-1;i++)
	;; for(int j=1;j<N-1;j++)			;; for(int j=1;j<N-1;j++)
	;; A[j+1][i+1] = A[j+1][i+1] + k;			;; A[j+1][i+1] = A[j+1][i+1] + k;

	; FIXME: Currently fails because of DA changes.
	; IR-LABEL: @interchange_01			; IR-LABEL: @interchange_01
	; IR-NOT: split			; IR-NOT: split

	; CHECK: Name: Dependence			; CHECK: Name: Dependence
				; DELIN: Name: UnsupportedInsBetweenInduction
	; CHECK-NEXT: Function: interchange_01			; CHECK-NEXT: Function: interchange_01

	define void @interchange_01(i32 %k, i32 %N) {			define void @interchange_01(i32 %k, i32 %N) {
	entry:			entry:
	%sub = add nsw i32 %N, -1			%sub = add nsw i32 %N, -1
	%cmp26 = icmp sgt i32 %N, 1			%cmp26 = icmp sgt i32 %N, 1
	br i1 %cmp26, label %for.cond1.preheader.lr.ph, label %for.end17			br i1 %cmp26, label %for.cond1.preheader.lr.ph, label %for.end17

	▲ Show 20 Lines • Show All 66 Lines • Show Last 20 Lines

llvm/test/Transforms/LoopInterchange/loop-interchange-optimization-remarks.ll

	; Test optimization remarks generated by the LoopInterchange pass.			; Test optimization remarks generated by the LoopInterchange pass.
	;			;
	; RUN: opt < %s -basicaa -loop-interchange -verify-dom-info -verify-loop-info \			; RUN: opt < %s -basicaa -loop-interchange -verify-dom-info -verify-loop-info \
	; RUN: -pass-remarks-output=%t -pass-remarks-missed='loop-interchange' \			; RUN: -pass-remarks-output=%t -pass-remarks-missed='loop-interchange' \
	; RUN: -pass-remarks='loop-interchange' -S			; RUN: -pass-remarks='loop-interchange' -S
	; RUN: cat %t \| FileCheck %s			; RUN: cat %t \| FileCheck %s

				; RUN: opt < %s -basicaa -loop-interchange -verify-dom-info -verify-loop-info \
				; RUN: -pass-remarks-output=%t -pass-remarks-missed='loop-interchange' \
				; RUN: -pass-remarks='loop-interchange' -S -da-disable-delinearization-checks
				; RUN: cat %t \| FileCheck --check-prefix=DELIN %s

	@A = common global [100 x [100 x i32]] zeroinitializer			@A = common global [100 x [100 x i32]] zeroinitializer
	@B = common global [100 x [100 x i32]] zeroinitializer			@B = common global [100 x [100 x i32]] zeroinitializer
	@C = common global [100 x i32] zeroinitializer			@C = common global [100 x i32] zeroinitializer

	;;---------------------------------------Test case 01---------------------------------			;;---------------------------------------Test case 01---------------------------------
	;; Loops interchange is not profitable.			;; Loops interchange is not profitable.
	;; for(int i=1;i<N;i++)			;; for(int i=1;i<N;i++)
	;; for(int j=1;j<N;j++)			;; for(int j=1;j<N;j++)
	Show All 40 Lines
	; CHECK: --- !Missed			; CHECK: --- !Missed
	; CHECK-NEXT: Pass: loop-interchange			; CHECK-NEXT: Pass: loop-interchange
	; CHECK-NEXT: Name: Dependence			; CHECK-NEXT: Name: Dependence
	; CHECK-NEXT: Function: test01			; CHECK-NEXT: Function: test01
	; CHECK-NEXT: Args:			; CHECK-NEXT: Args:
	; CHECK-NEXT: - String: Cannot interchange loops due to dependences.			; CHECK-NEXT: - String: Cannot interchange loops due to dependences.
	; CHECK-NEXT: ...			; CHECK-NEXT: ...

				; DELIN: --- !Missed
				; DELIN-NEXT: Pass: loop-interchange
				; DELIN-NEXT: Name: InterchangeNotProfitable
				; DELIN-NEXT: Function: test01
				; DELIN-NEXT: Args:
				; DELIN-NEXT: - String: 'Interchanging loops is too costly (cost='
				; DELIN-NEXT: - Cost: '2'
				; DELIN-NEXT: - String: ', threshold='
				; DELIN-NEXT: - Threshold: '0'
				; DELIN-NEXT: - String: ') and it does not improve parallelism.'
				; DELIN-NEXT: ...

	;;--------------------------------------Test case 02------------------------------------			;;--------------------------------------Test case 02------------------------------------
	;; [FIXME] This loop though valid is currently not interchanged due to the			;; [FIXME] This loop though valid is currently not interchanged due to the
	;; limitation that we cannot split the inner loop latch due to multiple use of inner induction			;; limitation that we cannot split the inner loop latch due to multiple use of inner induction
	;; variable.(used to increment the loop counter and to access A[j+1][i+1]			;; variable.(used to increment the loop counter and to access A[j+1][i+1]
	;; for(int i=0;i<N-1;i++)			;; for(int i=0;i<N-1;i++)
	;; for(int j=1;j<N-1;j++)			;; for(int j=1;j<N-1;j++)
	;; A[j+1][i+1] = A[j+1][i+1] + k;			;; A[j+1][i+1] = A[j+1][i+1] + k;

	Show All 36 Lines
	; CHECK: --- !Missed			; CHECK: --- !Missed
	; CHECK-NEXT: Pass: loop-interchange			; CHECK-NEXT: Pass: loop-interchange
	; CHECK-NEXT: Name: Dependence			; CHECK-NEXT: Name: Dependence
	; CHECK-NEXT: Function: test02			; CHECK-NEXT: Function: test02
	; CHECK-NEXT: Args:			; CHECK-NEXT: Args:
	; CHECK-NEXT: - String: Cannot interchange loops due to dependences.			; CHECK-NEXT: - String: Cannot interchange loops due to dependences.
	; CHECK-NEXT: ...			; CHECK-NEXT: ...

				; DELIN: --- !Missed
				; DELIN-NEXT: Pass: loop-interchange
				; DELIN-NEXT: Name: UnsupportedInsBetweenInduction
				; DELIN-NEXT: Function: test02
				; DELIN-NEXT: Args:
				; DELIN-NEXT: - String: Found unsupported instruction between induction variable increment and branch.
				; DELIN-NEXT: ...

	;;-----------------------------------Test case 03-------------------------------			;;-----------------------------------Test case 03-------------------------------
	;; Test to make sure we can handle output dependencies.			;; Test to make sure we can handle output dependencies.
	;;			;;
	;; for (int i = 0; i < 2; ++i)			;; for (int i = 0; i < 2; ++i)
	;; for(int j = 0; j < 3; ++j) {			;; for(int j = 0; j < 3; ++j) {
	;; A[j][i] = i;			;; A[j][i] = i;
	;; A[j][i+1] = j;			;; A[j][i+1] = j;
	;; }			;; }
	Show All 32 Lines
	; CHECK: --- !Missed			; CHECK: --- !Missed
	; CHECK-NEXT: Pass: loop-interchange			; CHECK-NEXT: Pass: loop-interchange
	; CHECK-NEXT: Name: Dependence			; CHECK-NEXT: Name: Dependence
	; CHECK-NEXT: Function: test03			; CHECK-NEXT: Function: test03
	; CHECK-NEXT: Args:			; CHECK-NEXT: Args:
	; CHECK-NEXT: - String: Cannot interchange loops due to dependences.			; CHECK-NEXT: - String: Cannot interchange loops due to dependences.
	; CHECK-NEXT: ...			; CHECK-NEXT: ...

				; DELIN: --- !Passed
				; DELIN-NEXT: Pass: loop-interchange
				; DELIN-NEXT: Name: Interchanged
				; DELIN-NEXT: Function: test03
				; DELIN-NEXT: Args:
				; DELIN-NEXT: - String: Loop interchanged with enclosing loop.
				; DELIN-NEXT: ...

	;;--------------------------------------Test case 04-------------------------------------			;;--------------------------------------Test case 04-------------------------------------
	;; Loops not tightly nested are not interchanged			;; Loops not tightly nested are not interchanged
	;; for(int j=0;j<N;j++) {			;; for(int j=0;j<N;j++) {
	;; B[j] = j+k;			;; B[j] = j+k;
	;; for(int i=0;i<N;i++)			;; for(int i=0;i<N;i++)
	;; A[j][i] = A[j][i]+B[j];			;; A[j][i] = A[j][i]+B[j];
	;; }			;; }

	Show All 38 Lines

	; CHECK: --- !Missed			; CHECK: --- !Missed
	; CHECK-NEXT: Pass: loop-interchange			; CHECK-NEXT: Pass: loop-interchange
	; CHECK-NEXT: Name: Dependence			; CHECK-NEXT: Name: Dependence
	; CHECK-NEXT: Function: test04			; CHECK-NEXT: Function: test04
	; CHECK-NEXT: Args:			; CHECK-NEXT: Args:
	; CHECK-NEXT: - String: Cannot interchange loops due to dependences.			; CHECK-NEXT: - String: Cannot interchange loops due to dependences.
	; CHECK-NEXT: ...			; CHECK-NEXT: ...

				; DELIN: --- !Missed
				; DELIN-NEXT: Pass: loop-interchange
				; DELIN-NEXT: Name: NotTightlyNested
				; DELIN-NEXT: Function: test04
				; DELIN-NEXT: Args:
				; DELIN-NEXT: - String: Cannot interchange loops because they are not tightly nested.
				; DELIN-NEXT: ...

llvm/test/Transforms/LoopInterchange/profitability.ll

	; RUN: opt < %s -loop-interchange -pass-remarks-output=%t -verify-dom-info -verify-loop-info \			; RUN: opt < %s -loop-interchange -pass-remarks-output=%t -verify-dom-info -verify-loop-info \
	; RUN: -pass-remarks=loop-interchange -pass-remarks-missed=loop-interchange			; RUN: -pass-remarks=loop-interchange -pass-remarks-missed=loop-interchange
	; RUN: FileCheck -input-file %t %s			; RUN: FileCheck -input-file %t %s

				; RUN: opt < %s -loop-interchange -pass-remarks-output=%t -verify-dom-info -verify-loop-info \
				; RUN: -pass-remarks=loop-interchange -pass-remarks-missed=loop-interchange \
				; RUN: -da-disable-delinearization-checks
				; RUN: FileCheck --check-prefix=DELIN -input-file %t %s

	;; We test profitability model in these test cases.			;; We test profitability model in these test cases.

	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	@A = common global [100 x [100 x i32]] zeroinitializer			@A = common global [100 x [100 x i32]] zeroinitializer
	@B = common global [100 x [100 x i32]] zeroinitializer			@B = common global [100 x [100 x i32]] zeroinitializer

	;;---------------------------------------Test case 01---------------------------------			;;---------------------------------------Test case 01---------------------------------
	;; Loops interchange will result in code vectorization and hence profitable. Check for interchange.			;; Loops interchange will result in code vectorization and hence profitable. Check for interchange.
	;; for(int i=1;i<100;i++)			;; for(int i=1;i<100;i++)
	;; for(int j=1;j<100;j++)			;; for(int j=1;j<100;j++)
	;; A[j][i] = A[j - 1][i] + B[j][i];			;; A[j][i] = A[j - 1][i] + B[j][i];
	;; FIXME: DA misses this case after D35430

	; CHECK: Name: Dependence			; CHECK: Name: Dependence
	; CHECK-NEXT: Function: interchange_01			; CHECK-NEXT: Function: interchange_01

				; DELIN: Name: Interchanged
				; DELIN-NEXT: Function: interchange_01

	define void @interchange_01() {			define void @interchange_01() {
	entry:			entry:
	br label %for2.preheader			br label %for2.preheader

	for2.preheader:			for2.preheader:
	%i30 = phi i64 [ 1, %entry ], [ %i.next31, %for1.inc14 ]			%i30 = phi i64 [ 1, %entry ], [ %i.next31, %for1.inc14 ]
	br label %for2			br label %for2

	▲ Show 20 Lines • Show All 136 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[DA] Delinearization of fixed-size multi-dimensional arraysClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 246585

llvm/include/llvm/Analysis/DependenceAnalysis.h

llvm/lib/Analysis/DependenceAnalysis.cpp

llvm/test/Analysis/DependenceAnalysis/PreliminaryNoValidityCheckFixedSize.ll

llvm/test/Analysis/DependenceAnalysis/SimpleSIVNoValidityCheckFixedSize.ll

llvm/test/Transforms/LoopInterchange/currentLimitation.ll

llvm/test/Transforms/LoopInterchange/loop-interchange-optimization-remarks.ll

llvm/test/Transforms/LoopInterchange/profitability.ll

[DA] Delinearization of fixed-size multi-dimensional arrays
ClosedPublic