This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/lib/Transforms/Scalar/
-
lib/
-
Transforms/
-
Scalar/
2/7
LoopInterchange.cpp

Differential D136277

[LoopInterchange] Simplify DepMatrix to a dependency vector.
AbandonedPublic

Authored by Meinersbur on Oct 19 2022, 11:08 AM.

Download Raw Diff

Details

Reviewers

bmahjour
fhahn
congzhe

Group Reviewers

Restricted Project

Summary

It is not necessary to store all pairwise dependencies when it is sufficient to know which kind of dependencies occured.

This might have some behavioral effects such as when two dependencies can be interchange on their own but combined into a single loop nest they can not. Example:
Dependence 1: [<,=,>] (carried by outer loop)
Dependence 2: [=,<,>] (carried by inner loop)
when combined:
[<=,<=,>]
which would not be considered interchangeable.

In any case, all of our current tests pass after this change and is significantly more efficient (still quadratic in the number of memory instructions, because we have to query DA for each pair)

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

Meinersbur created this revision.Oct 19 2022, 11:08 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 19 2022, 11:08 AM

Herald added a subscriber: hiraditya. · View Herald Transcript

Meinersbur requested review of this revision.Oct 19 2022, 11:08 AM

Herald added a project: Restricted Project. · View Herald TranscriptOct 19 2022, 11:08 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B193052: Diff 468975.Oct 19 2022, 12:26 PM

This would collapse two dimensions of the dependency matrix into one without providing a way to recover the individual dependency vectors, and as you pointed out in the description it could make some interchangeable cases look illegal. Another example is if we have [S, >, = ] and [S, <, =], we can safely interchange the two inner loops (by factoring out the S from the beginning of both vectors), but after merging them we get [S, <>, = ] and now the inner two loops can no longer be interchanged.

Is the size of dependency matrix proving to be problematic for certain workloads? If so, maybe looking at some concrete examples can help solve the problem in a more effective way (eg. considering sparse data structures or some sort of compression/decompression techniques).

On a different note, I think we can make the code more elegant and robust by encapsulating all the dependency matrix analysis inside a class. For example:

class DepMatrix {
public:
  /// Enumerates the possible kind of dependency in a dependence vector.
  enum class DepKind : char {
    NEG, 
    EQ,  
    POS,
    S,
    STAR
  };

  /// Represents a row of the dependency matrix.
  using DepVector = std::vector<DepKind>;
  ...

  /// Update the dependence matrix by exchanging two columns.
  void interchangeColumns(const Loop &A, const Loop &B);
  
  bool isPositive(int row);
  ...
private:
  /// The dependency matrix is represented by a vector of dependency vectors.
  std::vector<DepVector> DM;
};

llvm/lib/Transforms/Scalar/LoopInterchange.cpp
84	why does this need to be reduced so much?

This revision now requires changes to proceed.Nov 1 2022, 11:21 AM

congzhe added inline comments.Nov 1 2022, 12:59 PM

llvm/lib/Transforms/Scalar/LoopInterchange.cpp
154	typo?
179	Please correct me if I'm wrong - IIUC `Scalar` is considered a type of dependence that prevents interchange?
208	If Scalar is considered a type of dependence that prevents interchange, would `isAnydirectional()` be the following? return ((F & LT) && (F & GT)) \|\| (F & Scalar);

Meinersbur edited the summary of this revision. (Show Details)Nov 1 2022, 2:42 PM

Just one more question on the summary: when combining the two dependencies [<,=] and [=,<] that generated [<=,<=], I think we could still interchange the combined dependency? [<=,<=] is non-negative and swapping the two elements <= and <= gives a non-negative dependency. Thus we are not converting a positive dependency to a negative dependency, hence not violating the dependency rules for a valid interchange. Perhaps with the current validDepInterchange() we cannot interchange [<=,<=], but like I mentioned in the past meetings I'm rewriting validDepInterchange() to make the legality decisions be purely based on the signs of the dependence vectors before/after interchange. With the rewritten validDepInterchange() I think we can interchange [<=,<=].

My apologies for not being able to post the patch that rewrites validDepInterchange() sooner. We are actively working on it recently and now it seems there is a real need for this patch. We'll post the patch as soon as we can.

In D136277#3899782, @bmahjour wrote:

Is the size of dependency matrix proving to be problematic for certain workloads?

I am not aware of concrete problems, I mostly felt that it would not be necessary to always go through the entire list of pairwise dependencies and instead could be summarized since which instructions induce the dependencies is irrelevant.

However, due to experimenting with it I now understand that there indeed is a difference than what is done now, although I am not sure whether that situation is sufficiently common.

In D136277#3899782, @bmahjour wrote:
On a different note, I think we can make the code more elegant and robust by encapsulating all the dependency matrix analysis inside a class. For example:
class DepMatrix {
....

Could we call it something else than a matrix? In a matrix, I'd expect the indices have some relevance, here the outer dimension is an unordered collection (DenseSet<SmallString<4>>? That would also de-duplicate identical dependence vectors and thus less work later. Or just a flat array SmallVector<char> managed by DepMatrix). Also, std::vector of std::vectors is not the most elegant, esp when resizing.

llvm/lib/Transforms/Scalar/LoopInterchange.cpp
84	Without this patch, `MaxMemInstrCount` does not compare to the number of instructions, but the number of instruction pairs (that have a dependency). That is, the constant name and the comment above it are inconsistent. Since in worst case, there can be a quadratic number of pairs, I just took the square of it.
179	The original code explicitly ignores scalar types (`DepMatrix[Row][i] != 'S'`)

congzhe added inline comments.Nov 1 2022, 6:05 PM

llvm/lib/Transforms/Scalar/LoopInterchange.cpp
179	Right, but I remember earlier this year when discussing a few related bugs we reached to a conclusion that it is not correct to ignore scalar types (in other words the current code is wrong)? Anyways I'm rewriting the whole `validDepInterchange()` so it may not be too big of a concern for now.

Meinersbur edited the summary of this revision. (Show Details)Nov 2 2022, 8:11 AM

Abandon as discussed in the LoopWG call. It coarsens the analysis when compile time wasn't even a problem.

Revision Contents

Path

Size

llvm/

lib/

Transforms/

Scalar/

LoopInterchange.cpp

146 lines

Diff 468975

llvm/lib/Transforms/Scalar/LoopInterchange.cpp

Show First 20 Lines • Show All 59 Lines • ▼ Show 20 Lines

static cl::opt<int> LoopInterchangeCostThreshold( static cl::opt<int> LoopInterchangeCostThreshold(

"loop-interchange-threshold", cl::init(0), cl::Hidden, "loop-interchange-threshold", cl::init(0), cl::Hidden,

cl::desc("Interchange if you gain more than this number")); cl::desc("Interchange if you gain more than this number"));

namespace { namespace {

using LoopVector = SmallVector<Loop *, 8>; using LoopVector = SmallVector<Loop *, 8>;

// Synchronize with Dependence::DVEntry::LT,EQ,GT from DependenceAnalysis

enum DepFlags {

LT = 1 << 0,

EQ = 1 << 1,

GT = 1 << 2,

Scalar = 1 << 3,

Ignore = 1 << 4,

LLVM_MARK_AS_BITMASK_ENUM(/*LargestValue=*/Ignore)

};

// TODO: Check if we can use a sparse matrix here. // TODO: Check if we can use a sparse matrix here.

using CharMatrix = std::vector<std::vector<char>>; using CharMatrix = std::vector<DepFlags>;

} // end anonymous namespace } // end anonymous namespace

// Maximum number of dependencies that can be handled in the dependency matrix. // Maximum number of dependencies that can be handled in the dependency matrix.

static const unsigned MaxMemInstrCount = 100; static const unsigned MaxMemInstrCount = 10;

bmahjourUnsubmitted

Not Done

why does this need to be reduced so much?

bmahjour: why does this need to be reduced so much?

MeinersburAuthorUnsubmitted

Done

Without this patch, MaxMemInstrCount does not compare to the number of instructions, but the number of instruction pairs (that have a dependency). That is, the constant name and the comment above it are inconsistent.

Since in worst case, there can be a quadratic number of pairs, I just took the square of it.

Meinersbur: Without this patch, `MaxMemInstrCount` does not compare to the number of instructions, but the…

// Maximum loop depth supported. // Maximum loop depth supported.

static const unsigned MaxLoopNestDepth = 10; static const unsigned MaxLoopNestDepth = 10;

#ifdef DUMP_DEP_MATRICIES

static void printDepMatrix(CharMatrix &DepMatrix) {

for (auto &Row : DepMatrix) {

for (auto D : Row)

LLVM_DEBUG(dbgs() << D << " ");

LLVM_DEBUG(dbgs() << "\n");

}

#endif

static bool populateDependencyMatrix(CharMatrix &DepMatrix, unsigned Level, static bool populateDependencyMatrix(CharMatrix &DepMatrix, unsigned Level,

Loop *L, DependenceInfo *DI, Loop *L, DependenceInfo *DI,

ScalarEvolution *SE) { ScalarEvolution *SE) {

using ValueVector = SmallVector<Value *, 16>; using ValueVector = SmallVector<Value *, 16>;

ValueVector MemInstr; ValueVector MemInstr;

// For each block. // For each block.

Show All 12 Lines for (Instruction &I : *BB) {

MemInstr.push_back(&I); MemInstr.push_back(&I);

} }

LLVM_DEBUG(dbgs() << "Found " << MemInstr.size() LLVM_DEBUG(dbgs() << "Found " << MemInstr.size()

<< " Loads and Stores to analyze\n"); << " Loads and Stores to analyze\n");

ValueVector::iterator I, IE, J, JE; if (MemInstr.size() > MaxMemInstrCount) {

LLVM_DEBUG(dbgs() << "Cannot handle more than " << MaxMemInstrCount

<< " dependencies inside loop\n");

return false;

}

DepMatrix.resize(Level);

for (unsigned II = 0; II < Level; ++II)

DepMatrix[II] = static_cast<DepFlags>(0);

ValueVector::iterator I, IE, J, JE;

for (I = MemInstr.begin(), IE = MemInstr.end(); I != IE; ++I) { for (I = MemInstr.begin(), IE = MemInstr.end(); I != IE; ++I) {

for (J = I, JE = MemInstr.end(); J != JE; ++J) { for (J = I, JE = MemInstr.end(); J != JE; ++J) {

std::vector<char> Dep;

Instruction *Src = cast<Instruction>(*I); Instruction *Src = cast<Instruction>(*I);

Instruction *Dst = cast<Instruction>(*J); Instruction *Dst = cast<Instruction>(*J);

// Ignore Input dependencies. // Ignore Input dependencies.

if (isa<LoadInst>(Src) && isa<LoadInst>(Dst)) if (isa<LoadInst>(Src) && isa<LoadInst>(Dst))

continue; continue;

// Track Output, Flow, and Anti dependencies. // Track Output, Flow, and Anti dependencies.

if (auto D = DI->depends(Src, Dst, true)) { if (auto D = DI->depends(Src, Dst, true)) {

assert(D->isOrdered() && "Expected an output, flow or anti dep."); assert(D->isOrdered() && "Expected an output, flow or anti dep.");

// If the direction vector is negative, normalize it to // If the direction vector is negative, normalize it to

// make it non-negative. // make it non-negative.

if (D->normalize(SE)) if (D->normalize(SE))

LLVM_DEBUG(dbgs() << "Negative dependence vector normalized.\n"); LLVM_DEBUG(dbgs() << "Negative dependence vector normalized.\n");

LLVM_DEBUG(StringRef DepType = LLVM_DEBUG(StringRef DepType =

D->isFlow() ? "flow" : D->isAnti() ? "anti" : "output"; D->isFlow() ? "flow" : D->isAnti() ? "anti" : "output";

dbgs() << "Found " << DepType dbgs() << "Found " << DepType

<< " dependency between Src and Dst\n" << " dependency between Src and Dst\n"

<< " Src:" << *Src << "\n Dst:" << *Dst << '\n'); << " Src:" << *Src << "\n Dst:" << *Dst << '\n');

unsigned Levels = D->getLevels(); unsigned Levels = D->getLevels();

char Direction; unsigned II = 0;

for (unsigned II = 1; II <= Levels; ++II) { for (; II < Levels; ++II) {

if (D->isScalar(II)) { DepFlags &Lvl = DepMatrix[II];

Direction = 'S'; if (D->isScalar(II + 1)) {

Dep.push_back(Direction); Lvl |= Scalar;

} else { } else {

unsigned Dir = D->getDirection(II); // DepFlags needs lower 3 bits of DepFlags for be the same as used

congzheUnsubmitted

Not Done

} else {

- // DepFlags needs lower 3 bits of DepFlags for be the same as used

+ // DepFlags needs lower 3 bits of DepFlags to be the same as used

// by DA.

typo?

congzhe: typo?

if (Dir == Dependence::DVEntry::LT || // by DA.

Dir == Dependence::DVEntry::LE) Lvl |= static_cast<DepFlags>(D->getDirection(II + 1));

Direction = '<';

else if (Dir == Dependence::DVEntry::GT ||

Dir == Dependence::DVEntry::GE)

Direction = '>';

else if (Dir == Dependence::DVEntry::EQ)

Direction = '=';

else

Direction = '*';

Dep.push_back(Direction);

} }

while (Dep.size() != Level) { for (; II < Level; ++II)

Dep.push_back('I'); DepMatrix[II] |= Ignore;

}

DepMatrix.push_back(Dep);

if (DepMatrix.size() > MaxMemInstrCount) {

LLVM_DEBUG(dbgs() << "Cannot handle more than " << MaxMemInstrCount

<< " dependencies inside loop\n");

return false;

}

} }

return true; return true;

} }

// A loop is moved from index 'from' to an index 'to'. Update the Dependence // A loop is moved from index 'from' to an index 'to'. Update the Dependence

// matrix by exchanging the two columns. // matrix by exchanging the two columns.

static void interChangeDependencies(CharMatrix &DepMatrix, unsigned FromIndx, static void interChangeDependencies(CharMatrix &DepMatrix, unsigned FromIndx,

unsigned ToIndx) { unsigned ToIndx) {

for (unsigned I = 0, E = DepMatrix.size(); I < E; ++I) std::swap(DepMatrix[ToIndx], DepMatrix[FromIndx]);

std::swap(DepMatrix[I][ToIndx], DepMatrix[I][FromIndx]);

} }

// Checks if no dependence exist in the dependency matrix in Row before Column. // Checks if no dependence exist in the dependency matrix in Row before Column.

static bool containsNoDependence(CharMatrix &DepMatrix, unsigned Row, static bool containsNoDependence(CharMatrix &DepMatrix, unsigned Column) {

unsigned Column) {

for (unsigned i = 0; i < Column; ++i) { for (unsigned i = 0; i < Column; ++i) {

if (DepMatrix[Row][i] != '=' && DepMatrix[Row][i] != 'S' && DepFlags Lvl = DepMatrix[i];

DepMatrix[Row][i] != 'I') if (Lvl & LT || Lvl & GT)

congzheUnsubmitted

Not Done

DepFlags Lvl = DepMatrix[i];

- if (Lvl & LT || Lvl & GT)

+ if (Lvl & LT || Lvl & GT || Lvl & Scalar)

return false;

Please correct me if I'm wrong - IIUC Scalar is considered a type of dependence that prevents interchange?

congzhe: Please correct me if I'm wrong - IIUC `Scalar` is considered a type of dependence that prevents…

MeinersburAuthorUnsubmitted

Done

The original code explicitly ignores scalar types (DepMatrix[Row][i] != 'S')

Meinersbur: The original code explicitly ignores scalar types (`DepMatrix[Row][i] != 'S'`)

congzheUnsubmitted

Not Done

Right, but I remember earlier this year when discussing a few related bugs we reached to a conclusion that it is not correct to ignore scalar types (in other words the current code is wrong)? Anyways I'm rewriting the whole validDepInterchange() so it may not be too big of a concern for now.

congzhe: Right, but I remember earlier this year when discussing a few related bugs we reached to a…

return false; return false;

} }

return true; return true;

} }

static bool validDepInterchange(CharMatrix &DepMatrix, unsigned Row, static bool validDepInterchange(CharMatrix &DepMatrix, unsigned OuterLoopId,

unsigned OuterLoopId, char InnerDep, DepFlags InnerDep, DepFlags OuterDep) {

char OuterDep) {

if (InnerDep == OuterDep) if (InnerDep == OuterDep)

return true; return true;

// It is legal to interchange if and only if after interchange no row has a // It is legal to interchange if and only if after interchange no row has a

// '>' direction as the leftmost non-'='. // '>' direction as the leftmost non-'='.

if (InnerDep & GT) {

if (InnerDep == '=' || InnerDep == 'S' || InnerDep == 'I')

return true;

if (InnerDep == '<')

return true;

if (InnerDep == '>') {

// If OuterLoopId represents outermost loop then interchanging will make the // If OuterLoopId represents outermost loop then interchanging will make the

// 1st dependency as '>' // 1st dependency as '>'

if (OuterLoopId == 0) if (OuterLoopId == 0)

return false; return false;

// If all dependencies before OuterloopId are '=','S'or 'I'. Then // If all dependencies before OuterloopId are '=','S'or 'I'. Then

// interchanging will result in this row having an outermost non '=' // interchanging will result in this row having an outermost non '='

// dependency of '>' // dependency of '>'

if (!containsNoDependence(DepMatrix, Row, OuterLoopId)) if (!containsNoDependence(DepMatrix, OuterLoopId))

return true; return true;

} }

return false; return true;

} }

static bool isAnydirectional(DepFlags F) { return (F & LT) && (F & GT); }

congzheUnsubmitted

Not Done

If Scalar is considered a type of dependence that prevents interchange, would isAnydirectional() be the following?

return ((F & LT) && (F & GT)) || (F & Scalar);

congzhe: If Scalar is considered a type of dependence that prevents interchange, would `isAnydirectional…

// Checks if it is legal to interchange 2 loops. // Checks if it is legal to interchange 2 loops.

// [Theorem] A permutation of the loops in a perfect nest is legal if and only // [Theorem] A permutation of the loops in a perfect nest is legal if and only

// if the direction matrix, after the same permutation is applied to its // if the direction matrix, after the same permutation is applied to its

// columns, has no ">" direction as the leftmost non-"=" direction in any row. // columns, has no ">" direction as the leftmost non-"=" direction in any row.

static bool isLegalToInterChangeLoops(CharMatrix &DepMatrix, static bool isLegalToInterChangeLoops(CharMatrix &DepMatrix,

unsigned InnerLoopId, unsigned InnerLoopId,

unsigned OuterLoopId) { unsigned OuterLoopId) {

unsigned NumRows = DepMatrix.size();

// For each row check if it is valid to interchange. // For each row check if it is valid to interchange.

for (unsigned Row = 0; Row < NumRows; ++Row) { DepFlags InnerDep = DepMatrix[InnerLoopId];

char InnerDep = DepMatrix[Row][InnerLoopId]; DepFlags OuterDep = DepMatrix[OuterLoopId];

char OuterDep = DepMatrix[Row][OuterLoopId]; if (isAnydirectional(InnerDep) || isAnydirectional(OuterDep))

if (InnerDep == '*' || OuterDep == '*')

return false; return false;

if (!validDepInterchange(DepMatrix, Row, OuterLoopId, InnerDep, OuterDep)) if (!validDepInterchange(DepMatrix, OuterLoopId, InnerDep, OuterDep))

return false; return false;

}

return true; return true;

} }

static void populateWorklist(Loop &L, LoopVector &LoopList) { static void populateWorklist(Loop &L, LoopVector &LoopList) {

LLVM_DEBUG(dbgs() << "Calling populateWorklist on Func: " LLVM_DEBUG(dbgs() << "Calling populateWorklist on Func: "

<< L.getHeader()->getParent()->getName() << " Loop: %" << L.getHeader()->getParent()->getName() << " Loop: %"

<< L.getHeader()->getName() << '\n'); << L.getHeader()->getName() << '\n');

assert(LoopList.empty() && "LoopList should initially be empty!"); assert(LoopList.empty() && "LoopList should initially be empty!");

▲ Show 20 Lines • Show All 210 Lines • ▼ Show 20 Lines bool processLoopList(SmallVectorImpl<Loop *> &LoopList) {

CharMatrix DependencyMatrix; CharMatrix DependencyMatrix;

Loop *OuterMostLoop = *(LoopList.begin()); Loop *OuterMostLoop = *(LoopList.begin());

if (!populateDependencyMatrix(DependencyMatrix, LoopNestDepth, if (!populateDependencyMatrix(DependencyMatrix, LoopNestDepth,

OuterMostLoop, DI, SE)) { OuterMostLoop, DI, SE)) {

LLVM_DEBUG(dbgs() << "Populating dependency matrix failed\n"); LLVM_DEBUG(dbgs() << "Populating dependency matrix failed\n");

return false; return false;

} }

#ifdef DUMP_DEP_MATRICIES

LLVM_DEBUG(dbgs() << "Dependence before interchange\n");

printDepMatrix(DependencyMatrix);

#endif

// Get the Outermost loop exit. // Get the Outermost loop exit.

BasicBlock *LoopNestExit = OuterMostLoop->getExitBlock(); BasicBlock *LoopNestExit = OuterMostLoop->getExitBlock();

if (!LoopNestExit) { if (!LoopNestExit) {

LLVM_DEBUG(dbgs() << "OuterMostLoop needs an unique exit block"); LLVM_DEBUG(dbgs() << "OuterMostLoop needs an unique exit block");

return false; return false;

} }

Show All 23 Lines for (unsigned j = SelecLoopId; j > 0; j--) {

bool Interchanged = processLoop(LoopList[i], LoopList[i - 1], i, i - 1, bool Interchanged = processLoop(LoopList[i], LoopList[i - 1], i, i - 1,

DependencyMatrix, CostMap); DependencyMatrix, CostMap);

if (!Interchanged) if (!Interchanged)

continue; continue;

// Loops interchanged, update LoopList accordingly. // Loops interchanged, update LoopList accordingly.

std::swap(LoopList[i - 1], LoopList[i]); std::swap(LoopList[i - 1], LoopList[i]);

// Update the DependencyMatrix // Update the DependencyMatrix

interChangeDependencies(DependencyMatrix, i, i - 1); interChangeDependencies(DependencyMatrix, i, i - 1);

#ifdef DUMP_DEP_MATRICIES

LLVM_DEBUG(dbgs() << "Dependence after interchange\n");

printDepMatrix(DependencyMatrix);

#endif

ChangedPerIter |= Interchanged; ChangedPerIter |= Interchanged;

Changed |= Interchanged; Changed |= Interchanged;

} }

// Early abort if there was no interchange during an entire round of // Early abort if there was no interchange during an entire round of

// moving loops outwards. // moving loops outwards.

if (!ChangedPerIter) if (!ChangedPerIter)

break; break;

} }

return Changed; return Changed;

} }

bool processLoop(Loop *InnerLoop, Loop *OuterLoop, unsigned InnerLoopId, bool processLoop(Loop *InnerLoop, Loop *OuterLoop, unsigned InnerLoopId,

unsigned OuterLoopId, unsigned OuterLoopId, CharMatrix &DependencyMatrix,

std::vector<std::vector<char>> &DependencyMatrix,

const DenseMap<const Loop *, unsigned> &CostMap) { const DenseMap<const Loop *, unsigned> &CostMap) {

LLVM_DEBUG(dbgs() << "Processing InnerLoopId = " << InnerLoopId LLVM_DEBUG(dbgs() << "Processing InnerLoopId = " << InnerLoopId

<< " and OuterLoopId = " << OuterLoopId << "\n"); << " and OuterLoopId = " << OuterLoopId << "\n");

LoopInterchangeLegality LIL(OuterLoop, InnerLoop, SE, ORE); LoopInterchangeLegality LIL(OuterLoop, InnerLoop, SE, ORE);

if (!LIL.canInterchangeLoops(InnerLoopId, OuterLoopId, DependencyMatrix)) { if (!LIL.canInterchangeLoops(InnerLoopId, OuterLoopId, DependencyMatrix)) {

LLVM_DEBUG(dbgs() << "Not interchanging loops. Cannot prove legality.\n"); LLVM_DEBUG(dbgs() << "Not interchanging loops. Cannot prove legality.\n");

return false; return false;

} }

▲ Show 20 Lines • Show All 572 Lines • ▼ Show 20 Lines

} }

static bool isProfitableForVectorization(unsigned InnerLoopId, static bool isProfitableForVectorization(unsigned InnerLoopId,

unsigned OuterLoopId, unsigned OuterLoopId,

CharMatrix &DepMatrix) { CharMatrix &DepMatrix) {

// TODO: Improve this heuristic to catch more cases. // TODO: Improve this heuristic to catch more cases.

// If the inner loop is loop independent or doesn't carry any dependency it is // If the inner loop is loop independent or doesn't carry any dependency it is

// profitable to move this to outer position. // profitable to move this to outer position.

for (auto &Row : DepMatrix) { if (DepMatrix[InnerLoopId] & ~(Scalar | Ignore))

if (Row[InnerLoopId] != 'S' && Row[InnerLoopId] != 'I')

return false; return false;

// TODO: We need to improve this heuristic. // TODO: We need to improve this heuristic.

if (Row[OuterLoopId] != '=') if (DepMatrix[OuterLoopId] & ~EQ)

return false; return false;

}

// If outer loop has dependence and inner loop is loop independent then it is // If outer loop has dependence and inner loop is loop independent then it is

// profitable to interchange to enable parallelism. // profitable to interchange to enable parallelism.

// If there are no dependences, interchanging will not improve anything. // If there are no dependences, interchanging will not improve anything.

return !DepMatrix.empty(); return DepMatrix[InnerLoopId] != 0;

} }

bool LoopInterchangeProfitability::isProfitable( bool LoopInterchangeProfitability::isProfitable(

const Loop *InnerLoop, const Loop *OuterLoop, unsigned InnerLoopId, const Loop *InnerLoop, const Loop *OuterLoop, unsigned InnerLoopId,

unsigned OuterLoopId, CharMatrix &DepMatrix, unsigned OuterLoopId, CharMatrix &DepMatrix,

const DenseMap<const Loop *, unsigned> &CostMap) { const DenseMap<const Loop *, unsigned> &CostMap) {

// TODO: Remove the legacy cost model. // TODO: Remove the legacy cost model.

▲ Show 20 Lines • Show All 616 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[LoopInterchange] Simplify DepMatrix to a dependency vector.AbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 468975

llvm/lib/Transforms/Scalar/LoopInterchange.cpp

[LoopInterchange] Simplify DepMatrix to a dependency vector.
AbandonedPublic