This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
mlir/lib/
-
lib/
-
Analysis/
1/1
DataFlowAnalysis.cpp
-
Transforms/
-
SCCP.cpp

Differential D116393

[MLIR] DataFlowAnalysis: Use a queue to maintain the worklist
ClosedPublic

Authored by vaivaswatha on Dec 29 2021, 10:20 PM.

Download Raw Diff

Details

Reviewers

rriddle

Commits

rG2c384c377276: [MLIR][DataFlowAnalysis] Use a queue to maintain the worklist

Summary

Since the analysis is described to be suitable for a forward data-flow analysis, maintaining the worklist as a queue is closer to the RPO ordering of block visits, thus reaching the fixpoint earlier.

For example,

// mlir-opt -mlir-disable-threading -allow-unregistered-dialect -pass-pipeline="builtin.func(sccp)" sccp-iterations.mlir
func @simple_control_flow(%arg0 : i32, %arg1 : i1) -> i32 {
  cond_br %arg1, ^bb1, ^bb3

^bb1:
  %1 = arith.constant 1 : i32
  br ^bb2(%1 : i32)

^bb3:
  %3 = arith.constant 2 : i32
  br ^bb2(%3 : i32)

^bb2(%arg : i32):
  %2 = arith.constant 3 : i32
  %4 = arith.addi %arg, %2 : i32
  return %4 : i32
}

Running SCCP on this code results in the following visits when using a stack (currently) to maintain the worklist (I modified visitOperation in SCCP.cpp to print the operation being visited):

(SCCP): Visiting %c2_i32 = arith.constant 2 : i32
(SCCP): Visiting %c3_i32 = arith.constant 3 : i32                                                                                                                                                                  
(SCCP): Visiting %1 = arith.addi %0, %c3_i32 : i32
(SCCP): Visiting %c1_i32 = arith.constant 1 : i32
(SCCP): Visiting %1 = arith.addi %0, %c3_i32 : i32
module  {                                         
  func @simple_control_flow(%arg0: i32, %arg1: i1) -> i32 {
    %c3_i32 = arith.constant 3 : i32                                                                     
    %c2_i32 = arith.constant 2 : i32
    %c1_i32 = arith.constant 1 : i32
    cond_br %arg1, ^bb1, ^bb2       
  ^bb1:  // pred: ^bb0       
    br ^bb3(%c1_i32 : i32)
  ^bb2:  // pred: ^bb0    
    br ^bb3(%c2_i32 : i32)
  ^bb3(%0: i32):  // 2 preds: ^bb1, ^bb2
    %1 = arith.addi %0, %c3_i32 : i32   
    return %1 : i32                  
  }                
}

But with the proposed change to use a queue instead, the visits are:

(SCCP): Visiting %c1_i32 = arith.constant 1 : i32
(SCCP): Visiting %c2_i32 = arith.constant 2 : i32
(SCCP): Visiting %c3_i32 = arith.constant 3 : i32
(SCCP): Visiting %1 = arith.addi %0, %c3_i32 : i32
module  {
  func @simple_control_flow(%arg0: i32, %arg1: i1) -> i32 {
    %c3_i32 = arith.constant 3 : i32
    %c2_i32 = arith.constant 2 : i32
    %c1_i32 = arith.constant 1 : i32
    cond_br %arg1, ^bb1, ^bb2
  ^bb1:  // pred: ^bb0
    br ^bb3(%c1_i32 : i32)
  ^bb2:  // pred: ^bb0
    br ^bb3(%c2_i32 : i32)
  ^bb3(%0: i32):  // 2 preds: ^bb1, ^bb2
    %1 = arith.addi %0, %c3_i32 : i32
    return %1 : i32
  }
}

On the longer run, we could explore more complex strategies (not to mention the simple improvement of checking if a node is already in the queue before inserting it) as is described in "Iterative Data-flow Analysis - Revisited" (Keith D. Cooper, Timothy J. Harvey, and Ken Kennedy).

I'm not sure what's the right way to add a test for this. Perhaps add a statistic to count when an Operation is visited, print that and CHECK for that in the unit test?

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

vaivaswatha created this revision.Dec 29 2021, 10:20 PM

Herald added subscribers: sdasgup3, wenzhicui, wrengr and 19 others. · View Herald TranscriptDec 29 2021, 10:20 PM

vaivaswatha requested review of this revision.Dec 29 2021, 10:20 PM

Herald added a project: Restricted Project. · View Herald TranscriptDec 29 2021, 10:20 PM

Herald added subscribers: stephenneuendorffer, nicolasvasilache. · View Herald Transcript

Harbormaster completed remote builds in B140963: Diff 396603.Dec 29 2021, 10:32 PM

Hmmm, I'm slightly fine with submitting this without an explicit test given that it is changing a data structure (and not a crazy feature/behavior change) and seems relatively straightforward. Do you mind adding in the logging that you added (potentially as a separate commit)? It would save time for future debugging. Ideally we could CHECK the debug output, but can't think of any examples of that offhand.

mlir/lib/Analysis/DataFlowAnalysis.cpp
9	System includes should go after the others: https://llvm.org/docs/CodingStandards.html#include-style

This revision is now accepted and ready to land.Jan 4 2022, 2:34 PM

Incorporating review comments.

@rriddle I've added the debug statement I had locally. If this looks fine to you, I'll go ahead and push the change.

Harbormaster completed remote builds in B141610: Diff 397460.Jan 4 2022, 10:49 PM

Closed by commit rG2c384c377276: [MLIR][DataFlowAnalysis] Use a queue to maintain the worklist (authored by vaivaswatha). · Explain WhyJan 5 2022, 8:27 PM

This revision was automatically updated to reflect the committed changes.

vaivaswatha added a commit: rG2c384c377276: [MLIR][DataFlowAnalysis] Use a queue to maintain the worklist.

Revision Contents

Path

Size

mlir/

lib/

Analysis/

DataFlowAnalysis.cpp

30 lines

Transforms/

SCCP.cpp

6 lines

Diff 397782

mlir/lib/Analysis/DataFlowAnalysis.cpp

//===- DataFlowAnalysis.cpp -----------------------------------------------===//		//===- DataFlowAnalysis.cpp -----------------------------------------------===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "mlir/Analysis/DataFlowAnalysis.h"		#include "mlir/Analysis/DataFlowAnalysis.h"
		rriddleUnsubmitted Done Reply Inline Actions System includes should go after the others: https://llvm.org/docs/CodingStandards.html#include-style rriddle: System includes should go after the others: https://llvm.org/docs/CodingStandards.html#include…
#include "mlir/IR/Operation.h"		#include "mlir/IR/Operation.h"
#include "mlir/Interfaces/CallInterfaces.h"		#include "mlir/Interfaces/CallInterfaces.h"
#include "mlir/Interfaces/ControlFlowInterfaces.h"		#include "mlir/Interfaces/ControlFlowInterfaces.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"

		#include <queue>

using namespace mlir;		using namespace mlir;
using namespace mlir::detail;		using namespace mlir::detail;

namespace {		namespace {
/// This class contains various state used when computing the lattice elements		/// This class contains various state used when computing the lattice elements
/// of a callable operation.		/// of a callable operation.
class CallableLatticeState {		class CallableLatticeState {
public:		public:
▲ Show 20 Lines • Show All 137 Lines • ▼ Show 20 Lines	private:
template <typename ValuesT>		template <typename ValuesT>
void markAllPessimisticFixpoint(ValuesT values) {		void markAllPessimisticFixpoint(ValuesT values) {
for (auto value : values)		for (auto value : values)
markPessimisticFixpoint(value);		markPessimisticFixpoint(value);
}		}
template <typename ValuesT>		template <typename ValuesT>
void markAllPessimisticFixpoint(Operation *op, ValuesT values) {		void markAllPessimisticFixpoint(Operation *op, ValuesT values) {
markAllPessimisticFixpoint(values);		markAllPessimisticFixpoint(values);
opWorklist.push_back(op);		opWorklist.push(op);
}		}
template <typename ValuesT>		template <typename ValuesT>
void markAllPessimisticFixpointAndVisitUsers(ValuesT values) {		void markAllPessimisticFixpointAndVisitUsers(ValuesT values) {
for (auto value : values) {		for (auto value : values) {
AbstractLatticeElement &lattice = analysis.getLatticeElement(value);		AbstractLatticeElement &lattice = analysis.getLatticeElement(value);
if (lattice.markPessimisticFixpoint() == ChangeResult::Change)		if (lattice.markPessimisticFixpoint() == ChangeResult::Change)
visitUsers(value);		visitUsers(value);
}		}
Show All 13 Lines	private:

/// The set of blocks that are known to execute, or are intrinsically live.		/// The set of blocks that are known to execute, or are intrinsically live.
SmallPtrSet<Block *, 16> executableBlocks;		SmallPtrSet<Block *, 16> executableBlocks;

/// The set of control flow edges that are known to execute.		/// The set of control flow edges that are known to execute.
DenseSet<std::pair<Block , Block >> executableEdges;		DenseSet<std::pair<Block , Block >> executableEdges;

/// A worklist containing blocks that need to be processed.		/// A worklist containing blocks that need to be processed.
SmallVector<Block *, 64> blockWorklist;		std::queue<Block *> blockWorklist;

/// A worklist of operations that need to be processed.		/// A worklist of operations that need to be processed.
SmallVector<Operation *, 64> opWorklist;		std::queue<Operation *> opWorklist;

/// The callable operations that have their argument/result state tracked.		/// The callable operations that have their argument/result state tracked.
DenseMap<Operation *, CallableLatticeState> callableLatticeState;		DenseMap<Operation *, CallableLatticeState> callableLatticeState;

/// A map between a call operation and the resolved symbol callable. This		/// A map between a call operation and the resolved symbol callable. This
/// avoids re-resolving symbol references during propagation. Value based		/// avoids re-resolving symbol references during propagation. Value based
/// callables are trivial to resolve, so they can be done in-place.		/// callables are trivial to resolve, so they can be done in-place.
DenseMap<Operation , Operation > callToSymbolCallable;		DenseMap<Operation , Operation > callToSymbolCallable;
Show All 14 Lines	for (Region &region : op->getRegions()) {
markEntryBlockExecutable(&region, /markPessimisticFixpoint=/true);		markEntryBlockExecutable(&region, /markPessimisticFixpoint=/true);
}		}
initializeSymbolCallables(op);		initializeSymbolCallables(op);
}		}

void ForwardDataFlowSolver::solve() {		void ForwardDataFlowSolver::solve() {
while (!blockWorklist.empty() \|\| !opWorklist.empty()) {		while (!blockWorklist.empty() \|\| !opWorklist.empty()) {
// Process any operations in the op worklist.		// Process any operations in the op worklist.
while (!opWorklist.empty())		while (!opWorklist.empty()) {
visitUsers(*opWorklist.pop_back_val());		Operation *nextOp = opWorklist.front();
		opWorklist.pop();
		visitUsers(*nextOp);
		}

// Process any blocks in the block worklist.		// Process any blocks in the block worklist.
while (!blockWorklist.empty())		while (!blockWorklist.empty()) {
visitBlock(blockWorklist.pop_back_val());		Block *nextBlock = blockWorklist.front();
		blockWorklist.pop();
		visitBlock(nextBlock);
		}
}		}
}		}

void ForwardDataFlowSolver::initializeSymbolCallables(Operation *op) {		void ForwardDataFlowSolver::initializeSymbolCallables(Operation *op) {
// Initialize the set of symbol callables that can have their state tracked.		// Initialize the set of symbol callables that can have their state tracked.
// This tracks which symbol callable operations we can propagate within and		// This tracks which symbol callable operations we can propagate within and
// out of.		// out of.
auto walkFn = [&](Operation *symTable, bool allUsesVisible) {		auto walkFn = [&](Operation *symTable, bool allUsesVisible) {
▲ Show 20 Lines • Show All 117 Lines • ▼ Show 20 Lines	void ForwardDataFlowSolver::visitOperation(Operation *op) {
// If all of the results of this operation are already resolved, bail out		// If all of the results of this operation are already resolved, bail out
// early.		// early.
auto isAtFixpointFn = [&](Value value) { return isAtFixpoint(value); };		auto isAtFixpointFn = [&](Value value) { return isAtFixpoint(value); };
if (llvm::all_of(op->getResults(), isAtFixpointFn))		if (llvm::all_of(op->getResults(), isAtFixpointFn))
return;		return;

// Visit the current operation.		// Visit the current operation.
if (analysis.visitOperation(op, operandLattices) == ChangeResult::Change)		if (analysis.visitOperation(op, operandLattices) == ChangeResult::Change)
opWorklist.push_back(op);		opWorklist.push(op);

// `visitOperation` is required to define all of the result lattices.		// `visitOperation` is required to define all of the result lattices.
assert(llvm::none_of(		assert(llvm::none_of(
op->getResults(),		op->getResults(),
[&](Value value) {		[&](Value value) {
return analysis.getLatticeElement(value).isUninitialized();		return analysis.getLatticeElement(value).isUninitialized();
}) &&		}) &&
"expected `visitOperation` to define all result lattices");		"expected `visitOperation` to define all result lattices");
▲ Show 20 Lines • Show All 92 Lines • ▼ Show 20 Lines	if (!region) {
continue;		continue;

// Mark the results outside of the input range as having reached the		// Mark the results outside of the input range as having reached the
// pessimistic fixpoint.		// pessimistic fixpoint.
// TODO: This isn't exactly ideal. There may be situations in which a		// TODO: This isn't exactly ideal. There may be situations in which a
// region operation can provide information for certain results that		// region operation can provide information for certain results that
// aren't part of the control flow.		// aren't part of the control flow.
if (succArgs.size() != results.size()) {		if (succArgs.size() != results.size()) {
opWorklist.push_back(parentOp);		opWorklist.push(parentOp);
if (succArgs.empty()) {		if (succArgs.empty()) {
markAllPessimisticFixpoint(results);		markAllPessimisticFixpoint(results);
continue;		continue;
}		}

unsigned firstResIdx = succArgs[0].cast<OpResult>().getResultNumber();		unsigned firstResIdx = succArgs[0].cast<OpResult>().getResultNumber();
markAllPessimisticFixpoint(results.take_front(firstResIdx));		markAllPessimisticFixpoint(results.take_front(firstResIdx));
markAllPessimisticFixpoint(		markAllPessimisticFixpoint(
▲ Show 20 Lines • Show All 219 Lines • ▼ Show 20 Lines	if (!region->empty()) {
return markBlockExecutable(&region->front());		return markBlockExecutable(&region->front());
}		}
return ChangeResult::NoChange;		return ChangeResult::NoChange;
}		}

ChangeResult ForwardDataFlowSolver::markBlockExecutable(Block *block) {		ChangeResult ForwardDataFlowSolver::markBlockExecutable(Block *block) {
bool marked = executableBlocks.insert(block).second;		bool marked = executableBlocks.insert(block).second;
if (marked)		if (marked)
blockWorklist.push_back(block);		blockWorklist.push(block);
return marked ? ChangeResult::Change : ChangeResult::NoChange;		return marked ? ChangeResult::Change : ChangeResult::NoChange;
}		}

bool ForwardDataFlowSolver::isBlockExecutable(Block *block) const {		bool ForwardDataFlowSolver::isBlockExecutable(Block *block) const {
return executableBlocks.count(block);		return executableBlocks.count(block);
}		}

void ForwardDataFlowSolver::markEdgeExecutable(Block from, Block to) {		void ForwardDataFlowSolver::markEdgeExecutable(Block from, Block to) {
Show All 19 Lines	bool ForwardDataFlowSolver::isAtFixpoint(Value value) const {
if (auto *lattice = analysis.lookupLatticeElement(value))		if (auto *lattice = analysis.lookupLatticeElement(value))
return lattice->isAtFixpoint();		return lattice->isAtFixpoint();
return false;		return false;
}		}

void ForwardDataFlowSolver::join(Operation *owner, AbstractLatticeElement &to,		void ForwardDataFlowSolver::join(Operation *owner, AbstractLatticeElement &to,
const AbstractLatticeElement &from) {		const AbstractLatticeElement &from) {
if (to.join(from) == ChangeResult::Change)		if (to.join(from) == ChangeResult::Change)
opWorklist.push_back(owner);		opWorklist.push(owner);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// AbstractLatticeElement		// AbstractLatticeElement
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

AbstractLatticeElement::~AbstractLatticeElement() = default;		AbstractLatticeElement::~AbstractLatticeElement() = default;

Show All 32 Lines

mlir/lib/Transforms/SCCP.cpp

	Show All 17 Lines
	#include "mlir/Analysis/DataFlowAnalysis.h"			#include "mlir/Analysis/DataFlowAnalysis.h"
	#include "mlir/IR/Builders.h"			#include "mlir/IR/Builders.h"
	#include "mlir/IR/Dialect.h"			#include "mlir/IR/Dialect.h"
	#include "mlir/Interfaces/ControlFlowInterfaces.h"			#include "mlir/Interfaces/ControlFlowInterfaces.h"
	#include "mlir/Interfaces/SideEffectInterfaces.h"			#include "mlir/Interfaces/SideEffectInterfaces.h"
	#include "mlir/Pass/Pass.h"			#include "mlir/Pass/Pass.h"
	#include "mlir/Transforms/FoldUtils.h"			#include "mlir/Transforms/FoldUtils.h"
	#include "mlir/Transforms/Passes.h"			#include "mlir/Transforms/Passes.h"
				#include "llvm/Support/Debug.h"

				#define DEBUG_TYPE "sccp"

	using namespace mlir;			using namespace mlir;

	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	// SCCP Analysis			// SCCP Analysis
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	namespace {			namespace {
	Show All 31 Lines

	struct SCCPAnalysis : public ForwardDataFlowAnalysis<SCCPLatticeValue> {			struct SCCPAnalysis : public ForwardDataFlowAnalysis<SCCPLatticeValue> {
	using ForwardDataFlowAnalysis<SCCPLatticeValue>::ForwardDataFlowAnalysis;			using ForwardDataFlowAnalysis<SCCPLatticeValue>::ForwardDataFlowAnalysis;
	~SCCPAnalysis() override = default;			~SCCPAnalysis() override = default;

	ChangeResult			ChangeResult
	visitOperation(Operation *op,			visitOperation(Operation *op,
	ArrayRef<LatticeElement<SCCPLatticeValue> *> operands) final {			ArrayRef<LatticeElement<SCCPLatticeValue> *> operands) final {

				LLVM_DEBUG(llvm::dbgs() << "SCCP: Visiting operation: " << *op << "\n");

	// Don't try to simulate the results of a region operation as we can't			// Don't try to simulate the results of a region operation as we can't
	// guarantee that folding will be out-of-place. We don't allow in-place			// guarantee that folding will be out-of-place. We don't allow in-place
	// folds as the desire here is for simulated execution, and not general			// folds as the desire here is for simulated execution, and not general
	// folding.			// folding.
	if (op->getNumRegions())			if (op->getNumRegions())
	return markAllPessimisticFixpoint(op->getResults());			return markAllPessimisticFixpoint(op->getResults());

	SmallVector<Attribute> constantOperands(			SmallVector<Attribute> constantOperands(
	▲ Show 20 Lines • Show All 174 Lines • Show Last 20 Lines