This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/CodeGen/
-
CodeGen/
6
LiveDebugValues.cpp

Differential D15945

Add option to enable/disable LiveDebugValues pass.
AbandonedPublic

Authored by tvvikram on Jan 6 2016, 9:34 PM.

Download Raw Diff

Details

Reviewers

kcc
samsonov
aprantl

Summary

It is 'on' by default.

Diff Detail

Event Timeline

tvvikram updated this revision to Diff 44190.Jan 6 2016, 9:34 PM

tvvikram retitled this revision from to Add option to enable/disable LiveDebugValues pass..

tvvikram updated this object.

tvvikram added reviewers: aprantl, samsonov, kcc.

please make it false by default since the phase is not really working (compilation times out)

Can we actually add an equivalent clang debug tuning option called "-fvar-tracking" similar to the gcc one here as well?

-eric

In D15945#321093, @kcc wrote:

please make it false by default since the phase is not really working (compilation times out)

I kept it 'on' as it works in most of the cases. I will try to submit a patch by EoD that fixes the issue.

I noticed with a.ll, the test case from https://llvm.org/bugs/show_bug.cgi?id=26055, that there are too many DEBUG_VALUE instructions getting inserted. Statistics showed >6L instructions getting inserted:
616584 live-debug-values - Number of DBG_VALUE instructions inserted

One reason is the presence of too many basic blocks within the function. With so many blocks, the convergence takes really long time. I thought of ignoring functions with more basic blocks. But instead, I have added a simple heuristic to stop convergence after certain number of iterations. Currently the max iterations is arbitrary (chosen to be 10).

In D15945#321100, @echristo wrote:

Can we actually add an equivalent clang debug tuning option called "-fvar-tracking" similar to the gcc one here as well?

-eric

+1!

Note that test case in https://llvm.org/bugs/show_bug.cgi?id=26055 is produced from auto-generated code, so its BB structure should be very regular, and you can come up with rather precise estimate of how many passes you will need to converge. Does current bound (10) make compile time reasonable for that case?

-fvar-tracking: yes, please! It would be nice to be able to toggle this pass from frontend.

I have a number of comments regarding the code as well, but I will rather see quick fixes (or revert) committed ASAP: this is hurting us pretty badly, as we can't build several projects with optimization and debug info enabled.

In D15945#321360, @samsonov wrote:

Note that test case in https://llvm.org/bugs/show_bug.cgi?id=26055 is produced from auto-generated code, so its BB structure should be very regular, and you can come up with rather precise estimate of how many passes you will need to converge. Does current bound (10) make compile time reasonable for that case?

Yes. The compile time is now almost equal (~0.8 sec on my machine).

Okay, i'm confused.

(Note: I helped write var-tracking for GCC).

Let's ignore the inefficient data structure usage here (ie appending lists instead of using constant time splices, using lists instead of hash maps, etc).

The reason your convergence is slow, is, AFAICT, because you are using a random basic block order for processing the dataflow problem.

You should be walking blocks in reverse postorder, which is the optimal order for this problem, and will significantly speed up convergence.

I'm also going to take a pass at writing comments on the algorithmic issues here.

• dberlin added inline comments.Jan 8 2016, 4:20 PM

lib/CodeGen/LiveDebugValues.cpp
299	Can you please explain, precisely, what this accomplishes? Besides badly needing a comment explaining what is going on, it is really complicated, and if i'm reading it right, really slow (N^2 or worse) for the datastructures you have here.
323	This looks like another N^2 part. You either should be keeping these things in a hash map or at the very least, and ordered set so that you can do this fast.
390	Please use a SmallPtrSet<BasicBlock *, 16> Visited to to track whether you visited a block or not, don't go searching through the worklist.

• dberlin added inline comments.Jan 8 2016, 4:35 PM

lib/CodeGen/LiveDebugValues.cpp

There is no reason for this to be a global variable.
Please just make your transfer functions return bools and | them together.

Ditto.

374

You really need to be iterating in reverse postorder.

One trivial way to do that is to use the ordering and two worklists.

std::priority_queue <unsigned int> worklist;
std::priority_queue <unsigned int> pending;

DenseMap<MachineBasicBlock *, unsigned int> BBToOrder;
DenseMap<unsigned int, MachineBasicBlock *> OrderToBB

ReversePostOrderTraversal<MachineBasicBlock *> RPOT(entryblock);
unsigned int RPONumber = 0;
for (auto I = RPOT.begin(), auto E = RPOT.end(); I != E; ++I) {
  BBToOrder[*I] = RPONumber;
  OrderToBB[RPONumber] = *I;
  worklist.push_back(RPONumber);
  ++RPONumber;
}


while (changed) {
<go through worklist, use OrderToBB[I] to get BB from worklist> 
put necessary successors on pending using BBToOrder.
std::swap(worklist, pending)
Clear pending.
}

Thanks @dberlin for the suggestions and working on it (http://reviews.llvm.org/D16039)

Revision Contents

Path

Size

lib/

CodeGen/

LiveDebugValues.cpp

7 lines

Diff 44190

lib/CodeGen/LiveDebugValues.cpp

Show All 31 Lines
#include "llvm/Target/TargetSubtargetInfo.h"		#include "llvm/Target/TargetSubtargetInfo.h"
#include <deque>		#include <deque>
#include <list>		#include <list>

using namespace llvm;		using namespace llvm;

#define DEBUG_TYPE "live-debug-values"		#define DEBUG_TYPE "live-debug-values"

		static cl::opt<bool>
		EnableLDV("live-debug-values", cl::init(true),
		cl::desc("Enable the live debug values pass"), cl::Hidden);

STATISTIC(NumInserted, "Number of DBG_VALUE instructions inserted");		STATISTIC(NumInserted, "Number of DBG_VALUE instructions inserted");

namespace {		namespace {

class LiveDebugValues : public MachineFunctionPass {		class LiveDebugValues : public MachineFunctionPass {

private:		private:
const TargetRegisterInfo *TRI;		const TargetRegisterInfo *TRI;
Show All 23 Lines	struct VarLoc {
VarLoc(DebugVariable _var, const MachineInstr *_mi) : Var(_var), MI(_mi) {}		VarLoc(DebugVariable _var, const MachineInstr *_mi) : Var(_var), MI(_mi) {}

bool operator==(const VarLoc &V) const;		bool operator==(const VarLoc &V) const;
};		};

typedef std::list<VarLoc> VarLocList;		typedef std::list<VarLoc> VarLocList;
typedef SmallDenseMap<const MachineBasicBlock *, VarLocList> VarLocInMBB;		typedef SmallDenseMap<const MachineBasicBlock *, VarLocList> VarLocInMBB;

bool OLChanged; // OutgoingLocs got changed for this bb.		bool OLChanged; // OutgoingLocs got changed for this bb.
		dberlinUnsubmitted Not Done Reply Inline Actions There is no reason for this to be a global variable. Please just make your transfer functions return bools and \| them together. dberlin: There is no reason for this to be a global variable. Please just make your transfer functions…
bool MBBJoined; // The MBB was joined.		bool MBBJoined; // The MBB was joined.

		dberlinUnsubmitted Not Done Reply Inline Actions Ditto. dberlin: Ditto.
void transferDebugValue(MachineInstr &MI, VarLocList &OpenRanges);		void transferDebugValue(MachineInstr &MI, VarLocList &OpenRanges);
void transferRegisterDef(MachineInstr &MI, VarLocList &OpenRanges);		void transferRegisterDef(MachineInstr &MI, VarLocList &OpenRanges);
void transferTerminatorInst(MachineInstr &MI, VarLocList &OpenRanges,		void transferTerminatorInst(MachineInstr &MI, VarLocList &OpenRanges,
VarLocInMBB &OutLocs);		VarLocInMBB &OutLocs);
void transfer(MachineInstr &MI, VarLocList &OpenRanges, VarLocInMBB &OutLocs);		void transfer(MachineInstr &MI, VarLocList &OpenRanges, VarLocInMBB &OutLocs);

void join(MachineBasicBlock &MBB, VarLocInMBB &OutLocs, VarLocInMBB &InLocs);		void join(MachineBasicBlock &MBB, VarLocInMBB &OutLocs, VarLocInMBB &InLocs);

▲ Show 20 Lines • Show All 197 Lines • ▼ Show 20 Lines	for (auto p : MBB.predecessors()) {
// Just copy over the Out locs to incoming locs for the first predecessor.		// Just copy over the Out locs to incoming locs for the first predecessor.
if (p == *MBB.pred_begin()) {		if (p == *MBB.pred_begin()) {
InLocsT = OL->second;		InLocsT = OL->second;
continue;		continue;
}		}

// Join with this predecessor.		// Join with this predecessor.
VarLocList &VLL = OL->second;		VarLocList &VLL = OL->second;
InLocsT.erase(std::remove_if(InLocsT.begin(), InLocsT.end(),		InLocsT.erase(std::remove_if(InLocsT.begin(), InLocsT.end(),
		dberlinUnsubmitted Not Done Reply Inline Actions Can you please explain, precisely, what this accomplishes? Besides badly needing a comment explaining what is going on, it is really complicated, and if i'm reading it right, really slow (N^2 or worse) for the datastructures you have here. dberlin: Can you please explain, precisely, what this accomplishes? Besides badly needing a comment…
[&](VarLoc &ILT) {		[&](VarLoc &ILT) {
return (std::find_if(VLL.begin(), VLL.end(),		return (std::find_if(VLL.begin(), VLL.end(),
[&](const VarLoc &V) {		[&](const VarLoc &V) {
return (ILT == V);		return (ILT == V);
}) == VLL.end());		}) == VLL.end());
}),		}),
InLocsT.end());		InLocsT.end());
}		}

if (InLocsT.empty())		if (InLocsT.empty())
return;		return;

if (InLocs.find(&MBB) == InLocs.end()) {		if (InLocs.find(&MBB) == InLocs.end()) {
// Create space for new Incoming locs entries.		// Create space for new Incoming locs entries.
VarLocList VLL;		VarLocList VLL;
InLocs.insert(std::make_pair(&MBB, std::move(VLL)));		InLocs.insert(std::make_pair(&MBB, std::move(VLL)));
}		}
auto IL = InLocs.find(&MBB);		auto IL = InLocs.find(&MBB);
assert(IL != InLocs.end());		assert(IL != InLocs.end());
VarLocList &ILL = IL->second;		VarLocList &ILL = IL->second;

// Insert DBG_VALUE instructions, if not already inserted.		// Insert DBG_VALUE instructions, if not already inserted.
for (auto ILT : InLocsT) {		for (auto ILT : InLocsT) {
if (std::find_if(ILL.begin(), ILL.end(), [&](const VarLoc &I) {		if (std::find_if(ILL.begin(), ILL.end(), [&](const VarLoc &I) {
		dberlinUnsubmitted Not Done Reply Inline Actions This looks like another N^2 part. You either should be keeping these things in a hash map or at the very least, and ordered set so that you can do this fast. dberlin: This looks like another N^2 part. You either should be keeping these things in a hash map or…
return (ILT == I);		return (ILT == I);
}) == ILL.end()) {		}) == ILL.end()) {
// This VarLoc is not found in InLocs i.e. it is not yet inserted. So, a		// This VarLoc is not found in InLocs i.e. it is not yet inserted. So, a
// new range is started for the var from the mbb's beginning by inserting		// new range is started for the var from the mbb's beginning by inserting
// a new DBG_VALUE. transfer() will end this range however appropriate.		// a new DBG_VALUE. transfer() will end this range however appropriate.
const MachineInstr *DMI = ILT.MI;		const MachineInstr *DMI = ILT.MI;
MachineInstr *MI =		MachineInstr *MI =
BuildMI(MBB, MBB.instr_begin(), DMI->getDebugLoc(), DMI->getDesc(),		BuildMI(MBB, MBB.instr_begin(), DMI->getDebugLoc(), DMI->getDesc(),
Show All 34 Lines	bool LiveDebugValues::ExtendRanges(MachineFunction &MF) {

// Construct a worklist of MBBs.		// Construct a worklist of MBBs.
for (auto &MBB : MF)		for (auto &MBB : MF)
BBWorklist.push_back(&MBB);		BBWorklist.push_back(&MBB);

// Perform join() and transfer() using the worklist until the ranges converge		// Perform join() and transfer() using the worklist until the ranges converge
// Ranges have converged when the worklist is empty.		// Ranges have converged when the worklist is empty.
while (!BBWorklist.empty()) {		while (!BBWorklist.empty()) {
MachineBasicBlock *MBB = BBWorklist.front();		MachineBasicBlock *MBB = BBWorklist.front();
		dberlinUnsubmitted Not Done Reply Inline Actions You really need to be iterating in reverse postorder. One trivial way to do that is to use the ordering and two worklists. std::priority_queue <unsigned int> worklist; std::priority_queue <unsigned int> pending; DenseMap<MachineBasicBlock , unsigned int> BBToOrder; DenseMap<unsigned int, MachineBasicBlock > OrderToBB ReversePostOrderTraversal<MachineBasicBlock > RPOT(entryblock); unsigned int RPONumber = 0; for (auto I = RPOT.begin(), auto E = RPOT.end(); I != E; ++I) { BBToOrder[I] = RPONumber; OrderToBB[RPONumber] = I; worklist.push_back(RPONumber); ++RPONumber; } while (changed) { <go through worklist, use OrderToBB[I] to get BB from worklist> put necessary successors on pending using BBToOrder. std::swap(worklist, pending) Clear pending. } dberlin:* You really need to be iterating in reverse postorder. One trivial way to do that is to use the…
BBWorklist.pop_front();		BBWorklist.pop_front();

join(*MBB, OutLocs, InLocs);		join(*MBB, OutLocs, InLocs);

if (MBBJoined) {		if (MBBJoined) {
Changed = true;		Changed = true;
for (auto &MI : *MBB)		for (auto &MI : *MBB)
transfer(MI, OpenRanges, OutLocs);		transfer(MI, OpenRanges, OutLocs);
DEBUG(printVarLocInMBB(OutLocs, "OutLocs after propagating", dbgs()));		DEBUG(printVarLocInMBB(OutLocs, "OutLocs after propagating", dbgs()));
DEBUG(printVarLocInMBB(InLocs, "InLocs after propagating", dbgs()));		DEBUG(printVarLocInMBB(InLocs, "InLocs after propagating", dbgs()));

if (OLChanged) {		if (OLChanged) {
OLChanged = false;		OLChanged = false;
for (auto s : MBB->successors())		for (auto s : MBB->successors())
if (std::find(BBWorklist.begin(), BBWorklist.end(), s) ==		if (std::find(BBWorklist.begin(), BBWorklist.end(), s) ==
BBWorklist.end()) // add if not already present.		BBWorklist.end()) // add if not already present.
		dberlinUnsubmitted Not Done Reply Inline Actions Please use a SmallPtrSet<BasicBlock , 16> Visited to to track whether you visited a block or not, don't go searching through the worklist. dberlin:* Please use a SmallPtrSet<BasicBlock *, 16> Visited to to track whether you visited a block or…
BBWorklist.push_back(s);		BBWorklist.push_back(s);
}		}
}		}
}		}
DEBUG(printVarLocInMBB(OutLocs, "Final OutLocs", dbgs()));		DEBUG(printVarLocInMBB(OutLocs, "Final OutLocs", dbgs()));
DEBUG(printVarLocInMBB(InLocs, "Final InLocs", dbgs()));		DEBUG(printVarLocInMBB(InLocs, "Final InLocs", dbgs()));
return Changed;		return Changed;
}		}

bool LiveDebugValues::runOnMachineFunction(MachineFunction &MF) {		bool LiveDebugValues::runOnMachineFunction(MachineFunction &MF) {
		if (!EnableLDV)
		return false;

TRI = MF.getSubtarget().getRegisterInfo();		TRI = MF.getSubtarget().getRegisterInfo();
TII = MF.getSubtarget().getInstrInfo();		TII = MF.getSubtarget().getInstrInfo();

bool Changed = false;		bool Changed = false;

Changed \|= ExtendRanges(MF);		Changed \|= ExtendRanges(MF);

return Changed;		return Changed;
}		}

This is an archive of the discontinued LLVM Phabricator instance.

Add option to enable/disable LiveDebugValues pass.AbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 44190

lib/CodeGen/LiveDebugValues.cpp

Add option to enable/disable LiveDebugValues pass.
AbandonedPublic