This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/tools/llvm-mca/Views/
-
tools/
-
llvm-mca/
-
Views/
-
InstructionInfoView.h
6/7
InstructionInfoView.cpp
-
SummaryView.h
-
SummaryView.cpp

Differential D86177

[llvm-mca][NFC] Separate calculation of display data from its display in the summary and instruction info views
ClosedPublic

Authored by wolfgangp on Aug 18 2020, 4:08 PM.

Download Raw Diff

Details

Reviewers

andreadb
RKSimon
lebedev.ri

Summary

This is a preparatory patch for PR47227. We want to separate the calculation of display data from its display for the MCA views with the goal to then alternatively generate JSON (serialized) output.

We move the calculations to methods by the name of collectData() before displaying the data.

Diff Detail

Event Timeline

wolfgangp created this revision.Aug 18 2020, 4:08 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 18 2020, 4:08 PM

Herald added a subscriber: gbedwell. · View Herald Transcript

wolfgangp requested review of this revision.Aug 18 2020, 4:08 PM

Before going with JSON, i think it will be important to establish the stability guarantees:
there shouldn't be any stability guarantees, neither on the data nor on the structure.

llvm/tools/llvm-mca/Views/InstructionInfoView.cpp
29	These should be default member-initializers
29	This is the only place with `IIVDVec`, let's just inline it?
43–45	auto I : zip(IIVD, Source) const InstructionInfoViewData& IIVDEntry = std::get<0>(I) const MCInst &Inst = std::get<1>(I)
100	Just take `MutableArrayRef`.
102	auto I : zip(Source, IIVD) const MCInst &Inst = std::get<0>(I); InstructionInfoViewData& IIVDEntry = std::get<1>(I);

Nice!

Thanks Wolfgang! I always wanted to add a structured output to llvm-mca.
Thanks for working on this.

I also agree with the idea of splitting the process in two stages: a preliminar stage to collect the data from a view, a second (and final) stage where the data collected during the first stage is properly structured and printed out.

I agree with Roman in that we should not guarantee stability of data and/or structure.
Also, changes to the output structure should always be advertised (for example, by adding a line in the release notes), so that people are always aware of it.

That being said, it is not impossible for most (if not all) default views to guarantee stability of data (but not structure). After all, default views rarely (if ever) change in practice.
However, to guarantee data stability, we need some form of versioning and ideally an "auto-upgrade" functionality too (to convert/map data from an older structure to elements of the new structure). The way how I see it is that there is no need to implement this now. We can always add more guarantees in the future if we really think it is worth it.

About the patch:
Roman has already pointed out what can be improved in the patch (foreach loops / default initializer / etc.). I have nothing else to add on it.
I like your design and the general direction. I will be happy to accept this patch once comments from Roman are addressed.

Thanks,
Andrea

Addressed review comments:

Using MutableArrayRef
Using range-based for loops with zip
Using default initializers with InstructionInfoViewData

lebedev.ri added inline comments.Aug 19 2020, 1:56 PM

llvm/tools/llvm-mca/Views/InstructionInfoView.cpp
43–46	Ok, then auto I : enumerate(zip(IIVD, Source)) const InstructionInfoViewData &IIVDEntry = std::get<0>(I.value()); CE.getEncoding(I.index())

In D86177#2225748, @andreadb wrote:

Nice!

Thank you Andrea!

I agree with Roman in that we should not guarantee stability of data and/or structure.
Also, changes to the output structure should always be advertised (for example, by adding a line in the release notes), so that people are always aware of it.

I must confess I am not very informed about the issues regarding stability. Doesn't data stability depend on the scheduling model? I don't see how llvm-mca could guarantee it if that changes.
Or maybe I'm just confused about this. If you or Roman could briefly outline what's implied by data stability I'd appreciate it. Regarding structure stability a migration strategy would imply that llvm-mca would have to be able to read an older version JSON and update it to a later version, no?

I'll submit the follow-on patch to get started and you can educate me further on stability.

That being said, it is not impossible for most (if not all) default views to guarantee stability of data (but not structure). After all, default views rarely (if ever) change in practice.
However, to guarantee data stability, we need some form of versioning and ideally an "auto-upgrade" functionality too (to convert/map data from an older structure to elements of the new structure). The way how I see it is that there is no need to implement this now. We can always add more guarantees in the future if we really think it is worth it.

<snip>

wolfgang

llvm/tools/llvm-mca/Views/InstructionInfoView.cpp
29	I'm planning one more use. Would you still prefer using the explicit type?

In D86177#2227126, @wolfgangp wrote:

In D86177#2225748, @andreadb wrote:

Nice!

Thank you Andrea!

I agree with Roman in that we should not guarantee stability of data and/or structure.
Also, changes to the output structure should always be advertised (for example, by adding a line in the release notes), so that people are always aware of it.

I must confess I am not very informed about the issues regarding stability. Doesn't data stability depend on the scheduling model? I don't see how llvm-mca could guarantee it if that changes.

What I was trying to say is that the structure of the json report may change over time.

Future versions of mca may decide to structure data differently. Since we don't guarantee a stable structure for the json file, this may introduce incompatibilites in downstream tools which instead expect/assume a fixed/unchanging layout.

One way to solve this issue is by adding a version string to the json file, with the idea that each version string corresponds to a different structure of the json report.
It doesn't solve the incompatibility issue, but it allow tools to quickly identify incompatible files.

My other point was about the "stability of the data" (specifically: data generated by the default llvm-mca views).
Default mca views are not likely to change in future. So we can assume that the data generated by those views will always be the same. What could change is how data is layout in the final json report; Since the layout is not fixed, data may be moved to different objects/used by different name-value pairs etc.

If data doesn't change, and the only thing that changes is how data is mapped to the json structure, then it is possible to write an automated tool (e.g. an auto-upgrade script) which maps the data from an older report to the new json report structure. This again would probably require the presence of a string version.

In conclusion: if we really want to, in future we can always introduce a version string and write a simple auto-upgrade with simple rewriting rules for a few basic views.
However, yy personal opinion is that we should not worry about this (at least, not now).

Addressed review comment:

using enumerate() instead of an explicit index variable

wolfgangp marked an inline comment as done.Aug 21 2020, 8:57 AM

This revision is now accepted and ready to land.Aug 21 2020, 8:59 AM

LGTM

Not sure why this review wasn't automatically closed.
This was committed back in August as cf6adecd6a8718ee2737ca55e4cd938364b984cc

Revision Contents

Path

Size

llvm/

tools/

llvm-mca/

Views/

InstructionInfoView.h

14 lines

InstructionInfoView.cpp

83 lines

SummaryView.h

15 lines

SummaryView.cpp

40 lines

Diff 286416

llvm/tools/llvm-mca/Views/InstructionInfoView.h

	Show All 30 Lines
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_TOOLS_LLVM_MCA_INSTRUCTIONINFOVIEW_H			#ifndef LLVM_TOOLS_LLVM_MCA_INSTRUCTIONINFOVIEW_H
	#define LLVM_TOOLS_LLVM_MCA_INSTRUCTIONINFOVIEW_H			#define LLVM_TOOLS_LLVM_MCA_INSTRUCTIONINFOVIEW_H

	#include "Views/View.h"			#include "Views/View.h"
	#include "llvm/ADT/ArrayRef.h"			#include "llvm/ADT/ArrayRef.h"
				#include "llvm/ADT/SmallVector.h"
	#include "llvm/MC/MCInst.h"			#include "llvm/MC/MCInst.h"
	#include "llvm/MC/MCInstPrinter.h"			#include "llvm/MC/MCInstPrinter.h"
	#include "llvm/MC/MCInstrInfo.h"			#include "llvm/MC/MCInstrInfo.h"
	#include "llvm/MC/MCSubtargetInfo.h"			#include "llvm/MC/MCSubtargetInfo.h"
	#include "llvm/MCA/CodeEmitter.h"			#include "llvm/MCA/CodeEmitter.h"
	#include "llvm/Support/raw_ostream.h"			#include "llvm/Support/raw_ostream.h"

	#define DEBUG_TYPE "llvm-mca"			#define DEBUG_TYPE "llvm-mca"

	namespace llvm {			namespace llvm {
	namespace mca {			namespace mca {

	/// A view that prints out generic instruction information.			/// A view that prints out generic instruction information.
	class InstructionInfoView : public View {			class InstructionInfoView : public View {
	const llvm::MCSubtargetInfo &STI;			const llvm::MCSubtargetInfo &STI;
	const llvm::MCInstrInfo &MCII;			const llvm::MCInstrInfo &MCII;
	CodeEmitter &CE;			CodeEmitter &CE;
	bool PrintEncodings;			bool PrintEncodings;
	llvm::ArrayRef<llvm::MCInst> Source;			llvm::ArrayRef<llvm::MCInst> Source;
	llvm::MCInstPrinter &MCIP;			llvm::MCInstPrinter &MCIP;

				struct InstructionInfoViewData {
				unsigned NumMicroOpcodes;
				unsigned Latency;
				Optional<double> RThroughput;
				bool mayLoad;
				bool mayStore;
				bool hasUnmodeledSideEffects;
				};
				using IIVDVec = SmallVector<InstructionInfoViewData, 16>;

				/// Place the data into the array of InstructionInfoViewData IIVD.
				void collectData(IIVDVec &IIVD) const;

	public:			public:
	InstructionInfoView(const llvm::MCSubtargetInfo &ST,			InstructionInfoView(const llvm::MCSubtargetInfo &ST,
	const llvm::MCInstrInfo &II, CodeEmitter &C,			const llvm::MCInstrInfo &II, CodeEmitter &C,
	bool ShouldPrintEncodings, llvm::ArrayRef<llvm::MCInst> S,			bool ShouldPrintEncodings, llvm::ArrayRef<llvm::MCInst> S,
	llvm::MCInstPrinter &IP)			llvm::MCInstPrinter &IP)
	: STI(ST), MCII(II), CE(C), PrintEncodings(ShouldPrintEncodings),			: STI(ST), MCII(II), CE(C), PrintEncodings(ShouldPrintEncodings),
	Source(S), MCIP(IP) {}			Source(S), MCIP(IP) {}

	void printView(llvm::raw_ostream &OS) const override;			void printView(llvm::raw_ostream &OS) const override;
	};			};
	} // namespace mca			} // namespace mca
	} // namespace llvm			} // namespace llvm

	#endif			#endif

llvm/tools/llvm-mca/Views/InstructionInfoView.cpp

	Show All 14 Lines
	#include "llvm/Support/FormattedStream.h"			#include "llvm/Support/FormattedStream.h"

	namespace llvm {			namespace llvm {
	namespace mca {			namespace mca {

	void InstructionInfoView::printView(raw_ostream &OS) const {			void InstructionInfoView::printView(raw_ostream &OS) const {
	std::string Buffer;			std::string Buffer;
	raw_string_ostream TempStream(Buffer);			raw_string_ostream TempStream(Buffer);
	const MCSchedModel &SM = STI.getSchedModel();

	std::string Instruction;			std::string Instruction;
	raw_string_ostream InstrStream(Instruction);			raw_string_ostream InstrStream(Instruction);

				if (!Source.size())
				return;

				IIVDVec IIVD(Source.size(), {0, 0, 0.0, false, false, false});
				lebedev.riUnsubmitted Done Reply Inline Actions These should be default member-initializers lebedev.ri: These should be default member-initializers
				lebedev.riUnsubmitted Not Done Reply Inline Actions This is the only place with `IIVDVec`, let's just inline it? lebedev.ri: This is the only place with `IIVDVec`, let's just inline it?
				wolfgangpAuthorUnsubmitted Done Reply Inline Actions I'm planning one more use. Would you still prefer using the explicit type? wolfgangp: I'm planning one more use. Would you still prefer using the explicit type?
				collectData(IIVD);

	TempStream << "\n\nInstruction Info:\n";			TempStream << "\n\nInstruction Info:\n";
	TempStream << "[1]: #uOps\n[2]: Latency\n[3]: RThroughput\n"			TempStream << "[1]: #uOps\n[2]: Latency\n[3]: RThroughput\n"
	<< "[4]: MayLoad\n[5]: MayStore\n[6]: HasSideEffects (U)\n";			<< "[4]: MayLoad\n[5]: MayStore\n[6]: HasSideEffects (U)\n";
	if (PrintEncodings) {			if (PrintEncodings) {
	TempStream << "[7]: Encoding Size\n";			TempStream << "[7]: Encoding Size\n";
	TempStream << "\n[1] [2] [3] [4] [5] [6] [7] "			TempStream << "\n[1] [2] [3] [4] [5] [6] [7] "
	<< "Encodings: Instructions:\n";			<< "Encodings: Instructions:\n";
	} else {			} else {
	TempStream << "\n[1] [2] [3] [4] [5] [6] Instructions:\n";			TempStream << "\n[1] [2] [3] [4] [5] [6] Instructions:\n";
	}			}

	for (unsigned I = 0, E = Source.size(); I < E; ++I) {			for (auto IIVDIt = IIVD.begin(); IIVDIt != IIVD.end(); ++IIVDIt) {
	const MCInst &Inst = Source[I];			InstructionInfoViewData IIVDEntry = *IIVDIt;
	const MCInstrDesc &MCDesc = MCII.get(Inst.getOpcode());			unsigned I = std::distance(IIVD.begin(), IIVDIt);
				lebedev.riUnsubmitted Done Reply Inline Actions auto I : zip(IIVD, Source) const InstructionInfoViewData& IIVDEntry = std::get<0>(I) const MCInst &Inst = std::get<1>(I) lebedev.ri: auto I : zip(IIVD, Source) const InstructionInfoViewData& IIVDEntry = std::get<0>(I) const…

	// Obtain the scheduling class information from the instruction.
	unsigned SchedClassID = MCDesc.getSchedClass();
	unsigned CPUID = SM.getProcessorID();

	// Try to solve variant scheduling classes.
	while (SchedClassID && SM.getSchedClassDesc(SchedClassID)->isVariant())
	SchedClassID = STI.resolveVariantSchedClass(SchedClassID, &Inst, CPUID);

	const MCSchedClassDesc &SCDesc = *SM.getSchedClassDesc(SchedClassID);
	unsigned NumMicroOpcodes = SCDesc.NumMicroOps;
	unsigned Latency = MCSchedModel::computeInstrLatency(STI, SCDesc);
	// Add extra latency due to delays in the forwarding data paths.
	Latency += MCSchedModel::getForwardingDelayCycles(
	STI.getReadAdvanceEntries(SCDesc));
	Optional<double> RThroughput =
	MCSchedModel::getReciprocalThroughput(STI, SCDesc);

				lebedev.riUnsubmitted Done Reply Inline Actions Ok, then auto I : enumerate(zip(IIVD, Source)) const InstructionInfoViewData &IIVDEntry = std::get<0>(I.value()); CE.getEncoding(I.index()) lebedev.ri: Ok, then auto I : enumerate(zip(IIVD, Source)) const InstructionInfoViewData &IIVDEntry = std…
	TempStream << ' ' << NumMicroOpcodes << " ";			TempStream << ' ' << IIVDEntry.NumMicroOpcodes << " ";
	if (NumMicroOpcodes < 10)			if (IIVDEntry.NumMicroOpcodes < 10)
	TempStream << " ";			TempStream << " ";
	else if (NumMicroOpcodes < 100)			else if (IIVDEntry.NumMicroOpcodes < 100)
	TempStream << ' ';			TempStream << ' ';
	TempStream << Latency << " ";			TempStream << IIVDEntry.Latency << " ";
	if (Latency < 10)			if (IIVDEntry.Latency < 10)
	TempStream << " ";			TempStream << " ";
	else if (Latency < 100)			else if (IIVDEntry.Latency < 100)
	TempStream << ' ';			TempStream << ' ';

	if (RThroughput.hasValue()) {			if (IIVDEntry.RThroughput.hasValue()) {
	double RT = RThroughput.getValue();			double RT = IIVDEntry.RThroughput.getValue();
	TempStream << format("%.2f", RT) << ' ';			TempStream << format("%.2f", RT) << ' ';
	if (RT < 10.0)			if (RT < 10.0)
	TempStream << " ";			TempStream << " ";
	else if (RT < 100.0)			else if (RT < 100.0)
	TempStream << ' ';			TempStream << ' ';
	} else {			} else {
	TempStream << " - ";			TempStream << " - ";
	}			}
	TempStream << (MCDesc.mayLoad() ? " * " : " ");			TempStream << (IIVDEntry.mayLoad ? " * " : " ");
	TempStream << (MCDesc.mayStore() ? " * " : " ");			TempStream << (IIVDEntry.mayStore ? " * " : " ");
	TempStream << (MCDesc.hasUnmodeledSideEffects() ? " U " : " ");			TempStream << (IIVDEntry.hasUnmodeledSideEffects ? " U " : " ");

	if (PrintEncodings) {			if (PrintEncodings) {
	StringRef Encoding(CE.getEncoding(I));			StringRef Encoding(CE.getEncoding(I));
	unsigned EncodingSize = Encoding.size();			unsigned EncodingSize = Encoding.size();
	TempStream << " " << EncodingSize			TempStream << " " << EncodingSize
	<< (EncodingSize < 10 ? " " : " ");			<< (EncodingSize < 10 ? " " : " ");
	TempStream.flush();			TempStream.flush();
	formatted_raw_ostream FOS(TempStream);			formatted_raw_ostream FOS(TempStream);
	for (unsigned i = 0, e = Encoding.size(); i != e; ++i)			for (unsigned i = 0, e = Encoding.size(); i != e; ++i)
	FOS << format("%02x ", (uint8_t)Encoding[i]);			FOS << format("%02x ", (uint8_t)Encoding[i]);
	FOS.PadToColumn(30);			FOS.PadToColumn(30);
	FOS.flush();			FOS.flush();
	}			}

				const MCInst &Inst = Source[I];
	MCIP.printInst(&Inst, 0, "", STI, InstrStream);			MCIP.printInst(&Inst, 0, "", STI, InstrStream);
	InstrStream.flush();			InstrStream.flush();

	// Consume any tabs or spaces at the beginning of the string.			// Consume any tabs or spaces at the beginning of the string.
	StringRef Str(Instruction);			StringRef Str(Instruction);
	Str = Str.ltrim();			Str = Str.ltrim();
	TempStream << Str << '\n';			TempStream << Str << '\n';
	Instruction = "";			Instruction = "";
	}			}

	TempStream.flush();			TempStream.flush();
	OS << Buffer;			OS << Buffer;
	}			}

				void InstructionInfoView::collectData(IIVDVec &IIVD) const {
				lebedev.riUnsubmitted Done Reply Inline Actions Just take `MutableArrayRef`. lebedev.ri: Just take `MutableArrayRef`.
				const MCSchedModel &SM = STI.getSchedModel();
				for (unsigned I = 0, E = Source.size(); I < E; ++I) {
				lebedev.riUnsubmitted Done Reply Inline Actions auto I : zip(Source, IIVD) const MCInst &Inst = std::get<0>(I); InstructionInfoViewData& IIVDEntry = std::get<1>(I); lebedev.ri: auto I : zip(Source, IIVD) const MCInst &Inst = std::get<0>(I); InstructionInfoViewData&…
				InstructionInfoViewData IIVDEntry;
				const MCInst &Inst = Source[I];
				const MCInstrDesc &MCDesc = MCII.get(Inst.getOpcode());

				// Obtain the scheduling class information from the instruction.
				unsigned SchedClassID = MCDesc.getSchedClass();
				unsigned CPUID = SM.getProcessorID();

				// Try to solve variant scheduling classes.
				while (SchedClassID && SM.getSchedClassDesc(SchedClassID)->isVariant())
				SchedClassID = STI.resolveVariantSchedClass(SchedClassID, &Inst, CPUID);

				const MCSchedClassDesc &SCDesc = *SM.getSchedClassDesc(SchedClassID);
				IIVDEntry.NumMicroOpcodes = SCDesc.NumMicroOps;
				IIVDEntry.Latency = MCSchedModel::computeInstrLatency(STI, SCDesc);
				// Add extra latency due to delays in the forwarding data paths.
				IIVDEntry.Latency += MCSchedModel::getForwardingDelayCycles(
				STI.getReadAdvanceEntries(SCDesc));
				IIVDEntry.RThroughput = MCSchedModel::getReciprocalThroughput(STI, SCDesc);
				IIVDEntry.mayLoad = MCDesc.mayLoad();
				IIVDEntry.mayStore = MCDesc.mayStore();
				IIVDEntry.hasUnmodeledSideEffects = MCDesc.hasUnmodeledSideEffects();
				IIVD[I] = IIVDEntry;
				}
				}
	} // namespace mca.			} // namespace mca.
	} // namespace llvm			} // namespace llvm

llvm/tools/llvm-mca/Views/SummaryView.h

Show All 40 Lines	class SummaryView : public View {
const llvm::MCSchedModel &SM;		const llvm::MCSchedModel &SM;
llvm::ArrayRef<llvm::MCInst> Source;		llvm::ArrayRef<llvm::MCInst> Source;
const unsigned DispatchWidth;		const unsigned DispatchWidth;
unsigned LastInstructionIdx;		unsigned LastInstructionIdx;
unsigned TotalCycles;		unsigned TotalCycles;
// The total number of micro opcodes contributed by a block of instructions.		// The total number of micro opcodes contributed by a block of instructions.
unsigned NumMicroOps;		unsigned NumMicroOps;

		struct DisplayValues {
		unsigned Instructions;
		unsigned Iterations;
		unsigned TotalInstructions;
		unsigned TotalCycles;
		unsigned DispatchWidth;
		unsigned TotalUOps;
		double IPC;
		double UOpsPerCycle;
		double BlockRThroughput;
		};

// For each processor resource, this vector stores the cumulative number of		// For each processor resource, this vector stores the cumulative number of
// resource cycles consumed by the analyzed code block.		// resource cycles consumed by the analyzed code block.
llvm::SmallVector<unsigned, 8> ProcResourceUsage;		llvm::SmallVector<unsigned, 8> ProcResourceUsage;

// Each processor resource is associated with a so-called processor resource		// Each processor resource is associated with a so-called processor resource
// mask. This vector allows to correlate processor resource IDs with processor		// mask. This vector allows to correlate processor resource IDs with processor
// resource masks. There is exactly one element per each processor resource		// resource masks. There is exactly one element per each processor resource
// declared by the scheduling model.		// declared by the scheduling model.
llvm::SmallVector<uint64_t, 8> ProcResourceMasks;		llvm::SmallVector<uint64_t, 8> ProcResourceMasks;

// Used to map resource indices to actual processor resource IDs.		// Used to map resource indices to actual processor resource IDs.
llvm::SmallVector<unsigned, 8> ResIdx2ProcResID;		llvm::SmallVector<unsigned, 8> ResIdx2ProcResID;

// Compute the reciprocal throughput for the analyzed code block.		// Compute the reciprocal throughput for the analyzed code block.
// The reciprocal block throughput is computed as the MAX between:		// The reciprocal block throughput is computed as the MAX between:
// - NumMicroOps / DispatchWidth		// - NumMicroOps / DispatchWidth
// - Total Resource Cycles / #Units (for every resource consumed).		// - Total Resource Cycles / #Units (for every resource consumed).
double getBlockRThroughput() const;		double getBlockRThroughput() const;

		/// Compute the data we want to print out in the object DV.
		void collectData(DisplayValues &DV) const;

public:		public:
SummaryView(const llvm::MCSchedModel &Model, llvm::ArrayRef<llvm::MCInst> S,		SummaryView(const llvm::MCSchedModel &Model, llvm::ArrayRef<llvm::MCInst> S,
unsigned Width);		unsigned Width);

void onCycleEnd() override { ++TotalCycles; }		void onCycleEnd() override { ++TotalCycles; }
void onEvent(const HWInstructionEvent &Event) override;		void onEvent(const HWInstructionEvent &Event) override;
void printView(llvm::raw_ostream &OS) const override;		void printView(llvm::raw_ostream &OS) const override;
};		};

} // namespace mca		} // namespace mca
} // namespace llvm		} // namespace llvm

#endif		#endif

llvm/tools/llvm-mca/Views/SummaryView.cpp

Show First 20 Lines • Show All 57 Lines • ▼ Show 20 Lines	for (const std::pair<uint64_t, ResourceUsage> &RU : Desc.Resources) {
if (RU.second.size()) {		if (RU.second.size()) {
unsigned ProcResID = ResIdx2ProcResID[getResourceStateIndex(RU.first)];		unsigned ProcResID = ResIdx2ProcResID[getResourceStateIndex(RU.first)];
ProcResourceUsage[ProcResID] += RU.second.size();		ProcResourceUsage[ProcResID] += RU.second.size();
}		}
}		}
}		}

void SummaryView::printView(raw_ostream &OS) const {		void SummaryView::printView(raw_ostream &OS) const {
unsigned Instructions = Source.size();
unsigned Iterations = (LastInstructionIdx / Instructions) + 1;
unsigned TotalInstructions = Instructions * Iterations;
unsigned TotalUOps = NumMicroOps * Iterations;
double IPC = (double)TotalInstructions / TotalCycles;
double UOpsPerCycle = (double)TotalUOps / TotalCycles;
double BlockRThroughput = computeBlockRThroughput(
SM, DispatchWidth, NumMicroOps, ProcResourceUsage);

std::string Buffer;		std::string Buffer;
raw_string_ostream TempStream(Buffer);		raw_string_ostream TempStream(Buffer);
TempStream << "Iterations: " << Iterations;		DisplayValues DV;
TempStream << "\nInstructions: " << TotalInstructions;
TempStream << "\nTotal Cycles: " << TotalCycles;		collectData(DV);
TempStream << "\nTotal uOps: " << TotalUOps << '\n';		TempStream << "Iterations: " << DV.Iterations;
TempStream << "\nDispatch Width: " << DispatchWidth;		TempStream << "\nInstructions: " << DV.TotalInstructions;
		TempStream << "\nTotal Cycles: " << DV.TotalCycles;
		TempStream << "\nTotal uOps: " << DV.TotalUOps << '\n';
		TempStream << "\nDispatch Width: " << DV.DispatchWidth;
TempStream << "\nuOps Per Cycle: "		TempStream << "\nuOps Per Cycle: "
<< format("%.2f", floor((UOpsPerCycle * 100) + 0.5) / 100);		<< format("%.2f", floor((DV.UOpsPerCycle * 100) + 0.5) / 100);
TempStream << "\nIPC: "		TempStream << "\nIPC: "
<< format("%.2f", floor((IPC * 100) + 0.5) / 100);		<< format("%.2f", floor((DV.IPC * 100) + 0.5) / 100);
TempStream << "\nBlock RThroughput: "		TempStream << "\nBlock RThroughput: "
<< format("%.1f", floor((BlockRThroughput * 10) + 0.5) / 10)		<< format("%.1f", floor((DV.BlockRThroughput * 10) + 0.5) / 10)
<< '\n';		<< '\n';
TempStream.flush();		TempStream.flush();
OS << Buffer;		OS << Buffer;
}		}

		void SummaryView::collectData(DisplayValues &DV) const {
		DV.Instructions = Source.size();
		DV.Iterations = (LastInstructionIdx / DV.Instructions) + 1;
		DV.TotalInstructions = DV.Instructions * DV.Iterations;
		DV.TotalCycles = TotalCycles;
		DV.DispatchWidth = DispatchWidth;
		DV.TotalUOps = NumMicroOps * DV.Iterations;
		DV.UOpsPerCycle = (double)DV.TotalUOps / TotalCycles;
		DV.IPC = (double)DV.TotalInstructions / TotalCycles;
		DV.BlockRThroughput = computeBlockRThroughput(SM, DispatchWidth, NumMicroOps,
		ProcResourceUsage);
		}
} // namespace mca.		} // namespace mca.
} // namespace llvm		} // namespace llvm