This is an archive of the discontinued LLVM Phabricator instance.

[GCOV] Emit the writeout function as nested loops of global data.
ClosedPublic

Authored by chandlerc on May 2 2018, 5:12 AM.

Download Raw Diff

Details

Reviewers

wmi
echristo
dblaikie

Commits

rG71c3a3fac521: [GCOV] Emit the writeout function as nested loops of global data.
rL331407: [GCOV] Emit the writeout function as nested loops of global data.

Summary

Prior to this change, LLVM would in some cases emit *massive* writeout
functions with many 10s of 1000s of function calls in straight-line
code. This is a very wasteful way to represent what are fundamentally
loops and creates a number of scalability issues. Among other things,
register allocating these calls is extremely expensive. While D46127 makes this
less severe, we'll still run into scaling issues with this eventually. If not
in the compile time, just from the code size.

Now the pass builds up global data structures modeling the inputs to
these functions, and simply loops over the data structures calling the
relevant functions with those values. This ensures that the code size is
a fixed and only data size grows with larger amounts of coverage data.

A trivial change to IRBuilder is included to make it easier to build
the constants that make up the global data.

Diff Detail

Repository

rL LLVM

Build Status

Buildable 17628
Build 17628: arc lint + arc unit

Event Timeline

chandlerc created this revision.May 2 2018, 5:12 AM

Herald added subscribers: hiraditya, mcrosier, sanjoy. · View Herald TranscriptMay 2 2018, 5:12 AM

chandlerc added a reviewer: dblaikie.May 2 2018, 5:31 AM

Can you add a new test case testing the new emitted code patterns?

The generated code looks great. We may want to do the similar thing in @__llvm_gcov_flush. It generated a bunch of memset in straight line code in a loop of CountersBySP.
call void @llvm.memset.p0i8.i64(i8* align 16 bitcast ([2 x i64]* @__llvm_gcov_ctr to i8*), i8 0, i64 16, i1 false)
call void @llvm.memset.p0i8.i64(i8* align 16 bitcast ([12 x i64]* @__llvm_gcov_ctr.6 to i8*), i8 0, i64 96, i1 false)
call void @llvm.memset.p0i8.i64(i8* align 16 bitcast ([2 x i64]* @__llvm_gcov_ctr.7 to i8*), i8 0, i64 16, i1 false)
...

Some minor/idle comments

llvm/lib/Transforms/Instrumentation/GCOVProfiling.cpp
913	Maybe worth using 'zip' here to iterate the CUNodes (it does support iterators rather than using getOperand(i))? Might be easier, I guess, if llvm::seq supported more defaults (like defaulting to int, starting at zero, and unbounded (no need to specify an end value if the thing you zip it with is constrained anyway))
973	Is this tested? (the test case only seems to have function_args) (also I'm not sure I really understand this bit of the code - it's creating a global variable with a fixed name, but it's doing so inside the loop over all CUs? (would this create duplicate GVs? Do they somehow get renamed to be unique?) Is this code assuming there will only be one non-module-skeleton CU?)
llvm/test/Transforms/GCOVProfiling/function-numbering.ll
22–88	I'd expect this test should probably test the contents of the gcov_writeout function? Nothing else appears to be testing that code? (given that you changed it significantly & no other tests need updating)

Updated based on review feedback.

In D46357#1085440, @wmi wrote:

The generated code looks great. We may want to do the similar thing in @__llvm_gcov_flush. It generated a bunch of memset in straight line code in a loop of CountersBySP.
call void @llvm.memset.p0i8.i64(i8* align 16 bitcast ([2 x i64]* @__llvm_gcov_ctr to i8*), i8 0, i64 16, i1 false)
call void @llvm.memset.p0i8.i64(i8* align 16 bitcast ([12 x i64]* @__llvm_gcov_ctr.6 to i8*), i8 0, i64 96, i1 false)
call void @llvm.memset.p0i8.i64(i8* align 16 bitcast ([2 x i64]* @__llvm_gcov_ctr.7 to i8*), i8 0, i64 16, i1 false)
...

Sure, but let's do that in a follow-up? It isn't hitting scaling limits nearly as rapidly from my testing.

In D46357#1085373, @davidxl wrote:

Can you add a new test case testing the new emitted code patterns?

Yep (see my response to dblaikie's comment below too).

llvm/lib/Transforms/Instrumentation/GCOVProfiling.cpp
913	I'd like to do this in a follow-up change if I can? I already did more surgery here than I probably should have. Happy to iterate some on cleaner loops though.
973	I think we always have both? The test only looks at one. I honestly don't understand the contents of this well enough to write a test. I now test the writeout which at least clearly shows that the structure ends up being sound here. Regarding the global variable -- I was assuming they get renamed like SSA values. Their names aren't significant as they don't have linkage. But I can just make the names unique constructively, as that seems cleaner anyways.
llvm/test/Transforms/GCOVProfiling/function-numbering.ll
22–88	Yeah, I was trying to avoid adding large amounts of new testing but the testing seems super bad here... I've added a full test of the writeout function now.

Harbormaster completed remote builds in B17628: Diff 144939.May 2 2018, 2:58 PM

Seems good to me - but feel free to wait for Wei's OK too if you like, of course.

This revision is now accepted and ready to land.May 2 2018, 3:23 PM

LGTM. Thanks!

Closed by commit rL331407: [GCOV] Emit the writeout function as nested loops of global data. (authored by chandlerc). · Explain WhyMay 2 2018, 3:28 PM

This revision was automatically updated to reflect the committed changes.

Revision Contents

Path

Size

llvm/

include/

llvm/

IR/

IRBuilder.h

17 lines

lib/

Transforms/

Instrumentation/

GCOVProfiling.cpp

221 lines

test/

Transforms/

GCOVProfiling/

function-numbering.ll

69 lines

Diff 144939

llvm/include/llvm/IR/IRBuilder.h

Show First 20 Lines • Show All 1,470 Lines • ▼ Show 20 Lines	Value CreateConstInBoundsGEP2_64(Value Ptr, uint64_t Idx0, uint64_t Idx1,
return Insert(GetElementPtrInst::CreateInBounds(nullptr, Ptr, Idxs), Name);		return Insert(GetElementPtrInst::CreateInBounds(nullptr, Ptr, Idxs), Name);
}		}

Value CreateStructGEP(Type Ty, Value *Ptr, unsigned Idx,		Value CreateStructGEP(Type Ty, Value *Ptr, unsigned Idx,
const Twine &Name = "") {		const Twine &Name = "") {
return CreateConstInBoundsGEP2_32(Ty, Ptr, 0, Idx, Name);		return CreateConstInBoundsGEP2_32(Ty, Ptr, 0, Idx, Name);
}		}

		Value CreateStructGEP(Value Ptr, unsigned Idx, const Twine &Name = "") {
		return CreateConstInBoundsGEP2_32(nullptr, Ptr, 0, Idx, Name);
		}

/// Same as CreateGlobalString, but return a pointer with "i8*" type		/// Same as CreateGlobalString, but return a pointer with "i8*" type
/// instead of a pointer to array of i8.		/// instead of a pointer to array of i8.
Value *CreateGlobalStringPtr(StringRef Str, const Twine &Name = "",		Constant *CreateGlobalStringPtr(StringRef Str, const Twine &Name = "",
unsigned AddressSpace = 0) {		unsigned AddressSpace = 0) {
GlobalVariable *gv = CreateGlobalString(Str, Name, AddressSpace);		GlobalVariable *GV = CreateGlobalString(Str, Name, AddressSpace);
Value *zero = ConstantInt::get(Type::getInt32Ty(Context), 0);		Constant *Zero = ConstantInt::get(Type::getInt32Ty(Context), 0);
Value *Args[] = { zero, zero };		Constant *Indices[] = {Zero, Zero};
return CreateInBoundsGEP(gv->getValueType(), gv, Args, Name);		return ConstantExpr::getInBoundsGetElementPtr(GV->getValueType(), GV,
		Indices);
}		}

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Instruction creation methods: Cast/Conversion Operators		// Instruction creation methods: Cast/Conversion Operators
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//

Value CreateTrunc(Value V, Type *DestTy, const Twine &Name = "") {		Value CreateTrunc(Value V, Type *DestTy, const Twine &Name = "") {
return CreateCast(Instruction::Trunc, V, DestTy, Name);		return CreateCast(Instruction::Trunc, V, DestTy, Name);
▲ Show 20 Lines • Show All 621 Lines • Show Last 20 Lines

llvm/lib/Transforms/Instrumentation/GCOVProfiling.cpp

Show All 11 Lines
// to records the edges between blocks that run and emit a complementary "gcda"		// to records the edges between blocks that run and emit a complementary "gcda"
// file on exit.		// file on exit.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/ADT/DenseMap.h"		#include "llvm/ADT/DenseMap.h"
#include "llvm/ADT/Hashing.h"		#include "llvm/ADT/Hashing.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
		#include "llvm/ADT/Sequence.h"
#include "llvm/ADT/Statistic.h"		#include "llvm/ADT/Statistic.h"
#include "llvm/ADT/StringExtras.h"		#include "llvm/ADT/StringExtras.h"
#include "llvm/ADT/StringMap.h"		#include "llvm/ADT/StringMap.h"
#include "llvm/ADT/UniqueVector.h"		#include "llvm/ADT/UniqueVector.h"
#include "llvm/Analysis/EHPersonalities.h"		#include "llvm/Analysis/EHPersonalities.h"
#include "llvm/IR/DebugInfo.h"		#include "llvm/IR/DebugInfo.h"
#include "llvm/IR/DebugLoc.h"		#include "llvm/IR/DebugLoc.h"
#include "llvm/IR/IRBuilder.h"		#include "llvm/IR/IRBuilder.h"
#include "llvm/IR/InstIterator.h"		#include "llvm/IR/InstIterator.h"
#include "llvm/IR/Instructions.h"		#include "llvm/IR/Instructions.h"
#include "llvm/IR/IntrinsicInst.h"		#include "llvm/IR/IntrinsicInst.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/Pass.h"		#include "llvm/Pass.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/FileSystem.h"		#include "llvm/Support/FileSystem.h"
#include "llvm/Support/Path.h"		#include "llvm/Support/Path.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include "llvm/Transforms/Instrumentation/GCOVProfiler.h"
#include "llvm/Transforms/Instrumentation.h"		#include "llvm/Transforms/Instrumentation.h"
		#include "llvm/Transforms/Instrumentation/GCOVProfiler.h"
#include "llvm/Transforms/Utils/ModuleUtils.h"		#include "llvm/Transforms/Utils/ModuleUtils.h"
#include <algorithm>		#include <algorithm>
#include <memory>		#include <memory>
#include <string>		#include <string>
#include <utility>		#include <utility>
using namespace llvm;		using namespace llvm;

#define DEBUG_TYPE "insert-gcov-profiling"		#define DEBUG_TYPE "insert-gcov-profiling"
▲ Show 20 Lines • Show All 833 Lines • ▼ Show 20 Lines	Function *GCOVProfiler::insertCounterWriteout(
IRBuilder<> Builder(BB);		IRBuilder<> Builder(BB);

Constant *StartFile = getStartFileFunc();		Constant *StartFile = getStartFileFunc();
Constant *EmitFunction = getEmitFunctionFunc();		Constant *EmitFunction = getEmitFunctionFunc();
Constant *EmitArcs = getEmitArcsFunc();		Constant *EmitArcs = getEmitArcsFunc();
Constant *SummaryInfo = getSummaryInfoFunc();		Constant *SummaryInfo = getSummaryInfoFunc();
Constant *EndFile = getEndFileFunc();		Constant *EndFile = getEndFileFunc();

NamedMDNode *CU_Nodes = M->getNamedMetadata("llvm.dbg.cu");		NamedMDNode *CUNodes = M->getNamedMetadata("llvm.dbg.cu");
if (CU_Nodes) {		if (!CUNodes) {
for (unsigned i = 0, e = CU_Nodes->getNumOperands(); i != e; ++i) {		Builder.CreateRetVoid();
auto *CU = cast<DICompileUnit>(CU_Nodes->getOperand(i));		return WriteoutF;
		}

		// Collect the relevant data into a large constant data structure that we can
		// walk to write out everything.
		StructType *StartFileCallArgsTy = StructType::create(
		{Builder.getInt8PtrTy(), Builder.getInt8PtrTy(), Builder.getInt32Ty()});
		StructType *EmitFunctionCallArgsTy = StructType::create(
		{Builder.getInt32Ty(), Builder.getInt8PtrTy(), Builder.getInt32Ty(),
		Builder.getInt8Ty(), Builder.getInt32Ty()});
		StructType *EmitArcsCallArgsTy = StructType::create(
		{Builder.getInt32Ty(), Builder.getInt64Ty()->getPointerTo()});
		StructType *FileInfoTy =
		StructType::create({StartFileCallArgsTy, Builder.getInt32Ty(),
		EmitFunctionCallArgsTy->getPointerTo(),
		EmitArcsCallArgsTy->getPointerTo()});

		Constant *Zero32 = Builder.getInt32(0);

		SmallVector<Constant *, 8> FileInfos;
		for (int i : llvm::seq<int>(0, CUNodes->getNumOperands())) {
		dblaikieUnsubmitted Not Done Reply Inline Actions Maybe worth using 'zip' here to iterate the CUNodes (it does support iterators rather than using getOperand(i))? Might be easier, I guess, if llvm::seq supported more defaults (like defaulting to int, starting at zero, and unbounded (no need to specify an end value if the thing you zip it with is constrained anyway)) dblaikie: Maybe worth using 'zip' here to iterate the CUNodes (it does support iterators rather than…
		chandlercAuthorUnsubmitted Not Done Reply Inline Actions I'd like to do this in a follow-up change if I can? I already did more surgery here than I probably should have. Happy to iterate some on cleaner loops though. chandlerc: I'd like to do this in a follow-up change if I can? I already did more surgery here than I…
		auto *CU = cast<DICompileUnit>(CUNodes->getOperand(i));

// Skip module skeleton (and module) CUs.		// Skip module skeleton (and module) CUs.
if (CU->getDWOId())		if (CU->getDWOId())
continue;		continue;

std::string FilenameGcda = mangleName(CU, GCovFileType::GCDA);		std::string FilenameGcda = mangleName(CU, GCovFileType::GCDA);
uint32_t CfgChecksum = FileChecksums.empty() ? 0 : FileChecksums[i];		uint32_t CfgChecksum = FileChecksums.empty() ? 0 : FileChecksums[i];
Builder.CreateCall(StartFile,		auto *StartFileCallArgs = ConstantStruct::get(
{Builder.CreateGlobalStringPtr(FilenameGcda),		StartFileCallArgsTy, {Builder.CreateGlobalStringPtr(FilenameGcda),
Builder.CreateGlobalStringPtr(ReversedVersion),		Builder.CreateGlobalStringPtr(ReversedVersion),
Builder.getInt32(CfgChecksum)});		Builder.getInt32(CfgChecksum)});
for (unsigned j = 0, e = CountersBySP.size(); j != e; ++j) {
		SmallVector<Constant *, 8> EmitFunctionCallArgsArray;
		SmallVector<Constant *, 8> EmitArcsCallArgsArray;
		for (int j : llvm::seq<int>(0, CountersBySP.size())) {
auto *SP = cast_or_null<DISubprogram>(CountersBySP[j].second);		auto *SP = cast_or_null<DISubprogram>(CountersBySP[j].second);
uint32_t FuncChecksum = Funcs.empty() ? 0 : Funcs[j]->getFuncChecksum();		uint32_t FuncChecksum = Funcs.empty() ? 0 : Funcs[j]->getFuncChecksum();
Builder.CreateCall(		EmitFunctionCallArgsArray.push_back(ConstantStruct::get(
EmitFunction,		EmitFunctionCallArgsTy,
{Builder.getInt32(j),		{Builder.getInt32(j),
Options.FunctionNamesInData		Options.FunctionNamesInData
? Builder.CreateGlobalStringPtr(getFunctionName(SP))		? Builder.CreateGlobalStringPtr(getFunctionName(SP))
: Constant::getNullValue(Builder.getInt8PtrTy()),		: Constant::getNullValue(Builder.getInt8PtrTy()),
Builder.getInt32(FuncChecksum),		Builder.getInt32(FuncChecksum),
Builder.getInt8(Options.UseCfgChecksum),		Builder.getInt8(Options.UseCfgChecksum),
Builder.getInt32(CfgChecksum)});		Builder.getInt32(CfgChecksum)}));

GlobalVariable *GV = CountersBySP[j].first;		GlobalVariable *GV = CountersBySP[j].first;
unsigned Arcs =		unsigned Arcs = cast<ArrayType>(GV->getValueType())->getNumElements();
cast<ArrayType>(GV->getValueType())->getNumElements();		EmitArcsCallArgsArray.push_back(ConstantStruct::get(
Builder.CreateCall(EmitArcs, {Builder.getInt32(Arcs),		EmitArcsCallArgsTy,
Builder.CreateConstGEP2_64(GV, 0, 0)});		{Builder.getInt32(Arcs),
		ConstantExpr::getInBoundsGetElementPtr(
		GV->getValueType(), GV,
		makeArrayRef<Constant *>({Zero32, Zero32}))}));
		}
		// Create global arrays for the two emit calls.
		int CountersSize = CountersBySP.size();
		assert(CountersSize == (int)EmitFunctionCallArgsArray.size() &&
		"Mismatched array size!");
		assert(CountersSize == (int)EmitArcsCallArgsArray.size() &&
		"Mismatched array size!");
		auto *EmitFunctionCallArgsArrayTy =
		ArrayType::get(EmitFunctionCallArgsTy, CountersSize);
		auto *EmitFunctionCallArgsArrayGV = new GlobalVariable(
		M, EmitFunctionCallArgsArrayTy, /isConstant*/ true,
		GlobalValue::InternalLinkage,
		ConstantArray::get(EmitFunctionCallArgsArrayTy,
		EmitFunctionCallArgsArray),
		Twine("__llvm_internal_gcov_emit_function_args.") + Twine(i));
		auto *EmitArcsCallArgsArrayTy =
		ArrayType::get(EmitArcsCallArgsTy, CountersSize);
		EmitFunctionCallArgsArrayGV->setUnnamedAddr(
		GlobalValue::UnnamedAddr::Global);
		auto *EmitArcsCallArgsArrayGV = new GlobalVariable(
		M, EmitArcsCallArgsArrayTy, /isConstant*/ true,
		GlobalValue::InternalLinkage,
		ConstantArray::get(EmitArcsCallArgsArrayTy, EmitArcsCallArgsArray),
		Twine("__llvm_internal_gcov_emit_arcs_args.") + Twine(i));
		dblaikieUnsubmitted Done Reply Inline Actions Is this tested? (the test case only seems to have function_args) (also I'm not sure I really understand this bit of the code - it's creating a global variable with a fixed name, but it's doing so inside the loop over all CUs? (would this create duplicate GVs? Do they somehow get renamed to be unique?) Is this code assuming there will only be one non-module-skeleton CU?) dblaikie: Is this tested? (the test case only seems to have function_args) (also I'm not sure I really…
		chandlercAuthorUnsubmitted Not Done Reply Inline Actions I think we always have both? The test only looks at one. I honestly don't understand the contents of this well enough to write a test. I now test the writeout which at least clearly shows that the structure ends up being sound here. Regarding the global variable -- I was assuming they get renamed like SSA values. Their names aren't significant as they don't have linkage. But I can just make the names unique constructively, as that seems cleaner anyways. chandlerc: I think we always have both? The test only looks at one. I honestly don't understand the…
		EmitArcsCallArgsArrayGV->setUnnamedAddr(GlobalValue::UnnamedAddr::Global);

		FileInfos.push_back(ConstantStruct::get(
		FileInfoTy,
		{StartFileCallArgs, Builder.getInt32(CountersSize),
		ConstantExpr::getInBoundsGetElementPtr(
		EmitFunctionCallArgsArrayTy, EmitFunctionCallArgsArrayGV,
		makeArrayRef<Constant *>({Zero32, Zero32})),
		ConstantExpr::getInBoundsGetElementPtr(
		EmitArcsCallArgsArrayTy, EmitArcsCallArgsArrayGV,
		makeArrayRef<Constant *>({Zero32, Zero32}))}));
}		}

		// If we didn't find anything to actually emit, bail on out.
		if (FileInfos.empty()) {
		Builder.CreateRetVoid();
		return WriteoutF;
		}

		// To simplify code, we cap the number of file infos we write out to fit
		// easily in a 32-bit signed integer. This gives consistent behavior between
		// 32-bit and 64-bit systems without requiring (potentially very slow) 64-bit
		// operations on 32-bit systems. It also seems unreasonable to try to handle
		// more than 2 billion files.
		if ((int64_t)FileInfos.size() > (int64_t)INT_MAX)
		FileInfos.resize(INT_MAX);

		// Create a global for the entire data structure so we can walk it more
		// easily.
		auto *FileInfoArrayTy = ArrayType::get(FileInfoTy, FileInfos.size());
		auto *FileInfoArrayGV = new GlobalVariable(
		M, FileInfoArrayTy, /isConstant*/ true, GlobalValue::InternalLinkage,
		ConstantArray::get(FileInfoArrayTy, FileInfos),
		"__llvm_internal_gcov_emit_file_info");
		FileInfoArrayGV->setUnnamedAddr(GlobalValue::UnnamedAddr::Global);

		// Create the CFG for walking this data structure.
		auto *FileLoopHeader =
		BasicBlock::Create(*Ctx, "file.loop.header", WriteoutF);
		auto *CounterLoopHeader =
		BasicBlock::Create(*Ctx, "counter.loop.header", WriteoutF);
		auto FileLoopLatch = BasicBlock::Create(Ctx, "file.loop.latch", WriteoutF);
		auto ExitBB = BasicBlock::Create(Ctx, "exit", WriteoutF);

		// We always have at least one file, so just branch to the header.
		Builder.CreateBr(FileLoopHeader);

		// The index into the files structure is our loop induction variable.
		Builder.SetInsertPoint(FileLoopHeader);
		PHINode *IV =
		Builder.CreatePHI(Builder.getInt32Ty(), /NumReservedValues/ 2);
		IV->addIncoming(Builder.getInt32(0), BB);
		auto *FileInfoPtr =
		Builder.CreateInBoundsGEP(FileInfoArrayGV, {Builder.getInt32(0), IV});
		auto *StartFileCallArgsPtr = Builder.CreateStructGEP(FileInfoPtr, 0);
		Builder.CreateCall(
		StartFile,
		{Builder.CreateLoad(Builder.CreateStructGEP(StartFileCallArgsPtr, 0)),
		Builder.CreateLoad(Builder.CreateStructGEP(StartFileCallArgsPtr, 1)),
		Builder.CreateLoad(Builder.CreateStructGEP(StartFileCallArgsPtr, 2))});
		auto *NumCounters =
		Builder.CreateLoad(Builder.CreateStructGEP(FileInfoPtr, 1));
		auto *EmitFunctionCallArgsArray =
		Builder.CreateLoad(Builder.CreateStructGEP(FileInfoPtr, 2));
		auto *EmitArcsCallArgsArray =
		Builder.CreateLoad(Builder.CreateStructGEP(FileInfoPtr, 3));
		auto *EnterCounterLoopCond =
		Builder.CreateICmpSLT(Builder.getInt32(0), NumCounters);
		Builder.CreateCondBr(EnterCounterLoopCond, CounterLoopHeader, FileLoopLatch);

		Builder.SetInsertPoint(CounterLoopHeader);
		auto JV = Builder.CreatePHI(Builder.getInt32Ty(), /NumReservedValues*/ 2);
		JV->addIncoming(Builder.getInt32(0), FileLoopHeader);
		auto *EmitFunctionCallArgsPtr =
		Builder.CreateInBoundsGEP(EmitFunctionCallArgsArray, {JV});
		Builder.CreateCall(
		EmitFunction,
		{Builder.CreateLoad(Builder.CreateStructGEP(EmitFunctionCallArgsPtr, 0)),
		Builder.CreateLoad(Builder.CreateStructGEP(EmitFunctionCallArgsPtr, 1)),
		Builder.CreateLoad(Builder.CreateStructGEP(EmitFunctionCallArgsPtr, 2)),
		Builder.CreateLoad(Builder.CreateStructGEP(EmitFunctionCallArgsPtr, 3)),
		Builder.CreateLoad(
		Builder.CreateStructGEP(EmitFunctionCallArgsPtr, 4))});
		auto *EmitArcsCallArgsPtr =
		Builder.CreateInBoundsGEP(EmitArcsCallArgsArray, {JV});
		Builder.CreateCall(
		EmitArcs,
		{Builder.CreateLoad(Builder.CreateStructGEP(EmitArcsCallArgsPtr, 0)),
		Builder.CreateLoad(Builder.CreateStructGEP(EmitArcsCallArgsPtr, 1))});
		auto *NextJV = Builder.CreateAdd(JV, Builder.getInt32(1));
		auto *CounterLoopCond = Builder.CreateICmpSLT(NextJV, NumCounters);
		Builder.CreateCondBr(CounterLoopCond, CounterLoopHeader, FileLoopLatch);
		JV->addIncoming(NextJV, CounterLoopHeader);

		Builder.SetInsertPoint(FileLoopLatch);
Builder.CreateCall(SummaryInfo, {});		Builder.CreateCall(SummaryInfo, {});
Builder.CreateCall(EndFile, {});		Builder.CreateCall(EndFile, {});
}		auto *NextIV = Builder.CreateAdd(IV, Builder.getInt32(1));
}		auto *FileLoopCond =
		Builder.CreateICmpSLT(NextIV, Builder.getInt32(FileInfos.size()));
		Builder.CreateCondBr(FileLoopCond, FileLoopHeader, ExitBB);
		IV->addIncoming(NextIV, FileLoopLatch);

		Builder.SetInsertPoint(ExitBB);
Builder.CreateRetVoid();		Builder.CreateRetVoid();

return WriteoutF;		return WriteoutF;
}		}

void GCOVProfiler::insertIndirectCounterIncrement() {		void GCOVProfiler::insertIndirectCounterIncrement() {
Function *Fn =		Function *Fn =
cast<Function>(GCOVProfiler::getIncrementIndirectCounterFunc());		cast<Function>(GCOVProfiler::getIncrementIndirectCounterFunc());
Fn->setUnnamedAddr(GlobalValue::UnnamedAddr::Global);		Fn->setUnnamedAddr(GlobalValue::UnnamedAddr::Global);
Fn->setLinkage(GlobalValue::InternalLinkage);		Fn->setLinkage(GlobalValue::InternalLinkage);
▲ Show 20 Lines • Show All 87 Lines • Show Last 20 Lines

llvm/test/Transforms/GCOVProfiling/function-numbering.ll

	Show All 13 Lines
	; RUN: llvm-cov gcov -n -dump %t/function-numbering.gcno 2>&1 \| FileCheck --check-prefix GCNO %s			; RUN: llvm-cov gcov -n -dump %t/function-numbering.gcno 2>&1 \| FileCheck --check-prefix GCNO %s

	target datalayout = "e-m:o-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:o-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-apple-macosx10.10.0"			target triple = "x86_64-apple-macosx10.10.0"

	; GCDA: @[[FOO:[0-9]+]] = private unnamed_addr constant [4 x i8] c"foo\00"			; GCDA: @[[FOO:[0-9]+]] = private unnamed_addr constant [4 x i8] c"foo\00"
	; GCDA-NOT: @{{[0-9]+}} = private unnamed_addr constant .* c"bar\00"			; GCDA-NOT: @{{[0-9]+}} = private unnamed_addr constant .* c"bar\00"
	; GCDA: @[[BAZ:[0-9]+]] = private unnamed_addr constant [4 x i8] c"baz\00"			; GCDA: @[[BAZ:[0-9]+]] = private unnamed_addr constant [4 x i8] c"baz\00"
	; GCDA: define internal void @__llvm_gcov_writeout()			; GCDA: @__llvm_internal_gcov_emit_function_args.0 = internal unnamed_addr constant
	; GCDA: call void @llvm_gcda_emit_function(i32 0, i8* getelementptr inbounds ([4 x i8], [4 x i8]* @[[FOO]]			; GCDA-SAME: { i32 0, i8* getelementptr inbounds ([4 x i8], [4 x i8]* @[[FOO]]
	; GCDA: call void @llvm_gcda_emit_function(i32 1, i8* getelementptr inbounds ([4 x i8], [4 x i8]* @[[BAZ]]			; GCDA-SAME: { i32 1, i8* getelementptr inbounds ([4 x i8], [4 x i8]* @[[BAZ]]
				;
				; GCDA-LABEL: define internal void @__llvm_gcov_writeout() {{.*}} {
				; GCDA-NEXT: entry:
				; GCDA-NEXT: br label %[[FILE_LOOP_HEADER:.*]]
				;
				; GCDA: [[FILE_LOOP_HEADER]]:
				; GCDA-NEXT: %[[IV:.]] = phi i32 [ 0, %entry ], [ %[[NEXT_IV:.]], %[[FILE_LOOP_LATCH:.*]] ]
				; GCDA-NEXT: %[[FILE_INFO:.]] = getelementptr inbounds {{.}}, {{.}} @__llvm_internal_gcov_emit_file_info, i32 0, i32 %[[IV]]
				; GCDA-NEXT: %[[START_FILE_ARGS:.]] = getelementptr inbounds {{.}}, {{.}} %[[FILE_INFO]], i32 0, i32 0
				; GCDA-NEXT: %[[START_FILE_ARG_0_PTR:.]] = getelementptr inbounds {{.}}, {{.}} %[[START_FILE_ARGS]], i32 0, i32 0
				; GCDA-NEXT: %[[START_FILE_ARG_0:.]] = load i8, i8** %[[START_FILE_ARG_0_PTR]]
				; GCDA-NEXT: %[[START_FILE_ARG_1_PTR:.]] = getelementptr inbounds {{.}}, {{.}} %[[START_FILE_ARGS]], i32 0, i32 1
				; GCDA-NEXT: %[[START_FILE_ARG_1:.]] = load i8, i8** %[[START_FILE_ARG_1_PTR]]
				; GCDA-NEXT: %[[START_FILE_ARG_2_PTR:.]] = getelementptr inbounds {{.}}, {{.}} %[[START_FILE_ARGS]], i32 0, i32 2
				; GCDA-NEXT: %[[START_FILE_ARG_2:.]] = load i32, i32 %[[START_FILE_ARG_2_PTR]]
				; GCDA-NEXT: call void @llvm_gcda_start_file(i8* %[[START_FILE_ARG_0]], i8* %[[START_FILE_ARG_1]], i32 %[[START_FILE_ARG_2]])
				; GCDA-NEXT: %[[NUM_COUNTERS_PTR:.]] = getelementptr inbounds {{.}}, {{.}} %[[FILE_INFO]], i32 0, i32 1
				; GCDA-NEXT: %[[NUM_COUNTERS:.]] = load i32, i32 %[[NUM_COUNTERS_PTR]]
				; GCDA-NEXT: %[[EMIT_FUN_ARGS_ARRAY_PTR:.]] = getelementptr inbounds {{.}}, {{.}} %[[FILE_INFO]], i32 0, i32 2
				; GCDA-NEXT: %[[EMIT_FUN_ARGS_ARRAY:.]] = load {{.}}, {{.}}** %[[EMIT_FUN_ARGS_ARRAY_PTR]]
				; GCDA-NEXT: %[[EMIT_ARCS_ARGS_ARRAY_PTR:.]] = getelementptr inbounds {{.}}, {{.}} %[[FILE_INFO]], i32 0, i32 3
				; GCDA-NEXT: %[[EMIT_ARCS_ARGS_ARRAY:.]] = load {{.}}, {{.}}** %[[EMIT_ARCS_ARGS_ARRAY_PTR]]
				; GCDA-NEXT: %[[ENTER_COUNTER_LOOP_COND:.*]] = icmp slt i32 0, %[[NUM_COUNTERS]]
				; GCDA-NEXT: br i1 %[[ENTER_COUNTER_LOOP_COND]], label %[[COUNTER_LOOP:.*]], label %[[FILE_LOOP_LATCH]]
				;
				; GCDA: [[COUNTER_LOOP]]:
				; GCDA-NEXT: %[[JV:.]] = phi i32 [ 0, %[[FILE_LOOP_HEADER]] ], [ %[[NEXT_JV:.]], %[[COUNTER_LOOP]] ]
				; GCDA-NEXT: %[[EMIT_FUN_ARGS:.]] = getelementptr inbounds {{.}}, {{.}} %[[EMIT_FUN_ARGS_ARRAY]], i32 %[[JV]]
				; GCDA-NEXT: %[[EMIT_FUN_ARG_0_PTR:.]] = getelementptr inbounds {{.}}, {{.}} %[[EMIT_FUN_ARGS]], i32 0, i32 0
				; GCDA-NEXT: %[[EMIT_FUN_ARG_0:.]] = load i32, i32 %[[EMIT_FUN_ARG_0_PTR]]
				; GCDA-NEXT: %[[EMIT_FUN_ARG_1_PTR:.]] = getelementptr inbounds {{.}}, {{.}} %[[EMIT_FUN_ARGS]], i32 0, i32 1
				; GCDA-NEXT: %[[EMIT_FUN_ARG_1:.]] = load i8, i8** %[[EMIT_FUN_ARG_1_PTR]]
				; GCDA-NEXT: %[[EMIT_FUN_ARG_2_PTR:.]] = getelementptr inbounds {{.}}, {{.}} %[[EMIT_FUN_ARGS]], i32 0, i32 2
				; GCDA-NEXT: %[[EMIT_FUN_ARG_2:.]] = load i32, i32 %[[EMIT_FUN_ARG_2_PTR]]
				; GCDA-NEXT: %[[EMIT_FUN_ARG_3_PTR:.]] = getelementptr inbounds {{.}}, {{.}} %[[EMIT_FUN_ARGS]], i32 0, i32 3
				; GCDA-NEXT: %[[EMIT_FUN_ARG_3:.]] = load i8, i8 %[[EMIT_FUN_ARG_3_PTR]]
				; GCDA-NEXT: %[[EMIT_FUN_ARG_4_PTR:.]] = getelementptr inbounds {{.}}, {{.}} %[[EMIT_FUN_ARGS]], i32 0, i32 4
				; GCDA-NEXT: %[[EMIT_FUN_ARG_4:.]] = load i32, i32 %[[EMIT_FUN_ARG_4_PTR]]
				; GCDA-NEXT: call void @llvm_gcda_emit_function(i32 %[[EMIT_FUN_ARG_0]],
				; GCDA-SAME: i8* %[[EMIT_FUN_ARG_1]],
				; GCDA-SAME: i32 %[[EMIT_FUN_ARG_2]],
				; GCDA-SAME: i8 %[[EMIT_FUN_ARG_3]],
				; GCDA-SAME: i32 %[[EMIT_FUN_ARG_4]])
				; GCDA-NEXT: %[[EMIT_ARCS_ARGS:.]] = getelementptr inbounds {{.}}, {{.}} %[[EMIT_ARCS_ARGS_ARRAY]], i32 %[[JV]]
				; GCDA-NEXT: %[[EMIT_ARCS_ARG_0_PTR:.]] = getelementptr inbounds {{.}}, {{.}} %[[EMIT_ARCS_ARGS]], i32 0, i32 0
				; GCDA-NEXT: %[[EMIT_ARCS_ARG_0:.]] = load i32, i32 %[[EMIT_ARCS_ARG_0_PTR]]
				; GCDA-NEXT: %[[EMIT_ARCS_ARG_1_PTR:.]] = getelementptr inbounds {{.}}, {{.}} %[[EMIT_ARCS_ARGS]], i32 0, i32 1
				; GCDA-NEXT: %[[EMIT_ARCS_ARG_1:.]] = load i64, i64** %[[EMIT_ARCS_ARG_1_PTR]]
				; GCDA-NEXT: call void @llvm_gcda_emit_arcs(i32 %[[EMIT_ARCS_ARG_0]],
				; GCDA-SAME: i64* %[[EMIT_ARCS_ARG_1]])
				; GCDA-NEXT: %[[NEXT_JV]] = add i32 %[[JV]], 1
				; GCDA-NEXT: %[[COUNTER_LOOP_COND:.*]] = icmp slt i32 %[[NEXT_JV]], %[[NUM_COUNTERS]]
				; GCDA-NEXT: br i1 %[[COUNTER_LOOP_COND]], label %[[COUNTER_LOOP]], label %[[FILE_LOOP_LATCH]]
				;
				; GCDA: [[FILE_LOOP_LATCH]]:
				; GCDA-NEXT: call void @llvm_gcda_summary_info()
				; GCDA-NEXT: call void @llvm_gcda_end_file()
				; GCDA-NEXT: %[[NEXT_IV]] = add i32 %[[IV]], 1
				; GCDA-NEXT: %[[FILE_LOOP_COND:.*]] = icmp slt i32 %[[NEXT_IV]], 1
				; GCDA-NEXT: br i1 %[[FILE_LOOP_COND]], label %[[FILE_LOOP_HEADER]], label %[[EXIT:.*]]
				;
				; GCDA: [[EXIT]]:
				; GCDA-NEXT: ret void

				dblaikieUnsubmitted Done Reply Inline Actions I'd expect this test should probably test the contents of the gcov_writeout function? Nothing else appears to be testing that code? (given that you changed it significantly & no other tests need updating) dblaikie: I'd expect this test should probably test the contents of the gcov_writeout function? Nothing…
				chandlercAuthorUnsubmitted Not Done Reply Inline Actions Yeah, I was trying to avoid adding large amounts of new testing but the testing seems super bad here... I've added a full test of the writeout function now. chandlerc: Yeah, I was trying to avoid adding large amounts of new testing but the testing seems super bad…
	; GCNO: == foo (0) @			; GCNO: == foo (0) @
	; GCNO-NOT: == bar ({{[0-9]+}}) @			; GCNO-NOT: == bar ({{[0-9]+}}) @
	; GCNO: == baz (1) @			; GCNO: == baz (1) @

	define void @foo() !dbg !4 {			define void @foo() !dbg !4 {
	ret void, !dbg !12			ret void, !dbg !12
	}			}

	Show All 27 Lines