This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/ProfileData/
-
llvm/
-
ProfileData/
-
SampleProfWriter.h
-
lib/ProfileData/
-
ProfileData/
1/4
SampleProfWriter.cpp
-
tools/llvm-profdata/
-
llvm-profdata/
-
llvm-profdata.cpp
-
unittests/tools/
-
tools/
-
CMakeLists.txt
-
llvm-profdata/
-
CMakeLists.txt
1/8
OutputSizeLimitTest.cpp

Differential D141446

[llvm-profdata] Add option to cap profile output size
ClosedPublic

Authored by huangjd on Jan 10 2023, 4:45 PM.

Download Raw Diff

Details

Reviewers

davidxl
xur
kazu
ellis
gulfem
snehasish

Commits

rG79971d0d771a: [llvm-profdata] Add option to cap profile output size
rG48f163b889a8: [llvm-profdata] Add option to cap profile output size
rGc268f850a299: Fix to D139603(reverted) - moved size check to unit test so that it is cross…

Summary

D139603 (add option to llvm-profdata to reduce output profile size) contains test cases that are not cross-platform. Moving those tests to unit test and making sure the feature is callable from llvm library

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	60,190 ms	x64 debian > Clang.CodeGen/RISCV/rvv-intrinsics-autogenerated/policy/non-overloaded::vloxseg.c
	60,220 ms	x64 debian > Clang.CodeGen/RISCV/rvv-intrinsics-autogenerated/policy/non-overloaded::vluxseg.c
	60,200 ms	x64 debian > Clang.CodeGen/RISCV/rvv-intrinsics-autogenerated/policy/overloaded::vloxseg.c
	60,210 ms	x64 debian > Clang.CodeGen/RISCV/rvv-intrinsics-autogenerated/policy/overloaded::vluxseg.c
	10 ms	x64 debian > LLVM-Unit.tools/llvm-profdata/_/LLVMProfdataTests/TestOutputSizeLimit::TestOutputSizeLimitBinary
		View Full Test Results (7 Failed)

Event Timeline

huangjd created this revision.Jan 10 2023, 4:45 PM

Herald added a project: Restricted Project. · View Herald TranscriptJan 10 2023, 4:45 PM

Herald added a subscriber: hiraditya. · View Herald Transcript

huangjd requested review of this revision.Jan 10 2023, 4:45 PM

Herald added a project: Restricted Project. · View Herald TranscriptJan 10 2023, 4:45 PM

Herald added a subscriber: llvm-commits. · View Herald Transcript

lgtm with some comments.

llvm/lib/ProfileData/SampleProfWriter.cpp
100	This will become unused in non-DEBUG build, maybe add the workaround here https://reviews.llvm.org/rG9f4a9d3f44501fa755eb71fe855e15cf0e59e8b8 Or wrap this in LLVM_DEBUG too. Also IterationCount below as mentioned in this comment on the previous revision: https://reviews.llvm.org/D139603#inline-1365026
llvm/unittests/tools/llvm-profdata/OutputSizeLimitTest.cpp
42	I think we should replace report_fatal_error with std::error_code EC = ReaderOrErr.getError(); ASSERT_FALSE(EC) << EC.message().c_str(); to let gtest handle the failure cleanly.
70	Perhaps TestWriteWithSizeLimit instead of TestOutputSizeLimit1 is a little more informative?
72	This should probably be EXPECT_LE. The rationale is explained here https://stackoverflow.com/a/2565309

This revision is now accepted and ready to land.Jan 10 2023, 6:01 PM

Harbormaster completed remote builds in B206965: Diff 488034.Jan 10 2023, 10:09 PM

Updating D141446: Fix to D139603(reverted) - moved size check to unit test so that it is cross-platform

Harbormaster completed remote builds in B207230: Diff 488407.Jan 11 2023, 3:58 PM

Updating D141446: Fix to D139603(reverted) - moved size check to unit test so that it is cross-platform

This revision was landed with ongoing or failed builds.Jan 11 2023, 4:42 PM

Closed by commit rGc268f850a299: Fix to D139603(reverted) - moved size check to unit test so that it is cross… (authored by huangjd). · Explain Why

This revision was automatically updated to reflect the committed changes.

huangjd added a commit: rGc268f850a299: Fix to D139603(reverted) - moved size check to unit test so that it is cross….

Harbormaster completed remote builds in B207232: Diff 488409.Jan 11 2023, 6:22 PM

Hi @huangjd ,

check-llvm gets failed with the following linker errors:

FAILED: unittests/tools/llvm-profdata/LLVMProfdataTests 
: && /usr/bin/c++ -fPIC -fno-semantic-interposition -fvisibility-inlines-hidden -Werror=date-time -Wall -Wextra -Wno-unused-parameter -Wwrite-strings -Wcast-qual -Wno-missing-field-initializers -pedantic -Wno-long-long -Wimplicit-fallthrough -Wno-maybe-uninitialized -Wno-class-memaccess -Wno-redundant-move -Wno-pessimizing-move -Wno-noexcept-type -Wdelete-non-virtual-dtor -Wsuggest-override -Wno-comment -Wno-misleading-indentation -Wctad-maybe-unsupported -fdiagnostics-color -ffunction-sections -fdata-sections -O3 -DNDEBUG -fuse-ld=gold     -Wl,--gc-sections unittests/tools/llvm-profdata/CMakeFiles/LLVMProfdataTests.dir/OutputSizeLimitTest.cpp.o -o unittests/tools/llvm-profdata/LLVMProfdataTests  -Wl,-rpath,/home/buildbot/worker/as-builder-7/llvm-nvptx-nvidia-ubuntu/build/lib  lib/libLLVMProfileData.so.16git  lib/libllvm_gtest_main.so.16git  lib/libLLVMTestingSupport.so.16git  lib/libllvm_gtest.so.16git  lib/libLLVMSupport.so.16git  -Wl,-rpath-link,/home/buildbot/worker/as-builder-7/llvm-nvptx-nvidia-ubuntu/build/lib && :
unittests/tools/llvm-profdata/CMakeFiles/LLVMProfdataTests.dir/OutputSizeLimitTest.cpp.o:OutputSizeLimitTest.cpp:function TestOutputSizeLimit_TestOutputSizeLimit1_Test::TestBody() [clone .localalias]: error: undefined reference to 'llvm::LLVMContext::LLVMContext()'
unittests/tools/llvm-profdata/CMakeFiles/LLVMProfdataTests.dir/OutputSizeLimitTest.cpp.o:OutputSizeLimitTest.cpp:function TestOutputSizeLimit_TestOutputSizeLimit1_Test::TestBody() [clone .localalias]: error: undefined reference to 'llvm::LLVMContext::~LLVMContext()'
collect2: error: ld returned 1 exit status

https://lab.llvm.org/buildbot/#/builders/234/builds/2655

Would you take a look?

This unit-test based test also fails on non-linux, see e.g. http://45.33.8.238/macm1/52570/step_11.txt

I think that's because writeWithSizeLimitInternal calls Strategy->Erase(StringBuffer.size()); and that erases an element from an unordered_map, and it depends on the C++ standard library if an "important" function name gets removed. Depending on that, the error is either too_large (which the test expects) or truncated_name_table (which it doesn't).

Please take a look and revert for now if it takes a while to fix. If you do end up reverting, it'd be cool if you could revert 6be251352e6b4d9708a1b7b7b146ea199342de22 in the same commit (and reland it when you reland).

llvm/unittests/tools/llvm-profdata/OutputSizeLimitTest.cpp
38	https://github.com/google/googletest/blob/main/docs/advanced.md#propagating-fatal-failures FYI

We are also seeing an assertion failure in the added unit test on our internal Windows builder similar to what this bot is hitting:

https://lab.llvm.org/buildbot/#/builders/231/builds/7174

[ RUN      ] TestOutputSizeLimit.TestOutputSizeLimit1
LLVMProfdataTests: /home/buildbots/ppc64be-clang-test-suite/clang-ppc64be-test-suite/llvm-project/llvm/lib/ProfileData/SampleProfWriter.cpp:85: virtual void llvm::sampleprof::DefaultFunctionPruningStrategy::Erase(size_t): Assertion `NumToRemove <= SortedFunctions.size()' failed.

This bot is still broken https://lab.llvm.org/buildbot/#/builders/168/builds/11266

vitalybuka added a reverting change: rGc37694817a59: Revert "Fix to D139603(reverted) - moved size check to unit test so that it is….Jan 11 2023, 11:25 PM

vitalybuka reopened this revision.Jan 11 2023, 11:26 PM

This revision is now accepted and ready to land.Jan 11 2023, 11:26 PM

vitalybuka mentioned this in rG55e69d1f16a4: Revert "[gn] port c268f850a299".Jan 11 2023, 11:31 PM

Updated unit test

Harbormaster completed remote builds in B208376: Diff 489997.Jan 17 2023, 6:49 PM

I have tested the latest changes on llvm-nvptx-nvidia-ubuntu (https://lab.llvm.org/buildbot/#/builders/234) builder locally and the unit tests get built and run successfully.
Thank you @huangjd.

Update unit test, previously may crash on mac OS

Harbormaster completed remote builds in B209966: Diff 492241.Jan 25 2023, 3:24 PM

Refactored to fix bugs on non linux platforms

Moved all the tests into unit test since

Output size limit doesn't require the preservation of specific functions if a different pruning strategy is used in the future, so CHECK-NEXT is too restrictive. In unit test each rewritten sample is checked against the original sample to confirm all its fields are identical

Windows use '\r\n' line ending so the behavior of output size limit in text mode is different (more functions being pruned than linux for the same limit because extra char per line). This breaks lit test

Write once for many tests with repetative output

Harbormaster completed remote builds in B211059: Diff 493715.Jan 31 2023, 2:46 PM

cleanup

use llvm::unittest::TempFile instead

Harbormaster completed remote builds in B211106: Diff 493781.Jan 31 2023, 7:20 PM

@snehasish Please re-review since I introduced a major change to the code.

lgtm

llvm/lib/ProfileData/SampleProfWriter.cpp
111	Is this leaking the raw_svector_ostream objects? Can we rewrite this as the following? raw_svector_ostream OS(StringBuffer); OutputStream.reset(&OS); Also might be worth it to use a separate OutputStream object within the loop scope so that we don't have to worry about the swap before and after and the lifetimes of the raw_vector_ostream objects.
llvm/unittests/tools/llvm-profdata/OutputSizeLimitTest.cpp
71	I think these parameters name should be capitalized based on the guide. https://llvm.org/docs/CodingStandards.html Also consider moving this to the implementation of FunctionSamples since it seems generally useful to have operator== implemented?
173	VAR_RETURN_IF_ERROR can be used here?
197	EXPECT_THAT_EXPECTED is probably better to continue with other test cases rather than aborting the test on the first failure?

huangjd added inline comments.Feb 6 2023, 5:07 PM

llvm/lib/ProfileData/SampleProfWriter.cpp
111	unique_ptr.reset(new obj) is the correct usage. https://en.cppreference.com/w/cpp/memory/unique_ptr/reset

snehasish added inline comments.Feb 7 2023, 9:12 AM

llvm/lib/ProfileData/SampleProfWriter.cpp
111	Thanks, knowing the type makes it clear! Perhaps that is additional motivation for a separate, local OutputStream var?

huangjd added inline comments.Feb 7 2023, 1:42 PM

llvm/unittests/tools/llvm-profdata/OutputSizeLimitTest.cpp
197	The callee returns an error only if there's something wrong with I/O or profile reading, which is not expected to happen at all, so ASSERT is used properly here. This test checks the correctness of the sample map after size reduction, which is checked with EXPECT_EQ

Cleanup unit test

Harbormaster completed remote builds in B212495: Diff 495678.Feb 7 2023, 6:10 PM

This revision was landed with ongoing or failed builds.Feb 7 2023, 6:19 PM

Closed by commit rG48f163b889a8: [llvm-profdata] Add option to cap profile output size (authored by huangjd). · Explain Why

This revision was automatically updated to reflect the committed changes.

huangjd added a commit: rG48f163b889a8: [llvm-profdata] Add option to cap profile output size.

huangjd added a reverting change: rG981218e0f88c: Revert "[llvm-profdata] Add option to cap profile output size".Feb 7 2023, 6:30 PM

huangjd reopened this revision.Feb 7 2023, 7:01 PM

This revision is now accepted and ready to land.Feb 7 2023, 7:01 PM

update tests because API change from main

Harbormaster completed remote builds in B212519: Diff 495711.Feb 7 2023, 7:57 PM

This revision was landed with ongoing or failed builds.Feb 8 2023, 2:22 PM

Closed by commit rG79971d0d771a: [llvm-profdata] Add option to cap profile output size (authored by huangjd). · Explain Why

This revision was automatically updated to reflect the committed changes.

huangjd added a commit: rG79971d0d771a: [llvm-profdata] Add option to cap profile output size.

MaskRay mentioned this in rG4415e2c66ae3: [CMake] Fix -DBUILD_SHARED_LIBS=on builds after D141446.Feb 8 2023, 3:52 PM

Revision Contents

Path

Size

llvm/

include/

llvm/

ProfileData/

SampleProfWriter.h

71 lines

lib/

ProfileData/

SampleProfWriter.cpp

152 lines

tools/

llvm-profdata/

llvm-profdata.cpp

15 lines

unittests/

tools/

CMakeLists.txt

1 line

llvm-profdata/

CMakeLists.txt

12 lines

OutputSizeLimitTest.cpp

222 lines

Diff 493781

llvm/include/llvm/ProfileData/SampleProfWriter.h

Show All 29 Lines	enum SectionLayout {
DefaultLayout,		DefaultLayout,
// The layout splits profile with context information from profile without		// The layout splits profile with context information from profile without
// context information. When Thinlto is enabled, ThinLTO postlink phase only		// context information. When Thinlto is enabled, ThinLTO postlink phase only
// has to load profile with context information and can skip the other part.		// has to load profile with context information and can skip the other part.
CtxSplitLayout,		CtxSplitLayout,
NumOfLayout,		NumOfLayout,
};		};

		/// When writing a profile with size limit, user may want to use a different
		/// strategy to reduce function count other than dropping functions with fewest
		/// samples first. In this case a class implementing the same interfaces should
		/// be provided to SampleProfileWriter::writeWithSizeLimit().
		class FunctionPruningStrategy {
		protected:
		SampleProfileMap &ProfileMap;
		size_t OutputSizeLimit;

		public:
		/// \p ProfileMap A reference to the original profile map. It will be modified
		/// by Erase().
		/// \p OutputSizeLimit Size limit in bytes of the output profile. This is
		/// necessary to estimate how many functions to remove.
		FunctionPruningStrategy(SampleProfileMap &ProfileMap, size_t OutputSizeLimit)
		: ProfileMap(ProfileMap), OutputSizeLimit(OutputSizeLimit) {}

		virtual ~FunctionPruningStrategy() = default;

		/// SampleProfileWriter::writeWithSizeLimit() calls this after every write
		/// iteration if the output size still exceeds the limit. This function
		/// should erase some functions from the profile map so that the writer tries
		/// to write the profile again with fewer functions. At least 1 entry from the
		/// profile map must be erased.
		///
		/// \p CurrentOutputSize Number of bytes in the output if current profile map
		/// is written.
		virtual void Erase(size_t CurrentOutputSize) = 0;
		};

		class DefaultFunctionPruningStrategy : public FunctionPruningStrategy {
		std::vector<NameFunctionSamples> SortedFunctions;

		public:
		DefaultFunctionPruningStrategy(SampleProfileMap &ProfileMap,
		size_t OutputSizeLimit);

		/// In this default implementation, functions with fewest samples are dropped
		/// first. Since the exact size of the output cannot be easily calculated due
		/// to compression, we use a heuristic to remove as many functions as
		/// necessary but not too many, aiming to minimize the number of write
		/// iterations.
		/// Empirically, functions with larger total sample count contain linearly
		/// more sample entries, meaning it takes linearly more space to write them.
		/// The cumulative length is therefore quadratic if all functions are sorted
		/// by total sample count.
		/// TODO: Find better heuristic.
		void Erase(size_t CurrentOutputSize) override;
		};

/// Sample-based profile writer. Base class.		/// Sample-based profile writer. Base class.
class SampleProfileWriter {		class SampleProfileWriter {
public:		public:
virtual ~SampleProfileWriter() = default;		virtual ~SampleProfileWriter() = default;

/// Write sample profiles in \p S.		/// Write sample profiles in \p S.
///		///
/// \returns status code of the file update operation.		/// \returns status code of the file update operation.
virtual std::error_code writeSample(const FunctionSamples &S) = 0;		virtual std::error_code writeSample(const FunctionSamples &S) = 0;

/// Write all the sample profiles in the given map of samples.		/// Write all the sample profiles in the given map of samples.
///		///
/// \returns status code of the file update operation.		/// \returns status code of the file update operation.
virtual std::error_code write(const SampleProfileMap &ProfileMap);		virtual std::error_code write(const SampleProfileMap &ProfileMap);

		/// Write sample profiles up to given size limit, using the pruning strategy
		/// to drop some functions if necessary.
		///
		/// \returns status code of the file update operation.
		template <typename FunctionPruningStrategy = DefaultFunctionPruningStrategy>
		std::error_code writeWithSizeLimit(SampleProfileMap &ProfileMap,
		size_t OutputSizeLimit) {
		FunctionPruningStrategy Strategy(ProfileMap, OutputSizeLimit);
		return writeWithSizeLimitInternal(ProfileMap, OutputSizeLimit, &Strategy);
		}

raw_ostream &getOutputStream() { return *OutputStream; }		raw_ostream &getOutputStream() { return *OutputStream; }

/// Profile writer factory.		/// Profile writer factory.
///		///
/// Create a new file writer based on the value of \p Format.		/// Create a new file writer based on the value of \p Format.
static ErrorOr<std::unique_ptr<SampleProfileWriter>>		static ErrorOr<std::unique_ptr<SampleProfileWriter>>
create(StringRef Filename, SampleProfileFormat Format);		create(StringRef Filename, SampleProfileFormat Format);

Show All 13 Lines	SampleProfileWriter(std::unique_ptr<raw_ostream> &OS)
: OutputStream(std::move(OS)) {}		: OutputStream(std::move(OS)) {}

/// Write a file header for the profile file.		/// Write a file header for the profile file.
virtual std::error_code writeHeader(const SampleProfileMap &ProfileMap) = 0;		virtual std::error_code writeHeader(const SampleProfileMap &ProfileMap) = 0;

// Write function profiles to the profile file.		// Write function profiles to the profile file.
virtual std::error_code writeFuncProfiles(const SampleProfileMap &ProfileMap);		virtual std::error_code writeFuncProfiles(const SampleProfileMap &ProfileMap);

		std::error_code writeWithSizeLimitInternal(SampleProfileMap &ProfileMap,
		size_t OutputSizeLimit,
		FunctionPruningStrategy *Strategy);

		/// For writeWithSizeLimit in text mode, each newline takes 1 additional byte
		/// on Windows when actually written to the file, but not written to a memory
		/// buffer. This needs to be accounted for when rewriting the profile.
		size_t LineCount;

/// Output stream where to emit the profile to.		/// Output stream where to emit the profile to.
std::unique_ptr<raw_ostream> OutputStream;		std::unique_ptr<raw_ostream> OutputStream;

/// Profile summary.		/// Profile summary.
std::unique_ptr<ProfileSummary> Summary;		std::unique_ptr<ProfileSummary> Summary;

/// Compute summary for this profile.		/// Compute summary for this profile.
void computeSummary(const SampleProfileMap &ProfileMap);		void computeSummary(const SampleProfileMap &ProfileMap);

/// Profile format.		/// Profile format.
SampleProfileFormat Format = SPF_None;		SampleProfileFormat Format = SPF_None;
};		};

/// Sample-based profile writer (text format).		/// Sample-based profile writer (text format).
class SampleProfileWriterText : public SampleProfileWriter {		class SampleProfileWriterText : public SampleProfileWriter {
public:		public:
std::error_code writeSample(const FunctionSamples &S) override;		std::error_code writeSample(const FunctionSamples &S) override;

protected:		protected:
SampleProfileWriterText(std::unique_ptr<raw_ostream> &OS)		SampleProfileWriterText(std::unique_ptr<raw_ostream> &OS)
: SampleProfileWriter(OS), Indent(0) {}		: SampleProfileWriter(OS), Indent(0) {}

std::error_code writeHeader(const SampleProfileMap &ProfileMap) override {		std::error_code writeHeader(const SampleProfileMap &ProfileMap) override {
		LineCount = 0;
return sampleprof_error::success;		return sampleprof_error::success;
}		}

private:		private:
/// Indent level to use when writing.		/// Indent level to use when writing.
///		///
/// This is used when printing inlined callees.		/// This is used when printing inlined callees.
unsigned Indent;		unsigned Indent;
▲ Show 20 Lines • Show All 286 Lines • Show Last 20 Lines

llvm/lib/ProfileData/SampleProfWriter.cpp

Show All 24 Lines
#include "llvm/Support/Endian.h"		#include "llvm/Support/Endian.h"
#include "llvm/Support/EndianStream.h"		#include "llvm/Support/EndianStream.h"
#include "llvm/Support/ErrorOr.h"		#include "llvm/Support/ErrorOr.h"
#include "llvm/Support/FileSystem.h"		#include "llvm/Support/FileSystem.h"
#include "llvm/Support/LEB128.h"		#include "llvm/Support/LEB128.h"
#include "llvm/Support/MD5.h"		#include "llvm/Support/MD5.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include <algorithm>		#include <algorithm>
		#include <cmath>
#include <cstdint>		#include <cstdint>
#include <memory>		#include <memory>
#include <set>		#include <set>
#include <system_error>		#include <system_error>
#include <utility>		#include <utility>
#include <vector>		#include <vector>

		#define DEBUG_TYPE "llvm-profdata"

using namespace llvm;		using namespace llvm;
using namespace sampleprof;		using namespace sampleprof;

		namespace llvm {
		namespace support {
		namespace endian {
		namespace {

		// Adapter class to llvm::support::endian::Writer for pwrite().
		struct SeekableWriter {
		raw_pwrite_stream &OS;
		endianness Endian;
		SeekableWriter(raw_pwrite_stream &OS, endianness Endian)
		: OS(OS), Endian(Endian) {}

		template <typename ValueType>
		void pwrite(ValueType Val, size_t Offset) {
		std::string StringBuf;
		raw_string_ostream SStream(StringBuf);
		Writer(SStream, Endian).write(Val);
		OS.pwrite(StringBuf.data(), StringBuf.size(), Offset);
		}
		};

		} // namespace
		} // namespace endian
		} // namespace support
		} // namespace llvm

		DefaultFunctionPruningStrategy::DefaultFunctionPruningStrategy(
		SampleProfileMap &ProfileMap, size_t OutputSizeLimit)
		: FunctionPruningStrategy(ProfileMap, OutputSizeLimit) {
		sortFuncProfiles(ProfileMap, SortedFunctions);
		}

		void DefaultFunctionPruningStrategy::Erase(size_t CurrentOutputSize) {
		double D = (double)OutputSizeLimit / CurrentOutputSize;
		size_t NewSize = (size_t)round(ProfileMap.size() * D * D);
		size_t NumToRemove = ProfileMap.size() - NewSize;
		if (NumToRemove < 1)
		NumToRemove = 1;

		assert(NumToRemove <= SortedFunctions.size());
		llvm::for_each(
		llvm::make_range(SortedFunctions.begin() + SortedFunctions.size() -
		NumToRemove,
		SortedFunctions.end()),
		[&](const NameFunctionSamples &E) { ProfileMap.erase(E.first); });
		SortedFunctions.resize(SortedFunctions.size() - NumToRemove);
		}

		std::error_code SampleProfileWriter::writeWithSizeLimitInternal(
		SampleProfileMap &ProfileMap, size_t OutputSizeLimit,
		FunctionPruningStrategy *Strategy) {
		if (OutputSizeLimit == 0)
		return write(ProfileMap);

		size_t OriginalFunctionCount = ProfileMap.size();
		snehasishUnsubmitted Not Done Reply Inline Actions This will become unused in non-DEBUG build, maybe add the workaround here https://reviews.llvm.org/rG9f4a9d3f44501fa755eb71fe855e15cf0e59e8b8 Or wrap this in LLVM_DEBUG too. Also IterationCount below as mentioned in this comment on the previous revision: https://reviews.llvm.org/D139603#inline-1365026 snehasish: This will become unused in non-DEBUG build, maybe add the workaround here https://reviews.llvm.

		std::unique_ptr<raw_ostream> OriginalOutputStream;
		OutputStream.swap(OriginalOutputStream);

		size_t IterationCount = 0;
		size_t TotalSize;

		SmallVector<char> StringBuffer;
		do {
		StringBuffer.clear();
		OutputStream.reset(new raw_svector_ostream(StringBuffer));
		snehasishUnsubmitted Not Done Reply Inline Actions Is this leaking the raw_svector_ostream objects? Can we rewrite this as the following? raw_svector_ostream OS(StringBuffer); OutputStream.reset(&OS); Also might be worth it to use a separate OutputStream object within the loop scope so that we don't have to worry about the swap before and after and the lifetimes of the raw_vector_ostream objects. snehasish: Is this leaking the raw_svector_ostream objects? Can we rewrite this as the following? ```…
		huangjdAuthorUnsubmitted Done Reply Inline Actions unique_ptr.reset(new obj) is the correct usage. https://en.cppreference.com/w/cpp/memory/unique_ptr/reset huangjd: unique_ptr.reset(new obj) is the correct usage. https://en.cppreference.
		snehasishUnsubmitted Not Done Reply Inline Actions Thanks, knowing the type makes it clear! Perhaps that is additional motivation for a separate, local OutputStream var? snehasish: Thanks, knowing the type makes it clear! Perhaps that is additional motivation for a separate…
		if (std::error_code EC = write(ProfileMap))
		return EC;

		TotalSize = StringBuffer.size();
		// On Windows every "\n" is actually written as "\r\n" to disk but not to
		// memory buffer, this difference should be added when considering the total
		// output size.
		#ifdef _WIN32
		if (Format == SPF_Text)
		TotalSize += LineCount;
		#endif
		if (TotalSize <= OutputSizeLimit)
		break;

		Strategy->Erase(TotalSize);
		IterationCount++;
		} while (ProfileMap.size() != 0);

		if (ProfileMap.size() == 0)
		return sampleprof_error::too_large;

		OutputStream.swap(OriginalOutputStream);
		OutputStream->write(StringBuffer.data(), StringBuffer.size());
		LLVM_DEBUG(dbgs() << "Profile originally has " << OriginalFunctionCount
		<< " functions, reduced to " << ProfileMap.size() << " in "
		<< IterationCount << " iterations\n");
		// Silence warning on Release build.
		(void)OriginalFunctionCount;
		(void)IterationCount;
		return sampleprof_error::success;
		}

std::error_code		std::error_code
SampleProfileWriter::writeFuncProfiles(const SampleProfileMap &ProfileMap) {		SampleProfileWriter::writeFuncProfiles(const SampleProfileMap &ProfileMap) {
std::vector<NameFunctionSamples> V;		std::vector<NameFunctionSamples> V;
sortFuncProfiles(ProfileMap, V);		sortFuncProfiles(ProfileMap, V);
for (const auto &I : V) {		for (const auto &I : V) {
if (std::error_code EC = writeSample(*I.second))		if (std::error_code EC = writeSample(*I.second))
return EC;		return EC;
}		}
▲ Show 20 Lines • Show All 60 Lines • ▼ Show 20 Lines	std::error_code SampleProfileWriterExtBinaryBase::addNewSection(
}		}
SecHdrTable.push_back({Type, Entry.Flags, SectionStart - FileStart,		SecHdrTable.push_back({Type, Entry.Flags, SectionStart - FileStart,
OutputStream->tell() - SectionStart, LayoutIdx});		OutputStream->tell() - SectionStart, LayoutIdx});
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code		std::error_code
SampleProfileWriterExtBinaryBase::write(const SampleProfileMap &ProfileMap) {		SampleProfileWriterExtBinaryBase::write(const SampleProfileMap &ProfileMap) {
		// When calling write on a different profile map, existing states should be
		// cleared.
		NameTable.clear();
		CSNameTable.clear();
		SecHdrTable.clear();

if (std::error_code EC = writeHeader(ProfileMap))		if (std::error_code EC = writeHeader(ProfileMap))
return EC;		return EC;

std::string LocalBuf;		std::string LocalBuf;
LocalBufStream = std::make_unique<raw_string_ostream>(LocalBuf);		LocalBufStream = std::make_unique<raw_string_ostream>(LocalBuf);
if (std::error_code EC = writeSections(ProfileMap))		if (std::error_code EC = writeSections(ProfileMap))
return EC;		return EC;

▲ Show 20 Lines • Show All 345 Lines • ▼ Show 20 Lines	std::error_code SampleProfileWriterText::writeSample(const FunctionSamples &S) {
if (FunctionSamples::ProfileIsCS)		if (FunctionSamples::ProfileIsCS)
OS << "[" << S.getContext().toString() << "]:" << S.getTotalSamples();		OS << "[" << S.getContext().toString() << "]:" << S.getTotalSamples();
else		else
OS << S.getName() << ":" << S.getTotalSamples();		OS << S.getName() << ":" << S.getTotalSamples();

if (Indent == 0)		if (Indent == 0)
OS << ":" << S.getHeadSamples();		OS << ":" << S.getHeadSamples();
OS << "\n";		OS << "\n";
		LineCount++;

SampleSorter<LineLocation, SampleRecord> SortedSamples(S.getBodySamples());		SampleSorter<LineLocation, SampleRecord> SortedSamples(S.getBodySamples());
for (const auto &I : SortedSamples.get()) {		for (const auto &I : SortedSamples.get()) {
LineLocation Loc = I->first;		LineLocation Loc = I->first;
const SampleRecord &Sample = I->second;		const SampleRecord &Sample = I->second;
OS.indent(Indent + 1);		OS.indent(Indent + 1);
if (Loc.Discriminator == 0)		if (Loc.Discriminator == 0)
OS << Loc.LineOffset << ": ";		OS << Loc.LineOffset << ": ";
else		else
OS << Loc.LineOffset << "." << Loc.Discriminator << ": ";		OS << Loc.LineOffset << "." << Loc.Discriminator << ": ";

OS << Sample.getSamples();		OS << Sample.getSamples();

for (const auto &J : Sample.getSortedCallTargets())		for (const auto &J : Sample.getSortedCallTargets())
OS << " " << J.first << ":" << J.second;		OS << " " << J.first << ":" << J.second;
OS << "\n";		OS << "\n";
		LineCount++;
}		}

SampleSorter<LineLocation, FunctionSamplesMap> SortedCallsiteSamples(		SampleSorter<LineLocation, FunctionSamplesMap> SortedCallsiteSamples(
S.getCallsiteSamples());		S.getCallsiteSamples());
Indent += 1;		Indent += 1;
for (const auto &I : SortedCallsiteSamples.get())		for (const auto &I : SortedCallsiteSamples.get())
for (const auto &FS : I->second) {		for (const auto &FS : I->second) {
LineLocation Loc = I->first;		LineLocation Loc = I->first;
const FunctionSamples &CalleeSamples = FS.second;		const FunctionSamples &CalleeSamples = FS.second;
OS.indent(Indent);		OS.indent(Indent);
if (Loc.Discriminator == 0)		if (Loc.Discriminator == 0)
OS << Loc.LineOffset << ": ";		OS << Loc.LineOffset << ": ";
else		else
OS << Loc.LineOffset << "." << Loc.Discriminator << ": ";		OS << Loc.LineOffset << "." << Loc.Discriminator << ": ";
if (std::error_code EC = writeSample(CalleeSamples))		if (std::error_code EC = writeSample(CalleeSamples))
return EC;		return EC;
}		}
Indent -= 1;		Indent -= 1;

if (FunctionSamples::ProfileIsProbeBased) {		if (FunctionSamples::ProfileIsProbeBased) {
OS.indent(Indent + 1);		OS.indent(Indent + 1);
OS << "!CFGChecksum: " << S.getFunctionHash() << "\n";		OS << "!CFGChecksum: " << S.getFunctionHash() << "\n";
		LineCount++;
}		}

if (S.getContext().getAllAttributes()) {		if (S.getContext().getAllAttributes()) {
OS.indent(Indent + 1);		OS.indent(Indent + 1);
OS << "!Attributes: " << S.getContext().getAllAttributes() << "\n";		OS << "!Attributes: " << S.getContext().getAllAttributes() << "\n";
		LineCount++;
}		}

return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code		std::error_code
SampleProfileWriterBinary::writeContextIdx(const SampleContext &Context) {		SampleProfileWriterBinary::writeContextIdx(const SampleContext &Context) {
assert(!Context.hasContext() && "cs profile is not supported");		assert(!Context.hasContext() && "cs profile is not supported");
▲ Show 20 Lines • Show All 69 Lines • ▼ Show 20 Lines	std::error_code SampleProfileWriterBinary::writeNameTable() {
}		}
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileWriterCompactBinary::writeFuncOffsetTable() {		std::error_code SampleProfileWriterCompactBinary::writeFuncOffsetTable() {
auto &OS = *OutputStream;		auto &OS = *OutputStream;

// Fill the slot remembered by TableOffset with the offset of FuncOffsetTable.		// Fill the slot remembered by TableOffset with the offset of FuncOffsetTable.
auto &OFS = static_cast<raw_fd_ostream &>(OS);
uint64_t FuncOffsetTableStart = OS.tell();		uint64_t FuncOffsetTableStart = OS.tell();
if (OFS.seek(TableOffset) == (uint64_t)-1)		support::endian::SeekableWriter Writer(static_cast<raw_pwrite_stream &>(OS),
return sampleprof_error::ostream_seek_unsupported;		support::little);
support::endian::Writer Writer(*OutputStream, support::little);		Writer.pwrite(FuncOffsetTableStart, TableOffset);
Writer.write(FuncOffsetTableStart);
if (OFS.seek(FuncOffsetTableStart) == (uint64_t)-1)
return sampleprof_error::ostream_seek_unsupported;

// Write out the table size.		// Write out the table size.
encodeULEB128(FuncOffsetTable.size(), OS);		encodeULEB128(FuncOffsetTable.size(), OS);

// Write out FuncOffsetTable.		// Write out FuncOffsetTable.
for (auto Entry : FuncOffsetTable) {		for (auto Entry : FuncOffsetTable) {
if (std::error_code EC = writeNameIdx(Entry.first))		if (std::error_code EC = writeNameIdx(Entry.first))
return EC;		return EC;
encodeULEB128(Entry.second, OS);		encodeULEB128(Entry.second, OS);
}		}
		FuncOffsetTable.clear();
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileWriterCompactBinary::writeNameTable() {		std::error_code SampleProfileWriterCompactBinary::writeNameTable() {
auto &OS = *OutputStream;		auto &OS = *OutputStream;
std::set<StringRef> V;		std::set<StringRef> V;
stablizeNameTable(NameTable, V);		stablizeNameTable(NameTable, V);

Show All 11 Lines	SampleProfileWriterBinary::writeMagicIdent(SampleProfileFormat Format) {
// Write file magic identifier.		// Write file magic identifier.
encodeULEB128(SPMagic(Format), OS);		encodeULEB128(SPMagic(Format), OS);
encodeULEB128(SPVersion(), OS);		encodeULEB128(SPVersion(), OS);
return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code		std::error_code
SampleProfileWriterBinary::writeHeader(const SampleProfileMap &ProfileMap) {		SampleProfileWriterBinary::writeHeader(const SampleProfileMap &ProfileMap) {
		// When calling write on a different profile map, existing names should be
		// cleared.
		NameTable.clear();

writeMagicIdent(Format);		writeMagicIdent(Format);

computeSummary(ProfileMap);		computeSummary(ProfileMap);
if (auto EC = writeSummary())		if (auto EC = writeSummary())
return EC;		return EC;

// Generate the name table for all the functions referenced in the profile.		// Generate the name table for all the functions referenced in the profile.
for (const auto &I : ProfileMap) {		for (const auto &I : ProfileMap) {
Show All 24 Lines	for (uint32_t i = 0; i < SectionHdrLayout.size(); i++) {
Writer.write(static_cast<uint64_t>(-1));		Writer.write(static_cast<uint64_t>(-1));
Writer.write(static_cast<uint64_t>(-1));		Writer.write(static_cast<uint64_t>(-1));
Writer.write(static_cast<uint64_t>(-1));		Writer.write(static_cast<uint64_t>(-1));
Writer.write(static_cast<uint64_t>(-1));		Writer.write(static_cast<uint64_t>(-1));
}		}
}		}

std::error_code SampleProfileWriterExtBinaryBase::writeSecHdrTable() {		std::error_code SampleProfileWriterExtBinaryBase::writeSecHdrTable() {
auto &OFS = static_cast<raw_fd_ostream &>(*OutputStream);
uint64_t Saved = OutputStream->tell();

// Set OutputStream to the location saved in SecHdrTableOffset.
if (OFS.seek(SecHdrTableOffset) == (uint64_t)-1)
return sampleprof_error::ostream_seek_unsupported;
support::endian::Writer Writer(*OutputStream, support::little);

assert(SecHdrTable.size() == SectionHdrLayout.size() &&		assert(SecHdrTable.size() == SectionHdrLayout.size() &&
"SecHdrTable entries doesn't match SectionHdrLayout");		"SecHdrTable entries doesn't match SectionHdrLayout");
SmallVector<uint32_t, 16> IndexMap(SecHdrTable.size(), -1);		SmallVector<uint32_t, 16> IndexMap(SecHdrTable.size(), -1);
for (uint32_t TableIdx = 0; TableIdx < SecHdrTable.size(); TableIdx++) {		for (uint32_t TableIdx = 0; TableIdx < SecHdrTable.size(); TableIdx++) {
IndexMap[SecHdrTable[TableIdx].LayoutIndex] = TableIdx;		IndexMap[SecHdrTable[TableIdx].LayoutIndex] = TableIdx;
}		}

// Write the section header table in the order specified in		// Write the section header table in the order specified in
// SectionHdrLayout. SectionHdrLayout specifies the sections		// SectionHdrLayout. SectionHdrLayout specifies the sections
// order in which profile reader expect to read, so the section		// order in which profile reader expect to read, so the section
// header table should be written in the order in SectionHdrLayout.		// header table should be written in the order in SectionHdrLayout.
// Note that the section order in SecHdrTable may be different		// Note that the section order in SecHdrTable may be different
// from the order in SectionHdrLayout, for example, SecFuncOffsetTable		// from the order in SectionHdrLayout, for example, SecFuncOffsetTable
// needs to be computed after SecLBRProfile (the order in SecHdrTable),		// needs to be computed after SecLBRProfile (the order in SecHdrTable),
// but it needs to be read before SecLBRProfile (the order in		// but it needs to be read before SecLBRProfile (the order in
// SectionHdrLayout). So we use IndexMap above to switch the order.		// SectionHdrLayout). So we use IndexMap above to switch the order.
		support::endian::SeekableWriter Writer(
		static_cast<raw_pwrite_stream &>(*OutputStream), support::little);
for (uint32_t LayoutIdx = 0; LayoutIdx < SectionHdrLayout.size();		for (uint32_t LayoutIdx = 0; LayoutIdx < SectionHdrLayout.size();
LayoutIdx++) {		LayoutIdx++) {
assert(IndexMap[LayoutIdx] < SecHdrTable.size() &&		assert(IndexMap[LayoutIdx] < SecHdrTable.size() &&
"Incorrect LayoutIdx in SecHdrTable");		"Incorrect LayoutIdx in SecHdrTable");
auto Entry = SecHdrTable[IndexMap[LayoutIdx]];		auto Entry = SecHdrTable[IndexMap[LayoutIdx]];
Writer.write(static_cast<uint64_t>(Entry.Type));		Writer.pwrite(static_cast<uint64_t>(Entry.Type),
Writer.write(static_cast<uint64_t>(Entry.Flags));		SecHdrTableOffset + 4 * LayoutIdx * sizeof(uint64_t));
Writer.write(static_cast<uint64_t>(Entry.Offset));		Writer.pwrite(static_cast<uint64_t>(Entry.Flags),
Writer.write(static_cast<uint64_t>(Entry.Size));		SecHdrTableOffset + (4 * LayoutIdx + 1) * sizeof(uint64_t));
		Writer.pwrite(static_cast<uint64_t>(Entry.Offset),
		SecHdrTableOffset + (4 * LayoutIdx + 2) * sizeof(uint64_t));
		Writer.pwrite(static_cast<uint64_t>(Entry.Size),
		SecHdrTableOffset + (4 * LayoutIdx + 3) * sizeof(uint64_t));
}		}

// Reset OutputStream.
if (OFS.seek(Saved) == (uint64_t)-1)
return sampleprof_error::ostream_seek_unsupported;

return sampleprof_error::success;		return sampleprof_error::success;
}		}

std::error_code SampleProfileWriterExtBinaryBase::writeHeader(		std::error_code SampleProfileWriterExtBinaryBase::writeHeader(
const SampleProfileMap &ProfileMap) {		const SampleProfileMap &ProfileMap) {
auto &OS = *OutputStream;		auto &OS = *OutputStream;
FileStart = OS.tell();		FileStart = OS.tell();
writeMagicIdent(Format);		writeMagicIdent(Format);
▲ Show 20 Lines • Show All 160 Lines • Show Last 20 Lines

llvm/tools/llvm-profdata/llvm-profdata.cpp

Show First 20 Lines • Show All 960 Lines • ▼ Show 20 Lines

static void		static void
mergeSampleProfile(const WeightedFileVector &Inputs, SymbolRemapper *Remapper,		mergeSampleProfile(const WeightedFileVector &Inputs, SymbolRemapper *Remapper,
StringRef OutputFilename, ProfileFormat OutputFormat,		StringRef OutputFilename, ProfileFormat OutputFormat,
StringRef ProfileSymbolListFile, bool CompressAllSections,		StringRef ProfileSymbolListFile, bool CompressAllSections,
bool UseMD5, bool GenPartialProfile, bool GenCSNestedProfile,		bool UseMD5, bool GenPartialProfile, bool GenCSNestedProfile,
bool SampleMergeColdContext, bool SampleTrimColdContext,		bool SampleMergeColdContext, bool SampleTrimColdContext,
bool SampleColdContextFrameDepth, FailureMode FailMode,		bool SampleColdContextFrameDepth, FailureMode FailMode,
bool DropProfileSymbolList) {		bool DropProfileSymbolList, size_t OutputSizeLimit) {
using namespace sampleprof;		using namespace sampleprof;
SampleProfileMap ProfileMap;		SampleProfileMap ProfileMap;
SmallVector<std::unique_ptr<sampleprof::SampleProfileReader>, 5> Readers;		SmallVector<std::unique_ptr<sampleprof::SampleProfileReader>, 5> Readers;
LLVMContext Context;		LLVMContext Context;
sampleprof::ProfileSymbolList WriterList;		sampleprof::ProfileSymbolList WriterList;
std::optional<bool> ProfileIsProbeBased;		std::optional<bool> ProfileIsProbeBased;
std::optional<bool> ProfileIsCS;		std::optional<bool> ProfileIsCS;
for (const auto &Input : Inputs) {		for (const auto &Input : Inputs) {
▲ Show 20 Lines • Show All 76 Lines • ▼ Show 20 Lines	if (std::error_code EC = WriterOrErr.getError())
exitWithErrorCode(EC, OutputFilename);		exitWithErrorCode(EC, OutputFilename);

auto Writer = std::move(WriterOrErr.get());		auto Writer = std::move(WriterOrErr.get());
// WriterList will have StringRef refering to string in Buffer.		// WriterList will have StringRef refering to string in Buffer.
// Make sure Buffer lives as long as WriterList.		// Make sure Buffer lives as long as WriterList.
auto Buffer = getInputFileBuf(ProfileSymbolListFile);		auto Buffer = getInputFileBuf(ProfileSymbolListFile);
handleExtBinaryWriter(*Writer, OutputFormat, Buffer.get(), WriterList,		handleExtBinaryWriter(*Writer, OutputFormat, Buffer.get(), WriterList,
CompressAllSections, UseMD5, GenPartialProfile);		CompressAllSections, UseMD5, GenPartialProfile);
if (std::error_code EC = Writer->write(ProfileMap))
		// If OutputSizeLimit is 0 (default), it is the same as write().
		if (std::error_code EC =
		Writer->writeWithSizeLimit(ProfileMap, OutputSizeLimit))
exitWithErrorCode(std::move(EC));		exitWithErrorCode(std::move(EC));
}		}

static WeightedFile parseWeightedFile(const StringRef &WeightedFilename) {		static WeightedFile parseWeightedFile(const StringRef &WeightedFilename) {
StringRef WeightStr, FileName;		StringRef WeightStr, FileName;
std::tie(WeightStr, FileName) = WeightedFilename.split(',');		std::tie(WeightStr, FileName) = WeightedFilename.split(',');

uint64_t Weight;		uint64_t Weight;
▲ Show 20 Lines • Show All 126 Lines • ▼ Show 20 Lines	static int merge_main(int argc, const char *argv[]) {
cl::opt<bool> SampleTrimColdContext(		cl::opt<bool> SampleTrimColdContext(
"sample-trim-cold-context", cl::init(false), cl::Hidden,		"sample-trim-cold-context", cl::init(false), cl::Hidden,
cl::desc(		cl::desc(
"Trim context sample profiles whose count is below cold threshold"));		"Trim context sample profiles whose count is below cold threshold"));
cl::opt<uint32_t> SampleColdContextFrameDepth(		cl::opt<uint32_t> SampleColdContextFrameDepth(
"sample-frame-depth-for-cold-context", cl::init(1),		"sample-frame-depth-for-cold-context", cl::init(1),
cl::desc("Keep the last K frames while merging cold profile. 1 means the "		cl::desc("Keep the last K frames while merging cold profile. 1 means the "
"context-less base profile"));		"context-less base profile"));
		cl::opt<size_t> OutputSizeLimit(
		"output-size-limit", cl::init(0), cl::Hidden,
		cl::desc("Trim cold functions until profile size is below specified "
		"limit in bytes. This uses a heursitic and functions may be "
		"excessively trimmed"));
cl::opt<bool> GenPartialProfile(		cl::opt<bool> GenPartialProfile(
"gen-partial-profile", cl::init(false), cl::Hidden,		"gen-partial-profile", cl::init(false), cl::Hidden,
cl::desc("Generate a partial profile (only meaningful for -extbinary)"));		cl::desc("Generate a partial profile (only meaningful for -extbinary)"));
cl::opt<std::string> SupplInstrWithSample(		cl::opt<std::string> SupplInstrWithSample(
"supplement-instr-with-sample", cl::init(""), cl::Hidden,		"supplement-instr-with-sample", cl::init(""), cl::Hidden,
cl::desc("Supplement an instr profile with sample profile, to correct "		cl::desc("Supplement an instr profile with sample profile, to correct "
"the profile unrepresentativeness issue. The sample "		"the profile unrepresentativeness issue. The sample "
"profile is the input of the flag. Output will be in instr "		"profile is the input of the flag. Output will be in instr "
▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines	if (ProfileKind == instr)
mergeInstrProfile(WeightedInputs, DebugInfoFilename, Remapper.get(),		mergeInstrProfile(WeightedInputs, DebugInfoFilename, Remapper.get(),
OutputFilename, OutputFormat, OutputSparse, NumThreads,		OutputFilename, OutputFormat, OutputSparse, NumThreads,
FailureMode, ProfiledBinary);		FailureMode, ProfiledBinary);
else		else
mergeSampleProfile(		mergeSampleProfile(
WeightedInputs, Remapper.get(), OutputFilename, OutputFormat,		WeightedInputs, Remapper.get(), OutputFilename, OutputFormat,
ProfileSymbolListFile, CompressAllSections, UseMD5, GenPartialProfile,		ProfileSymbolListFile, CompressAllSections, UseMD5, GenPartialProfile,
GenCSNestedProfile, SampleMergeColdContext, SampleTrimColdContext,		GenCSNestedProfile, SampleMergeColdContext, SampleTrimColdContext,
SampleColdContextFrameDepth, FailureMode, DropProfileSymbolList);		SampleColdContextFrameDepth, FailureMode, DropProfileSymbolList,
		OutputSizeLimit);
return 0;		return 0;
}		}

/// Computer the overlap b/w profile BaseFilename and profile TestFilename.		/// Computer the overlap b/w profile BaseFilename and profile TestFilename.
static void overlapInstrProfile(const std::string &BaseFilename,		static void overlapInstrProfile(const std::string &BaseFilename,
const std::string &TestFilename,		const std::string &TestFilename,
const OverlapFuncFilters &FuncFilter,		const OverlapFuncFilters &FuncFilter,
raw_fd_ostream &OS, bool IsCS) {		raw_fd_ostream &OS, bool IsCS) {
▲ Show 20 Lines • Show All 1,715 Lines • Show Last 20 Lines

llvm/unittests/tools/CMakeLists.txt

	if(LLVM_TARGETS_TO_BUILD MATCHES "X86")			if(LLVM_TARGETS_TO_BUILD MATCHES "X86")
	add_subdirectory(			add_subdirectory(
	llvm-cfi-verify			llvm-cfi-verify
	)			)
	endif()			endif()

	add_subdirectory(			add_subdirectory(
	llvm-exegesis			llvm-exegesis
	)			)
				add_subdirectory(llvm-profdata)
	add_subdirectory(llvm-profgen)			add_subdirectory(llvm-profgen)
	add_subdirectory(llvm-mca)			add_subdirectory(llvm-mca)

llvm/unittests/tools/llvm-profdata/CMakeLists.txt

This file was added.

				set(LLVM_LINK_COMPONENTS
				ProfileData
				Support
				)

				add_llvm_unittest(LLVMProfdataTests
				OutputSizeLimitTest.cpp
				)

				target_link_libraries(LLVMProfdataTests PRIVATE LLVMTestingSupport)

				set_property(TARGET LLVMProfdataTests PROPERTY FOLDER "Tests/UnitTests/ToolTests")

llvm/unittests/tools/llvm-profdata/OutputSizeLimitTest.cpp

This file was added.

				//===- llvm/unittests/tools/llvm-profdata/OutputSizeLimitTest.cpp ---------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/ProfileData/SampleProfReader.h"
				#include "llvm/ProfileData/SampleProfWriter.h"
				#include "llvm/Support/FileSystem.h"
				#include "llvm/Testing/Support/Error.h"
				#include "gtest/gtest.h"

				using namespace llvm;
				using llvm::unittest::TempFile;

				std::string Input1 = R"(main:184019:0
				4: 534
				4.2: 534
				5: 1075
				5.1: 1075
				6: 2080
				7: 534
				9: 2064 _Z3bari:1471 _Z3fooi:631
				10: inline1:1000
				1: 1000
				10: inline2:2000
				1: 2000
				_Z3bari:20301:1437
				1: 1437
				_Z3fooi:7711:610
				1: 610)";

				const char EmptyProfile[18] = "\xff\xe5\xd0\xb1\xf4\xc9\x94\xa8\x53\x67";

				/// sys::fs and SampleProf mix Error and error_code, making an adapter class
				/// to keep code elegant.
				thakisUnsubmitted Not Done Reply Inline Actions https://github.com/google/googletest/blob/main/docs/advanced.md#propagating-fatal-failures FYI thakis: https://github.com/google/googletest/blob/main/docs/advanced.md#propagating-fatal-failures FYI
				template <typename T> class ExpectedErrorOr : public Expected<T> {
				public:
				ExpectedErrorOr(T &&Obj) : Expected<T>(Obj) {}

				snehasishUnsubmitted Not Done Reply Inline Actions I think we should replace report_fatal_error with std::error_code EC = ReaderOrErr.getError(); ASSERT_FALSE(EC) << EC.message().c_str(); to let gtest handle the failure cleanly. snehasish: I think we should replace report_fatal_error with ``` std::error_code EC = ReaderOrErr.
				ExpectedErrorOr(std::error_code EC) : Expected<T>(errorCodeToError(EC)) {}

				ExpectedErrorOr(Error &&E) : Expected<T>(std::move(E)) {}

				template <typename U>
				ExpectedErrorOr(ErrorOr<U> &&E)
				: Expected<T>(errorCodeToError(E.getError())) {}

				template <typename U>
				ExpectedErrorOr(Expected<U> &&E) : Expected<T>(E.takeError()) {}
				};

				#define DEF_VAR_RETURN_IF_ERROR(Var, Value) \
				auto Var##OrErr = Value; \
				if (!Var##OrErr) \
				return Var##OrErr; \
				auto Var = std::move(Var##OrErr.get())

				#define VAR_RETURN_IF_ERROR(Var, Value) \
				Var##OrErr = Value; \
				if (!Var##OrErr) \
				return Var##OrErr; \
				Var = std::move(Var##OrErr.get())

				#define RETURN_IF_ERROR(Value) \
				if (auto E = Value) \
				return std::move(E)

				snehasishUnsubmitted Not Done Reply Inline Actions Perhaps TestWriteWithSizeLimit instead of TestOutputSizeLimit1 is a little more informative? snehasish: Perhaps TestWriteWithSizeLimit instead of TestOutputSizeLimit1 is a little more informative?
				static bool operator==(const FunctionSamples &a, const FunctionSamples &b) {
				snehasishUnsubmitted Not Done Reply Inline Actions I think these parameters name should be capitalized based on the guide. https://llvm.org/docs/CodingStandards.html Also consider moving this to the implementation of FunctionSamples since it seems generally useful to have operator== implemented? snehasish: I think these parameters name should be capitalized based on the guide. https://llvm.
				const BodySampleMap &BodySamplesA = a.getBodySamples();
				snehasishUnsubmitted Not Done Reply Inline Actions This should probably be EXPECT_LE. The rationale is explained here https://stackoverflow.com/a/2565309 snehasish: This should probably be EXPECT_LE. The rationale is explained here https://stackoverflow.
				const BodySampleMap &BodySamplesB = b.getBodySamples();

				const CallsiteSampleMap &CallsiteSamplesA = a.getCallsiteSamples();
				const CallsiteSampleMap &CallsiteSamplesB = b.getCallsiteSamples();

				if (a.getTotalSamples() != b.getTotalSamples() \|\|
				BodySamplesA.size() != BodySamplesB.size() \|\|
				CallsiteSamplesA.size() != CallsiteSamplesB.size())
				return false;

				for (auto &BodySampleA : BodySamplesA) {
				auto BodySampleB = BodySamplesB.find(BodySampleA.first);
				if (BodySampleB == BodySamplesB.end() \|\|
				BodySampleA.second.getSamples() != BodySampleB->second.getSamples() \|\|
				BodySampleA.second.getCallTargets() !=
				BodySampleB->second.getCallTargets())
				return false;
				}

				for (auto &CallsiteA : CallsiteSamplesA) {
				auto CallsiteB = CallsiteSamplesB.find(CallsiteA.first);
				if (CallsiteB == CallsiteSamplesB.end())
				return false;

				auto FunctionsA = CallsiteA.second;
				auto FunctionsB = CallsiteB->second;
				if (FunctionsA.size() != FunctionsB.size())
				return false;
				for (auto &FunctionA : FunctionsA) {
				auto FunctionB = FunctionsB.find(FunctionA.first);
				if (FunctionB == FunctionsB.end() \|\|
				!(FunctionA.second == FunctionB->second))
				return false;
				}
				}
				return true;
				}

				/// The main testing routine. After rewriting profiles with size limit, check
				/// the following:
				/// 1. The file size of the new profile is within the size limit.
				/// 2. The new profile is a subset of the old profile, and the content of every
				/// sample in the new profile is unchanged.
				/// Note that even though by default samples with fewest total count are dropped
				/// first, this is not a requirement. Samples can be dropped by any order.
				static ExpectedErrorOr<void *> RunTest(StringRef Input, size_t SizeLimit,
				SampleProfileFormat Format) {
				// Read Input profile.
				LLVMContext Context;
				auto InputBuffer = MemoryBuffer::getMemBuffer(Input);
				DEF_VAR_RETURN_IF_ERROR(Reader,
				SampleProfileReader::create(InputBuffer, Context));
				RETURN_IF_ERROR(Reader->read());
				SampleProfileMap OldProfiles = Reader->getProfiles();

				// Rewrite it to a temp file with size limit.
				TempFile Temp("profile", "afdo");
				bool isEmpty = false;
				{
				DEF_VAR_RETURN_IF_ERROR(Writer,
				SampleProfileWriter::create(Temp.path(), Format));
				std::error_code EC = Writer->writeWithSizeLimit(OldProfiles, SizeLimit);
				// too_large means no sample could be written because SizeLimit is too
				// small. Otherwise any other error code indicates unexpected failure.
				if (EC == sampleprof_error::too_large)
				isEmpty = true;
				else if (EC)
				return EC;
				}

				// Read the temp file to get new profiles. Use the default empty profile if
				// temp file was not written because size limit is too small.
				SampleProfileMap NewProfiles;
				InputBuffer = MemoryBuffer::getMemBuffer(StringRef(EmptyProfile, 17));
				DEF_VAR_RETURN_IF_ERROR(NewReader,
				SampleProfileReader::create(InputBuffer, Context));
				if (!isEmpty) {
				VAR_RETURN_IF_ERROR(
				NewReader, SampleProfileReader::create(Temp.path().str(), Context));
				RETURN_IF_ERROR(NewReader->read());
				NewProfiles = NewReader->getProfiles();
				}

				// Check temp file is actually within size limit.
				uint64_t FileSize;
				RETURN_IF_ERROR(sys::fs::file_size(Temp.path(), FileSize));
				EXPECT_LE(FileSize, SizeLimit);

				// For compact binary format, function names are stored as MD5, so we cannot
				// directly match the samples of the new profile with the old profile. A
				// simple way is to convert the old profile to compact binary format and read
				// it back
				if (Format == llvm::sampleprof::SPF_Compact_Binary) {
				TempFile CompBinary("compbinary", "afdo");
				{
				DEF_VAR_RETURN_IF_ERROR(
				Writer, SampleProfileWriter::create(
				CompBinary.path(), llvm::sampleprof::SPF_Compact_Binary));
				RETURN_IF_ERROR(Writer->write(OldProfiles));
				}
				ReaderOrErr = SampleProfileReader::create(CompBinary.path().str(), Context);
				snehasishUnsubmitted Not Done Reply Inline Actions VAR_RETURN_IF_ERROR can be used here? snehasish: VAR_RETURN_IF_ERROR can be used here?
				if (!ReaderOrErr)
				return ReaderOrErr;
				Reader = std::move(ReaderOrErr.get());
				RETURN_IF_ERROR(Reader->read());
				OldProfiles = Reader->getProfiles();
				}

				// For every sample in the new profile, confirm it is in the old profile and
				// unchanged.
				for (auto Sample : NewProfiles) {
				auto FindResult = OldProfiles.find(Sample.first);
				EXPECT_NE(FindResult, OldProfiles.end());
				if (FindResult != OldProfiles.end()) {
				EXPECT_EQ(Sample.second.getHeadSamples(),
				FindResult->second.getHeadSamples());
				EXPECT_TRUE(Sample.second == FindResult->second);
				}
				}
				return nullptr;
				}

				TEST(TestOutputSizeLimit, TestOutputSizeLimitExtBinary) {
				for (size_t OutputSizeLimit : {490, 489, 488, 475, 474, 459, 400})
				ASSERT_THAT_EXPECTED(
				snehasishUnsubmitted Not Done Reply Inline Actions EXPECT_THAT_EXPECTED is probably better to continue with other test cases rather than aborting the test on the first failure? snehasish: EXPECT_THAT_EXPECTED is probably better to continue with other test cases rather than aborting…
				huangjdAuthorUnsubmitted Done Reply Inline Actions The callee returns an error only if there's something wrong with I/O or profile reading, which is not expected to happen at all, so ASSERT is used properly here. This test checks the correctness of the sample map after size reduction, which is checked with EXPECT_EQ huangjd: The callee returns an error only if there's something wrong with I/O or profile reading, which…
				RunTest(Input1, OutputSizeLimit, llvm::sampleprof::SPF_Ext_Binary),
				Succeeded());
				}

				TEST(TestOutputSizeLimit, TestOutputSizeLimitBinary) {
				for (size_t OutputSizeLimit : {250, 249, 248, 237, 236, 223, 200})
				ASSERT_THAT_EXPECTED(
				RunTest(Input1, OutputSizeLimit, llvm::sampleprof::SPF_Binary),
				Succeeded());
				}

				TEST(TestOutputSizeLimit, TestOutputSizeLimitCompBinary) {
				for (size_t OutputSizeLimit : {277, 276, 275, 264, 263, 250, 200})
				ASSERT_THAT_EXPECTED(
				RunTest(Input1, OutputSizeLimit, llvm::sampleprof::SPF_Compact_Binary),
				Succeeded());
				}

				TEST(TestOutputSizeLimit, TestOutputSizeLimitText) {
				for (size_t OutputSizeLimit :
				{229, 228, 227, 213, 212, 211, 189, 188, 187, 186, 150})
				ASSERT_THAT_EXPECTED(
				RunTest(Input1, OutputSizeLimit, llvm::sampleprof::SPF_Text),
				Succeeded());
				}

This is an archive of the discontinued LLVM Phabricator instance.

[llvm-profdata] Add option to cap profile output sizeClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 493781

llvm/include/llvm/ProfileData/SampleProfWriter.h

llvm/lib/ProfileData/SampleProfWriter.cpp

llvm/tools/llvm-profdata/llvm-profdata.cpp

llvm/unittests/tools/CMakeLists.txt

llvm/unittests/tools/llvm-profdata/CMakeLists.txt

llvm/unittests/tools/llvm-profdata/OutputSizeLimitTest.cpp

[llvm-profdata] Add option to cap profile output size
ClosedPublic