Download Raw Diff

Details

Reviewers

Commits

rG54fe208a5f36: [Support] Make line-number cache robust against access patterns.
rL329470: [Support] Make line-number cache robust against access patterns.

Summary

The LLVM SourceMgr class (which is used indirectly by Swift, though not Clang)
has a routine for looking up line numbers of SMLocs. This routine uses a
shared, special-purpose cache that handles exactly one access pattern
efficiently: looking up the line number of an SMLoc that points into the same
buffer as the last query made to the SourceMgr, at a location in the buffer at
or ahead of the last query.

When this works it's fine, but when it fails it's catastrophic for performancer:
one recent out-of-order access from a Swift utility routine ran for tens of
seconds, spending 99% of its time repeatedly scanning buffers for '\n'.

This change removes the shared cache from the SourceMgr and installs a new
cache in each SrcBuffer. The per-SrcBuffer caches are also "full", in the sense
that rather than caching a single last-query pointer, they cache _all_ the
line-ending offsets, in a binary-searchable array, such that once it's
populated (on first access), all subsequent access patterns run at the same
speed.

Performance measurements I've done show this is actually a little bit faster on
real codebases (though only a couple fractions of a percent). Memory usage is
up by a few tens to hundreds of bytes per SrcBuffer that has a line lookup done
on it; I've attempted to minimize this by using dynamic selection of integer
sized when storing offset arrays. But the main motive here is to
make-impossible the cases we don't always see, that show up by surprise when
there is an out-of-order access pattern.

Diff Detail

Repository: rL LLVM

Event Timeline

graydon created this revision.Mar 28 2018, 3:39 PM

Harbormaster completed remote builds in B16514: Diff 140151.Mar 28 2018, 3:39 PM

That's potentially a lot more memory usage, since the various vectors never get freed. Have you measured what the effect on memory usage is on a large Swift project with diagnostics in many files?

In D45003#1051045, @jordan_rose wrote:

That's potentially a lot more memory usage, since the various vectors never get freed. Have you measured what the effect on memory usage is on a large Swift project with diagnostics in many files?

I've not been able to measure any memory difference when building local projects here (rusage maxrss fluctuates a few tens of kb anyways due to goodness knows what nondeterminism). Keep in mind we only fill the vector on a lookup, so most buffers will never be populated. And those that do, assume a file is (say) 1000 lines long, it'll cost 8kb to index. Even if we indexed a thousand such files -- a diagnostic in every file of a large project! -- we'd only be talking 8mb.

I'll keep trying a few other cases and post back.

And those that do, assume a file is (say) 1000 lines long, it'll cost 8kb to index. Even if we indexed a thousand such files -- a diagnostic in every file of a large project! -- we'd only be talking 8mb.

I'm not sure 1000 lines is a reasonable assumption, but the orders of magnitude probably do still work out. Okay, I'm less concerned, and this looks okay.

Did a bit more precise measurement: on a medium-sized module (802 files, 200kloc) when we're doing -emit-module for a whole module (which turns out to map _every file_ in order to get doc-comment locations, when it's emitting module docs) the normal compilation uses 191MB RSS and these maps account for 3.3MB of that (1.7%). Largest map is 32KB, median is 4KB. Not trivial, but not huge.

However, in those cases it also represents a more-noticeable speedup: 271s -> 267s (and 925bn instructions -> 918bn).

Given that the majority of the files are quite small -- certainly smaller than 2^64 bits! -- I've made a variant that uses variable size offsets (8, 16, 32 or 64 bit) rather than pointers. That variant uses only 956KB for the indexes in that testcase (with 200kloc / 191MB RSS). I think that's about as memory-cheap as I can make it using this approach.

Modification to use variable unit-size offset vectors rather than pointer vectors.

Harbormaster completed remote builds in B16580: Diff 140387.Mar 30 2018, 12:05 AM

Fix comments.

graydon edited the summary of this revision. (Show Details)Mar 30 2018, 12:11 AM

graydon edited the summary of this revision. (Show Details)

That seems about as clever as possible—anything more and it would definitely be overboard. Can you add tests with 255-, 256-, and 257-byte buffers, then? With and without newlines as the last character, and testing the just-past-the-end pointer in addition to something in-bounds?

There's commentary in lib/MC/MCParser/AsmParser.cpp about the ungraceful degradation in SourceMgr's cache. Would this help or simplify what AsmParser is doing?

Update to add more tests around boundary conditions of buffer-map expansion.

Harbormaster completed remote builds in B16655: Diff 140714.Apr 2 2018, 5:23 PM

In D45003#1053430, @probinson wrote:

There's commentary in lib/MC/MCParser/AsmParser.cpp about the ungraceful degradation in SourceMgr's cache. Would this help or simplify what AsmParser is doing?

I believe this should eliminate the behaviour those comments describe (and thus remove the need for the code), though I feel a bit under-qualified tinkering with the code there itself. Would you like me to try, or shall I leave it to the owners of that file?

In D45003#1055099, @graydon wrote:

In D45003#1053430, @probinson wrote:

There's commentary in lib/MC/MCParser/AsmParser.cpp about the ungraceful degradation in SourceMgr's cache. Would this help or simplify what AsmParser is doing?

I believe this should eliminate the behaviour those comments describe (and thus remove the need for the code), though I feel a bit under-qualified tinkering with the code there itself. Would you like me to try, or shall I leave it to the owners of that file?

If you could have a go at it, that would be great. It's fine to do it as a follow-up NFC patch. Or file a bug and cc me. It's not really my area either but I keep running across that comment and it has always bothered me, maybe enough to do something now.

[AsmParser] Remove code that compensated for formerly-slow SrcMgr.FindLineNumber()

In D45003#1057871, @probinson wrote:

In D45003#1055099, @graydon wrote:

In D45003#1053430, @probinson wrote:

There's commentary in lib/MC/MCParser/AsmParser.cpp about the ungraceful degradation in SourceMgr's cache. Would this help or simplify what AsmParser is doing?

I believe this should eliminate the behaviour those comments describe (and thus remove the need for the code), though I feel a bit under-qualified tinkering with the code there itself. Would you like me to try, or shall I leave it to the owners of that file?

If you could have a go at it, that would be great. It's fine to do it as a follow-up NFC patch. Or file a bug and cc me. It's not really my area either but I keep running across that comment and it has always bothered me, maybe enough to do something now.

Ok, after a little staring it seemed clear it was just a small change. I made a synthetic test with 10,000 line-directives and built it with -c -g, both with and without the change, and only saw a 0.1% increase in instructions retired / 1ms increase in time (of 300ms). So I figure this is probably acceptable?

Harbormaster completed remote builds in B16798: Diff 141239.Apr 5 2018, 4:45 PM

Removing code with effectively no performance or functionality penalty is a beautiful thing. That part LGTM.
@jordan_rose still needs to sign off, I think.

Ah, yes, Graydon's convinced me that the change will work and will not regress memory usage terribly, and the new tests look good.

This revision is now accepted and ready to land.Apr 6 2018, 9:52 AM

Closed by commit rL329470: [Support] Make line-number cache robust against access patterns. (authored by graydon). · Explain WhyApr 6 2018, 5:47 PM

This revision was automatically updated to reflect the committed changes.

Diff 141459

llvm/trunk/include/llvm/Support/SourceMgr.h

Show All 12 Lines
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_SUPPORT_SOURCEMGR_H		#ifndef LLVM_SUPPORT_SOURCEMGR_H
#define LLVM_SUPPORT_SOURCEMGR_H		#define LLVM_SUPPORT_SOURCEMGR_H

#include "llvm/ADT/ArrayRef.h"		#include "llvm/ADT/ArrayRef.h"
#include "llvm/ADT/None.h"		#include "llvm/ADT/None.h"
		#include "llvm/ADT/PointerUnion.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/StringRef.h"		#include "llvm/ADT/StringRef.h"
#include "llvm/ADT/Twine.h"		#include "llvm/ADT/Twine.h"
#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"
#include "llvm/Support/SMLoc.h"		#include "llvm/Support/SMLoc.h"
#include <algorithm>		#include <algorithm>
#include <cassert>		#include <cassert>
#include <memory>		#include <memory>
Show All 23 Lines	public:
/// It gets called each time PrintMessage is invoked.		/// It gets called each time PrintMessage is invoked.
using DiagHandlerTy = void ()(const SMDiagnostic &, void Context);		using DiagHandlerTy = void ()(const SMDiagnostic &, void Context);

private:		private:
struct SrcBuffer {		struct SrcBuffer {
/// The memory buffer for the file.		/// The memory buffer for the file.
std::unique_ptr<MemoryBuffer> Buffer;		std::unique_ptr<MemoryBuffer> Buffer;

		/// Helper type for OffsetCache below: since we're storing many offsets
		/// into relatively small files (often smaller than 2^8 or 2^16 bytes),
		/// we select the offset vector element type dynamically based on the
		/// size of Buffer.
		using VariableSizeOffsets = PointerUnion4<std::vector<uint8_t> *,
		std::vector<uint16_t> *,
		std::vector<uint32_t> *,
		std::vector<uint64_t> *>;

		/// Vector of offsets into Buffer at which there are line-endings
		/// (lazily populated). Once populated, the '\n' that marks the end of
		/// line number N from [1..] is at Buffer[OffsetCache[N-1]]. Since
		/// these offsets are in sorted (ascending) order, they can be
		/// binary-searched for the first one after any given offset (eg. an
		/// offset corresponding to a particular SMLoc).
		mutable VariableSizeOffsets OffsetCache;

		/// Populate \c OffsetCache and look up a given \p Ptr in it, assuming
		/// it points somewhere into \c Buffer. The static type parameter \p T
		/// must be an unsigned integer type from uint{8,16,32,64}_t large
		/// enough to store offsets inside \c Buffer.
		template<typename T>
		unsigned getLineNumber(const char *Ptr) const;

/// This is the location of the parent include, or null if at the top level.		/// This is the location of the parent include, or null if at the top level.
SMLoc IncludeLoc;		SMLoc IncludeLoc;

		SrcBuffer() = default;
		SrcBuffer(SrcBuffer &&);
		SrcBuffer(const SrcBuffer &) = delete;
		SrcBuffer &operator=(const SrcBuffer &) = delete;
		~SrcBuffer();
};		};

/// This is all of the buffers that we are reading from.		/// This is all of the buffers that we are reading from.
std::vector<SrcBuffer> Buffers;		std::vector<SrcBuffer> Buffers;

// This is the list of directories we should search for include files in.		// This is the list of directories we should search for include files in.
std::vector<std::string> IncludeDirectories;		std::vector<std::string> IncludeDirectories;

/// This is a cache for line number queries, its implementation is really
/// private to SourceMgr.cpp.
mutable void *LineNoCache = nullptr;

DiagHandlerTy DiagHandler = nullptr;		DiagHandlerTy DiagHandler = nullptr;
void *DiagContext = nullptr;		void *DiagContext = nullptr;

bool isValidBufferID(unsigned i) const { return i && i <= Buffers.size(); }		bool isValidBufferID(unsigned i) const { return i && i <= Buffers.size(); }

public:		public:
SourceMgr() = default;		SourceMgr() = default;
SourceMgr(const SourceMgr &) = delete;		SourceMgr(const SourceMgr &) = delete;
SourceMgr &operator=(const SourceMgr &) = delete;		SourceMgr &operator=(const SourceMgr &) = delete;
~SourceMgr();		~SourceMgr() = default;

void setIncludeDirs(const std::vector<std::string> &Dirs) {		void setIncludeDirs(const std::vector<std::string> &Dirs) {
IncludeDirectories = Dirs;		IncludeDirectories = Dirs;
}		}

/// Specify a diagnostic handler to be invoked every time PrintMessage is		/// Specify a diagnostic handler to be invoked every time PrintMessage is
/// called. \p Ctx is passed into the handler when it is invoked.		/// called. \p Ctx is passed into the handler when it is invoked.
void setDiagHandler(DiagHandlerTy DH, void *Ctx = nullptr) {		void setDiagHandler(DiagHandlerTy DH, void *Ctx = nullptr) {
▲ Show 20 Lines • Show All 191 Lines • Show Last 20 Lines

llvm/trunk/lib/MC/MCParser/AsmParser.cpp

Show First 20 Lines • Show All 162 Lines • ▼ Show 20 Lines	struct CppHashInfoTy {
SMLoc Loc;		SMLoc Loc;
unsigned Buf = 0;		unsigned Buf = 0;
};		};
CppHashInfoTy CppHashInfo;		CppHashInfoTy CppHashInfo;

/// \brief List of forward directional labels for diagnosis at the end.		/// \brief List of forward directional labels for diagnosis at the end.
SmallVector<std::tuple<SMLoc, CppHashInfoTy, MCSymbol *>, 4> DirLabels;		SmallVector<std::tuple<SMLoc, CppHashInfoTy, MCSymbol *>, 4> DirLabels;

/// When generating dwarf for assembly source files we need to calculate the
/// logical line number based on the last parsed cpp hash file line comment
/// and current line. Since this is slow and messes up the SourceMgr's
/// cache we save the last info we queried with SrcMgr.FindLineNumber().
SMLoc LastQueryIDLoc;
unsigned LastQueryBuffer;
unsigned LastQueryLine;

/// AssemblerDialect. ~OU means unset value and use value provided by MAI.		/// AssemblerDialect. ~OU means unset value and use value provided by MAI.
unsigned AssemblerDialect = ~0U;		unsigned AssemblerDialect = ~0U;

/// \brief is Darwin compatibility enabled?		/// \brief is Darwin compatibility enabled?
bool IsDarwin = false;		bool IsDarwin = false;

/// \brief Are we parsing ms-style inline assembly?		/// \brief Are we parsing ms-style inline assembly?
bool ParsingInlineAsm = false;		bool ParsingInlineAsm = false;
▲ Show 20 Lines • Show All 1,997 Lines • ▼ Show 20 Lines	if (!ParseHadError && enabledGenDwarfForAssembly() &&
// If we previously parsed a cpp hash file line comment then make sure the		// If we previously parsed a cpp hash file line comment then make sure the
// current Dwarf File is for the CppHashFilename if not then emit the		// current Dwarf File is for the CppHashFilename if not then emit the
// Dwarf File table for it and adjust the line number for the .loc.		// Dwarf File table for it and adjust the line number for the .loc.
if (!CppHashInfo.Filename.empty()) {		if (!CppHashInfo.Filename.empty()) {
unsigned FileNumber = getStreamer().EmitDwarfFileDirective(		unsigned FileNumber = getStreamer().EmitDwarfFileDirective(
0, StringRef(), CppHashInfo.Filename);		0, StringRef(), CppHashInfo.Filename);
getContext().setGenDwarfFileNumber(FileNumber);		getContext().setGenDwarfFileNumber(FileNumber);

// Since SrcMgr.FindLineNumber() is slow and messes up the SourceMgr's		unsigned CppHashLocLineNo =
// cache with the different Loc from the call above we save the last
// info we queried here with SrcMgr.FindLineNumber().
unsigned CppHashLocLineNo;
if (LastQueryIDLoc == CppHashInfo.Loc &&
LastQueryBuffer == CppHashInfo.Buf)
CppHashLocLineNo = LastQueryLine;
else {
CppHashLocLineNo =
SrcMgr.FindLineNumber(CppHashInfo.Loc, CppHashInfo.Buf);		SrcMgr.FindLineNumber(CppHashInfo.Loc, CppHashInfo.Buf);
LastQueryLine = CppHashLocLineNo;
LastQueryIDLoc = CppHashInfo.Loc;
LastQueryBuffer = CppHashInfo.Buf;
}
Line = CppHashInfo.LineNumber - 1 + (Line - CppHashLocLineNo);		Line = CppHashInfo.LineNumber - 1 + (Line - CppHashLocLineNo);
}		}

getStreamer().EmitDwarfLocDirective(		getStreamer().EmitDwarfLocDirective(
getContext().getGenDwarfFileNumber(), Line, 0,		getContext().getGenDwarfFileNumber(), Line, 0,
DWARF2_LINE_DEFAULT_IS_STMT ? DWARF2_FLAG_IS_STMT : 0, 0, 0,		DWARF2_LINE_DEFAULT_IS_STMT ? DWARF2_FLAG_IS_STMT : 0, 0, 0,
StringRef());		StringRef());
}		}
▲ Show 20 Lines • Show All 3,646 Lines • Show Last 20 Lines

llvm/trunk/lib/Support/SourceMgr.cpp

Show All 22 Lines
#include "llvm/Support/Locale.h"		#include "llvm/Support/Locale.h"
#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"
#include "llvm/Support/Path.h"		#include "llvm/Support/Path.h"
#include "llvm/Support/SMLoc.h"		#include "llvm/Support/SMLoc.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include <algorithm>		#include <algorithm>
#include <cassert>		#include <cassert>
#include <cstddef>		#include <cstddef>
		#include <limits>
#include <memory>		#include <memory>
#include <string>		#include <string>
#include <utility>		#include <utility>

using namespace llvm;		using namespace llvm;

static const size_t TabStop = 8;		static const size_t TabStop = 8;

namespace {

struct LineNoCacheTy {
const char *LastQuery;
unsigned LastQueryBufferID;
unsigned LineNoOfQuery;
};

} // end anonymous namespace

static LineNoCacheTy getCache(void Ptr) {
return (LineNoCacheTy*)Ptr;
}

SourceMgr::~SourceMgr() {
delete getCache(LineNoCache);
}

unsigned SourceMgr::AddIncludeFile(const std::string &Filename,		unsigned SourceMgr::AddIncludeFile(const std::string &Filename,
SMLoc IncludeLoc,		SMLoc IncludeLoc,
std::string &IncludedFile) {		std::string &IncludedFile) {
IncludedFile = Filename;		IncludedFile = Filename;
ErrorOr<std::unique_ptr<MemoryBuffer>> NewBufOrErr =		ErrorOr<std::unique_ptr<MemoryBuffer>> NewBufOrErr =
MemoryBuffer::getFile(IncludedFile);		MemoryBuffer::getFile(IncludedFile);

// If the file didn't exist directly, see if it's in an include path.		// If the file didn't exist directly, see if it's in an include path.
Show All 15 Lines	for (unsigned i = 0, e = Buffers.size(); i != e; ++i)
if (Loc.getPointer() >= Buffers[i].Buffer->getBufferStart() &&		if (Loc.getPointer() >= Buffers[i].Buffer->getBufferStart() &&
// Use <= here so that a pointer to the null at the end of the buffer		// Use <= here so that a pointer to the null at the end of the buffer
// is included as part of the buffer.		// is included as part of the buffer.
Loc.getPointer() <= Buffers[i].Buffer->getBufferEnd())		Loc.getPointer() <= Buffers[i].Buffer->getBufferEnd())
return i + 1;		return i + 1;
return 0;		return 0;
}		}

		template <typename T>
		unsigned SourceMgr::SrcBuffer::getLineNumber(const char *Ptr) const {

		// Ensure OffsetCache is allocated and populated with offsets of all the
		// '\n' bytes.
		std::vector<T> *Offsets = nullptr;
		if (OffsetCache.isNull()) {
		Offsets = new std::vector<T>();
		OffsetCache = Offsets;
		size_t Sz = Buffer->getBufferSize();
		assert(Sz <= std::numeric_limits<T>::max());
		StringRef S = Buffer->getBuffer();
		for (size_t N = 0; N < Sz; ++N) {
		if (S[N] == '\n') {
		Offsets->push_back(static_cast<T>(N));
		}
		}
		} else {
		Offsets = OffsetCache.get<std::vector<T> *>();
		}

		const char *BufStart = Buffer->getBufferStart();
		assert(Ptr >= BufStart && Ptr <= Buffer->getBufferEnd());
		ptrdiff_t PtrDiff = Ptr - BufStart;
		assert(PtrDiff >= 0 && static_cast<size_t>(PtrDiff) <= std::numeric_limits<T>::max());
		T PtrOffset = static_cast<T>(PtrDiff);

		// std::lower_bound returns the first EOL offset that's not-less-than
		// PtrOffset, meaning the EOL that _ends the line_ that PtrOffset is on
		// (including if PtrOffset refers to the EOL itself). If there's no such
		// EOL, returns end().
		auto EOL = std::lower_bound(Offsets->begin(), Offsets->end(), PtrOffset);

		// Lines count from 1, so add 1 to the distance from the 0th line.
		return (1 + (EOL - Offsets->begin()));
		}

		SourceMgr::SrcBuffer::SrcBuffer(SourceMgr::SrcBuffer &&Other)
		: Buffer(std::move(Other.Buffer)),
		OffsetCache(Other.OffsetCache),
		IncludeLoc(Other.IncludeLoc) {
		Other.OffsetCache = nullptr;
		}

		SourceMgr::SrcBuffer::~SrcBuffer() {
		if (!OffsetCache.isNull()) {
		if (OffsetCache.is<std::vector<uint8_t>*>())
		delete OffsetCache.get<std::vector<uint8_t>*>();
		else if (OffsetCache.is<std::vector<uint16_t>*>())
		delete OffsetCache.get<std::vector<uint16_t>*>();
		else if (OffsetCache.is<std::vector<uint32_t>*>())
		delete OffsetCache.get<std::vector<uint32_t>*>();
		else
		delete OffsetCache.get<std::vector<uint64_t>*>();
		OffsetCache = nullptr;
		}
		}

std::pair<unsigned, unsigned>		std::pair<unsigned, unsigned>
SourceMgr::getLineAndColumn(SMLoc Loc, unsigned BufferID) const {		SourceMgr::getLineAndColumn(SMLoc Loc, unsigned BufferID) const {
if (!BufferID)		if (!BufferID)
BufferID = FindBufferContainingLoc(Loc);		BufferID = FindBufferContainingLoc(Loc);
assert(BufferID && "Invalid Location!");		assert(BufferID && "Invalid Location!");

const MemoryBuffer *Buff = getMemoryBuffer(BufferID);		auto &SB = getBufferInfo(BufferID);
		const char *Ptr = Loc.getPointer();

// Count the number of \n's between the start of the file and the specified		size_t Sz = SB.Buffer->getBufferSize();
// location.		assert(Sz <= std::numeric_limits<uint64_t>::max());
unsigned LineNo = 1;		unsigned LineNo;
		if (Sz <= std::numeric_limits<uint8_t>::max())
const char *BufStart = Buff->getBufferStart();		LineNo = SB.getLineNumber<uint8_t>(Ptr);
const char *Ptr = BufStart;		else if (Sz <= std::numeric_limits<uint16_t>::max())
		LineNo = SB.getLineNumber<uint16_t>(Ptr);
// If we have a line number cache, and if the query is to a later point in the		else if (Sz <= std::numeric_limits<uint32_t>::max())
// same file, start searching from the last query location. This optimizes		LineNo = SB.getLineNumber<uint32_t>(Ptr);
// for the case when multiple diagnostics come out of one file in order.		else
if (LineNoCacheTy *Cache = getCache(LineNoCache))		LineNo = SB.getLineNumber<uint64_t>(Ptr);
if (Cache->LastQueryBufferID == BufferID &&
Cache->LastQuery <= Loc.getPointer()) {
Ptr = Cache->LastQuery;
LineNo = Cache->LineNoOfQuery;
}

// Scan for the location being queried, keeping track of the number of lines
// we see.
for (; SMLoc::getFromPointer(Ptr) != Loc; ++Ptr)
if (*Ptr == '\n') ++LineNo;

// Allocate the line number cache if it doesn't exist.
if (!LineNoCache)
LineNoCache = new LineNoCacheTy();

// Update the line # cache.
LineNoCacheTy &Cache = *getCache(LineNoCache);
Cache.LastQueryBufferID = BufferID;
Cache.LastQuery = Ptr;
Cache.LineNoOfQuery = LineNo;

		const char *BufStart = SB.Buffer->getBufferStart();
size_t NewlineOffs = StringRef(BufStart, Ptr-BufStart).find_last_of("\n\r");		size_t NewlineOffs = StringRef(BufStart, Ptr-BufStart).find_last_of("\n\r");
if (NewlineOffs == StringRef::npos) NewlineOffs = ~(size_t)0;		if (NewlineOffs == StringRef::npos) NewlineOffs = ~(size_t)0;
return std::make_pair(LineNo, Ptr-BufStart-NewlineOffs);		return std::make_pair(LineNo, Ptr-BufStart-NewlineOffs);
}		}

void SourceMgr::PrintIncludeStack(SMLoc IncludeLoc, raw_ostream &OS) const {		void SourceMgr::PrintIncludeStack(SMLoc IncludeLoc, raw_ostream &OS) const {
if (IncludeLoc == SMLoc()) return; // Top of stack.		if (IncludeLoc == SMLoc()) return; // Top of stack.

▲ Show 20 Lines • Show All 362 Lines • Show Last 20 Lines

llvm/trunk/unittests/Support/SourceMgrTest.cpp

Show First 20 Lines • Show All 101 Lines • ▼ Show 20 Lines	TEST_F(SourceMgrTest, LocationAtNewline) {
printMessage(getLoc(7), SourceMgr::DK_Error, "message", None, None);		printMessage(getLoc(7), SourceMgr::DK_Error, "message", None, None);

EXPECT_EQ("file.in:1:8: error: message\n"		EXPECT_EQ("file.in:1:8: error: message\n"
"aaa bbb\n"		"aaa bbb\n"
" ^\n",		" ^\n",
Output);		Output);
}		}

		TEST_F(SourceMgrTest, LocationAtEmptyBuffer) {
		setMainBuffer("", "file.in");
		printMessage(getLoc(0), SourceMgr::DK_Error, "message", None, None);

		EXPECT_EQ("file.in:1:1: error: message\n"
		"\n"
		"^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationJustOnSoleNewline) {
		setMainBuffer("\n", "file.in");
		printMessage(getLoc(0), SourceMgr::DK_Error, "message", None, None);

		EXPECT_EQ("file.in:1:1: error: message\n"
		"\n"
		"^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationJustAfterSoleNewline) {
		setMainBuffer("\n", "file.in");
		printMessage(getLoc(1), SourceMgr::DK_Error, "message", None, None);

		EXPECT_EQ("file.in:2:1: error: message\n"
		"\n"
		"^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationJustAfterNonNewline) {
		setMainBuffer("123", "file.in");
		printMessage(getLoc(3), SourceMgr::DK_Error, "message", None, None);

		EXPECT_EQ("file.in:1:4: error: message\n"
		"123\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationOnFirstLineOfMultiline) {
		setMainBuffer("1234\n6789\n", "file.in");
		printMessage(getLoc(3), SourceMgr::DK_Error, "message", None, None);

		EXPECT_EQ("file.in:1:4: error: message\n"
		"1234\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationOnEOLOfFirstLineOfMultiline) {
		setMainBuffer("1234\n6789\n", "file.in");
		printMessage(getLoc(4), SourceMgr::DK_Error, "message", None, None);

		EXPECT_EQ("file.in:1:5: error: message\n"
		"1234\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationOnSecondLineOfMultiline) {
		setMainBuffer("1234\n6789\n", "file.in");
		printMessage(getLoc(5), SourceMgr::DK_Error, "message", None, None);

		EXPECT_EQ("file.in:2:1: error: message\n"
		"6789\n"
		"^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationOnSecondLineOfMultilineNoSecondEOL) {
		setMainBuffer("1234\n6789", "file.in");
		printMessage(getLoc(5), SourceMgr::DK_Error, "message", None, None);

		EXPECT_EQ("file.in:2:1: error: message\n"
		"6789\n"
		"^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationOnEOLOfSecondSecondLineOfMultiline) {
		setMainBuffer("1234\n6789\n", "file.in");
		printMessage(getLoc(9), SourceMgr::DK_Error, "message", None, None);

		EXPECT_EQ("file.in:2:5: error: message\n"
		"6789\n"
		" ^\n",
		Output);
		}

		#define STRING_LITERAL_253_BYTES \
		"1234567890\n1234567890\n" \
		"1234567890\n1234567890\n" \
		"1234567890\n1234567890\n" \
		"1234567890\n1234567890\n" \
		"1234567890\n1234567890\n" \
		"1234567890\n1234567890\n" \
		"1234567890\n1234567890\n" \
		"1234567890\n1234567890\n" \
		"1234567890\n1234567890\n" \
		"1234567890\n1234567890\n" \
		"1234567890\n1234567890\n" \
		"1234567890\n"

		//===----------------------------------------------------------------------===//
		// 255-byte buffer tests
		//===----------------------------------------------------------------------===//

		TEST_F(SourceMgrTest, LocationBeforeEndOf255ByteBuffer) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"12" // + 2 = 255 bytes
		, "file.in");
		printMessage(getLoc(253), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:24:1: error: message\n"
		"12\n"
		"^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationAtEndOf255ByteBuffer) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"12" // + 2 = 255 bytes
		, "file.in");
		printMessage(getLoc(254), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:24:2: error: message\n"
		"12\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationPastEndOf255ByteBuffer) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"12" // + 2 = 255 bytes
		, "file.in");
		printMessage(getLoc(255), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:24:3: error: message\n"
		"12\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationBeforeEndOf255ByteBufferEndingInNewline) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"1\n" // + 2 = 255 bytes
		, "file.in");
		printMessage(getLoc(253), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:24:1: error: message\n"
		"1\n"
		"^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationAtEndOf255ByteBufferEndingInNewline) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"1\n" // + 2 = 255 bytes
		, "file.in");
		printMessage(getLoc(254), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:24:2: error: message\n"
		"1\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationPastEndOf255ByteBufferEndingInNewline) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"1\n" // + 2 = 255 bytes
		, "file.in");
		printMessage(getLoc(255), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:25:1: error: message\n"
		"\n"
		"^\n",
		Output);
		}

		//===----------------------------------------------------------------------===//
		// 256-byte buffer tests
		//===----------------------------------------------------------------------===//

		TEST_F(SourceMgrTest, LocationBeforeEndOf256ByteBuffer) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"123" // + 3 = 256 bytes
		, "file.in");
		printMessage(getLoc(254), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:24:2: error: message\n"
		"123\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationAtEndOf256ByteBuffer) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"123" // + 3 = 256 bytes
		, "file.in");
		printMessage(getLoc(255), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:24:3: error: message\n"
		"123\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationPastEndOf256ByteBuffer) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"123" // + 3 = 256 bytes
		, "file.in");
		printMessage(getLoc(256), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:24:4: error: message\n"
		"123\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationBeforeEndOf256ByteBufferEndingInNewline) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"12\n" // + 3 = 256 bytes
		, "file.in");
		printMessage(getLoc(254), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:24:2: error: message\n"
		"12\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationAtEndOf256ByteBufferEndingInNewline) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"12\n" // + 3 = 256 bytes
		, "file.in");
		printMessage(getLoc(255), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:24:3: error: message\n"
		"12\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationPastEndOf256ByteBufferEndingInNewline) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"12\n" // + 3 = 256 bytes
		, "file.in");
		printMessage(getLoc(256), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:25:1: error: message\n"
		"\n"
		"^\n",
		Output);
		}

		//===----------------------------------------------------------------------===//
		// 257-byte buffer tests
		//===----------------------------------------------------------------------===//

		TEST_F(SourceMgrTest, LocationBeforeEndOf257ByteBuffer) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"1234" // + 4 = 257 bytes
		, "file.in");
		printMessage(getLoc(255), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:24:3: error: message\n"
		"1234\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationAtEndOf257ByteBuffer) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"1234" // + 4 = 257 bytes
		, "file.in");
		printMessage(getLoc(256), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:24:4: error: message\n"
		"1234\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationPastEndOf257ByteBuffer) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"1234" // + 4 = 257 bytes
		, "file.in");
		printMessage(getLoc(257), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:24:5: error: message\n"
		"1234\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationBeforeEndOf257ByteBufferEndingInNewline) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"123\n" // + 4 = 257 bytes
		, "file.in");
		printMessage(getLoc(255), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:24:3: error: message\n"
		"123\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationAtEndOf257ByteBufferEndingInNewline) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"123\n" // + 4 = 257 bytes
		, "file.in");
		printMessage(getLoc(256), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:24:4: error: message\n"
		"123\n"
		" ^\n",
		Output);
		}

		TEST_F(SourceMgrTest, LocationPastEndOf257ByteBufferEndingInNewline) {
		setMainBuffer(STRING_LITERAL_253_BYTES // first 253 bytes
		"123\n" // + 4 = 257 bytes
		, "file.in");
		printMessage(getLoc(257), SourceMgr::DK_Error, "message", None, None);
		EXPECT_EQ("file.in:25:1: error: message\n"
		"\n"
		"^\n",
		Output);
		}

TEST_F(SourceMgrTest, BasicRange) {		TEST_F(SourceMgrTest, BasicRange) {
setMainBuffer("aaa bbb\nccc ddd\n", "file.in");		setMainBuffer("aaa bbb\nccc ddd\n", "file.in");
printMessage(getLoc(4), SourceMgr::DK_Error, "message", getRange(4, 3), None);		printMessage(getLoc(4), SourceMgr::DK_Error, "message", getRange(4, 3), None);

EXPECT_EQ("file.in:1:5: error: message\n"		EXPECT_EQ("file.in:1:5: error: message\n"
"aaa bbb\n"		"aaa bbb\n"
" ^~~\n",		" ^~~\n",
Output);		Output);
▲ Show 20 Lines • Show All 68 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Support] Make line-number cache robust against access patterns.
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 141459

llvm/trunk/include/llvm/Support/SourceMgr.h

llvm/trunk/lib/MC/MCParser/AsmParser.cpp

llvm/trunk/lib/Support/SourceMgr.cpp

llvm/trunk/unittests/Support/SourceMgrTest.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[Support] Make line-number cache robust against access patterns.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 141459

llvm/trunk/include/llvm/Support/SourceMgr.h

llvm/trunk/lib/MC/MCParser/AsmParser.cpp

llvm/trunk/lib/Support/SourceMgr.cpp

llvm/trunk/unittests/Support/SourceMgrTest.cpp

[Support] Make line-number cache robust against access patterns.
ClosedPublic