This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/lib/Basic/
-
lib/
-
Basic/
-
SourceManager.cpp

Differential D97320

Use a fast path when initializing LineOffsetMapping
ClosedPublic

Authored by serge-sans-paille on Feb 23 2021, 11:53 AM.

Download Raw Diff

Details

Reviewers

nikic
lattner

Commits

rG80e8efd563fd: Use a fast path when initializing LineOffsetMapping

Summary

Use the fact that the number of line break is lower than printable characters to
guide the optimization process. Also use a fuzzy test that catches both \n and
\r in a single check to speedup the computation.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

serge-sans-paille created this revision.Feb 23 2021, 11:53 AM

Herald added a subscriber: dexonsmith. · View Herald TranscriptFeb 23 2021, 11:53 AM

serge-sans-paille requested review of this revision.Feb 23 2021, 11:53 AM

Harbormaster completed remote builds in B90455: Diff 325861.Feb 23 2021, 1:59 PM

Interesting optimization! How much speedup does this provide?

In D97320#2585515, @lattner wrote:

Interesting optimization! How much speedup does this provide?

When running perf stat -e instructions ./bin/clang -O0 -fsyntax-only sqlite3.c -o/dev/null -w before / after the patch, I get:

1,929,024,335 instructions before
1,905,625,961 instructions after

That's roughly a 1% speedup ;-)

Nice, it would be worth checking to see if something like this (duplicating the ++I) would be faster or if it codegens the same:

while (I < BufLen) {

// Use a fast check to catch both newlines
if (LLVM_UNLIKELY(Buf[I] > std::max('\n', '\r'))) {
  ++I;
  continue;
}
other stuff

}

This revision is now accepted and ready to land.Feb 27 2021, 9:28 PM

Same generated assembly:

┌─→movzbl 0x0(%r13,%rbp,1),%ecx
│  cmp    $0xd,%cl
│↑ jbe    61
│  add    $0x1,%ebp
├──cmp    %rbp,%rbx
└──ja     110

This revision was landed with ongoing or failed builds.Mar 1 2021, 1:23 AM

Closed by commit rG80e8efd563fd: Use a fast path when initializing LineOffsetMapping (authored by serge-sans-paille). · Explain Why

This revision was automatically updated to reflect the committed changes.

serge-sans-paille added a commit: rG80e8efd563fd: Use a fast path when initializing LineOffsetMapping.

Herald added a project: Restricted Project. · View Herald TranscriptMar 1 2021, 1:23 AM

Herald added a subscriber: cfe-commits. · View Herald Transcript

Nice, thanks!

Revision Contents

Path

Size

clang/

lib/

Basic/

SourceManager.cpp

17 lines

Diff 327049

clang/lib/Basic/SourceManager.cpp

Show First 20 Lines • Show All 1,264 Lines • ▼ Show 20 Lines	LineOffsetMapping LineOffsetMapping::get(llvm::MemoryBufferRef Buffer,
// Line #1 starts at char 0.		// Line #1 starts at char 0.
LineOffsets.push_back(0);		LineOffsets.push_back(0);

const unsigned char Buf = (const unsigned char )Buffer.getBufferStart();		const unsigned char Buf = (const unsigned char )Buffer.getBufferStart();
const unsigned char End = (const unsigned char )Buffer.getBufferEnd();		const unsigned char End = (const unsigned char )Buffer.getBufferEnd();
const std::size_t BufLen = End - Buf;		const std::size_t BufLen = End - Buf;
unsigned I = 0;		unsigned I = 0;
while (I < BufLen) {		while (I < BufLen) {
		// Use a fast check to catch both newlines
		if (LLVM_UNLIKELY(Buf[I] <= std::max('\n', '\r'))) {
if (Buf[I] == '\n') {		if (Buf[I] == '\n') {
LineOffsets.push_back(I + 1);		LineOffsets.push_back(I + 1);
} else if (Buf[I] == '\r') {		} else if (Buf[I] == '\r') {
// If this is \r\n, skip both characters.		// If this is \r\n, skip both characters.
if (I + 1 < BufLen && Buf[I + 1] == '\n')		if (I + 1 < BufLen && Buf[I + 1] == '\n')
++I;		++I;
LineOffsets.push_back(I + 1);		LineOffsets.push_back(I + 1);
}		}
		}
++I;		++I;
}		}

return LineOffsetMapping(LineOffsets, Alloc);		return LineOffsetMapping(LineOffsets, Alloc);
}		}

LineOffsetMapping::LineOffsetMapping(ArrayRef<unsigned> LineOffsets,		LineOffsetMapping::LineOffsetMapping(ArrayRef<unsigned> LineOffsets,
llvm::BumpPtrAllocator &Alloc)		llvm::BumpPtrAllocator &Alloc)
▲ Show 20 Lines • Show All 933 Lines • Show Last 20 Lines