This is an archive of the discontinued LLVM Phabricator instance.

[Support] use larger character set for creating unique filenames
Needs ReviewPublic

Authored by inglorion on Jul 31 2018, 8:16 PM.

Download Raw Diff

Details

Reviewers

rnk
pcc
zturner

Summary

This changes the character set we use to create unique filenames from
16 characters to 36, which allows more unique filenames to be
generated for the same name length.

Diff Detail

Build Status

Buildable 20944
Build 20944: arc lint + arc unit

Event Timeline

inglorion created this revision.Jul 31 2018, 8:16 PM

Herald added a subscriber: hiraditya. · View Herald TranscriptJul 31 2018, 8:16 PM

inglorion added a parent revision: D50126: [Support] fix TempFile infinite loop and permission denied errors.Jul 31 2018, 8:17 PM

The inspiration for this is http://crbug.com/856635, where we're seeing the same file name generated multiple times during a single run.

inglorion mentioned this in D50126: [Support] fix TempFile infinite loop and permission denied errors.Jul 31 2018, 8:44 PM

It would be an extra line of code, but I think this would be clearer (with
even less chance of collision) if you just choose a random number in [0,35]
and map it to [0-9][a-z]. The bit twiddling and array indexing seems like
unnecessary cleverness and someone could accidentally come along and break
it if they weren’t careful.

Yeah. On top of the cost of getting a number from the cryptographically secure random number generator, I don't think using mod instead of and will cost that much. I'll make the change. We could add a few more characters ("-_@", probably others), but I think digits+letters is already a good improvement to what we currently have, without having to worry about file systems not liking some characters.

Use 36-character set instead of 32-character set, per @zturner's suggestion.

I also realized that this would make the TempFileCollisions test flaky
(about 2% failure rate), so I modified it to get the expected failure
rate down below 1 per million.

inglorion edited the summary of this revision. (Show Details)Aug 1 2018, 1:15 PM

In D50127#1183838, @inglorion wrote:

The inspiration for this is http://crbug.com/856635, where we're seeing the same file name generated multiple times during a single run.

https://bugs.chromium.org/p/chromium/issues/detail?id=856635#c10 sounds like we only saw this if the pattern was made smaller. Are we sure this change fixes an actual problem?

In D50127#1185981, @thakis wrote:

https://bugs.chromium.org/p/chromium/issues/detail?id=856635#c10 sounds like we only saw this if the pattern was made smaller. Are we sure this change fixes an actual problem?

The failure in that bug does not reproduce on my local machine, but when I make the pattern smaller, I end up with a failure that looks a lot like it. For reference, with the old 16-character set and models of length 6, we have a 50%+ chance of collision after 5000 tempfiles or so. We easily generate that many in a ThinLTO build of Chromium. So it seems at least plausible that this is the same problem.

Aside from that, I think this is just a good thing to do. We're effectively increasing the probability that a name supposed to be unique is indeed unique, without making file names longer or consuming more entropy from the random number generator.

Revision Contents

Path

Size

clang/

test/

Analysis/

diagnostics/

plist-multi-file.c

2 lines

llvm/

lib/

Support/

Path.cpp

3 lines

unittests/

Support/

Path.cpp

4 lines

Diff 158456

clang/test/Analysis/diagnostics/plist-multi-file.c

	Show First 20 Lines • Show All 193 Lines • ▼ Show 20 Lines
	// CHECK-NEXT: <key>location</key>			// CHECK-NEXT: <key>location</key>
	// CHECK-NEXT: <dict>			// CHECK-NEXT: <dict>
	// CHECK-NEXT: <key>line</key><integer>2</integer>			// CHECK-NEXT: <key>line</key><integer>2</integer>
	// CHECK-NEXT: <key>col</key><integer>8</integer>			// CHECK-NEXT: <key>col</key><integer>8</integer>
	// CHECK-NEXT: <key>file</key><integer>1</integer>			// CHECK-NEXT: <key>file</key><integer>1</integer>
	// CHECK-NEXT: </dict>			// CHECK-NEXT: </dict>
	// CHECK-NEXT: <key>HTMLDiagnostics_files</key>			// CHECK-NEXT: <key>HTMLDiagnostics_files</key>
	// CHECK-NEXT: <array>			// CHECK-NEXT: <array>
	// CHECK-NEXT: <string>report-{{([0-9a-f]{6})}}.html</string>			// CHECK-NEXT: <string>report-{{([0-9a-v]{6})}}.html</string>
	// CHECK-NEXT: </array>			// CHECK-NEXT: </array>
	// CHECK-NEXT: </dict>			// CHECK-NEXT: </dict>
	// CHECK-NEXT: </array>			// CHECK-NEXT: </array>

llvm/lib/Support/Path.cpp

Show First 20 Lines • Show All 192 Lines • ▼ Show 20 Lines	createUniqueEntity(const Twine &Model, int &ResultFD,
// Limit the number of attempts we make, so that we don't infinite loop when		// Limit the number of attempts we make, so that we don't infinite loop when
// we run out of filenames that fit the model.		// we run out of filenames that fit the model.
std::error_code EC;		std::error_code EC;
for (int Retries = 128; Retries > 0; --Retries) {		for (int Retries = 128; Retries > 0; --Retries) {
// Replace '%' with random chars.		// Replace '%' with random chars.
for (unsigned i = 0, e = ModelStorage.size(); i != e; ++i) {		for (unsigned i = 0, e = ModelStorage.size(); i != e; ++i) {
if (ModelStorage[i] == '%')		if (ModelStorage[i] == '%')
ResultPath[i] =		ResultPath[i] =
"0123456789abcdef"[sys::Process::GetRandomNumber() & 15];		"0123456789abcdefghijklmnopqrstuv"[sys::Process::GetRandomNumber() &
		31];
}		}

// Try to open + create the file.		// Try to open + create the file.
switch (Type) {		switch (Type) {
case FS_File: {		case FS_File: {
EC = sys::fs::openFileForReadWrite(Twine(ResultPath.begin()), ResultFD,		EC = sys::fs::openFileForReadWrite(Twine(ResultPath.begin()), ResultFD,
sys::fs::CD_CreateNew, Flags, Mode);		sys::fs::CD_CreateNew, Flags, Mode);
if (EC) {		if (EC) {
▲ Show 20 Lines • Show All 1,032 Lines • Show Last 20 Lines

llvm/unittests/Support/Path.cpp

Show First 20 Lines • Show All 697 Lines • ▼ Show 20 Lines	if (T) {
return true;		return true;
} else {		} else {
logAllUnhandledErrors(T.takeError(), errs(),		logAllUnhandledErrors(T.takeError(), errs(),
"Failed to create temporary file: ");		"Failed to create temporary file: ");
return false;		return false;
}		}
};		};

// We should be able to create exactly 16 temporary files.		// We should be able to create exactly 32 temporary files.
for (int i = 0; i < 16; ++i)		for (int i = 0; i < 32; ++i)
EXPECT_TRUE(TryCreateTempFile());		EXPECT_TRUE(TryCreateTempFile());
EXPECT_FALSE(TryCreateTempFile());		EXPECT_FALSE(TryCreateTempFile());

for (fs::TempFile &T : TempFiles)		for (fs::TempFile &T : TempFiles)
cantFail(T.discard());		cantFail(T.discard());
}		}

TEST_F(FileSystemTest, CreateDir) {		TEST_F(FileSystemTest, CreateDir) {
▲ Show 20 Lines • Show All 1,008 Lines • Show Last 20 Lines