This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Support/
-
llvm/
-
Support/
6/9
FileSystem.h
-
lib/Support/
-
Support/
-
Unix/
4/9
Path.inc
-
Windows/
3/4
Path.inc
-
unittests/Support/
-
Support/
8/18
Path.cpp

Differential D78896

[Support] Add file lock/unlock functions
ClosedPublic

Authored by sepavloff on Apr 26 2020, 11:32 PM.

Download Raw Diff

Details

Reviewers

rnk
MaskRay
labath
sammccall
krytarowski
amccarth

Commits

rG536736995bf5: [Support] Add file lock/unlock functions
rGf51bc4fb60fb: [Support] Add file lock/unlock functions

Summary

New functions lockFile and unlockFile implement simple file locking.
They lock or unlock entire file. This must be enough to support
simulataneous writes to log files in parallel builds.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

sepavloff created this revision.Apr 26 2020, 11:32 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 26 2020, 11:32 PM

Herald added a subscriber: hiraditya. · View Herald Transcript

sepavloff added a child revision: D78897: [Support] raw_fd_ostream can lock file before write.Apr 26 2020, 11:47 PM

Harbormaster failed remote builds in B54740: Diff 260217!Apr 27 2020, 12:29 AM

labath added inline comments.Apr 27 2020, 5:36 AM

llvm/include/llvm/Support/FileSystem.h
1140–1141	this would be nicer as a std::chrono type.
1145	Should this be called `tryLockFile`, or maybe even `tryLockFileFor` (to mirror e.g. std::timed_mutex::try_lock_for) ?
llvm/unittests/Support/Path.cpp
2046–2047	Have you considered using flock(2) instead of F_SETLK? That might give you semantics which are a bit saner and a bit closer to what happens on windows (though file locking is always weird on posix systems)...

Updated patch according to reviewer's notes

sepavloff marked 5 inline comments as done.Apr 28 2020, 12:48 AM

sepavloff added inline comments.

llvm/include/llvm/Support/FileSystem.h
1140–1141	Agree.
1145	Make sense. Changed to `tryLockFile`.
llvm/unittests/Support/Path.cpp
2046–2047	IIUC, `flock` is not a POSIX call. GLIBC implements it on top of `fcntl`. The implementation also contains vague statement that it represents different mechanism on 4BSD: https://github.com/bminor/glibc/blob/92954ffa5a5662fbfde14febd7e5dcc358c85470/sysdeps/posix/flock.c#L18 . So I would refrain from using it, as the code must work on Linux, MacOS and *BSD. POSIX calls looks more portable.

Harbormaster failed remote builds in B54930: Diff 260555!Apr 28 2020, 1:34 AM

Looping in Kamil for his knowledge of "obscure" operating systems and other historical trivia.

However, I am still wondering (as I've alluded to in the other review) if there isn't a simpler way to guarantee atomic appends to a "log" file.

llvm/unittests/Support/Path.cpp
2046–2047	You're right that it is not a posix call -- I did not realize that. However, a brief search seems to indicate that all major operating systems (I tried linux, mac, openbsd, freebsd, netbsd) do have this function. I'm not sure in what situation is the glibc function you linked to used (glibc build system is very opaque to me), but it is definitely not used on linux, as linux kernel has first class support for this via SYS_flock. I'd expect the BSDs (that includes macs) to do the same, as they document flock as behaving differently than fcntl locks. You're right that fcntl locks are more portable on paper, but I am not really sure that is true in practice. OTOH, I am sure that the fcntl lock semantics are very weird. One example is given in the bsd man pages: ... This semantic means that applica- tions must be aware of any files that a subroutine library may access. For example if an application for updating the password file locks the password file database while making the update, and then calls getpwnam(3) to retrieve a record, the lock will be lost because getpwnam(3) opens, reads, and closes the password database. The database close will release all locks that the process has associated with the database, even if the library routine never requested a lock on the data- base. I also have very bad memories of trying to use this function, because the deadlock detection algorithm used can create false positives in multithreaded applications.

sepavloff added a child revision: D79066: [Support] Class to facilitate file locking.Apr 28 2020, 10:35 PM

If the goal is to synchronize writes to a highly contended log file, would it be better (and feasible) to have the individual threads/processes write timestamped output to separate streams that can be merged after-the-fact?

llvm/lib/Support/Windows/Path.inc
1271	`::Sleep(1)` sleeps for _at least_ one millisecond, but possibly for much longer. The default tick frequency on Windows is about 16 ms. (Many apps boost the system's timer frequency to 1 ms, but that's not universal and not recommended except for short periods when an app needs to display real-time media.) Sleeping too long once isn't a big deal. But Counter increments by 1 ms each time through the loop, regardless of how long it actually took, so this is likely to sleep too long many times. If the user requests a 10 ms timeout, they could actually wait 160 ms (in some near-worst case scenario). If this tracked the actual time elapsed, it probably would never be worse than 16 ms. You can use chrono's high resolution clock or Windows APIs like ::GetTickCount or ::QueryPerformanceCounter to find out how long the thread actually slept.

Updated patch

fcntl was replaced for flock on unix,
std::chrono was used to check timeout,
unit test was enabled to unix as well.

sepavloff marked 2 inline comments as done.Apr 30 2020, 2:54 AM

sepavloff added inline comments.

llvm/lib/Support/Windows/Path.inc
1271	Indeed, `Timeout` was in fact a number of attempts. Reworked this place using `std::chrono`.
llvm/unittests/Support/Path.cpp
2046–2047	After some hesitation I implemented your idea to use `flock` instead of `fcntl`. The concern was problems with locking on NFS shares, but it seems this was an issue for old implementation. Using `flock` allows to enable unit test for unix as well.

Harbormaster failed remote builds in B55274: Diff 261166!Apr 30 2020, 3:37 AM

Thanks for the timeout fix on Windows. I'm happy now, but I'm not qualified to approve other parts of this patch.

In D78896#2010324, @amccarth wrote:

If the goal is to synchronize writes to a highly contended log file, would it be better (and feasible) to have the individual threads/processes write timestamped output to separate streams that can be merged after-the-fact?

Such way is possible, but expensive. There must be a tool that collects these separate reports. And this tool must somehow know about build process so that it could pick up right files. Of course, it can pick up every report file in a build tree, but it this case somebody must clean the tree prior to build with getting statistics and this is a source of human errors. Such way is OK if there is some integrated system and running make is made by it and not by the user.

Instead, ability of compiler to log child process statistics to given file provide simple and fast way to obtain resource consumption data, which may be used not only by compiler developer but also by testers or other persons who need such data.

This looks good to me too. Just some stylistic comments inline.

llvm/lib/Support/Unix/Path.inc
34	add space before `<`
1055–1063	You could reduce nesting here by flipping the condition: `if (flock(...) == 0) return std::error_code();`
llvm/lib/Support/Windows/Path.inc
1267	No `else` after `return`.

labath added inline comments.May 4 2020, 1:30 AM

llvm/include/llvm/Support/FileSystem.h
1136–1137	Given the very different locking semantics on different OSs, it would be good to explicitly mention what this functions does (or does not) promise. Something along the lines of that this is guaranteed to work only if other processes also try to lock the file the same way, and that it is unspecified whether holding a lock prevents another process from modifying the said file.
llvm/lib/Support/Windows/Path.inc
1285	same here.
llvm/unittests/Support/Path.cpp
2045	Maybe a test where the second lock is taken on a separate thread, and the first lock is released while the second thread is waiting for it to become available?
2046–2047	Thanks. We can always revisit this if it turns out to be an issue somewhere.

Updated patch

Small format corrections,
Added test on locking in threads.

sepavloff marked 7 inline comments as done.May 5 2020, 6:14 AM

sepavloff added inline comments.

llvm/unittests/Support/Path.cpp
2045	Added test `lockFileThread`.

Harbormaster failed remote builds in B55779: Diff 262090!May 5 2020, 6:58 AM

labath added inline comments.May 5 2020, 7:09 AM

llvm/include/llvm/Support/FileSystem.h
1136	Is the "advisory" part really true on windows? My impression is that is not true (at least not in the sense that posix uses this term). WriteFile documentation says: If part of the file is locked by another process and the write operation overlaps the locked portion, WriteFile fails.
1138	I find this "may change it" part very ambiguous. Did you mean that the process may assume that no other process changes that file (because the operating system guarantees that) or that the process must assume that no other process will change it (because there is no mechanism in the OS to prevent it). Note that if my thoughts on WriteFile above are true, then neither of the two interpretations are correct, and all we can say is something like. "Attempts to lock the file by other processes will fail/block, but the caller should not assume that the file cannot be modified by uncooperating processes who access it without locking."
llvm/unittests/Support/Path.cpp
2080–2135	I get the impression this code is much more complicated then needed. There's a lot of synchronization going on but it still does not guarantee the that the file is unlocked while the other thread is inside the `tryLock` call (my goal was to get coverage for the while loop). How about something like: EC = fs::tryLockFile(FD1); ASSERT_NO_ERROR(EC); EC = fs::tryLockFile(FD2); ASSERT_ERROR(EC); std::thread LockThread([&] { EC2 = fs::tryLockFile(FD2, std::chrono::minutes(1)); }); std::this_thread::sleep_for(std::chrono::seconds(1)); EC = fs::unlockFile(FD1); ASSERT_NO_ERROR(EC); LockThread.join(); ASSERT_NO_ERROR(EC2); EC = fs::unlockFile(FD2); ASSERT_NO_ERROR(EC); It still does not guarantee that the other thread is inside the `tryLockFile` call, but it comes as close as we can get, and it avoids all the condition_variable overhead.
2089	I don't think this is a good use of auto per the coding standards http://llvm.org/docs/CodingStandards.html#use-auto-type-deduction-to-make-code-more-readable.

Updated patch

Added blocking lock function,
Reorganized unit tests.

Harbormaster failed remote builds in B56154: Diff 262861!May 8 2020, 8:33 AM

krytarowski added inline comments.May 11 2020, 5:01 AM

llvm/unittests/Support/Path.cpp
2046–2047	flock() is implemented as a system call on BSDs since 1983, this means that it is pretty universal.

sepavloff mentioned this in D79066: [Support] Class to facilitate file locking.May 20 2020, 11:19 PM

I think this patch is mostly ok, but I'm still waiting on responses to two comments I made earlier.

llvm/include/llvm/Support/FileSystem.h
1138	It would be nice to get this clarified as well..
llvm/unittests/Support/Path.cpp
2080–2135	Waiting on a response to this. I still feel that this can be organized in a much simpler way without using so many explicit synchronization primitives.

Updated patch

Fixed documentation for tryLockFile,
Added comments to unit test.

sepavloff marked 2 inline comments as done.May 21 2020, 5:17 AM

sepavloff added inline comments.

llvm/include/llvm/Support/FileSystem.h
1138	I changed the wording in this point.
llvm/unittests/Support/Path.cpp
2080–2135	I put a diagram explaining what the test does. Actually there are two events, which ensures that the first attempt to lock file occurs in Thread2 after Thread1 locks the file but before it releases it. So the two calls to `tryLockFile` checks both cases, successful and unsuccessful. Each event requires a mutex and a condvar, so we have 4 synchronization objects. Simpler variants do not guarantee checking the both cases.

Harbormaster failed remote builds in B57514: Diff 265477!May 21 2020, 6:26 AM

vvereschaka added a subscriber: vvereschaka.May 26 2020, 10:51 AM

labath added inline comments.May 28 2020, 9:02 AM

llvm/unittests/Support/Path.cpp
2080–2135	I believe that each of these events requires synchronization, but a condition variable is not the only way to achieve that. Starting and joining a thread is also a form of synchronization, and it is much simpler (and something you have to do anyway. So, instead of starting a thread which, as a first order of business blocks on a condition variable, you could just delay starting the thread until such a time that the condition would be satisfied. Basically -- remove `cv.wait_for` and replace `cv.notify` with the creation of the thread object. Then, instead of waiting for the other thread to unblock you, you can just `join` it. For the second condition variable, you just create a fresh thread again. Having the thread bodies be smaller would make the code much easier to follow, and it's not like we have to worry about the performance overhead of creating a bunch of small threads here...

Updated unit test

sepavloff marked an inline comment as done.May 29 2020, 3:02 AM

sepavloff added inline comments.

llvm/unittests/Support/Path.cpp
2080–2135	Starting the second thread inside the first instead of waiting event indeed makes the test more compact, I rewrote the test accordingly. As for the second lock try, which happens after the file is unlocked, using new thread makes the logic more obscure. I moved synchronization stuff into the new class `Event`, it must make the test shorter and clearer.

Harbormaster failed remote builds in B58392: Diff 267154!May 29 2020, 4:51 AM

labath added inline comments.May 29 2020, 5:15 AM

llvm/unittests/Support/Path.cpp
2080–2135	Creating the Event class does make it a bit better, but I still maintain that this test is too complicated for what it really tests. Take a look at the following test: TEST_F(FileSystemTest, lockFileThread) { #if LLVM_ENABLE_THREADS int FD1, FD2; SmallString<64> TempPath; ASSERT_NO_ERROR(fs::createTemporaryFile("test", "temp", FD1, TempPath)); FileRemover Cleanup(TempPath); ASSERT_NO_ERROR(fs::openFileForReadWrite(TempPath, FD2, fs::CD_OpenExisting, fs::OF_Append)); ASSERT_NO_ERROR(fs::tryLockFile(FD1)); ASSERT_ERROR(fs::tryLockFile(FD2)); std::future<std::error_code> Future = std::async(std::launch::async, [&] { return fs::tryLockFile(FD2, std::chrono::seconds(5)); }); ASSERT_NO_ERROR(fs::unlockFile(FD1)); ASSERT_NO_ERROR(Future.get()); fs::unlockFile(FD2); ASSERT_NO_ERROR(fs::tryLockFile(FD1)); ASSERT_ERROR(fs::tryLockFile(FD2)); Future = std::async(std::launch::async, [&] { return fs::lockFile(FD2); }); ASSERT_NO_ERROR(fs::unlockFile(FD1)); ASSERT_NO_ERROR(Future.get()); fs::unlockFile(FD2); #endif } It tests the same thing as the test you wrote -- I obtained by applying series of semantics-preserving simplifications to it. This included fairly simple things like: inlining replacing patterns like std::thread(foo).join() with direct calls to `foo` moving code which does not block outside of a thread -- e.g. asserting that a lock attempt fails does not need to be done in a separate thread because it does not block. Only the blocking calls do. replacing a thread consisting of a single expression with a call to `std::async` removing unused variables produced by all of this However, the end result is a test which is about 3 times shorter than the original (28 lines vs 88), and it's almost linear -- each parallel section is only three lines long. I think it'd be pretty hard to argue that this is not more readable than the original.

sepavloff marked an inline comment as done.May 31 2020, 9:57 PM

sepavloff added inline comments.

llvm/unittests/Support/Path.cpp
2080–2135	Thank you very much for the code and detailed explanations! The way your code checks the result of `fs::tryLockFile` means it relies on particular sequence of statement execution. For the test to be successful the main thread after the creation of a separate thread must continue execution and execute `fs::unlockFile`. In this case when the separate thread starts, it sees unlocked file. It is the most probable case, but not the single. If rescheduling of the main thread occurs after thread creation but before execution of `fs::unlockFile` or there is a core ready to execute the new thread, this test will fail. There is no guarantee of ordering statement execution in different threads unless synchronization objects are used.

labath added inline comments.Jun 1 2020, 2:11 AM

llvm/unittests/Support/Path.cpp
2080–2135	I'm sorry, but I am unable to follow this line of reasoning. You're talking about this block of code, right? std::future<std::error_code> Future = std::async(std::launch::async, [&] { return fs::tryLockFile(FD2, std::chrono::seconds(5)); }); ASSERT_NO_ERROR(fs::unlockFile(FD1)); ASSERT_NO_ERROR(Future.get()); fs::unlockFile(FD2); Before the `std::async` statement, FD1 is locked, FD2 is unlocked. After it, there are two actions that can execute in arbitrary order (or concurrently): `fs::unlockFile(FD1)` on the main thread and `fs::tryLockFile(FD2, std::chrono::seconds(5))` on the "async" thread. If the `unlockFile` executes first, it will unlock the file, and the subsequent `tryLockFile` will immediately succeed. If `tryLockFile` is scheduled first, then it will get a lock failure and will start to wait. While it waits the main thread will get scheduled, unlock the file, and then the `tryLockFile` will succeed again. If an evil scheduler decides to not schedule the main thread for five seconds, then `tryLockFile` will fail, but there's nothing we can do about that except increase the timeout. This is the exact same situation that can happen with condition variables: // thread 2 ... DoUnlockEvent.signal(); if (UseBlockingCall) ECT2b = fs::lockFile(FD2); else ECT2b = fs::tryLockFile(FD2, std::chrono::seconds(5)); ... // thread 1 ECT1a = fs::tryLockFile(FD1); if (ECT1a) return; auto Thread2 = std::thread(Thread2Body); DoUnlockEvent.wait(); ECT1b = fs::unlockFile(FD1); ... After thread1 is unblocked by `DoUnlockEvent.signal();`, we again have two runnable threads (the `if` statement on thread 2, and the `fs::unlockFile(FD1)` call on thread 1) and it is up to the scheduler to determine their order. It's not true that there are no synchronization objects here. We have `std::future` and the thread object contained within. Creation of the future (via std::async) establishes a happens-before relationship between the actions taken before `std::async` is called on the main thread, and the body of the async thread. This is exactly what happens with `DoUnlockEvent.signal()` and `DoUnlockEvent.wait()`. And calling `future::get` establishes a happens-before relationship between the body of the async thread and the code that comes after the `get` call. That's exactly what would happen with `Thread2.join()` in your example. My point is that "launching a async thread" is a simpler way of synchronizing than "waiting on a cv", and "getting a future" is simpler than "joining a thread which 'returns' a result through a global variable".

Updated unit test

sepavloff marked an inline comment as done.Jun 2 2020, 8:13 AM

sepavloff added inline comments.

llvm/unittests/Support/Path.cpp
2080–2135	Sorry, I didn't notice that you use operation with enough long timeout. In this case no flaky behavior should be observed. I updated the unit test. Thank you very much!

Awesome, let's get this out the door.

This revision is now accepted and ready to land.Jun 2 2020, 9:51 AM

Harbormaster failed remote builds in B58773: Diff 267893!Jun 2 2020, 9:52 AM

Closed by commit rGf51bc4fb60fb: [Support] Add file lock/unlock functions (authored by sepavloff). · Explain WhyJun 2 2020, 10:25 PM

This revision was automatically updated to reflect the committed changes.

This patch broke the Solaris buildbots (Builder clang-solaris11-sparcv9 Build #5494, Builder clang-solaris11-amd64 Build #4477):

[24/2656] Building CXX object lib/Support/CMakeFiles/LLVMSupport.dir/Path.cpp.o
FAILED: lib/Support/CMakeFiles/LLVMSupport.dir/Path.cpp.o 
/usr/gcc/7/bin/c++  -DGTEST_HAS_RTTI=0 -D_DEBUG -D_FILE_OFFSET_BITS=64 -D__STDC_CONSTANT_MACROS -D__STDC_FORMAT_MACROS -D__STDC_LIMIT_MACROS -Ilib/Support -I/opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/lib/Support -Iinclude -I/opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/include -I/opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/include/llvm/Support/Solaris -fPIC -fvisibility-inlines-hidden -Werror=date-time -Wall -Wextra -Wno-unused-parameter -Wwrite-strings -Wcast-qual -Wno-missing-field-initializers -pedantic -Wno-long-long -Wimplicit-fallthrough -Wno-maybe-uninitialized -Wno-noexcept-type -Wdelete-non-virtual-dtor -Wno-comment -fdiagnostics-color -ffunction-sections -fdata-sections -O3    -UNDEBUG -std=c++14  -fno-exceptions -fno-rtti -MD -MT lib/Support/CMakeFiles/LLVMSupport.dir/Path.cpp.o -MF lib/Support/CMakeFiles/LLVMSupport.dir/Path.cpp.o.d -o lib/Support/CMakeFiles/LLVMSupport.dir/Path.cpp.o -c /opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/lib/Support/Path.cpp
In file included from /opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/lib/Support/Path.cpp:1151:0:
/opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/lib/Support/Unix/Path.inc: In function ‘std::error_code llvm::sys::fs::tryLockFile(int, std::chrono::milliseconds)’:
/opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/lib/Support/Unix/Path.inc:1055:21: error: ‘LOCK_EX’ was not declared in this scope
     if (::flock(FD, LOCK_EX | LOCK_NB) == 0)
                     ^~~~~~~
/opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/lib/Support/Unix/Path.inc:1055:21: note: suggested alternative: ‘LOCK_HELD’
     if (::flock(FD, LOCK_EX | LOCK_NB) == 0)
                     ^~~~~~~
                     LOCK_HELD
/opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/lib/Support/Unix/Path.inc:1055:31: error: ‘LOCK_NB’ was not declared in this scope
     if (::flock(FD, LOCK_EX | LOCK_NB) == 0)
                               ^~~~~~~
/opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/lib/Support/Unix/Path.inc:1055:31: note: suggested alternative: ‘_CLOCK_T’
     if (::flock(FD, LOCK_EX | LOCK_NB) == 0)
                               ^~~~~~~
                               _CLOCK_T
/opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/lib/Support/Unix/Path.inc: In function ‘std::error_code llvm::sys::fs::lockFile(int)’:
/opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/lib/Support/Unix/Path.inc:1068:19: error: ‘LOCK_EX’ was not declared in this scope
   if (::flock(FD, LOCK_EX) == 0)
                   ^~~~~~~
/opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/lib/Support/Unix/Path.inc:1068:19: note: suggested alternative: ‘LOCK_HELD’
   if (::flock(FD, LOCK_EX) == 0)
                   ^~~~~~~
                   LOCK_HELD
/opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/lib/Support/Unix/Path.inc: In function ‘std::error_code llvm::sys::fs::unlockFile(int)’:
/opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/lib/Support/Unix/Path.inc:1074:19: error: ‘LOCK_UN’ was not declared in this scope
   if (::flock(FD, LOCK_UN) == -1)
                   ^~~~~~~
/opt/llvm-buildbot/home/solaris11-amd64/clang-solaris11-amd64/llvm/llvm/lib/Support/Unix/Path.inc:1074:19: note: suggested alternative: ‘LONG_MIN’
   if (::flock(FD, LOCK_UN) == -1)
                   ^~~~~~~
                   LONG_MIN

As detailed e.g. in x/sys/unix: Solaris lacks Flock (plus related constants) #11113, Solaris
doesn't support the non-standard flock.

Reverted: https://reviews.llvm.org/rG8577595e03faf740ee0cfae1bbb2d0ff6f4e4516.

Because flock is unsupported on Solaris, this patch has been rewritten on top of fcntl facility. It also required unit test to be rewritten as well.

If there is no objections, I will commit the new patch in couple of days.

This revision is now accepted and ready to land.Jun 7 2020, 9:48 PM

Updated patch

flock call was replaced by fcntl in hope that this change would enable this facility on Solaris OS, where flock is unavailable.
As locks created by fcntl work on per-process basis, unit test based on multithreding does not work. A new unit test where locks are obtained in different processes was added.

If nobody will argue in couple of days, this patch will be committed.

MaskRay added inline comments.Jun 7 2020, 10:01 PM

llvm/lib/Support/Unix/Path.inc
1066	I feel uneasy with usleep(1000). Why is it needed?

Harbormaster failed remote builds in B59425: Diff 269104!Jun 7 2020, 11:25 PM

It's a pitty about flock&solaris. It's a much saner api, and a lot closer to the windows implementation.

I guess we have to stick to fcntl then. The new version looks good. I just think it would be good to acknowledge the weirdness of the flock api in the method documentation. Maybe something like "Care should be taken when using this function in a multithreaded context, as it may not prevent other threads in the same process from obtaining a lock on the same file, even if they are using a different file descriptor."

llvm/lib/Support/Unix/Path.inc
1066	That's because the os provides no mechanism to wait for a lock to become available for a given amount of time (well.... one could use `SIGALRM`s to achieve that, but those come with problems of their own..) Using exponential backoff for this might be a good idea, but I also think that can wait until this becomes a problem.
llvm/unittests/Support/ProgramTest.cpp
369 ↗	(On Diff #269104)	Although this will work in this particular case, using Twines in this way is very dangerous (this is _almost_ the same as the one that http://llvm.org/docs/ProgrammersManual.html#llvm-adt-twine-h warns about). As performance is definitely not relevant here, it would be better to just use strings.

MaskRay added inline comments.Jun 8 2020, 8:42 AM

llvm/lib/Support/Unix/Path.inc
1066	The standard `try_*` (std::mutex::try_lock, pthread_mutex_trylock, etc) return immediately upon a failure. The backoff strategy should be done by the application to avoid the misnomer.

MaskRay requested changes to this revision.Jun 8 2020, 8:42 AM

This revision now requires changes to proceed.Jun 8 2020, 8:42 AM

Updated patch

Rebased,
Added note about using in multithreaded environment,
Used std::string instead of Twine in unit test.

In D78896#2079169, @labath wrote:

It's a pitty about flock&solaris. It's a much saner api, and a lot closer to the windows implementation.

I guess we have to stick to fcntl then. The new version looks good. I just think it would be good to acknowledge the weirdness of the flock api in the method documentation. Maybe something like "Care should be taken when using this function in a multithreaded context, as it may not prevent other threads in the same process from obtaining a lock on the same file, even if they are using a different file descriptor."

Added the note in the description of the function in FileSystem.h.

llvm/lib/Support/Unix/Path.inc
1066	If the function is called as `tryLockFile(FD)` it behaves exactly as the standard `try_` functions, it returns immediately. The backoff strategy should be done by the application to avoid the misnomer. This use case is often and even standard library provides functions that attempt to do an operation in specified time (_for). As the only expected usage of file locks is writing to log file by concurrent processes and it requires repeating the operation if unsuccessful, so integrations of the wait loop into the service function seems natural.

Harbormaster failed remote builds in B59584: Diff 269429!Jun 9 2020, 2:09 AM

labath added inline comments.Jun 9 2020, 2:41 AM

llvm/lib/Support/Unix/Path.inc
1066	@MaskRay, I'm not sure how to interpret your comment -- would your concerns be addressed if this function was renamed to `tryLockFileFor` (and deleting the default argument value) ? Because I too think that would be a better name for this function (though I don't feel very strongly about that). `tryLockFile` could then be implemented as `tryLockFileFor(seconds(0))`, or not implemented at all, if it's not needed. As for the retries, I agree with @sepavloff, that it makes sense to implement them here. The way I see it the "retries" are an implementation detail and the function as a whole still behaves like the other `try_lock_for` functions. E.g. description of `std::timed_mutex::try_lock_for` says: "tries to lock the mutex, returns if the mutex has been unavailable for the specified timeout duration" -- that is still true. The main difference is that the function returns an error_code instead of a bool, but there's not much that can be done about that, as this operation can fail for a lot of other reasons. I think this is sort similar to `std::atomic<T>::fetch_add`. E.g., arm64 does not have the equivalent of the `lock add` instruction, so this function compiles to a `ldaxr+stlxr` loop, which attempts the update several times until successful -- the caller does not have to retry manually. (And down at the microarchitectural level, even `lock add` probably uses retries&timeouts to implement the functionality.)

MaskRay added inline comments.Jun 9 2020, 9:44 AM

llvm/lib/Support/Unix/Path.inc
1066	Adding both `try` and `for` to the name looks good to me. POSIX uses `timedlock` which may be considered as well.

Updated patch

tryLockFile was renamed to tryLockFileWithTimeout,
added new function tryLockFile, which returns immediately.

In D78896#2079169, @labath wrote:

It's a pitty about flock&solaris. It's a much saner api, and a lot closer to the windows implementation.

I guess we have to stick to fcntl then. The new version looks good. I just think it would be good to acknowledge the weirdness of the flock api in the method documentation. Maybe something like "Care should be taken when using this function in a multithreaded context, as it may not prevent other threads in the same process from obtaining a lock on the same file, even if they are using a different file descriptor."

Added the note in the description of the function in FileSystem.h.

llvm/lib/Support/Unix/Path.inc
1066	Using `for` is convenient for standard functions, because they accept single argument of type `std::chrono::duration`, so an expression like: f.wait_for(2500ms) looks self-documented. In the case of `tryLockFile` it becomes something like: tryLockFileFor(FileHandle, 2500ms) which is pretty ugly. I renamed `tryLockFile` to `tryLockFileWithTimeout`, which must be unambiguous, although long. Also a function `tryLock` was added, that does not wait and return immediately.

Forgotten changes

Harbormaster failed remote builds in B59750: Diff 269754!Jun 10 2020, 2:41 AM

Harbormaster failed remote builds in B59753: Diff 269760!Jun 10 2020, 3:14 AM

Updated patch

Changed name of the function tryLockFileWithTimeout to tryLockFile. This
name is shorter thus more readable. Expression like tryLockFile(SomeFile)
looks natural, tryLockFile(SomeFile, 2500ms) also is understandable without
documentation. Setting default timeout value to zero is natural, as a single
attempt to lock, which is associted with try* functions, can be considered
as a particular case of very short timeout.

Harbormaster failed remote builds in B60949: Diff 271931!Jun 19 2020, 2:07 AM

Any feedback?

Rebased patch

Harbormaster completed remote builds in B63690: Diff 276917.Jul 10 2020, 12:19 AM

@amccarth Could you please review Windows part of this patch? Thank you.

As I noted on April 30, I'm OK with the Windows portions of this. I didn't explicitly "Accept" because I didn't want to pre-empt the concerns of the other reviewers.

I see that @MaskRay is still marked as requesting revisions.

In D78896#2176413, @amccarth wrote:

As I noted on April 30, I'm OK with the Windows portions of this. I didn't explicitly "Accept" because I didn't want to pre-empt the concerns of the other reviewers.

I see that @MaskRay is still marked as requesting revisions.

Sorry for trouble, the thread is so long that I missed your review.
Thank you!

MaskRay accepted this revision.Jul 27 2020, 11:59 PM

This revision is now accepted and ready to land.Jul 27 2020, 11:59 PM

Closed by commit rG536736995bf5: [Support] Add file lock/unlock functions (authored by sepavloff). · Explain WhyJul 28 2020, 2:45 AM

This revision was automatically updated to reflect the committed changes.

sepavloff added a commit: rG536736995bf5: [Support] Add file lock/unlock functions.

Revision Contents

Path

Size

llvm/

include/

llvm/

Support/

FileSystem.h

33 lines

lib/

Support/

Unix/

Path.inc

29 lines

Windows/

Path.inc

37 lines

unittests/

Support/

Path.cpp

108 lines

Diff 267154

llvm/include/llvm/Support/FileSystem.h

	Show First 20 Lines • Show All 1,125 Lines • ▼ Show 20 Lines
	/// of the opened file, and that path is stored in this			/// of the opened file, and that path is stored in this
	/// location.			/// location.
	/// @returns a platform-specific file descriptor if \a Name has been opened,			/// @returns a platform-specific file descriptor if \a Name has been opened,
	/// otherwise an error object.			/// otherwise an error object.
	Expected<file_t>			Expected<file_t>
	openNativeFileForRead(const Twine &Name, OpenFlags Flags = OF_None,			openNativeFileForRead(const Twine &Name, OpenFlags Flags = OF_None,
	SmallVectorImpl<char> *RealPath = nullptr);			SmallVectorImpl<char> *RealPath = nullptr);

				/// Try to locks the file during the specified time.
				///
				/// This function implements advisory locking on entire file. If it returns
				labathUnsubmitted Not Done Reply Inline Actions Is the "advisory" part really true on windows? My impression is that is not true (at least not in the sense that posix uses this term). WriteFile documentation says: If part of the file is locked by another process and the write operation overlaps the locked portion, WriteFile fails. labath: Is the "advisory" part really true on windows? My impression is that is not true (at least not…
				/// <em>errc::success</em>, the file is locked by the calling process. Until the
				labathUnsubmitted Done Reply Inline Actions Given the very different locking semantics on different OSs, it would be good to explicitly mention what this functions does (or does not) promise. Something along the lines of that this is guaranteed to work only if other processes also try to lock the file the same way, and that it is unspecified whether holding a lock prevents another process from modifying the said file. labath: Given the very different locking semantics on different OSs, it would be good to explicitly…
				/// process unlocks the file by calling \a unlockFile, all attempts to lock the
				labathUnsubmitted Not Done Reply Inline Actions I find this "may change it" part very ambiguous. Did you mean that the process may assume that no other process changes that file (because the operating system guarantees that) or that the process must assume that no other process will change it (because there is no mechanism in the OS to prevent it). Note that if my thoughts on WriteFile above are true, then neither of the two interpretations are correct, and all we can say is something like. "Attempts to lock the file by other processes will fail/block, but the caller should not assume that the file cannot be modified by uncooperating processes who access it without locking." labath: I find this "may change it" part very ambiguous. Did you mean that the process may assume that…
				labathUnsubmitted Not Done Reply Inline Actions It would be nice to get this clarified as well.. labath: It would be nice to get this clarified as well..
				sepavloffAuthorUnsubmitted Done Reply Inline Actions I changed the wording in this point. sepavloff: I changed the wording in this point.
				/// same file will fail/block. The process that locked the file may assume that
				/// none of other processes read or write this file, provided that all processes
				/// lock the file prior to accessing its content.
				labathUnsubmitted Done Reply Inline Actions this would be nicer as a std::chrono type. labath: this would be nicer as a std::chrono type.
				sepavloffAuthorUnsubmitted Done Reply Inline Actions Agree. sepavloff: Agree.
				///
				/// @param File The descriptor representing the file to lock.
				/// @param Timeout Time in milliseconds that the process should wait before
				/// reporting lock failure. Zero value means try to get lock only
				labathUnsubmitted Done Reply Inline Actions Should this be called `tryLockFile`, or maybe even `tryLockFileFor` (to mirror e.g. std::timed_mutex::try_lock_for) ? labath: Should this be called `tryLockFile`, or maybe even `tryLockFileFor` (to mirror e.g. [[ https…
				sepavloffAuthorUnsubmitted Done Reply Inline Actions Make sense. Changed to `tryLockFile`. sepavloff: Make sense. Changed to `tryLockFile`.
				/// once.
				/// @returns errc::success if lock is successfully obtained,
				/// errc::no_lock_available if the file cannot be locked, or platform-specific
				/// error_code otherwise.
				std::error_code
				tryLockFile(int FD,
				std::chrono::milliseconds Timeout = std::chrono::milliseconds(0));

				/// Lock the file.
				///
				/// This function acts as @ref tryLockFile(int,std::chrono::milliseconds) but it
				/// waits infinitely.
				std::error_code lockFile(int FD);

				/// Unlock the file.
				///
				/// @param File The descriptor representing the file to unlock.
				/// @returns errc::success if lock is successfully released or platform-specific
				/// error_code otherwise.
				std::error_code unlockFile(int FD);

	/// @brief Close the file object. This should be used instead of ::close for			/// @brief Close the file object. This should be used instead of ::close for
	/// portability. On error, the caller should assume the file is closed, as is			/// portability. On error, the caller should assume the file is closed, as is
	/// the case for Process::SafelyCloseFileDescriptor			/// the case for Process::SafelyCloseFileDescriptor
	///			///
	/// @param F On input, this is the file to close. On output, the file is			/// @param F On input, this is the file to close. On output, the file is
	/// set to kInvalidFile.			/// set to kInvalidFile.
	///			///
	/// @returns An error code if closing the file failed. Typically, an error here			/// @returns An error code if closing the file failed. Typically, an error here
	▲ Show 20 Lines • Show All 303 Lines • Show Last 20 Lines

llvm/lib/Support/Unix/Path.inc

Show All 25 Lines
#endif		#endif
#ifdef HAVE_UNISTD_H		#ifdef HAVE_UNISTD_H
#include <unistd.h>		#include <unistd.h>
#endif		#endif
#ifdef HAVE_SYS_MMAN_H		#ifdef HAVE_SYS_MMAN_H
#include <sys/mman.h>		#include <sys/mman.h>
#endif		#endif

		#include <sys/file.h>
		labathUnsubmitted Done Reply Inline Actions add space before `<` labath: add space before `<`
#include <dirent.h>		#include <dirent.h>
#include <pwd.h>		#include <pwd.h>

#ifdef __APPLE__		#ifdef __APPLE__
#include <mach-o/dyld.h>		#include <mach-o/dyld.h>
#include <sys/attr.h>		#include <sys/attr.h>
#include <copyfile.h>		#include <copyfile.h>
#elif defined(__FreeBSD__)		#elif defined(__FreeBSD__)
▲ Show 20 Lines • Show All 1,000 Lines • ▼ Show 20 Lines	#else
ssize_t NumRead =		ssize_t NumRead =
sys::RetryAfterSignal(-1, ::read, FD, Buf.data(), Buf.size());		sys::RetryAfterSignal(-1, ::read, FD, Buf.data(), Buf.size());
#endif		#endif
if (NumRead == -1)		if (NumRead == -1)
return errorCodeToError(std::error_code(errno, std::generic_category()));		return errorCodeToError(std::error_code(errno, std::generic_category()));
return NumRead;		return NumRead;
}		}

		std::error_code tryLockFile(int FD, std::chrono::milliseconds Timeout) {
		auto Start = std::chrono::steady_clock::now();
		auto End = Start + Timeout;
		do {
		if (::flock(FD, LOCK_EX \| LOCK_NB) == 0)
		return std::error_code();
		int Error = errno;
		if (Error == EWOULDBLOCK) {
		usleep(1000);
		continue;
		}
		return std::error_code(Error, std::generic_category());
		} while (std::chrono::steady_clock::now() < End);
		labathUnsubmitted Done Reply Inline Actions You could reduce nesting here by flipping the condition: `if (flock(...) == 0) return std::error_code();` labath: You could reduce nesting here by flipping the condition: `if (flock(...) == 0) return std…
		return make_error_code(errc::no_lock_available);
		}

		MaskRayUnsubmitted Not Done Reply Inline Actions I feel uneasy with usleep(1000). Why is it needed? MaskRay: I feel uneasy with usleep(1000). Why is it needed?
		labathUnsubmitted Not Done Reply Inline Actions That's because the os provides no mechanism to wait for a lock to become available for a given amount of time (well.... one could use `SIGALRM`s to achieve that, but those come with problems of their own..) Using exponential backoff for this might be a good idea, but I also think that can wait until this becomes a problem. labath: That's because the os provides no mechanism to wait for a lock to become available for a given…
		MaskRayUnsubmitted Not Done Reply Inline Actions The standard `try_` (std::mutex::try_lock, pthread_mutex_trylock, etc) return immediately upon a failure. The backoff strategy should be done by the application to avoid the misnomer. MaskRay:* The standard `try_*` (std::mutex::try_lock, pthread_mutex_trylock, etc) return immediately upon…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions If the function is called as `tryLockFile(FD)` it behaves exactly as the standard `try_` functions, it returns immediately. The backoff strategy should be done by the application to avoid the misnomer. This use case is often and even standard library provides functions that attempt to do an operation in specified time (_for). As the only expected usage of file locks is writing to log file by concurrent processes and it requires repeating the operation if unsuccessful, so integrations of the wait loop into the service function seems natural. sepavloff: If the function is called as `tryLockFile(FD)` it behaves exactly as the standard `try_*`…
		labathUnsubmitted Not Done Reply Inline Actions @MaskRay, I'm not sure how to interpret your comment -- would your concerns be addressed if this function was renamed to `tryLockFileFor` (and deleting the default argument value) ? Because I too think that would be a better name for this function (though I don't feel very strongly about that). `tryLockFile` could then be implemented as `tryLockFileFor(seconds(0))`, or not implemented at all, if it's not needed. As for the retries, I agree with @sepavloff, that it makes sense to implement them here. The way I see it the "retries" are an implementation detail and the function as a whole still behaves like the other `try_lock_for` functions. E.g. description of `std::timed_mutex::try_lock_for` says: "tries to lock the mutex, returns if the mutex has been unavailable for the specified timeout duration" -- that is still true. The main difference is that the function returns an error_code instead of a bool, but there's not much that can be done about that, as this operation can fail for a lot of other reasons. I think this is sort similar to `std::atomic<T>::fetch_add`. E.g., arm64 does not have the equivalent of the `lock add` instruction, so this function compiles to a `ldaxr+stlxr` loop, which attempts the update several times until successful -- the caller does not have to retry manually. (And down at the microarchitectural level, even `lock add` probably uses retries&timeouts to implement the functionality.) labath: @MaskRay, I'm not sure how to interpret your comment -- would your concerns be addressed if…
		MaskRayUnsubmitted Not Done Reply Inline Actions Adding both `try` and `for` to the name looks good to me. POSIX uses `timedlock` which may be considered as well. MaskRay: Adding both `try` and `for` to the name looks good to me. POSIX uses `timedlock` which may be…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Using `for` is convenient for standard functions, because they accept single argument of type `std::chrono::duration`, so an expression like: f.wait_for(2500ms) looks self-documented. In the case of `tryLockFile` it becomes something like: tryLockFileFor(FileHandle, 2500ms) which is pretty ugly. I renamed `tryLockFile` to `tryLockFileWithTimeout`, which must be unambiguous, although long. Also a function `tryLock` was added, that does not wait and return immediately. sepavloff: Using `for` is convenient for standard functions, because they accept single argument of type…
		std::error_code lockFile(int FD) {
		if (::flock(FD, LOCK_EX) == 0)
		return std::error_code();
		return std::error_code(errno, std::generic_category());
		}

		std::error_code unlockFile(int FD) {
		if (::flock(FD, LOCK_UN) == -1)
		return std::error_code(errno, std::generic_category());
		return std::error_code();
		}

std::error_code closeFile(file_t &F) {		std::error_code closeFile(file_t &F) {
file_t TmpF = F;		file_t TmpF = F;
F = kInvalidFile;		F = kInvalidFile;
return Process::SafelyCloseFileDescriptor(TmpF);		return Process::SafelyCloseFileDescriptor(TmpF);
}		}

template <typename T>		template <typename T>
static std::error_code remove_directories_impl(const T &Entry,		static std::error_code remove_directories_impl(const T &Entry,
▲ Show 20 Lines • Show All 193 Lines • Show Last 20 Lines

llvm/lib/Support/Windows/Path.inc

Show First 20 Lines • Show All 1,249 Lines • ▼ Show 20 Lines	Expected<size_t> readNativeFileSlice(file_t FileHandle,
MutableArrayRef<char> Buf,		MutableArrayRef<char> Buf,
uint64_t Offset) {		uint64_t Offset) {
OVERLAPPED Overlapped = {};		OVERLAPPED Overlapped = {};
Overlapped.Offset = uint32_t(Offset);		Overlapped.Offset = uint32_t(Offset);
Overlapped.OffsetHigh = uint32_t(Offset >> 32);		Overlapped.OffsetHigh = uint32_t(Offset >> 32);
return readNativeFileImpl(FileHandle, Buf, &Overlapped);		return readNativeFileImpl(FileHandle, Buf, &Overlapped);
}		}

		std::error_code tryLockFile(int FD, std::chrono::milliseconds Timeout) {
		DWORD Flags = LOCKFILE_EXCLUSIVE_LOCK \| LOCKFILE_FAIL_IMMEDIATELY;
		OVERLAPPED OV = {0};
		file_t File = convertFDToNativeFile(FD);
		auto Start = std::chrono::steady_clock::now();
		auto End = Start + Timeout;
		do {
		if (::LockFileEx(File, Flags, 0, MAXDWORD, MAXDWORD, &OV))
		return std::error_code();
		DWORD Error = ::GetLastError();
		labathUnsubmitted Done Reply Inline Actions No `else` after `return`. labath: No `else` after `return`.
		if (Error == ERROR_LOCK_VIOLATION) {
		::Sleep(1);
		continue;
		}
		amccarthUnsubmitted Not Done Reply Inline Actions `::Sleep(1)` sleeps for _at least_ one millisecond, but possibly for much longer. The default tick frequency on Windows is about 16 ms. (Many apps boost the system's timer frequency to 1 ms, but that's not universal and not recommended except for short periods when an app needs to display real-time media.) Sleeping too long once isn't a big deal. But Counter increments by 1 ms each time through the loop, regardless of how long it actually took, so this is likely to sleep too long many times. If the user requests a 10 ms timeout, they could actually wait 160 ms (in some near-worst case scenario). If this tracked the actual time elapsed, it probably would never be worse than 16 ms. You can use chrono's high resolution clock or Windows APIs like ::GetTickCount or ::QueryPerformanceCounter to find out how long the thread actually slept. amccarth: `::Sleep(1)` sleeps for _at least_ one millisecond, but possibly for much longer. The default…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Indeed, `Timeout` was in fact a number of attempts. Reworked this place using `std::chrono`. sepavloff: Indeed, `Timeout` was in fact a number of attempts. Reworked this place using `std::chrono`.
		return mapWindowsError(Error);
		} while (std::chrono::steady_clock::now() < End);
		return mapWindowsError(ERROR_LOCK_VIOLATION);
		}

		std::error_code lockFile(int FD) {
		DWORD Flags = LOCKFILE_EXCLUSIVE_LOCK;
		OVERLAPPED OV = {0};
		file_t File = convertFDToNativeFile(FD);
		if (::LockFileEx(File, Flags, 0, MAXDWORD, MAXDWORD, &OV))
		return std::error_code();
		DWORD Error = ::GetLastError();
		return mapWindowsError(Error);
		}
		labathUnsubmitted Done Reply Inline Actions same here. labath: same here.

		std::error_code unlockFile(int FD) {
		OVERLAPPED OV = { 0 };
		file_t File = convertFDToNativeFile(FD);
		if (::UnlockFileEx(File, 0, MAXDWORD, MAXDWORD, &OV))
		return std::error_code();
		return mapWindowsError(::GetLastError());
		}

std::error_code closeFile(file_t &F) {		std::error_code closeFile(file_t &F) {
file_t TmpF = F;		file_t TmpF = F;
F = kInvalidFile;		F = kInvalidFile;
if (!::CloseHandle(TmpF))		if (!::CloseHandle(TmpF))
return mapWindowsError(::GetLastError());		return mapWindowsError(::GetLastError());
return std::error_code();		return std::error_code();
}		}

▲ Show 20 Lines • Show All 239 Lines • Show Last 20 Lines

llvm/unittests/Support/Path.cpp

//===- llvm/unittest/Support/Path.cpp - Path tests ------------------------===//		//===- llvm/unittest/Support/Path.cpp - Path tests ------------------------===//
		Lint: Lint Inline Actions clang-format suggested style edits found: Lint: Lint: clang-format suggested style edits found:
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/Support/Path.h"		#include "llvm/Support/Path.h"
Show All 24 Lines
#include <winerror.h>		#include <winerror.h>
#endif		#endif

#ifdef LLVM_ON_UNIX		#ifdef LLVM_ON_UNIX
#include <pwd.h>		#include <pwd.h>
#include <sys/stat.h>		#include <sys/stat.h>
#endif		#endif

		#include <condition_variable>
		#include <mutex>
		#include <thread>

using namespace llvm;		using namespace llvm;
using namespace llvm::sys;		using namespace llvm::sys;

#define ASSERT_NO_ERROR(x) \		#define ASSERT_NO_ERROR(x) \
if (std::error_code ASSERT_NO_ERROR_ec = x) { \		if (std::error_code ASSERT_NO_ERROR_ec = x) { \
SmallString<128> MessageStorage; \		SmallString<128> MessageStorage; \
raw_svector_ostream Message(MessageStorage); \		raw_svector_ostream Message(MessageStorage); \
Message << #x ": did not return errc::success.\n" \		Message << #x ": did not return errc::success.\n" \
▲ Show 20 Lines • Show All 1,983 Lines • ▼ Show 20 Lines	TEST_F(FileSystemTest, widenPath) {

// Check the removal of "dots".		// Check the removal of "dots".
Input = ShareName + DirName + "\\.\\foo\\.\\.." + FileName;		Input = ShareName + DirName + "\\.\\foo\\.\\.." + FileName;
ASSERT_NO_ERROR(windows::widenPath(Input, Result));		ASSERT_NO_ERROR(windows::widenPath(Input, Result));
EXPECT_EQ(Result, Expected);		EXPECT_EQ(Result, Expected);
}		}
#endif		#endif

		TEST_F(FileSystemTest, lockFile) {
		labathUnsubmitted Done Reply Inline Actions Maybe a test where the second lock is taken on a separate thread, and the first lock is released while the second thread is waiting for it to become available? labath: Maybe a test where the second lock is taken on a separate thread, and the first lock is…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Added test `lockFileThread`. sepavloff: Added test `lockFileThread`.
		int FD1, FD2;
		SmallString<64> TempPath;
		labathUnsubmitted Not Done Reply Inline Actions Have you considered using flock(2) instead of F_SETLK? That might give you semantics which are a bit saner and a bit closer to what happens on windows (though file locking is always weird on posix systems)... labath: Have you considered using flock(2) instead of F_SETLK? That might give you semantics which are…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions IIUC, `flock` is not a POSIX call. GLIBC implements it on top of `fcntl`. The implementation also contains vague statement that it represents different mechanism on 4BSD: https://github.com/bminor/glibc/blob/92954ffa5a5662fbfde14febd7e5dcc358c85470/sysdeps/posix/flock.c#L18 . So I would refrain from using it, as the code must work on Linux, MacOS and BSD. POSIX calls looks more portable. sepavloff:* IIUC, `flock` is not a POSIX call. GLIBC implements it on top of `fcntl`. The implementation…
		labathUnsubmitted Not Done Reply Inline Actions You're right that it is not a posix call -- I did not realize that. However, a brief search seems to indicate that all major operating systems (I tried linux, mac, openbsd, freebsd, netbsd) do have this function. I'm not sure in what situation is the glibc function you linked to used (glibc build system is very opaque to me), but it is definitely not used on linux, as linux kernel has first class support for this via SYS_flock. I'd expect the BSDs (that includes macs) to do the same, as they document flock as behaving differently than fcntl locks. You're right that fcntl locks are more portable on paper, but I am not really sure that is true in practice. OTOH, I am sure that the fcntl lock semantics are very weird. One example is given in the bsd man pages: ... This semantic means that applica- tions must be aware of any files that a subroutine library may access. For example if an application for updating the password file locks the password file database while making the update, and then calls getpwnam(3) to retrieve a record, the lock will be lost because getpwnam(3) opens, reads, and closes the password database. The database close will release all locks that the process has associated with the database, even if the library routine never requested a lock on the data- base. I also have very bad memories of trying to use this function, because the deadlock detection algorithm used can create false positives in multithreaded applications. labath: You're right that it is not a posix call -- I did not realize that. However, a brief search…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions After some hesitation I implemented your idea to use `flock` instead of `fcntl`. The concern was problems with locking on NFS shares, but it seems this was an issue for old implementation. Using `flock` allows to enable unit test for unix as well. sepavloff: After some hesitation I implemented your idea to use `flock` instead of `fcntl`. The concern…
		krytarowskiUnsubmitted Not Done Reply Inline Actions flock() is implemented as a system call on BSDs since 1983, this means that it is pretty universal. krytarowski: flock() is implemented as a system call on BSDs since 1983, this means that it is pretty…
		labathUnsubmitted Not Done Reply Inline Actions Thanks. We can always revisit this if it turns out to be an issue somewhere. labath: Thanks. We can always revisit this if it turns out to be an issue somewhere.
		ASSERT_NO_ERROR(fs::createTemporaryFile("test", "temp", FD1, TempPath));
		FileRemover Cleanup(TempPath);
		ASSERT_NO_ERROR(fs::openFileForReadWrite(TempPath, FD2, fs::CD_OpenExisting,
		fs::OF_Append));
		ASSERT_NO_ERROR(fs::tryLockFile(FD1));

		ASSERT_EQ(errc::no_lock_available,
		fs::tryLockFile(FD2, std::chrono::milliseconds(5)));
		ASSERT_NO_ERROR(fs::unlockFile(FD1));
		ASSERT_NO_ERROR(fs::tryLockFile(FD2));
		ASSERT_NO_ERROR(fs::unlockFile(FD2));
		}

		namespace {
		class Event {
		std::mutex M;
		std::condition_variable CV;
		bool Signaling = false;
		public:
		void reset() { Signaling = false; }
		void wait() {
		std::unique_lock<std::mutex> Lock(M);
		if (!Signaling)
		CV.wait_for(Lock, std::chrono::seconds(1), [&] { return Signaling; });
		}
		void signal() {
		std::unique_lock<std::mutex> Lock(M);
		Signaling = true;
		CV.notify_all();
		}
		};
		}

		TEST_F(FileSystemTest, lockFileThread) {
		#if LLVM_ENABLE_THREADS
		int FD1, FD2;
		SmallString<64> TempPath;
		ASSERT_NO_ERROR(fs::createTemporaryFile("test", "temp", FD1, TempPath));
		FileRemover Cleanup(TempPath);
		ASSERT_NO_ERROR(fs::openFileForReadWrite(TempPath, FD2, fs::CD_OpenExisting,
		fs::OF_Append));

		labathUnsubmitted Not Done Reply Inline Actions I don't think this is a good use of auto per the coding standards http://llvm.org/docs/CodingStandards.html#use-auto-type-deduction-to-make-code-more-readable. labath: I don't think this is a good use of auto per the coding standards <http://llvm.
		// Threads execute the following sequence:
		//
		// T1 started
		// lock file
		// start T2
		// wait T2 started
		// \| try locking file (failure)
		// \| / unblock T1
		// unlock file lock file
		// unlock file

		Event DoUnlockEvent;

		std::error_code ECT2a, ECT2b;
		bool UseBlockingCall = false;
		const auto Thread2Body = [&]() {
		ECT2a = fs::tryLockFile(FD2);
		if (!ECT2a)
		return;
		DoUnlockEvent.signal();
		if (UseBlockingCall)
		ECT2b = fs::lockFile(FD2);
		else
		ECT2b = fs::tryLockFile(FD2, std::chrono::seconds(5));
		if (ECT2b)
		return;
		fs::unlockFile(FD2);
		};

		std::error_code ECT1a, ECT1b;
		const auto Thread1Body = [&]() {
		ECT1a = fs::tryLockFile(FD1);
		if (ECT1a)
		return;
		auto Thread2 = std::thread(Thread2Body);
		DoUnlockEvent.wait();
		ECT1b = fs::unlockFile(FD1);
		Thread2.join();
		};

		auto Thread1 = std::thread(Thread1Body);
		Thread1.join();
		ASSERT_NO_ERROR(ECT1a);
		ASSERT_NO_ERROR(ECT1b);
		ASSERT_ERROR(ECT2a);
		ASSERT_NO_ERROR(ECT2b);
		labathUnsubmitted Not Done Reply Inline Actions I get the impression this code is much more complicated then needed. There's a lot of synchronization going on but it still does not guarantee the that the file is unlocked while the other thread is inside the `tryLock` call (my goal was to get coverage for the while loop). How about something like: EC = fs::tryLockFile(FD1); ASSERT_NO_ERROR(EC); EC = fs::tryLockFile(FD2); ASSERT_ERROR(EC); std::thread LockThread([&] { EC2 = fs::tryLockFile(FD2, std::chrono::minutes(1)); }); std::this_thread::sleep_for(std::chrono::seconds(1)); EC = fs::unlockFile(FD1); ASSERT_NO_ERROR(EC); LockThread.join(); ASSERT_NO_ERROR(EC2); EC = fs::unlockFile(FD2); ASSERT_NO_ERROR(EC); It still does not guarantee that the other thread is inside the `tryLockFile` call, but it comes as close as we can get, and it avoids all the condition_variable overhead. labath: I get the impression this code is much more complicated then needed. There's a lot of…
		labathUnsubmitted Not Done Reply Inline Actions Waiting on a response to this. I still feel that this can be organized in a much simpler way without using so many explicit synchronization primitives. labath: Waiting on a response to this. I still feel that this can be organized in a much simpler way…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions I put a diagram explaining what the test does. Actually there are two events, which ensures that the first attempt to lock file occurs in Thread2 after Thread1 locks the file but before it releases it. So the two calls to `tryLockFile` checks both cases, successful and unsuccessful. Each event requires a mutex and a condvar, so we have 4 synchronization objects. Simpler variants do not guarantee checking the both cases. sepavloff: I put a diagram explaining what the test does. Actually there are two events, which ensures…
		labathUnsubmitted Not Done Reply Inline Actions I believe that each of these events requires synchronization, but a condition variable is not the only way to achieve that. Starting and joining a thread is also a form of synchronization, and it is much simpler (and something you have to do anyway. So, instead of starting a thread which, as a first order of business blocks on a condition variable, you could just delay starting the thread until such a time that the condition would be satisfied. Basically -- remove `cv.wait_for` and replace `cv.notify` with the creation of the thread object. Then, instead of waiting for the other thread to unblock you, you can just `join` it. For the second condition variable, you just create a fresh thread again. Having the thread bodies be smaller would make the code much easier to follow, and it's not like we have to worry about the performance overhead of creating a bunch of small threads here... labath: I believe that each of these events requires synchronization, but a condition variable is not…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Starting the second thread inside the first instead of waiting event indeed makes the test more compact, I rewrote the test accordingly. As for the second lock try, which happens after the file is unlocked, using new thread makes the logic more obscure. I moved synchronization stuff into the new class `Event`, it must make the test shorter and clearer. sepavloff: Starting the second thread inside the first instead of waiting event indeed makes the test more…
		labathUnsubmitted Not Done Reply Inline Actions Creating the Event class does make it a bit better, but I still maintain that this test is too complicated for what it really tests. Take a look at the following test: TEST_F(FileSystemTest, lockFileThread) { #if LLVM_ENABLE_THREADS int FD1, FD2; SmallString<64> TempPath; ASSERT_NO_ERROR(fs::createTemporaryFile("test", "temp", FD1, TempPath)); FileRemover Cleanup(TempPath); ASSERT_NO_ERROR(fs::openFileForReadWrite(TempPath, FD2, fs::CD_OpenExisting, fs::OF_Append)); ASSERT_NO_ERROR(fs::tryLockFile(FD1)); ASSERT_ERROR(fs::tryLockFile(FD2)); std::future<std::error_code> Future = std::async(std::launch::async, [&] { return fs::tryLockFile(FD2, std::chrono::seconds(5)); }); ASSERT_NO_ERROR(fs::unlockFile(FD1)); ASSERT_NO_ERROR(Future.get()); fs::unlockFile(FD2); ASSERT_NO_ERROR(fs::tryLockFile(FD1)); ASSERT_ERROR(fs::tryLockFile(FD2)); Future = std::async(std::launch::async, [&] { return fs::lockFile(FD2); }); ASSERT_NO_ERROR(fs::unlockFile(FD1)); ASSERT_NO_ERROR(Future.get()); fs::unlockFile(FD2); #endif } It tests the same thing as the test you wrote -- I obtained by applying series of semantics-preserving simplifications to it. This included fairly simple things like: inlining replacing patterns like std::thread(foo).join() with direct calls to `foo` moving code which does not block outside of a thread -- e.g. asserting that a lock attempt fails does not need to be done in a separate thread because it does not block. Only the blocking calls do. replacing a thread consisting of a single expression with a call to `std::async` removing unused variables produced by all of this However, the end result is a test which is about 3 times shorter than the original (28 lines vs 88), and it's almost linear -- each parallel section is only three lines long. I think it'd be pretty hard to argue that this is not more readable than the original. labath: Creating the Event class does make it a bit better, but I still maintain that this test is too…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Thank you very much for the code and detailed explanations! The way your code checks the result of `fs::tryLockFile` means it relies on particular sequence of statement execution. For the test to be successful the main thread after the creation of a separate thread must continue execution and execute `fs::unlockFile`. In this case when the separate thread starts, it sees unlocked file. It is the most probable case, but not the single. If rescheduling of the main thread occurs after thread creation but before execution of `fs::unlockFile` or there is a core ready to execute the new thread, this test will fail. There is no guarantee of ordering statement execution in different threads unless synchronization objects are used. sepavloff: Thank you very much for the code and detailed explanations! The way your code checks the…
		labathUnsubmitted Not Done Reply Inline Actions I'm sorry, but I am unable to follow this line of reasoning. You're talking about this block of code, right? std::future<std::error_code> Future = std::async(std::launch::async, [&] { return fs::tryLockFile(FD2, std::chrono::seconds(5)); }); ASSERT_NO_ERROR(fs::unlockFile(FD1)); ASSERT_NO_ERROR(Future.get()); fs::unlockFile(FD2); Before the `std::async` statement, FD1 is locked, FD2 is unlocked. After it, there are two actions that can execute in arbitrary order (or concurrently): `fs::unlockFile(FD1)` on the main thread and `fs::tryLockFile(FD2, std::chrono::seconds(5))` on the "async" thread. If the `unlockFile` executes first, it will unlock the file, and the subsequent `tryLockFile` will immediately succeed. If `tryLockFile` is scheduled first, then it will get a lock failure and will start to wait. While it waits the main thread will get scheduled, unlock the file, and then the `tryLockFile` will succeed again. If an evil scheduler decides to not schedule the main thread for five seconds, then `tryLockFile` will fail, but there's nothing we can do about that except increase the timeout. This is the exact same situation that can happen with condition variables: // thread 2 ... DoUnlockEvent.signal(); if (UseBlockingCall) ECT2b = fs::lockFile(FD2); else ECT2b = fs::tryLockFile(FD2, std::chrono::seconds(5)); ... // thread 1 ECT1a = fs::tryLockFile(FD1); if (ECT1a) return; auto Thread2 = std::thread(Thread2Body); DoUnlockEvent.wait(); ECT1b = fs::unlockFile(FD1); ... After thread1 is unblocked by `DoUnlockEvent.signal();`, we again have two runnable threads (the `if` statement on thread 2, and the `fs::unlockFile(FD1)` call on thread 1) and it is up to the scheduler to determine their order. It's not true that there are no synchronization objects here. We have `std::future` and the thread object contained within. Creation of the future (via std::async) establishes a happens-before relationship between the actions taken before `std::async` is called on the main thread, and the body of the async thread. This is exactly what happens with `DoUnlockEvent.signal()` and `DoUnlockEvent.wait()`. And calling `future::get` establishes a happens-before relationship between the body of the async thread and the code that comes after the `get` call. That's exactly what would happen with `Thread2.join()` in your example. My point is that "launching a async thread" is a simpler way of synchronizing than "waiting on a cv", and "getting a future" is simpler than "joining a thread which 'returns' a result through a global variable". labath: I'm sorry, but I am unable to follow this line of reasoning. You're talking about this block of…
		sepavloffAuthorUnsubmitted Done Reply Inline Actions Sorry, I didn't notice that you use operation with enough long timeout. In this case no flaky behavior should be observed. I updated the unit test. Thank you very much! sepavloff: Sorry, I didn't notice that you use operation with enough long timeout. In this case no flaky…

		UseBlockingCall = true;
		DoUnlockEvent.reset();
		auto Thread1a = std::thread(Thread1Body);
		Thread1a.join();
		ASSERT_NO_ERROR(ECT1a);
		ASSERT_NO_ERROR(ECT1b);
		ASSERT_ERROR(ECT2a);
		ASSERT_NO_ERROR(ECT2b);

		#endif
		}

} // anonymous namespace		} // anonymous namespace

This is an archive of the discontinued LLVM Phabricator instance.

[Support] Add file lock/unlock functionsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 267154

llvm/include/llvm/Support/FileSystem.h

llvm/lib/Support/Unix/Path.inc

llvm/lib/Support/Windows/Path.inc

llvm/unittests/Support/Path.cpp

[Support] Add file lock/unlock functions
ClosedPublic