This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Support/
-
llvm/
-
Support/
2/5
FileSystem.h
-
MemoryBuffer.h
-
lib/
-
LTO/
-
Caching.cpp
-
LTOModule.cpp
-
ThinLTOCodeGenerator.cpp
-
Object/
-
ArchiveWriter.cpp
-
Support/
-
FileOutputBuffer.cpp
-
MemoryBuffer.cpp
-
Unix/
-
Path.inc
-
VirtualFileSystem.cpp
-
Windows/
6/12
Path.inc
-
XRay/
-
InstrumentationMap.cpp
-
Profile.cpp
-
Trace.cpp
-
tools/llvm-xray/
-
llvm-xray/
-
xray-fdr-dump.cpp
-
unittests/Support/
-
Support/
-
MemoryBufferTest.cpp
-
Path.cpp
-
ReplaceFileTest.cpp

Differential D63453

[Support] Move llvm::MemoryBuffer to sys::fs::file_t
ClosedPublic

Authored by rnk on Jun 17 2019, 11:51 AM.

Download Raw Diff

Details

Reviewers

aganea
Bigcheese

Commits

rGcc418a3af45a: [Support] Move llvm::MemoryBuffer to sys::fs::file_t
rL365588: [Support] Move llvm::MemoryBuffer to sys::fs::file_t

Summary

On Windows, Posix integer file descriptors are a compatibility layer
over native file handles provided by the C runtime. There is a hard
limit on the maximum number of file descriptors that a process can open,
and the limit is 8192. LLD typically doesn't run into this limit because
it opens input files, maps them into memory, and then immediately closes
the file descriptor. This prevents it from running out of FDs.

For various reasons, I'd like to open handles to every input file and
keep them open during linking. That requires migrating MemoryBuffer over
to taking open native file handles instead of integer FDs.

Diff Detail

Repository

rG LLVM Github Monorepo

Build Status

Buildable 33576
Build 33575: arc lint + arc unit

Event Timeline

rnk created this revision.Jun 17 2019, 11:51 AM

Herald added a project: Restricted Project. · View Herald TranscriptJun 17 2019, 11:51 AM

Herald added subscribers: dang, dexonsmith, steven_wu and 2 others. · View Herald Transcript

Harbormaster completed remote builds in B33499: Diff 205143.Jun 17 2019, 11:51 AM

aganea added inline comments.Jun 17 2019, 2:56 PM

llvm/include/llvm/Support/FileSystem.h
938	Same question as below, why not just keep `openNativeFile`?
963	Why not use `file_t` everywhere? Do we still need POSIX FD's in user code? I find it error-prone having to juggle between the two.
llvm/lib/Support/Windows/Path.inc
738	A bit unrelated, but worth mentioning: I've noticed that `status()` (and thus `getStatus()`) came up in profiles as slower-than-they-should-be. Each call to `status()` issues 5 kernel calls and opens the file by the same occasion. In comparaison, MSVC uses the file metadata through the `FindFirstFile/FindNextFile` API, which is a lot faster. When using precompiled headers, this makes a difference because Clang calls `status()` on every file referenced by the PCH, whereas MSVC simply iterates through folders (for a given CPP using precompiled headers, MSVC was taking ~250 ms to compile, and Clang about ~1200 ms). Clang has some level of caching, but even at that it goes through the `status()` call. One of the ideas was maybe to hide a directory caching mechanism behind this `status()` API instead of letting it go directly to the OS layer.
1211	Just keep the API below that "works" all the time? (`readNativeFileSlice`)

Where do you want to keep these open? The default OS limit on open files for OSX is 256.

In D63453#1547446, @Bigcheese wrote:

Where do you want to keep these open? The default OS limit on open files for OSX is 256.

In COFF LLD.

By the way, that is... disturbingly low. How is this not already a huge problem? Is this why clang doesn't keep open FDs to the header files it maps? That in itself seemed like a bug to me. It invites TOCTOU issues. Users pretty routinely race with the compiler while saving files in their editors.

Right now COFF LLD tries to mmap files asynchronously or in parallel, and that adds a lot of complexity. If I remove that complexity, then I should be able to open each input once, one at a time, get the inode / unique id, close the file if already open, map it, and continue.

llvm/include/llvm/Support/FileSystem.h
963	No reason, other than that there's a lot of code to migrate. The refactoring is also made slightly more difficult by the move to use `llvm::Error` instead of `std::error_code`.
llvm/lib/Support/Windows/Path.inc
738	I think @silvas was telling me more or less the same thing based on his experience at Sony. He had a similar suggestion. At this point, I think it would be best to add the cache to the VirtualFileSystem layer, since then the cache won't be global.
1211	Are you suggesting simplifying the code by forwarding the call with offset 0, or reducing the API to one entry point? Stream devices like pipes don't support seeking or pread, so to change the public API, I would need to make the offset optional, which I think would be less clear. Perhaps I can simplify this Windows specific code to delegate to the slice version, but that was my concern.

aganea added inline comments.Jun 18 2019, 10:43 AM

llvm/include/llvm/Support/FileSystem.h
963	Do you think that's something that should be done? (not having FDs anymore)
llvm/lib/Support/Windows/Path.inc
738	`FileSystemStatCache` seems to do that (?) But then a new `sys::fs` API would be needed to go through the NTFS directory metadata instead of the current call to `sys::fs::status()`.
1211	I meant forward to a common implementation for both `readNativeFile` and `readNativeFileSlice`

common ReadFile code

Harbormaster completed remote builds in B33576: Diff 205406.Jun 18 2019, 11:34 AM

rnk added inline comments.Jun 18 2019, 11:34 AM

llvm/include/llvm/Support/FileSystem.h
963	Yes.
llvm/lib/Support/Windows/Path.inc
1211	Sure, done.

In D63453#1548672, @rnk wrote:

In D63453#1547446, @Bigcheese wrote:

Where do you want to keep these open? The default OS limit on open files for OSX is 256.

In COFF LLD.

FWIW, we do use lld-link to cross-compile from Darwin to Windows, and it would be awesome if that kept working :)

aganea added inline comments.Jun 18 2019, 7:28 PM

llvm/lib/Support/Windows/Path.inc

1211

Sorry for not being clearer. I was thinking more along the lines of:

static std::error_code readNativeFileImpl(file_t FileHandle,
                                          MutableArrayRef<char> Buf,
                                          size_t *BytesRead, size_t Offset) {
  char *BufPtr = Buf.data();
  size_t BytesLeft = Buf.size();

  if (BytesRead)
    *BytesRead = 0;

  while (BytesLeft) {
    uint64_t CurOff = Buf.size() - BytesLeft + Offset;
    OVERLAPPED Overlapped{};
    Overlapped.Offset = uint32_t(CurOff);
    Overlapped.OffsetHigh = uint32_t(uint64_t(CurOff) >> 32);

    // ReadFile can only read 2GB at a time.
    DWORD BytesToRead32 = std::min(1 << 31, BytesToRead);
    DWORD BytesRead32 = 0;
    bool Success =
        ::ReadFile(FileHandle, BufPtr, BytesToRead32, &BytesRead32, Overlap);
    if (!Success) {
      DWORD Err = ::GetLastError();
      // Pipe EOF is not an error.
      if (Err != ERROR_BROKEN_PIPE)
        return mapWindowsError(Err);
    }
    if (BytesRead)
      *BytesRead += BytesRead32;

    // Once we reach EOF, zero the remaining bytes in the buffer.
    if (BytesRead32 == 0) {
      memset(BufPtr, 0, BytesLeft);
      break;
    }
    BytesLeft -= BytesRead32;
    BufPtr += BytesRead32;
  }
  return std::error_code();
}

std::error_code readNativeFile(file_t FileHandle, MutableArrayRef<char> Buf,
                               size_t *BytesRead) {
  return readNativeFileImpl(FileHandle, Buf, BytesRead, /*Offset*/ 0);
}

std::error_code readNativeFileSlice(file_t FileHandle,
                                    MutableArrayRef<char> Buf, size_t Offset) {
  return readNativeFileImpl(FileHandle, Buf, /*BytesRead*/ nullptr, Offset);
}

The doc says:

For an hFile that supports byte offsets, if you use this parameter you must specify a byte offset at which to start reading from the file or device. This offset is specified by setting the Offset and OffsetHigh members of the OVERLAPPED structure. For an hFile that does not support byte offsets, Offset and OffsetHigh are ignored.

So I think we can use OVERLAPPED with pipes as long as we don't write anything in there.
It'd be interesting to also test that 2 GB read limit :-) My whole point was, if that limit still exists, it should not leak outside of this implementation. Meaning that users should not have to loop to read files larger than 2 GB.

In D63453#1549349, @smeenai wrote:

FWIW, we do use lld-link to cross-compile from Darwin to Windows, and it would be awesome if that kept working :)

I promise not to regress that functionality. :)

Still, I don't think that's a good reason not to do this change, although I certainly have less motivation now.

llvm/lib/Support/Windows/Path.inc
1211	The Unix implementation doesn't loop, so it can theoretically return with a short read. I think if I make this one loop, I should make the Unix one loop, and then promise the caller to block until the requested number of bytes are read or EOF is hit. However, because this API doesn't prescribe any particular kind of buffer, the caller typically has to loop anyway to allocate more memory for the next read. I did it that way to make it as compatible with `read` as possible for easy migration. So, given that short reads theoretically exist on Unix, do you think the code sharing (beyond what we have already) for Windows is worth adding the inconsistency in behavior between Windows and Unix?

In D63453#1550865, @rnk wrote:
Still, I don't think that's a good reason not to do this change, although I certainly have less motivation now.

I think this is a good change, and it's going into the right direction, regardless of what you plan on doing in LLD. LGTM!

llvm/lib/Support/Windows/Path.inc
1211	The API behavior should not diverge between Windows and Unix, I think we both agree on this. However I find a bit awkward to serve short reads, when we could serve complete reads, as the user requested in the first place (in the worst case, the caller expects a full read anyway for `readNativeFile` isn't it?). But perhaps this is OT? Let's leave it the way you did it, and discuss this later if you wish?

This revision is now accepted and ready to land.Jun 19 2019, 2:15 PM

I got back to work and found some time, so now I have easy access to Linux and Windows, and should be able to finish this.

llvm/lib/Support/Windows/Path.inc
1211	I guess it's OK if the Windows API guarantees that there are no short reads. If it ever becomes a problem, we can teach the Unix one to loop until EOF.

rnk marked an inline comment as done.Jul 9 2019, 5:30 PM

rnk added inline comments.

llvm/lib/Support/Windows/Path.inc
1211	I tried the loop you wrote, but it seems to interfere with the outer loop for reading from pipes until EOF. I think we should stick with this for now.

Closed by commit rL365588: [Support] Move llvm::MemoryBuffer to sys::fs::file_t (authored by rnk). · Explain WhyJul 9 2019, 5:34 PM

This revision was automatically updated to reflect the committed changes.

Herald added a subscriber: kristina. · View Herald TranscriptJul 9 2019, 5:34 PM

labath added a subscriber: labath.Aug 16 2019, 6:18 AM

labath added inline comments.

llvm/trunk/lib/Support/Windows/Path.inc
1258–1262 ↗	(On Diff #208854)	BTW, I've was looking at this code in context of D66344, and I have found this quirk of this API to be very surprising and unexpected. Zeroing the remainder of the buffer may be the right behavior for `MemoryBuffer::getFileSlice` (though I'm not even convinced of that), but I definitely wouldn't expect that to happen in `readNativeFileSlice`. I think that an additional `size_t *BytesRead` read argument would be more natural here (and consistent with `readNativeFile`). What would you say to a patch which adds a BytesRead argument to this function and moves the zeroing code to the MemoryBuffer class (currently the only user of this function)?

aganea added inline comments.Aug 16 2019, 7:56 AM

llvm/trunk/lib/Support/Windows/Path.inc
1258–1262 ↗	(On Diff #208854)	Sounds good to me! `ErrorOr<size_t> readNativeFileSlice(..)` ? (and same thing for `readNativeFile` ?)

labath added inline comments.Aug 20 2019, 5:14 AM

llvm/trunk/lib/Support/Windows/Path.inc
1258–1262 ↗	(On Diff #208854)	Cool. I've created D66471 for that. I've went for `Expected<size_t>` instead of `ErrorOr`, as (IIUC) the plan is to eventually get rid of `ErrorOr`..

Revision Contents

Path

Size

llvm/

include/

llvm/

Support/

FileSystem.h

68 lines

MemoryBuffer.h

4 lines

lib/

LTO/

Caching.cpp

9 lines

LTOModule.cpp

3 lines

ThinLTOCodeGenerator.cpp

7 lines

Object/

ArchiveWriter.cpp

13 lines

Support/

FileOutputBuffer.cpp

3 lines

MemoryBuffer.cpp

87 lines

Unix/

Path.inc

49 lines

VirtualFileSystem.cpp

27 lines

Windows/

Path.inc

82 lines

XRay/

InstrumentationMap.cpp

3 lines

Profile.cpp

3 lines

Trace.cpp

3 lines

tools/

llvm-xray/

xray-fdr-dump.cpp

11 lines

unittests/

Support/

MemoryBufferTest.cpp

10 lines

Path.cpp

8 lines

ReplaceFileTest.cpp

8 lines

Diff 205406

llvm/include/llvm/Support/FileSystem.h

Show First 20 Lines • Show All 642 Lines • ▼ Show 20 Lines
/// @returns errc::success if result has been successfully set, otherwise a		/// @returns errc::success if result has been successfully set, otherwise a
/// platform-specific error_code.		/// platform-specific error_code.
std::error_code status(const Twine &path, file_status &result,		std::error_code status(const Twine &path, file_status &result,
bool follow = true);		bool follow = true);

/// A version for when a file descriptor is already available.		/// A version for when a file descriptor is already available.
std::error_code status(int FD, file_status &Result);		std::error_code status(int FD, file_status &Result);

		#ifdef _WIN32
		/// A version for when a file descriptor is already available.
		std::error_code status(file_t FD, file_status &Result);
		#endif

/// Set file permissions.		/// Set file permissions.
///		///
/// @param Path File to set permissions on.		/// @param Path File to set permissions on.
/// @param Permissions New file permissions.		/// @param Permissions New file permissions.
/// @returns errc::success if the permissions were successfully set, otherwise		/// @returns errc::success if the permissions were successfully set, otherwise
/// a platform-specific error_code.		/// a platform-specific error_code.
/// @note On Windows, all permissions except *_write are ignored. Using any of		/// @note On Windows, all permissions except *_write are ignored. Using any of
/// owner_write, group_write, or all_write will make the file writable.		/// owner_write, group_write, or all_write will make the file writable.
▲ Show 20 Lines • Show All 266 Lines • ▼ Show 20 Lines
/// is stored in this location. Otherwise, this is set to -1.		/// is stored in this location. Otherwise, this is set to -1.
/// @param Disp Value specifying the existing-file behavior.		/// @param Disp Value specifying the existing-file behavior.
/// @param Access Value specifying whether to open the file in read, write, or		/// @param Access Value specifying whether to open the file in read, write, or
/// read-write mode.		/// read-write mode.
/// @param Flags Additional flags.		/// @param Flags Additional flags.
/// @param Mode The access permissions of the file, represented in octal.		/// @param Mode The access permissions of the file, represented in octal.
/// @returns errc::success if \a Name has been opened, otherwise a		/// @returns errc::success if \a Name has been opened, otherwise a
/// platform-specific error_code.		/// platform-specific error_code.
std::error_code openFile(const Twine &Name, int &ResultFD,		std::error_code openFile(const Twine &Name, int &ResultFD,
		aganeaUnsubmitted Not Done Reply Inline Actions Same question as below, why not just keep `openNativeFile`? aganea: Same question as below, why not just keep `openNativeFile`?
CreationDisposition Disp, FileAccess Access,		CreationDisposition Disp, FileAccess Access,
OpenFlags Flags, unsigned Mode = 0666);		OpenFlags Flags, unsigned Mode = 0666);

/// @brief Opens a file with the specified creation disposition, access mode,		/// @brief Opens a file with the specified creation disposition, access mode,
/// and flags and returns a platform-specific file object.		/// and flags and returns a platform-specific file object.
///		///
/// The caller is responsible for closing the file object once they are		/// The caller is responsible for closing the file object once they are
/// finished with it.		/// finished with it.
///		///
/// @param Name The path of the file to open, relative or absolute.		/// @param Name The path of the file to open, relative or absolute.
/// @param Disp Value specifying the existing-file behavior.		/// @param Disp Value specifying the existing-file behavior.
/// @param Access Value specifying whether to open the file in read, write, or		/// @param Access Value specifying whether to open the file in read, write, or
/// read-write mode.		/// read-write mode.
/// @param Flags Additional flags.		/// @param Flags Additional flags.
/// @param Mode The access permissions of the file, represented in octal.		/// @param Mode The access permissions of the file, represented in octal.
/// @returns errc::success if \a Name has been opened, otherwise a		/// @returns errc::success if \a Name has been opened, otherwise a
/// platform-specific error_code.		/// platform-specific error_code.
Expected<file_t> openNativeFile(const Twine &Name, CreationDisposition Disp,		Expected<file_t> openNativeFile(const Twine &Name, CreationDisposition Disp,
FileAccess Access, OpenFlags Flags,		FileAccess Access, OpenFlags Flags,
unsigned Mode = 0666);		unsigned Mode = 0666);

		/// Converts from a Posix file descriptor number to a native file handle.
		/// On Windows, this retreives the underlying handle. On non-Windows, this is a
		/// no-op.
		file_t convertFDToNativeFile(int FD);
		aganeaUnsubmitted Not Done Reply Inline Actions Why not use `file_t` everywhere? Do we still need POSIX FD's in user code? I find it error-prone having to juggle between the two. aganea: Why not use `file_t` everywhere? Do we still need POSIX FD's in user code? I find it error…
		rnkAuthorUnsubmitted Done Reply Inline Actions No reason, other than that there's a lot of code to migrate. The refactoring is also made slightly more difficult by the move to use `llvm::Error` instead of `std::error_code`. rnk: No reason, other than that there's a lot of code to migrate. The refactoring is also made…
		aganeaUnsubmitted Not Done Reply Inline Actions Do you think that's something that should be done? (not having FDs anymore) aganea: Do you think that's something that should be done? (not having FDs anymore)
		rnkAuthorUnsubmitted Done Reply Inline Actions Yes. rnk: Yes.

		#ifndef _WIN32
		inline file_t convertFDToNativeFile(int FD) { return FD; }
		#endif

		/// Return an open handle to standard in. On Unix, this is typically FD 0.
		/// Returns kInvalidFile when the stream is closed.
		file_t getStdinHandle();

		/// Return an open handle to standard out. On Unix, this is typically FD 1.
		/// Returns kInvalidFile when the stream is closed.
		file_t getStdoutHandle();

		/// Return an open handle to standard error. On Unix, this is typically FD 2.
		/// Returns kInvalidFile when the stream is closed.
		file_t getStderrHandle();

		/// Reads \p Buf.size() bytes from \p FileHandle into \p Buf. The number of
		/// bytes actually read is returned in \p BytesRead. On Unix, this is equivalent
		/// to `*BytesRead = ::read(FD, Buf.data(), Buf.size())`, with error reporting.
		/// BytesRead will contain zero when reaching EOF.
		///
		/// @param FileHandle File to read from.
		/// @param Buf Buffer to read into.
		/// @param BytesRead Output parameter of the number of bytes read.
		/// @returns The error, if any, or errc::success.
		std::error_code readNativeFile(file_t FileHandle, MutableArrayRef<char> Buf,
		size_t *BytesRead);

		/// Reads \p Buf.size() bytes from \p FileHandle at offset \p Offset into \p
		/// Buf. If 'pread' is available, this will use that, otherwise it will use
		/// 'lseek'. Bytes requested beyond the end of the file will be zero
		/// initialized.
		///
		/// @param FileHandle File to read from.
		/// @param Buf Buffer to read into.
		/// @param Offset Offset into the file at which the read should occur.
		/// @returns The error, if any, or errc::success.
		std::error_code readNativeFileSlice(file_t FileHandle,
		MutableArrayRef<char> Buf, size_t Offset);

/// @brief Opens the file with the given name in a write-only or read-write		/// @brief Opens the file with the given name in a write-only or read-write
/// mode, returning its open file descriptor. If the file does not exist, it		/// mode, returning its open file descriptor. If the file does not exist, it
/// is created.		/// is created.
///		///
/// The caller is responsible for closing the file descriptor once they are		/// The caller is responsible for closing the file descriptor once they are
/// finished with it.		/// finished with it.
///		///
/// @param Name The path of the file to open, relative or absolute.		/// @param Name The path of the file to open, relative or absolute.
▲ Show 20 Lines • Show All 103 Lines • ▼ Show 20 Lines
/// location.		/// location.
/// @returns a platform-specific file descriptor if \a Name has been opened,		/// @returns a platform-specific file descriptor if \a Name has been opened,
/// otherwise an error object.		/// otherwise an error object.
Expected<file_t>		Expected<file_t>
openNativeFileForRead(const Twine &Name, OpenFlags Flags = OF_None,		openNativeFileForRead(const Twine &Name, OpenFlags Flags = OF_None,
SmallVectorImpl<char> *RealPath = nullptr);		SmallVectorImpl<char> *RealPath = nullptr);

/// @brief Close the file object. This should be used instead of ::close for		/// @brief Close the file object. This should be used instead of ::close for
/// portability.		/// portability. On error, the caller should assume the file is closed, as is
		/// the case for Process::SafelyCloseFileDescriptor
///		///
/// @param F On input, this is the file to close. On output, the file is		/// @param F On input, this is the file to close. On output, the file is
/// set to kInvalidFile.		/// set to kInvalidFile.
void closeFile(file_t &F);		///
		/// @returns An error code if closing the file failed. Typically, an error here
		/// means that the filesystem may have failed to perform some buffered writes.
		std::error_code closeFile(file_t &F);

std::error_code getUniqueID(const Twine Path, UniqueID &Result);		std::error_code getUniqueID(const Twine Path, UniqueID &Result);

/// Get disk space usage information.		/// Get disk space usage information.
///		///
/// Note: Users must be careful about "Time Of Check, Time Of Use" kind of bug.		/// Note: Users must be careful about "Time Of Check, Time Of Use" kind of bug.
/// Note: Windows reports results according to the quota allocated to the user.		/// Note: Windows reports results according to the quota allocated to the user.
///		///
Show All 13 Lines	enum mapmode {
priv ///< May modify via data, but changes are lost on destruction.		priv ///< May modify via data, but changes are lost on destruction.
};		};

private:		private:
/// Platform-specific mapping state.		/// Platform-specific mapping state.
size_t Size;		size_t Size;
void *Mapping;		void *Mapping;
#ifdef _WIN32		#ifdef _WIN32
void *FileHandle;		sys::fs::file_t FileHandle;
#endif		#endif
mapmode Mode;		mapmode Mode;

std::error_code init(int FD, uint64_t Offset, mapmode Mode);		std::error_code init(sys::fs::file_t FD, uint64_t Offset, mapmode Mode);

public:		public:
mapped_file_region() = delete;		mapped_file_region() = delete;
mapped_file_region(mapped_file_region&) = delete;		mapped_file_region(mapped_file_region&) = delete;
mapped_file_region &operator =(mapped_file_region&) = delete;		mapped_file_region &operator =(mapped_file_region&) = delete;

/// \param fd An open file descriptor to map. mapped_file_region takes		/// \param fd An open file descriptor to map. Does not take ownership of fd.
/// ownership if closefd is true. It must have been opended in the correct		mapped_file_region(sys::fs::file_t fd, mapmode mode, size_t length, uint64_t offset,
/// mode.
mapped_file_region(int fd, mapmode mode, size_t length, uint64_t offset,
std::error_code &ec);		std::error_code &ec);

~mapped_file_region();		~mapped_file_region();

size_t size() const;		size_t size() const;
char *data() const;		char *data() const;

/// Get a const view of the data. Modifying this memory has undefined		/// Get a const view of the data. Modifying this memory has undefined
▲ Show 20 Lines • Show All 251 Lines • Show Last 20 Lines

llvm/include/llvm/Support/MemoryBuffer.h

Show First 20 Lines • Show All 84 Lines • ▼ Show 20 Lines	public:
/// look like a regular file but have 0 size (e.g. /proc/cpuinfo on Linux).		/// look like a regular file but have 0 size (e.g. /proc/cpuinfo on Linux).
static ErrorOr<std::unique_ptr<MemoryBuffer>>		static ErrorOr<std::unique_ptr<MemoryBuffer>>
getFileAsStream(const Twine &Filename);		getFileAsStream(const Twine &Filename);

/// Given an already-open file descriptor, map some slice of it into a		/// Given an already-open file descriptor, map some slice of it into a
/// MemoryBuffer. The slice is specified by an \p Offset and \p MapSize.		/// MemoryBuffer. The slice is specified by an \p Offset and \p MapSize.
/// Since this is in the middle of a file, the buffer is not null terminated.		/// Since this is in the middle of a file, the buffer is not null terminated.
static ErrorOr<std::unique_ptr<MemoryBuffer>>		static ErrorOr<std::unique_ptr<MemoryBuffer>>
getOpenFileSlice(int FD, const Twine &Filename, uint64_t MapSize,		getOpenFileSlice(sys::fs::file_t FD, const Twine &Filename, uint64_t MapSize,
int64_t Offset, bool IsVolatile = false);		int64_t Offset, bool IsVolatile = false);

/// Given an already-open file descriptor, read the file and return a		/// Given an already-open file descriptor, read the file and return a
/// MemoryBuffer.		/// MemoryBuffer.
///		///
/// \param IsVolatile Set to true to indicate that the contents of the file		/// \param IsVolatile Set to true to indicate that the contents of the file
/// can change outside the user's control, e.g. when libclang tries to parse		/// can change outside the user's control, e.g. when libclang tries to parse
/// while the user is editing/updating the file or if the file is on an NFS.		/// while the user is editing/updating the file or if the file is on an NFS.
static ErrorOr<std::unique_ptr<MemoryBuffer>>		static ErrorOr<std::unique_ptr<MemoryBuffer>>
getOpenFile(int FD, const Twine &Filename, uint64_t FileSize,		getOpenFile(sys::fs::file_t FD, const Twine &Filename, uint64_t FileSize,
bool RequiresNullTerminator = true, bool IsVolatile = false);		bool RequiresNullTerminator = true, bool IsVolatile = false);

/// Open the specified memory range as a MemoryBuffer. Note that InputData		/// Open the specified memory range as a MemoryBuffer. Note that InputData
/// must be null terminated if RequiresNullTerminator is true.		/// must be null terminated if RequiresNullTerminator is true.
static std::unique_ptr<MemoryBuffer>		static std::unique_ptr<MemoryBuffer>
getMemBuffer(StringRef InputData, StringRef BufferName = "",		getMemBuffer(StringRef InputData, StringRef BufferName = "",
bool RequiresNullTerminator = true);		bool RequiresNullTerminator = true);

▲ Show 20 Lines • Show All 175 Lines • Show Last 20 Lines

llvm/lib/LTO/Caching.cpp

Show All 38 Lines	return [=](unsigned Task, StringRef Key) -> AddStreamFn {
sys::path::append(EntryPath, CacheDirectoryPath, "llvmcache-" + Key);		sys::path::append(EntryPath, CacheDirectoryPath, "llvmcache-" + Key);
// First, see if we have a cache hit.		// First, see if we have a cache hit.
int FD;		int FD;
SmallString<64> ResultPath;		SmallString<64> ResultPath;
std::error_code EC = sys::fs::openFileForRead(		std::error_code EC = sys::fs::openFileForRead(
Twine(EntryPath), FD, sys::fs::OF_UpdateAtime, &ResultPath);		Twine(EntryPath), FD, sys::fs::OF_UpdateAtime, &ResultPath);
if (!EC) {		if (!EC) {
ErrorOr<std::unique_ptr<MemoryBuffer>> MBOrErr =		ErrorOr<std::unique_ptr<MemoryBuffer>> MBOrErr =
MemoryBuffer::getOpenFile(FD, EntryPath,		MemoryBuffer::getOpenFile(sys::fs::convertFDToNativeFile(FD),
		EntryPath,
/FileSize/ -1,		/FileSize/ -1,
/RequiresNullTerminator/ false);		/RequiresNullTerminator/ false);
close(FD);		close(FD);
if (MBOrErr) {		if (MBOrErr) {
AddBuffer(Task, std::move(*MBOrErr));		AddBuffer(Task, std::move(*MBOrErr));
return AddStreamFn();		return AddStreamFn();
}		}
EC = MBOrErr.getError();		EC = MBOrErr.getError();
Show All 25 Lines	struct CacheStream : NativeObjectStream {
Task(Task) {}		Task(Task) {}

~CacheStream() {		~CacheStream() {
// Make sure the stream is closed before committing it.		// Make sure the stream is closed before committing it.
OS.reset();		OS.reset();

// Open the file first to avoid racing with a cache pruner.		// Open the file first to avoid racing with a cache pruner.
ErrorOr<std::unique_ptr<MemoryBuffer>> MBOrErr =		ErrorOr<std::unique_ptr<MemoryBuffer>> MBOrErr =
MemoryBuffer::getOpenFile(TempFile.FD, TempFile.TmpName,		MemoryBuffer::getOpenFile(
/FileSize/ -1,		sys::fs::convertFDToNativeFile(TempFile.FD), TempFile.TmpName,
/RequiresNullTerminator/ false);		/FileSize=/-1, /RequiresNullTerminator=/false);
if (!MBOrErr)		if (!MBOrErr)
report_fatal_error(Twine("Failed to open new cache file ") +		report_fatal_error(Twine("Failed to open new cache file ") +
TempFile.TmpName + ": " +		TempFile.TmpName + ": " +
MBOrErr.getError().message() + "\n");		MBOrErr.getError().message() + "\n");

// On POSIX systems, this will atomically replace the destination if		// On POSIX systems, this will atomically replace the destination if
// it already exists. We try to emulate this on Windows, but this may		// it already exists. We try to emulate this on Windows, but this may
// fail with a permission denied error (for example, if the destination		// fail with a permission denied error (for example, if the destination
▲ Show 20 Lines • Show All 49 Lines • Show Last 20 Lines

llvm/lib/LTO/LTOModule.cpp

Show First 20 Lines • Show All 124 Lines • ▼ Show 20 Lines	LTOModule::createFromOpenFile(LLVMContext &Context, int fd, StringRef path,
return createFromOpenFileSlice(Context, fd, path, size, 0, options);		return createFromOpenFileSlice(Context, fd, path, size, 0, options);
}		}

ErrorOr<std::unique_ptr<LTOModule>>		ErrorOr<std::unique_ptr<LTOModule>>
LTOModule::createFromOpenFileSlice(LLVMContext &Context, int fd, StringRef path,		LTOModule::createFromOpenFileSlice(LLVMContext &Context, int fd, StringRef path,
size_t map_size, off_t offset,		size_t map_size, off_t offset,
const TargetOptions &options) {		const TargetOptions &options) {
ErrorOr<std::unique_ptr<MemoryBuffer>> BufferOrErr =		ErrorOr<std::unique_ptr<MemoryBuffer>> BufferOrErr =
MemoryBuffer::getOpenFileSlice(fd, path, map_size, offset);		MemoryBuffer::getOpenFileSlice(sys::fs::convertFDToNativeFile(fd), path,
		map_size, offset);
if (std::error_code EC = BufferOrErr.getError()) {		if (std::error_code EC = BufferOrErr.getError()) {
Context.emitError(EC.message());		Context.emitError(EC.message());
return EC;		return EC;
}		}
std::unique_ptr<MemoryBuffer> Buffer = std::move(BufferOrErr.get());		std::unique_ptr<MemoryBuffer> Buffer = std::move(BufferOrErr.get());
return makeLTOModule(Buffer->getMemBufferRef(), options, Context,		return makeLTOModule(Buffer->getMemBufferRef(), options, Context,
/* ShouldBeLazy */ false);		/* ShouldBeLazy */ false);
}		}
▲ Show 20 Lines • Show All 535 Lines • Show Last 20 Lines

llvm/lib/LTO/ThinLTOCodeGenerator.cpp

Show First 20 Lines • Show All 349 Lines • ▼ Show 20 Lines	ErrorOr<std::unique_ptr<MemoryBuffer>> tryLoadingBuffer() {
if (EntryPath.empty())		if (EntryPath.empty())
return std::error_code();		return std::error_code();
int FD;		int FD;
SmallString<64> ResultPath;		SmallString<64> ResultPath;
std::error_code EC = sys::fs::openFileForRead(		std::error_code EC = sys::fs::openFileForRead(
Twine(EntryPath), FD, sys::fs::OF_UpdateAtime, &ResultPath);		Twine(EntryPath), FD, sys::fs::OF_UpdateAtime, &ResultPath);
if (EC)		if (EC)
return EC;		return EC;
ErrorOr<std::unique_ptr<MemoryBuffer>> MBOrErr =		ErrorOr<std::unique_ptr<MemoryBuffer>> MBOrErr = MemoryBuffer::getOpenFile(
MemoryBuffer::getOpenFile(FD, EntryPath,		sys::fs::convertFDToNativeFile(FD), EntryPath,
/FileSize/ -1,		/FileSize=/-1, /RequiresNullTerminator=/false);
/RequiresNullTerminator/ false);
close(FD);		close(FD);
return MBOrErr;		return MBOrErr;
}		}

// Cache the Produced object file		// Cache the Produced object file
void write(const MemoryBuffer &OutputBuffer) {		void write(const MemoryBuffer &OutputBuffer) {
if (EntryPath.empty())		if (EntryPath.empty())
return;		return;
▲ Show 20 Lines • Show All 714 Lines • Show Last 20 Lines

llvm/lib/Object/ArchiveWriter.cpp

Show First 20 Lines • Show All 68 Lines • ▼ Show 20 Lines	if (!Deterministic) {
M.Perms = AccessModeOrErr.get();		M.Perms = AccessModeOrErr.get();
}		}
return std::move(M);		return std::move(M);
}		}

Expected<NewArchiveMember> NewArchiveMember::getFile(StringRef FileName,		Expected<NewArchiveMember> NewArchiveMember::getFile(StringRef FileName,
bool Deterministic) {		bool Deterministic) {
sys::fs::file_status Status;		sys::fs::file_status Status;
int FD;		auto FDOrErr = sys::fs::openNativeFileForRead(FileName);
if (auto EC = sys::fs::openFileForRead(FileName, FD))		if (!FDOrErr)
return errorCodeToError(EC);		return FDOrErr.takeError();
assert(FD != -1);		sys::fs::file_t FD = *FDOrErr;
		assert(FD != sys::fs::kInvalidFile);

if (auto EC = sys::fs::status(FD, Status))		if (auto EC = sys::fs::status(FD, Status))
return errorCodeToError(EC);		return errorCodeToError(EC);

// Opening a directory doesn't make sense. Let it fail.		// Opening a directory doesn't make sense. Let it fail.
// Linux cannot open directories with open(2), although		// Linux cannot open directories with open(2), although
// cygwin and *bsd can.		// cygwin and *bsd can.
if (Status.type() == sys::fs::file_type::directory_file)		if (Status.type() == sys::fs::file_type::directory_file)
return errorCodeToError(make_error_code(errc::is_a_directory));		return errorCodeToError(make_error_code(errc::is_a_directory));

ErrorOr<std::unique_ptr<MemoryBuffer>> MemberBufferOrErr =		ErrorOr<std::unique_ptr<MemoryBuffer>> MemberBufferOrErr =
MemoryBuffer::getOpenFile(FD, FileName, Status.getSize(), false);		MemoryBuffer::getOpenFile(FD, FileName, Status.getSize(), false);
if (!MemberBufferOrErr)		if (!MemberBufferOrErr)
return errorCodeToError(MemberBufferOrErr.getError());		return errorCodeToError(MemberBufferOrErr.getError());

if (close(FD) != 0)		if (auto EC = sys::fs::closeFile(FD))
return errorCodeToError(std::error_code(errno, std::generic_category()));		return errorCodeToError(EC);

NewArchiveMember M;		NewArchiveMember M;
M.Buf = std::move(*MemberBufferOrErr);		M.Buf = std::move(*MemberBufferOrErr);
M.MemberName = M.Buf->getBufferIdentifier();		M.MemberName = M.Buf->getBufferIdentifier();
if (!Deterministic) {		if (!Deterministic) {
M.ModTime = std::chrono::time_point_cast<std::chrono::seconds>(		M.ModTime = std::chrono::time_point_cast<std::chrono::seconds>(
Status.getLastModificationTime());		Status.getLastModificationTime());
M.UID = Status.getUser();		M.UID = Status.getUser();
▲ Show 20 Lines • Show All 524 Lines • Show Last 20 Lines

llvm/lib/Support/FileOutputBuffer.cpp

Show First 20 Lines • Show All 141 Lines • ▼ Show 20 Lines	if (auto EC = fs::resize_file(File.FD, Size)) {
consumeError(File.discard());		consumeError(File.discard());
return errorCodeToError(EC);		return errorCodeToError(EC);
}		}
#endif		#endif

// Mmap it.		// Mmap it.
std::error_code EC;		std::error_code EC;
auto MappedFile = llvm::make_unique<fs::mapped_file_region>(		auto MappedFile = llvm::make_unique<fs::mapped_file_region>(
File.FD, fs::mapped_file_region::readwrite, Size, 0, EC);		fs::convertFDToNativeFile(File.FD), fs::mapped_file_region::readwrite,
		Size, 0, EC);

// mmap(2) can fail if the underlying filesystem does not support it.		// mmap(2) can fail if the underlying filesystem does not support it.
// If that happens, we fall back to in-memory buffer as the last resort.		// If that happens, we fall back to in-memory buffer as the last resort.
if (EC) {		if (EC) {
consumeError(File.discard());		consumeError(File.discard());
return createInMemoryBuffer(Path, Size, Mode);		return createInMemoryBuffer(Path, Size, Mode);
}		}

Show All 37 Lines

llvm/lib/Support/MemoryBuffer.cpp

Show First 20 Lines • Show All 176 Lines • ▼ Show 20 Lines	static uint64_t getLegalMapSize(uint64_t Len, uint64_t Offset) {
return Len + (Offset - getLegalMapOffset(Offset));		return Len + (Offset - getLegalMapOffset(Offset));
}		}

const char *getStart(uint64_t Len, uint64_t Offset) {		const char *getStart(uint64_t Len, uint64_t Offset) {
return MFR.const_data() + (Offset - getLegalMapOffset(Offset));		return MFR.const_data() + (Offset - getLegalMapOffset(Offset));
}		}

public:		public:
MemoryBufferMMapFile(bool RequiresNullTerminator, int FD, uint64_t Len,		MemoryBufferMMapFile(bool RequiresNullTerminator, sys::fs::file_t FD, uint64_t Len,
uint64_t Offset, std::error_code &EC)		uint64_t Offset, std::error_code &EC)
: MFR(FD, MB::Mapmode, getLegalMapSize(Len, Offset),		: MFR(FD, MB::Mapmode, getLegalMapSize(Len, Offset),
getLegalMapOffset(Offset), EC) {		getLegalMapOffset(Offset), EC) {
if (!EC) {		if (!EC) {
const char *Start = getStart(Len, Offset);		const char *Start = getStart(Len, Offset);
MemoryBuffer::init(Start, Start + Len, RequiresNullTerminator);		MemoryBuffer::init(Start, Start + Len, RequiresNullTerminator);
}		}
}		}
Show All 9 Lines	public:

MemoryBuffer::BufferKind getBufferKind() const override {		MemoryBuffer::BufferKind getBufferKind() const override {
return MemoryBuffer::MemoryBuffer_MMap;		return MemoryBuffer::MemoryBuffer_MMap;
}		}
};		};
}		}

static ErrorOr<std::unique_ptr<WritableMemoryBuffer>>		static ErrorOr<std::unique_ptr<WritableMemoryBuffer>>
getMemoryBufferForStream(int FD, const Twine &BufferName) {		getMemoryBufferForStream(sys::fs::file_t FD, const Twine &BufferName) {
const ssize_t ChunkSize = 4096*4;		const ssize_t ChunkSize = 4096*4;
SmallString<ChunkSize> Buffer;		SmallString<ChunkSize> Buffer;
ssize_t ReadBytes;		size_t ReadBytes;
// Read into Buffer until we hit EOF.		// Read into Buffer until we hit EOF.
do {		do {
Buffer.reserve(Buffer.size() + ChunkSize);		Buffer.reserve(Buffer.size() + ChunkSize);
ReadBytes = sys::RetryAfterSignal(-1, ::read, FD, Buffer.end(), ChunkSize);		if (auto EC = sys::fs::readNativeFile(
if (ReadBytes == -1)		FD, makeMutableArrayRef(Buffer.end(), ChunkSize), &ReadBytes))
return std::error_code(errno, std::generic_category());		return EC;
Buffer.set_size(Buffer.size() + ReadBytes);		Buffer.set_size(Buffer.size() + ReadBytes);
} while (ReadBytes != 0);		} while (ReadBytes != 0);

return getMemBufferCopyImpl(Buffer, BufferName);		return getMemBufferCopyImpl(Buffer, BufferName);
}		}


ErrorOr<std::unique_ptr<MemoryBuffer>>		ErrorOr<std::unique_ptr<MemoryBuffer>>
MemoryBuffer::getFile(const Twine &Filename, int64_t FileSize,		MemoryBuffer::getFile(const Twine &Filename, int64_t FileSize,
bool RequiresNullTerminator, bool IsVolatile) {		bool RequiresNullTerminator, bool IsVolatile) {
return getFileAux<MemoryBuffer>(Filename, FileSize, FileSize, 0,		return getFileAux<MemoryBuffer>(Filename, FileSize, FileSize, 0,
RequiresNullTerminator, IsVolatile);		RequiresNullTerminator, IsVolatile);
}		}

template <typename MB>		template <typename MB>
static ErrorOr<std::unique_ptr<MB>>		static ErrorOr<std::unique_ptr<MB>>
getOpenFileImpl(int FD, const Twine &Filename, uint64_t FileSize,		getOpenFileImpl(sys::fs::file_t FD, const Twine &Filename, uint64_t FileSize,
uint64_t MapSize, int64_t Offset, bool RequiresNullTerminator,		uint64_t MapSize, int64_t Offset, bool RequiresNullTerminator,
bool IsVolatile);		bool IsVolatile);

template <typename MB>		template <typename MB>
static ErrorOr<std::unique_ptr<MB>>		static ErrorOr<std::unique_ptr<MB>>
getFileAux(const Twine &Filename, int64_t FileSize, uint64_t MapSize,		getFileAux(const Twine &Filename, int64_t FileSize, uint64_t MapSize,
uint64_t Offset, bool RequiresNullTerminator, bool IsVolatile) {		uint64_t Offset, bool RequiresNullTerminator, bool IsVolatile) {
int FD;		Expected<sys::fs::file_t> FDOrErr =
std::error_code EC = sys::fs::openFileForRead(Filename, FD, sys::fs::OF_None);		sys::fs::openNativeFileForRead(Filename, sys::fs::OF_None);
		if (!FDOrErr)
if (EC)		return errorToErrorCode(FDOrErr.takeError());
return EC;		sys::fs::file_t FD = *FDOrErr;

auto Ret = getOpenFileImpl<MB>(FD, Filename, FileSize, MapSize, Offset,		auto Ret = getOpenFileImpl<MB>(FD, Filename, FileSize, MapSize, Offset,
RequiresNullTerminator, IsVolatile);		RequiresNullTerminator, IsVolatile);
close(FD);		sys::fs::closeFile(FD);
return Ret;		return Ret;
}		}

ErrorOr<std::unique_ptr<WritableMemoryBuffer>>		ErrorOr<std::unique_ptr<WritableMemoryBuffer>>
WritableMemoryBuffer::getFile(const Twine &Filename, int64_t FileSize,		WritableMemoryBuffer::getFile(const Twine &Filename, int64_t FileSize,
bool IsVolatile) {		bool IsVolatile) {
return getFileAux<WritableMemoryBuffer>(Filename, FileSize, FileSize, 0,		return getFileAux<WritableMemoryBuffer>(Filename, FileSize, FileSize, 0,
/RequiresNullTerminator/ false,		/RequiresNullTerminator/ false,
Show All 37 Lines
WritableMemoryBuffer::getNewMemBuffer(size_t Size, const Twine &BufferName) {		WritableMemoryBuffer::getNewMemBuffer(size_t Size, const Twine &BufferName) {
auto SB = WritableMemoryBuffer::getNewUninitMemBuffer(Size, BufferName);		auto SB = WritableMemoryBuffer::getNewUninitMemBuffer(Size, BufferName);
if (!SB)		if (!SB)
return nullptr;		return nullptr;
memset(SB->getBufferStart(), 0, Size);		memset(SB->getBufferStart(), 0, Size);
return SB;		return SB;
}		}

static bool shouldUseMmap(int FD,		static bool shouldUseMmap(sys::fs::file_t FD,
size_t FileSize,		size_t FileSize,
size_t MapSize,		size_t MapSize,
off_t Offset,		off_t Offset,
bool RequiresNullTerminator,		bool RequiresNullTerminator,
int PageSize,		int PageSize,
bool IsVolatile) {		bool IsVolatile) {
// mmap may leave the buffer without null terminator if the file size changed		// mmap may leave the buffer without null terminator if the file size changed
// by the time the last page is mapped in, so avoid it if the file size is		// by the time the last page is mapped in, so avoid it if the file size is
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines
#endif		#endif

return true;		return true;
}		}

static ErrorOr<std::unique_ptr<WriteThroughMemoryBuffer>>		static ErrorOr<std::unique_ptr<WriteThroughMemoryBuffer>>
getReadWriteFile(const Twine &Filename, uint64_t FileSize, uint64_t MapSize,		getReadWriteFile(const Twine &Filename, uint64_t FileSize, uint64_t MapSize,
uint64_t Offset) {		uint64_t Offset) {
int FD;		Expected<sys::fs::file_t> FDOrErr = sys::fs::openNativeFileForReadWrite(
std::error_code EC = sys::fs::openFileForReadWrite(		Filename, sys::fs::CD_OpenExisting, sys::fs::OF_None);
Filename, FD, sys::fs::CD_OpenExisting, sys::fs::OF_None);		if (!FDOrErr)
		return errorToErrorCode(FDOrErr.takeError());
if (EC)		sys::fs::file_t FD = *FDOrErr;
return EC;

// Default is to map the full file.		// Default is to map the full file.
if (MapSize == uint64_t(-1)) {		if (MapSize == uint64_t(-1)) {
// If we don't know the file size, use fstat to find out. fstat on an open		// If we don't know the file size, use fstat to find out. fstat on an open
// file descriptor is cheaper than stat on a random path.		// file descriptor is cheaper than stat on a random path.
if (FileSize == uint64_t(-1)) {		if (FileSize == uint64_t(-1)) {
sys::fs::file_status Status;		sys::fs::file_status Status;
std::error_code EC = sys::fs::status(FD, Status);		std::error_code EC = sys::fs::status(FD, Status);
if (EC)		if (EC)
return EC;		return EC;

// If this not a file or a block device (e.g. it's a named pipe		// If this not a file or a block device (e.g. it's a named pipe
// or character device), we can't mmap it, so error out.		// or character device), we can't mmap it, so error out.
sys::fs::file_type Type = Status.type();		sys::fs::file_type Type = Status.type();
if (Type != sys::fs::file_type::regular_file &&		if (Type != sys::fs::file_type::regular_file &&
Type != sys::fs::file_type::block_file)		Type != sys::fs::file_type::block_file)
return make_error_code(errc::invalid_argument);		return make_error_code(errc::invalid_argument);

FileSize = Status.getSize();		FileSize = Status.getSize();
}		}
MapSize = FileSize;		MapSize = FileSize;
}		}

		std::error_code EC;
std::unique_ptr<WriteThroughMemoryBuffer> Result(		std::unique_ptr<WriteThroughMemoryBuffer> Result(
new (NamedBufferAlloc(Filename))		new (NamedBufferAlloc(Filename))
MemoryBufferMMapFile<WriteThroughMemoryBuffer>(false, FD, MapSize,		MemoryBufferMMapFile<WriteThroughMemoryBuffer>(false, FD, MapSize,
Offset, EC));		Offset, EC));
if (EC)		if (EC)
return EC;		return EC;
return std::move(Result);		return std::move(Result);
}		}

ErrorOr<std::unique_ptr<WriteThroughMemoryBuffer>>		ErrorOr<std::unique_ptr<WriteThroughMemoryBuffer>>
WriteThroughMemoryBuffer::getFile(const Twine &Filename, int64_t FileSize) {		WriteThroughMemoryBuffer::getFile(const Twine &Filename, int64_t FileSize) {
return getReadWriteFile(Filename, FileSize, FileSize, 0);		return getReadWriteFile(Filename, FileSize, FileSize, 0);
}		}

/// Map a subrange of the specified file as a WritableMemoryBuffer.		/// Map a subrange of the specified file as a WritableMemoryBuffer.
ErrorOr<std::unique_ptr<WriteThroughMemoryBuffer>>		ErrorOr<std::unique_ptr<WriteThroughMemoryBuffer>>
WriteThroughMemoryBuffer::getFileSlice(const Twine &Filename, uint64_t MapSize,		WriteThroughMemoryBuffer::getFileSlice(const Twine &Filename, uint64_t MapSize,
uint64_t Offset) {		uint64_t Offset) {
return getReadWriteFile(Filename, -1, MapSize, Offset);		return getReadWriteFile(Filename, -1, MapSize, Offset);
}		}

template <typename MB>		template <typename MB>
static ErrorOr<std::unique_ptr<MB>>		static ErrorOr<std::unique_ptr<MB>>
getOpenFileImpl(int FD, const Twine &Filename, uint64_t FileSize,		getOpenFileImpl(sys::fs::file_t FD, const Twine &Filename, uint64_t FileSize,
uint64_t MapSize, int64_t Offset, bool RequiresNullTerminator,		uint64_t MapSize, int64_t Offset, bool RequiresNullTerminator,
bool IsVolatile) {		bool IsVolatile) {
static int PageSize = sys::Process::getPageSizeEstimate();		static int PageSize = sys::Process::getPageSizeEstimate();

// Default is to map the full file.		// Default is to map the full file.
if (MapSize == uint64_t(-1)) {		if (MapSize == uint64_t(-1)) {
// If we don't know the file size, use fstat to find out. fstat on an open		// If we don't know the file size, use fstat to find out. fstat on an open
// file descriptor is cheaper than stat on a random path.		// file descriptor is cheaper than stat on a random path.
Show All 28 Lines	getOpenFileImpl(sys::fs::file_t FD, const Twine &Filename, uint64_t FileSize,

auto Buf = WritableMemoryBuffer::getNewUninitMemBuffer(MapSize, Filename);		auto Buf = WritableMemoryBuffer::getNewUninitMemBuffer(MapSize, Filename);
if (!Buf) {		if (!Buf) {
// Failed to create a buffer. The only way it can fail is if		// Failed to create a buffer. The only way it can fail is if
// new(std::nothrow) returns 0.		// new(std::nothrow) returns 0.
return make_error_code(errc::not_enough_memory);		return make_error_code(errc::not_enough_memory);
}		}

char *BufPtr = Buf.get()->getBufferStart();		sys::fs::readNativeFileSlice(FD, Buf->getBuffer(), Offset);

size_t BytesLeft = MapSize;
#ifndef HAVE_PREAD
if (lseek(FD, Offset, SEEK_SET) == -1)
return std::error_code(errno, std::generic_category());
#endif

while (BytesLeft) {
#ifdef HAVE_PREAD
ssize_t NumRead = sys::RetryAfterSignal(-1, ::pread, FD, BufPtr, BytesLeft,
MapSize - BytesLeft + Offset);
#else
ssize_t NumRead = sys::RetryAfterSignal(-1, ::read, FD, BufPtr, BytesLeft);
#endif
if (NumRead == -1) {
// Error while reading.
return std::error_code(errno, std::generic_category());
}
if (NumRead == 0) {
memset(BufPtr, 0, BytesLeft); // zero-initialize rest of the buffer.
break;
}
BytesLeft -= NumRead;
BufPtr += NumRead;
}

return std::move(Buf);		return std::move(Buf);
}		}

ErrorOr<std::unique_ptr<MemoryBuffer>>		ErrorOr<std::unique_ptr<MemoryBuffer>>
MemoryBuffer::getOpenFile(int FD, const Twine &Filename, uint64_t FileSize,		MemoryBuffer::getOpenFile(sys::fs::file_t FD, const Twine &Filename, uint64_t FileSize,
bool RequiresNullTerminator, bool IsVolatile) {		bool RequiresNullTerminator, bool IsVolatile) {
return getOpenFileImpl<MemoryBuffer>(FD, Filename, FileSize, FileSize, 0,		return getOpenFileImpl<MemoryBuffer>(FD, Filename, FileSize, FileSize, 0,
RequiresNullTerminator, IsVolatile);		RequiresNullTerminator, IsVolatile);
}		}

ErrorOr<std::unique_ptr<MemoryBuffer>>		ErrorOr<std::unique_ptr<MemoryBuffer>>
MemoryBuffer::getOpenFileSlice(int FD, const Twine &Filename, uint64_t MapSize,		MemoryBuffer::getOpenFileSlice(sys::fs::file_t FD, const Twine &Filename, uint64_t MapSize,
int64_t Offset, bool IsVolatile) {		int64_t Offset, bool IsVolatile) {
assert(MapSize != uint64_t(-1));		assert(MapSize != uint64_t(-1));
return getOpenFileImpl<MemoryBuffer>(FD, Filename, -1, MapSize, Offset, false,		return getOpenFileImpl<MemoryBuffer>(FD, Filename, -1, MapSize, Offset, false,
IsVolatile);		IsVolatile);
}		}

ErrorOr<std::unique_ptr<MemoryBuffer>> MemoryBuffer::getSTDIN() {		ErrorOr<std::unique_ptr<MemoryBuffer>> MemoryBuffer::getSTDIN() {
// Read in all of the data from stdin, we cannot mmap stdin.		// Read in all of the data from stdin, we cannot mmap stdin.
//		//
// FIXME: That isn't necessarily true, we should try to mmap stdin and		// FIXME: That isn't necessarily true, we should try to mmap stdin and
// fallback if it fails.		// fallback if it fails.
sys::ChangeStdinToBinary();		sys::ChangeStdinToBinary();

return getMemoryBufferForStream(0, "<stdin>");		return getMemoryBufferForStream(sys::fs::getStdinHandle(), "<stdin>");
}		}

ErrorOr<std::unique_ptr<MemoryBuffer>>		ErrorOr<std::unique_ptr<MemoryBuffer>>
MemoryBuffer::getFileAsStream(const Twine &Filename) {		MemoryBuffer::getFileAsStream(const Twine &Filename) {
int FD;		Expected<sys::fs::file_t> FDOrErr =
std::error_code EC = sys::fs::openFileForRead(Filename, FD, sys::fs::OF_None);		sys::fs::openNativeFileForRead(Filename, sys::fs::OF_None);
if (EC)		if (!FDOrErr)
return EC;		return errorToErrorCode(FDOrErr.takeError());
		sys::fs::file_t FD = *FDOrErr;
ErrorOr<std::unique_ptr<MemoryBuffer>> Ret =		ErrorOr<std::unique_ptr<MemoryBuffer>> Ret =
getMemoryBufferForStream(FD, Filename);		getMemoryBufferForStream(FD, Filename);
close(FD);		sys::fs::closeFile(FD);
return Ret;		return Ret;
}		}

MemoryBufferRef MemoryBuffer::getMemBufferRef() const {		MemoryBufferRef MemoryBuffer::getMemBufferRef() const {
StringRef Data = getBuffer();		StringRef Data = getBuffer();
StringRef Identifier = getBufferIdentifier();		StringRef Identifier = getBufferIdentifier();
return MemoryBufferRef(Data, Identifier);		return MemoryBufferRef(Data, Identifier);
}		}

SmallVectorMemoryBuffer::~SmallVectorMemoryBuffer() {}		SmallVectorMemoryBuffer::~SmallVectorMemoryBuffer() {}

llvm/lib/Support/Unix/Path.inc

Show First 20 Lines • Show All 969 Lines • ▼ Show 20 Lines	Expected<file_t> openNativeFileForRead(const Twine &Name, OpenFlags Flags,
SmallVectorImpl<char> *RealPath) {		SmallVectorImpl<char> *RealPath) {
file_t ResultFD;		file_t ResultFD;
std::error_code EC = openFileForRead(Name, ResultFD, Flags, RealPath);		std::error_code EC = openFileForRead(Name, ResultFD, Flags, RealPath);
if (EC)		if (EC)
return errorCodeToError(EC);		return errorCodeToError(EC);
return ResultFD;		return ResultFD;
}		}

void closeFile(file_t &F) {		file_t getStdinHandle() { return 0; }
::close(F);		file_t getStdoutHandle() { return 1; }
		file_t getStderrHandle() { return 2; }

		std::error_code readNativeFile(file_t FD, MutableArrayRef<char> Buf,
		size_t *BytesRead) {
		*BytesRead = sys::RetryAfterSignal(-1, ::read, FD, Buf.data(), Buf.size());
		if (ssize_t(*BytesRead) == -1)
		return std::error_code(errno, std::generic_category());
		return std::error_code();
		}

		std::error_code readNativeFileSlice(file_t FD, MutableArrayRef<char> Buf,
		size_t Offset) {
		char *BufPtr = Buf.data();
		size_t BytesLeft = Buf.size();

		#ifndef HAVE_PREAD
		// If we don't have pread, seek to Offset.
		if (lseek(FD, Offset, SEEK_SET) == -1)
		return std::error_code(errno, std::generic_category());
		#endif

		while (BytesLeft) {
		#ifdef HAVE_PREAD
		ssize_t NumRead = sys::RetryAfterSignal(-1, ::pread, FD, BufPtr, BytesLeft,
		Buf.size() - BytesLeft + Offset);
		#else
		ssize_t NumRead = sys::RetryAfterSignal(-1, ::read, FD, BufPtr, BytesLeft);
		#endif
		if (NumRead == -1) {
		// Error while reading.
		return std::error_code(errno, std::generic_category());
		}
		if (NumRead == 0) {
		memset(BufPtr, 0, BytesLeft); // zero-initialize rest of the buffer.
		break;
		}
		BytesLeft -= NumRead;
		BufPtr += NumRead;
		}
		return std::error_code();
		}

		std::error_code closeFile(file_t &F) {
		file_t TmpF = F;
F = kInvalidFile;		F = kInvalidFile;
		return Process::SafelyCloseFileDescriptor(TmpF);
}		}

template <typename T>		template <typename T>
static std::error_code remove_directories_impl(const T &Entry,		static std::error_code remove_directories_impl(const T &Entry,
bool IgnoreErrors) {		bool IgnoreErrors) {
std::error_code EC;		std::error_code EC;
directory_iterator Begin(Entry, EC, false);		directory_iterator Begin(Entry, EC, false);
directory_iterator End;		directory_iterator End;
▲ Show 20 Lines • Show All 176 Lines • Show Last 20 Lines

llvm/lib/Support/VirtualFileSystem.cpp

Show First 20 Lines • Show All 50 Lines • ▼ Show 20 Lines
#include <string>		#include <string>
#include <system_error>		#include <system_error>
#include <utility>		#include <utility>
#include <vector>		#include <vector>

using namespace llvm;		using namespace llvm;
using namespace llvm::vfs;		using namespace llvm::vfs;

		using llvm::sys::fs::file_t;
using llvm::sys::fs::file_status;		using llvm::sys::fs::file_status;
using llvm::sys::fs::file_type;		using llvm::sys::fs::file_type;
		using llvm::sys::fs::kInvalidFile;
using llvm::sys::fs::perms;		using llvm::sys::fs::perms;
using llvm::sys::fs::UniqueID;		using llvm::sys::fs::UniqueID;

Status::Status(const file_status &Status)		Status::Status(const file_status &Status)
: UID(Status.getUniqueID()), MTime(Status.getLastModificationTime()),		: UID(Status.getUniqueID()), MTime(Status.getLastModificationTime()),
User(Status.getUser()), Group(Status.getGroup()), Size(Status.getSize()),		User(Status.getUser()), Group(Status.getGroup()), Size(Status.getSize()),
Type(Status.type()), Perms(Status.permissions()) {}		Type(Status.type()), Perms(Status.permissions()) {}

▲ Show 20 Lines • Show All 96 Lines • ▼ Show 20 Lines
//===-----------------------------------------------------------------------===/		//===-----------------------------------------------------------------------===/

namespace {		namespace {

/// Wrapper around a raw file descriptor.		/// Wrapper around a raw file descriptor.
class RealFile : public File {		class RealFile : public File {
friend class RealFileSystem;		friend class RealFileSystem;

int FD;		file_t FD;
Status S;		Status S;
std::string RealName;		std::string RealName;

RealFile(int FD, StringRef NewName, StringRef NewRealPathName)		RealFile(file_t FD, StringRef NewName, StringRef NewRealPathName)
: FD(FD), S(NewName, {}, {}, {}, {}, {},		: FD(FD), S(NewName, {}, {}, {}, {}, {},
llvm::sys::fs::file_type::status_error, {}),		llvm::sys::fs::file_type::status_error, {}),
RealName(NewRealPathName.str()) {		RealName(NewRealPathName.str()) {
assert(FD >= 0 && "Invalid or inactive file descriptor");		assert(FD != kInvalidFile && "Invalid or inactive file descriptor");
}		}

public:		public:
~RealFile() override;		~RealFile() override;

ErrorOr<Status> status() override;		ErrorOr<Status> status() override;
ErrorOr<std::string> getName() override;		ErrorOr<std::string> getName() override;
ErrorOr<std::unique_ptr<MemoryBuffer>> getBuffer(const Twine &Name,		ErrorOr<std::unique_ptr<MemoryBuffer>> getBuffer(const Twine &Name,
int64_t FileSize,		int64_t FileSize,
bool RequiresNullTerminator,		bool RequiresNullTerminator,
bool IsVolatile) override;		bool IsVolatile) override;
std::error_code close() override;		std::error_code close() override;
};		};

} // namespace		} // namespace

RealFile::~RealFile() { close(); }		RealFile::~RealFile() { close(); }

ErrorOr<Status> RealFile::status() {		ErrorOr<Status> RealFile::status() {
assert(FD != -1 && "cannot stat closed file");		assert(FD != kInvalidFile && "cannot stat closed file");
if (!S.isStatusKnown()) {		if (!S.isStatusKnown()) {
file_status RealStatus;		file_status RealStatus;
if (std::error_code EC = sys::fs::status(FD, RealStatus))		if (std::error_code EC = sys::fs::status(FD, RealStatus))
return EC;		return EC;
S = Status::copyWithNewName(RealStatus, S.getName());		S = Status::copyWithNewName(RealStatus, S.getName());
}		}
return S;		return S;
}		}

ErrorOr<std::string> RealFile::getName() {		ErrorOr<std::string> RealFile::getName() {
return RealName.empty() ? S.getName().str() : RealName;		return RealName.empty() ? S.getName().str() : RealName;
}		}

ErrorOr<std::unique_ptr<MemoryBuffer>>		ErrorOr<std::unique_ptr<MemoryBuffer>>
RealFile::getBuffer(const Twine &Name, int64_t FileSize,		RealFile::getBuffer(const Twine &Name, int64_t FileSize,
bool RequiresNullTerminator, bool IsVolatile) {		bool RequiresNullTerminator, bool IsVolatile) {
assert(FD != -1 && "cannot get buffer for closed file");		assert(FD != kInvalidFile && "cannot get buffer for closed file");
return MemoryBuffer::getOpenFile(FD, Name, FileSize, RequiresNullTerminator,		return MemoryBuffer::getOpenFile(FD, Name, FileSize, RequiresNullTerminator,
IsVolatile);		IsVolatile);
}		}

std::error_code RealFile::close() {		std::error_code RealFile::close() {
std::error_code EC = sys::Process::SafelyCloseFileDescriptor(FD);		std::error_code EC = sys::fs::closeFile(FD);
FD = -1;		FD = kInvalidFile;
return EC;		return EC;
}		}

namespace {		namespace {

/// A file system according to your operating system.		/// A file system according to your operating system.
/// This may be linked to the process's working directory, or maintain its own.		/// This may be linked to the process's working directory, or maintain its own.
///		///
▲ Show 20 Lines • Show All 54 Lines • ▼ Show 20 Lines	ErrorOr<Status> RealFileSystem::status(const Twine &Path) {
if (std::error_code EC =		if (std::error_code EC =
sys::fs::status(adjustPath(Path, Storage), RealStatus))		sys::fs::status(adjustPath(Path, Storage), RealStatus))
return EC;		return EC;
return Status::copyWithNewName(RealStatus, Path);		return Status::copyWithNewName(RealStatus, Path);
}		}

ErrorOr<std::unique_ptr<File>>		ErrorOr<std::unique_ptr<File>>
RealFileSystem::openFileForRead(const Twine &Name) {		RealFileSystem::openFileForRead(const Twine &Name) {
int FD;
SmallString<256> RealName, Storage;		SmallString<256> RealName, Storage;
if (std::error_code EC = sys::fs::openFileForRead(		Expected<file_t> FDOrErr = sys::fs::openNativeFileForRead(
adjustPath(Name, Storage), FD, sys::fs::OF_None, &RealName))		adjustPath(Name, Storage), sys::fs::OF_None, &RealName);
return EC;		if (!FDOrErr)
return std::unique_ptr<File>(new RealFile(FD, Name.str(), RealName.str()));		return errorToErrorCode(FDOrErr.takeError());
		return std::unique_ptr<File>(
		new RealFile(*FDOrErr, Name.str(), RealName.str()));
}		}

llvm::ErrorOr<std::string> RealFileSystem::getCurrentWorkingDirectory() const {		llvm::ErrorOr<std::string> RealFileSystem::getCurrentWorkingDirectory() const {
if (WD)		if (WD)
return WD->Specified.str();		return WD->Specified.str();

SmallString<128> Dir;		SmallString<128> Dir;
if (std::error_code EC = llvm::sys::fs::current_path(Dir))		if (std::error_code EC = llvm::sys::fs::current_path(Dir))
▲ Show 20 Lines • Show All 1,803 Lines • Show Last 20 Lines

llvm/lib/Support/Windows/Path.inc

Show First 20 Lines • Show All 728 Lines • ▼ Show 20 Lines	std::error_code status(const Twine &path, file_status &result, bool Follow) {
return getStatus(h, result);		return getStatus(h, result);
}		}

std::error_code status(int FD, file_status &Result) {		std::error_code status(int FD, file_status &Result) {
HANDLE FileHandle = reinterpret_cast<HANDLE>(_get_osfhandle(FD));		HANDLE FileHandle = reinterpret_cast<HANDLE>(_get_osfhandle(FD));
return getStatus(FileHandle, Result);		return getStatus(FileHandle, Result);
}		}

		std::error_code status(file_t FileHandle, file_status &Result) {
		return getStatus(FileHandle, Result);
		aganeaUnsubmitted Not Done Reply Inline Actions A bit unrelated, but worth mentioning: I've noticed that `status()` (and thus `getStatus()`) came up in profiles as slower-than-they-should-be. Each call to `status()` issues 5 kernel calls and opens the file by the same occasion. In comparaison, MSVC uses the file metadata through the `FindFirstFile/FindNextFile` API, which is a lot faster. When using precompiled headers, this makes a difference because Clang calls `status()` on every file referenced by the PCH, whereas MSVC simply iterates through folders (for a given CPP using precompiled headers, MSVC was taking ~250 ms to compile, and Clang about ~1200 ms). Clang has some level of caching, but even at that it goes through the `status()` call. One of the ideas was maybe to hide a directory caching mechanism behind this `status()` API instead of letting it go directly to the OS layer. aganea: A bit unrelated, but worth mentioning: I've noticed that `status()` (and thus `getStatus()`)…
		rnkAuthorUnsubmitted Done Reply Inline Actions I think @silvas was telling me more or less the same thing based on his experience at Sony. He had a similar suggestion. At this point, I think it would be best to add the cache to the VirtualFileSystem layer, since then the cache won't be global. rnk: I think @silvas was telling me more or less the same thing based on his experience at Sony. He…
		aganeaUnsubmitted Not Done Reply Inline Actions `FileSystemStatCache` seems to do that (?) But then a new `sys::fs` API would be needed to go through the NTFS directory metadata instead of the current call to `sys::fs::status()`. aganea: `FileSystemStatCache` seems to do that (?) But then a new `sys::fs` API would be needed to go…
		}

std::error_code setPermissions(const Twine &Path, perms Permissions) {		std::error_code setPermissions(const Twine &Path, perms Permissions) {
SmallVector<wchar_t, 128> PathUTF16;		SmallVector<wchar_t, 128> PathUTF16;
if (std::error_code EC = widenPath(Path, PathUTF16))		if (std::error_code EC = widenPath(Path, PathUTF16))
return EC;		return EC;

DWORD Attributes = ::GetFileAttributesW(PathUTF16.begin());		DWORD Attributes = ::GetFileAttributesW(PathUTF16.begin());
if (Attributes == INVALID_FILE_ATTRIBUTES)		if (Attributes == INVALID_FILE_ATTRIBUTES)
return mapWindowsError(GetLastError());		return mapWindowsError(GetLastError());
Show All 25 Lines	std::error_code setLastAccessAndModificationTime(int FD, TimePoint<> AccessTime,
FILETIME AccessFT = toFILETIME(AccessTime);		FILETIME AccessFT = toFILETIME(AccessTime);
FILETIME ModifyFT = toFILETIME(ModificationTime);		FILETIME ModifyFT = toFILETIME(ModificationTime);
HANDLE FileHandle = reinterpret_cast<HANDLE>(_get_osfhandle(FD));		HANDLE FileHandle = reinterpret_cast<HANDLE>(_get_osfhandle(FD));
if (!SetFileTime(FileHandle, NULL, &AccessFT, &ModifyFT))		if (!SetFileTime(FileHandle, NULL, &AccessFT, &ModifyFT))
return mapWindowsError(::GetLastError());		return mapWindowsError(::GetLastError());
return std::error_code();		return std::error_code();
}		}

std::error_code mapped_file_region::init(int FD, uint64_t Offset,		std::error_code mapped_file_region::init(sys::fs::file_t OrigFileHandle,
mapmode Mode) {		uint64_t Offset, mapmode Mode) {
this->Mode = Mode;		this->Mode = Mode;
HANDLE OrigFileHandle = reinterpret_cast<HANDLE>(_get_osfhandle(FD));
if (OrigFileHandle == INVALID_HANDLE_VALUE)		if (OrigFileHandle == INVALID_HANDLE_VALUE)
return make_error_code(errc::bad_file_descriptor);		return make_error_code(errc::bad_file_descriptor);

DWORD flprotect;		DWORD flprotect;
switch (Mode) {		switch (Mode) {
case readonly: flprotect = PAGE_READONLY; break;		case readonly: flprotect = PAGE_READONLY; break;
case readwrite: flprotect = PAGE_READWRITE; break;		case readwrite: flprotect = PAGE_READWRITE; break;
case priv: flprotect = PAGE_WRITECOPY; break;		case priv: flprotect = PAGE_WRITECOPY; break;
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	if (!::DuplicateHandle(::GetCurrentProcess(), OrigFileHandle,
std::error_code ec = mapWindowsError(GetLastError());		std::error_code ec = mapWindowsError(GetLastError());
::UnmapViewOfFile(Mapping);		::UnmapViewOfFile(Mapping);
return ec;		return ec;
}		}

return std::error_code();		return std::error_code();
}		}

mapped_file_region::mapped_file_region(int fd, mapmode mode, size_t length,		mapped_file_region::mapped_file_region(sys::fs::file_t fd, mapmode mode,
uint64_t offset, std::error_code &ec)		size_t length, uint64_t offset,
		std::error_code &ec)
: Size(length), Mapping() {		: Size(length), Mapping() {
ec = init(fd, offset, mode);		ec = init(fd, offset, mode);
if (ec)		if (ec)
Mapping = 0;		Mapping = 0;
}		}

static bool hasFlushBufferKernelBug() {		static bool hasFlushBufferKernelBug() {
static bool Ret{GetWindowsOSVersion() < llvm::VersionTuple(10, 0, 0, 17763)};		static bool Ret{GetWindowsOSVersion() < llvm::VersionTuple(10, 0, 0, 17763)};
▲ Show 20 Lines • Show All 333 Lines • ▼ Show 20 Lines	Expected<file_t> openNativeFileForRead(const Twine &Name, OpenFlags Flags,

// Fetch the real name of the file, if the user asked		// Fetch the real name of the file, if the user asked
if (Result && RealPath)		if (Result && RealPath)
realPathFromHandle(Result, RealPath);		realPathFromHandle(Result, RealPath);

return Result;		return Result;
}		}

void closeFile(file_t &F) {		file_t convertFDToNativeFile(int FD) {
::CloseHandle(F);		return reinterpret_cast<HANDLE>(::_get_osfhandle(FD));
		}

		file_t getStdinHandle() { return ::GetStdHandle(STD_INPUT_HANDLE); }
		file_t getStdoutHandle() { return ::GetStdHandle(STD_OUTPUT_HANDLE); }
		file_t getStderrHandle() { return ::GetStdHandle(STD_ERROR_HANDLE); }

		std::error_code readNativeFileImpl(file_t FileHandle, char *BufPtr, size_t BytesToRead,
		aganeaUnsubmitted Not Done Reply Inline Actions Just keep the API below that "works" all the time? (`readNativeFileSlice`) aganea: Just keep the API below that "works" all the time? (`readNativeFileSlice`)
		rnkAuthorUnsubmitted Done Reply Inline Actions Are you suggesting simplifying the code by forwarding the call with offset 0, or reducing the API to one entry point? Stream devices like pipes don't support seeking or pread, so to change the public API, I would need to make the offset optional, which I think would be less clear. Perhaps I can simplify this Windows specific code to delegate to the slice version, but that was my concern. rnk: Are you suggesting simplifying the code by forwarding the call with offset 0, or reducing the…
		aganeaUnsubmitted Not Done Reply Inline Actions I meant forward to a common implementation for both `readNativeFile` and `readNativeFileSlice` aganea: I meant forward to a common implementation for both `readNativeFile` and `readNativeFileSlice`
		rnkAuthorUnsubmitted Done Reply Inline Actions Sure, done. rnk: Sure, done.
		aganeaUnsubmitted Not Done Reply Inline Actions Sorry for not being clearer. I was thinking more along the lines of: static std::error_code readNativeFileImpl(file_t FileHandle, MutableArrayRef<char> Buf, size_t BytesRead, size_t Offset) { char BufPtr = Buf.data(); size_t BytesLeft = Buf.size(); if (BytesRead) BytesRead = 0; while (BytesLeft) { uint64_t CurOff = Buf.size() - BytesLeft + Offset; OVERLAPPED Overlapped{}; Overlapped.Offset = uint32_t(CurOff); Overlapped.OffsetHigh = uint32_t(uint64_t(CurOff) >> 32); // ReadFile can only read 2GB at a time. DWORD BytesToRead32 = std::min(1 << 31, BytesToRead); DWORD BytesRead32 = 0; bool Success = ::ReadFile(FileHandle, BufPtr, BytesToRead32, &BytesRead32, Overlap); if (!Success) { DWORD Err = ::GetLastError(); // Pipe EOF is not an error. if (Err != ERROR_BROKEN_PIPE) return mapWindowsError(Err); } if (BytesRead) BytesRead += BytesRead32; // Once we reach EOF, zero the remaining bytes in the buffer. if (BytesRead32 == 0) { memset(BufPtr, 0, BytesLeft); break; } BytesLeft -= BytesRead32; BufPtr += BytesRead32; } return std::error_code(); } std::error_code readNativeFile(file_t FileHandle, MutableArrayRef<char> Buf, size_t BytesRead) { return readNativeFileImpl(FileHandle, Buf, BytesRead, /Offset/ 0); } std::error_code readNativeFileSlice(file_t FileHandle, MutableArrayRef<char> Buf, size_t Offset) { return readNativeFileImpl(FileHandle, Buf, /BytesRead/ nullptr, Offset); } The doc says: For an hFile that supports byte offsets, if you use this parameter you must specify a byte offset at which to start reading from the file or device. This offset is specified by setting the Offset and OffsetHigh members of the OVERLAPPED structure. For an hFile that does not support byte offsets, Offset and OffsetHigh are ignored. So I think we can use OVERLAPPED with pipes as long as we don't write anything in there. It'd be interesting to also test that 2 GB read limit :-) My whole point was, if that limit still exists, it should not leak outside of this implementation. Meaning that users should not have to loop to read files larger than 2 GB. aganea:* Sorry for not being clearer. I was thinking more along the lines of: ``` static std::error_code…
		rnkAuthorUnsubmitted Done Reply Inline Actions The Unix implementation doesn't loop, so it can theoretically return with a short read. I think if I make this one loop, I should make the Unix one loop, and then promise the caller to block until the requested number of bytes are read or EOF is hit. However, because this API doesn't prescribe any particular kind of buffer, the caller typically has to loop anyway to allocate more memory for the next read. I did it that way to make it as compatible with `read` as possible for easy migration. So, given that short reads theoretically exist on Unix, do you think the code sharing (beyond what we have already) for Windows is worth adding the inconsistency in behavior between Windows and Unix? rnk: The Unix implementation doesn't loop, so it can theoretically return with a short read. I think…
		aganeaUnsubmitted Not Done Reply Inline Actions The API behavior should not diverge between Windows and Unix, I think we both agree on this. However I find a bit awkward to serve short reads, when we could serve complete reads, as the user requested in the first place (in the worst case, the caller expects a full read anyway for `readNativeFile` isn't it?). But perhaps this is OT? Let's leave it the way you did it, and discuss this later if you wish? aganea: The API behavior should not diverge between Windows and Unix, I think we both agree on this.
		rnkAuthorUnsubmitted Done Reply Inline Actions I guess it's OK if the Windows API guarantees that there are no short reads. If it ever becomes a problem, we can teach the Unix one to loop until EOF. rnk: I guess it's OK if the Windows API guarantees that there are no short reads. If it ever becomes…
		rnkAuthorUnsubmitted Done Reply Inline Actions I tried the loop you wrote, but it seems to interfere with the outer loop for reading from pipes until EOF. I think we should stick with this for now. rnk: I tried the loop you wrote, but it seems to interfere with the outer loop for reading from…
		size_t BytesRead, OVERLAPPED Overlap) {
		// ReadFile can only read 2GB at a time. The caller should check the number of
		// bytes and read in a loop until termination.
		DWORD BytesToRead32 =
		std::min(size_t(std::numeric_limits<DWORD>::max()), BytesToRead);
		DWORD BytesRead32 = 0;
		bool Success =
		::ReadFile(FileHandle, BufPtr, BytesToRead32, &BytesRead32, Overlap);
		*BytesRead = BytesRead32;
		if (!Success) {
		DWORD Err = ::GetLastError();
		// Pipe EOF is not an error.
		if (Err == ERROR_BROKEN_PIPE)
		return std::error_code();
		return mapWindowsError(Err);
		}
		return std::error_code();
		}

		std::error_code readNativeFile(file_t FileHandle, MutableArrayRef<char> Buf,
		size_t *BytesRead) {
		return readNativeFileImpl(FileHandle, Buf.data(), Buf.size(), BytesRead,
		/Overlap=/nullptr);
		}

		std::error_code readNativeFileSlice(file_t FileHandle,
		MutableArrayRef<char> Buf, size_t Offset) {
		char *BufPtr = Buf.data();
		size_t BytesLeft = Buf.size();

		while (BytesLeft) {
		uint64_t CurOff = Buf.size() - BytesLeft + Offset;
		OVERLAPPED Overlapped = {};
		Overlapped.Offset = uint32_t(CurOff);
		Overlapped.OffsetHigh = uint32_t(uint64_t(CurOff) >> 32);

		size_t BytesRead = 0;
		if (auto EC = readNativeFileImpl(FileHandle, BufPtr, BytesLeft, &BytesRead,
		&Overlapped))
		return EC;

		// Once we reach EOF, zero the remaining bytes in the buffer.
		if (BytesRead == 0) {
		memset(BufPtr, 0, BytesLeft);
		break;
		}
		BytesLeft -= BytesRead;
		BufPtr += BytesRead;
		}
		return std::error_code();
		}

		std::error_code closeFile(file_t &F) {
		file_t TmpF = F;
F = kInvalidFile;		F = kInvalidFile;
		if (!::CloseHandle(TmpF))
		return mapWindowsError(::GetLastError());
		return std::error_code();
}		}

std::error_code remove_directories(const Twine &path, bool IgnoreErrors) {		std::error_code remove_directories(const Twine &path, bool IgnoreErrors) {
// Convert to utf-16.		// Convert to utf-16.
SmallVector<wchar_t, 128> Path16;		SmallVector<wchar_t, 128> Path16;
std::error_code EC = widenPath(path, Path16);		std::error_code EC = widenPath(path, Path16);
if (EC && !IgnoreErrors)		if (EC && !IgnoreErrors)
return EC;		return EC;
▲ Show 20 Lines • Show All 229 Lines • Show Last 20 Lines

llvm/lib/XRay/InstrumentationMap.cpp

	Show First 20 Lines • Show All 172 Lines • ▼ Show 20 Lines

	static Error			static Error
	loadYAML(int Fd, size_t FileSize, StringRef Filename,			loadYAML(int Fd, size_t FileSize, StringRef Filename,
	InstrumentationMap::SledContainer &Sleds,			InstrumentationMap::SledContainer &Sleds,
	InstrumentationMap::FunctionAddressMap &FunctionAddresses,			InstrumentationMap::FunctionAddressMap &FunctionAddresses,
	InstrumentationMap::FunctionAddressReverseMap &FunctionIds) {			InstrumentationMap::FunctionAddressReverseMap &FunctionIds) {
	std::error_code EC;			std::error_code EC;
	sys::fs::mapped_file_region MappedFile(			sys::fs::mapped_file_region MappedFile(
	Fd, sys::fs::mapped_file_region::mapmode::readonly, FileSize, 0, EC);			sys::fs::convertFDToNativeFile(Fd),
				sys::fs::mapped_file_region::mapmode::readonly, FileSize, 0, EC);
	if (EC)			if (EC)
	return make_error<StringError>(			return make_error<StringError>(
	Twine("Failed memory-mapping file '") + Filename + "'.", EC);			Twine("Failed memory-mapping file '") + Filename + "'.", EC);

	std::vector<YAMLXRaySledEntry> YAMLSleds;			std::vector<YAMLXRaySledEntry> YAMLSleds;
	yaml::Input In(StringRef(MappedFile.data(), MappedFile.size()));			yaml::Input In(StringRef(MappedFile.data(), MappedFile.size()));
	In >> YAMLSleds;			In >> YAMLSleds;
	if (In.error())			if (In.error())
	▲ Show 20 Lines • Show All 51 Lines • Show Last 20 Lines

llvm/lib/XRay/Profile.cpp

Show First 20 Lines • Show All 266 Lines • ▼ Show 20 Lines	Expected<Profile> loadProfile(StringRef Filename) {

uint64_t FileSize;		uint64_t FileSize;
if (auto EC = sys::fs::file_size(Filename, FileSize))		if (auto EC = sys::fs::file_size(Filename, FileSize))
return make_error<StringError>(		return make_error<StringError>(
Twine("Cannot get filesize of '") + Filename + "'", EC);		Twine("Cannot get filesize of '") + Filename + "'", EC);

std::error_code EC;		std::error_code EC;
sys::fs::mapped_file_region MappedFile(		sys::fs::mapped_file_region MappedFile(
Fd, sys::fs::mapped_file_region::mapmode::readonly, FileSize, 0, EC);		sys::fs::convertFDToNativeFile(Fd),
		sys::fs::mapped_file_region::mapmode::readonly, FileSize, 0, EC);
if (EC)		if (EC)
return make_error<StringError>(		return make_error<StringError>(
Twine("Cannot mmap profile '") + Filename + "'", EC);		Twine("Cannot mmap profile '") + Filename + "'", EC);
StringRef Data(MappedFile.data(), MappedFile.size());		StringRef Data(MappedFile.data(), MappedFile.size());

Profile P;		Profile P;
uint32_t Offset = 0;		uint32_t Offset = 0;
DataExtractor Extractor(Data, true, 8);		DataExtractor Extractor(Data, true, 8);
▲ Show 20 Lines • Show All 119 Lines • Show Last 20 Lines

llvm/lib/XRay/Trace.cpp

Show First 20 Lines • Show All 385 Lines • ▼ Show 20 Lines	if (FileSize < 4) {
return make_error<StringError>(		return make_error<StringError>(
Twine("File '") + Filename + "' too small for XRay.",		Twine("File '") + Filename + "' too small for XRay.",
std::make_error_code(std::errc::executable_format_error));		std::make_error_code(std::errc::executable_format_error));
}		}

// Map the opened file into memory and use a StringRef to access it later.		// Map the opened file into memory and use a StringRef to access it later.
std::error_code EC;		std::error_code EC;
sys::fs::mapped_file_region MappedFile(		sys::fs::mapped_file_region MappedFile(
Fd, sys::fs::mapped_file_region::mapmode::readonly, FileSize, 0, EC);		sys::fs::convertFDToNativeFile(Fd),
		sys::fs::mapped_file_region::mapmode::readonly, FileSize, 0, EC);
if (EC) {		if (EC) {
return make_error<StringError>(		return make_error<StringError>(
Twine("Cannot read log from '") + Filename + "'", EC);		Twine("Cannot read log from '") + Filename + "'", EC);
}		}
auto Data = StringRef(MappedFile.data(), MappedFile.size());		auto Data = StringRef(MappedFile.data(), MappedFile.size());

// TODO: Lift the endianness and implementation selection here.		// TODO: Lift the endianness and implementation selection here.
DataExtractor LittleEndianDE(Data, true, 8);		DataExtractor LittleEndianDE(Data, true, 8);
▲ Show 20 Lines • Show All 67 Lines • Show Last 20 Lines

llvm/tools/llvm-xray/xray-fdr-dump.cpp

Show All 29 Lines	static cl::opt<std::string> DumpInput(cl::Positional,
cl::desc("<xray fdr mode log>"),		cl::desc("<xray fdr mode log>"),
cl::Required, cl::sub(Dump));		cl::Required, cl::sub(Dump));
static cl::opt<bool> DumpVerify("verify",		static cl::opt<bool> DumpVerify("verify",
cl::desc("verify structure of the log"),		cl::desc("verify structure of the log"),
cl::init(false), cl::sub(Dump));		cl::init(false), cl::sub(Dump));

static CommandRegistration Unused(&Dump, []() -> Error {		static CommandRegistration Unused(&Dump, []() -> Error {
// Open the file provided.		// Open the file provided.
int Fd;		auto FDOrErr = sys::fs::openNativeFileForRead(DumpInput);
if (auto EC = sys::fs::openFileForRead(DumpInput, Fd))		if (!FDOrErr)
return createStringError(EC, "Cannot open file '%s' for read.",		return FDOrErr.takeError();
DumpInput.c_str());

uint64_t FileSize;		uint64_t FileSize;
if (auto EC = sys::fs::file_size(DumpInput, FileSize))		if (auto EC = sys::fs::file_size(DumpInput, FileSize))
return createStringError(EC, "Failed to get file size for '%s'.",		return createStringError(EC, "Failed to get file size for '%s'.",
DumpInput.c_str());		DumpInput.c_str());

std::error_code EC;		std::error_code EC;
sys::fs::mapped_file_region MappedFile(		sys::fs::mapped_file_region MappedFile(
Fd, sys::fs::mapped_file_region::mapmode::readonly, FileSize, 0, EC);		*FDOrErr, sys::fs::mapped_file_region::mapmode::readonly, FileSize, 0,
		EC);
		sys::fs::closeFile(*FDOrErr);

DataExtractor DE(StringRef(MappedFile.data(), MappedFile.size()), true, 8);		DataExtractor DE(StringRef(MappedFile.data(), MappedFile.size()), true, 8);
uint32_t OffsetPtr = 0;		uint32_t OffsetPtr = 0;

auto FileHeaderOrError = readBinaryFormatHeader(DE, OffsetPtr);		auto FileHeaderOrError = readBinaryFormatHeader(DE, OffsetPtr);
if (!FileHeaderOrError)		if (!FileHeaderOrError)
return FileHeaderOrError.takeError();		return FileHeaderOrError.takeError();
auto &H = FileHeaderOrError.get();		auto &H = FileHeaderOrError.get();
▲ Show 20 Lines • Show All 60 Lines • Show Last 20 Lines

llvm/unittests/Support/MemoryBufferTest.cpp

Show First 20 Lines • Show All 143 Lines • ▼ Show 20 Lines	for (int i = 0; i < 60000; ++i) {
OF << "0123456789";		OF << "0123456789";
}		}

if (Reopen) {		if (Reopen) {
OF.close();		OF.close();
EXPECT_FALSE(sys::fs::openFileForRead(TestPath.c_str(), TestFD));		EXPECT_FALSE(sys::fs::openFileForRead(TestPath.c_str(), TestFD));
}		}

ErrorOr<OwningBuffer> Buf =		ErrorOr<OwningBuffer> Buf = MemoryBuffer::getOpenFileSlice(
MemoryBuffer::getOpenFileSlice(TestFD, TestPath.c_str(),		sys::fs::convertFDToNativeFile(TestFD), TestPath.c_str(),
40000, // Size		40000, // Size
80000 // Offset		80000 // Offset
);		);

std::error_code EC = Buf.getError();		std::error_code EC = Buf.getError();
EXPECT_FALSE(EC);		EXPECT_FALSE(EC);

StringRef BufData = Buf.get()->getBuffer();		StringRef BufData = Buf.get()->getBuffer();
EXPECT_EQ(BufData.size(), 40000U);		EXPECT_EQ(BufData.size(), 40000U);
EXPECT_EQ(BufData[0], '0');		EXPECT_EQ(BufData[0], '0');
EXPECT_EQ(BufData[9], '9');		EXPECT_EQ(BufData[9], '9');
▲ Show 20 Lines • Show All 127 Lines • Show Last 20 Lines

llvm/unittests/Support/Path.cpp

Show First 20 Lines • Show All 1,078 Lines • ▼ Show 20 Lines	ASSERT_NO_ERROR(
fs::createTemporaryFile("prefix", "temp", FileDescriptor, TempPath));		fs::createTemporaryFile("prefix", "temp", FileDescriptor, TempPath));
unsigned Size = 4096;		unsigned Size = 4096;
ASSERT_NO_ERROR(fs::resize_file(FileDescriptor, Size));		ASSERT_NO_ERROR(fs::resize_file(FileDescriptor, Size));

// Map in temp file and add some content		// Map in temp file and add some content
std::error_code EC;		std::error_code EC;
StringRef Val("hello there");		StringRef Val("hello there");
{		{
fs::mapped_file_region mfr(FileDescriptor,		fs::mapped_file_region mfr(fs::convertFDToNativeFile(FileDescriptor),
fs::mapped_file_region::readwrite, Size, 0, EC);		fs::mapped_file_region::readwrite, Size, 0, EC);
ASSERT_NO_ERROR(EC);		ASSERT_NO_ERROR(EC);
std::copy(Val.begin(), Val.end(), mfr.data());		std::copy(Val.begin(), Val.end(), mfr.data());
// Explicitly add a 0.		// Explicitly add a 0.
mfr.data()[Val.size()] = 0;		mfr.data()[Val.size()] = 0;
// Unmap temp file		// Unmap temp file
}		}
ASSERT_EQ(close(FileDescriptor), 0);		ASSERT_EQ(close(FileDescriptor), 0);

// Map it back in read-only		// Map it back in read-only
{		{
int FD;		int FD;
EC = fs::openFileForRead(Twine(TempPath), FD);		EC = fs::openFileForRead(Twine(TempPath), FD);
ASSERT_NO_ERROR(EC);		ASSERT_NO_ERROR(EC);
fs::mapped_file_region mfr(FD, fs::mapped_file_region::readonly, Size, 0, EC);		fs::mapped_file_region mfr(fs::convertFDToNativeFile(FD),
		fs::mapped_file_region::readonly, Size, 0, EC);
ASSERT_NO_ERROR(EC);		ASSERT_NO_ERROR(EC);

// Verify content		// Verify content
EXPECT_EQ(StringRef(mfr.const_data()), Val);		EXPECT_EQ(StringRef(mfr.const_data()), Val);

// Unmap temp file		// Unmap temp file
fs::mapped_file_region m(FD, fs::mapped_file_region::readonly, Size, 0, EC);		fs::mapped_file_region m(fs::convertFDToNativeFile(FD),
		fs::mapped_file_region::readonly, Size, 0, EC);
ASSERT_NO_ERROR(EC);		ASSERT_NO_ERROR(EC);
ASSERT_EQ(close(FD), 0);		ASSERT_EQ(close(FD), 0);
}		}
ASSERT_NO_ERROR(fs::remove(TempPath));		ASSERT_NO_ERROR(fs::remove(TempPath));
}		}

TEST(Support, NormalizePath) {		TEST(Support, NormalizePath) {
using TestTuple = std::tuple<const char , const char , const char *>;		using TestTuple = std::tuple<const char , const char , const char *>;
▲ Show 20 Lines • Show All 614 Lines • Show Last 20 Lines

llvm/unittests/Support/ReplaceFileTest.cpp

Show First 20 Lines • Show All 46 Lines • ▼ Show 20 Lines	class ScopedFD {
ScopedFD &operator=(const ScopedFD &) = delete;		ScopedFD &operator=(const ScopedFD &) = delete;

public:		public:
explicit ScopedFD(int Descriptor) : FD(Descriptor) {}		explicit ScopedFD(int Descriptor) : FD(Descriptor) {}
~ScopedFD() { Process::SafelyCloseFileDescriptor(FD); }		~ScopedFD() { Process::SafelyCloseFileDescriptor(FD); }
};		};

bool FDHasContent(int FD, StringRef Content) {		bool FDHasContent(int FD, StringRef Content) {
auto Buffer = MemoryBuffer::getOpenFile(FD, "", -1);		auto Buffer =
		MemoryBuffer::getOpenFile(sys::fs::convertFDToNativeFile(FD), "", -1);
assert(Buffer);		assert(Buffer);
return Buffer.get()->getBuffer() == Content;		return Buffer.get()->getBuffer() == Content;
}		}

bool FileHasContent(StringRef File, StringRef Content) {		bool FileHasContent(StringRef File, StringRef Content) {
int FD = 0;		int FD = 0;
auto EC = fs::openFileForRead(File, FD);		auto EC = fs::openFileForRead(File, FD);
(void)EC;		(void)EC;
▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines	ASSERT_NO_ERROR(CreateFileWithContent(TargetTmp0FileName, "!!target.tmp0!!"));
// Use mapped_file_region to make sure that the destination file is mmap'ed.		// Use mapped_file_region to make sure that the destination file is mmap'ed.
// This will cause SetInformationByHandle to fail when renaming to the		// This will cause SetInformationByHandle to fail when renaming to the
// destination, and we will follow the code path that tries to give target		// destination, and we will follow the code path that tries to give target
// a temporary name.		// a temporary name.
int TargetFD;		int TargetFD;
std::error_code EC;		std::error_code EC;
ASSERT_NO_ERROR(fs::openFileForRead(TargetFileName, TargetFD));		ASSERT_NO_ERROR(fs::openFileForRead(TargetFileName, TargetFD));
ScopedFD X(TargetFD);		ScopedFD X(TargetFD);
sys::fs::mapped_file_region MFR(		sys::fs::mapped_file_region MFR(sys::fs::convertFDToNativeFile(TargetFD),
TargetFD, sys::fs::mapped_file_region::readonly, 10, 0, EC);		sys::fs::mapped_file_region::readonly, 10,
		0, EC);
ASSERT_FALSE(EC);		ASSERT_FALSE(EC);

ASSERT_NO_ERROR(fs::rename(SourceFileName, TargetFileName));		ASSERT_NO_ERROR(fs::rename(SourceFileName, TargetFileName));

#ifdef _WIN32		#ifdef _WIN32
// Make sure that target was temporarily renamed to target.tmp1 on Windows.		// Make sure that target was temporarily renamed to target.tmp1 on Windows.
// This is signified by a permission denied error as opposed to no such file		// This is signified by a permission denied error as opposed to no such file
// or directory when trying to open it.		// or directory when trying to open it.
Show All 14 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[Support] Move llvm::MemoryBuffer to sys::fs::file_tClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 205406

llvm/include/llvm/Support/FileSystem.h

llvm/include/llvm/Support/MemoryBuffer.h

llvm/lib/LTO/Caching.cpp

llvm/lib/LTO/LTOModule.cpp

llvm/lib/LTO/ThinLTOCodeGenerator.cpp

llvm/lib/Object/ArchiveWriter.cpp

llvm/lib/Support/FileOutputBuffer.cpp

llvm/lib/Support/MemoryBuffer.cpp

llvm/lib/Support/Unix/Path.inc

llvm/lib/Support/VirtualFileSystem.cpp

llvm/lib/Support/Windows/Path.inc

llvm/lib/XRay/InstrumentationMap.cpp

llvm/lib/XRay/Profile.cpp

llvm/lib/XRay/Trace.cpp

llvm/tools/llvm-xray/xray-fdr-dump.cpp

llvm/unittests/Support/MemoryBufferTest.cpp

llvm/unittests/Support/Path.cpp

llvm/unittests/Support/ReplaceFileTest.cpp

[Support] Move llvm::MemoryBuffer to sys::fs::file_t
ClosedPublic