This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Support/
-
llvm/
-
Support/
3/12
raw_ostream.h
-
lib/Support/
-
Support/
3/6
raw_ostream.cpp
-
unittests/Support/
-
Support/
-
CMakeLists.txt
-
raw_fd_ostream_test.cpp
-
raw_fd_stream_test.cpp

Differential D91693

[Support] Add reserve() method to the raw_ostream.
ClosedPublic

Authored by avl on Nov 18 2020, 2:41 AM.

Download Raw Diff

Details

Reviewers

jhenderson
alexander-shaposhnikov
grimar
labath
dblaikie
MaskRay

Commits

rG875b3b2cdda1: [Support] Add reserve() method to the raw_ostream.

Summary

If resulting size of the output stream is already known,
then the space for stream data could be preliminary
allocated in some cases. f.e. raw_string_ostream could
preallocate the space for the target string(it allows
to avoid reallocations during writing into the stream).

Diff Detail

Repository: rG LLVM Github Monorepo

Unit TestsFailed

	Time	Test
	390 ms	linux > HWAddressSanitizer-x86_64.TestCases::sizes.cpp
	110 ms	windows > LLVM-Unit.Support/_/SupportTests_exe::raw_fd_ostreamTest.ReserveDevNull

Event Timeline

avl created this revision.Nov 18 2020, 2:41 AM

Herald added a project: Restricted Project. · View Herald TranscriptNov 18 2020, 2:41 AM

Herald added subscribers: llvm-commits, dexonsmith, hiraditya, mgorny. · View Herald Transcript

avl requested review of this revision.Nov 18 2020, 2:41 AM

Herald added a subscriber: ormris. · View Herald TranscriptNov 18 2020, 2:41 AM

For raw_fd_ostream, I wonder if we should actually just have a separate class e.g. raw_mmap_ostream, rather than trying to make raw_fd_ostream represent two possible different things. For other streams, I'm not sure the reserving behaviour is useful - strings and vectors could be reserved from outside the wrapping stream, for example.

llvm/include/llvm/Support/raw_ostream.h
138
139–140	What's the reason for setting the stream to be unbuffered on reserving space? It seems like a potentially unexpected side effect that isn't strictly needed.
144
575–578	Same comments as above. I'm not convinced duplicating the comment is worthwhile, but I suppose these are separate classes, so it's probably acceptable.
582	No need for `virtual` when the base method is `virtual` and the sub class method has `override`.
656–660	Ditto.
698–702	Ditto.
llvm/lib/Support/raw_ostream.cpp
742–743	It's definitely a programmer error if this case is hit, but I think it might be worth also bailing out of the function in that case, to avoid writing past the end of the file, which could cause all sorts of horridness. What do you think?
819	Same as above. Perhaps seek to EOF here in that case?
920–923	This should report some sort of error up the tree if this happens, right? Maybe `reserve` should return `Error`.

Harbormaster completed remote builds in B79248: Diff 306019.Nov 18 2020, 3:36 AM

I second the separate class idea. It seems like it could be much cleaner. The normal treatment of reserve-like methods is that of a hint -- one that the implementation could ignore or adjust in some circumstances -- I wouldn't expect that calling this method would completely change the way in which a file is accessed, nor that writing "beyond" the reserved storage would result in an assertion...

For raw_fd_ostream, I wonder if we should actually just have a separate class e.g. raw_mmap_ostream, rather than trying to make raw_fd_ostream represent two possible different things. For other streams, I'm not sure the reserving behaviour is useful - strings and vectors could be reserved from outside the wrapping stream, for example.

The idea behind usage of reserve() method is that the size which should be reserved could be unknown.
Thus we do not know how to reserve strings and vectors at creation point. The same for memory mapped file: it could be unknown which class to create raw_fd_ostream or raw_mmap_ostream:

void WriteToStreamLibrayFunc(raw_ostream& Out) {
   if ( HasVariableData()) {
      while (HasData()) {
         Out.Write()
      }
   } else {
      size_t Size = calculateSize();
      
      Out.reserve(Size);
      Out.Write();
   }
}

Usage 1:

std::string string;
string.reserve(?????);
raw_string_ostream Out(string);
WriteToStreamLibrayFunc (Out);

Usage 2:

raw_fd_ostream?raw_mmap_ostream? Out();
WriteToStreamLibrayFunc (Out);

So, using reserve() method allows us to create effective storage if resulting size could be calculated under some conditions.

llvm/include/llvm/Support/raw_ostream.h
139–140	The reason is when some space is allocated by reserve() method then other buffers become useless in most cases. We do not need to copy data through internal buffers - we could write them directly into memory mapped file.
llvm/lib/Support/raw_ostream.cpp
742–743	It seems to me that we do not need to bail out. That is a programmer error as you said. But we need to stay correct. i.e. I think I need to add check for sizes(to ignore writing out of the file), like in raw_fd_stream::read().
819	yes. I think seeking to EOF is a good idea. But I think it is better to have assertion here, not to bail out with error.
920–923	The idea of that method is when it could not use memory-mapped file then we continue with usual processing. F.e. if file is opened without read access then we will have a error "permission denied". But we would not like to report this error. We would like to continue without memory mapped file. So the idea, is that if reserve() method is not able to reserve data then we continue the same as without calling to reserve().

addressed minor comments.

The question of creating separate raw_mmap_ostream class
is under discussion yet.

I second the separate class idea. It seems like it could be much cleaner. The normal treatment of reserve-like methods is that of a hint -- one that the implementation could ignore or adjust in some circumstances -- I wouldn't expect that calling this method would completely change the way in which a file is accessed, nor that writing "beyond" the reserved storage would result in an assertion...

What do you think of WriteToStreamLibrayFunc use case? In that context reserve() is not just a hint, but it limits the size of resulting stream. Probably the method name is not good. Would resize() be better ?

Harbormaster completed remote builds in B79472: Diff 306431.Nov 19 2020, 9:49 AM

In D91693#2405908, @avl wrote:

I second the separate class idea. It seems like it could be much cleaner. The normal treatment of reserve-like methods is that of a hint -- one that the implementation could ignore or adjust in some circumstances -- I wouldn't expect that calling this method would completely change the way in which a file is accessed, nor that writing "beyond" the reserved storage would result in an assertion...

What do you think of WriteToStreamLibrayFunc use case? In that context reserve() is not just a hint, but it limits the size of resulting stream. Probably the method name is not good. Would resize() be better ?

Not really. :( For resize, I would expect that it has a physical effect on the underlying object (changing file size), which is good, I guess. But I would still expect that it is possible to write more data than that size, and have it be appended, as that's what our other streams do. I would also expect this operation to also have some effect on raw_string_ostream (changing the underlying string size) and such. And I am still worried about using mmap(2)/write(2) interchangeably. The two apis have different characteristics, and I don't think that can be conveyed by a single method call. E.g., the mmap approach will not respect O_APPEND, it will cause SIGBUS if the file is concurrently truncated (that's why our mmap APIs like MemoryBuffer have IsVolatile arguments), etc.

TBH, I am not sure that raw_ostream is the best class for this use case. How much of it's functionality do you need? If you don't need the formatted output capabilities, then maybe a different interface might be better suited. (I realize you're trying to remove one.)

In D91693#2402455, @avl wrote:
Usage 2:
raw_fd_ostream?raw_mmap_ostream? Out();
WriteToStreamLibrayFunc (Out);

One way to get around that would be to make raw_mmap_ostream usable even without an explicit reserve call. So, if the user does not call reserve (or if he goes past the buffer he has already reserved), the stream would automatically mmap successive chunks of the file, and write to them.

That would make make the mmap approach harder to implement, but otoh, it would at least behave rougly as a normal stream. And the usage of a distinct class would make it clear that something funky is happening. Also, this approach might give you a performance boost (reduced copying) even for the cases where the size is not known ahead of time. We may not even need to add an new reserve method to the class, as we could have the class mmap buffer-sized chunks, and the code could adjust the buffer size when the size is known.....

Not really. :( For resize, I would expect that it has a physical effect on the underlying object (changing file size), which is good, I guess. But I would still expect that it is possible to write more data than that size, and have it be appended, as that's what our other streams do. I would also expect this operation to also have some effect on raw_string_ostream (changing the underlying string size) and such. And I am still worried about using mmap(2)/write(2) interchangeably. The two apis have different characteristics, and I don't think that can be conveyed by a single method call. E.g., the mmap approach will not respect O_APPEND, it will cause SIGBUS if the file is concurrently truncated (that's why our mmap APIs like MemoryBuffer have IsVolatile arguments), etc.

I see. So the better solution would be resizable raw_mmap_ostream which could be used in that scenario(when the size is not known at creation time):

raw_mmap_ostream Out();
WriteToStreamLibrayFunc (Out);

TBH, I am not sure that raw_ostream is the best class for this use case. How much of it's functionality do you need? If you don't need the formatted output capabilities, then maybe a different interface might be better suited. (I realize you're trying to remove one.)

I think I do not need formatted output capabilities. They could probably be nice to have in some cases. But main functionality which is useful - possibility to write data as a sequence of pieces. i.e. not as one huge chunk, but as a series of smaller chunks.

Speaking of reserve() method. Do you think it would be useful to implement it in the sense of hint? The only practical implementation for the current moment would be reservation underlying string by raw_string_ostream? It might be useful for the future raw_mmap_ostream, but it would depend on implementation.

void raw_string_ostream::reserve(uint64_t Size) override {
  OS.reserve(Size);
}

In D91693#2407604, @avl wrote:

Not really. :( For resize, I would expect that it has a physical effect on the underlying object (changing file size), which is good, I guess. But I would still expect that it is possible to write more data than that size, and have it be appended, as that's what our other streams do. I would also expect this operation to also have some effect on raw_string_ostream (changing the underlying string size) and such. And I am still worried about using mmap(2)/write(2) interchangeably. The two apis have different characteristics, and I don't think that can be conveyed by a single method call. E.g., the mmap approach will not respect O_APPEND, it will cause SIGBUS if the file is concurrently truncated (that's why our mmap APIs like MemoryBuffer have IsVolatile arguments), etc.

I see. So the better solution would be resizable raw_mmap_ostream which could be used in that scenario(when the size is not known at creation time):

Yes, *I* would think so. You may want to gather additional opinions before spending too much time on that, though...

TBH, I am not sure that raw_ostream is the best class for this use case. How much of it's functionality do you need? If you don't need the formatted output capabilities, then maybe a different interface might be better suited. (I realize you're trying to remove one.)

I think I do not need formatted output capabilities. They could probably be nice to have in some cases. But main functionality which is useful - possibility to write data as a sequence of pieces. i.e. not as one huge chunk, but as a series of smaller chunks.

Yeah, I suppose that's reasonable. Though, if the scope of this is small (e.g.: it only needs to write to files _or_ memory buffers, and it's not going go have a lot of callers/large surface area), I would not completely dismiss some custom solution either...

Speaking of reserve() method. Do you think it would be useful to implement it in the sense of hint? The only practical implementation for the current moment would be reservation underlying string by raw_string_ostream? It might be useful for the future raw_mmap_ostream, but it would depend on implementation.

I don't think that would be *un*reasonable, but I'd wait until a use case for it shows up.

Yeah, I suppose that's reasonable. Though, if the scope of this is small (e.g.: it only needs to write to files _or_ memory buffers, and it's not going go have a lot of callers/large surface area), I would not completely dismiss some custom solution either...

This work is done exactly because of scope for custom solution become wider. There is a D88827 review which tries to move core implementation of llvm-objcopy into the Object library. So there is a request to avoid using custom solution in favor of more standard one. That is why I am trying to replace custom llvm-objcopy solution(D91028).

I don't think that would be *un*reasonable, but I'd wait until a use case for it shows up.

There is such a use case in D91028

SmallVector<char, 0> Buffer;
raw_svector_ostream MemStream(Buffer);

if (Error E = executeObjcopyOnBinary(Config, **ObjOrErr, MemStream))
  return E;

In D91693#2407967, @avl wrote:

Yeah, I suppose that's reasonable. Though, if the scope of this is small (e.g.: it only needs to write to files _or_ memory buffers, and it's not going go have a lot of callers/large surface area), I would not completely dismiss some custom solution either...

This work is done exactly because of scope for custom solution become wider. There is a D88827 review which tries to move core implementation of llvm-objcopy into the Object library. So there is a request to avoid using custom solution in favor of more standard one. That is why I am trying to replace custom llvm-objcopy solution(D91028).

Ok, fair enough.

I don't think that would be *un*reasonable, but I'd wait until a use case for it shows up.

There is such a use case in D91028
SmallVector<char, 0> Buffer;
raw_svector_ostream MemStream(Buffer);

if (Error E = executeObjcopyOnBinary(Config, **ObjOrErr, MemStream))
  return E;

Seems reasonable, then. I would like to get a second opinion though..

avl mentioned this in D91028: [llvm-objcopy][NFC] replace class Buffer/MemBuffer/FileBuffer with streams..Nov 23 2020, 1:37 AM

In D91693#2410578, @labath wrote:
In D91693#2407967, @avl wrote:

Yeah, I suppose that's reasonable. Though, if the scope of this is small (e.g.: it only needs to write to files _or_ memory buffers, and it's not going go have a lot of callers/large surface area), I would not completely dismiss some custom solution either...

This work is done exactly because of scope for custom solution become wider. There is a D88827 review which tries to move core implementation of llvm-objcopy into the Object library. So there is a request to avoid using custom solution in favor of more standard one. That is why I am trying to replace custom llvm-objcopy solution(D91028).

Ok, fair enough.
I don't think that would be *un*reasonable, but I'd wait until a use case for it shows up.

There is such a use case in D91028
SmallVector<char, 0> Buffer;
raw_svector_ostream MemStream(Buffer);

if (Error E = executeObjcopyOnBinary(Config, **ObjOrErr, MemStream))
  return E;
Seems reasonable, then. I would like to get a second opinion though..

@jhenderson mind chiming in here? If you and @labath find agreement with the direction, that seems good to me.

I slightly lost track of what people are suggesting, so instead, I'm deliberately taking a step back from my knowledge of the existing code here, to outline an idealised theoretical design, so this might not work quite as planned. To me, it would make more sense for executeObjcopyOnBinary to take a filename and do the writing internally, rather than receiving some kind of input buffer that it writes to. This then means you should know the size up-front before writing to the buffer, if I'm not mistaken, allowing you to set the size of your buffer up front. That in turn removes the need for any resize/reserve method - you just specify the size needed at the construction of the buffer. You might therefore have two different stream classes being used, one for in-memory writes and the other for file writing. The two of them share a common base class, so only a small part of the code requires knowing what buffer kind to create. The in-memory version would just take a size (and possibly some backing storage), whilst the other takes a size and file name and creates a file, possibly via memory mapping, that ultimately is that size. It would be an assertion in these cases to write past the end of the file.

Ultimately, I think the only new code needed would be a new constructor overload for raw_svector_ostream taking a size that the output size will be and a new raw_ostream subclass which does the same with a file behind it.

@jhenderson Summary of proposed approach:

Current solution, which uses Buffer/MemoryBuffer/FileBuffer assumes that the buffer for the whole file should be pre-allocated. This assumes that the whole file should be loaded into the memory. For the library, it might be an inconvenient requirement. Some tools might want to have a possibility to reduce memory usage.

Using streams allows us to reduce memory usages. No need to load all data into the memory - the data could be streamed through a smaller buffer.

Passing file name for the executeObjcopyOnBinary might be inconvenient if we would like to have a possibility to replace destination.

executeObjcopyOnBinary(CopyConfig &Config, object::Binary &In, StringRef OutName);

i.e. when we would like to write data not in the file but in the memory, or just discard output,
or calculate the hash for output we might want to replace destination. That would be hard
if the only file name is specified. Opposite, following design:

executeObjcopyOnBinary(CopyConfig &Config, object::Binary &In, raw_ostream &Out);

allows to use various kinds of destinations;

SmallVector<char, 0> Buffer;
raw_svector_ostream MemStream(Buffer);
executeObjcopyOnBinary(Config, In, Out);

raw_fd_ostream Out(OutputFilename);
executeObjcopyOnBinary(Config, In, Out);

raw_null_ostream Out;
executeObjcopyOnBinary(Config, In, Out);

raw_sha1_ostream Out;
executeObjcopyOnBinary(Config, In, Out);

Current objcopy implementation assumes that data is already pre-allocated. Thus, to have the advantage of using streams it should be rewritten in such a way that streams were used directly(without creating intermediate buffers).

If llvm-objcopy needs to use memory-mapped files then we might implement resizable raw_mmap_ostream. Though I do not know at the current moment whether the effective implementation of resizable memory-mapped file could be done.

The fact that llvm-objcopy knows the size of resulting data could be used to make implementation a bit more efficient. We could tell reserve() for the streams and the more effective buffers could be used:

SmallVector<char, 0> Buffer;
raw_svector_ostream MemStream(Buffer);
executeObjcopyOnBinary(Config, In, Out);

executeObjcopyOnBinary (Config, In, Out) {
  Out.reserve(XX);   /// <<< calls Buffer.reserve(XX); 
}

Finally, we can support all current functionality and have the advantages of using streams.
So my current suggestion is

to use raw_ostream for output parameter of executeObjcopyOnBinary interface.
add reserve() method to the raw_ostream, so that streams were able to optimize internal buffers if possible.

What do you think?

In D91693#2416252, @avl wrote:

@jhenderson Summary of proposed approach:

Current solution, which uses Buffer/MemoryBuffer/FileBuffer assumes that the buffer for the whole file should be pre-allocated. This assumes that the whole file should be loaded into the memory. For the library, it might be an inconvenient requirement. Some tools might want to have a possibility to reduce memory usage.

Using streams allows us to reduce memory usages. No need to load all data into the memory - the data could be streamed through a smaller buffer.

It's not immediately clear to me which data you're concerned about loading into memory, but it's worth remembering that objcopy will need to have all the data (or at least its size) in its internal representation before it can write most bits, at least for ELF objcopy, because the section header offset field is part of the ELF header, but the section header table itself is at the end.

Passing file name for the executeObjcopyOnBinary might be inconvenient if we would like to have a possibility to replace destination.

This can be solved by overloading:

// In all examples, types of first two parameters are arbitrarily chosen, the third parameter, when present, is the important one).

// Execute objcopy then write to file Out.
Error executeObjcopyOnBinary(const CopyConfig &Config, const Binary &In, StringRef Out) {
  // Do some objcopy stuff to finalise the objcopy layout.
  Object Obj = executeObjcopy(Config, In);
  size_t Size = Obj.getOutputSize();
  raw_mmap_ostream OS(Out, Size);
  writeOutput(OS, Obj);
}

// Execute objcopy then write to raw_ostream Out. raw_mmap_ostream won't be usable here, since the output size isn't known up front.
Error executeObjcopyOnBinary(const CopyConfig &Config, const Binary &In, raw_ostream &Out) {
  Object Obj = executeObjcopy(Config, In);
  writeOutput(Out, Obj);
}

// If a user wanted to write the output to an existing raw_mmap_ostream, they'd need to call the lower-level API directly:
void aFunction(...) {
  ...
  Object Obj = executeObjcopy(Config, In);
  size_t Size = Obj.getOutputSize();
  ...
  raw_mmap_ostream Out(FileName, Size + ...);
  ...
  writeOutput(Out, Obj);
}

It's worth noting that we cannot have Expected<raw_ostream &> executeObjcopyOnBinary(const CopyConfig &Config, const Binary &In); where the function itself creates the stream and returns it, because something needs to own the backing storage.

Current objcopy implementation assumes that data is already pre-allocated. Thus, to have the advantage of using streams it should be rewritten in such a way that streams were used directly(without creating intermediate buffers).

I don't think it's possible to start writing to a stream before the size of the data is known (see above). Does that mean streams are still justified?

If llvm-objcopy needs to use memory-mapped files then we might implement resizable raw_mmap_ostream. Though I do not know at the current moment whether the effective implementation of resizable memory-mapped file could be done.

I don't think resizeable memory-mapped files are useful in this context, because the size needs to be known before writing anyway. The exception is if we think calling the objcopy guts is too complicated for users that want to write to an existing memory mapped file, but I think it is possible to keep it simple, like in my example above.

The fact that llvm-objcopy knows the size of resulting data could be used to make implementation a bit more efficient. We could tell reserve() for the streams and the more effective buffers could be used:

This is merely a nice-to-have, and not itself directly related to the issue at hand. It can be implemented later. People who want the efficiency bonus of reserving could use the same approach as I did in the above example with the adding to an existing memory mapped file, by calling the lower-level API directly. They will usually have access to their underlying backing storage anyway (e.g. the std::vector) and so can just reserve it directly.

It's not immediately clear to me which data you're concerned about loading into memory, but it's worth remembering that objcopy will need to have all the data (or at least its size) in its internal representation before it can write most bits, at least for ELF objcopy, because the section header offset field is part of the ELF header, but the section header table itself is at the end.

I mean bits of the resulting object. f.e. output ELF file data.

My understanding is that this(having all the data in internal representation) might be not
necessary for other formats, f.e. MachO/Wasm. Following, is the existing code for MachO :

Expected<std::unique_ptr<MemoryBuffer>>
object::writeUniversalBinaryToBuffer(ArrayRef<Slice> Slices) {
  SmallVector<char, 0> Buffer;
  raw_svector_ostream Out(Buffer);

  if (Error E = writeUniversalBinaryToStream(Slices, Out))  <<< data is written to the memory stream
    return std::move(E);
    
  return std::make_unique<SmallVectorMemoryBuffer>(std::move(Buffer));
}    
......    
  Expected<std::unique_ptr<MemoryBuffer>> B =
      writeUniversalBinaryToBuffer(Slices);
  if (!B)
    return B.takeError();
  if (Error E = Out.allocate((*B)->getBufferSize()))
    return E;
  memcpy(Out.getBufferStart(), (*B)->getBufferStart(), (*B)->getBufferSize());  
  ^^^^^^^^^
  data is copied into the output buffer

Using stream as a destination would allow us not to use the intermediate memory object.
i.e. If it is not necessary to update the file header after the end of the file is written/or if that size
could be precalculated in the start - then we do not need to load all file data into the
internal buffers.

For the ELF case, there would be three alternatives(in streams solution)(*) :

If the section header offset could somehow be precalculated then we are lucky and

do not need to update it after the file is written.

We could store the file data into the memory buffer, update the field and then copy this buffer

into the destination stream.

Use raw_pwrite_stream as the destination. That would allow us to update the section header

offset after the file is written.

So we would still be able to successfully handle the ELF case and we do not require loading whole
file bits into the memory for other cases.

This can be solved by overloading:

// In all examples, types of first two parameters are arbitrarily chosen, the third parameter, when present, is the important one).

// Execute objcopy then write to file Out.
Error executeObjcopyOnBinary(const CopyConfig &Config, const Binary &In, StringRef Out) {
// Do some objcopy stuff to finalise the objcopy layout.
Object Obj = executeObjcopy(Config, In);
size_t Size = Obj.getOutputSize();
raw_mmap_ostream OS(Out, Size);
writeOutput(OS, Obj);
}

// Execute objcopy then write to raw_ostream Out. raw_mmap_ostream won't be usable here, since the output size isn't known up front.
Error executeObjcopyOnBinary(const CopyConfig &Config, const Binary &In, raw_ostream &Out) {
Object Obj = executeObjcopy(Config, In);
writeOutput(Out, Obj);
}

// If a user wanted to write the output to an existing raw_mmap_ostream, they'd need to call the lower-level API directly:
void aFunction(...) {
...
Object Obj = executeObjcopy(Config, In);
size_t Size = Obj.getOutputSize();
...
raw_mmap_ostream Out(FileName, Size + ...);
...
writeOutput(Out, Obj);
}

It's worth noting that we cannot have Expected<raw_ostream &> executeObjcopyOnBinary(const CopyConfig &Config, const Binary &In); where the function itself creates the stream and returns it, because something needs to own the backing storage.

In this design we have a new entity "Object" in the interface:

Object Obj = executeObjcopy(Config, In);

This new "Object" would force additional copying. F.e. following usage would become useless:

void aFunction(...) {
  ...
  Object Obj = executeObjcopy(Config, In);
  size_t Size = Obj.getOutputSize();
  ...
  raw_mmap_ostream Out(FileName, Size + ...);
  ...
  writeOutput(Out, Obj); <<< data should be copied from Object Obj into the raw_mmap_ostream Out;
}

The advantage of using a memory-mapped file is that we could write directly to the memory owned by the memory-mapped file
and this memory would be stored by the system. Using an intermediate Object, would require us to do extra copying from that Object into a memory-mapped file. In the streams scenario we do not need to do extra copying:

executeObjcopyOnBinary(CopyConfig &Config, object::Binary &In, raw_ostream &Out) {
  Writer->finalize(Out); <<< calls Out.reserve(XXX);

  Writer->write(Out);  <<< writes directly to the memory mapped file data
}

void aFunction(...) {
  ...
  raw_mmap_ostream Out(FileName);
  executeObjcopy(Config, In, Out);
}

I don't think it's possible to start writing to a stream before the size of the data is known (see above). Does that mean streams are still justified?

My understanding is yes, streams are still justified. see (*).

I don't think resizeable memory-mapped files are useful in this context, because the size needs to be known before writing anyway. The exception is if we think calling the objcopy guts is too complicated for users that want to write to an existing memory mapped file, but I think it is possible to keep it simple, like in my example above.

But we already know the size inside objcopy before writing.
Currenty we call Buf.allocate(totalSize()) before starting writing.
With streams solution, we would call reserve(totalSize()) in that place.

This is merely a nice-to-have, and not itself directly related to the issue at hand. It can be implemented later. People who want the efficiency bonus of reserving could use the same approach as I did in the above example with the adding to an existing memory mapped file, by calling the lower-level API directly. They will usually have access to their underlying backing storage anyway (e.g. the std::vector) and so can just reserve it directly.

yes, that could be implemented later.
But that would be required to have the same performance results for streams solution and the current implementation.

@jhenderson James, what do you think of using streams as suggested by D91028 and D91693? Would it be useful? It seems it would reduce amount of copied data.

In D91693#2425369, @avl wrote:

@jhenderson James, what do you think of using streams as suggested by D91028 and D91693? Would it be useful? It seems it would reduce amount of copied data.

Hi @avl,

Honestly, I don't have the time to look at this process in detail, and refactoring things to support an objcopy library is not high on my priority list. I'm not convinced that your suggestions are actually going to be workable/useful in practice, if I'm honest, and have tried to outline my concerns already. My biggest concern is how do you stream an ELF header without already knowing where your section header table will live. If you know where your section header table will live, you have all the information you need for presizing your output buffer, so being able to reserve post stream creation becomes pointless. There's no need to read the entire input object file into memory either - llvm-objcopy doesn't do this already (note that generic sections that require no manipulation just use an ArrayRef to refer to the section contents).

I will give it more thought, but I cannot guarantee when I'll be able to come back to this.

Honestly, I don't have the time to look at this process in detail, and refactoring things to support an objcopy library is not high on my priority list. I'm not convinced that your suggestions are actually going to be workable/useful in practice, if I'm honest, and have tried to outline my concerns already.

yes. thank you for your comments. I tried to explain why I think that streams could still be used. Please, check whether it is correct argument.

My biggest concern is how do you stream an ELF header without already knowing where your section header table will live.

please, check three ways of how to do this:

Use preliminary memory buffer(ELF header could be updated after sections are written):

executeObjcopyOnBinary(CopyConfig &Config, object::Binary &In, raw_ostream &Out) {
  Buf = WritableMemoryBuffer::getNewMemBuffer(totalSize()); <<< allocate preliminary memory buffer
  
  // Following code stores result in the buffer Buf(that is how the current implementation works).
  // Segment data must be written first, so that the ELF header and program
  // header tables can overwrite it, if covered by a segment.
  writeSegmentData();
  writeEhdr();
  writePhdrs();
  if (Error E = writeSectionData())
    return E;
  if (WriteSectionHeaders)
    writeShdrs();

  Out.write(Buf->getBufferStart(), Buf->getBufferSize()); <<<<< copy data from internal buffer into output stream
  Out.flush();
  return Error::success();  
}

That scheme looks equal(in the sence of copied output data) to solution you have proposed earlier:

void aFunction(...) {
  ...
  Object Obj = executeObjcopy(Config, In);  <<<< allocate memory buffer and return result in it
  size_t Size = Obj.getOutputSize();
  ...
  raw_fd_ostream Out(FileName);
  ...
  writeOutput(Out, Obj);  <<< copy data from the memory buffer into output file
}

Write directly into the stream:

Before writing started the overall output file size is already known.
It is calculated by ELFWriter<ELFT>::finalize(). Probably, the information about the place where section header table will live could be understood inside ELFWriter<ELFT>::finalize() ? In that case the ELF header
might be written immediately in the correct form.

If above two variants are not good then we could consider using raw_pwrite_stream for the destination.

executeObjcopyOnBinary(CopyConfig &Config, object::Binary &In, raw_pwrite_stream &Out) {

  // Following code stores result in the raw_pwrite_stream &Out.
  // Segment data must be written first, so that the ELF header and program
  // header tables can overwrite it, if covered by a segment.
  writeSegmentData();
  writeEhdr();
  writePhdrs();
  if (Error E = writeSectionData())
    return E;
  if (WriteSectionHeaders)
    writeShdrs();

  // seek to the ELF header and update it
  Out.seek(to ELF Header);
  Out.write(updated ELF header);

  return Error::success();  
}

if you know where your section header table will live, you have all the information you need for presizing your output buffer, so being able to reserve post stream creation becomes pointless.

Why it becomes pointless?

There's no need to read the entire input object file into memory either - llvm-objcopy doesn't do this already (note that generic sections that require no manipulation just use an ArrayRef to refer to the section contents).

Right. I am not proposing to optimize memory used for input object file.
That proposal is about using streams to use standard classes and to optimize memory used for the output file.

following is an example when the entire output file is loaded into the memory:

Expected<std::unique_ptr<MemoryBuffer>>
object::writeUniversalBinaryToBuffer(ArrayRef<Slice> Slices) {
  SmallVector<char, 0> Buffer;
  raw_svector_ostream Out(Buffer);

  if (Error E = writeUniversalBinaryToStream(Slices, Out))  <<< entire output file is loaded into the memory
    return std::move(E);
    
  return std::make_unique<SmallVectorMemoryBuffer>(std::move(Buffer));
}    
......    
  Expected<std::unique_ptr<MemoryBuffer>> B =
      writeUniversalBinaryToBuffer(Slices);
  if (!B)
    return B.takeError();
  if (Error E = Out.allocate((*B)->getBufferSize()))
    return E;
  memcpy(Out.getBufferStart(), (*B)->getBufferStart(), (*B)->getBufferSize());  
  ^^^^^^^^^
  data is copied from memory into the output buffer

In D91693#2425585, @jhenderson wrote:

In D91693#2425369, @avl wrote:

@jhenderson James, what do you think of using streams as suggested by D91028 and D91693? Would it be useful? It seems it would reduce amount of copied data.

Hi @avl,

Honestly, I don't have the time to look at this process in detail, and refactoring things to support an objcopy library is not high on my priority list. I'm not convinced that your suggestions are actually going to be workable/useful in practice, if I'm honest, and have tried to outline my concerns already. My biggest concern is how do you stream an ELF header without already knowing where your section header table will live. If you know where your section header table will live, you have all the information you need for presizing your output buffer, so being able to reserve post stream creation becomes pointless.

I guess that depends on the API - if the API is "write this object to this stream" then that API implementation has a "Oh, I know how big my output is, I can pass that hint to the stream and, if it has use for that (such as presizing a memory mapped output file to back storage) it can do that" - if the API is defined more in terms of "write this object to the file specified by this file name", then, yeah, you can change the logic about how the file is opened in the first place - presize, open memory mapped, and use a MemoryBuffer instead of a stream, potentially.

But a stream API is more general - allows for streaming out to stdout or other places that can't be identified by a file name, for instance.

@avl - it might be this and the related patch/usage in llvm-objcopy deserve an llvm-dev design thread about the overall issues you're trying to solve and the design directions you've explored and the ones you are proposing.

@avl - it might be this and the related patch/usage in llvm-objcopy deserve an llvm-dev design thread about the overall issues you're trying to solve and the design directions you've explored and the ones you are proposing.

Thanks! Will start that thread right after holidays...

@avl - it might be this and the related patch/usage in llvm-objcopy deserve an llvm-dev design thread about the overall issues you're trying to solve and the design directions you've explored and the ones you are proposing.

https://lists.llvm.org/pipermail/llvm-dev/2021-January/147892.html

removed usages of memory mapped file. make reserve() method to
pre-allocate internal buffers only.

avl edited the summary of this revision. (Show Details)Feb 8 2021, 9:11 AM

Harbormaster completed remote builds in B88298: Diff 322122.Feb 8 2021, 9:46 AM

ping.

sgraenitz mentioned this in D96627: [WIP] Implement JITLoaderGDB ObjectLinkingLayer plugin for ELF x86-64.Feb 17 2021, 3:16 AM

dblaikie added inline comments.Feb 18 2021, 3:27 PM

llvm/include/llvm/Support/raw_ostream.h
656	Would it make sense for the reserve operation to be relative to where the stream has reached so far? ie: for it to be implemented as: void reserve(size_t ExtraCapacity) override { OS.reserve(OS.size() + ExtraCapacity); } (similarly for other implementations) Because the average caller probably shouldn't be thinking about how many bytes have already been written to the stream when deciding how much capacity they want for future write operations?

avl added inline comments.Feb 19 2021, 7:00 AM

llvm/include/llvm/Support/raw_ostream.h
656	I think - yes, it makes sense. In that case, it would probably be better to name that method reserveExtraSpace()(to look differently than std::vector::reserve()). Also, I think it should not be calculated against OS.size(). It looks better to calculate it against current_pos()(Because the callee can receive stream with the current position in the middle of the stream) Would update the review accordingly(if there are no objections).

dblaikie added inline comments.Feb 19 2021, 12:22 PM

llvm/include/llvm/Support/raw_ostream.h
656	Happy to use current_pos, though it looks like it's == size anyway, right? uint64_t raw_svector_ostream::current_pos() const { return OS.size(); } & raw_string_ostream's member: uint64_t current_pos() const override { return OS.size(); }

avl added inline comments.Feb 19 2021, 1:20 PM

llvm/include/llvm/Support/raw_ostream.h
656	Happy to use current_pos, though it looks like it's == size anyway, right? right. I was thinking on general case when we might seek from the current position(like in raw_fd_ostream). Thus using current_pos() might be more general. But, since we have specialized implementation in raw_svector_ostream/raw_string_ostream and do not have seek methods there - it would be fine to use OS.size().

change reserve() to reserveExtraSpace(). That allows pre-allocating
the internal stream buffers without knowing its current size.

Harbormaster completed remote builds in B90243: Diff 325488.Feb 22 2021, 12:07 PM

dblaikie accepted this revision.Feb 22 2021, 1:19 PM

dblaikie added inline comments.

llvm/unittests/Support/raw_ostream_test.cpp
462 ↗	(On Diff #325488)	maybe make this a bit more interesting by having a non-zero value for tell() here, otherwise there's no difference between reserve using an absolute size, and reserve using a tell-relative value.

This revision is now accepted and ready to land.Feb 22 2021, 1:19 PM

Closed by commit rG875b3b2cdda1: [Support] Add reserve() method to the raw_ostream. (authored by avl). · Explain WhyFeb 23 2021, 3:07 AM

This revision was automatically updated to reflect the committed changes.

avl added a commit: rG875b3b2cdda1: [Support] Add reserve() method to the raw_ostream..

Thanks! updated test accordingly.

Revision Contents

Path

Size

llvm/

include/

llvm/

Support/

raw_ostream.h

44 lines

lib/

Support/

raw_ostream.cpp

162 lines

unittests/

Support/

CMakeLists.txt

1 line

raw_fd_ostream_test.cpp

314 lines

raw_fd_stream_test.cpp

68 lines

Diff 306019

llvm/include/llvm/Support/raw_ostream.h

Show All 34 Lines

template <class T> class LLVM_NODISCARD Expected; template <class T> class LLVM_NODISCARD Expected;

namespace sys { namespace sys {

namespace fs { namespace fs {

enum FileAccess : unsigned; enum FileAccess : unsigned;

enum OpenFlags : unsigned; enum OpenFlags : unsigned;

enum CreationDisposition : unsigned; enum CreationDisposition : unsigned;

class FileLocker; class FileLocker;

class mapped_file_region;

} // end namespace fs } // end namespace fs

} // end namespace sys } // end namespace sys

/// This class implements an extremely fast bulk output stream that can *only* /// This class implements an extremely fast bulk output stream that can *only*

/// output to a stream. It does not support seeking, reopening, rewinding, line /// output to a stream. It does not support seeking, reopening, rewinding, line

/// buffered disciplines etc. It is a simple buffer that outputs /// buffered disciplines etc. It is a simple buffer that outputs

/// a chunk at a time. /// a chunk at a time.

class raw_ostream { class raw_ostream {

▲ Show 20 Lines • Show All 77 Lines • ▼ Show 20 Lines public:

virtual ~raw_ostream(); virtual ~raw_ostream();

/// tell - Return the current offset with the file. /// tell - Return the current offset with the file.

uint64_t tell() const { return current_pos() + GetNumBytesInBuffer(); } uint64_t tell() const { return current_pos() + GetNumBytesInBuffer(); }

OStreamKind get_kind() const { return Kind; } OStreamKind get_kind() const { return Kind; }

/// If possible, preallocate space for stream data when the final size

/// is already known. Attempt to put more data into the stream

jhendersonUnsubmitted

Not Done

/// If possible, preallocate space for stream data when the final size

- /// is already known. Attempt to put more data into the stream

+ /// is already known. Attempting to put more data into the stream

/// than it was reserved is not allowed. Set the stream to be unbuffered

jhenderson:

/// than it was reserved is not allowed. Set the stream to be unbuffered

/// if the space was successfully reserved.

jhendersonUnsubmitted

Not Done

What's the reason for setting the stream to be unbuffered on reserving space? It seems like a potentially unexpected side effect that isn't strictly needed.

jhenderson: What's the reason for setting the stream to be unbuffered on reserving space? It seems like a…

avlAuthorUnsubmitted

Done

The reason is when some space is allocated by reserve() method then other buffers become useless in most cases. We do not need to copy data through internal buffers - we could write them directly into memory mapped file.

avl: The reason is when some space is allocated by reserve() method then other buffers become…

virtual void reserve(uint64_t Size) {

assert(!ReservedSize && "Buffer space is already reserved.");

assert(tell() == 0 &&

"Can`t reserve space if any data is already written.");

jhendersonUnsubmitted

Not Done

assert(tell() == 0 &&

- "Can`t reserve space if any data is already written.");

+ "Can't reserve space if any data is already written.");

ReservedSize = Size;

jhenderson:

ReservedSize = Size;

SetUnbuffered();

}

//===--------------------------------------------------------------------===// //===--------------------------------------------------------------------===//

// Configuration Interface // Configuration Interface

//===--------------------------------------------------------------------===// //===--------------------------------------------------------------------===//

/// Set the stream to be buffered, with an automatically determined buffer /// Set the stream to be buffered, with an automatically determined buffer

/// size. /// size.

void SetBuffered(); void SetBuffered();

▲ Show 20 Lines • Show All 208 Lines • ▼ Show 20 Lines protected:

/// Return an efficient buffer size for the underlying output mechanism. /// Return an efficient buffer size for the underlying output mechanism.

virtual size_t preferred_buffer_size() const; virtual size_t preferred_buffer_size() const;

/// Return the beginning of the current stream buffer, or 0 if the stream is /// Return the beginning of the current stream buffer, or 0 if the stream is

/// unbuffered. /// unbuffered.

const char *getBufferStart() const { return OutBufStart; } const char *getBufferStart() const { return OutBufStart; }

Optional<uint64_t> ReservedSize;

//===--------------------------------------------------------------------===// //===--------------------------------------------------------------------===//

// Private Interface // Private Interface

//===--------------------------------------------------------------------===// //===--------------------------------------------------------------------===//

private: private:

/// Install the given buffer and mode. /// Install the given buffer and mode.

void SetBufferAndMode(char *BufferStart, size_t Size, BufferKind Mode); void SetBufferAndMode(char *BufferStart, size_t Size, BufferKind Mode);

/// Flush the current buffer, which is known to be non-empty. This outputs the /// Flush the current buffer, which is known to be non-empty. This outputs the

▲ Show 20 Lines • Show All 80 Lines • ▼ Show 20 Lines #endif

uint64_t current_pos() const override { return pos; } uint64_t current_pos() const override { return pos; }

/// Determine an efficient buffer size. /// Determine an efficient buffer size.

size_t preferred_buffer_size() const override; size_t preferred_buffer_size() const override;

void anchor() override; void anchor() override;

protected: protected:

std::unique_ptr<sys::fs::mapped_file_region> MMapFile;

/// Set the flag indicating that an output error has been encountered. /// Set the flag indicating that an output error has been encountered.

void error_detected(std::error_code EC) { this->EC = EC; } void error_detected(std::error_code EC) { this->EC = EC; }

/// Return the file descriptor. /// Return the file descriptor.

int get_fd() const { return FD; } int get_fd() const { return FD; }

// Update the file position by increasing \p Delta. // Update the file position by increasing \p Delta.

void inc_pos(uint64_t Delta) { pos += Delta; } void inc_pos(uint64_t Delta) { pos += Delta; }

▲ Show 20 Lines • Show All 85 Lines • ▼ Show 20 Lines public:

/// ///

/// @returns RAII object that releases the lock upon leaving the scope, if the /// @returns RAII object that releases the lock upon leaving the scope, if the

/// locking was successful. Otherwise returns corresponding /// locking was successful. Otherwise returns corresponding

/// error code. /// error code.

/// ///

/// It is used as @ref lock. /// It is used as @ref lock.

LLVM_NODISCARD LLVM_NODISCARD

Expected<sys::fs::FileLocker> tryLockFor(std::chrono::milliseconds Timeout); Expected<sys::fs::FileLocker> tryLockFor(std::chrono::milliseconds Timeout);

/// If possible, preallocate space for stream data when the final size

/// is already known. Attempt to put more data into the stream

/// than it was reserved is not allowed. Set the stream to be unbuffered

/// if the space was successfully reserved. raw_fd_ostream uses a

jhendersonUnsubmitted

Not Done

Same comments as above. I'm not convinced duplicating the comment is worthwhile, but I suppose these are separate classes, so it's probably acceptable.

jhenderson: Same comments as above. I'm not convinced duplicating the comment is worthwhile, but I suppose…

/// memory-mapped file as the buffer where data would be stored. The

/// sys::fs::FA_Read permission is required to allocate the memory-mapped file

/// buffer.

virtual void reserve(uint64_t Size) override;

jhendersonUnsubmitted

Not Done

/// buffer.

- virtual void reserve(uint64_t Size) override;

+ void reserve(uint64_t Size) override;

};

/// This returns a reference to a raw_fd_ostream for standard output. Use it

No need for virtual when the base method is virtual and the sub class method has override.

jhenderson: No need for `virtual` when the base method is `virtual` and the sub class method has `override`.

}; };

/// This returns a reference to a raw_fd_ostream for standard output. Use it /// This returns a reference to a raw_fd_ostream for standard output. Use it

/// like: outs() << "foo" << "bar"; /// like: outs() << "foo" << "bar";

raw_fd_ostream &outs(); raw_fd_ostream &outs();

/// This returns a reference to a raw_ostream for standard error. /// This returns a reference to a raw_ostream for standard error.

/// Use it like: errs() << "foo" << "bar"; /// Use it like: errs() << "foo" << "bar";

▲ Show 20 Lines • Show All 56 Lines • ▼ Show 20 Lines public:

~raw_string_ostream() override; ~raw_string_ostream() override;

/// Flushes the stream contents to the target string and returns the string's /// Flushes the stream contents to the target string and returns the string's

/// reference. /// reference.

std::string& str() { std::string& str() {

flush(); flush();

return OS; return OS;

} }

/// If possible, preallocate space for stream data when the final size

dblaikieUnsubmitted

Not Done

Would it make sense for the reserve operation to be relative to where the stream has reached so far?

ie: for it to be implemented as:

void reserve(size_t ExtraCapacity) override { OS.reserve(OS.size() + ExtraCapacity); }

(similarly for other implementations)
Because the average caller probably shouldn't be thinking about how many bytes have already been written to the stream when deciding how much capacity they want for future write operations?

dblaikie: Would it make sense for the reserve operation to be relative to where the stream has reached so…

avlAuthorUnsubmitted

Done

I think - yes, it makes sense. In that case, it would probably be better to name that method reserveExtraSpace()(to look differently than std::vector::reserve()).

Also, I think it should not be calculated against OS.size(). It looks better to calculate it against current_pos()(Because the callee can receive stream with the current position in the middle of the stream)

Would update the review accordingly(if there are no objections).

avl: I think - yes, it makes sense. In that case, it would probably be better to name that method…

dblaikieUnsubmitted

Not Done

Happy to use current_pos, though it looks like it's == size anyway, right?

uint64_t raw_svector_ostream::current_pos() const { return OS.size(); }

& raw_string_ostream's member:

uint64_t current_pos() const override { return OS.size(); }

dblaikie: Happy to use current_pos, though it looks like it's == size anyway, right? ``` uint64_t…

avlAuthorUnsubmitted

Done

Happy to use current_pos, though it looks like it's == size anyway, right?

right. I was thinking on general case when we might seek from the current position(like in raw_fd_ostream). Thus using current_pos() might be more general. But, since we have specialized implementation in raw_svector_ostream/raw_string_ostream and do not have seek methods there - it would be fine to use OS.size().

avl: >Happy to use current_pos, though it looks like it's == size anyway, right? right. I was…

/// is already known. Attempt to put more data into the stream

/// than it was reserved is not allowed. Set the stream to be unbuffered

/// if the space was successfully reserved.

virtual void reserve(uint64_t Size) override {

jhendersonUnsubmitted

Not Done

Ditto.

jhenderson: Ditto.

raw_ostream::reserve(Size);

OS.reserve(Size);

}

}; };

/// A raw_ostream that writes to an SmallVector or SmallString. This is a /// A raw_ostream that writes to an SmallVector or SmallString. This is a

/// simple adaptor class. This class does not encounter output errors. /// simple adaptor class. This class does not encounter output errors.

/// raw_svector_ostream operates without a buffer, delegating all memory /// raw_svector_ostream operates without a buffer, delegating all memory

/// management to the SmallString. Thus the SmallString is always up-to-date, /// management to the SmallString. Thus the SmallString is always up-to-date,

/// may be used directly and there is no need to call flush(). /// may be used directly and there is no need to call flush().

class raw_svector_ostream : public raw_pwrite_stream { class raw_svector_ostream : public raw_pwrite_stream {

Show All 17 Lines public:

} }

~raw_svector_ostream() override = default; ~raw_svector_ostream() override = default;

void flush() = delete; void flush() = delete;

/// Return a StringRef for the vector contents. /// Return a StringRef for the vector contents.

StringRef str() const { return StringRef(OS.data(), OS.size()); } StringRef str() const { return StringRef(OS.data(), OS.size()); }

/// If possible, preallocate space for stream data when the final size

/// is already known. Attempt to put more data into the stream

/// than it was reserved is not allowed. Set the stream to be unbuffered

/// if the space was successfully reserved.

virtual void reserve(uint64_t Size) override {

jhendersonUnsubmitted

Not Done

Ditto.

jhenderson: Ditto.

raw_ostream::reserve(Size);

OS.reserve(Size);

}

}; };

/// A raw_ostream that discards all output. /// A raw_ostream that discards all output.

class raw_null_ostream : public raw_pwrite_stream { class raw_null_ostream : public raw_pwrite_stream {

/// See raw_ostream::write_impl. /// See raw_ostream::write_impl.

void write_impl(const char *Ptr, size_t size) override; void write_impl(const char *Ptr, size_t size) override;

void pwrite_impl(const char *Ptr, size_t Size, uint64_t Offset) override; void pwrite_impl(const char *Ptr, size_t Size, uint64_t Offset) override;

Show All 23 Lines

llvm/lib/Support/raw_ostream.cpp

Show First 20 Lines • Show All 656 Lines • ▼ Show 20 Lines	if (!SupportsSeeking)
pos = 0;		pos = 0;
else		else
pos = static_cast<uint64_t>(loc);		pos = static_cast<uint64_t>(loc);
}		}

raw_fd_ostream::~raw_fd_ostream() {		raw_fd_ostream::~raw_fd_ostream() {
if (FD >= 0) {		if (FD >= 0) {
flush();		flush();
		if (MMapFile)
		MMapFile.reset();
if (ShouldClose) {		if (ShouldClose) {
if (auto EC = sys::Process::SafelyCloseFileDescriptor(FD))		if (auto EC = sys::Process::SafelyCloseFileDescriptor(FD))
error_detected(EC);		error_detected(EC);
}		}
}		}

#ifdef __MINGW32__		#ifdef __MINGW32__
// On mingw, global dtors should not call exit().		// On mingw, global dtors should not call exit().
▲ Show 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	do {
WCharsWritten += ActuallyWritten;		WCharsWritten += ActuallyWritten;
} while (WCharsWritten != WideText.size());		} while (WCharsWritten != WideText.size());
return true;		return true;
}		}
#endif		#endif

void raw_fd_ostream::write_impl(const char *Ptr, size_t Size) {		void raw_fd_ostream::write_impl(const char *Ptr, size_t Size) {
assert(FD >= 0 && "File already closed.");		assert(FD >= 0 && "File already closed.");
		assert(!ReservedSize \|\|
		pos + Size <= *ReservedSize && "Writing over reserved area.");
		jhendersonUnsubmitted Not Done Reply Inline Actions It's definitely a programmer error if this case is hit, but I think it might be worth also bailing out of the function in that case, to avoid writing past the end of the file, which could cause all sorts of horridness. What do you think? jhenderson: It's definitely a programmer error if this case is hit, but I think it might be worth also…
		avlAuthorUnsubmitted Done Reply Inline Actions It seems to me that we do not need to bail out. That is a programmer error as you said. But we need to stay correct. i.e. I think I need to add check for sizes(to ignore writing out of the file), like in raw_fd_stream::read(). avl: It seems to me that we do not need to bail out. That is a programmer error as you said. But we…

		if (MMapFile) {
		memcpy(MMapFile->data() + pos, Ptr, Size);
		pos += Size;
		} else {
pos += Size;		pos += Size;

#if defined(_WIN32)		#if defined(_WIN32)
// If this is a Windows console device, try re-encoding from UTF-8 to UTF-16		// If this is a Windows console device, try re-encoding from UTF-8 to UTF-16
// and using WriteConsoleW. If that fails, fall back to plain write().		// and using WriteConsoleW. If that fails, fall back to plain write().
if (IsWindowsConsole)		if (IsWindowsConsole)
if (write_console_impl(FD, StringRef(Ptr, Size)))		if (write_console_impl(FD, StringRef(Ptr, Size)))
return;		return;
#endif		#endif

// The maximum write size is limited to INT32_MAX. A write		// The maximum write size is limited to INT32_MAX. A write
// greater than SSIZE_MAX is implementation-defined in POSIX,		// greater than SSIZE_MAX is implementation-defined in POSIX,
// and Windows _write requires 32 bit input.		// and Windows _write requires 32 bit input.
size_t MaxWriteSize = INT32_MAX;		size_t MaxWriteSize = INT32_MAX;

#if defined(__linux__)		#if defined(__linux__)
// It is observed that Linux returns EINVAL for a very large write (>2G).		// It is observed that Linux returns EINVAL for a very large write (>2G).
// Make it a reasonably small value.		// Make it a reasonably small value.
MaxWriteSize = 1024 * 1024 * 1024;		MaxWriteSize = 1024 * 1024 * 1024;
#endif		#endif

do {		do {
size_t ChunkSize = std::min(Size, MaxWriteSize);		size_t ChunkSize = std::min(Size, MaxWriteSize);
ssize_t ret = ::write(FD, Ptr, ChunkSize);		ssize_t ret = ::write(FD, Ptr, ChunkSize);
		Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for variable 'ret' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for variable 'ret' [readability-identifier-naming]…

if (ret < 0) {		if (ret < 0) {
// If it's a recoverable error, swallow it and retry the write.		// If it's a recoverable error, swallow it and retry the write.
//		//
// Ideally we wouldn't ever see EAGAIN or EWOULDBLOCK here, since		// Ideally we wouldn't ever see EAGAIN or EWOULDBLOCK here, since
// raw_ostream isn't designed to do non-blocking I/O. However, some		// raw_ostream isn't designed to do non-blocking I/O. However, some
// programs, such as old versions of bjam, have mistakenly used		// programs, such as old versions of bjam, have mistakenly used
// O_NONBLOCK. For compatibility, emulate blocking semantics by		// O_NONBLOCK. For compatibility, emulate blocking semantics by
// spinning until the write succeeds. If you don't want spinning,		// spinning until the write succeeds. If you don't want spinning,
// don't use O_NONBLOCK file descriptors with raw_ostream.		// don't use O_NONBLOCK file descriptors with raw_ostream.
if (errno == EINTR \|\| errno == EAGAIN		if (errno == EINTR \|\| errno == EAGAIN
#ifdef EWOULDBLOCK		#ifdef EWOULDBLOCK
\|\| errno == EWOULDBLOCK		\|\| errno == EWOULDBLOCK
#endif		#endif
)		)
continue;		continue;

// Otherwise it's a non-recoverable error. Note it and quit.		// Otherwise it's a non-recoverable error. Note it and quit.
error_detected(std::error_code(errno, std::generic_category()));		error_detected(std::error_code(errno, std::generic_category()));
break;		break;
}		}

// The write may have written some or all of the data. Update the		// The write may have written some or all of the data. Update the
// size and buffer pointer to reflect the remainder that needs		// size and buffer pointer to reflect the remainder that needs
// to be written. If there are no bytes left, we're done.		// to be written. If there are no bytes left, we're done.
Ptr += ret;		Ptr += ret;
Size -= ret;		Size -= ret;
} while (Size > 0);		} while (Size > 0);
}		}
		}

void raw_fd_ostream::close() {		void raw_fd_ostream::close() {
assert(ShouldClose);		assert(ShouldClose);
ShouldClose = false;		ShouldClose = false;
flush();		flush();
		if (MMapFile)
		MMapFile.reset();
if (auto EC = sys::Process::SafelyCloseFileDescriptor(FD))		if (auto EC = sys::Process::SafelyCloseFileDescriptor(FD))
error_detected(EC);		error_detected(EC);
FD = -1;		FD = -1;
}		}

uint64_t raw_fd_ostream::seek(uint64_t off) {		uint64_t raw_fd_ostream::seek(uint64_t off) {
assert(SupportsSeeking && "Stream does not support seeking!");		assert(SupportsSeeking && "Stream does not support seeking!");
flush();		flush();
		if (MMapFile) {
		assert(off <= *ReservedSize);
		jhendersonUnsubmitted Not Done Reply Inline Actions Same as above. Perhaps seek to EOF here in that case? jhenderson: Same as above. Perhaps seek to EOF here in that case?
		avlAuthorUnsubmitted Done Reply Inline Actions yes. I think seeking to EOF is a good idea. But I think it is better to have assertion here, not to bail out with error. avl: yes. I think seeking to EOF is a good idea. But I think it is better to have assertion here…
		pos = off;
		} else {
#ifdef _WIN32		#ifdef _WIN32
pos = ::_lseeki64(FD, off, SEEK_SET);		pos = ::_lseeki64(FD, off, SEEK_SET);
#elif defined(HAVE_LSEEK64)		#elif defined(HAVE_LSEEK64)
pos = ::lseek64(FD, off, SEEK_SET);		pos = ::lseek64(FD, off, SEEK_SET);
#else		#else
pos = ::lseek(FD, off, SEEK_SET);		pos = ::lseek(FD, off, SEEK_SET);
#endif		#endif
if (pos == (uint64_t)-1)		if (pos == (uint64_t)-1)
error_detected(std::error_code(errno, std::generic_category()));		error_detected(std::error_code(errno, std::generic_category()));
		}

return pos;		return pos;
}		}

void raw_fd_ostream::pwrite_impl(const char *Ptr, size_t Size,		void raw_fd_ostream::pwrite_impl(const char *Ptr, size_t Size,
uint64_t Offset) {		uint64_t Offset) {
uint64_t Pos = tell();		uint64_t Pos = tell();
seek(Offset);		seek(Offset);
write(Ptr, Size);		write(Ptr, Size);
▲ Show 20 Lines • Show All 51 Lines • ▼ Show 20 Lines	raw_fd_ostream::tryLockFor(std::chrono::milliseconds Timeout) {
std::error_code EC = sys::fs::tryLockFile(FD, Timeout);		std::error_code EC = sys::fs::tryLockFile(FD, Timeout);
if (!EC)		if (!EC)
return sys::fs::FileLocker(FD);		return sys::fs::FileLocker(FD);
return errorCodeToError(EC);		return errorCodeToError(EC);
}		}

void raw_fd_ostream::anchor() {}		void raw_fd_ostream::anchor() {}

		void raw_fd_ostream::reserve(uint64_t Size) {
		if (!ShouldClose \|\| Size == 0)
		return;

		#ifndef _WIN32
		// On Windows, CreateFileMapping (the mmap function on Windows)
		// automatically extends the underlying file. We don't need to
		// extend the file beforehand. _chsize (ftruncate on Windows) is
		// pretty slow just like it writes specified amount of bytes,
		// so we should avoid calling that function.
		if (std::error_code EC = sys::fs::resize_file(FD, Size))
		return;
		#endif

		// Mmap it.
		std::error_code EC;
		MMapFile = std::make_unique<sys::fs::mapped_file_region>(
		sys::fs::convertFDToNativeFile(FD),
		sys::fs::mapped_file_region::readwrite, Size, 0, EC);

		if (EC) {
		MMapFile.reset();
		return;
		}
		jhendersonUnsubmitted Not Done Reply Inline Actions This should report some sort of error up the tree if this happens, right? Maybe `reserve` should return `Error`. jhenderson: This should report some sort of error up the tree if this happens, right? Maybe `reserve`…
		avlAuthorUnsubmitted Done Reply Inline Actions The idea of that method is when it could not use memory-mapped file then we continue with usual processing. F.e. if file is opened without read access then we will have a error "permission denied". But we would not like to report this error. We would like to continue without memory mapped file. So the idea, is that if reserve() method is not able to reserve data then we continue the same as without calling to reserve(). avl: The idea of that method is when it could not use memory-mapped file then we continue with usual…

		raw_ostream::reserve(Size);
		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// outs(), errs(), nulls()		// outs(), errs(), nulls()
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

raw_fd_ostream &llvm::outs() {		raw_fd_ostream &llvm::outs() {
// Set buffer settings to model stdout behavior.		// Set buffer settings to model stdout behavior.
std::error_code EC;		std::error_code EC;
static raw_fd_ostream S("-", EC, sys::fs::OF_None);		static raw_fd_ostream S("-", EC, sys::fs::OF_None);
Show All 27 Lines	raw_fd_stream::raw_fd_stream(StringRef Filename, std::error_code &EC)

// Do not support non-seekable files.		// Do not support non-seekable files.
if (!supportsSeeking())		if (!supportsSeeking())
EC = std::make_error_code(std::errc::invalid_argument);		EC = std::make_error_code(std::errc::invalid_argument);
}		}

ssize_t raw_fd_stream::read(char *Ptr, size_t Size) {		ssize_t raw_fd_stream::read(char *Ptr, size_t Size) {
assert(get_fd() >= 0 && "File already closed.");		assert(get_fd() >= 0 && "File already closed.");

		if (MMapFile) {
		size_t NumBytes = Size;
		if ((tell() + Size) > *ReservedSize)
		NumBytes -= tell() + Size - *ReservedSize;
		memcpy(Ptr, MMapFile->data() + tell(), NumBytes);
		inc_pos(NumBytes);
		return NumBytes;
		} else {
ssize_t Ret = ::read(get_fd(), (void *)Ptr, Size);		ssize_t Ret = ::read(get_fd(), (void *)Ptr, Size);
if (Ret >= 0)		if (Ret >= 0)
inc_pos(Ret);		inc_pos(Ret);
else		else
error_detected(std::error_code(errno, std::generic_category()));		error_detected(std::error_code(errno, std::generic_category()));
return Ret;		return Ret;
}		}
		}

bool raw_fd_stream::classof(const raw_ostream *OS) {		bool raw_fd_stream::classof(const raw_ostream *OS) {
return OS->get_kind() == OStreamKind::OK_FDStream;		return OS->get_kind() == OStreamKind::OK_FDStream;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// raw_string_ostream		// raw_string_ostream
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

raw_string_ostream::~raw_string_ostream() {		raw_string_ostream::~raw_string_ostream() {
flush();		flush();
}		}

void raw_string_ostream::write_impl(const char *Ptr, size_t Size) {		void raw_string_ostream::write_impl(const char *Ptr, size_t Size) {
		assert(!ReservedSize \|\|
		(OS.size() + Size <= *ReservedSize) && "Writing over reserved area");
OS.append(Ptr, Size);		OS.append(Ptr, Size);
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// raw_svector_ostream		// raw_svector_ostream
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

uint64_t raw_svector_ostream::current_pos() const { return OS.size(); }		uint64_t raw_svector_ostream::current_pos() const { return OS.size(); }

void raw_svector_ostream::write_impl(const char *Ptr, size_t Size) {		void raw_svector_ostream::write_impl(const char *Ptr, size_t Size) {
		assert(!ReservedSize \|\|
		(OS.size() + Size <= *ReservedSize) && "Writing over reserved area");
OS.append(Ptr, Ptr + Size);		OS.append(Ptr, Ptr + Size);
}		}

void raw_svector_ostream::pwrite_impl(const char *Ptr, size_t Size,		void raw_svector_ostream::pwrite_impl(const char *Ptr, size_t Size,
uint64_t Offset) {		uint64_t Offset) {
memcpy(OS.data() + Offset, Ptr, Size);		memcpy(OS.data() + Offset, Ptr, Size);
}		}

Show All 26 Lines

llvm/unittests/Support/CMakeLists.txt

Show First 20 Lines • Show All 85 Lines • ▼ Show 20 Lines	add_llvm_unittest(SupportTests
UnicodeTest.cpp		UnicodeTest.cpp
VersionTupleTest.cpp		VersionTupleTest.cpp
VirtualFileSystemTest.cpp		VirtualFileSystemTest.cpp
WithColorTest.cpp		WithColorTest.cpp
YAMLIOTest.cpp		YAMLIOTest.cpp
YAMLParserTest.cpp		YAMLParserTest.cpp
formatted_raw_ostream_test.cpp		formatted_raw_ostream_test.cpp
raw_fd_stream_test.cpp		raw_fd_stream_test.cpp
		raw_fd_ostream_test.cpp
raw_ostream_test.cpp		raw_ostream_test.cpp
raw_pwrite_stream_test.cpp		raw_pwrite_stream_test.cpp
raw_sha1_ostream_test.cpp		raw_sha1_ostream_test.cpp
xxhashTest.cpp		xxhashTest.cpp
)		)

target_link_libraries(SupportTests PRIVATE LLVMTestingSupport)		target_link_libraries(SupportTests PRIVATE LLVMTestingSupport)

Show All 26 Lines

llvm/unittests/Support/raw_fd_ostream_test.cpp

This file was added.

				//===- llvm/unittest/Support/raw_fd_ostream_test.cpp ----------------------===//
				//
				// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
				// See https://llvm.org/LICENSE.txt for license information.
				// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
				//
				//===----------------------------------------------------------------------===//

				#include "llvm/Support/FileUtilities.h"
				#include "llvm/Support/MemoryBuffer.h"
				#include "llvm/Support/raw_ostream.h"
				#include "gtest/gtest.h"

				using namespace llvm;

				namespace {

				static void CheckFileData(StringRef FileName, StringRef GoldenData) {
				Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for function 'CheckFileData' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for function 'CheckFileData' [readability-identifier…
				ErrorOr<std::unique_ptr<MemoryBuffer>> BufOrErr =
				MemoryBuffer::getFileOrSTDIN(FileName);
				EXPECT_FALSE(BufOrErr.getError());

				EXPECT_EQ((*BufOrErr)->getBufferSize(), GoldenData.size());
				EXPECT_EQ(memcmp((*BufOrErr)->getBufferStart(), GoldenData.data(),
				GoldenData.size()),
				0);
				}

				void WriteDataToTheStream(std::function<void(raw_fd_ostream &)> WriteData,
				Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for function 'WriteDataToTheStream' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for function 'WriteDataToTheStream' [readability…
				StringRef GoldenData) {
				SmallString<64> Path;
				int FD;
				ASSERT_FALSE(sys::fs::createTemporaryFile("foo", "bar", FD, Path));
				FileRemover Cleanup(Path);
				std::error_code EC;

				raw_fd_ostream ReadWriteAccessStream(Path, EC, sys::fs::CD_CreateAlways,
				sys::fs::FA_Write \| sys::fs::FA_Read,
				sys::fs::OF_None);
				EXPECT_TRUE(!EC);

				WriteData(ReadWriteAccessStream);

				ReadWriteAccessStream.close();

				CheckFileData(Path.c_str(), GoldenData);

				raw_fd_ostream WriteOnlyAccessStream(Path, EC, sys::fs::CD_CreateAlways,
				sys::fs::FA_Write, sys::fs::OF_None);
				EXPECT_TRUE(!EC);

				WriteData(WriteOnlyAccessStream);

				WriteOnlyAccessStream.close();

				CheckFileData(Path.c_str(), GoldenData);
				}

				void WriteWrongDataToTheStream(std::function<void(raw_fd_ostream &)> WriteData,
				Lint: Pre-merge checks Inline Actions clang-tidy: warning: invalid case style for function 'WriteWrongDataToTheStream' [readability-identifier-naming] not useful Lint: Pre-merge checks: clang-tidy: warning: invalid case style for function 'WriteWrongDataToTheStream' [readability…
				const char *ErrorDescription) {
				SmallString<64> Path;
				int FD;
				ASSERT_FALSE(sys::fs::createTemporaryFile("foo", "bar", FD, Path));
				FileRemover Cleanup(Path);
				std::error_code EC;

				raw_fd_ostream OS(Path, EC, sys::fs::CD_CreateAlways,
				sys::fs::FA_Write \| sys::fs::FA_Read, sys::fs::OF_None);
				EXPECT_TRUE(!EC);

				EXPECT_DEATH(WriteData(OS), ErrorDescription);
				}

				TEST(raw_fd_ostreamTest, EmptyFile) {
				WriteDataToTheStream([](raw_fd_ostream &) {}, "");

				WriteDataToTheStream([](raw_fd_ostream &OS) { OS.reserve(0); }, "");
				}

				TEST(raw_fd_ostreamTest, OneByteFile) {
				WriteDataToTheStream([](raw_fd_ostream &OS) { OS << "A"; }, "A");

				WriteDataToTheStream(
				[](raw_fd_ostream &OS) {
				OS.reserve(1);
				OS << "A";
				},
				"A");
				}

				TEST(raw_fd_ostreamTest, Write) {
				WriteDataToTheStream(
				[](raw_fd_ostream &OS) {
				OS << "0";
				OS << "1";
				OS << "2";
				OS << "3";
				OS << "4";
				OS << "5";
				OS << "6";
				OS << "7";
				},
				"01234567");

				WriteDataToTheStream(
				[](raw_fd_ostream &OS) {
				OS.reserve(8);
				OS << "0";
				OS << "1";
				OS << "2";
				OS << "3";
				OS << "4";
				OS << "5";
				OS << "6";
				OS << "7";
				},
				"01234567");
				}

				TEST(raw_fd_ostreamTest, PWrite) {
				WriteDataToTheStream(
				[](raw_fd_ostream &OS) {
				OS << "0";
				OS.pwrite("7", 1, 0);
				OS << "1";
				OS.pwrite("6", 1, 1);
				OS << "2";
				OS.pwrite("5", 1, 2);
				OS << "3";
				OS.pwrite("4", 1, 3);
				OS << "4";
				OS.pwrite("3", 1, 4);
				OS << "5";
				OS.pwrite("2", 1, 5);
				OS << "6";
				OS.pwrite("1", 1, 6);
				OS << "7";
				OS.pwrite("0", 1, 7);
				},
				"76543210");

				WriteDataToTheStream(
				[](raw_fd_ostream &OS) {
				OS.reserve(8);

				OS << "0";
				OS.pwrite("7", 1, 0);
				OS << "1";
				OS.pwrite("6", 1, 1);
				OS << "2";
				OS.pwrite("5", 1, 2);
				OS << "3";
				OS.pwrite("4", 1, 3);
				OS << "4";
				OS.pwrite("3", 1, 4);
				OS << "5";
				OS.pwrite("2", 1, 5);
				OS << "6";
				OS.pwrite("1", 1, 6);
				OS << "7";
				OS.pwrite("0", 1, 7);
				},
				"76543210");
				}

				TEST(raw_fd_ostreamTest, PWriteFullReplace) {
				WriteDataToTheStream(
				[](raw_fd_ostream &OS) {
				OS.write("abcd", 4);
				OS.pwrite("efgh", 4, 0);
				},
				"efgh");

				WriteDataToTheStream(
				[](raw_fd_ostream &OS) {
				OS.reserve(4);
				OS.write("abcd", 4);
				OS.pwrite("efgh", 4, 0);
				},
				"efgh");
				}

				TEST(raw_fd_ostreamTest, SeekAndWrite) {
				WriteDataToTheStream(
				[](raw_fd_ostream &OS) {
				OS.write("01234567", 8);
				OS.seek(4);
				OS.write("xyz", 3);
				OS.seek(8);
				},
				"0123xyz7");

				WriteDataToTheStream(
				[](raw_fd_ostream &OS) {
				OS.reserve(8);
				OS.write("01234567", 8);
				OS.seek(4);
				OS.write("xyz", 3);
				OS.seek(8);
				},
				"0123xyz7");
				}

				TEST(raw_fd_ostreamTest, Write4K) {
				const size_t DataSize = 4096;
				char Data[DataSize];
				memset(Data, 0xfe, DataSize);
				WriteDataToTheStream([&](raw_fd_ostream &OS) { OS.write(Data, DataSize); },
				StringRef(Data, DataSize));

				WriteDataToTheStream(
				[&](raw_fd_ostream &OS) {
				OS.reserve(DataSize);
				OS.write(Data, DataSize);
				},
				StringRef(Data, DataSize));
				}

				TEST(raw_fd_ostreamTest, WriteBuffered4K) {
				const size_t DataSize = 4096;
				char Data[DataSize];
				memset(Data, 0xfe, DataSize);
				WriteDataToTheStream(
				[&](raw_fd_ostream &OS) {
				OS.SetBufferSize(128);
				OS.write(Data, DataSize);
				},
				StringRef(Data, DataSize));

				WriteDataToTheStream(
				[&](raw_fd_ostream &OS) {
				OS.reserve(DataSize);
				OS.SetBufferSize(128);
				OS.write(Data, DataSize);
				},
				StringRef(Data, DataSize));
				}

				TEST(raw_fd_ostreamTest, ReserveDevNull) {
				std::error_code EC;
				raw_fd_ostream OS("/dev/null", EC, sys::fs::CD_CreateAlways,
				sys::fs::FA_Write \| sys::fs::FA_Read, sys::fs::OF_None);
				EXPECT_TRUE(!EC);

				OS.reserve(8);
				OS << "01234567";
				EXPECT_TRUE(!OS.error());
				}

				TEST(raw_fd_ostreamTest, ReserveStdOut) {
				std::error_code EC;
				raw_fd_ostream OS("-", EC, sys::fs::CD_CreateAlways,
				sys::fs::FA_Write \| sys::fs::FA_Read, sys::fs::OF_None);
				EXPECT_TRUE(!EC);

				OS.reserve(8);
				OS << "01234567";
				EXPECT_TRUE(!OS.error());
				}

				TEST(raw_fd_ostreamTest, ReserveWithoutWriting) {
				SmallString<64> Path;
				int FD;
				ASSERT_FALSE(sys::fs::createTemporaryFile("foo", "bar", FD, Path));
				FileRemover Cleanup(Path);
				std::error_code EC;

				raw_fd_ostream OS(Path, EC, sys::fs::CD_CreateAlways,
				sys::fs::FA_Write \| sys::fs::FA_Read, sys::fs::OF_None);
				EXPECT_TRUE(!EC);
				OS.reserve(1000);
				OS.close();
				EXPECT_FALSE(OS.has_error());

				ErrorOr<std::unique_ptr<MemoryBuffer>> BufOrErr =
				MemoryBuffer::getFileOrSTDIN(Path);
				EXPECT_FALSE(BufOrErr.getError());

				EXPECT_EQ((*BufOrErr)->getBufferSize(), 1000);
				}

				#ifdef GTEST_HAS_DEATH_TEST
				#ifndef NDEBUG
				TEST(raw_fd_ostreamTest, WriteMoreThanReserved) {
				WriteWrongDataToTheStream(
				[&](raw_fd_ostream &OS) {
				OS.reserve(5);
				OS.write("123456", 6);
				},
				"Writing over reserved area.");
				}

				TEST(raw_fd_ostreamTest, ReserveTwice) {
				WriteWrongDataToTheStream(
				[&](raw_fd_ostream &OS) {
				OS.reserve(5);
				OS.reserve(5);
				},
				"Buffer space is already reserved.");
				}

				TEST(raw_fd_ostreamTest, ReserveAfterWrite) {
				WriteWrongDataToTheStream(
				[&](raw_fd_ostream &OS) {
				OS << "AAAAA";
				OS.reserve(5);
				},
				"Can`t reserve space if any data is already written.");
				}

				#endif
				#endif

				} // namespace

llvm/unittests/Support/raw_fd_stream_test.cpp

Show First 20 Lines • Show All 45 Lines • ▼ Show 20 Lines	TEST(raw_fd_streamTest, ReadAfterWrite) {
EXPECT_EQ(Bytes[2], '2');		EXPECT_EQ(Bytes[2], '2');
EXPECT_EQ(Bytes[3], '3');		EXPECT_EQ(Bytes[3], '3');
EXPECT_EQ(Bytes[4], 'x');		EXPECT_EQ(Bytes[4], 'x');
EXPECT_EQ(Bytes[5], 'y');		EXPECT_EQ(Bytes[5], 'y');
EXPECT_EQ(Bytes[6], 'z');		EXPECT_EQ(Bytes[6], 'z');
EXPECT_EQ(Bytes[7], '7');		EXPECT_EQ(Bytes[7], '7');
}		}

		TEST(raw_fd_streamTest, PreallocatedReadAfterWrite) {
		SmallString<64> Path;
		int FD;
		ASSERT_FALSE(sys::fs::createTemporaryFile("foo", "bar", FD, Path));
		FileRemover Cleanup(Path);
		std::error_code EC;
		raw_fd_stream OS(Path, EC);
		EXPECT_TRUE(!EC);

		OS.reserve(10);

		char Bytes[10];

		OS.write("01234567", 8);

		OS.seek(3);
		EXPECT_EQ(OS.read(Bytes, 2), 2);
		EXPECT_EQ(Bytes[0], '3');
		EXPECT_EQ(Bytes[1], '4');

		OS.seek(4);
		OS.write("xyz", 3);

		OS.seek(0);
		EXPECT_EQ(OS.read(Bytes, 8), 8);
		EXPECT_EQ(Bytes[0], '0');
		EXPECT_EQ(Bytes[1], '1');
		EXPECT_EQ(Bytes[2], '2');
		EXPECT_EQ(Bytes[3], '3');
		EXPECT_EQ(Bytes[4], 'x');
		EXPECT_EQ(Bytes[5], 'y');
		EXPECT_EQ(Bytes[6], 'z');
		EXPECT_EQ(Bytes[7], '7');

		OS.seek(8);
		OS << '8';
		OS.pwrite("abc", 3, 0);
		OS << '9';
		OS.seek(0);
		EXPECT_EQ(OS.read(Bytes, 10), 10);
		EXPECT_EQ(Bytes[0], 'a');
		EXPECT_EQ(Bytes[1], 'b');
		EXPECT_EQ(Bytes[2], 'c');
		EXPECT_EQ(Bytes[3], '3');
		EXPECT_EQ(Bytes[4], 'x');
		EXPECT_EQ(Bytes[5], 'y');
		EXPECT_EQ(Bytes[6], 'z');
		EXPECT_EQ(Bytes[7], '7');
		EXPECT_EQ(Bytes[8], '8');
		EXPECT_EQ(Bytes[9], '9');

		// check seeking into last position.
		OS.seek(10);

		std::string DestString;
		raw_string_ostream Dst(DestString);

		Dst.reserve(10);

		// check reading from the stream.
		OS.seek(0);
		for (size_t NumBytes = OS.read(Bytes, 3); NumBytes != 0;
		NumBytes = OS.read(Bytes, 3))
		Dst.write(Bytes, NumBytes);

		EXPECT_EQ(DestString, "abc3xyz789");
		}

TEST(raw_fd_streamTest, DynCast) {		TEST(raw_fd_streamTest, DynCast) {
{		{
std::error_code EC;		std::error_code EC;
raw_fd_stream OS("-", EC);		raw_fd_stream OS("-", EC);
EXPECT_TRUE(dyn_cast<raw_fd_stream>(&OS));		EXPECT_TRUE(dyn_cast<raw_fd_stream>(&OS));
}		}
{		{
std::error_code EC;		std::error_code EC;
raw_fd_ostream OS("-", EC);		raw_fd_ostream OS("-", EC);
EXPECT_FALSE(dyn_cast<raw_fd_stream>(&OS));		EXPECT_FALSE(dyn_cast<raw_fd_stream>(&OS));
}		}
}		}

} // namespace		} // namespace

This is an archive of the discontinued LLVM Phabricator instance.

[Support] Add reserve() method to the raw_ostream.ClosedPublic

Details

Diff Detail

Unit TestsFailed

Event Timeline

Revision Contents

Diff 306019

llvm/include/llvm/Support/raw_ostream.h

llvm/lib/Support/raw_ostream.cpp

llvm/unittests/Support/CMakeLists.txt

llvm/unittests/Support/raw_fd_ostream_test.cpp

llvm/unittests/Support/raw_fd_stream_test.cpp

[Support] Add reserve() method to the raw_ostream.
ClosedPublic