This is an archive of the discontinued LLVM Phabricator instance.

Refactor bitcode reader to simplify control.
AbandonedPublic

Authored by kschimpf on Apr 1 2015, 2:32 PM.

Download Raw Diff

Details

Reviewers

dschuff
filcab
• rafael
jvoung

Summary

Modifies the bitcode reader such that the same logic is used for
both memory buffers and data streams. The incremental parsing
was factored into startParse, continueParse, and finishParse.
All parses (incremental or non-incremental) begin with startParse.
Then zero (or more) calls to continueParse incrementally read more
input, picking up from where the last call left off. finishParse
materializes any additional parts, based on the flags passed to startParse.

Diff Detail

Event Timeline

kschimpf updated this revision to Diff 23088.Apr 1 2015, 2:32 PM

kschimpf retitled this revision from to Refactor bitcode reader to simplify control..

kschimpf updated this object.

kschimpf edited the test plan for this revision. (Show Details)

kschimpf added reviewers: dschuff, jvoung, • rafael.

kschimpf added a subscriber: Unknown Object (MLST).

A couple nits for now, but still trying to read through and understand what's happening in the streaming and lazy cases...

lib/Bitcode/Reader/BitcodeReader.cpp
275	http://llvm.org/docs/CodingStandards.html#doxygen-use-in-documentation-comments says "\brief" instead of @brief
283	lowercase first letter of function name -- should probably do it for these new functions since you're touching it (but leave existing functions alone?)
288	\returns
397	lowercase first letter for new function (and I guess update the commit message if you do)
401	same
407	extra space in between updateParseState and (
409	In the review, I've tried to look for where ParseState gets set, and there are various ways to grep for that... one is ParseState = X... another is updateParseState(Y, ...), or updateParseState(Z); Is there any way you can make the number of variations smaller?
424	This is a bit weird to me... you have a bunch of function called "updateParseState" but some variants modify ParseState and some variants don't.
2752	NextUnreadBit is no longer set -- wanted to check if that is okay now (and why)?
2823	NOte -> Note
3241	I don't quite understand how "ShouldMaterializeAll = false" is supposed to work for the streaming case, if this isn't checked until after: while (ParseState < NoMoreInput) { if (std::error_code EC = ContinueParse()) { return EC; } } How do you delay reading until materialize(GV) for streaming?
3245	This used to be early, in "getLazyBitcodeModuleImpl". This looks like it is now happening late in FinishParse after the loop until NoMoreInput. Why is this okay now? Was the early call to "materializeForwardReferencedFunctions" actually extraneous because of the call in materialize(GV), or what? Make sure to check the lazy case with blockaddresses for computed gotos, if there isn't already a unittest for that.
4473	no need for extra space

jvoung added inline comments.Apr 1 2015, 4:49 PM

lib/Bitcode/Reader/BitcodeReader.cpp
235	be consistent about capitalization in comments one line starts with "parsed input" and another line starts with "Parsed input" =)
257	Don't need explicit anymore, though that transition was a while back so not really related to this CL.
284	Variable name is different from comment "MaterializeAll" vs "ShouldMaterializeAll" -- make them the same? I see that in the actual definition you're trying to avoid conflicting with the field name...

kschimpf updated this object.Apr 6 2015, 10:04 AM

kschimpf edited edge metadata.

Merge branch 'master' of http://llvm.org/git/llvm into readfac1
Working version. Save state.
Cleanup startParse.
Cleaned up code.
Fix nit.
Merge branch 'master' of http://llvm.org/git/llvm into readfac1

DId all changes except adding test that forward block address references get resolved on lazy loads.

lib/Bitcode/Reader/BitcodeReader.cpp
235	Done.
257	Done.
275	Sorry, my fault. I followed the syntax of ConstantPlaceHolder below. Changing to follow coding standards.
283	Good point. Fixing.
284	Fixing the names to be consistent. Fixing name conflict by prefixing assignments of field names with "this->".
288	Done.
397	Done.
401	Done.
409	I guess the first part of the problem is that there are 2 notions of parse state: The field ParseState that names the state of the parser, and NextUnreadBit which defines where to continue the parse on return (in case a function body gets parsed between calls). I also overloaded the return value with this update. Refactoring to do less and be more clear.
424	The refactoring is a bit better now. Hopefully good enough.
2823	Done.
3241	After talking to Derek, I realized what was the issue I was missing. I'l summarize what I understand: When streaming, we want to "return" as soon as possible, without having to force all bitcode to be scanned. This reduces the cost of (potential) blocking calls to the data streamer. Control can return to the caller without having completed the parse. However, the parsed portions must be consistent (i.e. forward block address references have been resolved). Based on this, I've modified the code to lift the materializeForwardReferencedFunctions into startParse.
3245	I agree that there should be some type of forward reference unit test to verify we can lazy evaluate these forward-referenced block addresses. I also agree that for the use by llvm-dis, my original code worked because it eventually calls materializeAllPermanently. I think I may have been confused about the full expectation of "streamed" (or lazy) was because of this.
4473	Done.

• rafael added inline comments.Apr 8 2015, 5:27 PM

include/llvm/Support/StreamingMemoryObject.h
75	Why do you need this? The streamer will return how many bytes were read and can handle a larger request. Also, why does it need to be part of this patch? It looks like this patch has many independent changes in it.
92	Why the extra logic? If objectsize is known it is the same as BytesRead, no?

kschimpf added inline comments.Apr 9 2015, 9:41 AM

include/llvm/Support/StreamingMemoryObject.h
75	This was added to handle the case of when one is parsing a wrapped bitcode file. In such cases, you do not need to do another read (which may block until it succeeds). That was the intent of this change. However, a simpler approach would be to allow the extra read, and then not set ObjectSize (below) if already set. I will remove this change, and add the conditional assignment to ObjectSize below. I changed it in this CL because it didn't cause a problem until I fixed that materializing a module (when streaming) didn't actually read all of the bitcode file. When that change was added, tests failed and this issue was exposed. I will remove the changes StreamingMemoryObject.{h,cpp} and put in a separate CL.
92	No, they aren't necessarily the same. The problem happens when you have a wrapped bitcode file, and was not exposed until I fixed the case that we weren't reading the entire bitcode when materializing lazily. Then, a bunch of test cases failed. When I looked into it, this is what I discovered: The wrapped bitcode was smaller than kChunkSize. Hence, the initial read set BytesRead to the size of the wrapped file on first read. The wrapper was then read, and set ObjectSize, which corresponded to 4 bytes smaller than BytesRead. This is the reason I changed this file as I did.

jvoung added inline comments.Apr 9 2015, 2:33 PM

lib/Bitcode/Reader/BitcodeReader.cpp
243	nit: This is usually 0 or 1, but it seems unexpected for this field to be named "NumModulesParsed", and yet have the type be "bool". Rename or change type?
424	Thanks -- this is better. For a while I was also wondering how many places need to be aware of setting the state to ParseError, but I think it's just continueParse() because most/all searching for bit position, etc. goes through that.
3132	Does this need to be cleanupOnError(EC) also?
3168	This covers two states? InsideModule and AtTopLevel? It might be more clear if you list them out so it's clear what the "break" corresponds to (AtTopLevel). Previously, the Stream.JumpToBit(NextUnreadBit); was only needed when InsideModule... is it now needed for AtTopLevel too?
3269	"NoMorInput" -> "NoMoreInput" It could also be that more states >= NoMoreInput are added as code evolves, but not handled here. Can the compiler accept/handle a "static_assert(ParseState < NoMoreInput, "...") to catch what happens if more states are added after NoMoreInput but not handled by this switch?
4530	Is this necessary at this point? Should that already be covered by the " // Iterate over the module, deserializing any functions that are still on disk" loop?
4534	The "promise" comment from "above" is removed now, so you could update this comment.

Fix issues raised by jvoung, and remove changes now in D8907.

kschimpf added a parent revision: D8931: Add test showing error in StreamingMemoryObject.setKnownObjectSize()..Apr 9 2015, 3:37 PM

kschimpf added inline comments.

lib/Bitcode/Reader/BitcodeReader.cpp
243	Good catch. I did meant to use size_t. Fixing.
424	That is correct and was the intent. State updates (and bit positioning) is intentionally now localized to continueParse. The only exception is in ParseModule, which updates the state to state whether it returned without completing.
3132	Yes. Good catch. Fixing.
3168	The jumpToBit is needed because various "materialize" methods may be called between calls to continueParse. By forcing a jumpToBit to happen at all calls to continueParse, we no longer need to know where the materialize methods leave the bitcursor. While I did not see an example of an error caused by interleaved calls to materialize, I was very suspicious that they could occur, and wanted to make sure that this would not happen. Hence, I made sure that continueParse always resets the position to where it left off. I will fix to not use default, so that a corresponding warning will be generated if a new value is added to the enumeration.
3269	Fixed string. Also removed "default" case and made all states explicit. This will force a warning if a new state is added.
4508	Removing the comment about being after a function body. This is no longer true. A call to materializeMetadata would put us some place else in the bitcode file.
4530	In correct bitcode files, you are right. However, if the function doesn't define any function blocks, but (incorrectly) references function block addresses, this code will cause the error to be generated. However, looking at the following instruction, this is checked anyway. Removing.
4534	Done.

Fix issues in diff 23431.

Merge branch 'master' into readfac1
Fix issues raised by merge.
Merge branch 'master' of http://llvm.org/git/llvm into readfac1
Fix tests to use old-style parser.

Fix nits.

Now that the issues with the streaming memory object has been fixed, I have updated this CL for review.

Note that I added a CL flag "-old-lazy-bitcode-parser". This was done to deal with a bug fixed by this CL. That is, in the old code, when you materialized a module, it didn't check if there was any additional data in the bitcode file. The new code fixes this by calling "finishParse". However, there are a couple of (bitcode binary) tests that were generated with this violation. Hence, the flag was added to fix this problem.

I'm willing to remove this flag in either (1) a later review, or (2) in a later revision. However, for this review I made the issues explicit so that the problem can be seen.

In D8786#179791, @kschimpf wrote:

However, there are a couple of (bitcode binary) tests that were generated with this violation. Hence, the flag was added to fix this problem.

I'm willing to remove this flag in either (1) a later review, or (2) in a later revision. However, for this review I made the issues explicit so that the problem can be seen.

Let me know which ones have bugs. They might be easy-ish to reconstruct (especially since I have some additional practice in fiddling with bc files, now (Can't promise to deal with them very quickly, though).

lib/Bitcode/Reader/BitcodeReader.cpp
293–294	Why the empty line?
306	http://llvm.org/docs/CodingStandards.html#doxygen-use-in-documentation-comments ^^ This changed recently. Omit \brief if the brief description is just a sentence. (You might want to add the '.' at the end, though)
771	Omit \brief.
785	Omit \brief.
3190	We're already at top level, no? (line 3234) I might be missing something, but it looks like we're at top level, and saw an EndBlock. Shouldn't this be an error?

kschimpf added a reviewer: filcab.May 28 2015, 1:54 PM

Merge branch 'master' of http://llvm.org/git/llvm into readfac1
Fix issues raised by filcab.

Fixes based on feedback by Filipe.

lib/Bitcode/Reader/BitcodeReader.cpp
293–294	Removed.
771	Done.
785	Done.
3190	I agree that we should be at top level. I also agree that it appears weird that we allow extra (unmathced) EndBlocks. This has been allowed by the bitcode reader/writer for years. I just wasn't willing to make the leap that I should remove this. However, I tried removing it (and making it an error), and no tests failed. Hence, Converting this to an error.
4521	Moved the iterating of functions to before the call to finishParse. This deals with the problems I was having with tests in test/Bitcode/invalid.test (i.e. Inputs/invalid-fwdref-type-mismatch-2.bc and Inputs/invalid-load-ptr-type.bc). These two files had multiple errors (the one they intended which was inside a function body, and the one probably not intended - extraneous stuff at the end of the bitcode file). This removes the need for the command line flag UseOldLazyBitcodeParse, and I have deleted it.

Hi Karl,

The files came straight from the fuzzer, so it is likely have more than one error. If, in order to support them (where support is: keep the test working and diagnosing what we want), you have to change the code in a convoluted way, I would prefer to change the test.

If the change is minimal and not a problem (doesn't impact legibility or architecture), then keeping the tests as they are is not a problem either. I just want to avoid having worse code just so we don't have to re-do some tests.

Of course, if we started crashing on the tests, that's a problem :-)

Thanks,

Filipe

(Phab butchered the comment in my email. Editing it to get a complete history in Phabricator)

Fix invalid bitcode tests with more than one error.
Merge branch 'master' into readfac1

In addition to moving the code back in materializeModule, I fixed three tests. Two of them were fuzz tests where I "truncated" the file to the end of the module block. The other test did not have anything after a bad abbreviation definition, and the code file was incomplete. So I generated a replacement test that was well structured otherwise (i.e. only had that one error in it).

include/llvm/Bitcode/BitstreamReader.h
328 ↗	(On Diff #26923)	This code fixes the state of the bit streamer when no more input is found. As a result, method AtEndOfStream now works correctly.
lib/Bitcode/Reader/BitcodeReader.cpp
3189–3190	Discovered that the Bitstream::EndBLock was "hiding" a bug int the bitstream reader when processing a data stream. That is, when using a data stream, the size is not set until after the eof is reached. Hence, when Stream.AtEndOfStream() was called above, it would return false even when at the eof. The actual problem was in FillCurWord, which did not set the bit position correctly when there was no more input. The old code worked because the read (at eof) would return zero, and is understood as an end block. By returning success for this value, it would hide this problem. I also improved the error message so that once can see where the reader thought the eof should be, if there is miscellaneous stuff at the end of the bit code file. This makes it easier to know where to cut a test file in such cases.
4522	Now that the eof checking is fixed, I moved this back where it was in an earlier version of this CL.

Hi Karl,

Really sorry for the delay.
LGTM on my part, as long as you add the test for the error message and do the fix.

Thank you,

Filipe

lib/Bitcode/Reader/BitcodeReader.cpp
772	Nit: Put more words on the first line.
776	Nit: If it's for docs, it's probably best to start with an uppercase letter.
3193	Thank you!
3197	errs()? Or StrBuf? Please also add a test for this error message.

Merge branch 'master' of http://llvm.org/git/llvm into readfac1
Fixes associated with review by Filipe.

Fixed issues raised by Filipe.

lib/Bitcode/Reader/BitcodeReader.cpp
772	Done.
776	Done.
3197	Good catch. I meant StrBuf, so that we can use the same API for all errors.

Applying the patch locally to take a better look.

lib/Bitcode/Reader/BitcodeReader.cpp
290–292	Please commit the pure cleanup bits first: using \ instead of @ starting functions with lowercase names.
3198	This looks a bit much to be honest. Corrupted files are not that common and it is trivial to set a breakpoint to find the state.
3199–3200	This is always 0 or 1. Use a boolean instead.

I got the following test failures locally:

LLVM :: Bitcode/invalid.test
LLVM :: tools/gold/invalid.ll

This CL has gotten a bit long, and hard to read (to many versions). Moved to a new CL http://reviews.llvm.org/D10518

lib/Bitcode/Reader/BitcodeReader.cpp
290–292	I assume this has already been done. The new CL doesn't have suc cases anymore.
3198	Simplified in new CL to do same as before.
3199–3200	This was already fixed in master.

This CL has gotten a bit long, and hard to read (to many versions). Moved to a new CL http://reviews.llvm.org/D10518

Revision Contents

Path

Size

include/

llvm/

Support/

StreamingMemoryObject.h

38 lines

lib/

Bitcode/

Reader/

BitcodeReader.cpp

309 lines

BitstreamReader.cpp

1 line

Support/

StreamingMemoryObject.cpp

12 lines

Diff 23088

include/llvm/Support/StreamingMemoryObject.h

Show First 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	public:
void setKnownObjectSize(size_t size);		void setKnownObjectSize(size_t size);

private:		private:
const static uint32_t kChunkSize = 4096 * 4;		const static uint32_t kChunkSize = 4096 * 4;
mutable std::vector<unsigned char> Bytes;		mutable std::vector<unsigned char> Bytes;
std::unique_ptr<DataStreamer> Streamer;		std::unique_ptr<DataStreamer> Streamer;
mutable size_t BytesRead; // Bytes read from stream		mutable size_t BytesRead; // Bytes read from stream
size_t BytesSkipped;// Bytes skipped at start of stream (e.g. wrapper/header)		size_t BytesSkipped;// Bytes skipped at start of stream (e.g. wrapper/header)
mutable size_t ObjectSize; // 0 if unknown, set if wrapper seen or EOF reached		mutable size_t ObjectSize; // 0 if unknown, set if wrapper seen or end of
mutable bool EOFReached;		// object reached.
		mutable bool EOOReached; // end of object reached.
// Fetch enough bytes such that Pos can be read or EOF is reached
// (i.e. BytesRead > Pos). Return true if Pos can be read.		// Fetch enough bytes such that Pos can be read or end of object is
// Unlike most of the functions in BitcodeReader, returns true on success.		// reached (i.e. BytesRead > Pos). Note: EOF sets end of object if
// Most of the requests will be small, but we fetch at kChunkSize bytes		// not already defined. Returns true if Pos can be read. Unlike
// at a time to avoid making too many potentially expensive GetBytes calls		// most of the functions in BitcodeReader, returns true on success.
		// Most of the requests will be small, but we fetch at kChunkSize
		// bytes at a time to avoid making too many potentially expensive
		// GetBytes calls
bool fetchToPos(size_t Pos) const {		bool fetchToPos(size_t Pos) const {
if (EOFReached)		if (EOOReached)
return Pos < ObjectSize;		return Pos < ObjectSize;

while (Pos >= BytesRead) {		while (Pos >= BytesRead) {
Bytes.resize(BytesRead + BytesSkipped + kChunkSize);		size_t NextChunkSize = kChunkSize;
		if (ObjectSize && ObjectSize < BytesRead + kChunkSize) {
		rafaelUnsubmitted Not Done Reply Inline Actions Why do you need this? The streamer will return how many bytes were read and can handle a larger request. Also, why does it need to be part of this patch? It looks like this patch has many independent changes in it. rafael: Why do you need this? The streamer will return how many bytes were read and can handle a larger…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions This was added to handle the case of when one is parsing a wrapped bitcode file. In such cases, you do not need to do another read (which may block until it succeeds). That was the intent of this change. However, a simpler approach would be to allow the extra read, and then not set ObjectSize (below) if already set. I will remove this change, and add the conditional assignment to ObjectSize below. I changed it in this CL because it didn't cause a problem until I fixed that materializing a module (when streaming) didn't actually read all of the bitcode file. When that change was added, tests failed and this issue was exposed. I will remove the changes StreamingMemoryObject.{h,cpp} and put in a separate CL. kschimpf: This was added to handle the case of when one is parsing a wrapped bitcode file. In such cases…
		if (BytesRead >= ObjectSize) {
		EOOReached = true;
		return false;
		}
		NextChunkSize = ObjectSize - BytesRead;
		}
		Bytes.resize(BytesRead + BytesSkipped + NextChunkSize);
size_t bytes = Streamer->GetBytes(&Bytes[BytesRead + BytesSkipped],		size_t bytes = Streamer->GetBytes(&Bytes[BytesRead + BytesSkipped],
kChunkSize);		NextChunkSize);
BytesRead += bytes;		BytesRead += bytes;
if (bytes != kChunkSize) { // reached EOF/ran out of bytes		if (bytes == 0) { // reached EOF/ran out of bytes
ObjectSize = BytesRead;		ObjectSize = BytesRead;
EOFReached = true;		EOOReached = true;
break;		break;
}		}
}		}
return Pos < BytesRead;		return (Pos < BytesRead) \|\| (ObjectSize && Pos < ObjectSize);
		rafaelUnsubmitted Not Done Reply Inline Actions Why the extra logic? If objectsize is known it is the same as BytesRead, no? rafael: Why the extra logic? If objectsize is known it is the same as BytesRead, no?
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions No, they aren't necessarily the same. The problem happens when you have a wrapped bitcode file, and was not exposed until I fixed the case that we weren't reading the entire bitcode when materializing lazily. Then, a bunch of test cases failed. When I looked into it, this is what I discovered: The wrapped bitcode was smaller than kChunkSize. Hence, the initial read set BytesRead to the size of the wrapped file on first read. The wrapper was then read, and set ObjectSize, which corresponded to 4 bytes smaller than BytesRead. This is the reason I changed this file as I did. kschimpf: No, they aren't necessarily the same. The problem happens when you have a wrapped bitcode file…
}		}

StreamingMemoryObject(const StreamingMemoryObject&) = delete;		StreamingMemoryObject(const StreamingMemoryObject&) = delete;
void operator=(const StreamingMemoryObject&) = delete;		void operator=(const StreamingMemoryObject&) = delete;
};		};

MemoryObject *getNonStreamedMemoryObject(		MemoryObject *getNonStreamedMemoryObject(
const unsigned char Start, const unsigned char End);		const unsigned char Start, const unsigned char End);

}		}
#endif // STREAMINGMEMORYOBJECT_H_		#endif // STREAMINGMEMORYOBJECT_H_

lib/Bitcode/Reader/BitcodeReader.cpp

Show First 20 Lines • Show All 129 Lines • ▼ Show 20 Lines	public:
void AssignValue(Metadata *MD, unsigned Idx);		void AssignValue(Metadata *MD, unsigned Idx);
void tryToResolveCycles();		void tryToResolveCycles();
};		};

class BitcodeReader : public GVMaterializer {		class BitcodeReader : public GVMaterializer {
LLVMContext &Context;		LLVMContext &Context;
DiagnosticHandlerFunction DiagnosticHandler;		DiagnosticHandlerFunction DiagnosticHandler;
Module *TheModule;		Module *TheModule;
		// The following two fields define the type of memory to parse.
std::unique_ptr<MemoryBuffer> Buffer;		std::unique_ptr<MemoryBuffer> Buffer;
		DataStreamer *Streamer;
std::unique_ptr<BitstreamReader> StreamFile;		std::unique_ptr<BitstreamReader> StreamFile;
BitstreamCursor Stream;		BitstreamCursor Stream;
DataStreamer *LazyStreamer;
uint64_t NextUnreadBit;
bool SeenValueSymbolTable;		bool SeenValueSymbolTable;

std::vector<Type*> TypeList;		std::vector<Type*> TypeList;
BitcodeReaderValueList ValueList;		BitcodeReaderValueList ValueList;
BitcodeReaderMDValueList MDValueList;		BitcodeReaderMDValueList MDValueList;
std::vector<Comdat *> ComdatList;		std::vector<Comdat *> ComdatList;
SmallVector<Instruction *, 64> InstructionList;		SmallVector<Instruction *, 64> InstructionList;

▲ Show 20 Lines • Show All 55 Lines • ▼ Show 20 Lines	class BitcodeReader : public GVMaterializer {
/// for a more compact encoding. Some instruction operands are not		/// for a more compact encoding. Some instruction operands are not
/// relative to the instruction ID: basic block numbers, and types.		/// relative to the instruction ID: basic block numbers, and types.
/// Once the old style function blocks have been phased out, we would		/// Once the old style function blocks have been phased out, we would
/// not need this flag.		/// not need this flag.
bool UseRelativeIDs;		bool UseRelativeIDs;

/// True if all functions will be materialized, negating the need to process		/// True if all functions will be materialized, negating the need to process
/// (e.g.) blockaddress forward references.		/// (e.g.) blockaddress forward references.
bool WillMaterializeAllForwardRefs;		bool WillMaterializeAllForwardRefs = false;

/// Functions that have block addresses taken. This is usually empty.		/// Functions that have block addresses taken. This is usually empty.
SmallPtrSet<const Function *, 4> BlockAddressesTaken;		SmallPtrSet<const Function *, 4> BlockAddressesTaken;

/// True if any Metadata block has been materialized.		/// True if any Metadata block has been materialized.
bool IsMetadataMaterialized;		bool IsMetadataMaterialized = false;

		/// True if the module is materialized.
		bool IsModuleMaterialized = false;

		/// True if meta data should initially be skipped.
		bool ShouldLazyLoadMetadata = false;

		/// True if everything should materialize all before finishing parsing.
		bool ShouldMaterializeAll = false;

		/// The state of the parse.
		enum BitcodeReaderState {
		AtStart,
		AtTopLevel, // Processing top-level records.
		InsideModule, // processing records inside a module block.
		jvoungUnsubmitted Not Done Reply Inline Actions be consistent about capitalization in comments one line starts with "parsed input" and another line starts with "Parsed input" =) jvoung: be consistent about capitalization in comments one line starts with "parsed input" and another…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Done. kschimpf: Done.
		// All states below here represent cases where input shouldn't be parsed.
		NoMoreInput, // Generic marker for having parsed input.
		ReachedEof, // parsed input, but not necessary materializations.
		FinishedParse, // Parsed input and materialized necessary parts.
		ParseError, // An error has occurred, stop parsing.
		} ParseState = AtStart;

		/// The position (within the bitcode) where parsing left off when
		jvoungUnsubmitted Not Done Reply Inline Actions nit: This is usually 0 or 1, but it seems unexpected for this field to be named "NumModulesParsed", and yet have the type be "bool". Rename or change type? jvoung: nit: This is usually 0 or 1, but it seems unexpected for this field to be named…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Good catch. I did meant to use size_t. Fixing. kschimpf: Good catch. I did meant to use size_t. Fixing.
		/// incrementally parsing.
		uint64_t NextUnreadBit = 0;

		/// The number of modules read at the top level.
		bool NumModulesParsed = 0;

bool StripDebugInfo = false;		bool StripDebugInfo = false;

public:		public:
std::error_code Error(BitcodeError E, const Twine &Message);		std::error_code Error(BitcodeError E, const Twine &Message);
std::error_code Error(BitcodeError E);		std::error_code Error(BitcodeError E);
std::error_code Error(const Twine &Message);		std::error_code Error(const Twine &Message);

explicit BitcodeReader(MemoryBuffer *buffer, LLVMContext &C,		explicit BitcodeReader(MemoryBuffer *Buffer, LLVMContext &C,
		jvoungUnsubmitted Not Done Reply Inline Actions Don't need explicit anymore, though that transition was a while back so not really related to this CL. jvoung: Don't need explicit anymore, though that transition was a while back so not really related to…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Done. kschimpf: Done.
DiagnosticHandlerFunction DiagnosticHandler);		DiagnosticHandlerFunction DiagnosticHandler);
explicit BitcodeReader(DataStreamer *streamer, LLVMContext &C,		explicit BitcodeReader(DataStreamer *Streamer, LLVMContext &C,
DiagnosticHandlerFunction DiagnosticHandler);		DiagnosticHandlerFunction DiagnosticHandler);
~BitcodeReader() { FreeState(); }		~BitcodeReader() { FreeState(); }

std::error_code materializeForwardReferencedFunctions();		std::error_code materializeForwardReferencedFunctions();

void FreeState();		void FreeState();

void releaseBuffer();		void releaseBuffer();

bool isDematerializable(const GlobalValue *GV) const override;		bool isDematerializable(const GlobalValue *GV) const override;
std::error_code materialize(GlobalValue *GV) override;		std::error_code materialize(GlobalValue *GV) override;
std::error_code MaterializeModule(Module *M) override;		std::error_code MaterializeModule(Module *M) override;
std::vector<StructType *> getIdentifiedStructTypes() const override;		std::vector<StructType *> getIdentifiedStructTypes() const override;
void Dematerialize(GlobalValue *GV) override;		void Dematerialize(GlobalValue *GV) override;

/// @brief Main interface to parsing a bitcode buffer.		/// @brief Starts an incremental parse for module M. Reads enough to
		jvoungUnsubmitted Not Done Reply Inline Actions http://llvm.org/docs/CodingStandards.html#doxygen-use-in-documentation-comments says "\brief" instead of @brief jvoung: http://llvm.org/docs/CodingStandards.html#doxygen-use-in-documentation-comments says "\brief"…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Sorry, my fault. I followed the syntax of ConstantPlaceHolder below. Changing to follow coding standards. kschimpf: Sorry, my fault. I followed the syntax of ConstantPlaceHolder below. Changing to follow coding…
		/// define global values. The flags define what should happen before
		/// finishing the parse. That is:
		/// ShouldMaterializeAll: When true, the module should be materialized
		/// completely. Otherwise, function bodies are only loaded on demand.
		/// ShouldLazyLoadMetadata: When true, the metadata blocks should be
		/// parsed.
/// @returns true if an error occurred.		/// @returns true if an error occurred.
std::error_code ParseBitcodeInto(Module *M,		std::error_code StartParse(Module *M,
		jvoungUnsubmitted Not Done Reply Inline Actions lowercase first letter of function name -- should probably do it for these new functions since you're touching it (but leave existing functions alone?) jvoung: lowercase first letter of function name -- should probably do it for these new functions since…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Good point. Fixing. kschimpf: Good point. Fixing.
		bool MaterializeAll,
		jvoungUnsubmitted Not Done Reply Inline Actions Variable name is different from comment "MaterializeAll" vs "ShouldMaterializeAll" -- make them the same? I see that in the actual definition you're trying to avoid conflicting with the field name... jvoung: Variable name is different from comment "MaterializeAll" vs "ShouldMaterializeAll" -- make them…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Fixing the names to be consistent. Fixing name conflict by prefixing assignments of field names with "this->". kschimpf: Fixing the names to be consistent. Fixing name conflict by prefixing assignments of field names…
bool ShouldLazyLoadMetadata = false);		bool ShouldLazyLoadMetadata = false);

		/// @brief Parses bitcode. Materializes based on flags.
		/// @returns true if an error occurred.
		jvoungUnsubmitted Not Done Reply Inline Actions \returns jvoung: \returns
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Done. kschimpf: Done.
		std::error_code ParseBitcodeInto(Module *M,
		bool ShouleMaterializeAll,
		bool ShouldLazyLoadMetadata);

		rafaelUnsubmitted Not Done Reply Inline Actions Please commit the pure cleanup bits first: using \ instead of @ starting functions with lowercase names. rafael: Please commit the pure cleanup bits first: using \ instead of @ starting functions with…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions I assume this has already been done. The new CL doesn't have suc cases anymore. kschimpf: I assume this has already been done. The new CL doesn't have suc cases anymore.

/// @brief Cheap mechanism to just extract module triple		/// @brief Cheap mechanism to just extract module triple
		filcabUnsubmitted Not Done Reply Inline Actions Why the empty line? filcab: Why the empty line?
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Removed. kschimpf: Removed.
/// @returns true if an error occurred.		/// @returns true if an error occurred.
ErrorOr<std::string> parseTriple();		ErrorOr<std::string> parseTriple();

static uint64_t decodeSignRotatedValue(uint64_t V);		static uint64_t decodeSignRotatedValue(uint64_t V);

/// Materialize any deferred Metadata block.		/// Materialize any deferred Metadata block.
std::error_code materializeMetadata() override;		std::error_code materializeMetadata() override;

void setStripDebugInfo() override;		void setStripDebugInfo() override;

private:		private:
std::vector<StructType *> IdentifiedStructTypes;		std::vector<StructType *> IdentifiedStructTypes;
		filcabUnsubmitted Not Done Reply Inline Actions http://llvm.org/docs/CodingStandards.html#doxygen-use-in-documentation-comments ^^ This changed recently. Omit \brief if the brief description is just a sentence. (You might want to add the '.' at the end, though) filcab: http://llvm.org/docs/CodingStandards.html#doxygen-use-in-documentation-comments ^^ This changed…
StructType *createIdentifiedStructType(LLVMContext &Context, StringRef Name);		StructType *createIdentifiedStructType(LLVMContext &Context, StringRef Name);
StructType *createIdentifiedStructType(LLVMContext &Context);		StructType *createIdentifiedStructType(LLVMContext &Context);

Type *getTypeByID(unsigned ID);		Type *getTypeByID(unsigned ID);
Value getFnValueByID(unsigned ID, Type Ty) {		Value getFnValueByID(unsigned ID, Type Ty) {
if (Ty && Ty->isMetadataTy())		if (Ty && Ty->isMetadataTy())
return MetadataAsValue::get(Ty->getContext(), getFnMetadataByID(ID));		return MetadataAsValue::get(Ty->getContext(), getFnMetadataByID(ID));
return ValueList.getValueFwdRef(ID, Ty);		return ValueList.getValueFwdRef(ID, Ty);
▲ Show 20 Lines • Show All 72 Lines • ▼ Show 20 Lines	Value *getValueSigned(SmallVectorImpl<uint64_t> &Record, unsigned Slot,
if (Slot == Record.size()) return nullptr;		if (Slot == Record.size()) return nullptr;
unsigned ValNo = (unsigned)decodeSignRotatedValue(Record[Slot]);		unsigned ValNo = (unsigned)decodeSignRotatedValue(Record[Slot]);
// Adjust the ValNo, if it was encoded relative to the InstNum.		// Adjust the ValNo, if it was encoded relative to the InstNum.
if (UseRelativeIDs)		if (UseRelativeIDs)
ValNo = InstNum - ValNo;		ValNo = InstNum - ValNo;
return getFnValueByID(ValNo, Ty);		return getFnValueByID(ValNo, Ty);
}		}

		// Continue incremental parse to next skipped block, or eof, whichever
		// comes first.
		std::error_code ContinueParse();
		jvoungUnsubmitted Not Done Reply Inline Actions lowercase first letter for new function (and I guess update the commit message if you do) jvoung: lowercase first letter for new function (and I guess update the commit message if you do)
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Done. kschimpf: Done.

		// Finish the parse and then materialize based on flags passed to
		// StartParse().
		std::error_code FinishParse();
		jvoungUnsubmitted Not Done Reply Inline Actions same jvoung: same
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Done. kschimpf: Done.

		// The updateParseState methods update the parse state with the
		// given information, so that the next call to ContinueParse can
		// continue. Returns an error code to be returned by
		// ContinueParse().
		std::error_code &updateParseState (BitcodeReaderState NewValue,
		jvoungUnsubmitted Not Done Reply Inline Actions extra space in between updateParseState and ( jvoung: extra space in between updateParseState and (
		std::error_code &EC) {
		ParseState = EC ? ParseError : NewValue;
		jvoungUnsubmitted Not Done Reply Inline Actions In the review, I've tried to look for where ParseState gets set, and there are various ways to grep for that... one is ParseState = X... another is updateParseState(Y, ...), or updateParseState(Z); Is there any way you can make the number of variations smaller? jvoung: In the review, I've tried to look for where ParseState gets set, and there are various ways to…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions I guess the first part of the problem is that there are 2 notions of parse state: The field ParseState that names the state of the parser, and NextUnreadBit which defines where to continue the parse on return (in case a function body gets parsed between calls). I also overloaded the return value with this update. Refactoring to do less and be more clear. kschimpf: I guess the first part of the problem is that there are 2 notions of parse state: 1) The field…
		NextUnreadBit = Stream.GetCurrentBitNo();
		return EC;
		}

		std::error_code updateParseState(std::error_code EC) {
		return updateParseState(ParseState, EC);
		}

		std::error_code updateParseState(BitcodeReaderState NewValue) {
		ParseState = NewValue;
		NextUnreadBit = Stream.GetCurrentBitNo();
		return std::error_code();
		}

		std::error_code updateParseState() {
		jvoungUnsubmitted Not Done Reply Inline Actions This is a bit weird to me... you have a bunch of function called "updateParseState" but some variants modify ParseState and some variants don't. jvoung: This is a bit weird to me... you have a bunch of function called "updateParseState" but some…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions The refactoring is a bit better now. Hopefully good enough. kschimpf: The refactoring is a bit better now. Hopefully good enough.
		jvoungUnsubmitted Not Done Reply Inline Actions Thanks -- this is better. For a while I was also wondering how many places need to be aware of setting the state to ParseError, but I think it's just continueParse() because most/all searching for bit position, etc. goes through that. jvoung: Thanks -- this is better. For a while I was also wondering how many places need to be aware of…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions That is correct and was the intent. State updates (and bit positioning) is intentionally now localized to continueParse. The only exception is in ParseModule, which updates the state to state whether it returned without completing. kschimpf: That is correct and was the intent. State updates (and bit positioning) is intentionally now…
		NextUnreadBit = Stream.GetCurrentBitNo();
		return std::error_code();
		}

/// Converts alignment exponent (i.e. power of two (or zero)) to the		/// Converts alignment exponent (i.e. power of two (or zero)) to the
/// corresponding alignment to use. If alignment is too large, returns		/// corresponding alignment to use. If alignment is too large, returns
/// a corresponding error code.		/// a corresponding error code.
std::error_code parseAlignmentValue(uint64_t Exponent, unsigned &Alignment);		std::error_code parseAlignmentValue(uint64_t Exponent, unsigned &Alignment);
std::error_code ParseAttrKind(uint64_t Code, Attribute::AttrKind *Kind);		std::error_code ParseAttrKind(uint64_t Code, Attribute::AttrKind *Kind);
std::error_code ParseModule(bool Resume, bool ShouldLazyLoadMetadata = false);		std::error_code ParseModule();
std::error_code ParseAttributeBlock();		std::error_code ParseAttributeBlock();
std::error_code ParseAttributeGroupBlock();		std::error_code ParseAttributeGroupBlock();
std::error_code ParseTypeTable();		std::error_code ParseTypeTable();
std::error_code ParseTypeTableBody();		std::error_code ParseTypeTableBody();

std::error_code ParseValueSymbolTable();		std::error_code ParseValueSymbolTable();
std::error_code ParseConstants();		std::error_code ParseConstants();
std::error_code RememberAndSkipFunctionBody();		std::error_code RememberAndSkipFunctionBody();
▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines

static DiagnosticHandlerFunction getDiagHandler(DiagnosticHandlerFunction F,		static DiagnosticHandlerFunction getDiagHandler(DiagnosticHandlerFunction F,
LLVMContext &C) {		LLVMContext &C) {
if (F)		if (F)
return F;		return F;
return [&C](const DiagnosticInfo &DI) { C.diagnose(DI); };		return [&C](const DiagnosticInfo &DI) { C.diagnose(DI); };
}		}

BitcodeReader::BitcodeReader(MemoryBuffer *buffer, LLVMContext &C,		BitcodeReader::BitcodeReader(MemoryBuffer *Buffer, LLVMContext &C,
DiagnosticHandlerFunction DiagnosticHandler)		DiagnosticHandlerFunction DiagnosticHandler)
: Context(C), DiagnosticHandler(getDiagHandler(DiagnosticHandler, C)),		: Context(C), DiagnosticHandler(getDiagHandler(DiagnosticHandler, C)),
TheModule(nullptr), Buffer(buffer), LazyStreamer(nullptr),		TheModule(nullptr), Buffer(Buffer), Streamer(nullptr),
NextUnreadBit(0), SeenValueSymbolTable(false), ValueList(C),		SeenValueSymbolTable(false), ValueList(C),
MDValueList(C), SeenFirstFunctionBody(false), UseRelativeIDs(false),		MDValueList(C), SeenFirstFunctionBody(false), UseRelativeIDs(false) {}
WillMaterializeAllForwardRefs(false), IsMetadataMaterialized(false) {}

BitcodeReader::BitcodeReader(DataStreamer *streamer, LLVMContext &C,		BitcodeReader::BitcodeReader(DataStreamer *Streamer, LLVMContext &C,
DiagnosticHandlerFunction DiagnosticHandler)		DiagnosticHandlerFunction DiagnosticHandler)
: Context(C), DiagnosticHandler(getDiagHandler(DiagnosticHandler, C)),		: Context(C), DiagnosticHandler(getDiagHandler(DiagnosticHandler, C)),
TheModule(nullptr), Buffer(nullptr), LazyStreamer(streamer),		TheModule(nullptr), Buffer(nullptr), Streamer(Streamer),
NextUnreadBit(0), SeenValueSymbolTable(false), ValueList(C),		SeenValueSymbolTable(false), ValueList(C),
MDValueList(C), SeenFirstFunctionBody(false), UseRelativeIDs(false),		MDValueList(C), SeenFirstFunctionBody(false), UseRelativeIDs(false) {}
WillMaterializeAllForwardRefs(false), IsMetadataMaterialized(false) {}

std::error_code BitcodeReader::materializeForwardReferencedFunctions() {		std::error_code BitcodeReader::materializeForwardReferencedFunctions() {
if (WillMaterializeAllForwardRefs)		if (WillMaterializeAllForwardRefs)
return std::error_code();		return std::error_code();

// Prevent recursion.		// Prevent recursion.
WillMaterializeAllForwardRefs = true;		WillMaterializeAllForwardRefs = true;

▲ Show 20 Lines • Show All 242 Lines • ▼ Show 20 Lines	static void UpgradeDLLImportExportLinkage(llvm::GlobalValue *GV, unsigned Val) {
switch (Val) {		switch (Val) {
case 5: GV->setDLLStorageClass(GlobalValue::DLLImportStorageClass); break;		case 5: GV->setDLLStorageClass(GlobalValue::DLLImportStorageClass); break;
case 6: GV->setDLLStorageClass(GlobalValue::DLLExportStorageClass); break;		case 6: GV->setDLLStorageClass(GlobalValue::DLLExportStorageClass); break;
}		}
}		}

namespace llvm {		namespace llvm {
namespace {		namespace {
/// @brief A class for maintaining the slot number definition		/// @brief A class for maintaining the slot number definition
		filcabUnsubmitted Not Done Reply Inline Actions Omit \brief. filcab: Omit \brief.
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Done. kschimpf: Done.
/// as a placeholder for the actual definition for forward constants defs.		/// as a placeholder for the actual definition for forward constants defs.
		filcabUnsubmitted Not Done Reply Inline Actions Nit: Put more words on the first line. filcab: Nit: Put more words on the first line.
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Done. kschimpf: Done.
class ConstantPlaceHolder : public ConstantExpr {		class ConstantPlaceHolder : public ConstantExpr {
void operator=(const ConstantPlaceHolder &) = delete;		void operator=(const ConstantPlaceHolder &) = delete;
public:		public:
// allocate space for exactly one operand		// allocate space for exactly one operand
		filcabUnsubmitted Not Done Reply Inline Actions Nit: If it's for docs, it's probably best to start with an uppercase letter. filcab: Nit: If it's for docs, it's probably best to start with an uppercase letter.
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Done. kschimpf: Done.
void *operator new(size_t s) {		void *operator new(size_t s) {
return User::operator new(s, 1);		return User::operator new(s, 1);
}		}
explicit ConstantPlaceHolder(Type *Ty, LLVMContext& Context)		explicit ConstantPlaceHolder(Type *Ty, LLVMContext& Context)
: ConstantExpr(Ty, Instruction::UserOp1, &Op<0>(), 1) {		: ConstantExpr(Ty, Instruction::UserOp1, &Op<0>(), 1) {
Op<0>() = UndefValue::get(Type::getInt32Ty(Context));		Op<0>() = UndefValue::get(Type::getInt32Ty(Context));
}		}

/// @brief Methods to support type inquiry through isa, cast, and dyn_cast.		/// @brief Methods to support type inquiry through isa, cast, and dyn_cast.
		filcabUnsubmitted Not Done Reply Inline Actions Omit \brief. filcab: Omit \brief.
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Done. kschimpf: Done.
static bool classof(const Value *V) {		static bool classof(const Value *V) {
return isa<ConstantExpr>(V) &&		return isa<ConstantExpr>(V) &&
cast<ConstantExpr>(V)->getOpcode() == Instruction::UserOp1;		cast<ConstantExpr>(V)->getOpcode() == Instruction::UserOp1;
}		}


/// Provide fast operand accessors		/// Provide fast operand accessors
DECLARE_TRANSPARENT_OPERAND_ACCESSORS(Value);		DECLARE_TRANSPARENT_OPERAND_ACCESSORS(Value);
▲ Show 20 Lines • Show All 1,941 Lines • ▼ Show 20 Lines	std::error_code BitcodeReader::GlobalCleanup() {

// Force deallocation of memory for these vectors to favor the client that		// Force deallocation of memory for these vectors to favor the client that
// want lazy deserialization.		// want lazy deserialization.
std::vector<std::pair<GlobalVariable*, unsigned> >().swap(GlobalInits);		std::vector<std::pair<GlobalVariable*, unsigned> >().swap(GlobalInits);
std::vector<std::pair<GlobalAlias*, unsigned> >().swap(AliasInits);		std::vector<std::pair<GlobalAlias*, unsigned> >().swap(AliasInits);
return std::error_code();		return std::error_code();
}		}

std::error_code BitcodeReader::ParseModule(bool Resume,		std::error_code BitcodeReader::ParseModule() {
bool ShouldLazyLoadMetadata) {		if (ParseState == AtTopLevel) {
if (Resume)		if (Stream.EnterSubBlock(bitc::MODULE_BLOCK_ID))
Stream.JumpToBit(NextUnreadBit);
else if (Stream.EnterSubBlock(bitc::MODULE_BLOCK_ID))
return Error("Invalid record");		return Error("Invalid record");
		ParseState = InsideModule;
		} else {
		assert(ParseState == InsideModule);
		}

SmallVector<uint64_t, 64> Record;		SmallVector<uint64_t, 64> Record;
std::vector<std::string> SectionTable;		std::vector<std::string> SectionTable;
std::vector<std::string> GCTable;		std::vector<std::string> GCTable;

// Read all the records for this module.		// Read all the records for this module.
while (1) {		while (1) {
BitstreamEntry Entry = Stream.advance();		BitstreamEntry Entry = Stream.advance();

switch (Entry.Kind) {		switch (Entry.Kind) {
case BitstreamEntry::Error:		case BitstreamEntry::Error:
return Error("Malformed block");		return Error("Malformed block");
case BitstreamEntry::EndBlock:		case BitstreamEntry::EndBlock:
		ParseState = AtTopLevel;
return GlobalCleanup();		return GlobalCleanup();

case BitstreamEntry::SubBlock:		case BitstreamEntry::SubBlock:
switch (Entry.ID) {		switch (Entry.ID) {
default: // Skip unknown content.		default: // Skip unknown content.
if (Stream.SkipBlock())		if (Stream.SkipBlock())
return Error("Invalid record");		return Error("Invalid record");
break;		break;
▲ Show 20 Lines • Show All 41 Lines • ▼ Show 20 Lines	case BitstreamEntry::SubBlock:
std::reverse(FunctionsWithBodies.begin(), FunctionsWithBodies.end());		std::reverse(FunctionsWithBodies.begin(), FunctionsWithBodies.end());
if (std::error_code EC = GlobalCleanup())		if (std::error_code EC = GlobalCleanup())
return EC;		return EC;
SeenFirstFunctionBody = true;		SeenFirstFunctionBody = true;
}		}

if (std::error_code EC = RememberAndSkipFunctionBody())		if (std::error_code EC = RememberAndSkipFunctionBody())
return EC;		return EC;
// For streaming bitcode, suspend parsing when we reach the function		// Suspend parsing when we reach a function body, assuming we
// bodies. Subsequent materialization calls will resume it when		// have already associated names with global values. NOte: If
		jvoungUnsubmitted Not Done Reply Inline Actions NOte -> Note jvoung: NOte -> Note
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Done. kschimpf: Done.
// necessary. For streaming, the function bodies must be at the end of		// the bitcode file is old, the symbol table will be at the
// the bitcode. If the bitcode file is old, the symbol table will be		// end instead and will not have been seen yet.
// at the end instead and will not have been seen yet. In this case,		if (SeenValueSymbolTable)
// just finish the parse now.
if (LazyStreamer && SeenValueSymbolTable) {
NextUnreadBit = Stream.GetCurrentBitNo();
jvoungUnsubmitted Not Done Reply Inline Actions NextUnreadBit is no longer set -- wanted to check if that is okay now (and why)? jvoung: NextUnreadBit is no longer set -- wanted to check if that is okay now (and why)?
return std::error_code();		return std::error_code();
}
break;		break;
case bitc::USELIST_BLOCK_ID:		case bitc::USELIST_BLOCK_ID:
if (std::error_code EC = ParseUseLists())		if (std::error_code EC = ParseUseLists())
return EC;		return EC;
break;		break;
}		}
continue;		continue;

▲ Show 20 Lines • Show All 224 Lines • ▼ Show 20 Lines	case bitc::MODULE_CODE_FUNCTION: {

ValueList.push_back(Func);		ValueList.push_back(Func);

// If this is a function with a body, remember the prototype we are		// If this is a function with a body, remember the prototype we are
// creating now, so that we can match up the body with them later.		// creating now, so that we can match up the body with them later.
if (!isProto) {		if (!isProto) {
Func->setIsMaterializable(true);		Func->setIsMaterializable(true);
FunctionsWithBodies.push_back(Func);		FunctionsWithBodies.push_back(Func);
if (LazyStreamer)
DeferredFunctionInfo[Func] = 0;		DeferredFunctionInfo[Func] = 0;
}		}
break;		break;
}		}
// ALIAS: [alias type, aliasee val#, linkage]		// ALIAS: [alias type, aliasee val#, linkage]
// ALIAS: [alias type, aliasee val#, linkage, visibility, dllstorageclass]		// ALIAS: [alias type, aliasee val#, linkage, visibility, dllstorageclass]
case bitc::MODULE_CODE_ALIAS: {		case bitc::MODULE_CODE_ALIAS: {
if (Record.size() < 3)		if (Record.size() < 3)
return Error("Invalid record");		return Error("Invalid record");
Show All 32 Lines	case bitc::MODULE_CODE_PURGEVALS:
ValueList.shrinkTo(Record[0]);		ValueList.shrinkTo(Record[0]);
break;		break;
}		}
Record.clear();		Record.clear();
}		}
}		}

std::error_code BitcodeReader::ParseBitcodeInto(Module *M,		std::error_code BitcodeReader::ParseBitcodeInto(Module *M,
		bool ShouldMaterializeAll,
bool ShouldLazyLoadMetadata) {		bool ShouldLazyLoadMetadata) {
TheModule = nullptr;		auto cleanupOnError = [&](std::error_code EC) {
		releaseBuffer(); // Never take ownership on error.
		return EC;
		};

		if (std::error_code EC =
		StartParse(M, ShouldMaterializeAll, ShouldLazyLoadMetadata))
		return cleanupOnError(EC);

		if (std::error_code EC = FinishParse())
		return cleanupOnError(EC);

		return std::error_code();
		}
		jvoungUnsubmitted Not Done Reply Inline Actions Does this need to be cleanupOnError(EC) also? jvoung: Does this need to be cleanupOnError(EC) also?
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Yes. Good catch. Fixing. kschimpf: Yes. Good catch. Fixing.

		std::error_code BitcodeReader::StartParse(Module *M,
		bool MaterializeAll,
		bool LazyLoadMetadata) {
		assert(ParseState == AtStart);
		TheModule = M;
		ShouldLazyLoadMetadata = LazyLoadMetadata;
		ShouldMaterializeAll = MaterializeAll;

if (std::error_code EC = InitStream())		if (std::error_code EC = InitStream())
return EC;		return EC;

// Sniff for the signature.		// Sniff for the signature.
if (Stream.Read(8) != 'B' \|\|		if (Stream.Read(8) != 'B' \|\|
Stream.Read(8) != 'C' \|\|		Stream.Read(8) != 'C' \|\|
Stream.Read(4) != 0x0 \|\|		Stream.Read(4) != 0x0 \|\|
Stream.Read(4) != 0xC \|\|		Stream.Read(4) != 0xC \|\|
Stream.Read(4) != 0xE \|\|		Stream.Read(4) != 0xE \|\|
Stream.Read(4) != 0xD)		Stream.Read(4) != 0xD)
return Error("Invalid bitcode signature");		return Error("Invalid bitcode signature");
		return ContinueParse();
		}

		std::error_code BitcodeReader::ContinueParse() {
		switch (ParseState) {
		case AtStart:
		ParseState = AtTopLevel;
		break;
		case ReachedEof:
		case FinishedParse:
		return updateParseState();
		case ParseError:
		return updateParseState(
		Error("Can't continue, bitcode error already found"));
		default:
		Stream.JumpToBit(NextUnreadBit);
		jvoungUnsubmitted Not Done Reply Inline Actions This covers two states? InsideModule and AtTopLevel? It might be more clear if you list them out so it's clear what the "break" corresponds to (AtTopLevel). Previously, the Stream.JumpToBit(NextUnreadBit); was only needed when InsideModule... is it now needed for AtTopLevel too? jvoung: This covers two states? InsideModule and AtTopLevel? It might be more clear if you list them…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions The jumpToBit is needed because various "materialize" methods may be called between calls to continueParse. By forcing a jumpToBit to happen at all calls to continueParse, we no longer need to know where the materialize methods leave the bitcursor. While I did not see an example of an error caused by interleaved calls to materialize, I was very suspicious that they could occur, and wanted to make sure that this would not happen. Hence, I made sure that continueParse always resets the position to where it left off. I will fix to not use default, so that a corresponding warning will be generated if a new value is added to the enumeration. kschimpf: The jumpToBit is needed because various "materialize" methods may be called between calls to…
		if (ParseState == InsideModule) {
		std::error_code EC = ParseModule();
		return updateParseState(EC);
		}
		break;
		}

// We expect a number of well-defined blocks, though we don't necessarily		// We expect a number of well-defined blocks, though we don't necessarily
// need to understand them all.		// need to understand them all.
while (1) {		while (1) {
if (Stream.AtEndOfStream())		if (Stream.AtEndOfStream())
return std::error_code();		return updateParseState(ReachedEof);

		assert(ParseState == AtTopLevel);
BitstreamEntry Entry =		BitstreamEntry Entry =
Stream.advance(BitstreamCursor::AF_DontAutoprocessAbbrevs);		Stream.advance(BitstreamCursor::AF_DontAutoprocessAbbrevs);

switch (Entry.Kind) {		switch (Entry.Kind) {
case BitstreamEntry::Error:		case BitstreamEntry::Error:
return Error("Malformed block");		return updateParseState(Error("Malformed block"));
case BitstreamEntry::EndBlock:		case BitstreamEntry::EndBlock:
return std::error_code();		return updateParseState(AtTopLevel);
		filcabUnsubmitted Not Done Reply Inline Actions We're already at top level, no? (line 3234) I might be missing something, but it looks like we're at top level, and saw an EndBlock. Shouldn't this be an error? filcab: We're already at top level, no? (line 3234) I might be missing something, but it looks like…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions I agree that we should be at top level. I also agree that it appears weird that we allow extra (unmathced) EndBlocks. This has been allowed by the bitcode reader/writer for years. I just wasn't willing to make the leap that I should remove this. However, I tried removing it (and making it an error), and no tests failed. Hence, Converting this to an error. kschimpf: I agree that we should be at top level. I also agree that it appears weird that we allow extra…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Discovered that the Bitstream::EndBLock was "hiding" a bug int the bitstream reader when processing a data stream. That is, when using a data stream, the size is not set until after the eof is reached. Hence, when Stream.AtEndOfStream() was called above, it would return false even when at the eof. The actual problem was in FillCurWord, which did not set the bit position correctly when there was no more input. The old code worked because the read (at eof) would return zero, and is understood as an end block. By returning success for this value, it would hide this problem. I also improved the error message so that once can see where the reader thought the eof should be, if there is miscellaneous stuff at the end of the bit code file. This makes it easier to know where to cut a test file in such cases. kschimpf: Discovered that the Bitstream::EndBLock was "hiding" a bug int the bitstream reader when…

case BitstreamEntry::SubBlock:		case BitstreamEntry::SubBlock:
switch (Entry.ID) {		switch (Entry.ID) {
		filcabUnsubmitted Not Done Reply Inline Actions Thank you! filcab: Thank you!
case bitc::BLOCKINFO_BLOCK_ID:		case bitc::BLOCKINFO_BLOCK_ID:
if (Stream.ReadBlockInfoBlock())		if (Stream.ReadBlockInfoBlock())
return Error("Malformed block");		return updateParseState(Error("Malformed block"));
break;		break;
		filcabUnsubmitted Not Done Reply Inline Actions errs()? Or StrBuf? Please also add a test for this error message. filcab: errs()? Or StrBuf? Please also add a test for this error message.
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Good catch. I meant StrBuf, so that we can use the same API for all errors. kschimpf: Good catch. I meant StrBuf, so that we can use the same API for all errors.
case bitc::MODULE_BLOCK_ID:		case bitc::MODULE_BLOCK_ID:
		rafaelUnsubmitted Not Done Reply Inline Actions This looks a bit much to be honest. Corrupted files are not that common and it is trivial to set a breakpoint to find the state. rafael: This looks a bit much to be honest. Corrupted files are not that common and it is trivial to…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Simplified in new CL to do same as before. kschimpf: Simplified in new CL to do same as before.
// Reject multiple MODULE_BLOCK's in a single bitstream.		// Reject multiple MODULE_BLOCK's in a single bitstream.
if (TheModule)		if (NumModulesParsed++)
		rafaelUnsubmitted Not Done Reply Inline Actions This is always 0 or 1. Use a boolean instead. rafael: This is always 0 or 1. Use a boolean instead.
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions This was already fixed in master. kschimpf: This was already fixed in master.
return Error("Invalid multiple blocks");		return updateParseState(Error("Invalid multiple blocks"));
TheModule = M;		if (std::error_code EC = ParseModule())
if (std::error_code EC = ParseModule(false, ShouldLazyLoadMetadata))		return updateParseState(EC);
return EC;		return updateParseState();
if (LazyStreamer)
return std::error_code();
break;		break;
default:		default:
if (Stream.SkipBlock())		if (Stream.SkipBlock())
return Error("Invalid record");		return updateParseState(Error("Invalid record"));
break;		break;
}		}
continue;		continue;
case BitstreamEntry::Record:		case BitstreamEntry::Record:
// There should be no records in the top-level of blocks.		// There should be no records in the top-level of blocks.

// The ranlib in Xcode 4 will align archive members by appending newlines		// The ranlib in Xcode 4 will align archive members by appending newlines
// to the end of them. If this file size is a multiple of 4 but not 8, we		// to the end of them. If this file size is a multiple of 4 but not 8, we
// have to read and ignore these final 4 bytes :-(		// have to read and ignore these final 4 bytes :-(
if (Stream.getAbbrevIDWidth() == 2 && Entry.ID == 2 &&		if (Stream.getAbbrevIDWidth() == 2 && Entry.ID == 2 &&
Stream.Read(6) == 2 && Stream.Read(24) == 0xa0a0a &&		Stream.Read(6) == 2 && Stream.Read(24) == 0xa0a0a &&
Stream.AtEndOfStream())		Stream.AtEndOfStream())
		return updateParseState();

		return updateParseState(Error("Invalid record"));
		}
		}
		}

		std::error_code BitcodeReader::FinishParse() {
		if (ParseState == FinishedParse)
return std::error_code();		return std::error_code();

return Error("Invalid record");		while (ParseState < NoMoreInput) {
		if (std::error_code EC = ContinueParse()) {
		return EC;
}		}
}		}

		assert(TheModule);
		ParseState = FinishedParse;

		if (ShouldMaterializeAll) {
		jvoungUnsubmitted Not Done Reply Inline Actions I don't quite understand how "ShouldMaterializeAll = false" is supposed to work for the streaming case, if this isn't checked until after: while (ParseState < NoMoreInput) { if (std::error_code EC = ContinueParse()) { return EC; } } How do you delay reading until materialize(GV) for streaming? jvoung: I don't quite understand how "ShouldMaterializeAll = false" is supposed to work for the…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions After talking to Derek, I realized what was the issue I was missing. I'l summarize what I understand: When streaming, we want to "return" as soon as possible, without having to force all bitcode to be scanned. This reduces the cost of (potential) blocking calls to the data streamer. Control can return to the caller without having completed the parse. However, the parsed portions must be consistent (i.e. forward block address references have been resolved). Based on this, I've modified the code to lift the materializeForwardReferencedFunctions into startParse. kschimpf: After talking to Derek, I realized what was the issue I was missing. I'l summarize what I…
		if (std::error_code EC = MaterializeModule(TheModule))
		return EC;
		} else {
		if (std::error_code EC = materializeForwardReferencedFunctions())
		jvoungUnsubmitted Not Done Reply Inline Actions This used to be early, in "getLazyBitcodeModuleImpl". This looks like it is now happening late in FinishParse after the loop until NoMoreInput. Why is this okay now? Was the early call to "materializeForwardReferencedFunctions" actually extraneous because of the call in materialize(GV), or what? Make sure to check the lazy case with blockaddresses for computed gotos, if there isn't already a unittest for that. jvoung: This used to be early, in "getLazyBitcodeModuleImpl". This looks like it is now happening late…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions I agree that there should be some type of forward reference unit test to verify we can lazy evaluate these forward-referenced block addresses. I also agree that for the use by llvm-dis, my original code worked because it eventually calls materializeAllPermanently. I think I may have been confused about the full expectation of "streamed" (or lazy) was because of this. kschimpf: I agree that there should be some type of forward reference unit test to verify we can lazy…
		return EC;
		}

		return std::error_code();
}		}

ErrorOr<std::string> BitcodeReader::parseModuleTriple() {		ErrorOr<std::string> BitcodeReader::parseModuleTriple() {
if (Stream.EnterSubBlock(bitc::MODULE_BLOCK_ID))		if (Stream.EnterSubBlock(bitc::MODULE_BLOCK_ID))
return Error("Invalid record");		return Error("Invalid record");

SmallVector<uint64_t, 64> Record;		SmallVector<uint64_t, 64> Record;

std::string Triple;		std::string Triple;
// Read all the records for this module.		// Read all the records for this module.
while (1) {		while (1) {
BitstreamEntry Entry = Stream.advanceSkippingSubblocks();		BitstreamEntry Entry = Stream.advanceSkippingSubblocks();

switch (Entry.Kind) {		switch (Entry.Kind) {
case BitstreamEntry::SubBlock: // Handled for us already.		case BitstreamEntry::SubBlock: // Handled for us already.
case BitstreamEntry::Error:		case BitstreamEntry::Error:
return Error("Malformed block");		return Error("Malformed block");
case BitstreamEntry::EndBlock:		case BitstreamEntry::EndBlock:
return Triple;		return Triple;
case BitstreamEntry::Record:		case BitstreamEntry::Record:
		jvoungUnsubmitted Not Done Reply Inline Actions "NoMorInput" -> "NoMoreInput" It could also be that more states >= NoMoreInput are added as code evolves, but not handled here. Can the compiler accept/handle a "static_assert(ParseState < NoMoreInput, "...") to catch what happens if more states are added after NoMoreInput but not handled by this switch? jvoung: "NoMorInput" -> "NoMoreInput" It could also be that more states >= NoMoreInput are added as…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Fixed string. Also removed "default" case and made all states explicit. This will force a warning if a new state is added. kschimpf: Fixed string. Also removed "default" case and made all states explicit. This will force a…
// The interesting case.		// The interesting case.
break;		break;
}		}

// Read a record.		// Read a record.
switch (Stream.readRecord(Entry.ID, Record)) {		switch (Stream.readRecord(Entry.ID, Record)) {
default: break; // Default behavior, ignore unknown content.		default: break; // Default behavior, ignore unknown content.
case bitc::MODULE_CODE_TRIPLE: { // TRIPLE: [strchr x N]		case bitc::MODULE_CODE_TRIPLE: { // TRIPLE: [strchr x N]
▲ Show 20 Lines • Show All 1,128 Lines • ▼ Show 20 Lines	OutOfRecordLoop:
return std::error_code();		return std::error_code();
}		}

/// Find the function body in the bitcode stream		/// Find the function body in the bitcode stream
std::error_code BitcodeReader::FindFunctionInStream(		std::error_code BitcodeReader::FindFunctionInStream(
Function *F,		Function *F,
DenseMap<Function *, uint64_t>::iterator DeferredFunctionInfoIterator) {		DenseMap<Function *, uint64_t>::iterator DeferredFunctionInfoIterator) {
while (DeferredFunctionInfoIterator->second == 0) {		while (DeferredFunctionInfoIterator->second == 0) {
if (Stream.AtEndOfStream())		if (ParseState < NoMoreInput) {
return Error("Could not find function in stream");		// Continue will parse the next body in the stream and set its
// ParseModule will parse the next body in the stream and set its
// position in the DeferredFunctionInfo map.		// position in the DeferredFunctionInfo map.
if (std::error_code EC = ParseModule(true))		if (std::error_code EC = ContinueParse())
return EC;		return EC;
		break;
		}
		return Error("Could not find function in stream");
}		}
return std::error_code();		return std::error_code();
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// GVMaterializer implementation		// GVMaterializer implementation
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

void BitcodeReader::releaseBuffer() { Buffer.release(); }		void BitcodeReader::releaseBuffer() { Buffer.release(); }

std::error_code BitcodeReader::materialize(GlobalValue *GV) {		std::error_code BitcodeReader::materialize(GlobalValue *GV) {
if (std::error_code EC = materializeMetadata())		if (std::error_code EC = materializeMetadata())
return EC;		return EC;

Function *F = dyn_cast<Function>(GV);		Function *F = dyn_cast<Function>(GV);
// If it's not a function or is already material, ignore the request.		// If it's not a function or is already material, ignore the request.
if (!F \|\| !F->isMaterializable())		if (!F \|\| !F->isMaterializable())
return std::error_code();		return std::error_code();

DenseMap<Function*, uint64_t>::iterator DFII = DeferredFunctionInfo.find(F);		DenseMap<Function*, uint64_t>::iterator DFII = DeferredFunctionInfo.find(F);
assert(DFII != DeferredFunctionInfo.end() && "Deferred function not found!");		assert(DFII != DeferredFunctionInfo.end() && "Deferred function not found!");
// If its position is recorded as 0, its body is somewhere in the stream		// If its position is recorded as 0, its body is somewhere in the stream
// but we haven't seen it yet.		// but we haven't seen it yet.
if (DFII->second == 0 && LazyStreamer)		if (DFII->second == 0)
if (std::error_code EC = FindFunctionInStream(F, DFII))		if (std::error_code EC = FindFunctionInStream(F, DFII))
return EC;		return EC;

// Move the bit stream to the saved position of the deferred function body.		// Move the bit stream to the saved position of the deferred function body.
Stream.JumpToBit(DFII->second);		Stream.JumpToBit(DFII->second);

if (std::error_code EC = ParseFunctionBody(F))		if (std::error_code EC = ParseFunctionBody(F))
return EC;		return EC;
Show All 11 Lines	if (I->first != I->second) {
if (CallInst* CI = dyn_cast<CallInst>(*UI++))		if (CallInst* CI = dyn_cast<CallInst>(*UI++))
UpgradeIntrinsicCall(CI, I->second);		UpgradeIntrinsicCall(CI, I->second);
}		}
}		}
}		}

// Bring in any functions that this function forward-referenced via		// Bring in any functions that this function forward-referenced via
// blockaddresses.		// blockaddresses.
return materializeForwardReferencedFunctions();		return materializeForwardReferencedFunctions();
		jvoungUnsubmitted Not Done Reply Inline Actions no need for extra space jvoung: no need for extra space
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Done. kschimpf: Done.
}		}

bool BitcodeReader::isDematerializable(const GlobalValue *GV) const {		bool BitcodeReader::isDematerializable(const GlobalValue *GV) const {
const Function *F = dyn_cast<Function>(GV);		const Function *F = dyn_cast<Function>(GV);
if (!F \|\| F->isDeclaration())		if (!F \|\| F->isDeclaration())
return false;		return false;

// Dematerializing F would leave dangling references that wouldn't be		// Dematerializing F would leave dangling references that wouldn't be
Show All 16 Lines	void BitcodeReader::Dematerialize(GlobalValue *GV) {
F->dropAllReferences();		F->dropAllReferences();
F->setIsMaterializable(true);		F->setIsMaterializable(true);
}		}

std::error_code BitcodeReader::MaterializeModule(Module *M) {		std::error_code BitcodeReader::MaterializeModule(Module *M) {
assert(M == TheModule &&		assert(M == TheModule &&
"Can only Materialize the Module this BitcodeReader is attached to.");		"Can only Materialize the Module this BitcodeReader is attached to.");

if (std::error_code EC = materializeMetadata())		if (IsModuleMaterialized)
		return std::error_code();

		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Removing the comment about being after a function body. This is no longer true. A call to materializeMetadata would put us some place else in the bitcode file. kschimpf: Removing the comment about being after a function body. This is no longer true. A call to…
		// Set flag now so that FinishParse will not recursively apply this
		// function.
		IsModuleMaterialized = true;

		// At this point, if there are any function bodies, the current bit is
		// pointing to the END_BLOCK record after them. Now make sure the rest
		// of the bits in the module have been read.
		if (std::error_code EC = FinishParse())
return EC;		return EC;

// Promise to materialize all forward references.		if (std::error_code EC = materializeMetadata())
WillMaterializeAllForwardRefs = true;		return EC;

		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Moved the iterating of functions to before the call to finishParse. This deals with the problems I was having with tests in test/Bitcode/invalid.test (i.e. Inputs/invalid-fwdref-type-mismatch-2.bc and Inputs/invalid-load-ptr-type.bc). These two files had multiple errors (the one they intended which was inside a function body, and the one probably not intended - extraneous stuff at the end of the bitcode file). This removes the need for the command line flag UseOldLazyBitcodeParse, and I have deleted it. kschimpf: Moved the iterating of functions to before the call to finishParse. This deals with the…
// Iterate over the module, deserializing any functions that are still on		// Iterate over the module, deserializing any functions that are still on
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Now that the eof checking is fixed, I moved this back where it was in an earlier version of this CL. kschimpf: Now that the eof checking is fixed, I moved this back where it was in an earlier version of…
// disk.		// disk.
for (Module::iterator F = TheModule->begin(), E = TheModule->end();		for (Module::iterator F = TheModule->begin(), E = TheModule->end();
F != E; ++F) {		F != E; ++F) {
if (std::error_code EC = materialize(F))		if (std::error_code EC = materialize(F))
return EC;		return EC;
}		}
// At this point, if there are any function bodies, the current bit is
// pointing to the END_BLOCK record after them. Now make sure the rest		if (std::error_code EC = materializeForwardReferencedFunctions())
		jvoungUnsubmitted Not Done Reply Inline Actions Is this necessary at this point? Should that already be covered by the " // Iterate over the module, deserializing any functions that are still on disk" loop? jvoung: Is this necessary at this point? Should that already be covered by the " // Iterate over the…
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions In correct bitcode files, you are right. However, if the function doesn't define any function blocks, but (incorrectly) references function block addresses, this code will cause the error to be generated. However, looking at the following instruction, this is checked anyway. Removing. kschimpf: In correct bitcode files, you are right. However, if the function doesn't define any function…
// of the bits in the module have been read.		return EC;
if (NextUnreadBit)
ParseModule(true);

// Check that all block address forward references got resolved (as we		// Check that all block address forward references got resolved (as we
// promised above).		// promised above).
		jvoungUnsubmitted Not Done Reply Inline Actions The "promise" comment from "above" is removed now, so you could update this comment. jvoung: The "promise" comment from "above" is removed now, so you could update this comment.
		kschimpfAuthorUnsubmitted Not Done Reply Inline Actions Done. kschimpf: Done.
if (!BasicBlockFwdRefs.empty())		if (!BasicBlockFwdRefs.empty())
return Error("Never resolved function from blockaddress");		return Error("Never resolved function from blockaddress");

// Upgrade any intrinsic calls that slipped through (should not happen!) and		// Upgrade any intrinsic calls that slipped through (should not happen!) and
// delete the old functions to clean up. We can't do this unless the entire		// delete the old functions to clean up. We can't do this unless the entire
// module is materialized because there could always be another function body		// module is materialized because there could always be another function body
// with calls to the old function.		// with calls to the old function.
for (std::vector<std::pair<Function, Function> >::iterator I =		for (std::vector<std::pair<Function, Function> >::iterator I =
Show All 18 Lines	std::error_code BitcodeReader::MaterializeModule(Module *M) {
return std::error_code();		return std::error_code();
}		}

std::vector<StructType *> BitcodeReader::getIdentifiedStructTypes() const {		std::vector<StructType *> BitcodeReader::getIdentifiedStructTypes() const {
return IdentifiedStructTypes;		return IdentifiedStructTypes;
}		}

std::error_code BitcodeReader::InitStream() {		std::error_code BitcodeReader::InitStream() {
if (LazyStreamer)		if (Streamer)
return InitLazyStream();		return InitLazyStream();
return InitStreamFromBuffer();		return InitStreamFromBuffer();
}		}

std::error_code BitcodeReader::InitStreamFromBuffer() {		std::error_code BitcodeReader::InitStreamFromBuffer() {
const unsigned char BufPtr = (const unsigned char)Buffer->getBufferStart();		const unsigned char BufPtr = (const unsigned char)Buffer->getBufferStart();
const unsigned char *BufEnd = BufPtr+Buffer->getBufferSize();		const unsigned char *BufEnd = BufPtr+Buffer->getBufferSize();

Show All 10 Lines	std::error_code BitcodeReader::InitStreamFromBuffer() {
Stream.init(&*StreamFile);		Stream.init(&*StreamFile);

return std::error_code();		return std::error_code();
}		}

std::error_code BitcodeReader::InitLazyStream() {		std::error_code BitcodeReader::InitLazyStream() {
// Check and strip off the bitcode wrapper; BitstreamReader expects never to		// Check and strip off the bitcode wrapper; BitstreamReader expects never to
// see it.		// see it.
auto OwnedBytes = llvm::make_unique<StreamingMemoryObject>(LazyStreamer);		auto OwnedBytes = llvm::make_unique<StreamingMemoryObject>(Streamer);
StreamingMemoryObject &Bytes = *OwnedBytes;		StreamingMemoryObject &Bytes = *OwnedBytes;
StreamFile = llvm::make_unique<BitstreamReader>(std::move(OwnedBytes));		StreamFile = llvm::make_unique<BitstreamReader>(std::move(OwnedBytes));
Stream.init(&*StreamFile);		Stream.init(&*StreamFile);

unsigned char buf[16];		unsigned char buf[16];
if (Bytes.readBytes(buf, 16, 0) != 16)		if (Bytes.readBytes(buf, 16, 0) != 16)
return Error("Invalid bitcode signature");		return Error("Invalid bitcode signature");

▲ Show 20 Lines • Show All 51 Lines • ▼ Show 20 Lines	getLazyBitcodeModuleImpl(std::unique_ptr<MemoryBuffer> &&Buffer,
LLVMContext &Context, bool WillMaterializeAll,		LLVMContext &Context, bool WillMaterializeAll,
DiagnosticHandlerFunction DiagnosticHandler,		DiagnosticHandlerFunction DiagnosticHandler,
bool ShouldLazyLoadMetadata = false) {		bool ShouldLazyLoadMetadata = false) {
Module *M = new Module(Buffer->getBufferIdentifier(), Context);		Module *M = new Module(Buffer->getBufferIdentifier(), Context);
BitcodeReader *R =		BitcodeReader *R =
new BitcodeReader(Buffer.get(), Context, DiagnosticHandler);		new BitcodeReader(Buffer.get(), Context, DiagnosticHandler);
M->setMaterializer(R);		M->setMaterializer(R);

auto cleanupOnError = [&](std::error_code EC) {		// Delay parsing Metadata if ShouldLazyLoadMetadata is true.
		if (std::error_code EC =
		R->ParseBitcodeInto(M, WillMaterializeAll, ShouldLazyLoadMetadata)) {
R->releaseBuffer(); // Never take ownership on error.		R->releaseBuffer(); // Never take ownership on error.
delete M; // Also deletes R.		delete M; // Also deletes R.
return EC;		return EC;
};		}

// Delay parsing Metadata if ShouldLazyLoadMetadata is true.
if (std::error_code EC = R->ParseBitcodeInto(M, ShouldLazyLoadMetadata))
return cleanupOnError(EC);

if (!WillMaterializeAll)
// Resolve forward references from blockaddresses.
if (std::error_code EC = R->materializeForwardReferencedFunctions())
return cleanupOnError(EC);

Buffer.release(); // The BitcodeReader owns it now.		Buffer.release(); // The BitcodeReader owns it now.
return M;		return M;
}		}

ErrorOr<Module *>		ErrorOr<Module *>
llvm::getLazyBitcodeModule(std::unique_ptr<MemoryBuffer> &&Buffer,		llvm::getLazyBitcodeModule(std::unique_ptr<MemoryBuffer> &&Buffer,
LLVMContext &Context,		LLVMContext &Context,
DiagnosticHandlerFunction DiagnosticHandler,		DiagnosticHandlerFunction DiagnosticHandler,
bool ShouldLazyLoadMetadata) {		bool ShouldLazyLoadMetadata) {
return getLazyBitcodeModuleImpl(std::move(Buffer), Context, false,		return getLazyBitcodeModuleImpl(std::move(Buffer), Context, false,
DiagnosticHandler, ShouldLazyLoadMetadata);		DiagnosticHandler, ShouldLazyLoadMetadata);
}		}

ErrorOr<std::unique_ptr<Module>>		ErrorOr<std::unique_ptr<Module>>
llvm::getStreamedBitcodeModule(StringRef Name, DataStreamer *Streamer,		llvm::getStreamedBitcodeModule(StringRef Name, DataStreamer *Streamer,
LLVMContext &Context,		LLVMContext &Context,
DiagnosticHandlerFunction DiagnosticHandler) {		DiagnosticHandlerFunction DiagnosticHandler) {
std::unique_ptr<Module> M = make_unique<Module>(Name, Context);		std::unique_ptr<Module> M = make_unique<Module>(Name, Context);
BitcodeReader *R = new BitcodeReader(Streamer, Context, DiagnosticHandler);		BitcodeReader *R = new BitcodeReader(Streamer, Context, DiagnosticHandler);
M->setMaterializer(R);		M->setMaterializer(R);
if (std::error_code EC = R->ParseBitcodeInto(M.get()))		if (std::error_code EC = R->ParseBitcodeInto(M.get(), false, false))
return EC;		return EC;
return std::move(M);		return std::move(M);
}		}

ErrorOr<Module *>		ErrorOr<Module *>
llvm::parseBitcodeFile(MemoryBufferRef Buffer, LLVMContext &Context,		llvm::parseBitcodeFile(MemoryBufferRef Buffer, LLVMContext &Context,
DiagnosticHandlerFunction DiagnosticHandler) {		DiagnosticHandlerFunction DiagnosticHandler) {
std::unique_ptr<MemoryBuffer> Buf = MemoryBuffer::getMemBuffer(Buffer, false);		std::unique_ptr<MemoryBuffer> Buf = MemoryBuffer::getMemBuffer(Buffer, false);
ErrorOr<Module *> ModuleOrErr = getLazyBitcodeModuleImpl(		ErrorOr<Module *> ModuleOrErr = getLazyBitcodeModuleImpl(
std::move(Buf), Context, true, DiagnosticHandler);		std::move(Buf), Context, true, DiagnosticHandler);
if (!ModuleOrErr)		if (!ModuleOrErr)
return ModuleOrErr;		return ModuleOrErr;
Module *M = ModuleOrErr.get();		Module *M = ModuleOrErr.get();
// Read in the entire module, and destroy the BitcodeReader.
if (std::error_code EC = M->materializeAllPermanently()) {
delete M;
return EC;
}

// TODO: Restore the use-lists to the in-memory state when the bitcode was		// TODO: Restore the use-lists to the in-memory state when the bitcode was
// written. We must defer until the Module has been fully materialized.		// written. We must defer until the Module has been fully materialized.

return M;		return M;
}		}

std::string		std::string
Show All 10 Lines

lib/Bitcode/Reader/BitstreamReader.cpp

Show First 20 Lines • Show All 332 Lines • ▼ Show 20 Lines	switch (readRecord(Entry.ID, Record)) {
Name += (char)Record[i];		Name += (char)Record[i];
CurBlockInfo->RecordNames.push_back(std::make_pair((unsigned)Record[0],		CurBlockInfo->RecordNames.push_back(std::make_pair((unsigned)Record[0],
Name));		Name));
break;		break;
}		}
}		}
}		}
}		}

lib/Support/StreamingMemoryObject.cpp

Show First 20 Lines • Show All 81 Lines • ▼ Show 20 Lines	uint64_t StreamingMemoryObject::getExtent() const {
// keep fetching until we run out of bytes		// keep fetching until we run out of bytes
while (fetchToPos(pos)) pos += kChunkSize;		while (fetchToPos(pos)) pos += kChunkSize;
return ObjectSize;		return ObjectSize;
}		}

uint64_t StreamingMemoryObject::readBytes(uint8_t *Buf, uint64_t Size,		uint64_t StreamingMemoryObject::readBytes(uint8_t *Buf, uint64_t Size,
uint64_t Address) const {		uint64_t Address) const {
fetchToPos(Address + Size - 1);		fetchToPos(Address + Size - 1);
if (Address >= BytesRead)		if (Address >= BytesRead \|\| (ObjectSize && Address >= ObjectSize))
return 0;		return 0;

uint64_t End = Address + Size;		uint64_t End = Address + Size;
if (End > BytesRead)		if (ObjectSize) {
		if (End > ObjectSize) {
		End = ObjectSize;
		}
		} else if (End > BytesRead)
End = BytesRead;		End = BytesRead;
assert(static_cast<int64_t>(End - Address) >= 0);		assert(End >= Address);
Size = End - Address;		Size = End - Address;
memcpy(Buf, &Bytes[Address + BytesSkipped], Size);		memcpy(Buf, &Bytes[Address + BytesSkipped], Size);
return Size;		return Size;
}		}

bool StreamingMemoryObject::dropLeadingBytes(size_t s) {		bool StreamingMemoryObject::dropLeadingBytes(size_t s) {
if (BytesRead < s) return true;		if (BytesRead < s) return true;
BytesSkipped = s;		BytesSkipped = s;
BytesRead -= s;		BytesRead -= s;
return false;		return false;
}		}

void StreamingMemoryObject::setKnownObjectSize(size_t size) {		void StreamingMemoryObject::setKnownObjectSize(size_t size) {
ObjectSize = size;		ObjectSize = size;
Bytes.reserve(size);		Bytes.reserve(size);
}		}

MemoryObject getNonStreamedMemoryObject(const unsigned char Start,		MemoryObject getNonStreamedMemoryObject(const unsigned char Start,
const unsigned char *End) {		const unsigned char *End) {
return new RawMemoryObject(Start, End);		return new RawMemoryObject(Start, End);
}		}

StreamingMemoryObject::StreamingMemoryObject(DataStreamer *streamer) :		StreamingMemoryObject::StreamingMemoryObject(DataStreamer *streamer) :
Bytes(kChunkSize), Streamer(streamer), BytesRead(0), BytesSkipped(0),		Bytes(kChunkSize), Streamer(streamer), BytesRead(0), BytesSkipped(0),
ObjectSize(0), EOFReached(false) {		ObjectSize(0), EOOReached(false) {
BytesRead = streamer->GetBytes(&Bytes[0], kChunkSize);		BytesRead = streamer->GetBytes(&Bytes[0], kChunkSize);
}		}
}		}