This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
cfe/trunk/lib/
-
trunk/
-
lib/
-
CodeGen/
-
ObjectFilePCHContainerOperations.cpp
-
Frontend/
-
PCHContainerOperations.cpp
-
SerializedDiagnosticReader.cpp
-
Serialization/
-
ASTReader.cpp
-
GlobalModuleIndex.cpp
-
llvm/trunk/
-
trunk/
-
include/llvm/Bitcode/
-
llvm/
-
Bitcode/
2
BitstreamReader.h
-
ReaderWriter.h
-
lib/Bitcode/Reader/
-
Bitcode/
-
Reader/
-
BitcodeReader.cpp
-
test/Bitcode/Inputs/
-
Bitcode/
-
Inputs/
-
invalid-array-operand-encoding.bc
-
invalid-code-len-width.bc
-
invalid-extractval-array-idx.bc
-
invalid-function-comdat-id.bc
-
invalid-fwdref-type-mismatch-2.bc
-
invalid-metadata-not-followed-named-node.bc
-
invalid-name-with-0-byte.bc
-
invalid-unexpected-eof.bc
-
tools/
-
llvm-bcanalyzer/
-
llvm-bcanalyzer.cpp
-
llvm-dis/
-
llvm-dis.cpp
-
unittests/Bitcode/
-
Bitcode/
-
BitReaderTest.cpp
-
BitstreamReaderTest.cpp

Differential D26219

Bitcode: Change reader interface to take memory buffers.
ClosedPublic

Authored by pcc on Nov 1 2016, 3:53 PM.

Download Raw Diff

Details

Reviewers

• rafael
mehdi_amini
dexonsmith

Commits

rG028eb5a3f823: Bitcode: Change reader interface to take memory buffers.
rC285773: Bitcode: Change reader interface to take memory buffers.
rL285773: Bitcode: Change reader interface to take memory buffers.

Summary

As proposed on llvm-dev:
http://lists.llvm.org/pipermail/llvm-dev/2016-October/106595.html

This change also fixes an API oddity where BitstreamCursor::Read() would
return zero for the first read past the end of the bitstream, but would
report_fatal_error for subsequent reads. Now we always report_fatal_error
for all reads past the end. Updated clients to check for the end of the
bitstream before reading from it.

I also needed to add padding to the invalid bitcode tests in
test/Bitcode/. This is because the streaming interface was not checking that
the file size is a multiple of 4.

Diff Detail

Repository: rL LLVM

Event Timeline

pcc updated this revision to Diff 76645.Nov 1 2016, 3:53 PM

pcc retitled this revision from to Bitcode, Support: Remove MemoryObject and DataStreamer interfaces; bitcode reader now takes memory buffers..

pcc updated this object.

pcc added reviewers: • rafael, mehdi_amini, dexonsmith.

pcc added subscribers: llvm-commits, jordan_rose.

Herald added subscribers: modocache, mgorny. · View Herald TranscriptNov 1 2016, 3:53 PM

Sounds great to me, but it would be kind of nice to do it in two commits: one to add the new API and one to remove the old one.

Split out API removal to D26222

pcc retitled this revision from Bitcode, Support: Remove MemoryObject and DataStreamer interfaces; bitcode reader now takes memory buffers. to Bitcode: Change reader interface to take memory buffers..Nov 1 2016, 4:23 PM

llvm/include/llvm/Bitcode/BitstreamReader.h
63 ↗	(On Diff #76645)	`= default;` ?
69 ↗	(On Diff #76645)	You can forward to the next ctor: BitstreamReader(MemoryBufferRef BitcodeBytes) : BitstreamReader(BitcodeBytes.getBuffer()) {}
llvm/lib/Bitcode/Reader/BitcodeReader.cpp
235 ↗	(On Diff #76645)	Is this related to this commit?
402 ↗	(On Diff #76645)	Is this related to this change or can you remove this as a pre-commit?

mehdi_amini added inline comments.Nov 1 2016, 4:25 PM

llvm/include/llvm/Bitcode/BitstreamReader.h
63 ↗	(On Diff #76651)	The `= default()` was supposed to be here (commented while you updated the diff)

Use = default and delegating ctor

llvm/lib/Bitcode/Reader/BitcodeReader.cpp
235 ↗	(On Diff #76645)	Yes, this ctor is currently needed elsewhere (marked below).
402 ↗	(On Diff #76645)	Same here.
6691 ↗	(On Diff #76651)	This is where we currently use the ctor.

Alright, LGTM!

This revision is now accepted and ready to land.Nov 1 2016, 5:05 PM

Closed by commit rL285773: Bitcode: Change reader interface to take memory buffers. (authored by pcc). · Explain WhyNov 1 2016, 5:18 PM

This revision was automatically updated to reflect the committed changes.

Alright, LGTM!

llvm/trunk/include/llvm/Bitcode/BitstreamReader.h
435	There is a semantic change here. Previously, when at the end of the stream after calling `Stream.advance(BitstreamCursor::AF_DontPopBlockAtEnd);` we would call `ReadCode()` which would return `0` which is also `bitc::END_BLOCK`, and we would enter the condition below and `return BitstreamEntry::getEndBlock();`. Now we return an error instead. This is breaking the loop pattern in `BitcodeReader::parseIdentificationBlock`, which is conceptually the following: // We expect a number of well-defined blocks, though we don't necessarily // need to understand them all. while (1) { BitstreamEntry Entry = Stream.advance(BitstreamCursor::AF_DontPopBlockAtEnd); // Ignore other sub-blocks. if (Stream.SkipBlock()) return error("Malformed block"); }

pcc added inline comments.Nov 9 2016, 2:26 PM

llvm/trunk/include/llvm/Bitcode/BitstreamReader.h
435	As discussed on IRC, let's add a `BitstreamEntry::EOF` enum and change the clients to handle it.

Revision Contents

Path

Size

cfe/

trunk/

lib/

CodeGen/

ObjectFilePCHContainerOperations.cpp

7 lines

Frontend/

PCHContainerOperations.cpp

3 lines

SerializedDiagnosticReader.cpp

5 lines

Serialization/

ASTReader.cpp

3 lines

GlobalModuleIndex.cpp

3 lines

llvm/

trunk/

include/

llvm/

Bitcode/

BitstreamReader.h

109 lines

ReaderWriter.h

8 lines

lib/

Bitcode/

Reader/

BitcodeReader.cpp

104 lines

test/

Bitcode/

Inputs/

invalid-array-operand-encoding.bc

invalid-code-len-width.bc

invalid-extractval-array-idx.bc

invalid-function-comdat-id.bc

invalid-fwdref-type-mismatch-2.bc

invalid-metadata-not-followed-named-node.bc

invalid-name-with-0-byte.bc

invalid-unexpected-eof.bc

2 lines

tools/

llvm-bcanalyzer/

llvm-bcanalyzer.cpp

8 lines

llvm-dis/

llvm-dis.cpp

39 lines

unittests/

Bitcode/

BitReaderTest.cpp

81 lines

BitstreamReaderTest.cpp

100 lines

Diff 76660

cfe/trunk/lib/CodeGen/ObjectFilePCHContainerOperations.cpp

Show First 20 Lines • Show All 319 Lines • ▼ Show 20 Lines	if (OFOrErr) {
bool IsCOFF = isa<llvm::object::COFFObjectFile>(*OF);		bool IsCOFF = isa<llvm::object::COFFObjectFile>(*OF);
// Find the clang AST section in the container.		// Find the clang AST section in the container.
for (auto &Section : OF->sections()) {		for (auto &Section : OF->sections()) {
StringRef Name;		StringRef Name;
Section.getName(Name);		Section.getName(Name);
if ((!IsCOFF && Name == "__clangast") \|\| (IsCOFF && Name == "clangast")) {		if ((!IsCOFF && Name == "__clangast") \|\| (IsCOFF && Name == "clangast")) {
StringRef Buf;		StringRef Buf;
Section.getContents(Buf);		Section.getContents(Buf);
return StreamFile.init((const unsigned char *)Buf.begin(),		StreamFile = llvm::BitstreamReader(Buf);
(const unsigned char *)Buf.end());		return;
}		}
}		}
}		}
handleAllErrors(OFOrErr.takeError(), [&](const llvm::ErrorInfoBase &EIB) {		handleAllErrors(OFOrErr.takeError(), [&](const llvm::ErrorInfoBase &EIB) {
if (EIB.convertToErrorCode() ==		if (EIB.convertToErrorCode() ==
llvm::object::object_error::invalid_file_type)		llvm::object::object_error::invalid_file_type)
// As a fallback, treat the buffer as a raw AST.		// As a fallback, treat the buffer as a raw AST.
StreamFile.init((const unsigned char *)Buffer.getBufferStart(),		StreamFile = llvm::BitstreamReader(Buffer);
(const unsigned char *)Buffer.getBufferEnd());
else		else
EIB.log(llvm::errs());		EIB.log(llvm::errs());
});		});
}		}

cfe/trunk/lib/Frontend/PCHContainerOperations.cpp

Show First 20 Lines • Show All 54 Lines • ▼ Show 20 Lines	std::unique_ptr<ASTConsumer> RawPCHContainerWriter::CreatePCHContainerGenerator(
CompilerInstance &CI, const std::string &MainFileName,		CompilerInstance &CI, const std::string &MainFileName,
const std::string &OutputFileName, std::unique_ptr<llvm::raw_pwrite_stream> OS,		const std::string &OutputFileName, std::unique_ptr<llvm::raw_pwrite_stream> OS,
std::shared_ptr<PCHBuffer> Buffer) const {		std::shared_ptr<PCHBuffer> Buffer) const {
return llvm::make_unique<RawPCHContainerGenerator>(std::move(OS), Buffer);		return llvm::make_unique<RawPCHContainerGenerator>(std::move(OS), Buffer);
}		}

void RawPCHContainerReader::ExtractPCH(		void RawPCHContainerReader::ExtractPCH(
llvm::MemoryBufferRef Buffer, llvm::BitstreamReader &StreamFile) const {		llvm::MemoryBufferRef Buffer, llvm::BitstreamReader &StreamFile) const {
StreamFile.init((const unsigned char *)Buffer.getBufferStart(),		StreamFile = llvm::BitstreamReader(Buffer);
(const unsigned char *)Buffer.getBufferEnd());
}		}

PCHContainerOperations::PCHContainerOperations() {		PCHContainerOperations::PCHContainerOperations() {
registerWriter(llvm::make_unique<RawPCHContainerWriter>());		registerWriter(llvm::make_unique<RawPCHContainerWriter>());
registerReader(llvm::make_unique<RawPCHContainerReader>());		registerReader(llvm::make_unique<RawPCHContainerReader>());
}		}

cfe/trunk/lib/Frontend/SerializedDiagnosticReader.cpp

Show All 18 Lines	std::error_code SerializedDiagnosticReader::readDiagnostics(StringRef File) {
// Open the diagnostics file.		// Open the diagnostics file.
FileSystemOptions FO;		FileSystemOptions FO;
FileManager FileMgr(FO);		FileManager FileMgr(FO);

auto Buffer = FileMgr.getBufferForFile(File);		auto Buffer = FileMgr.getBufferForFile(File);
if (!Buffer)		if (!Buffer)
return SDError::CouldNotLoad;		return SDError::CouldNotLoad;

llvm::BitstreamReader StreamFile;		llvm::BitstreamReader StreamFile(**Buffer);
StreamFile.init((const unsigned char )(Buffer)->getBufferStart(),
(const unsigned char )(Buffer)->getBufferEnd());

llvm::BitstreamCursor Stream(StreamFile);		llvm::BitstreamCursor Stream(StreamFile);

// Sniff for the signature.		// Sniff for the signature.
if (Stream.Read(8) != 'D' \|\|		if (Stream.Read(8) != 'D' \|\|
Stream.Read(8) != 'I' \|\|		Stream.Read(8) != 'I' \|\|
Stream.Read(8) != 'A' \|\|		Stream.Read(8) != 'A' \|\|
Stream.Read(8) != 'G')		Stream.Read(8) != 'G')
return SDError::InvalidSignature;		return SDError::InvalidSignature;
▲ Show 20 Lines • Show All 256 Lines • Show Last 20 Lines

cfe/trunk/lib/Serialization/ASTReader.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 3,796 Lines • ▼ Show 20 Lines	ASTReader::ASTReadResult ASTReader::ReadAST(StringRef FileName,

return Success;		return Success;
}		}

static ASTFileSignature readASTFileSignature(llvm::BitstreamReader &StreamFile);		static ASTFileSignature readASTFileSignature(llvm::BitstreamReader &StreamFile);

/// \brief Whether \p Stream starts with the AST/PCH file magic number 'CPCH'.		/// \brief Whether \p Stream starts with the AST/PCH file magic number 'CPCH'.
static bool startsWithASTFileMagic(BitstreamCursor &Stream) {		static bool startsWithASTFileMagic(BitstreamCursor &Stream) {
return Stream.Read(8) == 'C' &&		return Stream.canSkipToPos(4) &&
		Stream.Read(8) == 'C' &&
Stream.Read(8) == 'P' &&		Stream.Read(8) == 'P' &&
Stream.Read(8) == 'C' &&		Stream.Read(8) == 'C' &&
Stream.Read(8) == 'H';		Stream.Read(8) == 'H';
}		}

static unsigned moduleKindForDiagnostic(ModuleKind Kind) {		static unsigned moduleKindForDiagnostic(ModuleKind Kind) {
switch (Kind) {		switch (Kind) {
case MK_PCH:		case MK_PCH:
▲ Show 20 Lines • Show All 5,128 Lines • Show Last 20 Lines

cfe/trunk/lib/Serialization/GlobalModuleIndex.cpp

Show First 20 Lines • Show All 240 Lines • ▼ Show 20 Lines	GlobalModuleIndex::readIndex(StringRef Path) {

llvm::ErrorOr<std::unique_ptr<llvm::MemoryBuffer>> BufferOrErr =		llvm::ErrorOr<std::unique_ptr<llvm::MemoryBuffer>> BufferOrErr =
llvm::MemoryBuffer::getFile(IndexPath.c_str());		llvm::MemoryBuffer::getFile(IndexPath.c_str());
if (!BufferOrErr)		if (!BufferOrErr)
return std::make_pair(nullptr, EC_NotFound);		return std::make_pair(nullptr, EC_NotFound);
std::unique_ptr<llvm::MemoryBuffer> Buffer = std::move(BufferOrErr.get());		std::unique_ptr<llvm::MemoryBuffer> Buffer = std::move(BufferOrErr.get());

/// \brief The bitstream reader from which we'll read the AST file.		/// \brief The bitstream reader from which we'll read the AST file.
llvm::BitstreamReader Reader((const unsigned char *)Buffer->getBufferStart(),		llvm::BitstreamReader Reader(*Buffer);
(const unsigned char *)Buffer->getBufferEnd());

/// \brief The main bitstream cursor for the main block.		/// \brief The main bitstream cursor for the main block.
llvm::BitstreamCursor Cursor(Reader);		llvm::BitstreamCursor Cursor(Reader);

// Sniff for the signature.		// Sniff for the signature.
if (Cursor.Read(8) != 'B' \|\|		if (Cursor.Read(8) != 'B' \|\|
Cursor.Read(8) != 'C' \|\|		Cursor.Read(8) != 'C' \|\|
Cursor.Read(8) != 'G' \|\|		Cursor.Read(8) != 'G' \|\|
▲ Show 20 Lines • Show All 631 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/Bitcode/BitstreamReader.h

Show All 9 Lines
// This header defines the BitstreamReader class. This class can be used to		// This header defines the BitstreamReader class. This class can be used to
// read an arbitrary bitstream, regardless of its contents.		// read an arbitrary bitstream, regardless of its contents.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_BITCODE_BITSTREAMREADER_H		#ifndef LLVM_BITCODE_BITSTREAMREADER_H
#define LLVM_BITCODE_BITSTREAMREADER_H		#define LLVM_BITCODE_BITSTREAMREADER_H

		#include "llvm/ADT/ArrayRef.h"
#include "llvm/ADT/IntrusiveRefCntPtr.h"		#include "llvm/ADT/IntrusiveRefCntPtr.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/Bitcode/BitCodes.h"		#include "llvm/Bitcode/BitCodes.h"
#include "llvm/Support/Endian.h"		#include "llvm/Support/Endian.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/Support/MathExtras.h"		#include "llvm/Support/MathExtras.h"
#include "llvm/Support/StreamingMemoryObject.h"		#include "llvm/Support/MemoryBuffer.h"
#include <algorithm>		#include <algorithm>
#include <cassert>		#include <cassert>
#include <climits>		#include <climits>
#include <cstddef>		#include <cstddef>
#include <cstdint>		#include <cstdint>
#include <memory>		#include <memory>
#include <string>		#include <string>
#include <utility>		#include <utility>
Show All 12 Lines	public:
struct BlockInfo {		struct BlockInfo {
unsigned BlockID;		unsigned BlockID;
std::vector<IntrusiveRefCntPtr<BitCodeAbbrev>> Abbrevs;		std::vector<IntrusiveRefCntPtr<BitCodeAbbrev>> Abbrevs;
std::string Name;		std::string Name;
std::vector<std::pair<unsigned, std::string> > RecordNames;		std::vector<std::pair<unsigned, std::string> > RecordNames;
};		};

private:		private:
std::unique_ptr<MemoryObject> BitcodeBytes;		ArrayRef<uint8_t> BitcodeBytes;

std::vector<BlockInfo> BlockInfoRecords;		std::vector<BlockInfo> BlockInfoRecords;

/// This is set to true if we don't care about the block/record name		/// This is set to true if we don't care about the block/record name
/// information in the BlockInfo block. Only llvm-bcanalyzer uses this.		/// information in the BlockInfo block. Only llvm-bcanalyzer uses this.
bool IgnoreBlockInfoNames;		bool IgnoreBlockInfoNames = true;

public:		public:
BitstreamReader() : IgnoreBlockInfoNames(true) {		BitstreamReader() = default;
}		BitstreamReader(ArrayRef<uint8_t> BitcodeBytes)
		: BitcodeBytes(BitcodeBytes) {}
BitstreamReader(const unsigned char Start, const unsigned char End)		BitstreamReader(StringRef BitcodeBytes)
: IgnoreBlockInfoNames(true) {		: BitcodeBytes(reinterpret_cast<const uint8_t *>(BitcodeBytes.data()),
init(Start, End);		BitcodeBytes.size()) {}
}		BitstreamReader(MemoryBufferRef BitcodeBytes)
		: BitstreamReader(BitcodeBytes.getBuffer()) {}

BitstreamReader(std::unique_ptr<MemoryObject> BitcodeBytes)		ArrayRef<uint8_t> getBitcodeBytes() { return BitcodeBytes; }
: BitcodeBytes(std::move(BitcodeBytes)), IgnoreBlockInfoNames(true) {}

void init(const unsigned char Start, const unsigned char End) {
assert(((End-Start) & 3) == 0 &&"Bitcode stream not a multiple of 4 bytes");
BitcodeBytes.reset(getNonStreamedMemoryObject(Start, End));
}

MemoryObject &getBitcodeBytes() { return *BitcodeBytes; }

/// This is called by clients that want block/record name information.		/// This is called by clients that want block/record name information.
void CollectBlockInfoNames() { IgnoreBlockInfoNames = false; }		void CollectBlockInfoNames() { IgnoreBlockInfoNames = false; }
bool isIgnoringBlockInfoNames() { return IgnoreBlockInfoNames; }		bool isIgnoringBlockInfoNames() { return IgnoreBlockInfoNames; }

//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
// Block Manipulation		// Block Manipulation
//===--------------------------------------------------------------------===//		//===--------------------------------------------------------------------===//
Show All 39 Lines

/// This represents a position within a bitstream. There may be multiple		/// This represents a position within a bitstream. There may be multiple
/// independent cursors reading within one bitstream, each maintaining their		/// independent cursors reading within one bitstream, each maintaining their
/// own local state.		/// own local state.
class SimpleBitstreamCursor {		class SimpleBitstreamCursor {
BitstreamReader *R = nullptr;		BitstreamReader *R = nullptr;
size_t NextChar = 0;		size_t NextChar = 0;

// The size of the bicode. 0 if we don't know it yet.
size_t Size = 0;

public:		public:
/// This is the current data we have pulled from the stream but have not		/// This is the current data we have pulled from the stream but have not
/// returned to the client. This is specifically and intentionally defined to		/// returned to the client. This is specifically and intentionally defined to
/// follow the word size of the host machine for efficiency. We use word_t in		/// follow the word size of the host machine for efficiency. We use word_t in
/// places that are aware of this to make it perfectly explicit what is going		/// places that are aware of this to make it perfectly explicit what is going
/// on.		/// on.
typedef size_t word_t;		typedef size_t word_t;

Show All 9 Lines	public:

SimpleBitstreamCursor() = default;		SimpleBitstreamCursor() = default;

explicit SimpleBitstreamCursor(BitstreamReader &R) : R(&R) {}		explicit SimpleBitstreamCursor(BitstreamReader &R) : R(&R) {}
explicit SimpleBitstreamCursor(BitstreamReader *R) : R(R) {}		explicit SimpleBitstreamCursor(BitstreamReader *R) : R(R) {}

bool canSkipToPos(size_t pos) const {		bool canSkipToPos(size_t pos) const {
// pos can be skipped to if it is a valid address or one byte past the end.		// pos can be skipped to if it is a valid address or one byte past the end.
return pos == 0 \|\|		return pos <= R->getBitcodeBytes().size();
R->getBitcodeBytes().isValidAddress(static_cast<uint64_t>(pos - 1));
}		}

bool AtEndOfStream() {		bool AtEndOfStream() {
if (BitsInCurWord != 0)		return BitsInCurWord == 0 && R->getBitcodeBytes().size() <= NextChar;
return false;
if (Size != 0)
return Size <= NextChar;
fillCurWord();
return BitsInCurWord == 0;
}		}

/// Return the bit # of the bit we are reading.		/// Return the bit # of the bit we are reading.
uint64_t GetCurrentBitNo() const {		uint64_t GetCurrentBitNo() const {
return NextChar*CHAR_BIT - BitsInCurWord;		return NextChar*CHAR_BIT - BitsInCurWord;
}		}

// Return the byte # of the current bit.		// Return the byte # of the current bit.
Show All 32 Lines	assert((intptr_t)getPointerToByte(getCurrentByteNo(), 1) ==
"Expected to reach pointer");		"Expected to reach pointer");
}		}
void jumpToPointer(const char *Pointer) {		void jumpToPointer(const char *Pointer) {
jumpToPointer((const uint8_t *)Pointer);		jumpToPointer((const uint8_t *)Pointer);
}		}

/// Get a pointer into the bitstream at the specified byte offset.		/// Get a pointer into the bitstream at the specified byte offset.
const uint8_t *getPointerToByte(uint64_t ByteNo, uint64_t NumBytes) {		const uint8_t *getPointerToByte(uint64_t ByteNo, uint64_t NumBytes) {
return R->getBitcodeBytes().getPointer(ByteNo, NumBytes);		return R->getBitcodeBytes().data() + ByteNo;
}		}

/// Get a pointer into the bitstream at the specified bit offset.		/// Get a pointer into the bitstream at the specified bit offset.
///		///
/// The bit offset must be on a byte boundary.		/// The bit offset must be on a byte boundary.
const uint8_t *getPointerToBit(uint64_t BitNo, uint64_t NumBytes) {		const uint8_t *getPointerToBit(uint64_t BitNo, uint64_t NumBytes) {
assert(!(BitNo % 8) && "Expected bit on byte boundary");		assert(!(BitNo % 8) && "Expected bit on byte boundary");
return getPointerToByte(BitNo / 8, NumBytes);		return getPointerToByte(BitNo / 8, NumBytes);
}		}

void fillCurWord() {		void fillCurWord() {
if (Size != 0 && NextChar >= Size)		ArrayRef<uint8_t> Buf = R->getBitcodeBytes();
		if (NextChar >= Buf.size())
report_fatal_error("Unexpected end of file");		report_fatal_error("Unexpected end of file");

// Read the next word from the stream.		// Read the next word from the stream.
uint8_t Array[sizeof(word_t)] = {0};		const uint8_t *NextCharPtr = Buf.data() + NextChar;
		unsigned BytesRead;
uint64_t BytesRead =		if (Buf.size() >= NextChar + sizeof(word_t)) {
R->getBitcodeBytes().readBytes(Array, sizeof(Array), NextChar);		BytesRead = sizeof(word_t);

// If we run out of data, stop at the end of the stream.
if (BytesRead == 0) {
CurWord = 0;
BitsInCurWord = 0;
Size = NextChar;
return;
}

CurWord =		CurWord =
support::endian::read<word_t, support::little, support::unaligned>(		support::endian::read<word_t, support::little, support::unaligned>(
Array);		NextCharPtr);
		} else {
		// Short read.
		BytesRead = Buf.size() - NextChar;
		CurWord = 0;
		for (unsigned B = 0; B != BytesRead; ++B)
		CurWord \|= NextCharPtr[B] << (B * 8);
		}
NextChar += BytesRead;		NextChar += BytesRead;
BitsInCurWord = BytesRead * 8;		BitsInCurWord = BytesRead * 8;
}		}

word_t Read(unsigned NumBits) {		word_t Read(unsigned NumBits) {
static const unsigned BitsInWord = MaxChunkSize;		static const unsigned BitsInWord = MaxChunkSize;

assert(NumBits && NumBits <= BitsInWord &&		assert(NumBits && NumBits <= BitsInWord &&
Show All 12 Lines	if (BitsInCurWord >= NumBits) {
return R;		return R;
}		}

word_t R = BitsInCurWord ? CurWord : 0;		word_t R = BitsInCurWord ? CurWord : 0;
unsigned BitsLeft = NumBits - BitsInCurWord;		unsigned BitsLeft = NumBits - BitsInCurWord;

fillCurWord();		fillCurWord();

// If we run out of data, stop at the end of the stream.		// If we run out of data, abort.
if (BitsLeft > BitsInCurWord)		if (BitsLeft > BitsInCurWord)
return 0;		report_fatal_error("Unexpected end of file");

word_t R2 = CurWord & (~word_t(0) >> (BitsInWord - BitsLeft));		word_t R2 = CurWord & (~word_t(0) >> (BitsInWord - BitsLeft));

// Use a mask to avoid undefined behavior.		// Use a mask to avoid undefined behavior.
CurWord >>= (BitsLeft & Mask);		CurWord >>= (BitsLeft & Mask);

BitsInCurWord -= BitsLeft;		BitsInCurWord -= BitsLeft;

▲ Show 20 Lines • Show All 49 Lines • ▼ Show 20 Lines	if (sizeof(word_t) > 4 &&
BitsInCurWord = 32;		BitsInCurWord = 32;
return;		return;
}		}

BitsInCurWord = 0;		BitsInCurWord = 0;
}		}

/// Skip to the end of the file.		/// Skip to the end of the file.
void skipToEnd() { NextChar = R->getBitcodeBytes().getExtent(); }		void skipToEnd() { NextChar = R->getBitcodeBytes().size(); }

/// Prevent the cursor from reading past a byte boundary.
///
/// Prevent the cursor from requesting byte reads past \c Limit. This is
/// useful when working with a cursor on a StreamingMemoryObject, when it's
/// desirable to avoid invalidating the result of getPointerToByte().
///
/// If \c Limit is on a word boundary, AtEndOfStream() will return true if
/// the cursor position reaches or exceeds \c Limit, regardless of the true
/// number of available bytes. Otherwise, AtEndOfStream() returns true when
/// it reaches or exceeds the next word boundary.
void setArtificialByteLimit(uint64_t Limit) {
assert(getCurrentByteNo() < Limit && "Move cursor before lowering limit");

// Round to word boundary.
Limit = alignTo(Limit, sizeof(word_t));

// Only change size if the new one is lower.
if (!Size \|\| Size > Limit)
Size = Limit;
}

/// Return the Size, if known.
uint64_t getSizeIfKnown() const { return Size; }
};		};

/// When advancing through a bitstream cursor, each advance can discover a few		/// When advancing through a bitstream cursor, each advance can discover a few
/// different kinds of entries:		/// different kinds of entries:
struct BitstreamEntry {		struct BitstreamEntry {
enum {		enum {
Error, // Malformed bitcode was found.		Error, // Malformed bitcode was found.
EndBlock, // We've reached the end of the current block, (or the end of the		EndBlock, // We've reached the end of the current block, (or the end of the
▲ Show 20 Lines • Show All 83 Lines • ▼ Show 20 Lines	enum {
/// If this flag is used, abbrev entries are returned just like normal		/// If this flag is used, abbrev entries are returned just like normal
/// records.		/// records.
AF_DontAutoprocessAbbrevs = 2		AF_DontAutoprocessAbbrevs = 2
};		};

/// Advance the current bitstream, returning the next entry in the stream.		/// Advance the current bitstream, returning the next entry in the stream.
BitstreamEntry advance(unsigned Flags = 0) {		BitstreamEntry advance(unsigned Flags = 0) {
while (true) {		while (true) {
		if (AtEndOfStream())
		return BitstreamEntry::getError();

		mehdi_aminiUnsubmitted Not Done Reply Inline Actions There is a semantic change here. Previously, when at the end of the stream after calling `Stream.advance(BitstreamCursor::AF_DontPopBlockAtEnd);` we would call `ReadCode()` which would return `0` which is also `bitc::END_BLOCK`, and we would enter the condition below and `return BitstreamEntry::getEndBlock();`. Now we return an error instead. This is breaking the loop pattern in `BitcodeReader::parseIdentificationBlock`, which is conceptually the following: // We expect a number of well-defined blocks, though we don't necessarily // need to understand them all. while (1) { BitstreamEntry Entry = Stream.advance(BitstreamCursor::AF_DontPopBlockAtEnd); // Ignore other sub-blocks. if (Stream.SkipBlock()) return error("Malformed block"); } mehdi_amini: There is a semantic change here. Previously, when at the end of the stream after calling…
		pccAuthorUnsubmitted Not Done Reply Inline Actions As discussed on IRC, let's add a `BitstreamEntry::EOF` enum and change the clients to handle it. pcc: As discussed on IRC, let's add a `BitstreamEntry::EOF` enum and change the clients to handle it.
unsigned Code = ReadCode();		unsigned Code = ReadCode();
if (Code == bitc::END_BLOCK) {		if (Code == bitc::END_BLOCK) {
// Pop the end of the block unless Flags tells us not to.		// Pop the end of the block unless Flags tells us not to.
if (!(Flags & AF_DontPopBlockAtEnd) && ReadBlockEnd())		if (!(Flags & AF_DontPopBlockAtEnd) && ReadBlockEnd())
return BitstreamEntry::getError();		return BitstreamEntry::getError();
return BitstreamEntry::getEndBlock();		return BitstreamEntry::getEndBlock();
}		}

▲ Show 20 Lines • Show All 114 Lines • Show Last 20 Lines

llvm/trunk/include/llvm/Bitcode/ReaderWriter.h

	Show All 18 Lines
	#include "llvm/Support/Endian.h"			#include "llvm/Support/Endian.h"
	#include "llvm/Support/ErrorOr.h"			#include "llvm/Support/ErrorOr.h"
	#include "llvm/Support/MemoryBuffer.h"			#include "llvm/Support/MemoryBuffer.h"
	#include <memory>			#include <memory>
	#include <string>			#include <string>

	namespace llvm {			namespace llvm {
	class BitstreamWriter;			class BitstreamWriter;
	class DataStreamer;
	class LLVMContext;			class LLVMContext;
	class Module;			class Module;
	class ModulePass;			class ModulePass;
	class raw_ostream;			class raw_ostream;

	/// Offsets of the 32-bit fields of bitcode wrapper header.			/// Offsets of the 32-bit fields of bitcode wrapper header.
	static const unsigned BWH_MagicField = 0*4;			static const unsigned BWH_MagicField = 0*4;
	static const unsigned BWH_VersionField = 1*4;			static const unsigned BWH_VersionField = 1*4;
	static const unsigned BWH_OffsetField = 2*4;			static const unsigned BWH_OffsetField = 2*4;
	static const unsigned BWH_SizeField = 3*4;			static const unsigned BWH_SizeField = 3*4;
	static const unsigned BWH_CPUTypeField = 4*4;			static const unsigned BWH_CPUTypeField = 4*4;
	static const unsigned BWH_HeaderSize = 5*4;			static const unsigned BWH_HeaderSize = 5*4;

	/// Read the header of the specified bitcode buffer and prepare for lazy			/// Read the header of the specified bitcode buffer and prepare for lazy
	/// deserialization of function bodies. If ShouldLazyLoadMetadata is true,			/// deserialization of function bodies. If ShouldLazyLoadMetadata is true,
	/// lazily load metadata as well. If successful, this moves Buffer. On			/// lazily load metadata as well. If successful, this moves Buffer. On
	/// error, this does not move Buffer.			/// error, this does not move Buffer.
	ErrorOr<std::unique_ptr<Module>>			ErrorOr<std::unique_ptr<Module>>
	getLazyBitcodeModule(std::unique_ptr<MemoryBuffer> &&Buffer,			getLazyBitcodeModule(std::unique_ptr<MemoryBuffer> &&Buffer,
	LLVMContext &Context,			LLVMContext &Context,
	bool ShouldLazyLoadMetadata = false);			bool ShouldLazyLoadMetadata = false);

	/// Read the header of the specified stream and prepare for lazy
	/// deserialization and streaming of function bodies.
	ErrorOr<std::unique_ptr<Module>>
	getStreamedBitcodeModule(StringRef Name,
	std::unique_ptr<DataStreamer> Streamer,
	LLVMContext &Context);

	/// Read the header of the specified bitcode buffer and extract just the			/// Read the header of the specified bitcode buffer and extract just the
	/// triple information. If successful, this returns a string. On error, this			/// triple information. If successful, this returns a string. On error, this
	/// returns "".			/// returns "".
	std::string getBitcodeTargetTriple(MemoryBufferRef Buffer,			std::string getBitcodeTargetTriple(MemoryBufferRef Buffer,
	LLVMContext &Context);			LLVMContext &Context);

	/// Return true if \p Buffer contains a bitcode file with ObjC code (category			/// Return true if \p Buffer contains a bitcode file with ObjC code (category
	/// or class) in it.			/// or class) in it.
	▲ Show 20 Lines • Show All 151 Lines • Show Last 20 Lines

llvm/trunk/lib/Bitcode/Reader/BitcodeReader.cpp

Show First 20 Lines • Show All 55 Lines • ▼ Show 20 Lines
#include "llvm/IR/Operator.h"		#include "llvm/IR/Operator.h"
#include "llvm/IR/TrackingMDRef.h"		#include "llvm/IR/TrackingMDRef.h"
#include "llvm/IR/Type.h"		#include "llvm/IR/Type.h"
#include "llvm/IR/ValueHandle.h"		#include "llvm/IR/ValueHandle.h"
#include "llvm/Support/AtomicOrdering.h"		#include "llvm/Support/AtomicOrdering.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/Compiler.h"		#include "llvm/Support/Compiler.h"
#include "llvm/Support/DataStream.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/ErrorHandling.h"		#include "llvm/Support/ErrorHandling.h"
#include "llvm/Support/ErrorOr.h"		#include "llvm/Support/ErrorOr.h"
#include "llvm/Support/ManagedStatic.h"		#include "llvm/Support/ManagedStatic.h"
#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include "llvm/Support/StreamingMemoryObject.h"
#include <algorithm>		#include <algorithm>
#include <cassert>		#include <cassert>
#include <cstddef>		#include <cstddef>
#include <cstdint>		#include <cstdint>
#include <deque>		#include <deque>
#include <limits>		#include <limits>
#include <map>		#include <map>
#include <memory>		#include <memory>
▲ Show 20 Lines • Show All 147 Lines • ▼ Show 20 Lines	public:
Metadata upgradeTypeRefArray(Metadata MaybeTuple);		Metadata upgradeTypeRefArray(Metadata MaybeTuple);

private:		private:
Metadata resolveTypeRefArray(Metadata MaybeTuple);		Metadata resolveTypeRefArray(Metadata MaybeTuple);
};		};

class BitcodeReaderBase {		class BitcodeReaderBase {
protected:		protected:
BitcodeReaderBase() = default;
BitcodeReaderBase(MemoryBuffer *Buffer) : Buffer(Buffer) {}		BitcodeReaderBase(MemoryBuffer *Buffer) : Buffer(Buffer) {}

std::unique_ptr<MemoryBuffer> Buffer;		std::unique_ptr<MemoryBuffer> Buffer;
std::unique_ptr<BitstreamReader> StreamFile;		std::unique_ptr<BitstreamReader> StreamFile;
BitstreamCursor Stream;		BitstreamCursor Stream;

std::error_code initStream(std::unique_ptr<DataStreamer> Streamer);		std::error_code initStream();
std::error_code initStreamFromBuffer();
std::error_code initLazyStream(std::unique_ptr<DataStreamer> Streamer);

virtual std::error_code error(const Twine &Message) = 0;		virtual std::error_code error(const Twine &Message) = 0;
virtual ~BitcodeReaderBase() = default;		virtual ~BitcodeReaderBase() = default;
};		};

std::error_code		std::error_code BitcodeReaderBase::initStream() {
BitcodeReaderBase::initStream(std::unique_ptr<DataStreamer> Streamer) {
if (Streamer)
return initLazyStream(std::move(Streamer));
return initStreamFromBuffer();
}

std::error_code BitcodeReaderBase::initStreamFromBuffer() {
const unsigned char BufPtr = (const unsigned char)Buffer->getBufferStart();		const unsigned char BufPtr = (const unsigned char)Buffer->getBufferStart();
const unsigned char *BufEnd = BufPtr+Buffer->getBufferSize();		const unsigned char *BufEnd = BufPtr+Buffer->getBufferSize();

if (Buffer->getBufferSize() & 3)		if (Buffer->getBufferSize() & 3)
return error("Invalid bitcode signature");		return error("Invalid bitcode signature");

// If we have a wrapper header, parse it and ignore the non-bc file contents.		// If we have a wrapper header, parse it and ignore the non-bc file contents.
// The magic number is 0x0B17C0DE stored in little endian.		// The magic number is 0x0B17C0DE stored in little endian.
if (isBitcodeWrapper(BufPtr, BufEnd))		if (isBitcodeWrapper(BufPtr, BufEnd))
if (SkipBitcodeWrapperHeader(BufPtr, BufEnd, true))		if (SkipBitcodeWrapperHeader(BufPtr, BufEnd, true))
return error("Invalid bitcode wrapper header");		return error("Invalid bitcode wrapper header");

StreamFile.reset(new BitstreamReader(BufPtr, BufEnd));		StreamFile.reset(new BitstreamReader(ArrayRef<uint8_t>(BufPtr, BufEnd)));
Stream.init(&*StreamFile);		Stream.init(&*StreamFile);

return std::error_code();		return std::error_code();
}		}

std::error_code
BitcodeReaderBase::initLazyStream(std::unique_ptr<DataStreamer> Streamer) {
// Check and strip off the bitcode wrapper; BitstreamReader expects never to
// see it.
auto OwnedBytes =
llvm::make_unique<StreamingMemoryObject>(std::move(Streamer));
StreamingMemoryObject &Bytes = *OwnedBytes;
StreamFile = llvm::make_unique<BitstreamReader>(std::move(OwnedBytes));
Stream.init(&*StreamFile);

unsigned char buf[16];
if (Bytes.readBytes(buf, 16, 0) != 16)
return error("Invalid bitcode signature");

if (!isBitcode(buf, buf + 16))
return error("Invalid bitcode signature");

if (isBitcodeWrapper(buf, buf + 4)) {
const unsigned char *bitcodeStart = buf;
const unsigned char *bitcodeEnd = buf + 16;
SkipBitcodeWrapperHeader(bitcodeStart, bitcodeEnd, false);
Bytes.dropLeadingBytes(bitcodeStart - buf);
Bytes.setKnownObjectSize(bitcodeEnd - bitcodeStart);
}
return std::error_code();
}

class BitcodeReader : public BitcodeReaderBase, public GVMaterializer {		class BitcodeReader : public BitcodeReaderBase, public GVMaterializer {
LLVMContext &Context;		LLVMContext &Context;
Module *TheModule = nullptr;		Module *TheModule = nullptr;
// Next offset to start scanning for lazy parsing of function bodies.		// Next offset to start scanning for lazy parsing of function bodies.
uint64_t NextUnreadBit = 0;		uint64_t NextUnreadBit = 0;
// Last function offset found in the VST.		// Last function offset found in the VST.
uint64_t LastFunctionBlockBit = 0;		uint64_t LastFunctionBlockBit = 0;
bool SeenValueSymbolTable = false;		bool SeenValueSymbolTable = false;
▲ Show 20 Lines • Show All 83 Lines • ▼ Show 20 Lines	class BitcodeReader : public BitcodeReaderBase, public GVMaterializer {

std::vector<std::string> BundleTags;		std::vector<std::string> BundleTags;

public:		public:
std::error_code error(BitcodeError E, const Twine &Message);		std::error_code error(BitcodeError E, const Twine &Message);
std::error_code error(const Twine &Message) override;		std::error_code error(const Twine &Message) override;

BitcodeReader(MemoryBuffer *Buffer, LLVMContext &Context);		BitcodeReader(MemoryBuffer *Buffer, LLVMContext &Context);
BitcodeReader(LLVMContext &Context);
~BitcodeReader() override { freeState(); }		~BitcodeReader() override { freeState(); }

std::error_code materializeForwardReferencedFunctions();		std::error_code materializeForwardReferencedFunctions();

void freeState();		void freeState();

void releaseBuffer();		void releaseBuffer();

std::error_code materialize(GlobalValue *GV) override;		std::error_code materialize(GlobalValue *GV) override;
std::error_code materializeModule() override;		std::error_code materializeModule() override;
std::vector<StructType *> getIdentifiedStructTypes() const override;		std::vector<StructType *> getIdentifiedStructTypes() const override;

/// \brief Main interface to parsing a bitcode buffer.		/// \brief Main interface to parsing a bitcode buffer.
/// \returns true if an error occurred.		/// \returns true if an error occurred.
std::error_code parseBitcodeInto(std::unique_ptr<DataStreamer> Streamer,		std::error_code parseBitcodeInto(Module *M,
Module *M,
bool ShouldLazyLoadMetadata = false);		bool ShouldLazyLoadMetadata = false);

/// \brief Cheap mechanism to just extract module triple		/// \brief Cheap mechanism to just extract module triple
/// \returns true if an error occurred.		/// \returns true if an error occurred.
ErrorOr<std::string> parseTriple();		ErrorOr<std::string> parseTriple();

/// Cheap mechanism to just extract the identification block out of bitcode.		/// Cheap mechanism to just extract the identification block out of bitcode.
ErrorOr<std::string> parseIdentificationBlock();		ErrorOr<std::string> parseIdentificationBlock();
▲ Show 20 Lines • Show All 206 Lines • ▼ Show 20 Lines	public:

void releaseBuffer();		void releaseBuffer();

/// Check if the parser has encountered a summary section.		/// Check if the parser has encountered a summary section.
bool foundGlobalValSummary() { return SeenGlobalValSummary; }		bool foundGlobalValSummary() { return SeenGlobalValSummary; }

/// \brief Main interface to parsing a bitcode buffer.		/// \brief Main interface to parsing a bitcode buffer.
/// \returns true if an error occurred.		/// \returns true if an error occurred.
std::error_code parseSummaryIndexInto(std::unique_ptr<DataStreamer> Streamer,		std::error_code parseSummaryIndexInto(ModuleSummaryIndex *I);
ModuleSummaryIndex *I);

private:		private:
std::error_code parseModule();		std::error_code parseModule();
std::error_code parseValueSymbolTable(		std::error_code parseValueSymbolTable(
uint64_t Offset,		uint64_t Offset,
DenseMap<unsigned, GlobalValue::LinkageTypes> &ValueIdToLinkageMap);		DenseMap<unsigned, GlobalValue::LinkageTypes> &ValueIdToLinkageMap);
std::error_code parseEntireSummary();		std::error_code parseEntireSummary();
std::error_code parseModuleStringTable();		std::error_code parseModuleStringTable();
▲ Show 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	std::error_code BitcodeReader::error(const Twine &Message) {
return ::error(Context, make_error_code(BitcodeError::CorruptedBitcode),		return ::error(Context, make_error_code(BitcodeError::CorruptedBitcode),
Message);		Message);
}		}

BitcodeReader::BitcodeReader(MemoryBuffer *Buffer, LLVMContext &Context)		BitcodeReader::BitcodeReader(MemoryBuffer *Buffer, LLVMContext &Context)
: BitcodeReaderBase(Buffer), Context(Context), ValueList(Context),		: BitcodeReaderBase(Buffer), Context(Context), ValueList(Context),
MetadataList(Context) {}		MetadataList(Context) {}

BitcodeReader::BitcodeReader(LLVMContext &Context)
: Context(Context), ValueList(Context), MetadataList(Context) {}

std::error_code BitcodeReader::materializeForwardReferencedFunctions() {		std::error_code BitcodeReader::materializeForwardReferencedFunctions() {
if (WillMaterializeAllForwardRefs)		if (WillMaterializeAllForwardRefs)
return std::error_code();		return std::error_code();

// Prevent recursion.		// Prevent recursion.
WillMaterializeAllForwardRefs = true;		WillMaterializeAllForwardRefs = true;

while (!BasicBlockFwdRefQueue.empty()) {		while (!BasicBlockFwdRefQueue.empty()) {
▲ Show 20 Lines • Show All 1,441 Lines • ▼ Show 20 Lines	if (!NumStrings)
return error("Invalid record: metadata strings with no strings");		return error("Invalid record: metadata strings with no strings");
if (StringsOffset > Blob.size())		if (StringsOffset > Blob.size())
return error("Invalid record: metadata strings corrupt offset");		return error("Invalid record: metadata strings corrupt offset");

StringRef Lengths = Blob.slice(0, StringsOffset);		StringRef Lengths = Blob.slice(0, StringsOffset);
SimpleBitstreamCursor R(*StreamFile);		SimpleBitstreamCursor R(*StreamFile);
R.jumpToPointer(Lengths.begin());		R.jumpToPointer(Lengths.begin());

// Ensure that Blob doesn't get invalidated, even if this is reading from
// a StreamingMemoryObject with corrupt data.
R.setArtificialByteLimit(R.getCurrentByteNo() + StringsOffset);

StringRef Strings = Blob.drop_front(StringsOffset);		StringRef Strings = Blob.drop_front(StringsOffset);
do {		do {
if (R.AtEndOfStream())		if (R.AtEndOfStream())
return error("Invalid record: metadata strings bad length");		return error("Invalid record: metadata strings bad length");

unsigned Size = R.ReadVBR(6);		unsigned Size = R.ReadVBR(6);
if (Strings.size() < Size)		if (Strings.size() < Size)
return error("Invalid record: metadata strings truncated chars");		return error("Invalid record: metadata strings truncated chars");
▲ Show 20 Lines • Show All 2,017 Lines • ▼ Show 20 Lines	if (Stream.Read(8) != 'B' \|\|
Stream.Read(4) != 0x0 \|\|		Stream.Read(4) != 0x0 \|\|
Stream.Read(4) != 0xC \|\|		Stream.Read(4) != 0xC \|\|
Stream.Read(4) != 0xE \|\|		Stream.Read(4) != 0xE \|\|
Stream.Read(4) != 0xD)		Stream.Read(4) != 0xD)
return false;		return false;
return true;		return true;
}		}

std::error_code		std::error_code BitcodeReader::parseBitcodeInto(Module *M,
BitcodeReader::parseBitcodeInto(std::unique_ptr<DataStreamer> Streamer,		bool ShouldLazyLoadMetadata) {
Module *M, bool ShouldLazyLoadMetadata) {
TheModule = M;		TheModule = M;

if (std::error_code EC = initStream(std::move(Streamer)))		if (std::error_code EC = initStream())
return EC;		return EC;

// Sniff for the signature.		// Sniff for the signature.
if (!hasValidBitcodeHeader(Stream))		if (!hasValidBitcodeHeader(Stream))
return error("Invalid bitcode signature");		return error("Invalid bitcode signature");

// We expect a number of well-defined blocks, though we don't necessarily		// We expect a number of well-defined blocks, though we don't necessarily
// need to understand them all.		// need to understand them all.
▲ Show 20 Lines • Show All 57 Lines • ▼ Show 20 Lines	while (true) {
}		}
}		}
Record.clear();		Record.clear();
}		}
llvm_unreachable("Exit infinite loop");		llvm_unreachable("Exit infinite loop");
}		}

ErrorOr<std::string> BitcodeReader::parseTriple() {		ErrorOr<std::string> BitcodeReader::parseTriple() {
if (std::error_code EC = initStream(nullptr))		if (std::error_code EC = initStream())
return EC;		return EC;

// Sniff for the signature.		// Sniff for the signature.
if (!hasValidBitcodeHeader(Stream))		if (!hasValidBitcodeHeader(Stream))
return error("Invalid bitcode signature");		return error("Invalid bitcode signature");

// We expect a number of well-defined blocks, though we don't necessarily		// We expect a number of well-defined blocks, though we don't necessarily
// need to understand them all.		// need to understand them all.
Show All 18 Lines	while (true) {
case BitstreamEntry::Record:		case BitstreamEntry::Record:
Stream.skipRecord(Entry.ID);		Stream.skipRecord(Entry.ID);
continue;		continue;
}		}
}		}
}		}

ErrorOr<std::string> BitcodeReader::parseIdentificationBlock() {		ErrorOr<std::string> BitcodeReader::parseIdentificationBlock() {
if (std::error_code EC = initStream(nullptr))		if (std::error_code EC = initStream())
return EC;		return EC;

// Sniff for the signature.		// Sniff for the signature.
if (!hasValidBitcodeHeader(Stream))		if (!hasValidBitcodeHeader(Stream))
return error("Invalid bitcode signature");		return error("Invalid bitcode signature");

// We expect a number of well-defined blocks, though we don't necessarily		// We expect a number of well-defined blocks, though we don't necessarily
// need to understand them all.		// need to understand them all.
Show All 33 Lines	for (unsigned I = 0, E = Record.size(); I != E; I += 2) {
if (!MD)		if (!MD)
return error("Invalid metadata attachment");		return error("Invalid metadata attachment");
GO.addMetadata(K->second, *MD);		GO.addMetadata(K->second, *MD);
}		}
return std::error_code();		return std::error_code();
}		}

ErrorOr<bool> BitcodeReader::hasObjCCategory() {		ErrorOr<bool> BitcodeReader::hasObjCCategory() {
if (std::error_code EC = initStream(nullptr))		if (std::error_code EC = initStream())
return EC;		return EC;

// Sniff for the signature.		// Sniff for the signature.
if (!hasValidBitcodeHeader(Stream))		if (!hasValidBitcodeHeader(Stream))
return error("Invalid bitcode signature");		return error("Invalid bitcode signature");

// We expect a number of well-defined blocks, though we don't necessarily		// We expect a number of well-defined blocks, though we don't necessarily
// need to understand them all.		// need to understand them all.
▲ Show 20 Lines • Show All 1,571 Lines • ▼ Show 20 Lines
std::error_code ModuleSummaryIndexBitcodeReader::error(const Twine &Message) {		std::error_code ModuleSummaryIndexBitcodeReader::error(const Twine &Message) {
return ::error(DiagnosticHandler,		return ::error(DiagnosticHandler,
make_error_code(BitcodeError::CorruptedBitcode), Message);		make_error_code(BitcodeError::CorruptedBitcode), Message);
}		}

ModuleSummaryIndexBitcodeReader::ModuleSummaryIndexBitcodeReader(		ModuleSummaryIndexBitcodeReader::ModuleSummaryIndexBitcodeReader(
MemoryBuffer *Buffer, DiagnosticHandlerFunction DiagnosticHandler,		MemoryBuffer *Buffer, DiagnosticHandlerFunction DiagnosticHandler,
bool CheckGlobalValSummaryPresenceOnly)		bool CheckGlobalValSummaryPresenceOnly)
: BitcodeReaderBase(Buffer), DiagnosticHandler(std::move(DiagnosticHandler)),		: BitcodeReaderBase(Buffer),
		DiagnosticHandler(std::move(DiagnosticHandler)),
CheckGlobalValSummaryPresenceOnly(CheckGlobalValSummaryPresenceOnly) {}		CheckGlobalValSummaryPresenceOnly(CheckGlobalValSummaryPresenceOnly) {}

void ModuleSummaryIndexBitcodeReader::freeState() { Buffer = nullptr; }		void ModuleSummaryIndexBitcodeReader::freeState() { Buffer = nullptr; }

void ModuleSummaryIndexBitcodeReader::releaseBuffer() { Buffer.release(); }		void ModuleSummaryIndexBitcodeReader::releaseBuffer() { Buffer.release(); }

std::pair<GlobalValue::GUID, GlobalValue::GUID>		std::pair<GlobalValue::GUID, GlobalValue::GUID>
ModuleSummaryIndexBitcodeReader::getGUIDFromValueId(unsigned ValueId) {		ModuleSummaryIndexBitcodeReader::getGUIDFromValueId(unsigned ValueId) {
▲ Show 20 Lines • Show All 583 Lines • ▼ Show 20 Lines	case bitc::MST_CODE_HASH: {
break;		break;
}		}
}		}
}		}
llvm_unreachable("Exit infinite loop");		llvm_unreachable("Exit infinite loop");
}		}

// Parse the function info index from the bitcode streamer into the given index.		// Parse the function info index from the bitcode streamer into the given index.
std::error_code ModuleSummaryIndexBitcodeReader::parseSummaryIndexInto(		std::error_code
std::unique_ptr<DataStreamer> Streamer, ModuleSummaryIndex *I) {		ModuleSummaryIndexBitcodeReader::parseSummaryIndexInto(ModuleSummaryIndex *I) {
TheIndex = I;		TheIndex = I;

if (std::error_code EC = initStream(std::move(Streamer)))		if (std::error_code EC = initStream())
return EC;		return EC;

// Sniff for the signature.		// Sniff for the signature.
if (!hasValidBitcodeHeader(Stream))		if (!hasValidBitcodeHeader(Stream))
return error("Invalid bitcode signature");		return error("Invalid bitcode signature");

// We expect a number of well-defined blocks, though we don't necessarily		// We expect a number of well-defined blocks, though we don't necessarily
// need to understand them all.		// need to understand them all.
▲ Show 20 Lines • Show All 48 Lines • ▼ Show 20 Lines	const std::error_category &llvm::BitcodeErrorCategory() {
return *ErrorCategory;		return *ErrorCategory;
}		}

//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
// External interface		// External interface
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

static ErrorOr<std::unique_ptr<Module>>		static ErrorOr<std::unique_ptr<Module>>
getBitcodeModuleImpl(std::unique_ptr<DataStreamer> Streamer, StringRef Name,		getBitcodeModuleImpl(StringRef Name, BitcodeReader *R, LLVMContext &Context,
BitcodeReader *R, LLVMContext &Context,
bool MaterializeAll, bool ShouldLazyLoadMetadata) {		bool MaterializeAll, bool ShouldLazyLoadMetadata) {
std::unique_ptr<Module> M = llvm::make_unique<Module>(Name, Context);		std::unique_ptr<Module> M = llvm::make_unique<Module>(Name, Context);
M->setMaterializer(R);		M->setMaterializer(R);

auto cleanupOnError = [&](std::error_code EC) {		auto cleanupOnError = [&](std::error_code EC) {
R->releaseBuffer(); // Never take ownership on error.		R->releaseBuffer(); // Never take ownership on error.
return EC;		return EC;
};		};

// Delay parsing Metadata if ShouldLazyLoadMetadata is true.		// Delay parsing Metadata if ShouldLazyLoadMetadata is true.
if (std::error_code EC = R->parseBitcodeInto(std::move(Streamer), M.get(),		if (std::error_code EC = R->parseBitcodeInto(M.get(), ShouldLazyLoadMetadata))
ShouldLazyLoadMetadata))
return cleanupOnError(EC);		return cleanupOnError(EC);

if (MaterializeAll) {		if (MaterializeAll) {
// Read in the entire module, and destroy the BitcodeReader.		// Read in the entire module, and destroy the BitcodeReader.
if (std::error_code EC = M->materializeAll())		if (std::error_code EC = M->materializeAll())
return cleanupOnError(EC);		return cleanupOnError(EC);
} else {		} else {
// Resolve forward references from blockaddresses.		// Resolve forward references from blockaddresses.
Show All 13 Lines
/// everything.		/// everything.
static ErrorOr<std::unique_ptr<Module>>		static ErrorOr<std::unique_ptr<Module>>
getLazyBitcodeModuleImpl(std::unique_ptr<MemoryBuffer> &&Buffer,		getLazyBitcodeModuleImpl(std::unique_ptr<MemoryBuffer> &&Buffer,
LLVMContext &Context, bool MaterializeAll,		LLVMContext &Context, bool MaterializeAll,
bool ShouldLazyLoadMetadata = false) {		bool ShouldLazyLoadMetadata = false) {
BitcodeReader *R = new BitcodeReader(Buffer.get(), Context);		BitcodeReader *R = new BitcodeReader(Buffer.get(), Context);

ErrorOr<std::unique_ptr<Module>> Ret =		ErrorOr<std::unique_ptr<Module>> Ret =
getBitcodeModuleImpl(nullptr, Buffer->getBufferIdentifier(), R, Context,		getBitcodeModuleImpl(Buffer->getBufferIdentifier(), R, Context,
MaterializeAll, ShouldLazyLoadMetadata);		MaterializeAll, ShouldLazyLoadMetadata);
if (!Ret)		if (!Ret)
return Ret;		return Ret;

Buffer.release(); // The BitcodeReader owns it now.		Buffer.release(); // The BitcodeReader owns it now.
return Ret;		return Ret;
}		}

ErrorOr<std::unique_ptr<Module>>		ErrorOr<std::unique_ptr<Module>>
llvm::getLazyBitcodeModule(std::unique_ptr<MemoryBuffer> &&Buffer,		llvm::getLazyBitcodeModule(std::unique_ptr<MemoryBuffer> &&Buffer,
LLVMContext &Context, bool ShouldLazyLoadMetadata) {		LLVMContext &Context, bool ShouldLazyLoadMetadata) {
return getLazyBitcodeModuleImpl(std::move(Buffer), Context, false,		return getLazyBitcodeModuleImpl(std::move(Buffer), Context, false,
ShouldLazyLoadMetadata);		ShouldLazyLoadMetadata);
}		}

ErrorOr<std::unique_ptr<Module>>
llvm::getStreamedBitcodeModule(StringRef Name,
std::unique_ptr<DataStreamer> Streamer,
LLVMContext &Context) {
std::unique_ptr<Module> M = llvm::make_unique<Module>(Name, Context);
BitcodeReader *R = new BitcodeReader(Context);

return getBitcodeModuleImpl(std::move(Streamer), Name, R, Context, false,
false);
}

ErrorOr<std::unique_ptr<Module>> llvm::parseBitcodeFile(MemoryBufferRef Buffer,		ErrorOr<std::unique_ptr<Module>> llvm::parseBitcodeFile(MemoryBufferRef Buffer,
LLVMContext &Context) {		LLVMContext &Context) {
std::unique_ptr<MemoryBuffer> Buf = MemoryBuffer::getMemBuffer(Buffer, false);		std::unique_ptr<MemoryBuffer> Buf = MemoryBuffer::getMemBuffer(Buffer, false);
return getLazyBitcodeModuleImpl(std::move(Buf), Context, true);		return getLazyBitcodeModuleImpl(std::move(Buf), Context, true);
// TODO: Restore the use-lists to the in-memory state when the bitcode was		// TODO: Restore the use-lists to the in-memory state when the bitcode was
// written. We must defer until the Module has been fully materialized.		// written. We must defer until the Module has been fully materialized.
}		}

Show All 36 Lines	ErrorOr<std::unique_ptr<ModuleSummaryIndex>> llvm::getModuleSummaryIndex(

auto Index = llvm::make_unique<ModuleSummaryIndex>();		auto Index = llvm::make_unique<ModuleSummaryIndex>();

auto cleanupOnError = [&](std::error_code EC) {		auto cleanupOnError = [&](std::error_code EC) {
R.releaseBuffer(); // Never take ownership on error.		R.releaseBuffer(); // Never take ownership on error.
return EC;		return EC;
};		};

if (std::error_code EC = R.parseSummaryIndexInto(nullptr, Index.get()))		if (std::error_code EC = R.parseSummaryIndexInto(Index.get()))
return cleanupOnError(EC);		return cleanupOnError(EC);

Buf.release(); // The ModuleSummaryIndexBitcodeReader owns it now.		Buf.release(); // The ModuleSummaryIndexBitcodeReader owns it now.
return std::move(Index);		return std::move(Index);
}		}

// Check if the given bitcode buffer contains a global value summary block.		// Check if the given bitcode buffer contains a global value summary block.
bool llvm::hasGlobalValueSummary(		bool llvm::hasGlobalValueSummary(
MemoryBufferRef Buffer,		MemoryBufferRef Buffer,
const DiagnosticHandlerFunction &DiagnosticHandler) {		const DiagnosticHandlerFunction &DiagnosticHandler) {
std::unique_ptr<MemoryBuffer> Buf = MemoryBuffer::getMemBuffer(Buffer, false);		std::unique_ptr<MemoryBuffer> Buf = MemoryBuffer::getMemBuffer(Buffer, false);
ModuleSummaryIndexBitcodeReader R(Buf.get(), DiagnosticHandler, true);		ModuleSummaryIndexBitcodeReader R(Buf.get(), DiagnosticHandler, true);

auto cleanupOnError = [&](std::error_code EC) {		auto cleanupOnError = [&](std::error_code EC) {
R.releaseBuffer(); // Never take ownership on error.		R.releaseBuffer(); // Never take ownership on error.
return false;		return false;
};		};

if (std::error_code EC = R.parseSummaryIndexInto(nullptr, nullptr))		if (std::error_code EC = R.parseSummaryIndexInto(nullptr))
return cleanupOnError(EC);		return cleanupOnError(EC);

Buf.release(); // The ModuleSummaryIndexBitcodeReader owns it now.		Buf.release(); // The ModuleSummaryIndexBitcodeReader owns it now.
return R.foundGlobalValSummary();		return R.foundGlobalValSummary();
}		}

llvm/trunk/test/Bitcode/Inputs/invalid-array-operand-encoding.bc

This is a binary file.

llvm/trunk/test/Bitcode/Inputs/invalid-code-len-width.bc

This is a binary file.

llvm/trunk/test/Bitcode/Inputs/invalid-extractval-array-idx.bc

This is a binary file.

llvm/trunk/test/Bitcode/Inputs/invalid-function-comdat-id.bc

This is a binary file.

llvm/trunk/test/Bitcode/Inputs/invalid-fwdref-type-mismatch-2.bc

This is a binary file.

llvm/trunk/test/Bitcode/Inputs/invalid-metadata-not-followed-named-node.bc

This is a binary file.

llvm/trunk/test/Bitcode/Inputs/invalid-name-with-0-byte.bc

This is a binary file.

llvm/trunk/test/Bitcode/Inputs/invalid-unexpected-eof.bc

This file uses an unknown character encoding.

	BC折!␌000000␋00000000000000			BC折!␌000000␋000000000000000
	No newline at end of file			No newline at end of file

llvm/trunk/tools/llvm-bcanalyzer/llvm-bcanalyzer.cpp

Show First 20 Lines • Show All 430 Lines • ▼ Show 20 Lines	static bool decodeMetadataStringsBlob(BitstreamReader &Reader, StringRef Indent,
unsigned NumStrings = Record[0];		unsigned NumStrings = Record[0];
unsigned StringsOffset = Record[1];		unsigned StringsOffset = Record[1];
outs() << " num-strings = " << NumStrings << " {\n";		outs() << " num-strings = " << NumStrings << " {\n";

StringRef Lengths = Blob.slice(0, StringsOffset);		StringRef Lengths = Blob.slice(0, StringsOffset);
SimpleBitstreamCursor R(Reader);		SimpleBitstreamCursor R(Reader);
R.jumpToPointer(Lengths.begin());		R.jumpToPointer(Lengths.begin());

// Ensure that Blob doesn't get invalidated, even if this is reading from a
// StreamingMemoryObject with corrupt data.
R.setArtificialByteLimit(R.getCurrentByteNo() + StringsOffset);

StringRef Strings = Blob.drop_front(StringsOffset);		StringRef Strings = Blob.drop_front(StringsOffset);
do {		do {
if (R.AtEndOfStream())		if (R.AtEndOfStream())
return ReportError("bad length");		return ReportError("bad length");

unsigned Size = R.ReadVBR(6);		unsigned Size = R.ReadVBR(6);
if (Strings.size() < Size)		if (Strings.size() < Size)
return ReportError("truncated chars");		return ReportError("truncated chars");
▲ Show 20 Lines • Show All 279 Lines • ▼ Show 20 Lines	if (Dump) {
<< " Size=" << format_hex(Size, 10)		<< " Size=" << format_hex(Size, 10)
<< " CPUType=" << format_hex(CPUType, 10) << "/>\n";		<< " CPUType=" << format_hex(CPUType, 10) << "/>\n";
}		}

if (SkipBitcodeWrapperHeader(BufPtr, EndBufPtr, true))		if (SkipBitcodeWrapperHeader(BufPtr, EndBufPtr, true))
return ReportError("Invalid bitcode wrapper header");		return ReportError("Invalid bitcode wrapper header");
}		}

StreamFile = BitstreamReader(BufPtr, EndBufPtr);		StreamFile = BitstreamReader(ArrayRef<uint8_t>(BufPtr, EndBufPtr));
Stream = BitstreamCursor(StreamFile);		Stream = BitstreamCursor(StreamFile);
StreamFile.CollectBlockInfoNames();		StreamFile.CollectBlockInfoNames();

// Read the stream signature.		// Read the stream signature.
char Signature[6];		char Signature[6];
Signature[0] = Stream.Read(8);		Signature[0] = Stream.Read(8);
Signature[1] = Stream.Read(8);		Signature[1] = Stream.Read(8);
Signature[2] = Stream.Read(4);		Signature[2] = Stream.Read(4);
▲ Show 20 Lines • Show All 62 Lines • ▼ Show 20 Lines	while (!Stream.AtEndOfStream()) {

if (ParseBlock(Stream, BlockID, 0, CurStreamType))		if (ParseBlock(Stream, BlockID, 0, CurStreamType))
return true;		return true;
++NumTopBlocks;		++NumTopBlocks;
}		}

if (Dump) outs() << "\n\n";		if (Dump) outs() << "\n\n";

uint64_t BufferSizeBits = StreamFile.getBitcodeBytes().getExtent() * CHAR_BIT;		uint64_t BufferSizeBits = StreamFile.getBitcodeBytes().size() * CHAR_BIT;
// Print a summary of the read file.		// Print a summary of the read file.
outs() << "Summary of " << InputFilename << ":\n";		outs() << "Summary of " << InputFilename << ":\n";
outs() << " Total size: ";		outs() << " Total size: ";
PrintSize(BufferSizeBits);		PrintSize(BufferSizeBits);
outs() << "\n";		outs() << "\n";
outs() << " Stream type: ";		outs() << " Stream type: ";
switch (CurStreamType) {		switch (CurStreamType) {
case UnknownBitstream: outs() << "unknown\n"; break;		case UnknownBitstream: outs() << "unknown\n"; break;
▲ Show 20 Lines • Show All 99 Lines • Show Last 20 Lines

llvm/trunk/tools/llvm-dis/llvm-dis.cpp

Show All 20 Lines
#include "llvm/IR/AssemblyAnnotationWriter.h"		#include "llvm/IR/AssemblyAnnotationWriter.h"
#include "llvm/IR/DebugInfo.h"		#include "llvm/IR/DebugInfo.h"
#include "llvm/IR/DiagnosticInfo.h"		#include "llvm/IR/DiagnosticInfo.h"
#include "llvm/IR/DiagnosticPrinter.h"		#include "llvm/IR/DiagnosticPrinter.h"
#include "llvm/IR/IntrinsicInst.h"		#include "llvm/IR/IntrinsicInst.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/IR/Type.h"		#include "llvm/IR/Type.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/DataStream.h"
#include "llvm/Support/Error.h"		#include "llvm/Support/Error.h"
#include "llvm/Support/FileSystem.h"		#include "llvm/Support/FileSystem.h"
#include "llvm/Support/FormattedStream.h"		#include "llvm/Support/FormattedStream.h"
#include "llvm/Support/ManagedStatic.h"		#include "llvm/Support/ManagedStatic.h"
#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"
#include "llvm/Support/PrettyStackTrace.h"		#include "llvm/Support/PrettyStackTrace.h"
#include "llvm/Support/Signals.h"		#include "llvm/Support/Signals.h"
#include "llvm/Support/ToolOutputFile.h"		#include "llvm/Support/ToolOutputFile.h"
▲ Show 20 Lines • Show All 96 Lines • ▼ Show 20 Lines	static void diagnosticHandler(const DiagnosticInfo &DI, void *Context) {
DI.print(DP);		DI.print(DP);
OS << '\n';		OS << '\n';

if (DI.getSeverity() == DS_Error)		if (DI.getSeverity() == DS_Error)
exit(1);		exit(1);
}		}

static Expected<std::unique_ptr<Module>> openInputFile(LLVMContext &Context) {		static Expected<std::unique_ptr<Module>> openInputFile(LLVMContext &Context) {
if (MaterializeMetadata) {
ErrorOr<std::unique_ptr<MemoryBuffer>> MBOrErr =		ErrorOr<std::unique_ptr<MemoryBuffer>> MBOrErr =
MemoryBuffer::getFileOrSTDIN(InputFilename);		MemoryBuffer::getFileOrSTDIN(InputFilename);
if (!MBOrErr)		if (!MBOrErr)
return errorCodeToError(MBOrErr.getError());		return errorCodeToError(MBOrErr.getError());
ErrorOr<std::unique_ptr<Module>> MOrErr =		ErrorOr<std::unique_ptr<Module>> MOrErr =
getLazyBitcodeModule(std::move(*MBOrErr), Context,		getLazyBitcodeModule(std::move(*MBOrErr), Context,
/ShouldLazyLoadMetadata=/true);		/ShouldLazyLoadMetadata=/true);
if (!MOrErr)		if (!MOrErr)
return errorCodeToError(MOrErr.getError());		return errorCodeToError(MOrErr.getError());
		if (MaterializeMetadata)
(*MOrErr)->materializeMetadata();		(*MOrErr)->materializeMetadata();
return std::move(*MOrErr);
} else {
std::string ErrorMessage;
std::unique_ptr<DataStreamer> Streamer =
getDataFileStreamer(InputFilename, &ErrorMessage);
if (!Streamer)
return make_error<StringError>(ErrorMessage, inconvertibleErrorCode());
std::string DisplayFilename;
if (InputFilename == "-")
DisplayFilename = "<stdin>";
else		else
DisplayFilename = InputFilename;
ErrorOr<std::unique_ptr<Module>> MOrErr =
getStreamedBitcodeModule(DisplayFilename, std::move(Streamer), Context);
(*MOrErr)->materializeAll();		(*MOrErr)->materializeAll();
return std::move(*MOrErr);		return std::move(*MOrErr);
}		}
}

int main(int argc, char **argv) {		int main(int argc, char **argv) {
// Print a stack trace if we signal out.		// Print a stack trace if we signal out.
sys::PrintStackTraceOnErrorSignal(argv[0]);		sys::PrintStackTraceOnErrorSignal(argv[0]);
PrettyStackTraceProgram X(argc, argv);		PrettyStackTraceProgram X(argc, argv);

LLVMContext Context;		LLVMContext Context;
llvm_shutdown_obj Y; // Call llvm_shutdown() on exit.		llvm_shutdown_obj Y; // Call llvm_shutdown() on exit.
▲ Show 20 Lines • Show All 51 Lines • Show Last 20 Lines

llvm/trunk/unittests/Bitcode/BitReaderTest.cpp

Show All 12 Lines
#include "llvm/Bitcode/BitstreamReader.h"		#include "llvm/Bitcode/BitstreamReader.h"
#include "llvm/Bitcode/BitstreamWriter.h"		#include "llvm/Bitcode/BitstreamWriter.h"
#include "llvm/Bitcode/ReaderWriter.h"		#include "llvm/Bitcode/ReaderWriter.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/Instructions.h"		#include "llvm/IR/Instructions.h"
#include "llvm/IR/LLVMContext.h"		#include "llvm/IR/LLVMContext.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"
#include "llvm/IR/Verifier.h"		#include "llvm/IR/Verifier.h"
#include "llvm/Support/DataStream.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/MemoryBuffer.h"		#include "llvm/Support/MemoryBuffer.h"
#include "llvm/Support/SourceMgr.h"		#include "llvm/Support/SourceMgr.h"
#include "llvm/Support/StreamingMemoryObject.h"
#include "gtest/gtest.h"		#include "gtest/gtest.h"

using namespace llvm;		using namespace llvm;

namespace {		namespace {

std::unique_ptr<Module> parseAssembly(LLVMContext &Context,		std::unique_ptr<Module> parseAssembly(LLVMContext &Context,
const char *Assembly) {		const char *Assembly) {
Show All 23 Lines	static std::unique_ptr<Module> getLazyModuleFromAssembly(LLVMContext &Context,
writeModuleToBuffer(parseAssembly(Context, Assembly), Mem);		writeModuleToBuffer(parseAssembly(Context, Assembly), Mem);
std::unique_ptr<MemoryBuffer> Buffer =		std::unique_ptr<MemoryBuffer> Buffer =
MemoryBuffer::getMemBuffer(Mem.str(), "test", false);		MemoryBuffer::getMemBuffer(Mem.str(), "test", false);
ErrorOr<std::unique_ptr<Module>> ModuleOrErr =		ErrorOr<std::unique_ptr<Module>> ModuleOrErr =
getLazyBitcodeModule(std::move(Buffer), Context);		getLazyBitcodeModule(std::move(Buffer), Context);
return std::move(ModuleOrErr.get());		return std::move(ModuleOrErr.get());
}		}

class BufferDataStreamer : public DataStreamer {
std::unique_ptr<MemoryBuffer> Buffer;
unsigned Pos = 0;
size_t GetBytes(unsigned char *Out, size_t Len) override {
StringRef Buf = Buffer->getBuffer();
size_t Left = Buf.size() - Pos;
Len = std::min(Left, Len);
memcpy(Out, Buffer->getBuffer().substr(Pos).data(), Len);
Pos += Len;
return Len;
}

public:
BufferDataStreamer(std::unique_ptr<MemoryBuffer> Buffer)
: Buffer(std::move(Buffer)) {}
};

static std::unique_ptr<Module>
getStreamedModuleFromAssembly(LLVMContext &Context, SmallString<1024> &Mem,
const char *Assembly) {
writeModuleToBuffer(parseAssembly(Context, Assembly), Mem);
std::unique_ptr<MemoryBuffer> Buffer =
MemoryBuffer::getMemBuffer(Mem.str(), "test", false);
auto Streamer = llvm::make_unique<BufferDataStreamer>(std::move(Buffer));
ErrorOr<std::unique_ptr<Module>> ModuleOrErr =
getStreamedBitcodeModule("test", std::move(Streamer), Context);
return std::move(ModuleOrErr.get());
}

// Checks if we correctly detect eof if we try to read N bits when there are not
// enough bits left on the input stream to read N bits, and we are using a data
// streamer. In particular, it checks if we properly set the object size when
// the eof is reached under such conditions.
TEST(BitReaderTest, TestForEofAfterReadFailureOnDataStreamer) {
// Note: Because StreamingMemoryObject does a call to method GetBytes in it's
// constructor, using internal constant kChunkSize, we must fill the input
// with more characters than that amount.
static size_t InputSize = StreamingMemoryObject::kChunkSize + 5;
char *Text = new char[InputSize];
std::memset(Text, 'a', InputSize);
Text[InputSize - 1] = '\0';
StringRef Input(Text);

// Build bitsteam reader using data streamer.
auto MemoryBuf = MemoryBuffer::getMemBuffer(Input);
std::unique_ptr<DataStreamer> Streamer(
new BufferDataStreamer(std::move(MemoryBuf)));
auto OwnedBytes =
llvm::make_unique<StreamingMemoryObject>(std::move(Streamer));
auto Reader = llvm::make_unique<BitstreamReader>(std::move(OwnedBytes));
BitstreamCursor Cursor;
Cursor.init(Reader.get());

// Jump to two bytes before end of stream.
Cursor.JumpToBit((InputSize - 4) * CHAR_BIT);
// Try to read 4 bytes when only 2 are present, resulting in error value 0.
const size_t ReadErrorValue = 0;
EXPECT_EQ(ReadErrorValue, Cursor.Read(32));
// Should be at eof now.
EXPECT_TRUE(Cursor.AtEndOfStream());

delete[] Text;
}

TEST(BitReaderTest, MateralizeForwardRefWithStream) {
SmallString<1024> Mem;

LLVMContext Context;
std::unique_ptr<Module> M = getStreamedModuleFromAssembly(
Context, Mem, "@table = constant i8* blockaddress(@func, %bb)\n"
"define void @func() {\n"
" unreachable\n"
"bb:\n"
" unreachable\n"
"}\n");
EXPECT_FALSE(M->getFunction("func")->empty());
}

// Tests that lazy evaluation can parse functions out of order.		// Tests that lazy evaluation can parse functions out of order.
TEST(BitReaderTest, MaterializeFunctionsOutOfOrder) {		TEST(BitReaderTest, MaterializeFunctionsOutOfOrder) {
SmallString<1024> Mem;		SmallString<1024> Mem;
LLVMContext Context;		LLVMContext Context;
std::unique_ptr<Module> M = getLazyModuleFromAssembly(		std::unique_ptr<Module> M = getLazyModuleFromAssembly(
Context, Mem, "define void @f() {\n"		Context, Mem, "define void @f() {\n"
" unreachable\n"		" unreachable\n"
"}\n"		"}\n"
▲ Show 20 Lines • Show All 60 Lines • ▼ Show 20 Lines	TEST(BitReaderTest, MaterializeFunctionsForBlockAddr) { // PR11677
std::unique_ptr<Module> M = getLazyModuleFromAssembly(		std::unique_ptr<Module> M = getLazyModuleFromAssembly(
Context, Mem, "@table = constant i8* blockaddress(@func, %bb)\n"		Context, Mem, "@table = constant i8* blockaddress(@func, %bb)\n"
"define void @func() {\n"		"define void @func() {\n"
" unreachable\n"		" unreachable\n"
"bb:\n"		"bb:\n"
" unreachable\n"		" unreachable\n"
"}\n");		"}\n");
EXPECT_FALSE(verifyModule(*M, &dbgs()));		EXPECT_FALSE(verifyModule(*M, &dbgs()));
		EXPECT_FALSE(M->getFunction("func")->empty());
}		}

TEST(BitReaderTest, MaterializeFunctionsForBlockAddrInFunctionBefore) {		TEST(BitReaderTest, MaterializeFunctionsForBlockAddrInFunctionBefore) {
SmallString<1024> Mem;		SmallString<1024> Mem;

LLVMContext Context;		LLVMContext Context;
std::unique_ptr<Module> M = getLazyModuleFromAssembly(		std::unique_ptr<Module> M = getLazyModuleFromAssembly(
Context, Mem, "define i8* @before() {\n"		Context, Mem, "define i8* @before() {\n"
▲ Show 20 Lines • Show All 49 Lines • Show Last 20 Lines

llvm/trunk/unittests/Bitcode/BitstreamReaderTest.cpp

//===- BitstreamReaderTest.cpp - Tests for BitstreamReader ----------------===//		//===- BitstreamReaderTest.cpp - Tests for BitstreamReader ----------------===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/Bitcode/BitstreamReader.h"		#include "llvm/Bitcode/BitstreamReader.h"
#include "llvm/Bitcode/BitstreamWriter.h"		#include "llvm/Bitcode/BitstreamWriter.h"
#include "llvm/Support/StreamingMemoryObject.h"
#include "gtest/gtest.h"		#include "gtest/gtest.h"

using namespace llvm;		using namespace llvm;

namespace {		namespace {

class BufferStreamer : public DataStreamer {
StringRef Buffer;

public:
BufferStreamer(StringRef Buffer) : Buffer(Buffer) {}
size_t GetBytes(unsigned char *OutBuffer, size_t Length) override {
if (Length >= Buffer.size())
Length = Buffer.size();

std::copy(Buffer.begin(), Buffer.begin() + Length, OutBuffer);
Buffer = Buffer.drop_front(Length);
return Length;
}
};

TEST(BitstreamReaderTest, AtEndOfStream) {		TEST(BitstreamReaderTest, AtEndOfStream) {
uint8_t Bytes[4] = {		uint8_t Bytes[4] = {
0x00, 0x01, 0x02, 0x03		0x00, 0x01, 0x02, 0x03
};		};
BitstreamReader Reader(std::begin(Bytes), std::end(Bytes));		BitstreamReader Reader(Bytes);
BitstreamCursor Cursor(Reader);		BitstreamCursor Cursor(Reader);

EXPECT_FALSE(Cursor.AtEndOfStream());		EXPECT_FALSE(Cursor.AtEndOfStream());
(void)Cursor.Read(8);		(void)Cursor.Read(8);
EXPECT_FALSE(Cursor.AtEndOfStream());		EXPECT_FALSE(Cursor.AtEndOfStream());
(void)Cursor.Read(24);		(void)Cursor.Read(24);
EXPECT_TRUE(Cursor.AtEndOfStream());		EXPECT_TRUE(Cursor.AtEndOfStream());

Cursor.JumpToBit(0);		Cursor.JumpToBit(0);
EXPECT_FALSE(Cursor.AtEndOfStream());		EXPECT_FALSE(Cursor.AtEndOfStream());

Cursor.JumpToBit(32);		Cursor.JumpToBit(32);
EXPECT_TRUE(Cursor.AtEndOfStream());		EXPECT_TRUE(Cursor.AtEndOfStream());
}		}

TEST(BitstreamReaderTest, AtEndOfStreamJump) {		TEST(BitstreamReaderTest, AtEndOfStreamJump) {
uint8_t Bytes[4] = {		uint8_t Bytes[4] = {
0x00, 0x01, 0x02, 0x03		0x00, 0x01, 0x02, 0x03
};		};
BitstreamReader Reader(std::begin(Bytes), std::end(Bytes));		BitstreamReader Reader(Bytes);
BitstreamCursor Cursor(Reader);		BitstreamCursor Cursor(Reader);

Cursor.JumpToBit(32);		Cursor.JumpToBit(32);
EXPECT_TRUE(Cursor.AtEndOfStream());		EXPECT_TRUE(Cursor.AtEndOfStream());
}		}

TEST(BitstreamReaderTest, AtEndOfStreamEmpty) {		TEST(BitstreamReaderTest, AtEndOfStreamEmpty) {
uint8_t Dummy = 0xFF;		BitstreamReader Reader(ArrayRef<uint8_t>{});
BitstreamReader Reader(&Dummy, &Dummy);
BitstreamCursor Cursor(Reader);		BitstreamCursor Cursor(Reader);

EXPECT_TRUE(Cursor.AtEndOfStream());		EXPECT_TRUE(Cursor.AtEndOfStream());
}		}

TEST(BitstreamReaderTest, getCurrentByteNo) {		TEST(BitstreamReaderTest, getCurrentByteNo) {
uint8_t Bytes[] = {0x00, 0x01, 0x02, 0x03};		uint8_t Bytes[] = {0x00, 0x01, 0x02, 0x03};
BitstreamReader Reader(std::begin(Bytes), std::end(Bytes));		BitstreamReader Reader(Bytes);
SimpleBitstreamCursor Cursor(Reader);		SimpleBitstreamCursor Cursor(Reader);

for (unsigned I = 0, E = 33; I != E; ++I) {		for (unsigned I = 0, E = 32; I != E; ++I) {
EXPECT_EQ(I / 8, Cursor.getCurrentByteNo());		EXPECT_EQ(I / 8, Cursor.getCurrentByteNo());
(void)Cursor.Read(1);		(void)Cursor.Read(1);
}		}
EXPECT_EQ(4u, Cursor.getCurrentByteNo());		EXPECT_EQ(4u, Cursor.getCurrentByteNo());
}		}

TEST(BitstreamReaderTest, getPointerToByte) {		TEST(BitstreamReaderTest, getPointerToByte) {
uint8_t Bytes[] = {0x00, 0x01, 0x02, 0x03, 0x04, 0x05, 0x06, 0x07};		uint8_t Bytes[] = {0x00, 0x01, 0x02, 0x03, 0x04, 0x05, 0x06, 0x07};
BitstreamReader Reader(std::begin(Bytes), std::end(Bytes));		BitstreamReader Reader(Bytes);
SimpleBitstreamCursor Cursor(Reader);		SimpleBitstreamCursor Cursor(Reader);

for (unsigned I = 0, E = 8; I != E; ++I) {		for (unsigned I = 0, E = 8; I != E; ++I) {
EXPECT_EQ(Bytes + I, Cursor.getPointerToByte(I, 1));		EXPECT_EQ(Bytes + I, Cursor.getPointerToByte(I, 1));
}		}
}		}

TEST(BitstreamReaderTest, getPointerToBit) {		TEST(BitstreamReaderTest, getPointerToBit) {
uint8_t Bytes[] = {0x00, 0x01, 0x02, 0x03, 0x04, 0x05, 0x06, 0x07};		uint8_t Bytes[] = {0x00, 0x01, 0x02, 0x03, 0x04, 0x05, 0x06, 0x07};
BitstreamReader Reader(std::begin(Bytes), std::end(Bytes));		BitstreamReader Reader(Bytes);
SimpleBitstreamCursor Cursor(Reader);		SimpleBitstreamCursor Cursor(Reader);

for (unsigned I = 0, E = 8; I != E; ++I) {		for (unsigned I = 0, E = 8; I != E; ++I) {
EXPECT_EQ(Bytes + I, Cursor.getPointerToBit(I * 8, 1));		EXPECT_EQ(Bytes + I, Cursor.getPointerToBit(I * 8, 1));
}		}
}		}

TEST(BitstreamReaderTest, jumpToPointer) {		TEST(BitstreamReaderTest, jumpToPointer) {
uint8_t Bytes[] = {0x00, 0x01, 0x02, 0x03, 0x04, 0x05, 0x06, 0x07};		uint8_t Bytes[] = {0x00, 0x01, 0x02, 0x03, 0x04, 0x05, 0x06, 0x07};
BitstreamReader Reader(std::begin(Bytes), std::end(Bytes));		BitstreamReader Reader(Bytes);
SimpleBitstreamCursor Cursor(Reader);		SimpleBitstreamCursor Cursor(Reader);

for (unsigned I : {0, 6, 2, 7}) {		for (unsigned I : {0, 6, 2, 7}) {
Cursor.jumpToPointer(Bytes + I);		Cursor.jumpToPointer(Bytes + I);
EXPECT_EQ(I, Cursor.getCurrentByteNo());		EXPECT_EQ(I, Cursor.getCurrentByteNo());
}		}
}		}

TEST(BitstreamReaderTest, setArtificialByteLimit) {
uint8_t Bytes[] = {0x00, 0x01, 0x02, 0x03, 0x04, 0x05, 0x06, 0x07,
0x08, 0x09, 0x0a, 0x0b, 0x0c, 0x0d, 0x0e, 0x0f};
BitstreamReader Reader(std::begin(Bytes), std::end(Bytes));
SimpleBitstreamCursor Cursor(Reader);

Cursor.setArtificialByteLimit(8);
EXPECT_EQ(8u, Cursor.getSizeIfKnown());
while (!Cursor.AtEndOfStream())
(void)Cursor.Read(1);

EXPECT_EQ(8u, Cursor.getCurrentByteNo());
}

TEST(BitstreamReaderTest, setArtificialByteLimitNotWordBoundary) {
uint8_t Bytes[] = {0x00, 0x01, 0x02, 0x03, 0x04, 0x05, 0x06, 0x07,
0x08, 0x09, 0x0a, 0x0b, 0x0c, 0x0d, 0x0e, 0x0f};
BitstreamReader Reader(std::begin(Bytes), std::end(Bytes));
SimpleBitstreamCursor Cursor(Reader);

Cursor.setArtificialByteLimit(5);
EXPECT_EQ(8u, Cursor.getSizeIfKnown());
while (!Cursor.AtEndOfStream())
(void)Cursor.Read(1);

EXPECT_EQ(8u, Cursor.getCurrentByteNo());
}

TEST(BitstreamReaderTest, setArtificialByteLimitPastTheEnd) {
uint8_t Bytes[] = {0x00, 0x01, 0x02, 0x03, 0x04, 0x05, 0x06, 0x07,
0x08, 0x09, 0x0a, 0x0b};
BitstreamReader Reader(std::begin(Bytes), std::end(Bytes));
SimpleBitstreamCursor Cursor(Reader);

// The size of the memory object isn't known yet. Set it too high and
// confirm that we don't read too far.
Cursor.setArtificialByteLimit(24);
EXPECT_EQ(24u, Cursor.getSizeIfKnown());
while (!Cursor.AtEndOfStream())
(void)Cursor.Read(1);

EXPECT_EQ(12u, Cursor.getCurrentByteNo());
EXPECT_EQ(12u, Cursor.getSizeIfKnown());
}

TEST(BitstreamReaderTest, setArtificialByteLimitPastTheEndKnown) {
uint8_t Bytes[] = {0x00, 0x01, 0x02, 0x03, 0x04, 0x05, 0x06, 0x07,
0x08, 0x09, 0x0a, 0x0b};
BitstreamReader Reader(std::begin(Bytes), std::end(Bytes));
SimpleBitstreamCursor Cursor(Reader);

// Save the size of the memory object in the cursor.
while (!Cursor.AtEndOfStream())
(void)Cursor.Read(1);
EXPECT_EQ(12u, Cursor.getCurrentByteNo());
EXPECT_EQ(12u, Cursor.getSizeIfKnown());

Cursor.setArtificialByteLimit(20);
EXPECT_TRUE(Cursor.AtEndOfStream());
EXPECT_EQ(12u, Cursor.getSizeIfKnown());
}

TEST(BitstreamReaderTest, readRecordWithBlobWhileStreaming) {		TEST(BitstreamReaderTest, readRecordWithBlobWhileStreaming) {
SmallVector<uint8_t, 1> BlobData;		SmallVector<uint8_t, 1> BlobData;
for (unsigned I = 0, E = 1024; I != E; ++I)		for (unsigned I = 0, E = 1024; I != E; ++I)
BlobData.push_back(I);		BlobData.push_back(I);

// Try a bunch of different sizes.		// Try a bunch of different sizes.
const unsigned Magic = 0x12345678;		const unsigned Magic = 0x12345678;
const unsigned BlockID = bitc::FIRST_APPLICATION_BLOCKID;		const unsigned BlockID = bitc::FIRST_APPLICATION_BLOCKID;
Show All 16 Lines	unsigned AbbrevID;
AbbrevID = Stream.EmitAbbrev(Abbrev);		AbbrevID = Stream.EmitAbbrev(Abbrev);
unsigned Record[] = {RecordID};		unsigned Record[] = {RecordID};
Stream.EmitRecordWithBlob(AbbrevID, makeArrayRef(Record), BlobIn);		Stream.EmitRecordWithBlob(AbbrevID, makeArrayRef(Record), BlobIn);

Stream.ExitBlock();		Stream.ExitBlock();
}		}

// Stream the buffer into the reader.		// Stream the buffer into the reader.
BitstreamReader R(llvm::make_unique<StreamingMemoryObject>(		BitstreamReader R(
llvm::make_unique<BufferStreamer>(		ArrayRef<uint8_t>((const uint8_t *)Buffer.begin(), Buffer.size()));
StringRef(Buffer.begin(), Buffer.size()))));
BitstreamCursor Stream(R);		BitstreamCursor Stream(R);

// Header. Included in test so that we can run llvm-bcanalyzer to debug		// Header. Included in test so that we can run llvm-bcanalyzer to debug
// when there are problems.		// when there are problems.
ASSERT_EQ(Magic, Stream.Read(32));		ASSERT_EQ(Magic, Stream.Read(32));

// Block.		// Block.
BitstreamEntry Entry =		BitstreamEntry Entry =
Show All 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

Bitcode: Change reader interface to take memory buffers.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 76660

cfe/trunk/lib/CodeGen/ObjectFilePCHContainerOperations.cpp

cfe/trunk/lib/Frontend/PCHContainerOperations.cpp

cfe/trunk/lib/Frontend/SerializedDiagnosticReader.cpp

cfe/trunk/lib/Serialization/ASTReader.cpp

cfe/trunk/lib/Serialization/GlobalModuleIndex.cpp

llvm/trunk/include/llvm/Bitcode/BitstreamReader.h

llvm/trunk/include/llvm/Bitcode/ReaderWriter.h

llvm/trunk/lib/Bitcode/Reader/BitcodeReader.cpp

llvm/trunk/test/Bitcode/Inputs/invalid-array-operand-encoding.bc

llvm/trunk/test/Bitcode/Inputs/invalid-code-len-width.bc

llvm/trunk/test/Bitcode/Inputs/invalid-extractval-array-idx.bc

llvm/trunk/test/Bitcode/Inputs/invalid-function-comdat-id.bc

llvm/trunk/test/Bitcode/Inputs/invalid-fwdref-type-mismatch-2.bc

llvm/trunk/test/Bitcode/Inputs/invalid-metadata-not-followed-named-node.bc

llvm/trunk/test/Bitcode/Inputs/invalid-name-with-0-byte.bc

llvm/trunk/test/Bitcode/Inputs/invalid-unexpected-eof.bc

llvm/trunk/tools/llvm-bcanalyzer/llvm-bcanalyzer.cpp

llvm/trunk/tools/llvm-dis/llvm-dis.cpp

llvm/trunk/unittests/Bitcode/BitReaderTest.cpp

llvm/trunk/unittests/Bitcode/BitstreamReaderTest.cpp

Bitcode: Change reader interface to take memory buffers.
ClosedPublic