This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Bitcode/Reader/
-
Bitcode/
-
Reader/
5/10
BitcodeReader.cpp
-
test/Bitcode/
-
Bitcode/
-
aggregateInstructions.3.2.ll

Differential D118694

[Bitcode] Add partial support for opaque pointer auto-upgrade
ClosedPublic

Authored by nikic on Feb 1 2022, 4:09 AM.

Download Raw Diff

Details

Reviewers

aeubanks

Group Reviewers

Restricted Project

Commits

rGc28b0b9d18ed: [Bitcode] Add partial support for opaque pointer auto-upgrade

Summary

Auto-upgrades that rely on the pointer element type do not work in opaque pointer mode. The idea behind this patch is that we can instead work with type IDs, for which we can retain the pointer element type. For typed pointer bitcode, we will have a distinct type ID for pointers with distinct element type, even if there will only be a single corresponding opaque pointer type.

The disclaimer here is that this is only the first step of the change, and there are still more getPointerElementType() calls to remove. I expect that two more patches will be needed: 1. Track all "contained" type IDs, which will allow us to handle function params (which are contained in the function type) and GEPs (which may use vectors of pointers) 2. Track type IDs for values, which is e.g. necessary to handle loads.

Diff Detail

Event Timeline

nikic created this revision.Feb 1 2022, 4:09 AM

Herald added subscribers: dexonsmith, hiraditya. · View Herald TranscriptFeb 1 2022, 4:09 AM

nikic requested review of this revision.Feb 1 2022, 4:09 AM

Herald added a project: Restricted Project. · View Herald TranscriptFeb 1 2022, 4:09 AM

Herald added a subscriber: llvm-commits. · View Herald Transcript

Harbormaster completed remote builds in B146862: Diff 404880.Feb 1 2022, 5:48 AM

dblaikie added a subscriber: dblaikie.Feb 1 2022, 9:19 AM

dblaikie added inline comments.

llvm/lib/Bitcode/Reader/BitcodeReader.cpp
1185–1190	Would it be reasonable/possible to always use the `ElementTypeList` even when typed pointers are supported? (frontload changes somewhat) - maybe with typed pointer support this could be an assert that the type in `ElementTypeList` matches the non-opaque pointer type? (oh, I guess since this support is currently incomplete that's not possible, but would be possible when the support is complete but before opaque pointers are enabled by default? I guess it could still be an assert today in the case where `ElementTypeList` is available, maybe?)
5036–5037	I wonder if these sort of checks could be asserts? I realize they are dynamically reachable, but they're also not intended to be reached by end users (only by LLVM developers during this migration)... I guess it's weird either way. Either we have asserts that are dynamically reachable, or we have error paths that are untested... I don't feel great about either of those, but either seem like acceptable tradeoffs during the transition.

nikic added inline comments.Feb 1 2022, 10:23 AM

llvm/lib/Bitcode/Reader/BitcodeReader.cpp
1185–1190	Yes, we could always use ElementTypeList, would just have to populate it unconditionally. Currently I'm avoiding it if we're not doing a typed -> opaque transition, but maybe that's unnecessary micro-optimization.
5036–5037	I believe that bitcode reading is supposed to be resistant against invalid input and shouldn't assert in that case. At the same time we don't actually test these cases, because constructing the necessary invalid bitcode files would be really hard. I'm not sure if we have a fuzzer for bitcode fuzzing deployed anywhere.

dblaikie added inline comments.Feb 1 2022, 11:17 AM

llvm/lib/Bitcode/Reader/BitcodeReader.cpp
1185–1190	Might be worth enabling it always (if not now, eventually/before/during the opaque pointer default switch) to validate that the new tracking solution (`ElementTypeList`) matches the old results?
5036–5037	(oh, I misunderstood - I'd thought these errors were only temporary while the support is incomplete? But I guess maybe these errors are permanent/could be reached even after support is fully implemented) But in general bitcode reading errors can be/are tested for instance, here: llvm/test/Bitcode/invalid.test - though, yes, writing tests is hard. Not sure if we have any particularly good techniques - I suspect it's hex editing and such at the moment, unfortunately.

Always use element type list.

nikic added inline comments.Feb 2 2022, 6:35 AM

llvm/lib/Bitcode/Reader/BitcodeReader.cpp
1185–1190	I've changed it to always use ElementTypeList now.
5036–5037	(oh, I misunderstood - I'd thought these errors were only temporary while the support is incomplete? But I guess maybe these errors are permanent/could be reached even after support is fully implemented) Yes, these are permanent. An obvious way to hit this is via a non-pointer type -- which is the case this was checking for previously already.

Harbormaster completed remote builds in B147117: Diff 405245.Feb 2 2022, 7:06 AM

dblaikie added a subscriber: aprantl.Feb 2 2022, 10:26 AM

dblaikie added inline comments.

llvm/lib/Bitcode/Reader/BitcodeReader.cpp
5036–5037	@dexonsmith, @aprantl - what're your thoughts/ideas on invalid bitcode testing?

nikic mentioned this in D119000: [Bitcode] Add fuzzer for bitcode reading.Feb 4 2022, 7:46 AM

For a one-off test checking in a manually crafted .bc file seems fine to me. If you are planning to add more tests, we should invest in some kind of yaml2bc utility (like yaml2obj). Not sure if this answers your question?

In D118694#3298975, @aprantl wrote:

For a one-off test checking in a manually crafted .bc file seems fine to me. If you are planning to add more tests, we should invest in some kind of yaml2bc utility (like yaml2obj). Not sure if this answers your question?

Yeah, was mostly asking how firm we are about constructing those sort of test cases. Seems not every existing error is covered - so I don't want to request undue work/unreasonable standard for new work like this - but testing error paths does seem important to me.

@nikic could you add hand-crafted bitcode test coverage for these error messages/paths?

In D118694#3299435, @dblaikie wrote:

In D118694#3298975, @aprantl wrote:

For a one-off test checking in a manually crafted .bc file seems fine to me. If you are planning to add more tests, we should invest in some kind of yaml2bc utility (like yaml2obj). Not sure if this answers your question?

Yeah, was mostly asking how firm we are about constructing those sort of test cases. Seems not every existing error is covered - so I don't want to request undue work/unreasonable standard for new work like this - but testing error paths does seem important to me.

@nikic could you add hand-crafted bitcode test coverage for these error messages/paths?

Sorry, I don't think that would be a good use of my time. Of course, if you'd like to construct invalid bitcode files yourself, I will not object to their inclusion.

Would it be possible/could you use the fuzzer to generate test cases that cover these error cases minimally, and check those in as coverage for them?

nikic mentioned this in rG82ef888fbf3a: [Bitcode] Add fuzzer for bitcode reading.Feb 7 2022, 1:24 AM

In D118694#3299687, @dblaikie wrote:

Would it be possible/could you use the fuzzer to generate test cases that cover these error cases minimally, and check those in as coverage for them?

I did a bit of work on the fuzzer side today and fixed the first half dozen issues I encountered. Unfortunately, it looks like this isn't a case where we're missing a small handful of checks -- there's assertion failures all over the place. It would take a significant amount of work to get us to a clean baseline, I can't even estimate how much.

nikic added a child revision: D119339: [Bitcode] Improve support for opaque-pointer bitcode upgrade.Feb 9 2022, 7:07 AM

sorry for the slow review, I think this is fine
I agree constructing invalid bitcode is probably not worth the time

llvm/lib/Bitcode/Reader/BitcodeReader.cpp
487	perhaps a comment here would be good

This revision is now accepted and ready to land.Feb 10 2022, 10:25 AM

This revision was landed with ongoing or failed builds.Feb 11 2022, 12:35 AM

Closed by commit rGc28b0b9d18ed: [Bitcode] Add partial support for opaque pointer auto-upgrade (authored by nikic). · Explain Why

This revision was automatically updated to reflect the committed changes.

nikic added a commit: rGc28b0b9d18ed: [Bitcode] Add partial support for opaque pointer auto-upgrade.

Revision Contents

Path

Size

llvm/

lib/

Bitcode/

Reader/

BitcodeReader.cpp

78 lines

test/

Bitcode/

aggregateInstructions.3.2.ll

3 lines

Diff 404880

llvm/lib/Bitcode/Reader/BitcodeReader.cpp

Show First 20 Lines • Show All 477 Lines • ▼ Show 20 Lines	class BitcodeReader : public BitcodeReaderBase, public GVMaterializer {
// Last function offset found in the VST.		// Last function offset found in the VST.
uint64_t LastFunctionBlockBit = 0;		uint64_t LastFunctionBlockBit = 0;
bool SeenValueSymbolTable = false;		bool SeenValueSymbolTable = false;
uint64_t VSTOffset = 0;		uint64_t VSTOffset = 0;

std::vector<std::string> SectionTable;		std::vector<std::string> SectionTable;
std::vector<std::string> GCTable;		std::vector<std::string> GCTable;

std::vector<Type*> TypeList;		std::vector<Type *> TypeList;
		std::vector<Type *> ElementTypeList;
		aeubanksUnsubmitted Not Done Reply Inline Actions perhaps a comment here would be good aeubanks: perhaps a comment here would be good
DenseMap<Function , FunctionType > FunctionTypes;		DenseMap<Function , FunctionType > FunctionTypes;
BitcodeReaderValueList ValueList;		BitcodeReaderValueList ValueList;
Optional<MetadataLoader> MDLoader;		Optional<MetadataLoader> MDLoader;
std::vector<Comdat *> ComdatList;		std::vector<Comdat *> ComdatList;
DenseSet<GlobalObject *> ImplicitComdatObjects;		DenseSet<GlobalObject *> ImplicitComdatObjects;
SmallVector<Instruction *, 64> InstructionList;		SmallVector<Instruction *, 64> InstructionList;

std::vector<std::pair<GlobalVariable *, unsigned>> GlobalInits;		std::vector<std::pair<GlobalVariable *, unsigned>> GlobalInits;
▲ Show 20 Lines • Show All 91 Lines • ▼ Show 20 Lines	public:
void setStripDebugInfo() override;		void setStripDebugInfo() override;

private:		private:
std::vector<StructType *> IdentifiedStructTypes;		std::vector<StructType *> IdentifiedStructTypes;
StructType *createIdentifiedStructType(LLVMContext &Context, StringRef Name);		StructType *createIdentifiedStructType(LLVMContext &Context, StringRef Name);
StructType *createIdentifiedStructType(LLVMContext &Context);		StructType *createIdentifiedStructType(LLVMContext &Context);

Type *getTypeByID(unsigned ID);		Type *getTypeByID(unsigned ID);
		Type *getElementTypeByID(unsigned ID);

Value getFnValueByID(unsigned ID, Type Ty) {		Value getFnValueByID(unsigned ID, Type Ty) {
if (Ty && Ty->isMetadataTy())		if (Ty && Ty->isMetadataTy())
return MetadataAsValue::get(Ty->getContext(), getFnMetadataByID(ID));		return MetadataAsValue::get(Ty->getContext(), getFnMetadataByID(ID));
return ValueList.getValueFwdRef(ID, Ty);		return ValueList.getValueFwdRef(ID, Ty);
}		}

Metadata *getFnMetadataByID(unsigned ID) {		Metadata *getFnMetadataByID(unsigned ID) {
▲ Show 20 Lines • Show All 569 Lines • ▼ Show 20 Lines	Type *BitcodeReader::getTypeByID(unsigned ID) {
if (Type *Ty = TypeList[ID])		if (Type *Ty = TypeList[ID])
return Ty;		return Ty;

// If we have a forward reference, the only possible case is when it is to a		// If we have a forward reference, the only possible case is when it is to a
// named struct. Just create a placeholder for now.		// named struct. Just create a placeholder for now.
return TypeList[ID] = createIdentifiedStructType(Context);		return TypeList[ID] = createIdentifiedStructType(Context);
}		}

		Type *BitcodeReader::getElementTypeByID(unsigned ID) {
		if (ID >= TypeList.size())
		return nullptr;

		if (Context.supportsTypedPointers()) {
		Type *Ty = TypeList[ID];
		if (Ty->isPointerTy())
		return Ty->getNonOpaquePointerElementType();
		return nullptr;
		}
		dblaikieUnsubmitted Done Reply Inline Actions Would it be reasonable/possible to always use the `ElementTypeList` even when typed pointers are supported? (frontload changes somewhat) - maybe with typed pointer support this could be an assert that the type in `ElementTypeList` matches the non-opaque pointer type? (oh, I guess since this support is currently incomplete that's not possible, but would be possible when the support is complete but before opaque pointers are enabled by default? I guess it could still be an assert today in the case where `ElementTypeList` is available, maybe?) dblaikie: Would it be reasonable/possible to always use the `ElementTypeList` even when typed pointers…
		nikicAuthorUnsubmitted Done Reply Inline Actions Yes, we could always use ElementTypeList, would just have to populate it unconditionally. Currently I'm avoiding it if we're not doing a typed -> opaque transition, but maybe that's unnecessary micro-optimization. nikic: Yes, we could always use ElementTypeList, would just have to populate it unconditionally.
		dblaikieUnsubmitted Not Done Reply Inline Actions Might be worth enabling it always (if not now, eventually/before/during the opaque pointer default switch) to validate that the new tracking solution (`ElementTypeList`) matches the old results? dblaikie: Might be worth enabling it always (if not now, eventually/before/during the opaque pointer…
		nikicAuthorUnsubmitted Done Reply Inline Actions I've changed it to always use ElementTypeList now. nikic: I've changed it to always use ElementTypeList now.

		return ID < ElementTypeList.size() ? ElementTypeList[ID] : nullptr;
		}

StructType *BitcodeReader::createIdentifiedStructType(LLVMContext &Context,		StructType *BitcodeReader::createIdentifiedStructType(LLVMContext &Context,
StringRef Name) {		StringRef Name) {
auto *Ret = StructType::create(Context, Name);		auto *Ret = StructType::create(Context, Name);
IdentifiedStructTypes.push_back(Ret);		IdentifiedStructTypes.push_back(Ret);
return Ret;		return Ret;
}		}

StructType *BitcodeReader::createIdentifiedStructType(LLVMContext &Context) {		StructType *BitcodeReader::createIdentifiedStructType(LLVMContext &Context) {
▲ Show 20 Lines • Show All 516 Lines • ▼ Show 20 Lines	while (true) {
case BitstreamEntry::Record:		case BitstreamEntry::Record:
// The interesting case.		// The interesting case.
break;		break;
}		}

// Read a record.		// Read a record.
Record.clear();		Record.clear();
Type *ResultTy = nullptr;		Type *ResultTy = nullptr;
		Type *ElemTy = nullptr;
Expected<unsigned> MaybeRecord = Stream.readRecord(Entry.ID, Record);		Expected<unsigned> MaybeRecord = Stream.readRecord(Entry.ID, Record);
if (!MaybeRecord)		if (!MaybeRecord)
return MaybeRecord.takeError();		return MaybeRecord.takeError();
switch (MaybeRecord.get()) {		switch (MaybeRecord.get()) {
default:		default:
return error("Invalid value");		return error("Invalid value");
case bitc::TYPE_CODE_NUMENTRY: // TYPE_CODE_NUMENTRY: [numentries]		case bitc::TYPE_CODE_NUMENTRY: // TYPE_CODE_NUMENTRY: [numentries]
// TYPE_CODE_NUMENTRY contains a count of the number of types in the		// TYPE_CODE_NUMENTRY contains a count of the number of types in the
▲ Show 20 Lines • Show All 58 Lines • ▼ Show 20 Lines	case bitc::TYPE_CODE_POINTER: { // POINTER: [pointee type] or
return error("Invalid record");		return error("Invalid record");
unsigned AddressSpace = 0;		unsigned AddressSpace = 0;
if (Record.size() == 2)		if (Record.size() == 2)
AddressSpace = Record[1];		AddressSpace = Record[1];
ResultTy = getTypeByID(Record[0]);		ResultTy = getTypeByID(Record[0]);
if (!ResultTy \|\|		if (!ResultTy \|\|
!PointerType::isValidElementType(ResultTy))		!PointerType::isValidElementType(ResultTy))
return error("Invalid type");		return error("Invalid type");
		ElemTy = ResultTy;
ResultTy = PointerType::get(ResultTy, AddressSpace);		ResultTy = PointerType::get(ResultTy, AddressSpace);
break;		break;
}		}
case bitc::TYPE_CODE_OPAQUE_POINTER: { // OPAQUE_POINTER: [addrspace]		case bitc::TYPE_CODE_OPAQUE_POINTER: { // OPAQUE_POINTER: [addrspace]
if (Record.size() != 1)		if (Record.size() != 1)
return error("Invalid record");		return error("Invalid record");
if (Context.supportsTypedPointers())		if (Context.supportsTypedPointers())
return error(		return error(
▲ Show 20 Lines • Show All 134 Lines • ▼ Show 20 Lines	while (true) {
}		}

if (NumRecords >= TypeList.size())		if (NumRecords >= TypeList.size())
return error("Invalid TYPE table");		return error("Invalid TYPE table");
if (TypeList[NumRecords])		if (TypeList[NumRecords])
return error(		return error(
"Invalid TYPE table: Only named structs can be forward referenced");		"Invalid TYPE table: Only named structs can be forward referenced");
assert(ResultTy && "Didn't read a type?");		assert(ResultTy && "Didn't read a type?");
TypeList[NumRecords++] = ResultTy;		TypeList[NumRecords] = ResultTy;
		if (ElemTy && !Context.supportsTypedPointers()) {
		// We encountered a typed pointer in opaque pointers mode, and may need
		// to auto-upgrade related IR. Remember the element type.
		if (ElementTypeList.empty())
		ElementTypeList.resize(TypeList.size());
		ElementTypeList[NumRecords] = ElemTy;
		}
		++NumRecords;
}		}
}		}

Error BitcodeReader::parseOperandBundleTags() {		Error BitcodeReader::parseOperandBundleTags() {
if (Error Err = Stream.EnterSubBlock(bitc::OPERAND_BUNDLE_TAGS_BLOCK_ID))		if (Error Err = Stream.EnterSubBlock(bitc::OPERAND_BUNDLE_TAGS_BLOCK_ID))
return Err;		return Err;

if (!BundleTags.empty())		if (!BundleTags.empty())
▲ Show 20 Lines • Show All 394 Lines • ▼ Show 20 Lines
Error BitcodeReader::parseConstants() {		Error BitcodeReader::parseConstants() {
if (Error Err = Stream.EnterSubBlock(bitc::CONSTANTS_BLOCK_ID))		if (Error Err = Stream.EnterSubBlock(bitc::CONSTANTS_BLOCK_ID))
return Err;		return Err;

SmallVector<uint64_t, 64> Record;		SmallVector<uint64_t, 64> Record;

// Read all the records for this value table.		// Read all the records for this value table.
Type *CurTy = Type::getInt32Ty(Context);		Type *CurTy = Type::getInt32Ty(Context);
		Type *CurElemTy = nullptr;
unsigned NextCstNo = ValueList.size();		unsigned NextCstNo = ValueList.size();

struct DelayedShufTy {		struct DelayedShufTy {
VectorType *OpTy;		VectorType *OpTy;
VectorType *RTy;		VectorType *RTy;
uint64_t Op0Idx;		uint64_t Op0Idx;
uint64_t Op1Idx;		uint64_t Op1Idx;
uint64_t Op2Idx;		uint64_t Op2Idx;
▲ Show 20 Lines • Show All 95 Lines • ▼ Show 20 Lines	while (true) {
case bitc::CST_CODE_SETTYPE: // SETTYPE: [typeid]		case bitc::CST_CODE_SETTYPE: // SETTYPE: [typeid]
if (Record.empty())		if (Record.empty())
return error("Invalid record");		return error("Invalid record");
if (Record[0] >= TypeList.size() \|\| !TypeList[Record[0]])		if (Record[0] >= TypeList.size() \|\| !TypeList[Record[0]])
return error("Invalid record");		return error("Invalid record");
if (TypeList[Record[0]] == VoidType)		if (TypeList[Record[0]] == VoidType)
return error("Invalid constant type");		return error("Invalid constant type");
CurTy = TypeList[Record[0]];		CurTy = TypeList[Record[0]];
		CurElemTy = getElementTypeByID(Record[0]);
continue; // Skip the ValueList manipulation.		continue; // Skip the ValueList manipulation.
case bitc::CST_CODE_NULL: // NULL		case bitc::CST_CODE_NULL: // NULL
if (CurTy->isVoidTy() \|\| CurTy->isFunctionTy() \|\| CurTy->isLabelTy())		if (CurTy->isVoidTy() \|\| CurTy->isFunctionTy() \|\| CurTy->isLabelTy())
return error("Invalid type for a constant null value");		return error("Invalid type for a constant null value");
V = Constant::getNullValue(CurTy);		V = Constant::getNullValue(CurTy);
break;		break;
case bitc::CST_CODE_INTEGER: // INTEGER: [intval]		case bitc::CST_CODE_INTEGER: // INTEGER: [intval]
if (!CurTy->isIntegerTy() \|\| Record.empty())		if (!CurTy->isIntegerTy() \|\| Record.empty())
▲ Show 20 Lines • Show All 354 Lines • ▼ Show 20 Lines	case bitc::CST_CODE_INLINEASM_OLD: {
if (3+AsmStrSize+ConstStrSize > Record.size())		if (3+AsmStrSize+ConstStrSize > Record.size())
return error("Invalid record");		return error("Invalid record");

for (unsigned i = 0; i != AsmStrSize; ++i)		for (unsigned i = 0; i != AsmStrSize; ++i)
AsmStr += (char)Record[2+i];		AsmStr += (char)Record[2+i];
for (unsigned i = 0; i != ConstStrSize; ++i)		for (unsigned i = 0; i != ConstStrSize; ++i)
ConstrStr += (char)Record[3+AsmStrSize+i];		ConstrStr += (char)Record[3+AsmStrSize+i];
UpgradeInlineAsmString(&AsmStr);		UpgradeInlineAsmString(&AsmStr);
// FIXME: support upgrading in opaque pointers mode.		if (!CurElemTy)
V = InlineAsm::get(cast<FunctionType>(CurTy->getPointerElementType()),		return error("Missing element type for old-style inlineasm");
AsmStr, ConstrStr, HasSideEffects, IsAlignStack);		V = InlineAsm::get(cast<FunctionType>(CurElemTy), AsmStr, ConstrStr,
		HasSideEffects, IsAlignStack);
break;		break;
}		}
// This version adds support for the asm dialect keywords (e.g.,		// This version adds support for the asm dialect keywords (e.g.,
// inteldialect).		// inteldialect).
case bitc::CST_CODE_INLINEASM_OLD2: {		case bitc::CST_CODE_INLINEASM_OLD2: {
if (Record.size() < 2)		if (Record.size() < 2)
return error("Invalid record");		return error("Invalid record");
std::string AsmStr, ConstrStr;		std::string AsmStr, ConstrStr;
bool HasSideEffects = Record[0] & 1;		bool HasSideEffects = Record[0] & 1;
bool IsAlignStack = (Record[0] >> 1) & 1;		bool IsAlignStack = (Record[0] >> 1) & 1;
unsigned AsmDialect = Record[0] >> 2;		unsigned AsmDialect = Record[0] >> 2;
unsigned AsmStrSize = Record[1];		unsigned AsmStrSize = Record[1];
if (2+AsmStrSize >= Record.size())		if (2+AsmStrSize >= Record.size())
return error("Invalid record");		return error("Invalid record");
unsigned ConstStrSize = Record[2+AsmStrSize];		unsigned ConstStrSize = Record[2+AsmStrSize];
if (3+AsmStrSize+ConstStrSize > Record.size())		if (3+AsmStrSize+ConstStrSize > Record.size())
return error("Invalid record");		return error("Invalid record");

for (unsigned i = 0; i != AsmStrSize; ++i)		for (unsigned i = 0; i != AsmStrSize; ++i)
AsmStr += (char)Record[2+i];		AsmStr += (char)Record[2+i];
for (unsigned i = 0; i != ConstStrSize; ++i)		for (unsigned i = 0; i != ConstStrSize; ++i)
ConstrStr += (char)Record[3+AsmStrSize+i];		ConstrStr += (char)Record[3+AsmStrSize+i];
UpgradeInlineAsmString(&AsmStr);		UpgradeInlineAsmString(&AsmStr);
// FIXME: support upgrading in opaque pointers mode.		if (!CurElemTy)
V = InlineAsm::get(cast<FunctionType>(CurTy->getPointerElementType()),		return error("Missing element type for old-style inlineasm");
AsmStr, ConstrStr, HasSideEffects, IsAlignStack,		V = InlineAsm::get(cast<FunctionType>(CurElemTy), AsmStr, ConstrStr,
		HasSideEffects, IsAlignStack,
InlineAsm::AsmDialect(AsmDialect));		InlineAsm::AsmDialect(AsmDialect));
break;		break;
}		}
// This version adds support for the unwind keyword.		// This version adds support for the unwind keyword.
case bitc::CST_CODE_INLINEASM_OLD3: {		case bitc::CST_CODE_INLINEASM_OLD3: {
if (Record.size() < 2)		if (Record.size() < 2)
return error("Invalid record");		return error("Invalid record");
unsigned OpNum = 0;		unsigned OpNum = 0;
Show All 12 Lines	case bitc::CST_CODE_INLINEASM_OLD3: {
return error("Invalid record");		return error("Invalid record");

for (unsigned i = 0; i != AsmStrSize; ++i)		for (unsigned i = 0; i != AsmStrSize; ++i)
AsmStr += (char)Record[OpNum + i];		AsmStr += (char)Record[OpNum + i];
++OpNum;		++OpNum;
for (unsigned i = 0; i != ConstStrSize; ++i)		for (unsigned i = 0; i != ConstStrSize; ++i)
ConstrStr += (char)Record[OpNum + AsmStrSize + i];		ConstrStr += (char)Record[OpNum + AsmStrSize + i];
UpgradeInlineAsmString(&AsmStr);		UpgradeInlineAsmString(&AsmStr);
// FIXME: support upgrading in opaque pointers mode.		if (!CurElemTy)
V = InlineAsm::get(cast<FunctionType>(CurTy->getPointerElementType()),		return error("Missing element type for old-style inlineasm");
AsmStr, ConstrStr, HasSideEffects, IsAlignStack,		V = InlineAsm::get(cast<FunctionType>(CurElemTy), AsmStr, ConstrStr,
		HasSideEffects, IsAlignStack,
InlineAsm::AsmDialect(AsmDialect), CanThrow);		InlineAsm::AsmDialect(AsmDialect), CanThrow);
break;		break;
}		}
// This version adds explicit function type.		// This version adds explicit function type.
case bitc::CST_CODE_INLINEASM: {		case bitc::CST_CODE_INLINEASM: {
if (Record.size() < 3)		if (Record.size() < 3)
return error("Invalid record");		return error("Invalid record");
unsigned OpNum = 0;		unsigned OpNum = 0;
▲ Show 20 Lines • Show All 379 Lines • ▼ Show 20 Lines	Error BitcodeReader::parseGlobalVarRecord(ArrayRef<uint64_t> Record) {
bool explicitType = Record[1] & 2;		bool explicitType = Record[1] & 2;
unsigned AddressSpace;		unsigned AddressSpace;
if (explicitType) {		if (explicitType) {
AddressSpace = Record[1] >> 2;		AddressSpace = Record[1] >> 2;
} else {		} else {
if (!Ty->isPointerTy())		if (!Ty->isPointerTy())
return error("Invalid type for value");		return error("Invalid type for value");
AddressSpace = cast<PointerType>(Ty)->getAddressSpace();		AddressSpace = cast<PointerType>(Ty)->getAddressSpace();
Ty = Ty->getPointerElementType();		Ty = getElementTypeByID(Record[0]);
		if (!Ty)
		return error("Missing element type for old-style global");
}		}

uint64_t RawLinkage = Record[3];		uint64_t RawLinkage = Record[3];
GlobalValue::LinkageTypes Linkage = getDecodedLinkage(RawLinkage);		GlobalValue::LinkageTypes Linkage = getDecodedLinkage(RawLinkage);
MaybeAlign Alignment;		MaybeAlign Alignment;
if (Error Err = parseAlignmentValue(Record[4], Alignment))		if (Error Err = parseAlignmentValue(Record[4], Alignment))
return Err;		return Err;
std::string Section;		std::string Section;
▲ Show 20 Lines • Show All 75 Lines • ▼ Show 20 Lines	Error BitcodeReader::parseFunctionRecord(ArrayRef<uint64_t> Record) {
StringRef Name;		StringRef Name;
std::tie(Name, Record) = readNameFromStrtab(Record);		std::tie(Name, Record) = readNameFromStrtab(Record);

if (Record.size() < 8)		if (Record.size() < 8)
return error("Invalid record");		return error("Invalid record");
Type *FTy = getTypeByID(Record[0]);		Type *FTy = getTypeByID(Record[0]);
if (!FTy)		if (!FTy)
return error("Invalid record");		return error("Invalid record");
if (auto *PTy = dyn_cast<PointerType>(FTy))		if (isa<PointerType>(FTy)) {
FTy = PTy->getPointerElementType();		FTy = getElementTypeByID(Record[0]);
		if (!FTy)
		return error("Missing element type for old-style function");
		}

if (!isa<FunctionType>(FTy))		if (!isa<FunctionType>(FTy))
return error("Invalid type for value");		return error("Invalid type for value");
auto CC = static_cast<CallingConv::ID>(Record[1]);		auto CC = static_cast<CallingConv::ID>(Record[1]);
if (CC & ~CallingConv::MaxID)		if (CC & ~CallingConv::MaxID)
return error("Invalid calling convention ID");		return error("Invalid calling convention ID");

unsigned AddrSpace = TheModule->getDataLayout().getProgramAddressSpace();		unsigned AddrSpace = TheModule->getDataLayout().getProgramAddressSpace();
▲ Show 20 Lines • Show All 138 Lines • ▼ Show 20 Lines	Error BitcodeReader::parseGlobalIndirectSymbolRecord(
// v2: [strtab_offset, strtab_size, v1]		// v2: [strtab_offset, strtab_size, v1]
StringRef Name;		StringRef Name;
std::tie(Name, Record) = readNameFromStrtab(Record);		std::tie(Name, Record) = readNameFromStrtab(Record);

bool NewRecord = BitCode != bitc::MODULE_CODE_ALIAS_OLD;		bool NewRecord = BitCode != bitc::MODULE_CODE_ALIAS_OLD;
if (Record.size() < (3 + (unsigned)NewRecord))		if (Record.size() < (3 + (unsigned)NewRecord))
return error("Invalid record");		return error("Invalid record");
unsigned OpNum = 0;		unsigned OpNum = 0;
Type *Ty = getTypeByID(Record[OpNum++]);		unsigned TypeID = Record[OpNum++];
		Type *Ty = getTypeByID(TypeID);
if (!Ty)		if (!Ty)
return error("Invalid record");		return error("Invalid record");

unsigned AddrSpace;		unsigned AddrSpace;
if (!NewRecord) {		if (!NewRecord) {
auto *PTy = dyn_cast<PointerType>(Ty);		auto *PTy = dyn_cast<PointerType>(Ty);
if (!PTy)		if (!PTy)
return error("Invalid type for value");		return error("Invalid type for value");
Ty = PTy->getPointerElementType();
AddrSpace = PTy->getAddressSpace();		AddrSpace = PTy->getAddressSpace();
		Ty = getElementTypeByID(TypeID);
		if (!Ty)
		return error("Missing element type for old-style indirect symbol");
} else {		} else {
AddrSpace = Record[OpNum++];		AddrSpace = Record[OpNum++];
}		}

auto Val = Record[OpNum++];		auto Val = Record[OpNum++];
auto Linkage = Record[OpNum++];		auto Linkage = Record[OpNum++];
GlobalValue *NewGA;		GlobalValue *NewGA;
if (BitCode == bitc::MODULE_CODE_ALIAS \|\|		if (BitCode == bitc::MODULE_CODE_ALIAS \|\|
▲ Show 20 Lines • Show All 1,439 Lines • ▼ Show 20 Lines	case bitc::FUNC_CODE_INST_ALLOCA: { // ALLOCA: [instty, opty, op, align]
if (Record.size() != 4)		if (Record.size() != 4)
return error("Invalid record");		return error("Invalid record");
using APV = AllocaPackedValues;		using APV = AllocaPackedValues;
const uint64_t Rec = Record[3];		const uint64_t Rec = Record[3];
const bool InAlloca = Bitfield::get<APV::UsedWithInAlloca>(Rec);		const bool InAlloca = Bitfield::get<APV::UsedWithInAlloca>(Rec);
const bool SwiftError = Bitfield::get<APV::SwiftError>(Rec);		const bool SwiftError = Bitfield::get<APV::SwiftError>(Rec);
Type *Ty = getTypeByID(Record[0]);		Type *Ty = getTypeByID(Record[0]);
if (!Bitfield::get<APV::ExplicitType>(Rec)) {		if (!Bitfield::get<APV::ExplicitType>(Rec)) {
auto *PTy = dyn_cast_or_null<PointerType>(Ty);		Ty = getElementTypeByID(Record[0]);
if (!PTy)		if (!Ty)
return error("Old-style alloca with a non-pointer type");		return error("Missing element type for old-style alloca");
		dblaikieUnsubmitted Not Done Reply Inline Actions I wonder if these sort of checks could be asserts? I realize they are dynamically reachable, but they're also not intended to be reached by end users (only by LLVM developers during this migration)... I guess it's weird either way. Either we have asserts that are dynamically reachable, or we have error paths that are untested... I don't feel great about either of those, but either seem like acceptable tradeoffs during the transition. dblaikie: I wonder if these sort of checks could be asserts? I realize they are dynamically reachable…
		nikicAuthorUnsubmitted Done Reply Inline Actions I believe that bitcode reading is supposed to be resistant against invalid input and shouldn't assert in that case. At the same time we don't actually test these cases, because constructing the necessary invalid bitcode files would be really hard. I'm not sure if we have a fuzzer for bitcode fuzzing deployed anywhere. nikic: I believe that bitcode reading is supposed to be resistant against invalid input and shouldn't…
		dblaikieUnsubmitted Not Done Reply Inline Actions (oh, I misunderstood - I'd thought these errors were only temporary while the support is incomplete? But I guess maybe these errors are permanent/could be reached even after support is fully implemented) But in general bitcode reading errors can be/are tested for instance, here: llvm/test/Bitcode/invalid.test - though, yes, writing tests is hard. Not sure if we have any particularly good techniques - I suspect it's hex editing and such at the moment, unfortunately. dblaikie: (oh, I misunderstood - I'd thought these errors were only temporary while the support is…
		nikicAuthorUnsubmitted Done Reply Inline Actions (oh, I misunderstood - I'd thought these errors were only temporary while the support is incomplete? But I guess maybe these errors are permanent/could be reached even after support is fully implemented) Yes, these are permanent. An obvious way to hit this is via a non-pointer type -- which is the case this was checking for previously already. nikic: > (oh, I misunderstood - I'd thought these errors were only temporary while the support is…
		dblaikieUnsubmitted Not Done Reply Inline Actions @dexonsmith, @aprantl - what're your thoughts/ideas on invalid bitcode testing? dblaikie: @dexonsmith, @aprantl - what're your thoughts/ideas on invalid bitcode testing?
Ty = PTy->getPointerElementType();
}		}
Type *OpTy = getTypeByID(Record[1]);		Type *OpTy = getTypeByID(Record[1]);
Value *Size = getFnValueByID(Record[2], OpTy);		Value *Size = getFnValueByID(Record[2], OpTy);
MaybeAlign Align;		MaybeAlign Align;
uint64_t AlignExp =		uint64_t AlignExp =
Bitfield::get<APV::AlignLower>(Rec) \|		Bitfield::get<APV::AlignLower>(Rec) \|
(Bitfield::get<APV::AlignUpper>(Rec) << APV::AlignLower::Bits);		(Bitfield::get<APV::AlignUpper>(Rec) << APV::AlignLower::Bits);
if (Error Err = parseAlignmentValue(AlignExp, Align)) {		if (Error Err = parseAlignmentValue(AlignExp, Align)) {
▲ Show 20 Lines • Show All 2,137 Lines • Show Last 20 Lines

llvm/test/Bitcode/aggregateInstructions.3.2.ll

	; RUN: llvm-dis < %s.bc\| FileCheck %s			; RUN: llvm-dis -opaque-pointers=0 < %s.bc\| FileCheck %s
				; RUN: llvm-dis -opaque-pointers=1 < %s.bc\| FileCheck %s
	; RUN: verify-uselistorder < %s.bc			; RUN: verify-uselistorder < %s.bc

	; aggregateOperations.3.2.ll.bc was generated by passing this file to llvm-as-3.2.			; aggregateOperations.3.2.ll.bc was generated by passing this file to llvm-as-3.2.
	; The test checks that LLVM does not misread instructions with aggregate operands			; The test checks that LLVM does not misread instructions with aggregate operands
	; in older bitcode files.			; in older bitcode files.

	define void @extractvalue([4 x i8] %x1, [4 x [4 x i8]] %x2, {{i32, float}} %x3){			define void @extractvalue([4 x i8] %x1, [4 x [4 x i8]] %x2, {{i32, float}} %x3){
	entry:			entry:
	Show All 25 Lines