This is an archive of the discontinued LLVM Phabricator instance.

[clang][dataflow] Exclude protobuf types from modeling in the environment.
AbandonedPublic

Authored by ymandel on Apr 4 2022, 7:22 AM.

Download Raw Diff

Details

Reviewers

xazax.hun
sgatev
NoQ

Summary

Google's protobufs are often quite large and their internal state is never worth
modeling in the environment. Exclude them during value construction.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

ymandel created this revision.Apr 4 2022, 7:22 AM

Herald added a project: Restricted Project. · View Herald TranscriptApr 4 2022, 7:22 AM

Herald added subscribers: tschuett, steakhal, rnkovacs. · View Herald Transcript

ymandel requested review of this revision.Apr 4 2022, 7:22 AM

Herald added a project: Restricted Project. · View Herald TranscriptApr 4 2022, 7:22 AM

Harbormaster completed remote builds in B157728: Diff 420174.Apr 4 2022, 7:59 AM

xazax.hun added inline comments.Apr 4 2022, 3:02 PM

clang/lib/Analysis/FlowSensitive/DataflowEnvironment.cpp
635	Not sure how often is this invoked but we could reduce the number of string comparisons by caching the identifier ptr and do a pointer comparison.

ymandel marked an inline comment as done.Apr 5 2022, 5:57 AM

ymandel added inline comments.

clang/lib/Analysis/FlowSensitive/DataflowEnvironment.cpp
635	Good question. It means an extra comparison for each type until the pointer is cached (to check if the cache is set) and then, afterwards, 2 comparisons vs ~10 for the common case where the class name is doesn't match. In the matching case, though, it is clearly saving much more. For proto-heavy code, it seems a win, and a loss otherwise. But, the question is where to put the cache. It seems to me best to move this to be a method on DataflowAnalysisContext (since it is a global, not local env, property) and make the cached pointer a private member of DAC. Thoughts?

xazax.hun added inline comments.Apr 5 2022, 9:01 AM

clang/lib/Analysis/FlowSensitive/DataflowEnvironment.cpp
635	I think it might be nice to have a context that is scoped to the translation unit rather than a function. We might have other stuff that we want to cache here. An example how the static analyzer is dealing with this: https://github.com/llvm/llvm-project/blob/main/clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp#L960 In case we do not want to be lazy and willing to populate the cache eagerly at the beginning of the analysis, we would not pay for the extra checks to see if the cache is populated.

xazax.hun added inline comments.Apr 5 2022, 9:03 AM

clang/lib/Analysis/FlowSensitive/DataflowEnvironment.cpp
635	I'm also fine with not doing caching for now and having a note to consider this in the future if it becomes measurable in the profiles. It is probably fine for now when we only have a couple of short strings to compare. But I guess the number of special cases we handle this way will only increase in the future.

added FIXME

Harbormaster completed remote builds in B162228: Diff 426388.May 2 2022, 7:06 AM

ymandel marked 2 inline comments as done.May 6 2022, 6:32 AM

ymandel added inline comments.

clang/lib/Analysis/FlowSensitive/DataflowEnvironment.cpp
635	I'm also fine with not doing caching for now and having a note to consider this in the future if it becomes measurable in the profiles. It is probably fine for now when we only have a couple of short strings to compare. But I guess the number of special cases we handle this way will only increase in the future. (sorry for the very long delay in responding). I think holding off on the cache may be best for now, pending performance analysis. I'm hesitant to put too much effort in to optimizing this, because ultimately we want to move to a lazy-initialization model which would obviate the need for this kind of optimization, because we would only model the fields that are used, making the size of the underlying struct irrelevant. I've added a FIXME to this effect.

Ping... Gabor, Stanislav: what are your thoughts on this patch? I think we're in a place where I could do performance comparisons before/after if you think that's justified.

Alternatively, I have a relatively simple proposal for lazy initialization that I expect I will implement in the next couple of months. If you're sufficiently concerned with this patch, waiting for that patch is an option as well.

Since there are many moving pieces and the whole framework is experimental without many users, I'm fine with landing this as is as long we are tracking the improvement opportunities (tickets or fixemes).

This revision is now accepted and ready to land.Jun 13 2022, 8:35 AM

sgatev accepted this revision.Jun 13 2022, 9:37 AM

Reviving this patch, addressing some of the earlier concerns. However, I may have a better solution which makes this patch irrelevant. So, still WIP.

Herald added a reviewer: NoQ. · View Herald TranscriptDec 22 2022, 7:10 AM

Herald added a subscriber: martong. · View Herald Transcript

Harbormaster completed remote builds in B204576: Diff 484830.Dec 22 2022, 7:59 AM

ymandel mentioned this in D140694: [clang][dataflow] Only model struct fields that are used in the function being analyzed..Dec 27 2022, 7:40 AM

ymandel mentioned this in rG5e8f597c2fed: [clang][dataflow] Only model struct fields that are used in the function being….Jan 5 2023, 1:47 PM

Abandoning in favor of https://reviews.llvm.org/D140694.

ymandel mentioned this in rG01ccf7b3cee5: Revert "Revert "[clang][dataflow] Only model struct fields that are used in the….Jan 9 2023, 11:32 AM

Revision Contents

Path

Size

clang/

include/

clang/

Analysis/

FlowSensitive/

DataflowAnalysisContext.h

16 lines

lib/

Analysis/

FlowSensitive/

DataflowAnalysisContext.cpp

60 lines

DataflowEnvironment.cpp

3 lines

unittests/

Analysis/

FlowSensitive/

DataflowEnvironmentTest.cpp

116 lines

Diff 484830

clang/include/clang/Analysis/FlowSensitive/DataflowAnalysisContext.h

Show First 20 Lines • Show All 48 Lines • ▼ Show 20 Lines

/// Returns the set of all fields in the type.		/// Returns the set of all fields in the type.
llvm::DenseSet<const FieldDecl *> getObjectFields(QualType Type);		llvm::DenseSet<const FieldDecl *> getObjectFields(QualType Type);

/// Owns objects that encompass the state of a program and stores context that		/// Owns objects that encompass the state of a program and stores context that
/// is used during dataflow analysis.		/// is used during dataflow analysis.
class DataflowAnalysisContext {		class DataflowAnalysisContext {
public:		public:
		struct Options {
		// FIXME: add comments and change to std::vector<string>
		bool ExcludeGoogleProtobufs;
		};

/// Constructs a dataflow analysis context.		/// Constructs a dataflow analysis context.
///		///
/// Requirements:		/// Requirements:
///		///
/// `S` must not be null.		/// `S` must not be null.
DataflowAnalysisContext(std::unique_ptr<Solver> S)		DataflowAnalysisContext(std::unique_ptr<Solver> S,
		Options Opts = {/ExcludeGoogleProtobufs=/false})
: S(std::move(S)), TrueVal(createAtomicBoolValue()),		: S(std::move(S)), TrueVal(createAtomicBoolValue()),
FalseVal(createAtomicBoolValue()) {		FalseVal(createAtomicBoolValue()), Options(Opts) {
assert(this->S != nullptr);		assert(this->S != nullptr);
}		}

/// Takes ownership of `Loc` and returns a reference to it.		/// Takes ownership of `Loc` and returns a reference to it.
///		///
/// Requirements:		/// Requirements:
///		///
/// `Loc` must not be null.		/// `Loc` must not be null.
▲ Show 20 Lines • Show All 175 Lines • ▼ Show 20 Lines	public:
bool equivalentBoolValues(BoolValue &Val1, BoolValue &Val2);		bool equivalentBoolValues(BoolValue &Val1, BoolValue &Val2);

LLVM_DUMP_METHOD void dumpFlowCondition(AtomicBoolValue &Token);		LLVM_DUMP_METHOD void dumpFlowCondition(AtomicBoolValue &Token);

/// Returns the `ControlFlowContext` registered for `F`, if any. Otherwise,		/// Returns the `ControlFlowContext` registered for `F`, if any. Otherwise,
/// returns null.		/// returns null.
const ControlFlowContext getControlFlowContext(const FunctionDecl F);		const ControlFlowContext getControlFlowContext(const FunctionDecl F);

		bool isExcludedRecordType(const RecordType &Ty);

private:		private:
struct NullableQualTypeDenseMapInfo : private llvm::DenseMapInfo<QualType> {		struct NullableQualTypeDenseMapInfo : private llvm::DenseMapInfo<QualType> {
static QualType getEmptyKey() {		static QualType getEmptyKey() {
// Allow a NULL `QualType` by using a different value as the empty key.		// Allow a NULL `QualType` by using a different value as the empty key.
return QualType::getFromOpaquePtr(reinterpret_cast<Type *>(1));		return QualType::getFromOpaquePtr(reinterpret_cast<Type *>(1));
}		}

using DenseMapInfo::getHashValue;		using DenseMapInfo::getHashValue;
▲ Show 20 Lines • Show All 61 Lines • ▼ Show 20 Lines	private:
// creating a type-independent `NullPointerValue` without a `PointeeLoc`		// creating a type-independent `NullPointerValue` without a `PointeeLoc`
// field.		// field.
llvm::DenseMap<QualType, PointerValue *, NullableQualTypeDenseMapInfo>		llvm::DenseMap<QualType, PointerValue *, NullableQualTypeDenseMapInfo>
NullPointerVals;		NullPointerVals;

AtomicBoolValue &TrueVal;		AtomicBoolValue &TrueVal;
AtomicBoolValue &FalseVal;		AtomicBoolValue &FalseVal;

		Options Options;

// Indices that are used to avoid recreating the same composite boolean		// Indices that are used to avoid recreating the same composite boolean
// values.		// values.
llvm::DenseMap<std::pair<BoolValue , BoolValue >, ConjunctionValue *>		llvm::DenseMap<std::pair<BoolValue , BoolValue >, ConjunctionValue *>
ConjunctionVals;		ConjunctionVals;
llvm::DenseMap<std::pair<BoolValue , BoolValue >, DisjunctionValue *>		llvm::DenseMap<std::pair<BoolValue , BoolValue >, DisjunctionValue *>
DisjunctionVals;		DisjunctionVals;
llvm::DenseMap<BoolValue , NegationValue > NegationVals;		llvm::DenseMap<BoolValue , NegationValue > NegationVals;
llvm::DenseMap<std::pair<BoolValue , BoolValue >, ImplicationValue *>		llvm::DenseMap<std::pair<BoolValue , BoolValue >, ImplicationValue *>
Show All 13 Lines	private:
// Flow conditions depend on other flow conditions if they are created using		// Flow conditions depend on other flow conditions if they are created using
// `forkFlowCondition` or `joinFlowConditions`. The graph of flow condition		// `forkFlowCondition` or `joinFlowConditions`. The graph of flow condition
// dependencies is stored in the `FlowConditionDeps` map.		// dependencies is stored in the `FlowConditionDeps` map.
llvm::DenseMap<AtomicBoolValue , llvm::DenseSet<AtomicBoolValue >>		llvm::DenseMap<AtomicBoolValue , llvm::DenseSet<AtomicBoolValue >>
FlowConditionDeps;		FlowConditionDeps;
llvm::DenseMap<AtomicBoolValue , BoolValue > FlowConditionConstraints;		llvm::DenseMap<AtomicBoolValue , BoolValue > FlowConditionConstraints;

llvm::DenseMap<const FunctionDecl *, ControlFlowContext> FunctionContexts;		llvm::DenseMap<const FunctionDecl *, ControlFlowContext> FunctionContexts;

		const RecordType *ProtobufBaseTy = nullptr;
};		};

} // namespace dataflow		} // namespace dataflow
} // namespace clang		} // namespace clang

#endif // LLVM_CLANG_ANALYSIS_FLOWSENSITIVE_DATAFLOWANALYSISCONTEXT_H		#endif // LLVM_CLANG_ANALYSIS_FLOWSENSITIVE_DATAFLOWANALYSISCONTEXT_H

clang/lib/Analysis/FlowSensitive/DataflowAnalysisContext.cpp

Show First 20 Lines • Show All 349 Lines • ▼ Show 20 Lines	if (Stmt *Body = F->getBody()) {
assert(CFCtx);		assert(CFCtx);
auto Result = FunctionContexts.insert({F, std::move(*CFCtx)});		auto Result = FunctionContexts.insert({F, std::move(*CFCtx)});
return &Result.first->second;		return &Result.first->second;
}		}

return nullptr;		return nullptr;
}		}

		// Hard code certain types to exclude from modeling. Currently, we limit to
		// Google protobufs, since they can be very large, and have no value in
		// modeling.
		//
		// FIXME: Remove this specialized exclusion once a more general mechanism is
		// implemented for only modeling accessed fields. Otherwise, consider memoizing
		// this function or caching some of the information on which it relies (like the
		// protobuf `Message` base type) at the TU level.
		bool DataflowAnalysisContext::isExcludedRecordType(const RecordType &Ty) {
		if (!Options.ExcludeGoogleProtobufs)
		return false;

		const auto *CD = dyn_cast<CXXRecordDecl>(Ty.getDecl());
		if (CD == nullptr \|\| !CD->hasDefinition() \|\| CD->getNumBases() != 1)
		return false;

		QualType BQ = CD->bases_begin()->getType();
		if (BQ.isNull())
		return false;

		const RecordType *BaseTy = BQ->getAs<RecordType>();
		if (BaseTy == nullptr)
		return false;

		if (ProtobufBaseTy != nullptr)
		return BaseTy == ProtobufBaseTy;

		const RecordDecl *RD = BaseTy->getDecl();
		assert(RD != nullptr);
		IdentifierInfo *II = RD->getIdentifier();
		if (II == nullptr \|\| !II->isStr("Message"))
		return false;

		const auto *ND = dyn_cast<NamespaceDecl>(RD->getDeclContext());
		if (ND == nullptr)
		return false;
		IdentifierInfo *NamespaceII = ND->getIdentifier();
		if (NamespaceII == nullptr)
		return false;
		if (NamespaceII->isStr("proto2")) {
		ProtobufBaseTy = BaseTy;
		return true;
		}

		// Check for `::google::protobuf`:
		if (!NamespaceII->isStr("protobuf"))
		return false;

		ND = dyn_cast<NamespaceDecl>(ND->getDeclContext());
		if (ND == nullptr \|\| !ND->getParent()->isTranslationUnit())
		return false;
		NamespaceII = ND->getIdentifier();
		if (NamespaceII != nullptr && NamespaceII->isStr("google")) {
		ProtobufBaseTy = BaseTy;
		return true;
		}

		return false;
		}

} // namespace dataflow		} // namespace dataflow
} // namespace clang		} // namespace clang

using namespace clang;		using namespace clang;

const Expr &clang::dataflow::ignoreCFGOmittedNodes(const Expr &E) {		const Expr &clang::dataflow::ignoreCFGOmittedNodes(const Expr &E) {
const Expr *Current = &E;		const Expr *Current = &E;
if (auto *EWC = dyn_cast<ExprWithCleanups>(Current)) {		if (auto *EWC = dyn_cast<ExprWithCleanups>(Current)) {
Show All 37 Lines

clang/lib/Analysis/FlowSensitive/DataflowEnvironment.cpp

Show First 20 Lines • Show All 626 Lines • ▼ Show 20 Lines	Value *Environment::createValueUnlessSelfReferential(

if (Type->isBooleanType()) {		if (Type->isBooleanType()) {
CreatedValuesCount++;		CreatedValuesCount++;
return &makeAtomicBoolValue();		return &makeAtomicBoolValue();
}		}

if (Type->isIntegerType()) {		if (Type->isIntegerType()) {
// FIXME: consider instead `return nullptr`, given that we do nothing useful		// FIXME: consider instead `return nullptr`, given that we do nothing useful
// with integers, and so distinguishing them serves no purpose, but could		// with integers, and so distinguishing them serves no purpose, but could
		xazax.hunUnsubmitted Done Reply Inline Actions Not sure how often is this invoked but we could reduce the number of string comparisons by caching the identifier ptr and do a pointer comparison. xazax.hun: Not sure how often is this invoked but we could reduce the number of string comparisons by…
		ymandelAuthorUnsubmitted Done Reply Inline Actions Good question. It means an extra comparison for each type until the pointer is cached (to check if the cache is set) and then, afterwards, 2 comparisons vs ~10 for the common case where the class name is doesn't match. In the matching case, though, it is clearly saving much more. For proto-heavy code, it seems a win, and a loss otherwise. But, the question is where to put the cache. It seems to me best to move this to be a method on DataflowAnalysisContext (since it is a global, not local env, property) and make the cached pointer a private member of DAC. Thoughts? ymandel: Good question. It means an extra comparison for each type until the pointer is cached (to check…
		xazax.hunUnsubmitted Done Reply Inline Actions I think it might be nice to have a context that is scoped to the translation unit rather than a function. We might have other stuff that we want to cache here. An example how the static analyzer is dealing with this: https://github.com/llvm/llvm-project/blob/main/clang/lib/StaticAnalyzer/Checkers/StdLibraryFunctionsChecker.cpp#L960 In case we do not want to be lazy and willing to populate the cache eagerly at the beginning of the analysis, we would not pay for the extra checks to see if the cache is populated. xazax.hun: I think it might be nice to have a context that is scoped to the translation unit rather than a…
		xazax.hunUnsubmitted Done Reply Inline Actions I'm also fine with not doing caching for now and having a note to consider this in the future if it becomes measurable in the profiles. It is probably fine for now when we only have a couple of short strings to compare. But I guess the number of special cases we handle this way will only increase in the future. xazax.hun: I'm also fine with not doing caching for now and having a note to consider this in the future…
		ymandelAuthorUnsubmitted Done Reply Inline Actions I'm also fine with not doing caching for now and having a note to consider this in the future if it becomes measurable in the profiles. It is probably fine for now when we only have a couple of short strings to compare. But I guess the number of special cases we handle this way will only increase in the future. (sorry for the very long delay in responding). I think holding off on the cache may be best for now, pending performance analysis. I'm hesitant to put too much effort in to optimizing this, because ultimately we want to move to a lazy-initialization model which would obviate the need for this kind of optimization, because we would only model the fields that are used, making the size of the underlying struct irrelevant. I've added a FIXME to this effect. ymandel: > I'm also fine with not doing caching for now and having a note to consider this in the future…
// prevent convergence.		// prevent convergence.
CreatedValuesCount++;		CreatedValuesCount++;
return &takeOwnership(std::make_unique<IntegerValue>());		return &takeOwnership(std::make_unique<IntegerValue>());
}		}

if (Type->isReferenceType()) {		if (Type->isReferenceType()) {
CreatedValuesCount++;		CreatedValuesCount++;
QualType PointeeType = Type->castAs<ReferenceType>()->getPointeeType();		QualType PointeeType = Type->castAs<ReferenceType>()->getPointeeType();
Show All 23 Lines	if (Visited.insert(PointeeType.getCanonicalType()).second) {

if (PointeeVal != nullptr)		if (PointeeVal != nullptr)
setValue(PointeeLoc, *PointeeVal);		setValue(PointeeLoc, *PointeeVal);
}		}

return &takeOwnership(std::make_unique<PointerValue>(PointeeLoc));		return &takeOwnership(std::make_unique<PointerValue>(PointeeLoc));
}		}

if (Type->isStructureOrClassType()) {		if (Type->isStructureOrClassType() &&
		!DACtx->isExcludedRecordType(*Type->getAs<RecordType>())) {
CreatedValuesCount++;		CreatedValuesCount++;
// FIXME: Initialize only fields that are accessed in the context that is		// FIXME: Initialize only fields that are accessed in the context that is
// being analyzed.		// being analyzed.
llvm::DenseMap<const ValueDecl , Value > FieldValues;		llvm::DenseMap<const ValueDecl , Value > FieldValues;
for (const FieldDecl *Field : getObjectFields(Type)) {		for (const FieldDecl *Field : getObjectFields(Type)) {
assert(Field != nullptr);		assert(Field != nullptr);

QualType FieldType = Field->getType();		QualType FieldType = Field->getType();
▲ Show 20 Lines • Show All 55 Lines • Show Last 20 Lines

clang/unittests/Analysis/FlowSensitive/DataflowEnvironmentTest.cpp

Show All 17 Lines
#include "gtest/gtest.h"		#include "gtest/gtest.h"
#include <memory>		#include <memory>

namespace {		namespace {

using namespace clang;		using namespace clang;
using namespace dataflow;		using namespace dataflow;
using ::testing::ElementsAre;		using ::testing::ElementsAre;
		using ::testing::IsNull;
using ::testing::NotNull;		using ::testing::NotNull;
using ::testing::Pair;		using ::testing::Pair;

class EnvironmentTest : public ::testing::Test {		class EnvironmentTest : public ::testing::Test {
protected:		protected:
EnvironmentTest() : DAContext(std::make_unique<WatchedLiteralsSolver>()) {}		EnvironmentTest() : DAContext(std::make_unique<WatchedLiteralsSolver>()) {}

DataflowAnalysisContext DAContext;		DataflowAnalysisContext DAContext;
▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines	std::string Code = R"cc(
int Target () { return Global; }		int Target () { return Global; }
)cc";		)cc";

auto Unit =		auto Unit =
tooling::buildASTFromCodeWithArgs(Code, {"-fsyntax-only", "-std=c++11"});		tooling::buildASTFromCodeWithArgs(Code, {"-fsyntax-only", "-std=c++11"});
auto &Context = Unit->getASTContext();		auto &Context = Unit->getASTContext();

ASSERT_EQ(Context.getDiagnostics().getClient()->getNumErrors(), 0U);		ASSERT_EQ(Context.getDiagnostics().getClient()->getNumErrors(), 0U);

auto Results =		auto Results =
match(decl(anyOf(varDecl(hasName("Global")).bind("global"),		match(decl(anyOf(varDecl(hasName("Global")).bind("global"),
functionDecl(hasName("Target")).bind("target"))),		functionDecl(hasName("Target")).bind("target"))),
Context);		Context);
const auto *Fun = selectFirst<FunctionDecl>("target", Results);		const auto *Fun = selectFirst<FunctionDecl>("target", Results);
const auto *Var = selectFirst<VarDecl>("global", Results);		const auto *Var = selectFirst<VarDecl>("global", Results);
ASSERT_TRUE(Fun != nullptr);		ASSERT_TRUE(Fun != nullptr);
ASSERT_THAT(Var, NotNull());		ASSERT_THAT(Var, NotNull());
Show All 14 Lines	std::string Code = R"cc(
};		};
)cc";		)cc";

auto Unit =		auto Unit =
tooling::buildASTFromCodeWithArgs(Code, {"-fsyntax-only", "-std=c++11"});		tooling::buildASTFromCodeWithArgs(Code, {"-fsyntax-only", "-std=c++11"});
auto &Context = Unit->getASTContext();		auto &Context = Unit->getASTContext();

ASSERT_EQ(Context.getDiagnostics().getClient()->getNumErrors(), 0U);		ASSERT_EQ(Context.getDiagnostics().getClient()->getNumErrors(), 0U);

auto Results =		auto Results =
match(decl(anyOf(		match(decl(anyOf(
varDecl(hasName("Global")).bind("global"),		varDecl(hasName("Global")).bind("global"),
cxxConstructorDecl(ofClass(hasName("Target"))).bind("target"))),		cxxConstructorDecl(ofClass(hasName("Target"))).bind("target"))),
Context);		Context);
const auto *Ctor = selectFirst<CXXConstructorDecl>("target", Results);		const auto *Ctor = selectFirst<CXXConstructorDecl>("target", Results);
const auto *Var = selectFirst<VarDecl>("global", Results);		const auto *Var = selectFirst<VarDecl>("global", Results);
ASSERT_TRUE(Ctor != nullptr);		ASSERT_TRUE(Ctor != nullptr);
ASSERT_THAT(Var, NotNull());		ASSERT_THAT(Var, NotNull());

// Verify the global variable is populated when we analyze `Target`.		// Verify the global variable is populated when we analyze `Target`.
Environment Env(DAContext, *Ctor);		Environment Env(DAContext, *Ctor);
EXPECT_THAT(Env.getValue(*Var, SkipPast::None), NotNull());		EXPECT_THAT(Env.getValue(*Var, SkipPast::None), NotNull());
}		}

		TEST(ExcludeProtobufTypesTest, ExcludeEnabled) {
		using namespace ast_matchers;

		DataflowAnalysisContext DAContext(std::make_unique<WatchedLiteralsSolver>(),
		{/ExcludeGoogleProtobufs=/true});
		Environment Env(DAContext);

		std::string Code = R"cc(
		namespace google {
		namespace protobuf {
		struct Message {};
		}
		}

		namespace other {
		namespace google {
		namespace protobuf {
		struct Message {};
		}
		}
		}

		struct Bar : public google::protobuf::Message {
		bool Field;
		};

		// Not a protobuf, but looks like it. Verify that it is not excluded.
		struct Zab : public other::google::protobuf::Message {
		bool Field;
		};

		void target() {
		Bar B;
		Zab Z;
		(void)0;
		/[[check]]/
		}
		)cc";

		auto Unit =
		tooling::buildASTFromCodeWithArgs(Code, {"-fsyntax-only", "-std=c++11"});
		auto &Context = Unit->getASTContext();

		ASSERT_EQ(Context.getDiagnostics().getClient()->getNumErrors(), 0U);
		auto Results = match(varDecl(eachOf(varDecl(hasName("B")).bind("varB"),
		varDecl(hasName("Z")).bind("varZ"))),
		Context);
		const auto *B = selectFirst<VarDecl>("varB", Results);
		ASSERT_TRUE(B != nullptr);
		const auto *Z = selectFirst<VarDecl>("varZ", Results);
		ASSERT_TRUE(Z != nullptr);

		EXPECT_THAT(Env.createValue(B->getType()), IsNull());
		EXPECT_THAT(Env.createValue(Z->getType()), NotNull());
		}

		TEST(ExcludeProtobufTypesTest, ExcludeDisabled) {
		using namespace ast_matchers;

		DataflowAnalysisContext DAContext(std::make_unique<WatchedLiteralsSolver>(),
		{/ExcludeGoogleProtobufs=/false});
		Environment Env(DAContext);

		std::string Code = R"cc(
		namespace google {
		namespace protobuf {
		struct Message {};
		}
		}

		namespace other {
		namespace google {
		namespace protobuf {
		struct Message {};
		}
		}
		}

		struct Bar : public google::protobuf::Message {
		bool Field;
		};

		// Not a protobuf, but looks like it. Verify that it is not excluded.
		struct Zab : public other::google::protobuf::Message {
		bool Field;
		};

		void target() {
		Bar B;
		Zab Z;
		(void)0;
		/[[check]]/
		}
		)cc";

		auto Unit =
		tooling::buildASTFromCodeWithArgs(Code, {"-fsyntax-only", "-std=c++11"});
		auto &Context = Unit->getASTContext();

		ASSERT_EQ(Context.getDiagnostics().getClient()->getNumErrors(), 0U);

		auto Results = match(varDecl(eachOf(varDecl(hasName("B")).bind("varB"),
		varDecl(hasName("Z")).bind("varZ"))),
		Context);
		const auto *B = selectFirst<VarDecl>("varB", Results);
		ASSERT_TRUE(B != nullptr);
		const auto *Z = selectFirst<VarDecl>("varZ", Results);
		ASSERT_TRUE(Z != nullptr);

		EXPECT_THAT(Env.createValue(B->getType()), NotNull());
		EXPECT_THAT(Env.createValue(Z->getType()), NotNull());
		}

} // namespace		} // namespace

This is an archive of the discontinued LLVM Phabricator instance.

[clang][dataflow] Exclude protobuf types from modeling in the environment.AbandonedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 484830

clang/include/clang/Analysis/FlowSensitive/DataflowAnalysisContext.h

clang/lib/Analysis/FlowSensitive/DataflowAnalysisContext.cpp

clang/lib/Analysis/FlowSensitive/DataflowEnvironment.cpp

clang/unittests/Analysis/FlowSensitive/DataflowEnvironmentTest.cpp

[clang][dataflow] Exclude protobuf types from modeling in the environment.
AbandonedPublic