Diff 102700

include/clang/Analysis/CloneDetection.h

	Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines
	/// to the same CloneGroup.			/// to the same CloneGroup.
	static void splitCloneGroups(			static void splitCloneGroups(
	std::vector<CloneDetector::CloneGroup> &CloneGroups,			std::vector<CloneDetector::CloneGroup> &CloneGroups,
	std::function<bool(const StmtSequence &, const StmtSequence &)> Compare);			std::function<bool(const StmtSequence &, const StmtSequence &)> Compare);
	};			};

	/// Searches all children of the given clones for type II clones (i.e. they are			/// Searches all children of the given clones for type II clones (i.e. they are
	/// identical in every aspect beside the used variable names).			/// identical in every aspect beside the used variable names).
				///
				/// This constraint is also available to be executed in two phases, see
				/// RecursiveCloneTypeIIHashConstraint and RecursiveCloneTypeIIVerifyConstraint
				/// for more.
				NoQUnsubmitted Not Done Reply Inline Actions Since this constraint is a trivial composition of the other two now, is it useful at all as a separate class? Maybe just remove it? NoQ: Since this constraint is a trivial composition of the other two now, is it useful at all as a…
	class RecursiveCloneTypeIIConstraint {			class RecursiveCloneTypeIIConstraint {
				public:
				void constrain(std::vector<CloneDetector::CloneGroup> &Sequences);
				v.g.vassilevUnsubmitted Not Done Reply Inline Actions Could we typedef `std::vector<CloneDetector::CloneGroup>` into `CloneDetector::CloneGroups`? v.g.vassilev: Could we typedef `std::vector<CloneDetector::CloneGroup>` into `CloneDetector::CloneGroups`?
				teemperorAuthorUnsubmitted Done Reply Inline Actions Yes, I'll do this in another patch for the whole CloneDetector code base. teemperor: Yes, I'll do this in another patch for the whole CloneDetector code base.
				};

	/// Generates and saves a hash code for the given Stmt.			/// This constraint performs only the hashing part of the
	/// \param S The given Stmt.			/// RecursiveCloneTypeIIConstraint.
	/// \param D The Decl containing S.			///
	/// \param StmtsByHash Output parameter that will contain the hash codes for			/// It is supposed to be fast and can be used at the front of the constraint
	/// each StmtSequence in the given Stmt.			/// chain. However, it has a tiny chance to generate false-positives where the
	/// \return The hash code of the given Stmt.			/// clones in a clone group are not actually type II clones of each other.
	///			/// This happens only due to hash collisions and they can be removed by the
	/// If the given Stmt is a CompoundStmt, this method will also generate			/// RecursiveCloneTypeIIVerifyConstraint.
				NoQUnsubmitted Not Done Reply Inline Actions As a personal preference, i'd probably like to see a more straightforward answer to "what exactly does this do?", rather than "what is this part of?" and "how fast this is?". Eg., "RecursiveCloneTypeIIHashConstraint computes a hash of each statement sequence; sequences with different hash values are moved into separate clone groups. Collisions are possible, and this constraint does nothing to address this them. Add the slower RecursiveCloneTypeIIVerifyConstraint later in the constraint chain, not necessarily immediately, to eliminate hash collisions through a more detailed analysis." NoQ: As a personal preference, i'd probably like to see a more straightforward answer to "what…
	/// hashes for all possible StmtSequences in the children of this Stmt.			class RecursiveCloneTypeIIHashConstraint {
	size_t saveHash(const Stmt S, const Decl D,			public:
	std::vector<std::pair<size_t, StmtSequence>> &StmtsByHash);			void constrain(std::vector<CloneDetector::CloneGroup> &Sequences);
				};

				/// This constraint performs only the verification part of the
				/// RecursiveCloneTypeIIConstraint.
				///
				/// It is supposed to be used behind the RecursiveCloneTypeIIHashConstraint
				/// and verifies that all clones in a group are actually type II clones of
				/// each other. However, this constraint is quite slow, so if you have faster
				/// constraints that can handle false-positives generated by hash collisions,
				/// then prepend those constraints to this one for optimal performance.
				class RecursiveCloneTypeIIVerifyConstraint {
	public:			public:
	void constrain(std::vector<CloneDetector::CloneGroup> &Sequences);			void constrain(std::vector<CloneDetector::CloneGroup> &Sequences);
	};			};

	/// Ensures that every clone has at least the given complexity.			/// Ensures that every clone has at least the given complexity.
	///			///
	/// Complexity is here defined as the total amount of children of a statement.			/// Complexity is here defined as the total amount of children of a statement.
	/// This constraint assumes the first statement in the group is representative			/// This constraint assumes the first statement in the group is representative
	▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

lib/Analysis/CloneDetection.cpp

	Show First 20 Lines • Show All 92 Lines • ▼ Show 20 Lines
	// Copy as much as possible of the generated hash code to the Stmt's hash			// Copy as much as possible of the generated hash code to the Stmt's hash
	// code.			// code.
	std::memcpy(&HashCode, &HashResult,			std::memcpy(&HashCode, &HashResult,
	std::min(sizeof(HashCode), sizeof(HashResult)));			std::min(sizeof(HashCode), sizeof(HashResult)));

	return HashCode;			return HashCode;
	}			}

	size_t RecursiveCloneTypeIIConstraint::saveHash(			/// Generates and saves a hash code for the given Stmt.
	const Stmt S, const Decl D,			/// \param S The given Stmt.
	std::vector<std::pair<size_t, StmtSequence>> &StmtsByHash) {			/// \param D The Decl containing S.
				/// \param StmtsByHash Output parameter that will contain the hash codes for
				/// each StmtSequence in the given Stmt.
				/// \return The hash code of the given Stmt.
				///
				/// If the given Stmt is a CompoundStmt, this method will also generate
				/// hashes for all possible StmtSequences in the children of this Stmt.
				static size_t
				saveHash(const Stmt S, const Decl D,
				std::vector<std::pair<size_t, StmtSequence>> &StmtsByHash) {
	llvm::MD5 Hash;			llvm::MD5 Hash;
	ASTContext &Context = D->getASTContext();			ASTContext &Context = D->getASTContext();

	StmtDataCollector<llvm::MD5>(S, Context, Hash);			StmtDataCollector<llvm::MD5>(S, Context, Hash);

	auto CS = dyn_cast<CompoundStmt>(S);			auto CS = dyn_cast<CompoundStmt>(S);
	SmallVector<size_t, 8> ChildHashes;			SmallVector<size_t, 8> ChildHashes;

	▲ Show 20 Lines • Show All 74 Lines • ▼ Show 20 Lines
	CollectStmtSequenceData(LHS, LHSWrapper);			CollectStmtSequenceData(LHS, LHSWrapper);
	CollectStmtSequenceData(RHS, RHSWrapper);			CollectStmtSequenceData(RHS, RHSWrapper);

	return DataLHS == DataRHS;			return DataLHS == DataRHS;
	}			}

	void RecursiveCloneTypeIIConstraint::constrain(			void RecursiveCloneTypeIIConstraint::constrain(
	std::vector<CloneDetector::CloneGroup> &Sequences) {			std::vector<CloneDetector::CloneGroup> &Sequences) {
				RecursiveCloneTypeIIHashConstraint Hash;
				Hash.constrain(Sequences);
				RecursiveCloneTypeIIVerifyConstraint Verify;
				Verify.constrain(Sequences);
				}

				void RecursiveCloneTypeIIHashConstraint::constrain(
				std::vector<CloneDetector::CloneGroup> &Sequences) {
	// FIXME: Maybe we can do this in-place and don't need this additional vector.			// FIXME: Maybe we can do this in-place and don't need this additional vector.
	std::vector<CloneDetector::CloneGroup> Result;			std::vector<CloneDetector::CloneGroup> Result;

	for (CloneDetector::CloneGroup &Group : Sequences) {			for (CloneDetector::CloneGroup &Group : Sequences) {
	// We assume in the following code that the Group is non-empty, so we			// We assume in the following code that the Group is non-empty, so we
	// skip all empty groups.			// skip all empty groups.
	if (Group.empty())			if (Group.empty())
	continue;			continue;
	Show All 23 Lines
	// represent a CloneGroup, so we create a new group and start checking and			// represent a CloneGroup, so we create a new group and start checking and
	// adding the StmtSequences in this sequence.			// adding the StmtSequences in this sequence.
	CloneDetector::CloneGroup NewGroup;			CloneDetector::CloneGroup NewGroup;

	size_t PrototypeHash = Current.first;			size_t PrototypeHash = Current.first;

	for (; i < StmtsByHash.size(); ++i) {			for (; i < StmtsByHash.size(); ++i) {
	// A different hash value means we have reached the end of the sequence.			// A different hash value means we have reached the end of the sequence.
	if (PrototypeHash != StmtsByHash[i].first \|\|			if (PrototypeHash != StmtsByHash[i].first) {
	!areSequencesClones(StmtsByHash[i].second, Current.second)) {
	// The current sequence could be the start of a new CloneGroup. So we			// The current sequence could be the start of a new CloneGroup. So we
	// decrement i so that we visit it again in the outer loop.			// decrement i so that we visit it again in the outer loop.
	// Note: i can never be 0 at this point because we are just comparing			// Note: i can never be 0 at this point because we are just comparing
	// the hash of the Current StmtSequence with itself in the 'if' above.			// the hash of the Current StmtSequence with itself in the 'if' above.
	assert(i != 0);			assert(i != 0);
	--i;			--i;
	break;			break;
	}			}
	// Same hash value means we should add the StmtSequence to the current			// Same hash value means we should add the StmtSequence to the current
	// group.			// group.
	NewGroup.push_back(StmtsByHash[i].second);			NewGroup.push_back(StmtsByHash[i].second);
	}			}

	// We created a new clone group with matching hash codes and move it to			// We created a new clone group with matching hash codes and move it to
	// the result vector.			// the result vector.
	Result.push_back(NewGroup);			Result.push_back(NewGroup);
	}			}
	}			}
	// Sequences is the output parameter, so we copy our result into it.			// Sequences is the output parameter, so we copy our result into it.
	Sequences = Result;			Sequences = Result;
	}			}

				void RecursiveCloneTypeIIVerifyConstraint::constrain(
				std::vector<CloneDetector::CloneGroup> &Sequences) {
				CloneConstraint::splitCloneGroups(
				Sequences, [](const StmtSequence &A, const StmtSequence &B) {
				return areSequencesClones(A, B);
				});
				}

	size_t MinComplexityConstraint::calculateStmtComplexity(			size_t MinComplexityConstraint::calculateStmtComplexity(
	const StmtSequence &Seq, const std::string &ParentMacroStack) {			const StmtSequence &Seq, const std::string &ParentMacroStack) {
	if (Seq.empty())			if (Seq.empty())
	return 0;			return 0;

	size_t Complexity = 1;			size_t Complexity = 1;

	ASTContext &Context = Seq.getASTContext();			ASTContext &Context = Seq.getASTContext();
	▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

lib/StaticAnalyzer/Checkers/CloneChecker.cpp

Show First 20 Lines • Show All 72 Lines • ▼ Show 20 Lines	void CloneChecker::checkEndOfTranslationUnit(const TranslationUnitDecl *TU,
bool ReportNormalClones = Mgr.getAnalyzerOptions().getBooleanOption(		bool ReportNormalClones = Mgr.getAnalyzerOptions().getBooleanOption(
"ReportNormalClones", true, this);		"ReportNormalClones", true, this);

// Let the CloneDetector create a list of clones from all the analyzed		// Let the CloneDetector create a list of clones from all the analyzed
// statements. We don't filter for matching variable patterns at this point		// statements. We don't filter for matching variable patterns at this point
// because reportSuspiciousClones() wants to search them for errors.		// because reportSuspiciousClones() wants to search them for errors.
std::vector<CloneDetector::CloneGroup> AllCloneGroups;		std::vector<CloneDetector::CloneGroup> AllCloneGroups;

Detector.findClones(AllCloneGroups, RecursiveCloneTypeIIConstraint(),		Detector.findClones(
MinComplexityConstraint(MinComplexity),		AllCloneGroups, RecursiveCloneTypeIIHashConstraint(),
MinGroupSizeConstraint(2), OnlyLargestCloneConstraint());		MinGroupSizeConstraint(2), MinComplexityConstraint(MinComplexity),
		RecursiveCloneTypeIIVerifyConstraint(), OnlyLargestCloneConstraint());

if (ReportSuspiciousClones)		if (ReportSuspiciousClones)
reportSuspiciousClones(BR, Mgr, AllCloneGroups);		reportSuspiciousClones(BR, Mgr, AllCloneGroups);

// We are done for this translation unit unless we also need to report normal		// We are done for this translation unit unless we also need to report normal
// clones.		// clones.
if (!ReportNormalClones)		if (!ReportNormalClones)
return;		return;
▲ Show 20 Lines • Show All 92 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[analyzer] Performance optimizations for the CloneChecker
ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 102700

include/clang/Analysis/CloneDetection.h

lib/Analysis/CloneDetection.cpp

lib/StaticAnalyzer/Checkers/CloneChecker.cpp

This is an archive of the discontinued LLVM Phabricator instance.

[analyzer] Performance optimizations for the CloneCheckerClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 102700

include/clang/Analysis/CloneDetection.h

lib/Analysis/CloneDetection.cpp

lib/StaticAnalyzer/Checkers/CloneChecker.cpp

[analyzer] Performance optimizations for the CloneChecker
ClosedPublic