This is an archive of the discontinued LLVM Phabricator instance.

clang/lib/StaticAnalyzer/Checkers/CheckPlacementNew.cpp
25	Before i forget: Ideally @martong should have subscribed to [[ https://clang.llvm.org/doxygen/classclang_1_1ento_1_1CheckerDocumentation.html#a7fdb3b5ff726f4c5e782cef0d59c01ad \| `checkNewAllocator` ]] because it fires before the construct-expression whereas this callback fires after construct-expression which is too late as the UB we're trying to catch has occured much earlier.
165–166	You're saying that `A` is a struct and `a` is of type `A` and `&a` is sufficiently aligned then for every field `f` in the struct `&a.f` is sufficiently aligned. I'm not sure it's actually the case.
196–197	I don't think you'll ever see this case in a real-world program. Even if you would, i doubt we'll behave as expected, because we have certain hacks in place that mess up arithmetic on concrete pointers. I appreciate your thinking but i suggest removing this section for now as it'll probably cause more false positives than true positives.

Wohoow! I am impressed, this is really nice work, I like it! :) Could not find any glitch, looks good from my side once you address NoQ's concerns.

clang/lib/StaticAnalyzer/Checkers/CheckPlacementNew.cpp
125	Maybe the below could be a wording that's more easy to follow? `{0} bytes is possibly not enough for array allocation which ...`

Removed code and tests for ConcreteInt cases
Fixed FieldRegion check
Added handling for ElementRegion cases such as

void f7() {
  short b[10];

  // ok. 2(short align) + 3*2(index '1' offset)
  ::new (&b[3]) long;
}

Fixed align error message
Maybe fixed lint warnings

Harbormaster failed remote builds in B50983: Diff 253628!Mar 30 2020, 10:50 AM

test fix

Harbormaster failed remote builds in B50989: Diff 253642!Mar 30 2020, 11:57 AM

martong added inline comments.Mar 31 2020, 12:44 AM

clang/test/Analysis/placement-new.cpp
265	Maybe it is just me, but the contents of the parens here and above seems a bit muddled `(index '2' offset)`. This should be `(index '1' offset)`, shouldn't it? What is the exact meaning of the number in the hyphens (`'2'` in this case), could you please elaborate?

Fixed comments in tests

Harbormaster failed remote builds in B51101: Diff 253803!Mar 31 2020, 2:10 AM

LGTM! Thanks! But I am not that confident with the element regions and field regions, so @NoQ could you please take another look?

clang/lib/StaticAnalyzer/Checkers/CheckPlacementNew.cpp

This will break build-bots that run with -Werror.

../../git/llvm-project/clang/lib/StaticAnalyzer/Checkers/CheckPlacementNew.cpp:82:15: warning: suggest parentheses around assignment used as truth value [-Wparentheses]
   if (IsArray = NE->isArray()) {

clang/test/Analysis/placement-new.cpp

256

First I was wondering if we indeed handle correctly structs with nested arrays whose element's type is a structs with nested arrays (... and so on).

So, I tried the below test, and it seems okay. Thus I think it might be worth to add something similar to it.

void f9_1() {
  struct Y {
    char a;
    alignas(alignof(short)) char b[20];
  };
  struct X {
    char e;
    Y f[20];
  } Xi; // expected-note {{'Xi' initialized here}}

  // ok 2(custom align) + 6*1(index '6' offset)
  ::new (&Xi.f[6].b[6]) long;

  // bad 2(custom align) + 1*1(index '1' offset)
  ::new (&Xi.f[1].b[1]) long; // expected-warning{{Storage type is aligned to 3 bytes but allocated type is aligned to 8 bytes}} expected-note 1 {{}}
}

This revision is now accepted and ready to land.Mar 31 2020, 2:37 AM

Maybe build fix
Added tests for nested arrays of structures
Fixed bugs in implementation for ElementRegion cases

NoQ added inline comments.Apr 6 2020, 12:58 AM

clang/lib/StaticAnalyzer/Checkers/CheckPlacementNew.cpp
187	The sequence of `FieldRegion`s and `ElementRegion`s on top of a base region may be arbitrary: `var.a[0].b[1][2].c.d[3]` etc. I'd rather unwrap those regions one-by-one in a loop and look at the alignment of each layer.

NoQ added inline comments.Apr 6 2020, 4:01 AM

clang/lib/StaticAnalyzer/Checkers/CheckPlacementNew.cpp
187	Alternatively, just decompose the whole region into base region and offset and see if base region has the necessary alignment and the offset is divisible by the necessary alignment.

Ping? :)

clang/lib/StaticAnalyzer/Checkers/CheckPlacementNew.cpp
25	Ops, I forgot about it. Will be fixed soon.
25	When I use checkNewAllocator instead of check::PreStmt<CXXNewExpr> an error occures in method PathDiagnosticBuilder::generate. In the code ErrorNode->getLocation().getTag() because ProgramPoint contains an empty ProgramPointTag. I`m not very good with analyzer so its hard to understand what`s going on, but I`m still trying..
125	Yeah. Sure. I have some troubles with english :)
165–166	Yeah..Maybe we should take into account struct type align plus field offset? Currently, I am relying only on the fact that just struct type is aligned properly For example struct X { char a; int b; } x; Type X is aligned to 'int' and thus field 'x.a' is also aligned to 'int' because it goes first struct Y { char a; int b; char c; char d; } y; But here field 'y.d' is aligned to 'char' I will learn more about RegionOffset and will be back :)
187	The sequence of FieldRegions and ElementRegions on top of a base region may be arbitrary: var.a[0].b[1][2].c.d[3] etc. But i think(hope) I already do this and even have tests for this cases. For example void test22() { struct alignas(alignof(short)) Z { char p; char c[10]; }; struct Y { char p; Z b[10]; }; struct X { Y a[10]; } Xi; // expected-note {{'Xi' initialized here}} // ok. 2(X align) + 1 (offset Y.p) + 1(align Z.p to 'short') + 1(offset Z.p) + 3(index) ::new (&Xi.a[0].b[0].c[3]) long; } Cases with multidimensional arrays will also be handled correctly because method 'TheElementRegion->getAsArrayOffset()' calculates the offset for multidimensional arrays void testXX() { struct Y { char p; char b[10][10]; }; struct X { Y a[10]; } Xi; ::new (&Xi.a[0].b[0][0]) long; } I can explain the code below for ElementRegion if needed.
187	?
196–197	Okay I will remove it!
clang/test/Analysis/placement-new.cpp
256	Thanks! Added tests for this cases.
265	Yeah, sorry it is copy-paste error. Of course there should be '1'. // bad 2(custom align) + 1(index '1' offset)

martong added inline comments.Apr 20 2020, 1:49 AM

clang/lib/StaticAnalyzer/Checkers/CheckPlacementNew.cpp
187	Yeah, the tests are convincing and I think that you are handling the regions well. On the other hand, this code is getting really complex, we should make it easier to read and understand. E.g. `FieldOffsetValue` should be explained more, is it the offset started from the the start address of the multidimensional array, or it is just the offset from one element's start address? Also, you have two variables named as `Offset`. They are offsets from which starting address? Perhaps we should have in the comments a running example, maybe for `&Xi.a[0].b[0][0]`? I mean is `FieldOffseValue` is standing for `b` or for `a`?

NoQ added inline comments.Apr 20 2020, 2:41 AM

clang/lib/StaticAnalyzer/Checkers/CheckPlacementNew.cpp
125	Why do we say "possibly"? Where does the uncertainty come from?
187	Alternatively, just decompose the whole region into base region and offset and see if base region has the necessary alignment and the offset is divisible by the necessary alignment. I expect this to be, like, 5 lines of code. I don't understand why the current code is so complicated, it looks like you're considering multiple cases but ultimately doing the same thing.

Rewroted ElementRegion processing and fixed tests for this cases.
Simplified the code a bit.

Harbormaster failed remote builds in B54369: Diff 259499!Apr 23 2020, 3:13 AM

martong added inline comments.Apr 23 2020, 9:03 AM

clang/lib/StaticAnalyzer/Checkers/CheckPlacementNew.cpp
271	Perhaps you could call instead `checkVarRegionAlign()`?
271	Also, I think `BaseRegion` can be an `ElementRegion` here as well. So, in that case we should call into `checkElementRegionAlign()`, shouldn't we? This draws a pattern that we should recursively descend down to the top most base region. I.e. the different `check*RegionAlign` methods should call into each other until we reach the top level base region. The observation here is that the alignment of a region can be correct only if we can prove that its base region is aligned properly (and other requirements, e.g. the offset is divisible). But the base region may have another base region and we have to prove the alignment correctness to that as well. I hope this makes sense, please correct me if I am wrong.
clang/test/Analysis/placement-new.cpp
245	So these tests failed after you rewrote ElementRegion processing, right? Actually, I wonder why we thought that if x is divisible by 2 then (x+6) will be divisible by 8 unconditionally.It's good you have this fixed.

Refactoring
Build fix

Harbormaster failed remote builds in B54441: Diff 259636!Apr 23 2020, 11:54 AM

Build fix

... This draws a pattern that we should recursively descend down to the top most base region. I.e. the different check*RegionAlign methods should call into each other until we reach the top level base region.

The observation here is that the alignment of a region can be correct only if we can prove that its base region is aligned properly (and other requirements, e.g. the offset is divisible). But the base region may have another base region and we have to prove the alignment correctness to that as well.

This could be an issue not just with alignment but maybe with the size as well, I am not sure if we handle the offset properly in compound cases like this: Xi.b[0].a[1][6].

Even though the above issue is still not investigated/handled, I think this patch is now acceptable because seems like most of the practical cases are handled. We could further investigate the concern and improve in a follow-up patch.
I'd like to see this landed and thanks for your work!

In D76996#2017572, @martong wrote:

... This draws a pattern that we should recursively descend down to the top most base region. I.e. the different check*RegionAlign methods should call into each other until we reach the top level base region.

The observation here is that the alignment of a region can be correct only if we can prove that its base region is aligned properly (and other requirements, e.g. the offset is divisible). But the base region may have another base region and we have to prove the alignment correctness to that as well.

This could be an issue not just with alignment but maybe with the size as well, I am not sure if we handle the offset properly in compound cases like this: Xi.b[0].a[1][6].

Even though the above issue is still not investigated/handled, I think this patch is now acceptable because seems like most of the practical cases are handled. We could further investigate the concern and improve in a follow-up patch.
I'd like to see this landed and thanks for your work!

Thanks for feedback!

I still have no rights to push in the repo so if you think that it is acceptable could you commit it please?

clang/lib/StaticAnalyzer/Checkers/CheckPlacementNew.cpp
125	I don’t know what specific size Clang uses for its internal needs in array cases. If you can tell this size, I will use it here.
187	Thank you for feedback! Rewroted the code for ElementRegion cases. Now it is much easier to understand, and there was also a bug in logic.
271	Perhaps you could call instead checkVarRegionAlign()? Yeah, you are right. Thank you! Also, I think BaseRegion can be an ElementRegion here as well. So, in that case we should call into checkElementRegionAlign(), shouldn't we? This draws a pattern that we should recursively descend down to the top most base region. I.e. the different check*RegionAlign methods should call into each other until we reach the top level base region. I`m not sure that BaseRegion can be ElementRegion. Anyway there always must be some Variable in the end? Or maybe I am wrong? The observation here is that the alignment of a region can be correct only if we can prove that its base region is aligned properly (and other requirements, e.g. the offset is divisible). But the base region may have another base region and we have to prove the alignment correctness to that as well. I hope this makes sense, please correct me if I am wrong. I split check into to two stages. Check BaseRegion align. But if Var has its own align specifier we ignore BaseRegion align. Check that total offset is divisible by the necessary alignment. When I say total offset I mean that it is calculated from the BaseRegion(through all Fields and Elements Regions. e.g. Xi.a[10].b[20].c[30]. Total offset of 'c[30]' is offset from &Xi). So in this solution we no need to recursively check all regions align.
clang/test/Analysis/placement-new.cpp
245	Yes, It is my fault. For example variable can be allocated at address 0x149E730 and it is well aligned to 2,4,8. And the assert that '0+6' is well aligned to '8' will be wrong.

Thanks, just committed.

Closed by commit rG7c3768495e8c: [analyzer] Improve PlacementNewChecker (authored by martong). · Explain WhyMay 14 2020, 6:59 AM

This revision was automatically updated to reflect the committed changes.

https://bugs.llvm.org/show_bug.cgi?id=46266!

Revision Contents

Path

Size

clang/

lib/

StaticAnalyzer/

Checkers/

CheckPlacementNew.cpp

210 lines

test/

Analysis/

placement-new.cpp

144 lines

Diff 253642

clang/lib/StaticAnalyzer/Checkers/CheckPlacementNew.cpp

	Show All 16 Lines
	#include "llvm/Support/FormatVariadic.h"			#include "llvm/Support/FormatVariadic.h"

	using namespace clang;			using namespace clang;
	using namespace ento;			using namespace ento;

	namespace {			namespace {
	class PlacementNewChecker : public Checker<check::PreStmt<CXXNewExpr>> {			class PlacementNewChecker : public Checker<check::PreStmt<CXXNewExpr>> {
	public:			public:
	void checkPreStmt(const CXXNewExpr *NE, CheckerContext &C) const;			void checkPreStmt(const CXXNewExpr *NE, CheckerContext &C) const;
				NoQUnsubmitted Not Done Reply Inline Actions Before i forget: Ideally @martong should have subscribed to [[ https://clang.llvm.org/doxygen/classclang_1_1ento_1_1CheckerDocumentation.html#a7fdb3b5ff726f4c5e782cef0d59c01ad \| `checkNewAllocator` ]] because it fires before the construct-expression whereas this callback fires after construct-expression which is too late as the UB we're trying to catch has occured much earlier. NoQ: Before i forget: Ideally @martong should have subscribed to [[ https://clang.llvm.
				f00katAuthorUnsubmitted Done Reply Inline Actions Ops, I forgot about it. Will be fixed soon. f00kat: Ops, I forgot about it. Will be fixed soon.
				f00katAuthorUnsubmitted Done Reply Inline Actions When I use checkNewAllocator instead of check::PreStmt<CXXNewExpr> an error occures in method PathDiagnosticBuilder::generate. In the code ErrorNode->getLocation().getTag() because ProgramPoint contains an empty ProgramPointTag. I`m not very good with analyzer so its hard to understand what`s going on, but I`m still trying.. f00kat: When I use checkNewAllocator instead of check::PreStmt<CXXNewExpr> an error occures in method…

	private:			private:
				bool checkPlaceCapacityIsSufficient(const CXXNewExpr *NE,
				CheckerContext &C) const;

				bool checkPlaceIsAlignedProperly(const CXXNewExpr *NE,
				CheckerContext &C) const;

	// Returns the size of the target in a placement new expression.			// Returns the size of the target in a placement new expression.
	// E.g. in "new (&s) long" it returns the size of `long`.			// E.g. in "new (&s) long" it returns the size of `long`.
	SVal getExtentSizeOfNewTarget(const CXXNewExpr *NE, ProgramStateRef State,			SVal getExtentSizeOfNewTarget(const CXXNewExpr *NE, CheckerContext &C,
	CheckerContext &C) const;			bool &IsArray) const;
	// Returns the size of the place in a placement new expression.			// Returns the size of the place in a placement new expression.
	// E.g. in "new (&s) long" it returns the size of `s`.			// E.g. in "new (&s) long" it returns the size of `s`.
	SVal getExtentSizeOfPlace(const Expr *NE, ProgramStateRef State,			SVal getExtentSizeOfPlace(const CXXNewExpr *NE, CheckerContext &C) const;
	CheckerContext &C) const;			BugType SBT{this, "Insufficient storage for placement new",
	BugType BT{this, "Insufficient storage for placement new",			categories::MemoryError};
				BugType ABT{this, "Bad align storage for placement new",
	categories::MemoryError};			categories::MemoryError};
	};			};
	} // namespace			} // namespace

	SVal PlacementNewChecker::getExtentSizeOfPlace(const Expr *Place,			SVal PlacementNewChecker::getExtentSizeOfPlace(const CXXNewExpr *NE,
	ProgramStateRef State,
	CheckerContext &C) const {			CheckerContext &C) const {
				ProgramStateRef State = C.getState();
				const Expr *Place = NE->getPlacementArg(0);

	const MemRegion *MRegion = C.getSVal(Place).getAsRegion();			const MemRegion *MRegion = C.getSVal(Place).getAsRegion();
	if (!MRegion)			if (!MRegion)
	return UnknownVal();			return UnknownVal();
	RegionOffset Offset = MRegion->getAsOffset();			RegionOffset Offset = MRegion->getAsOffset();
	if (Offset.hasSymbolicOffset())			if (Offset.hasSymbolicOffset())
	return UnknownVal();			return UnknownVal();
	const MemRegion *BaseRegion = MRegion->getBaseRegion();			const MemRegion *BaseRegion = MRegion->getBaseRegion();
	if (!BaseRegion)			if (!BaseRegion)
	return UnknownVal();			return UnknownVal();

	SValBuilder &SvalBuilder = C.getSValBuilder();			SValBuilder &SvalBuilder = C.getSValBuilder();
	NonLoc OffsetInBytes = SvalBuilder.makeArrayIndex(			NonLoc OffsetInBytes = SvalBuilder.makeArrayIndex(
	Offset.getOffset() / C.getASTContext().getCharWidth());			Offset.getOffset() / C.getASTContext().getCharWidth());
	DefinedOrUnknownSVal ExtentInBytes =			DefinedOrUnknownSVal ExtentInBytes =
	getDynamicSize(State, BaseRegion, SvalBuilder);			getDynamicSize(State, BaseRegion, SvalBuilder);

	return SvalBuilder.evalBinOp(State, BinaryOperator::Opcode::BO_Sub,			return SvalBuilder.evalBinOp(State, BinaryOperator::Opcode::BO_Sub,
	ExtentInBytes, OffsetInBytes,			ExtentInBytes, OffsetInBytes,
	SvalBuilder.getArrayIndexType());			SvalBuilder.getArrayIndexType());
	}			}

	SVal PlacementNewChecker::getExtentSizeOfNewTarget(const CXXNewExpr *NE,			SVal PlacementNewChecker::getExtentSizeOfNewTarget(const CXXNewExpr *NE,
	ProgramStateRef State,			CheckerContext &C,
	CheckerContext &C) const {			bool &IsArray) const {
				ProgramStateRef State = C.getState();
	SValBuilder &SvalBuilder = C.getSValBuilder();			SValBuilder &SvalBuilder = C.getSValBuilder();
	QualType ElementType = NE->getAllocatedType();			QualType ElementType = NE->getAllocatedType();
	ASTContext &AstContext = C.getASTContext();			ASTContext &AstContext = C.getASTContext();
	CharUnits TypeSize = AstContext.getTypeSizeInChars(ElementType);			CharUnits TypeSize = AstContext.getTypeSizeInChars(ElementType);
	if (NE->isArray()) {			if (IsArray = NE->isArray()) {
				martongUnsubmitted Not Done Reply Inline Actions This will break build-bots that run with -Werror. ../../git/llvm-project/clang/lib/StaticAnalyzer/Checkers/CheckPlacementNew.cpp:82:15: warning: suggest parentheses around assignment used as truth value [-Wparentheses] if (IsArray = NE->isArray()) { martong: This will break build-bots that run with -Werror. ``` ../../git/llvm…
	const Expr SizeExpr = NE->getArraySize();			const Expr SizeExpr = NE->getArraySize();
	SVal ElementCount = C.getSVal(SizeExpr);			SVal ElementCount = C.getSVal(SizeExpr);
	if (auto ElementCountNL = ElementCount.getAs<NonLoc>()) {			if (auto ElementCountNL = ElementCount.getAs<NonLoc>()) {
	// size in Bytes = ElementCountNL * TypeSize			// size in Bytes = ElementCountNL * TypeSize
	return SvalBuilder.evalBinOp(			return SvalBuilder.evalBinOp(
	State, BO_Mul, *ElementCountNL,			State, BO_Mul, *ElementCountNL,
	SvalBuilder.makeArrayIndex(TypeSize.getQuantity()),			SvalBuilder.makeArrayIndex(TypeSize.getQuantity()),
	SvalBuilder.getArrayIndexType());			SvalBuilder.getArrayIndexType());
	}			}
	} else {			} else {
	// Create a concrete int whose size in bits and signedness is equal to			// Create a concrete int whose size in bits and signedness is equal to
	// ArrayIndexType.			// ArrayIndexType.
	llvm::APInt I(AstContext.getTypeSizeInChars(SvalBuilder.getArrayIndexType())			llvm::APInt I(AstContext.getTypeSizeInChars(SvalBuilder.getArrayIndexType())
	.getQuantity() *			.getQuantity() *
	C.getASTContext().getCharWidth(),			C.getASTContext().getCharWidth(),
	TypeSize.getQuantity());			TypeSize.getQuantity());
	return SvalBuilder.makeArrayIndex(I.getZExtValue());			return SvalBuilder.makeArrayIndex(I.getZExtValue());
	}			}
	return UnknownVal();			return UnknownVal();
	}			}

	void PlacementNewChecker::checkPreStmt(const CXXNewExpr *NE,			bool PlacementNewChecker::checkPlaceCapacityIsSufficient(
	CheckerContext &C) const {			const CXXNewExpr *NE, CheckerContext &C) const {
	// Check only the default placement new.			bool IsArrayTypeAllocated;
	if (!NE->getOperatorNew()->isReservedGlobalPlacementOperator())			SVal SizeOfTarget = getExtentSizeOfNewTarget(NE, C, IsArrayTypeAllocated);
	return;			SVal SizeOfPlace = getExtentSizeOfPlace(NE, C);
	if (NE->getNumPlacementArgs() == 0)
	return;

	ProgramStateRef State = C.getState();
	SVal SizeOfTarget = getExtentSizeOfNewTarget(NE, State, C);
	const Expr *Place = NE->getPlacementArg(0);
	SVal SizeOfPlace = getExtentSizeOfPlace(Place, State, C);
	const auto SizeOfTargetCI = SizeOfTarget.getAs<nonloc::ConcreteInt>();			const auto SizeOfTargetCI = SizeOfTarget.getAs<nonloc::ConcreteInt>();
	if (!SizeOfTargetCI)			if (!SizeOfTargetCI)
	return;			return true;
	const auto SizeOfPlaceCI = SizeOfPlace.getAs<nonloc::ConcreteInt>();			const auto SizeOfPlaceCI = SizeOfPlace.getAs<nonloc::ConcreteInt>();
	if (!SizeOfPlaceCI)			if (!SizeOfPlaceCI)
	return;			return true;

	if (SizeOfPlaceCI->getValue() < SizeOfTargetCI->getValue()) {			if ((SizeOfPlaceCI->getValue() < SizeOfTargetCI->getValue()) \|\|
	if (ExplodedNode *N = C.generateErrorNode(State)) {			(IsArrayTypeAllocated &&
	std::string Msg = std::string(			SizeOfPlaceCI->getValue() >= SizeOfTargetCI->getValue())) {
	llvm::formatv("Storage provided to placement new is only {0} bytes, "			if (ExplodedNode *N = C.generateErrorNode(C.getState())) {
				std::string Msg;
				// TODO: use clang constant
				if (IsArrayTypeAllocated &&
				SizeOfPlaceCI->getValue() > SizeOfTargetCI->getValue())
				Msg = std::string(llvm::formatv(
				"{0} bytes is possibly not enough for array allocation which "
				martongUnsubmitted Not Done Reply Inline Actions Maybe the below could be a wording that's more easy to follow? `{0} bytes is possibly not enough for array allocation which ...` martong: Maybe the below could be a wording that's more easy to follow? `{0} bytes is possibly not…
				f00katAuthorUnsubmitted Done Reply Inline Actions Yeah. Sure. I have some troubles with english :) f00kat: Yeah. Sure. I have some troubles with english :)
				NoQUnsubmitted Not Done Reply Inline Actions Why do we say "possibly"? Where does the uncertainty come from? NoQ: Why do we say "possibly"? Where does the uncertainty come from?
				f00katAuthorUnsubmitted Done Reply Inline Actions I don’t know what specific size Clang uses for its internal needs in array cases. If you can tell this size, I will use it here. f00kat: I don’t know what specific size Clang uses for its internal needs in array cases. If you can…
				"requires {1} bytes. Current overhead requires the size of {2} "
				"bytes",
				SizeOfPlaceCI->getValue(), SizeOfTargetCI->getValue(),
				SizeOfPlaceCI->getValue() - SizeOfTargetCI->getValue()));
				else if (IsArrayTypeAllocated &&
				SizeOfPlaceCI->getValue() == SizeOfTargetCI->getValue())
				Msg = std::string(llvm::formatv(
				"Storage provided to placement new is only {0} bytes, "
				"whereas the allocated array type requires more space for "
				"internal needs",
				SizeOfPlaceCI->getValue(), SizeOfTargetCI->getValue()));
				else
				Msg = std::string(llvm::formatv(
				"Storage provided to placement new is only {0} bytes, "
	"whereas the allocated type requires {1} bytes",			"whereas the allocated type requires {1} bytes",
	SizeOfPlaceCI->getValue(), SizeOfTargetCI->getValue()));			SizeOfPlaceCI->getValue(), SizeOfTargetCI->getValue()));

	auto R = std::make_unique<PathSensitiveBugReport>(BT, Msg, N);			auto R = std::make_unique<PathSensitiveBugReport>(SBT, Msg, N);
				bugreporter::trackExpressionValue(N, NE->getPlacementArg(0), *R);
				C.emitReport(std::move(R));

				return false;
				}
				}

				return true;
				}

				bool PlacementNewChecker::checkPlaceIsAlignedProperly(const CXXNewExpr *NE,
				CheckerContext &C) const {
				const Expr *Place = NE->getPlacementArg(0);

				QualType AllocatedT = NE->getAllocatedType();
				unsigned AllocatedTAlign = C.getASTContext().getTypeAlign(AllocatedT) /
				C.getASTContext().getCharWidth();

				auto EmitBadAlignReport = [Place, &C, AllocatedTAlign,
				this](unsigned StorageTAlign) -> void {
				ProgramStateRef State = C.getState();
				if (ExplodedNode *N = C.generateErrorNode(State)) {
				std::string Msg(llvm::formatv("Storage type is aligned to {0} bytes but "
				NoQUnsubmitted Not Done Reply Inline Actions You're saying that `A` is a struct and `a` is of type `A` and `&a` is sufficiently aligned then for every field `f` in the struct `&a.f` is sufficiently aligned. I'm not sure it's actually the case. NoQ: You're saying that `A` is a struct and `a` is of type `A` and `&a` is sufficiently aligned then…
				f00katAuthorUnsubmitted Done Reply Inline Actions Yeah..Maybe we should take into account struct type align plus field offset? Currently, I am relying only on the fact that just struct type is aligned properly For example struct X { char a; int b; } x; Type X is aligned to 'int' and thus field 'x.a' is also aligned to 'int' because it goes first struct Y { char a; int b; char c; char d; } y; But here field 'y.d' is aligned to 'char' I will learn more about RegionOffset and will be back :) f00kat: Yeah..Maybe we should take into account struct type align plus field offset? Currently, I am…
				"allocated type is aligned to {1} bytes",
				StorageTAlign, AllocatedTAlign));

				auto R = std::make_unique<PathSensitiveBugReport>(ABT, Msg, N);
	bugreporter::trackExpressionValue(N, Place, *R);			bugreporter::trackExpressionValue(N, Place, *R);
	C.emitReport(std::move(R));			C.emitReport(std::move(R));
	return;			}
				};

				auto GetStorageAlign = [&C](const ValueDecl *TheValueDecl) -> unsigned {
				unsigned StorageTAlign =
				C.getASTContext().getTypeAlign(TheValueDecl->getType());
				if (unsigned SpecifiedAlignment = TheValueDecl->getMaxAlignment())
				StorageTAlign = SpecifiedAlignment;

				return StorageTAlign / C.getASTContext().getCharWidth();
				};

				SVal PlaceVal = C.getSVal(Place);
				if (const MemRegion *MRegion = PlaceVal.getAsRegion()) {
				if (const ElementRegion *TheElementRegion =
				NoQUnsubmitted Not Done Reply Inline Actions The sequence of `FieldRegion`s and `ElementRegion`s on top of a base region may be arbitrary: `var.a[0].b[1][2].c.d[3]` etc. I'd rather unwrap those regions one-by-one in a loop and look at the alignment of each layer. NoQ: The sequence of `FieldRegion`s and `ElementRegion`s on top of a base region may be arbitrary…
				NoQUnsubmitted Not Done Reply Inline Actions Alternatively, just decompose the whole region into base region and offset and see if base region has the necessary alignment and the offset is divisible by the necessary alignment. NoQ: Alternatively, just decompose the whole region into base region and offset and see if base…
				f00katAuthorUnsubmitted Done Reply Inline Actions The sequence of FieldRegions and ElementRegions on top of a base region may be arbitrary: var.a[0].b[1][2].c.d[3] etc. But i think(hope) I already do this and even have tests for this cases. For example void test22() { struct alignas(alignof(short)) Z { char p; char c[10]; }; struct Y { char p; Z b[10]; }; struct X { Y a[10]; } Xi; // expected-note {{'Xi' initialized here}} // ok. 2(X align) + 1 (offset Y.p) + 1(align Z.p to 'short') + 1(offset Z.p) + 3(index) ::new (&Xi.a[0].b[0].c[3]) long; } Cases with multidimensional arrays will also be handled correctly because method 'TheElementRegion->getAsArrayOffset()' calculates the offset for multidimensional arrays void testXX() { struct Y { char p; char b[10][10]; }; struct X { Y a[10]; } Xi; ::new (&Xi.a[0].b[0][0]) long; } I can explain the code below for ElementRegion if needed. f00kat: > The sequence of FieldRegions and ElementRegions on top of a base region may be arbitrary: var.
				f00katAuthorUnsubmitted Done Reply Inline Actions ? f00kat: ?
				martongUnsubmitted Not Done Reply Inline Actions Yeah, the tests are convincing and I think that you are handling the regions well. On the other hand, this code is getting really complex, we should make it easier to read and understand. E.g. `FieldOffsetValue` should be explained more, is it the offset started from the the start address of the multidimensional array, or it is just the offset from one element's start address? Also, you have two variables named as `Offset`. They are offsets from which starting address? Perhaps we should have in the comments a running example, maybe for `&Xi.a[0].b[0][0]`? I mean is `FieldOffseValue` is standing for `b` or for `a`? martong: Yeah, the tests are convincing and I think that you are handling the regions well. On the…
				NoQUnsubmitted Not Done Reply Inline Actions Alternatively, just decompose the whole region into base region and offset and see if base region has the necessary alignment and the offset is divisible by the necessary alignment. I expect this to be, like, 5 lines of code. I don't understand why the current code is so complicated, it looks like you're considering multiple cases but ultimately doing the same thing. NoQ: > Alternatively, just decompose the whole region into base region and offset and see if base…
				f00katAuthorUnsubmitted Done Reply Inline Actions Thank you for feedback! Rewroted the code for ElementRegion cases. Now it is much easier to understand, and there was also a bug in logic. f00kat: Thank you for feedback! Rewroted the code for ElementRegion cases. Now it is much easier to…
				MRegion->getAs<ElementRegion>()) {
				RegionRawOffset Offset = TheElementRegion->getAsArrayOffset();
				if (const MemRegion *OffsetRegion = Offset.getRegion()) {
				if (const FieldRegion *TheFieldRegion =
				OffsetRegion->getAs<FieldRegion>())
				MRegion = TheFieldRegion->getBaseRegion();
				else
				MRegion = OffsetRegion;

				if (const DeclRegion *TheDeclRegion = MRegion->getAs<DeclRegion>()) {
				NoQUnsubmitted Not Done Reply Inline Actions I don't think you'll ever see this case in a real-world program. Even if you would, i doubt we'll behave as expected, because we have certain hacks in place that mess up arithmetic on concrete pointers. I appreciate your thinking but i suggest removing this section for now as it'll probably cause more false positives than true positives. NoQ: I don't think you'll ever see this case in a real-world program. Even if you would, i doubt…
				f00katAuthorUnsubmitted Done Reply Inline Actions Okay I will remove it! f00kat: Okay I will remove it!
				unsigned StorageTAlign = GetStorageAlign(TheDeclRegion->getDecl());
				CharUnits::QuantityType OffsetValue =
				Offset.getOffset().getQuantity();
				auto FinalStorageTAlign = StorageTAlign + OffsetValue;
				unsigned AddressAlign = FinalStorageTAlign % AllocatedTAlign;
				if (AddressAlign != 0) {
				EmitBadAlignReport(AddressAlign);

				return false;
				}
				}
				}
				} else if (const FieldRegion *TheFieldRegion =
				MRegion->getAs<FieldRegion>()) {
				MRegion = TheFieldRegion->getBaseRegion();

				if (!MRegion)
				return false;

				if (const VarRegion *TheVarRegion = MRegion->getAs<VarRegion>()) {
				const VarDecl *TheVarDecl = TheVarRegion->getDecl();

				unsigned StorageTAlign = GetStorageAlign(TheVarDecl);
				if (AllocatedTAlign > StorageTAlign) {
				EmitBadAlignReport(StorageTAlign);

				return false;
				}

				// We've checked type align but, unless FieldRegion offset is zero, we
				// also need to check its own align
				RegionOffset Offset = TheFieldRegion->getAsOffset();
				if (Offset.hasSymbolicOffset())
				return true;

				int64_t OffsetValue =
				Offset.getOffset() / C.getASTContext().getCharWidth();
				if (OffsetValue > 0) {
				unsigned AddressAlign = OffsetValue % AllocatedTAlign;
				if (AddressAlign != 0) {
				EmitBadAlignReport(AddressAlign);

				return false;
	}			}
	}			}
	}			}

				} else if (const VarRegion *TheVarRegion = MRegion->getAs<VarRegion>()) {
				const VarDecl *TheVarDecl = TheVarRegion->getDecl();
				unsigned StorageTAlign = GetStorageAlign(TheVarDecl);
				if (AllocatedTAlign > StorageTAlign) {
				EmitBadAlignReport(StorageTAlign);

				return false;
				}
				}
				}

				return true;
				}

				void PlacementNewChecker::checkPreStmt(const CXXNewExpr *NE,
				CheckerContext &C) const {
				// Check only the default placement new.
				if (!NE->getOperatorNew()->isReservedGlobalPlacementOperator())
				return;

				if (NE->getNumPlacementArgs() == 0)
				return;

				if (!checkPlaceCapacityIsSufficient(NE, C))
				return;

				checkPlaceIsAlignedProperly(NE, C);
				martongUnsubmitted Not Done Reply Inline Actions Perhaps you could call instead `checkVarRegionAlign()`? martong: Perhaps you could call instead `checkVarRegionAlign()`?
				martongUnsubmitted Not Done Reply Inline Actions Also, I think `BaseRegion` can be an `ElementRegion` here as well. So, in that case we should call into `checkElementRegionAlign()`, shouldn't we? This draws a pattern that we should recursively descend down to the top most base region. I.e. the different `checkRegionAlign` methods should call into each other until we reach the top level base region. The observation here is that the alignment of a region can be correct only if we can prove that its base region is aligned properly (and other requirements, e.g. the offset is divisible). But the base region may have another base region and we have to prove the alignment correctness to that as well. I hope this makes sense, please correct me if I am wrong. martong:* Also, I think `BaseRegion` can be an `ElementRegion` here as well. So, in that case we should…
				f00katAuthorUnsubmitted Done Reply Inline Actions Perhaps you could call instead checkVarRegionAlign()? Yeah, you are right. Thank you! Also, I think BaseRegion can be an ElementRegion here as well. So, in that case we should call into checkElementRegionAlign(), shouldn't we? This draws a pattern that we should recursively descend down to the top most base region. I.e. the different checkRegionAlign methods should call into each other until we reach the top level base region. I`m not sure that BaseRegion can be ElementRegion. Anyway there always must be some Variable in the end? Or maybe I am wrong? The observation here is that the alignment of a region can be correct only if we can prove that its base region is aligned properly (and other requirements, e.g. the offset is divisible). But the base region may have another base region and we have to prove the alignment correctness to that as well. I hope this makes sense, please correct me if I am wrong. I split check into to two stages. Check BaseRegion align. But if Var has its own align specifier we ignore BaseRegion align. Check that total offset is divisible by the necessary alignment. When I say total offset I mean that it is calculated from the BaseRegion(through all Fields and Elements Regions. e.g. Xi.a[10].b[20].c[30]. Total offset of 'c[30]' is offset from &Xi). So in this solution we no need to recursively check all regions align. f00kat:* > Perhaps you could call instead checkVarRegionAlign()? Yeah, you are right. Thank you! > Also…
				}

	void ento::registerPlacementNewChecker(CheckerManager &mgr) {			void ento::registerPlacementNewChecker(CheckerManager &mgr) {
	mgr.registerChecker<PlacementNewChecker>();			mgr.registerChecker<PlacementNewChecker>();
	}			}

	bool ento::shouldRegisterPlacementNewChecker(const CheckerManager &mgr) {			bool ento::shouldRegisterPlacementNewChecker(const CheckerManager &mgr) {
	return true;			return true;
	}			}

clang/test/Analysis/placement-new.cpp

Show First 20 Lines • Show All 149 Lines • ▼ Show 20 Lines	struct Derived : Base {
int y;		int y;
};		};
void f() {		void f() {
Base b; // expected-note {{'b' initialized here}}		Base b; // expected-note {{'b' initialized here}}
Derived *dp = ::new (&b) Derived; // expected-warning{{Storage provided to placement new is only 2 bytes, whereas the allocated type requires 8 bytes}} expected-note 1 {{}}		Derived *dp = ::new (&b) Derived; // expected-warning{{Storage provided to placement new is only 2 bytes, whereas the allocated type requires 8 bytes}} expected-note 1 {{}}
(void)dp;		(void)dp;
}		}
} // namespace testHierarchy		} // namespace testHierarchy

		namespace testArrayTypesAllocation {
		void f1() {
		struct S {
		short a;
		};

		// bad (not enough space).
		const unsigned N = 32;
		alignas(S) unsigned char buffer1[sizeof(S) * N]; // expected-note {{'buffer1' initialized here}}
		::new (buffer1) S[N]; // expected-warning{{Storage provided to placement new is only 64 bytes, whereas the allocated array type requires more space for internal needs}} expected-note 1 {{}}
		}

		void f2() {
		struct S {
		short a;
		};

		// maybe ok but we need to warn.
		const unsigned N = 32;
		alignas(S) unsigned char buffer2[sizeof(S) * N + sizeof(int)]; // expected-note {{'buffer2' initialized here}}
		::new (buffer2) S[N]; // expected-warning{{68 bytes is possibly not enough for array allocation which requires 64 bytes. Current overhead requires the size of 4 bytes}} expected-note 1 {{}}
		}
		} // namespace testArrayTypesAllocation

		namespace testStructAlign {
		void f1() {
		struct X {
		char a[9];
		} Xi; // expected-note {{'Xi' initialized here}}

		// bad (struct X is aligned to char).
		::new (&Xi.a) long; // expected-warning{{Storage type is aligned to 1 bytes but allocated type is aligned to 8 bytes}} expected-note 1 {{}}
		}

		void f2() {
		struct X {
		char a;
		char b;
		long c;
		} Xi;

		// ok (struct X is aligned to long).
		::new (&Xi.a) long;
		}

		void f3() {
		struct X {
		char a;
		char b;
		long c;
		} Xi; // expected-note {{'Xi' initialized here}}

		// bad (struct X is aligned to long but field 'b' is aligned to 1 because of its offset)
		::new (&Xi.b) long; // expected-warning{{Storage type is aligned to 1 bytes but allocated type is aligned to 8 bytes}} expected-note 1 {{}}
		}

		void f4() {
		struct X {
		char a;
		struct alignas(alignof(short)) Y {
		char b;
		char c;
		} y;
		long d;
		} Xi; // expected-note {{'Xi' initialized here}}

		// bad. 'b' is aligned to short
		::new (&Xi.y.b) long; // expected-warning{{Storage type is aligned to 2 bytes but allocated type is aligned to 8 bytes}} expected-note 1 {{}}
		}

		void f5() {
		short b[10]; // expected-note {{'b' initialized here}}

		::new (&b) long; // expected-warning{{Storage type is aligned to 2 bytes but allocated type is aligned to 8 bytes}} expected-note 1 {{}}
		}

		void f6() {
		short b[10]; // expected-note {{'b' initialized here}}

		// bad (same as previous but checks ElementRegion case)
		::new (&b[0]) long; // expected-warning{{Storage type is aligned to 2 bytes but allocated type is aligned to 8 bytes}} expected-note 1 {{}}
		}

		void f7() {
		short b[10];

		// ok. 2(short align) + 3*2(index '1' offset)
		martongUnsubmitted Not Done Reply Inline Actions So these tests failed after you rewrote ElementRegion processing, right? Actually, I wonder why we thought that if x is divisible by 2 then (x+6) will be divisible by 8 unconditionally.It's good you have this fixed. martong: So these tests failed after you rewrote ElementRegion processing, right? Actually, I wonder why…
		f00katAuthorUnsubmitted Done Reply Inline Actions Yes, It is my fault. For example variable can be allocated at address 0x149E730 and it is well aligned to 2,4,8. And the assert that '0+6' is well aligned to '8' will be wrong. f00kat: Yes, It is my fault. For example variable can be allocated at address 0x149E730 and it is well…
		::new (&b[3]) long;
		}

		void f8() {
		short b[10]; // expected-note {{'b' initialized here}}

		// bad. 2(short align) + 2*2(index '2' offset)
		::new (&b[2]) long; // expected-warning{{Storage type is aligned to 6 bytes but allocated type is aligned to 8 bytes}} expected-note 1 {{}}
		}

		void f9() {
		martongUnsubmitted Not Done Reply Inline Actions First I was wondering if we indeed handle correctly structs with nested arrays whose element's type is a structs with nested arrays (... and so on). So, I tried the below test, and it seems okay. Thus I think it might be worth to add something similar to it. void f9_1() { struct Y { char a; alignas(alignof(short)) char b[20]; }; struct X { char e; Y f[20]; } Xi; // expected-note {{'Xi' initialized here}} // ok 2(custom align) + 61(index '6' offset) ::new (&Xi.f[6].b[6]) long; // bad 2(custom align) + 11(index '1' offset) ::new (&Xi.f[1].b[1]) long; // expected-warning{{Storage type is aligned to 3 bytes but allocated type is aligned to 8 bytes}} expected-note 1 {{}} } martong: First I was wondering if we indeed handle correctly structs with nested arrays whose element's…
		f00katAuthorUnsubmitted Done Reply Inline Actions Thanks! Added tests for this cases. f00kat: Thanks! Added tests for this cases.
		struct X {
		char a;
		alignas(alignof(short)) char b[20];
		} Xi; // expected-note {{'Xi' initialized here}}

		// ok 2(custom align) + 6(index '2' offset)
		::new (&Xi.b[6]) long;

		// bad 2(custom align) + 1(index '2' offset)
		martongUnsubmitted Not Done Reply Inline Actions Maybe it is just me, but the contents of the parens here and above seems a bit muddled `(index '2' offset)`. This should be `(index '1' offset)`, shouldn't it? What is the exact meaning of the number in the hyphens (`'2'` in this case), could you please elaborate? martong: Maybe it is just me, but the contents of the parens here and above seems a bit muddled `(index…
		f00katAuthorUnsubmitted Done Reply Inline Actions Yeah, sorry it is copy-paste error. Of course there should be '1'. // bad 2(custom align) + 1(index '1' offset) f00kat: Yeah, sorry it is copy-paste error. Of course there should be '1'. ```// bad 2(custom align) +…
		::new (&Xi.b[1]) long; // expected-warning{{Storage type is aligned to 3 bytes but allocated type is aligned to 8 bytes}} expected-note 1 {{}}
		}

		void f10() {
		struct X {
		char a[8];
		alignas(2) char b;
		} Xi; // expected-note {{'Xi' initialized here}}

		// bad (struct X is aligned to 2).
		::new (&Xi.a) long; // expected-warning{{Storage type is aligned to 2 bytes but allocated type is aligned to 8 bytes}} expected-note 1 {{}}
		}

		void f11() {
		struct X {
		char a;
		char b;
		struct Y {
		long c;
		} d;
		} Xi;

		// ok (struct X is aligned to long).
		::new (&Xi.a) long;
		}

		void f12() {
		struct alignas(alignof(long)) X {
		char a;
		char b;
		} Xi;

		// ok (struct X is aligned to long).
		::new (&Xi.a) long;
		}
		} // namespace testStructAlign

This is an archive of the discontinued LLVM Phabricator instance.

[analyzer] Improve PlacementNewCheckerClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 253642

clang/lib/StaticAnalyzer/Checkers/CheckPlacementNew.cpp

clang/test/Analysis/placement-new.cpp

[analyzer] Improve PlacementNewChecker
ClosedPublic