This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/Support/
-
llvm/
-
Support/
-
JSON.h
-
lib/Support/
-
Support/
-
JSON.cpp
-
tools/llvm-dwarfdump/
-
llvm-dwarfdump/
5/12
Statistics.cpp

Differential D109217

[llvm-dwarfdump] Fix unsigned overflow when calculating stats
ClosedPublic

Authored by djtodoro on Sep 3 2021, 2:44 AM.

Download Raw Diff

Details

Reviewers

dblaikie
akhuang
jhenderson
aprantl

Commits

rGc450e47a8c2d: [llvm-dwarfdump] Fix unsigned overflow when calculating stats

Summary

This fixes https://bugs.llvm.org/show_bug.cgi?id=51652.

The idea is to bump all the stat fields to 64-bit wide unsigned integers. I've confirmed this resolves the use case for chromium.

Diff Detail

Unit TestsFailed

	Time	Test
	0 ms	x64 windows > Clang.CodeGen/aarch64-sve-intrinsics::acle_sve_st1b.c
	0 ms	x64 windows > Clang.CodeGen/aarch64-sve-intrinsics::acle_sve_st1h.c
	0 ms	x64 windows > Clang.CodeGen/aarch64-sve-intrinsics::acle_sve_st1w.c

Event Timeline

djtodoro created this revision.Sep 3 2021, 2:44 AM

Herald added a reviewer: jhenderson. · View Herald TranscriptSep 3 2021, 2:44 AM

Herald added subscribers: dexonsmith, cmtice, hiraditya. · View Herald Transcript

djtodoro requested review of this revision.Sep 3 2021, 2:44 AM

Herald added subscribers: llvm-commits, MaskRay. · View Herald TranscriptSep 3 2021, 2:44 AM

Harbormaster completed remote builds in B122469: Diff 370519.Sep 3 2021, 3:17 AM

Orlando added a subscriber: Orlando.Sep 3 2021, 4:28 AM

Orlando added inline comments.

llvm/tools/llvm-dwarfdump/Statistics.cpp
380–381	I think this assert should come before the assignment and be something like this to catch a value that would "overflow" by wrapping: `assert(GlobalStats.ScopeBytesCovered + ScopeBytesCovered >= GlobalStats.ScopeBytesCovered && "ScopeBytesCovered - overflow");` Otherwise I don't think this assertion will ever catch anything, since all uint64_t values are <= UINT64_MAX.

Seems like a generally reasonabel direction forward.

llvm/tools/llvm-dwarfdump/Statistics.cpp
380–381	Yep! I think the more general assert probably looks like this: assert(x <= max - y) x += y;
386–387	When would this assert fire? If `ScopeEntryValueBytesCovered` is a uint64_t, it can't ever be > than the max uint64_t value. Checking for overflow would usually be done, I tihnk, with a check before the overflow: assert(x <= max - y) x += y;

no other comments, thanks for the fix!

• hafixo added a commit: rCRT373035: hwasan: Compatibility fixes for short granules..Sep 6 2021, 12:44 AM

• hafixo added a commit: rGc336557f0238: hwasan: Compatibility fixes for short granules..Sep 6 2021, 12:47 AM

djtodoro added inline comments.Sep 6 2021, 2:01 AM

llvm/tools/llvm-dwarfdump/Statistics.cpp
380–381	Oh yes, thanks

Could something ala:
https://gcc.gnu.org/onlinedocs/gcc/Integer-Overflow-Builtins.html
help?

In D109217#2984816, @tschuett wrote:

Could something ala:
https://gcc.gnu.org/onlinedocs/gcc/Integer-Overflow-Builtins.html
help?

Probably enough to write the portable code (so it works on MSVC too, etc) and let the compiler optimize it. At least for this saturation code, Clang produces the same with or without the intrinsic (& GCC produces something else entirely - to itself and to clang): https://godbolt.org/z/jb3nnaKvT

In D109217#2985644, @dblaikie wrote:

In D109217#2984816, @tschuett wrote:

Could something ala:
https://gcc.gnu.org/onlinedocs/gcc/Integer-Overflow-Builtins.html
help?

Probably enough to write the portable code (so it works on MSVC too, etc) and let the compiler optimize it. At least for this saturation code, Clang produces the same with or without the intrinsic (& GCC produces something else entirely - to itself and to clang): https://godbolt.org/z/jb3nnaKvT

Maybe LLVM should learn saturating integers that assert in debug mode?

In D109217#2985647, @tschuett wrote:

In D109217#2985644, @dblaikie wrote:

In D109217#2984816, @tschuett wrote:

Could something ala:
https://gcc.gnu.org/onlinedocs/gcc/Integer-Overflow-Builtins.html
help?

Probably enough to write the portable code (so it works on MSVC too, etc) and let the compiler optimize it. At least for this saturation code, Clang produces the same with or without the intrinsic (& GCC produces something else entirely - to itself and to clang): https://godbolt.org/z/jb3nnaKvT

Maybe LLVM should learn saturating integers that assert in debug mode?

Perhaps. (though anything that asserts in debug mode should basically be UB in non-debug mode, in my opinion - if you aren't testing/using the functionality, it shouldn't be defined)

For ints we've got UBSan, but unsigned ints are defined on wrap. There's no unsigned type that's UB on overflow - certainly might be nice to have them to clarify the difference between a think you want to do weird bitfiddling with and expect all the overflow, etc, and a thing that's meant to do maths and where sanitizers could diagnose overflow, etc.

But I think a couple of manual overflow checks here is probably OK - might be worth putting it in a generic function and applying it to all the statistics to make things more robust/generic.

For ints we've got UBSan, but unsigned ints are defined on wrap. There's no unsigned type that's UB on overflow - certainly might be nice to have them to clarify the difference between a think you want to do weird bitfiddling with and expect all the overflow, etc, and a thing that's meant to do maths and where sanitizers could diagnose overflow, etc.

-fsanitize=unsigned-integer-overflow ?

In D109217#2985680, @xbolva00 wrote:

For ints we've got UBSan, but unsigned ints are defined on wrap. There's no unsigned type that's UB on overflow - certainly might be nice to have them to clarify the difference between a think you want to do weird bitfiddling with and expect all the overflow, etc, and a thing that's meant to do maths and where sanitizers could diagnose overflow, etc.

-fsanitize=unsigned-integer-overflow ?

Oh, right, we do have that :) (but no doubt LLVM isn't remotely clean of failures for it)

djtodoro mentioned this in D109347: [JSON] Handle uint64_t type.Sep 7 2021, 1:38 AM

djtodoro added a parent revision: D109347: [JSON] Handle uint64_t type.

add a test
bump the stats version
address the comments (I think that these few asserts are enough for this)
split JSON part into a separate patch

djtodoro retitled this revision from [NOT FOR COMMIT] [llvm-dwarfdump] Fix unsigned overflow when calculating stats to [llvm-dwarfdump] Fix unsigned overflow when calculating stats.Sep 7 2021, 1:49 AM

Harbormaster completed remote builds in B122833: Diff 371007.Sep 7 2021, 2:32 AM

Just one more inline question from me, but I will defer to the other reviewers for the rest & approval. Thanks for fixing this.

llvm/tools/llvm-dwarfdump/Statistics.cpp
760–774	I'm not sure how much this matters but looking at the comment above I don't think a version bump is necessary for this patch? Any input that didn't trip the assertions previously will still have the same output with the patch applied.

thopre removed a commit: rGc336557f0238: hwasan: Compatibility fixes for short granules..Sep 7 2021, 2:47 AM

thopre removed a commit: rCRT373035: hwasan: Compatibility fixes for short granules..Sep 7 2021, 2:51 AM

djtodoro added inline comments.Sep 7 2021, 3:28 AM

llvm/tools/llvm-dwarfdump/Statistics.cpp
760–774	I think since this is a bug fix we should bump it -- e.g., a stat number could have been 0 (as a consequence of the bug), and now it will be 2^32 for example, right? I think that this is the purpose behind the stats version.

Let me revisit the saturating integer without asserts. If you print 5 for an uint32_t, you will never know whether it overflowed never or 10 times (in release mode).

A saturating integer will print 5 or max int (saturated).

Even an SaturatingUint<uint64_t> shouldn't yield too much overhead.

In D109217#2987177, @tschuett wrote:

Let me revisit the saturating integer without asserts. If you print 5 for an uint32_t, you will never know whether it overflowed never or 10 times (in release mode).

A saturating integer will print 5 or max int (saturated).

Even an SaturatingUint<uint64_t> shouldn't yield too much overhead.

Hmm, is there an implementation of the SaturatingUint, or do we need to implement such type?

Not at the moment. I just wanted to pitch the idea of having a saturating integer in LLVM.

I think we can implement it here, and it will be useful/safe.
The question is if that should be implemented as a general thing in LLVM.

It seems like the down side of using asserts to detect the "overflow" (the patch's current approach) is that release-config users may still get misleading stats due to wrapping. Implementing a saturating int in and of itself doesn't seem like a full solution, since a user may not notice that the stat is the saturated value, or even know that it's special, especially if the stats are consumed by another tool.

IMO when the stat cannot be computed properly - however the detection is implemented, either with saturating ints, checks like the ones in the asserts, or something else - a good solution for users would be to print a message, and either skip printing the "bad" stats or all of them. That would be consistent for all build configurations and avoid hiding the issue. What do you think? Sorry if I'm just stating the obvious!

llvm/tools/llvm-dwarfdump/Statistics.cpp
760–774	Sounds reasonable (I wasn't thinking about release mode when I made that comment).

You will always know whether it is max int or max int because of saturating behaviour.

class SatUint32 {
  uint32_t value;
  bool overflowed;
}

The saturating integer class would use the builtins I mentioned above to perform arithmetic operations on value and detect overflow and set overflow to true.

I'd just saturate to max int, and use the max int value to indicate overflow. Shaving one value off to represent the overflow state seems fine to me.

That is fine be me. I guess the point is a save way to collect statistics and give guidance to users when the results could be bad, in release and debug mode. I would argue that saturating integers are different and maybe more precise solution than going from uint32_t to uint64_t ...

In D109217#2992699, @tschuett wrote:

That is fine be me. I guess the point is a save way to collect statistics and give guidance to users when the results could be bad, in release and debug mode. I would argue that saturating integers are different and maybe more precise solution than going from uint32_t to uint64_t ...

Well, both, probably - support use cases that weren't supported before (binaries that were too large to fit in the existing stats) and, separately/additionally, some way of reporting overflow rather than reporting bogus values.

Introduce the SaturatingUINT64

In D109217#2992694, @dblaikie wrote:

I'd just saturate to max int, and use the max int value to indicate overflow. Shaving one value off to represent the overflow state seems fine to me.

+1, do not need Overflow flag.

Harbormaster completed remote builds in B123657: Diff 372234.Sep 13 2021, 7:24 AM

-remove the isOverflow field

Harbormaster completed remote builds in B124372: Diff 373193.Sep 17 2021, 5:37 AM

dblaikie added inline comments.Sep 17 2021, 9:03 AM

llvm/tools/llvm-dwarfdump/Statistics.cpp
71	Personally, I think once we've defined the behavior on overflow, we shouldn't assert that overflow doesn't happen - this undermines the concept of having defined behavior on overflow (& makes it somewhat harder to test - since that behavior can now only be tested in a non-asserts build (well, I guess since assert isn't UB-if-false, and the assert is after the warning, that's not the case, but it's a bit subtle)). Also, might it make more sense to do the warning/etc on the final use/printing out of the statistic instead? (I guess that's difficult because some statistics are derived from others? - so catching it the moment it overflows means it'll always be diagnosed only once, rather than multiple times due to multiple uses?)

djtodoro added inline comments.Sep 20 2021, 8:15 AM

llvm/tools/llvm-dwarfdump/Statistics.cpp
71	Thanks for the suggestions. I totally agree. I guess that it makes sense to report the warning when printing the overflowed value, since we can point to the specific field.

addressing comments
- now the warning looks as follows:

"#call site DIEs": N (llvm-dwarfdump: warning: this field overflows),

Harbormaster completed remote builds in B124671: Diff 373595.Sep 20 2021, 8:25 AM

In D109217#3009505, @djtodoro wrote:
addressing comments

now the warning looks as follows:
"#call site DIEs": N (llvm-dwarfdump: warning: this field overflows),

Maybe we could render it symbolically and just say:

"#call site DIEs": >= 9223372036854775807

But yeah, maybe the warning is more suitable, not sure - I'll leave it up to you folks to decide what's best.

@cmtice @rdhindsa - might be handy if you folks are aware of this in terms of quirks when encoding the stats from our internal analysis pipelines, in case the format chosen here needs to be taken into account for how to render out of range data.

In D109217#3011103, @dblaikie wrote:
In D109217#3009505, @djtodoro wrote:
addressing comments

now the warning looks as follows:
"#call site DIEs": N (llvm-dwarfdump: warning: this field overflows),
Maybe we could render it symbolically and just say:
"#call site DIEs": >= 9223372036854775807
But yeah, maybe the warning is more suitable, not sure - I'll leave it up to you folks to decide what's best.

I'd prefer the warning since it will be easier when parsing the JSON data from utilities such as llvm-locstats.

djtodoro added a child revision: D110621: [llvm-locstats] Report a warning if overflow was detected by llvm-dwarfdump.Sep 28 2021, 5:10 AM

ping :)

dblaikie added inline comments.Oct 5 2021, 12:15 PM

llvm/test/tools/llvm-dwarfdump/X86/locstats-bytes-overflow.yaml
24 ↗	(On Diff #373595)	I think it'd be worth CHECKing the specific/full syntax, rather than just "this warning text appears somewhere in the output" - since we're specifically putting it in the output in a particular place. Hmm, that raises a question: is this warning going to stderr, but the actual stats output is to stdout? If so then I think that's a different problem. Maybe that answers one of my other questions though when I suggested printing the output as "> max int" - https://reviews.llvm.org/D109217#3012175 - is that what you meant in this comment? That the value in the JSON data doesn't have any mention of the warning/overflow, and only has the saturated integer value? I worry that's error-prone though, since the value is incorrect and a tool might not be aware of that. So I think it may be valuable to ensure we don't encode a valid value/something that could be mistaken for a valid value in the field when it's overflowed? JSON supports the value being a string, I assume - so perhaps a string representation of ">= max int" or "overflowed" or something would be suitable?
llvm/tools/llvm-dwarfdump/Statistics.cpp
51–56	Maybe just implement this in terms of the other:

djtodoro added inline comments.Oct 7 2021, 1:18 AM

llvm/test/tools/llvm-dwarfdump/X86/locstats-bytes-overflow.yaml
24 ↗	(On Diff #373595)	Yep, that was my concern. It was going to stderr, but the JSON data goes to stdout. JSON supports the value being a string, I assume - so perhaps a string representation of ">= max int" or "overflowed" or something would be suitable? Hmm, I agree... using some special value will make this output ready for the tools from outside. I'll update that.
llvm/tools/llvm-dwarfdump/Statistics.cpp
51–56	yes, thanks

introduce "overflow" special stats value for the fields that overflow

Harbormaster completed remote builds in B127455: Diff 377759.Oct 7 2021, 3:52 AM

This looks alright to me - though I'll leave it to toher folks with more statistics interest to provide final approval.

(@cmtice @rdhindsa - something to be aware of that might crop-up in google uses of the statistics infrastructure, I'd expect)

In D109217#3048505, @dblaikie wrote:

This looks alright to me - though I'll leave it to toher folks with more statistics interest to provide final approval.

(@cmtice @rdhindsa - something to be aware of that might crop-up in google uses of the statistics infrastructure, I'd expect)

Sure, thanks for your comments!

djtodoro added a reviewer: aprantl.Oct 7 2021, 1:39 PM

I think this is reasonable, out of curiosity, would there be a benefit to using APInt? Probably not because 64 bits is already huge...

In D109217#3049369, @aprantl wrote:

I think this is reasonable, out of curiosity, would there be a benefit to using APInt? Probably not because 64 bits is already huge...

I guess we can use APInt as well, but the 64 bits seem enough for now. A challenge would be to teach all the front-ends how to parse the big numbers (>64bits), but it is achievable.

Ok. Works for me.

llvm/tools/llvm-dwarfdump/Statistics.cpp
53	https://en.cppreference.com/w/cpp/types/numeric_limits/max ?

This revision is now accepted and ready to land.Oct 11 2021, 10:01 AM

Thanks!

-use the std::numeric_limits<uint64_t>::max()

Harbormaster completed remote builds in B128288: Diff 378908.Oct 12 2021, 1:43 AM

djtodoro mentioned this in rG8c3adce81dc3: [JSON] Handle uint64_t type.Oct 15 2021, 2:19 AM

Closed by commit rGc450e47a8c2d: [llvm-dwarfdump] Fix unsigned overflow when calculating stats (authored by djtodoro). · Explain WhyOct 15 2021, 3:16 AM

This revision was automatically updated to reflect the committed changes.

djtodoro added a commit: rGc450e47a8c2d: [llvm-dwarfdump] Fix unsigned overflow when calculating stats.

Revision Contents

Path

Size

llvm/

include/

llvm/

Support/

JSON.h

10 lines

lib/

Support/

JSON.cpp

3 lines

tools/

llvm-dwarfdump/

Statistics.cpp

136 lines

Diff 370519

llvm/include/llvm/Support/JSON.h

Show First 20 Lines • Show All 342 Lines • ▼ Show 20 Lines	Value(T B) : Type(T_Boolean) {
create<bool>(B);		create<bool>(B);
}		}
// Integers (except boolean). Must be non-narrowing convertible to int64_t.		// Integers (except boolean). Must be non-narrowing convertible to int64_t.
template <typename T, typename = std::enable_if_t<std::is_integral<T>::value>,		template <typename T, typename = std::enable_if_t<std::is_integral<T>::value>,
typename = std::enable_if_t<!std::is_same<T, bool>::value>>		typename = std::enable_if_t<!std::is_same<T, bool>::value>>
Value(T I) : Type(T_Integer) {		Value(T I) : Type(T_Integer) {
create<int64_t>(int64_t{I});		create<int64_t>(int64_t{I});
}		}
		// Unsigned 64-bit long integers.
		Value(uint64_t V) : Type(T_UINT64) {
		Lint: Pre-merge checks Inline Actions clang-format: please reformat the code - Value(uint64_t V) : Type(T_UINT64) { - create<uint64_t>(uint64_t{V}); - } + Value(uint64_t V) : Type(T_UINT64) { create<uint64_t>(uint64_t{V}); } Lint: Pre-merge checks: clang-format: please reformat the code ``` - Value(uint64_t V) : Type(T_UINT64) {…
		create<uint64_t>(uint64_t{V});
		}
// Floating point. Must be non-narrowing convertible to double.		// Floating point. Must be non-narrowing convertible to double.
template <typename T,		template <typename T,
typename = std::enable_if_t<std::is_floating_point<T>::value>,		typename = std::enable_if_t<std::is_floating_point<T>::value>,
double * = nullptr>		double * = nullptr>
Value(T D) : Type(T_Double) {		Value(T D) : Type(T_Double) {
create<double>(double{D});		create<double>(double{D});
}		}
// Serializable types: with a toJSON(const T&)->Value function, found by ADL.		// Serializable types: with a toJSON(const T&)->Value function, found by ADL.
Show All 18 Lines	public:
Kind kind() const {		Kind kind() const {
switch (Type) {		switch (Type) {
case T_Null:		case T_Null:
return Null;		return Null;
case T_Boolean:		case T_Boolean:
return Boolean;		return Boolean;
case T_Double:		case T_Double:
case T_Integer:		case T_Integer:
		case T_UINT64:
return Number;		return Number;
case T_String:		case T_String:
case T_StringRef:		case T_StringRef:
return String;		return String;
case T_Object:		case T_Object:
return Object;		return Object;
case T_Array:		case T_Array:
return Array;		return Array;
Show All 12 Lines	if (LLVM_LIKELY(Type == T_Boolean))
return as<bool>();		return as<bool>();
return llvm::None;		return llvm::None;
}		}
llvm::Optional<double> getAsNumber() const {		llvm::Optional<double> getAsNumber() const {
if (LLVM_LIKELY(Type == T_Double))		if (LLVM_LIKELY(Type == T_Double))
return as<double>();		return as<double>();
if (LLVM_LIKELY(Type == T_Integer))		if (LLVM_LIKELY(Type == T_Integer))
return as<int64_t>();		return as<int64_t>();
		if (LLVM_LIKELY(Type == T_UINT64))
		return as<uint64_t>();
return llvm::None;		return llvm::None;
}		}
// Succeeds if the Value is a Number, and exactly representable as int64_t.		// Succeeds if the Value is a Number, and exactly representable as int64_t.
llvm::Optional<int64_t> getAsInteger() const {		llvm::Optional<int64_t> getAsInteger() const {
if (LLVM_LIKELY(Type == T_Integer))		if (LLVM_LIKELY(Type == T_Integer))
return as<int64_t>();		return as<int64_t>();
		if (LLVM_LIKELY(Type == T_UINT64))
		return as<uint64_t>();
if (LLVM_LIKELY(Type == T_Double)) {		if (LLVM_LIKELY(Type == T_Double)) {
double D = as<double>();		double D = as<double>();
if (LLVM_LIKELY(std::modf(D, &D) == 0.0 &&		if (LLVM_LIKELY(std::modf(D, &D) == 0.0 &&
D >= double(std::numeric_limits<int64_t>::min()) &&		D >= double(std::numeric_limits<int64_t>::min()) &&
D <= double(std::numeric_limits<int64_t>::max())))		D <= double(std::numeric_limits<int64_t>::max())))
return D;		return D;
}		}
return llvm::None;		return llvm::None;
Show All 40 Lines	private:

friend class OStream;		friend class OStream;

enum ValueType : char {		enum ValueType : char {
T_Null,		T_Null,
T_Boolean,		T_Boolean,
T_Double,		T_Double,
T_Integer,		T_Integer,
		T_UINT64,
T_StringRef,		T_StringRef,
T_String,		T_String,
T_Object,		T_Object,
T_Array,		T_Array,
};		};
// All members mutable, see moveFrom().		// All members mutable, see moveFrom().
mutable ValueType Type;		mutable ValueType Type;
mutable llvm::AlignedCharArrayUnion<bool, double, int64_t, llvm::StringRef,		mutable llvm::AlignedCharArrayUnion<bool, double, int64_t, llvm::StringRef,
▲ Show 20 Lines • Show All 528 Lines • Show Last 20 Lines

llvm/lib/Support/JSON.cpp

	Show First 20 Lines • Show All 103 Lines • ▼ Show 20 Lines

	void Value::copyFrom(const Value &M) {			void Value::copyFrom(const Value &M) {
	Type = M.Type;			Type = M.Type;
	switch (Type) {			switch (Type) {
	case T_Null:			case T_Null:
	case T_Boolean:			case T_Boolean:
	case T_Double:			case T_Double:
	case T_Integer:			case T_Integer:
				case T_UINT64:
	memcpy(&Union, &M.Union, sizeof(Union));			memcpy(&Union, &M.Union, sizeof(Union));
	break;			break;
	case T_StringRef:			case T_StringRef:
	create<StringRef>(M.as<StringRef>());			create<StringRef>(M.as<StringRef>());
	break;			break;
	case T_String:			case T_String:
	create<std::string>(M.as<std::string>());			create<std::string>(M.as<std::string>());
	break;			break;
	case T_Object:			case T_Object:
	create<json::Object>(M.as<json::Object>());			create<json::Object>(M.as<json::Object>());
	break;			break;
	case T_Array:			case T_Array:
	create<json::Array>(M.as<json::Array>());			create<json::Array>(M.as<json::Array>());
	break;			break;
	}			}
	}			}

	void Value::moveFrom(const Value &&M) {			void Value::moveFrom(const Value &&M) {
	Type = M.Type;			Type = M.Type;
	switch (Type) {			switch (Type) {
	case T_Null:			case T_Null:
	case T_Boolean:			case T_Boolean:
	case T_Double:			case T_Double:
	case T_Integer:			case T_Integer:
				case T_UINT64:
	memcpy(&Union, &M.Union, sizeof(Union));			memcpy(&Union, &M.Union, sizeof(Union));
	break;			break;
	case T_StringRef:			case T_StringRef:
	create<StringRef>(M.as<StringRef>());			create<StringRef>(M.as<StringRef>());
	break;			break;
	case T_String:			case T_String:
	create<std::string>(std::move(M.as<std::string>()));			create<std::string>(std::move(M.as<std::string>()));
	M.Type = T_Null;			M.Type = T_Null;
	Show All 10 Lines
	}			}

	void Value::destroy() {			void Value::destroy() {
	switch (Type) {			switch (Type) {
	case T_Null:			case T_Null:
	case T_Boolean:			case T_Boolean:
	case T_Double:			case T_Double:
	case T_Integer:			case T_Integer:
				case T_UINT64:
	break;			break;
	case T_StringRef:			case T_StringRef:
	as<StringRef>().~StringRef();			as<StringRef>().~StringRef();
	break;			break;
	case T_String:			case T_String:
	as<std::string>().~basic_string();			as<std::string>().~basic_string();
	break;			break;
	case T_Object:			case T_Object:
	▲ Show 20 Lines • Show All 742 Lines • Show Last 20 Lines

llvm/tools/llvm-dwarfdump/Statistics.cpp

Show All 34 Lines

using AbstractOriginVarsTyMap = llvm::DenseMap<uint64_t, AbstractOriginVarsTy>; using AbstractOriginVarsTyMap = llvm::DenseMap<uint64_t, AbstractOriginVarsTy>;

/// This represents function DIE offsets containing an abstract_origin. /// This represents function DIE offsets containing an abstract_origin.

using FunctionsWithAbstractOriginTy = llvm::SmallVector<uint64_t>; using FunctionsWithAbstractOriginTy = llvm::SmallVector<uint64_t>;

/// Holds statistics for one function (or other entity that has a PC range and /// Holds statistics for one function (or other entity that has a PC range and

/// contains variables, such as a compile unit). /// contains variables, such as a compile unit).

struct PerFunctionStats { struct PerFunctionStats {

/// Number of inlined instances of this function. /// Number of inlined instances of this function.

unsigned NumFnInlined = 0; uint64_t NumFnInlined = 0;

/// Number of out-of-line instances of this function. /// Number of out-of-line instances of this function.

unsigned NumFnOutOfLine = 0; uint64_t NumFnOutOfLine = 0;

/// Number of inlined instances that have abstract origins. /// Number of inlined instances that have abstract origins.

unsigned NumAbstractOrigins = 0; uint64_t NumAbstractOrigins = 0;

/// Number of variables and parameters with location across all inlined /// Number of variables and parameters with location across all inlined

/// instances. /// instances.

unsigned TotalVarWithLoc = 0; uint64_t TotalVarWithLoc = 0;

/// Number of constants with location across all inlined instances. /// Number of constants with location across all inlined instances.

unsigned ConstantMembers = 0; uint64_t ConstantMembers = 0;

/// Number of arificial variables, parameters or members across all instances. /// Number of arificial variables, parameters or members across all instances.

aprantlUnsubmitted

Done

https://en.cppreference.com/w/cpp/types/numeric_limits/max ?

aprantl: https://en.cppreference.com/w/cpp/types/numeric_limits/max ?

unsigned NumArtificial = 0; uint64_t NumArtificial = 0;

/// List of all Variables and parameters in this function. /// List of all Variables and parameters in this function.

StringSet<> VarsInFunction; StringSet<> VarsInFunction;

dblaikieUnsubmitted

Not Done

void operator++(int) {

- if (Value != UINT64_MAX) {

- if (Value < UINT64_MAX - 1)

- ++Value;

- else

- Value = UINT64_MAX;

- }

+ *this += 1;

}

void operator+=(uint64_t Value_) {

Maybe just implement this in terms of the other:

dblaikie: Maybe just implement this in terms of the other:

djtodoroAuthorUnsubmitted

Done

yes, thanks

djtodoro: yes, thanks

/// Compile units also cover a PC range, but have this flag set to false. /// Compile units also cover a PC range, but have this flag set to false.

bool IsFunction = false; bool IsFunction = false;

/// Function has source location information. /// Function has source location information.

bool HasSourceLocation = false; bool HasSourceLocation = false;

/// Number of function parameters. /// Number of function parameters.

unsigned NumParams = 0; uint64_t NumParams = 0;

/// Number of function parameters with source location. /// Number of function parameters with source location.

unsigned NumParamSourceLocations = 0; uint64_t NumParamSourceLocations = 0;

/// Number of function parameters with type. /// Number of function parameters with type.

unsigned NumParamTypes = 0; uint64_t NumParamTypes = 0;

/// Number of function parameters with a DW_AT_location. /// Number of function parameters with a DW_AT_location.

unsigned NumParamLocations = 0; uint64_t NumParamLocations = 0;

/// Number of local variables. /// Number of local variables.

unsigned NumLocalVars = 0; uint64_t NumLocalVars = 0;

/// Number of local variables with source location. /// Number of local variables with source location.

dblaikieUnsubmitted

Not Done

Personally, I think once we've defined the behavior on overflow, we shouldn't assert that overflow doesn't happen - this undermines the concept of having defined behavior on overflow (& makes it somewhat harder to test - since that behavior can now only be tested in a non-asserts build (well, I guess since assert isn't UB-if-false, and the assert is after the warning, that's not the case, but it's a bit subtle)).

Also, might it make more sense to do the warning/etc on the final use/printing out of the statistic instead? (I guess that's difficult because some statistics are derived from others? - so catching it the moment it overflows means it'll always be diagnosed only once, rather than multiple times due to multiple uses?)

dblaikie: Personally, I think once we've defined the behavior on overflow, we shouldn't assert that…

djtodoroAuthorUnsubmitted

Done

Thanks for the suggestions. I totally agree.

I guess that it makes sense to report the warning when printing the overflowed value, since we can point to the specific field.

djtodoro: Thanks for the suggestions. I totally agree. I guess that it makes sense to report the warning…

unsigned NumLocalVarSourceLocations = 0; uint64_t NumLocalVarSourceLocations = 0;

/// Number of local variables with type. /// Number of local variables with type.

unsigned NumLocalVarTypes = 0; uint64_t NumLocalVarTypes = 0;

/// Number of local variables with DW_AT_location. /// Number of local variables with DW_AT_location.

unsigned NumLocalVarLocations = 0; uint64_t NumLocalVarLocations = 0;

}; };

/// Holds accumulated global statistics about DIEs. /// Holds accumulated global statistics about DIEs.

struct GlobalStats { struct GlobalStats {

/// Total number of PC range bytes covered by DW_AT_locations. /// Total number of PC range bytes covered by DW_AT_locations.

unsigned TotalBytesCovered = 0; uint64_t TotalBytesCovered = 0;

/// Total number of parent DIE PC range bytes covered by DW_AT_Locations. /// Total number of parent DIE PC range bytes covered by DW_AT_Locations.

unsigned ScopeBytesCovered = 0; uint64_t ScopeBytesCovered = 0;

/// Total number of PC range bytes in each variable's enclosing scope. /// Total number of PC range bytes in each variable's enclosing scope.

unsigned ScopeBytes = 0; uint64_t ScopeBytes = 0;

/// Total number of PC range bytes covered by DW_AT_locations with /// Total number of PC range bytes covered by DW_AT_locations with

/// the debug entry values (DW_OP_entry_value). /// the debug entry values (DW_OP_entry_value).

unsigned ScopeEntryValueBytesCovered = 0; uint64_t ScopeEntryValueBytesCovered = 0;

/// Total number of PC range bytes covered by DW_AT_locations of /// Total number of PC range bytes covered by DW_AT_locations of

/// formal parameters. /// formal parameters.

unsigned ParamScopeBytesCovered = 0; uint64_t ParamScopeBytesCovered = 0;

/// Total number of PC range bytes in each parameter's enclosing scope. /// Total number of PC range bytes in each parameter's enclosing scope.

unsigned ParamScopeBytes = 0; uint64_t ParamScopeBytes = 0;

/// Total number of PC range bytes covered by DW_AT_locations with /// Total number of PC range bytes covered by DW_AT_locations with

/// the debug entry values (DW_OP_entry_value) (only for parameters). /// the debug entry values (DW_OP_entry_value) (only for parameters).

unsigned ParamScopeEntryValueBytesCovered = 0; uint64_t ParamScopeEntryValueBytesCovered = 0;

/// Total number of PC range bytes covered by DW_AT_locations (only for local /// Total number of PC range bytes covered by DW_AT_locations (only for local

/// variables). /// variables).

unsigned LocalVarScopeBytesCovered = 0; uint64_t LocalVarScopeBytesCovered = 0;

/// Total number of PC range bytes in each local variable's enclosing scope. /// Total number of PC range bytes in each local variable's enclosing scope.

unsigned LocalVarScopeBytes = 0; uint64_t LocalVarScopeBytes = 0;

/// Total number of PC range bytes covered by DW_AT_locations with /// Total number of PC range bytes covered by DW_AT_locations with

/// the debug entry values (DW_OP_entry_value) (only for local variables). /// the debug entry values (DW_OP_entry_value) (only for local variables).

unsigned LocalVarScopeEntryValueBytesCovered = 0; uint64_t LocalVarScopeEntryValueBytesCovered = 0;

/// Total number of call site entries (DW_AT_call_file & DW_AT_call_line). /// Total number of call site entries (DW_AT_call_file & DW_AT_call_line).

unsigned CallSiteEntries = 0; uint64_t CallSiteEntries = 0;

/// Total number of call site DIEs (DW_TAG_call_site). /// Total number of call site DIEs (DW_TAG_call_site).

unsigned CallSiteDIEs = 0; uint64_t CallSiteDIEs = 0;

/// Total number of call site parameter DIEs (DW_TAG_call_site_parameter). /// Total number of call site parameter DIEs (DW_TAG_call_site_parameter).

unsigned CallSiteParamDIEs = 0; uint64_t CallSiteParamDIEs = 0;

/// Total byte size of concrete functions. This byte size includes /// Total byte size of concrete functions. This byte size includes

/// inline functions contained in the concrete functions. /// inline functions contained in the concrete functions.

unsigned FunctionSize = 0; uint64_t FunctionSize = 0;

/// Total byte size of inlined functions. This is the total number of bytes /// Total byte size of inlined functions. This is the total number of bytes

/// for the top inline functions within concrete functions. This can help /// for the top inline functions within concrete functions. This can help

/// tune the inline settings when compiling to match user expectations. /// tune the inline settings when compiling to match user expectations.

unsigned InlineFunctionSize = 0; uint64_t InlineFunctionSize = 0;

}; };

/// Holds accumulated debug location statistics about local variables and /// Holds accumulated debug location statistics about local variables and

/// formal parameters. /// formal parameters.

struct LocationStats { struct LocationStats {

/// Map the scope coverage decile to the number of variables in the decile. /// Map the scope coverage decile to the number of variables in the decile.

/// The first element of the array (at the index zero) represents the number /// The first element of the array (at the index zero) represents the number

/// of variables with the no debug location at all, but the last element /// of variables with the no debug location at all, but the last element

/// in the vector represents the number of fully covered variables within /// in the vector represents the number of fully covered variables within

/// its scope. /// its scope.

std::vector<unsigned> VarParamLocStats{ std::vector<uint64_t> VarParamLocStats{