This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
clang/
-
lib/CodeGen/
-
CodeGen/
7
CGDebugInfo.cpp
-
test/CodeGenObjC/
-
CodeGenObjC/
-
debug-info-ivars.m

Differential D51990

[DebugInfo] Fix emitting of bit offset for ObjC
AbandonedPublic

Authored by JDevlieghere on Sep 12 2018, 8:11 AM.

Download Raw Diff

Details

Reviewers

aprantl
rjmccall

Summary

We generate incorrect values for the DW_AT_data_bit_offset for interfaces in Objective-C. I can only speculate as to what we were trying to achieve by taking the modulo of the bit size with the byte size, but given that the size and offset is expressed in bits, this seems wrong.

Diff Detail

Repository: rC Clang

Event Timeline

JDevlieghere created this revision.Sep 12 2018, 8:11 AM

aprantl added a reviewer: rjmccall.Sep 12 2018, 8:46 AM

aprantl requested changes to this revision.Sep 12 2018, 8:52 AM

aprantl added inline comments.

clang/lib/CodeGen/CGDebugInfo.cpp
2369	It might help to attempt some git blame archeology. Judging from the comment, it sounds like the debugger is supposed to query the runtime for the byte offset and then add the bit offset from DWARF? Could that make sense?

This revision now requires changes to proceed.Sep 12 2018, 8:52 AM

aprantl added inline comments.Sep 12 2018, 8:53 AM

clang/lib/CodeGen/CGDebugInfo.cpp
2369	If that is the case, we'd need to relax llvm-dwarfdump --verify to accept this and make sure LLDB does the right thing instead.

aprantl removed a reviewer: dexonsmith.Sep 12 2018, 8:53 AM

aprantl added a project: debug-info.

JDevlieghere added inline comments.Sep 12 2018, 9:04 AM

clang/lib/CodeGen/CGDebugInfo.cpp
2369	Ah I see, yeah that sounds reasonable and explains the comment which I interpreted differently. Thanks!

JDevlieghere added inline comments.Sep 12 2018, 9:08 AM

clang/lib/CodeGen/CGDebugInfo.cpp
2369	btw it was added in rL167503.

aprantl added inline comments.Sep 12 2018, 9:44 AM

clang/lib/CodeGen/CGDebugInfo.cpp
2369	We should check whether emitting the offsets like this violates the DWARF spec. If yes, it may be better to emit the static offsets like you are doing here and then still have LLDB ignore everything but the bit-offsets from the dynamic byte offset.

JDevlieghere added inline comments.Sep 13 2018, 2:33 AM

clang/lib/CodeGen/CGDebugInfo.cpp
2369	The standard says The member entry corresponding to a data member that is defined in a structure, union or class may have either a DW_AT_data_member_location attribute or a DW_AT_data_bit_offset attribute. which to me sounds like they should be mutually exclusive. I ran the lldb test suite with my change and there were no new failures, which leads me to believe that the comment from r167503 still holds and lldb ignores this attribute, at least for Objective-C.

rjmccall added inline comments.Sep 16 2018, 11:49 PM

clang/lib/CodeGen/CGDebugInfo.cpp
2369	In Clang's current code-generation, the ivar offset variable stores the offset of the byte containing the first bit of the bit-field, which is not necessarily the same as the offset of the byte at which the bit-field's storage begins. This is why the debug info `%`s by the number of bits in a `char`: it's the correct bit-offset for computing where the bit-field begins relative to the value in the ivar offset variable. See the behavior of `CGObjCRuntime::EmitValueForIvarAtOffset`, which does the same `%` when computing the layout. It seems right to me that the `%` is done consistently within the compiler, rather than expecting LLDB to do it. That's the more future-proof design; for example, it would allow the compiler to emit a single ObjC ivar record for the combined bit-field storage and then emit debug info for the bit-fields at their appropriate relative bit-offsets, which could then be `>= 8`. This would arguably be a better code-generation scheme anyway, for a number of reasons — at the cost of breaking reflective access to the individual ivars, which I don't believe the runtime really supports anyway. Anyway, the upshot is that I don't think this patch is correct.

JDevlieghere abandoned this revision.Mar 7 2019, 12:52 PM

Herald added a project: Restricted Project. · View Herald TranscriptMar 7 2019, 12:52 PM

Herald added a subscriber: jdoerfert. · View Herald Transcript

Revision Contents

Path

Size

clang/

lib/

CodeGen/

CGDebugInfo.cpp

11 lines

test/

CodeGenObjC/

debug-info-ivars.m

51 lines

Diff 165093

clang/lib/CodeGen/CGDebugInfo.cpp

Show First 20 Lines • Show All 2,357 Lines • ▼ Show 20 Lines	if (!FType->isIncompleteArrayType()) {
FieldAlign = getTypeAlignIfRequired(FType, CGM.getContext());		FieldAlign = getTypeAlignIfRequired(FType, CGM.getContext());
}		}

uint64_t FieldOffset;		uint64_t FieldOffset;
if (CGM.getLangOpts().ObjCRuntime.isNonFragile()) {		if (CGM.getLangOpts().ObjCRuntime.isNonFragile()) {
// We don't know the runtime offset of an ivar if we're using the		// We don't know the runtime offset of an ivar if we're using the
// non-fragile ABI. For bitfields, use the bit offset into the first		// non-fragile ABI. For bitfields, use the bit offset into the first
// byte of storage of the bitfield. For other fields, use zero.		// byte of storage of the bitfield. For other fields, use zero.
if (Field->isBitField()) {
FieldOffset =		FieldOffset =
CGM.getObjCRuntime().ComputeBitfieldBitOffset(CGM, ID, Field);		Field->isBitField()
FieldOffset %= CGM.getContext().getCharWidth();		? CGM.getObjCRuntime().ComputeBitfieldBitOffset(CGM, ID, Field)
} else {		: 0;
		aprantlUnsubmitted Not Done Reply Inline Actions It might help to attempt some git blame archeology. Judging from the comment, it sounds like the debugger is supposed to query the runtime for the byte offset and then add the bit offset from DWARF? Could that make sense? aprantl: It might help to attempt some git blame archeology. Judging from the comment, it sounds like…
		aprantlUnsubmitted Not Done Reply Inline Actions If that is the case, we'd need to relax llvm-dwarfdump --verify to accept this and make sure LLDB does the right thing instead. aprantl: If that is the case, we'd need to relax llvm-dwarfdump --verify to accept this and make sure…
		JDevlieghereAuthorUnsubmitted Not Done Reply Inline Actions Ah I see, yeah that sounds reasonable and explains the comment which I interpreted differently. Thanks! JDevlieghere: Ah I see, yeah that sounds reasonable and explains the comment which I interpreted differently.
		JDevlieghereAuthorUnsubmitted Not Done Reply Inline Actions btw it was added in rL167503. JDevlieghere: btw it was added in rL167503.
		aprantlUnsubmitted Not Done Reply Inline Actions We should check whether emitting the offsets like this violates the DWARF spec. If yes, it may be better to emit the static offsets like you are doing here and then still have LLDB ignore everything but the bit-offsets from the dynamic byte offset. aprantl: We should check whether emitting the offsets like this violates the DWARF spec. If yes, it may…
		JDevlieghereAuthorUnsubmitted Not Done Reply Inline Actions The standard says The member entry corresponding to a data member that is defined in a structure, union or class may have either a DW_AT_data_member_location attribute or a DW_AT_data_bit_offset attribute. which to me sounds like they should be mutually exclusive. I ran the lldb test suite with my change and there were no new failures, which leads me to believe that the comment from r167503 still holds and lldb ignores this attribute, at least for Objective-C. JDevlieghere: The standard says > The member entry corresponding to a data member that is defined in a…
		rjmccallUnsubmitted Not Done Reply Inline Actions In Clang's current code-generation, the ivar offset variable stores the offset of the byte containing the first bit of the bit-field, which is not necessarily the same as the offset of the byte at which the bit-field's storage begins. This is why the debug info `%`s by the number of bits in a `char`: it's the correct bit-offset for computing where the bit-field begins relative to the value in the ivar offset variable. See the behavior of `CGObjCRuntime::EmitValueForIvarAtOffset`, which does the same `%` when computing the layout. It seems right to me that the `%` is done consistently within the compiler, rather than expecting LLDB to do it. That's the more future-proof design; for example, it would allow the compiler to emit a single ObjC ivar record for the combined bit-field storage and then emit debug info for the bit-fields at their appropriate relative bit-offsets, which could then be `>= 8`. This would arguably be a better code-generation scheme anyway, for a number of reasons — at the cost of breaking reflective access to the individual ivars, which I don't believe the runtime really supports anyway. Anyway, the upshot is that I don't think this patch is correct. rjmccall: In Clang's current code-generation, the ivar offset variable stores the offset of the byte…
FieldOffset = 0;
}
} else {		} else {
FieldOffset = RL.getFieldOffset(FieldNo);		FieldOffset = RL.getFieldOffset(FieldNo);
}		}

llvm::DINode::DIFlags Flags = llvm::DINode::FlagZero;		llvm::DINode::DIFlags Flags = llvm::DINode::FlagZero;
if (Field->getAccessControl() == ObjCIvarDecl::Protected)		if (Field->getAccessControl() == ObjCIvarDecl::Protected)
Flags = llvm::DINode::FlagProtected;		Flags = llvm::DINode::FlagProtected;
else if (Field->getAccessControl() == ObjCIvarDecl::Private)		else if (Field->getAccessControl() == ObjCIvarDecl::Private)
▲ Show 20 Lines • Show All 1,998 Lines • Show Last 20 Lines

clang/test/CodeGenObjC/debug-info-ivars.m

Show All 9 Lines	@interface BaseClass : NSObject
int i;		int i;
unsigned flag_1 : 9;		unsigned flag_1 : 9;
unsigned flag_2 : 9;		unsigned flag_2 : 9;
unsigned : 1;		unsigned : 1;
unsigned flag_3 : 9;		unsigned flag_3 : 9;
}		}
@end		@end

@implementation BaseClass		@implementation BaseClass
@end		@end

// CHECK: !DIDerivedType(tag: DW_TAG_member, name: "i"		// CHECK: !DIDerivedType(tag: DW_TAG_member, name: "i"
// CHECK-SAME: line: 10		// CHECK-SAME: line: 10
// CHECK-SAME: baseType: ![[INT:[0-9]+]]		// CHECK-SAME: baseType: ![[INT:[0-9]+]]
// CHECK-SAME: size: 32,		// CHECK-SAME: size: 32,
// CHECK-NOT: offset:		// CHECK-NOT: offset:
// CHECK-SAME: flags: DIFlagProtected		// CHECK-SAME: flags: DIFlagProtected
// CHECK: ![[INT]] = !DIBasicType(name: "int"		// CHECK: ![[INT]] = !DIBasicType(name: "int"
// CHECK: !DIDerivedType(tag: DW_TAG_member, name: "flag_1"		// CHECK: !DIDerivedType(tag: DW_TAG_member, name: "flag_1"
// CHECK-SAME: line: 11		// CHECK-SAME: line: 11
// CHECK-SAME: baseType: ![[UNSIGNED:[0-9]+]]		// CHECK-SAME: baseType: ![[UNSIGNED:[0-9]+]]
// CHECK-SAME: size: 9,		// CHECK-SAME: size: 9, offset: 96
// CHECK-NOT: offset:
// CHECK-SAME: flags: DIFlagProtected		// CHECK-SAME: flags: DIFlagProtected
// CHECK: ![[UNSIGNED]] = !DIBasicType(name: "unsigned int"		// CHECK: ![[UNSIGNED]] = !DIBasicType(name: "unsigned int"
// CHECK: !DIDerivedType(tag: DW_TAG_member, name: "flag_2"		// CHECK: !DIDerivedType(tag: DW_TAG_member, name: "flag_2"
// CHECK-SAME: line: 12		// CHECK-SAME: line: 12
// CHECK-SAME: baseType: ![[UNSIGNED]]		// CHECK-SAME: baseType: ![[UNSIGNED]]
// CHECK-SAME: size: 9, offset: 1,		// CHECK-SAME: size: 9, offset: 105
// CHECK-SAME: flags: DIFlagProtected		// CHECK-SAME: flags: DIFlagProtected
// CHECK: !DIDerivedType(tag: DW_TAG_member, name: "flag_3"		// CHECK: !DIDerivedType(tag: DW_TAG_member, name: "flag_3"
// CHECK-SAME: line: 14		// CHECK-SAME: line: 14
// CHECK-SAME: baseType: ![[UNSIGNED]]		// CHECK-SAME: baseType: ![[UNSIGNED]]
// CHECK-SAME: size: 9, offset: 3,		// CHECK-SAME: size: 9, offset: 115
// CHECK-SAME: flags: DIFlagProtected		// CHECK-SAME: flags: DIFlagProtected