This is an archive of the discontinued LLVM Phabricator instance.

Differential D12079

[MIPS] microMIPS breakpoints, disassembly and compressed addresses
ClosedPublic

Authored by jaydeep on Aug 17 2015, 4:13 AM.

Download Raw Diff

Details

Reviewers

Summary

This patch enables setting of breakpoints and disassembly for microMIPS applications running on bare-iron targets like IASim.

MIPS uses bit #0 (ISA bit) of an address for ISA mode (1 for microMIPS/MIPS16 and 0 for MIPS). The resulting address is called as compressed address when ISA bit is set. This allows processor to switch between microMIPS and MIPS without any need for special mode-control register. This bit is then cleared by the processor while fetching the instruction from memory. However, apart from .debug_line, none of the ELF/DWARF sections set the ISA bit.

In this patch:

The symbol table is recorded in the form of compressed address for microMIPS symbols, so that corresponding debug_line can be decoded properly.
Memory read/write of compressed address has been handled

Diff Detail

Repository: rL LLVM

Event Timeline

jaydeep retitled this revision from to [MIPS] microMIPS breakpoints, disassembly and compressed addresses.Aug 17 2015, 4:13 AM

jaydeep updated this revision to Diff 32289.Aug 17 2015, 4:13 AM

jaydeep updated this object.

jaydeep added a reviewer: clayborg.

jaydeep set the repository for this revision to rL LLVM.

jaydeep added subscribers: lldb-commits, bhushan, slthakur and 2 others.

Many changes. See inlined comments.

source/Core/Disassembler.cpp
1169–1187 ↗	(On Diff #32289)	This kind of address snipping is going to be needed in many different places and this should be done in: lldb::addr_t Target::GetOpcodeLoadAddress (lldb::addr_t load_addr, AddressClass addr_class) const; You will note there is already similar functionality for ARM: lldb::addr_t Target::GetOpcodeLoadAddress (lldb::addr_t load_addr, AddressClass addr_class) const { addr_t opcode_addr = load_addr; switch (m_arch.GetMachine()) { case llvm::Triple::arm: case llvm::Triple::thumb: switch (addr_class) { case eAddressClassData: case eAddressClassDebug: return LLDB_INVALID_ADDRESS; case eAddressClassInvalid: case eAddressClassUnknown: case eAddressClassCode: case eAddressClassCodeAlternateISA: case eAddressClassRuntime: opcode_addr &= ~(1ull); break; } break; default: break; } return opcode_addr; } Then you would typically access this via "Address::GetCallableLoadAddress (Target target, bool is_indirect) const". We should probably add a new method to Address: Address Address::GetCallableAddress(Target target, bool is_indirect) const { SectionSP section_sp (GetSection()); if (section_sp) { ModuleSP module_sp = section_sp->GetModule(); if (module_sp) { lldb::addr_t callable_file_addr = target->GetCallableLoadAddress (GetFileAddress(), GetAddressClass()); Address callable_addr; if (module_sp->ResolveFileAddress (callable_file_addr, callable_addr)) return callable_addr; } } return *this; } Then you should use this here: const size_t bytes_read = target->ReadMemory (range.GetBaseAddress().GetCallableAddress(target, false),
1189 ↗	(On Diff #32289)	This is incorrect. You can't pass a file address to target->ReadMemory(...) as this will do the wrong thing if you are running. The story goes like this: lldb_private::Address is a section offset based address that says an address is ".text + 0x1000". When target->ReadMemory() tries to read memory from this address, it can see if "prefer_file_cache" is set and if so, it will grab the section from the the address that is passed as the first parameter and then be able to get the module from that section and read data from the cached .text section contents from the object file in the module. If you call Target::ReadMemory() with "compressed_addr.GetFileAddress()", it will get the file address (the unslid address) of 0x1000 and convert that to a Address object. So just pass your fixed up address, in this case it will be "compressed_addr".
source/Plugins/ObjectFile/ELF/ObjectFileELF.cpp
2123–2137	I wouldn't muck with the symbol value directly, just make sure that ObjectFileELF::GetAddressClass(...) works: AddressClass ObjectFileELF::GetAddressClass (addr_t file_addr); This should classify any address as either eAddressClassCode (ARM for ARM architectures) or eAddressClassCodeAlternateISA (Thumb for ARM architectures). Then any code that relies on ISA should be checking the AddressClass. for eAddressClassCode or eAddressClassCodeAlternateISA.
source/Plugins/Process/gdb-remote/ProcessGDBRemote.cpp
3172–3173 ↗	(On Diff #32289)	This should be removed. The address in the breakpoint site should already have been sanitized by Process::CreateBreakpointSite() which will call Address::GetOpcodeLoadAddress(Target*) to get the correct address.
3187–3203 ↗	(On Diff #32289)	This should be removed. The address in the breakpoint site should already have been sanitized by Process::CreateBreakpointSite() which will call Address::GetOpcodeLoadAddress(Target*) to get the correct address.
3297–3314 ↗	(On Diff #32289)	This should be removed. The address in the breakpoint site should already have been sanitized by Process::CreateBreakpointSite() which will call Address::GetOpcodeLoadAddress(Target*) to get the correct address.
source/Plugins/SymbolFile/DWARF/SymbolFileDWARF.cpp
1276–1320	This should be removed and rely on ObjectFile::GetAddressClass() to do the right thing.

This revision now requires changes to proceed.Aug 17 2015, 10:44 AM

The main thing is, we don't want to be like other debuggers that have all this code in many many places that check address bits by checking the Architecture and litter the code with bit strips and adding bits where needed. We want to support addresses correctly by knowing that a Address has a special address class. So we use:

addr_t
Address::GetCallableLoadAddress (Target *target, bool is_indirect) const

and

lldb::addr_t
Target::GetOpcodeLoadAddress (lldb::addr_t load_addr, AddressClass addr_class) const

Target also has a counterpart that does the actual check since the target has the ArchSpec that tells us the architecture. Also if you ever need make a special address that needs to have bit zero set, there is:

lldb::addr_t
Target::GetCallableLoadAddress (lldb::addr_t load_addr, AddressClass addr_class) const

Addressed review comments.
Address conversions are handled in Address class. This is a reduced version of the original patch where address conversions are handled for breakpoints only.

Added GetCallableFileAddress for MIPS

DWARF parser should be stripping bit #0 for all addresses from mips targets: line tables, all address ranges for functions and blocks and variables should have this bit #0 stripped. The AddressClass from ObjectFileELF.cpp should help you figure out which ISA things are. This stops all sorts of extra code being added all over the debugger that needs to worry about this bit #0.

include/lldb/Core/Address.h
311–312 ↗	(On Diff #33793)	Rename to GetCallableAddress since it returns a lldb_private::Address object and that isn't a file address. File address would be if you returned a lldb::addr_t that was a file address for a specific module, but that isn't the case here.
include/lldb/Symbol/Function.h
579–580 ↗	(On Diff #33793)	This shouldn't be needed, see comment for Function::GetPrologueByteSize().
include/lldb/Target/Target.h
908–909 ↗	(On Diff #33793)	This should return a lldb_private::Address object and "load_addr" should be a "lldb_private::Address()". There is no good way for a target to talk about file addresses since the returned "lldb::addr_t" would only make sense to a module since all modules for shared libraries share the file address zero. Load addresses are different because when a process is running and has things loaded, a load address describes a unique place in the program. A file address of zero would match all shared libraries since most shared libraries start their .text segment (or one of their segments with a file address of zero). Moving to a lldb_private::Address gives us the section + offset where the section describes which module contains the address. The second parameter "addr_class" isn't needed if "lldb::addr_t load_addr" becomes "const lldb_private::Address &addr". So this function should be: lldb_private::Address GetCallableAddress (const lldb_private::Address &addr);
source/Core/Address.cpp
399–402 ↗	(On Diff #33793)	This should become: return target->GetCallableAddress(*this);
source/Core/FormatEntity.cpp
426–429	This change should probably be removed if we parse the line tables correctly right? That bit #0 for mips should be stripped when parsing the line table and the address class should be relied upon for anyone needing to know the origins of an address.
source/Symbol/Function.cpp
558 ↗	(On Diff #33793)	Remove this param, see comment below.
569–575 ↗	(On Diff #33793)	This bit #0 should be sanitized before it is placed into the line table when the line table is being parsed. No one should have to worry about this, so thie "Target *target" parameter should be removed and the line table should strip bit #0 and we should rely on getting the address class correctly like you already fixed in ObjectFileELF.cpp if anyone needs to know about the address class.
627 ↗	(On Diff #33793)	This should be removed. Bit #0 should never be left set in any public facing address and the address class should be relied upon by anyone needing to know about the ISA of the code address. So the DWARF parser needs to be fixed to strip bit #0 for all MIPS stuff.
source/Target/RegisterContext.cpp
106–124	Bit #0 should be stripped from the PC before it is figured out and the frame might need to track the address class, so this change shouldn't be needed. We don't want extra bits floating around in our code that we have to strip everywhere. This should be done as the stack frames are being created. The frame will need to keep track of the address class in case the address doesn't map back to a shared library (JITed code might not have a module describing the code). So this code should be removed and the backtracer will need to sanitize the addresses as the PC values are unwound.
source/Target/Target.cpp
2062–2063	This should be: lldb_private::Address GetCallableAddress (const lldb_private::Address &addr);
source/Target/ThreadPlanStepInRange.cpp
286 ↗	(On Diff #33793)	Remove. Line tables shouldn't contain bit #0 set in any addresses in the line tables. AddressClass of an address should be relied upon for ISA.

This revision now requires changes to proceed.Sep 2 2015, 10:04 AM

jaydeep added inline comments.Sep 4 2015, 5:07 AM

source/Target/RegisterContext.cpp
106–124	The breakpoint is set on OpcodeAddress (bit #0 clear), but target returns CallableAddress (bit #0 set) when breakpoint is hit. Set using: StoppointLocation (loc_id, addr.GetOpcodeLoadAddress(&owner.GetTarget()), hardware) Find using: addr_t pc = thread_sp->GetRegisterContext()->GetPC() + m_breakpoint_pc_offset; lldb::BreakpointSiteSP bp_site_sp = thread_sp->GetProcess()->GetBreakpointSiteList().FindByAddress(pc); We either need to clear bit #0 from the PC we get or need to set the breakpoint on CallableAddress (which would need LineTable in CallableAddress form).

In this patch:

The bit #0 has been cleared from addresses in the line tables. However we are relying upon ArchSpec instead of Target while clearing this bit in ParseDWARFLineTableCallback because SymbolContext may not have a valid target to call Address::GetOpcodeLoadAddress().

Bare-iron targets (like YAMON, IASim, Qemu) return compressed address (bit #0 set) when process is stopped in microMIPS address space. For example: bit #0 of PC is set when a breakpoint is hit. This bit has been cleared while reading the PC in RegisterContext::GetPC(). This would help us find breakpoints set using GetOpcodeLoadAddress (bit #0 clear),

DisassemblerLLVMC::DisassemblerLLVMC has been modified to create m_alternate_disasm_ap for microMIPS. This would display disassembly in either compressed (bit #0 set) or uncompressed (bit #0 clear) address space based on ISA mode.

Open issues:

In FormatEntity.cpp we probably don't need any changes to DumpAddress()
Remove MIPS comment from generic code and let the "Target::GetOpcodeLoadAddress (...) const" document what is happening.
Call Target::GetOpcodeLoadAddress (lldb::addr_t load_addr, AddressClass addr_class) const to fixup the PC.

source/Core/FormatEntity.cpp
421–429	I repeat this concern: do we still need to do this? There should be no changes needed for this function if the bit #0 has been stripped.
source/Target/RegisterContext.cpp
110–116	Probably no need for this MIPS specific comment in here, it should be documented once in the Target functions that strip the bit zero.
117–120	We don't need to make a section + offset address here, we can just use: lldb::addr_t Target::GetOpcodeLoadAddress (lldb::addr_t load_addr, AddressClass addr_class) const; So this code should be: TargetSP target_sp = m_thread.CalculateTarget(); if (target_sp) pc = target->GetOpcodeLoadAddress (pc, eAddressClassCode);

This revision now requires changes to proceed.Sep 8 2015, 9:24 AM

In this patch:

Removed MIPS comment from generic code
Used Target::GetOpcodeLoadAddress to fixup the PC

Regarding change in FormatEntity.cpp:

We still need to do this. The bit #0 of ‘addr’ has already been striped and thus it does not represent its true address space (microMIPS or MIPS). We need to call GetCallableLoadAddress here because we want to set the bit #0 of this address if it belongs to eAddressClassCodeAlternateISA.

This change displays the microMIPS disassembly (and other addresses) in compact address space:

0x8020067d <+0>:  addiusp -16
0x8020067f <+2>:  sw     $fp, 12($sp)
0x80200681 <+4>:  move   $fp, $sp

thread #1: tid = 0x0001, 0x802006c5 micro.elf`foo(a=0, b=0) + 16 at micro.c:19, stop reason = breakpoint 2.1 frame #0: 0x802006c5 micro.elf`foo(a=0, b=0) + 16 at micro.c:19

Without this change the microMIPS disassembly would be displayed in uncompact (MIPS) address space:

0x8020067c <+0>:  addiusp -16
0x8020067e <+2>:  sw     $fp, 12($sp)
0x80200680 <+4>:  move   $fp, $sp

thread #1: tid = 0x0001, 0x802006c4 micro.elf`foo(a=0, b=0) + 16 at micro.c:19, stop reason = breakpoint 2.1 frame #0: 0x802006c4 micro.elf`foo(a=0, b=0) + 16 at micro.c:19

So DumpAddress() in FormatEntity.cpp is a generic "dump any address by describing it". You can't just change the code to suit your needs for MIPS. This address could be any address: code or data. If you want something that can take an address like 0x1000 and you ask for its AddressClass and it sees that its address class is eAddressClassCodeAlternateISA, and then you change it to be "0x1001", this will need to be a new format type.

DumpAddress in FormatEntity.cpp is called for the following entities:

case Entry::Type::LineEntryStartAddress:
case Entry::Type::LineEntryEndAddress:
case Entry::Type::AddressFile:
case Entry::Type::AddressLoad:
case Entry::Type::AddressLoadOrFile:
case Entry::Type::FrameRegisterPC

So only the LineEntry ones should actually do what you did.

This revision now requires changes to proceed.Sep 9 2015, 2:51 PM

Actually not a new format type, but an extra arg will need to be passed to DumpAddress like "bool addr_is_callable".

Can you explain something to me? In the following example:

0x8020067d <+0>:  addiusp -16
0x8020067f <+2>:  sw     $fp, 12($sp)
0x80200681 <+4>:  move   $fp, $sp

Is the addiusp actually at 0x8020067c in memory? Then we just display 0x8020067d to let people know this is MicroMIPS?

In D12079#242751, @clayborg wrote:
Actually not a new format type, but an extra arg will need to be passed to DumpAddress like "bool addr_is_callable".

Can you explain something to me? In the following example:
0x8020067d <+0>:  addiusp -16
0x8020067f <+2>:  sw     $fp, 12($sp)
0x80200681 <+4>:  move   $fp, $sp
Is the addiusp actually at 0x8020067c in memory? Then we just display 0x8020067d to let people know this is MicroMIPS?

Yes, addiusp is actually at 0x8020067c, but processor (when running in microMIPS mode) strips bit #0 while fetching it from memory. We should display it at 0x8020067d to let user know that this is microMIPS.

In D12079#242742, @clayborg wrote:
So DumpAddress() in FormatEntity.cpp is a generic "dump any address by describing it". You can't just change the code to suit your needs for MIPS. This address could be any address: code or data. If you want something that can take an address like 0x1000 and you ask for its AddressClass and it sees that its address class is eAddressClassCodeAlternateISA, and then you change it to be "0x1001", this will need to be a new format type.

DumpAddress in FormatEntity.cpp is called for the following entities:
case Entry::Type::LineEntryStartAddress:
case Entry::Type::LineEntryEndAddress:
case Entry::Type::AddressFile:
case Entry::Type::AddressLoad:
case Entry::Type::AddressLoadOrFile:
case Entry::Type::FrameRegisterPC
So only the LineEntry ones should actually do what you did.

We need to display all these entities in compressed address format. How about a new MIPS specific function in Address and Target class which would do this.

Address Address::GetCallableAddress(Target *target);
lldb::addr_t Target::GetCallableAddress (lldb::addr_t load_addr, AddressClass addr_class);

In D12079#242998, @jaydeep wrote:
In D12079#242742, @clayborg wrote:
So DumpAddress() in FormatEntity.cpp is a generic "dump any address by describing it". You can't just change the code to suit your needs for MIPS. This address could be any address: code or data. If you want something that can take an address like 0x1000 and you ask for its AddressClass and it sees that its address class is eAddressClassCodeAlternateISA, and then you change it to be "0x1001", this will need to be a new format type.

DumpAddress in FormatEntity.cpp is called for the following entities:
case Entry::Type::LineEntryStartAddress:
case Entry::Type::LineEntryEndAddress:
case Entry::Type::AddressFile:
case Entry::Type::AddressLoad:
case Entry::Type::AddressLoadOrFile:
case Entry::Type::FrameRegisterPC
So only the LineEntry ones should actually do what you did.
We need to display all these entities in compressed address format. How about a new MIPS specific function in Address and Target class which would do this.

Address Address::GetCallableAddress(Target *target);
lldb::addr_t Target::GetCallableAddress (lldb::addr_t load_addr, AddressClass addr_class);

We already have this in Target:

lldb::addr_t
GetCallableLoadAddress (lldb::addr_t load_addr, lldb::AddressClass addr_class = lldb::eAddressClassInvalid) const;

So the solution here will be to modify Address::Dump() such that it detects when an address is eAddressClassCodeAlternateISA and when that happens it checks if the ExecutionContext parameter is non NULL, and if so, extract the target, and check the target's architecture is MIPS, then add the extra bit when displaying this address. As it seems that we would always want to describe a section offset address (lldb_private::Address object) in this way to show the MicroMIPS address space bit, right?

In D12079#243390, @clayborg wrote:
In D12079#242998, @jaydeep wrote:
In D12079#242742, @clayborg wrote:
So DumpAddress() in FormatEntity.cpp is a generic "dump any address by describing it". You can't just change the code to suit your needs for MIPS. This address could be any address: code or data. If you want something that can take an address like 0x1000 and you ask for its AddressClass and it sees that its address class is eAddressClassCodeAlternateISA, and then you change it to be "0x1001", this will need to be a new format type.

DumpAddress in FormatEntity.cpp is called for the following entities:
case Entry::Type::LineEntryStartAddress:
case Entry::Type::LineEntryEndAddress:
case Entry::Type::AddressFile:
case Entry::Type::AddressLoad:
case Entry::Type::AddressLoadOrFile:
case Entry::Type::FrameRegisterPC
So only the LineEntry ones should actually do what you did.
We need to display all these entities in compressed address format. How about a new MIPS specific function in Address and Target class which would do this.

Address Address::GetCallableAddress(Target *target);
lldb::addr_t Target::GetCallableAddress (lldb::addr_t load_addr, AddressClass addr_class);
We already have this in Target:
lldb::addr_t
GetCallableLoadAddress (lldb::addr_t load_addr, lldb::AddressClass addr_class = lldb::eAddressClassInvalid) const;
So the solution here will be to modify Address::Dump() such that it detects when an address is eAddressClassCodeAlternateISA and when that happens it checks if the ExecutionContext parameter is non NULL, and if so, extract the target, and check the target's architecture is MIPS, then add the extra bit when displaying this address. As it seems that we would always want to describe a section offset address (lldb_private::Address object) in this way to show the MicroMIPS address space bit, right?

Yes.

In D12079#244059, @jaydeep wrote:
In D12079#243390, @clayborg wrote:
In D12079#242998, @jaydeep wrote:
In D12079#242742, @clayborg wrote:
So DumpAddress() in FormatEntity.cpp is a generic "dump any address by describing it". You can't just change the code to suit your needs for MIPS. This address could be any address: code or data. If you want something that can take an address like 0x1000 and you ask for its AddressClass and it sees that its address class is eAddressClassCodeAlternateISA, and then you change it to be "0x1001", this will need to be a new format type.

DumpAddress in FormatEntity.cpp is called for the following entities:
case Entry::Type::LineEntryStartAddress:
case Entry::Type::LineEntryEndAddress:
case Entry::Type::AddressFile:
case Entry::Type::AddressLoad:
case Entry::Type::AddressLoadOrFile:
case Entry::Type::FrameRegisterPC
So only the LineEntry ones should actually do what you did.
We need to display all these entities in compressed address format. How about a new MIPS specific function in Address and Target class which would do this.

Address Address::GetCallableAddress(Target *target);
lldb::addr_t Target::GetCallableAddress (lldb::addr_t load_addr, AddressClass addr_class);
We already have this in Target:
lldb::addr_t
GetCallableLoadAddress (lldb::addr_t load_addr, lldb::AddressClass addr_class = lldb::eAddressClassInvalid) const;
So the solution here will be to modify Address::Dump() such that it detects when an address is eAddressClassCodeAlternateISA and when that happens it checks if the ExecutionContext parameter is non NULL, and if so, extract the target, and check the target's architecture is MIPS, then add the extra bit when displaying this address. As it seems that we would always want to describe a section offset address (lldb_private::Address object) in this way to show the MicroMIPS address space bit, right?
In D12079#243390, @clayborg wrote:
In D12079#242998, @jaydeep wrote:
In D12079#242742, @clayborg wrote:
So DumpAddress() in FormatEntity.cpp is a generic "dump any address by describing it". You can't just change the code to suit your needs for MIPS. This address could be any address: code or data. If you want something that can take an address like 0x1000 and you ask for its AddressClass and it sees that its address class is eAddressClassCodeAlternateISA, and then you change it to be "0x1001", this will need to be a new format type.

DumpAddress in FormatEntity.cpp is called for the following entities:
case Entry::Type::LineEntryStartAddress:
case Entry::Type::LineEntryEndAddress:
case Entry::Type::AddressFile:
case Entry::Type::AddressLoad:
case Entry::Type::AddressLoadOrFile:
case Entry::Type::FrameRegisterPC
So only the LineEntry ones should actually do what you did.
We need to display all these entities in compressed address format. How about a new MIPS specific function in Address and Target class which would do this.

Address Address::GetCallableAddress(Target *target);
lldb::addr_t Target::GetCallableAddress (lldb::addr_t load_addr, AddressClass addr_class);
We already have this in Target:
lldb::addr_t
GetCallableLoadAddress (lldb::addr_t load_addr, lldb::AddressClass addr_class = lldb::eAddressClassInvalid) const;
So the solution here will be to modify Address::Dump() such that it detects when an address is eAddressClassCodeAlternateISA and when that happens it checks if the ExecutionContext parameter is non NULL, and if so, extract the target, and check the target's architecture is MIPS, then add the extra bit when displaying this address. As it seems that we would always want to describe a section offset address (lldb_private::Address object) in this way to show the MicroMIPS address space bit, right?
Yes.

Instead of modifying Address::Dump() we should modify DumpAddress() so that

In D12079#244059, @jaydeep wrote:
In D12079#243390, @clayborg wrote:
In D12079#242998, @jaydeep wrote:
In D12079#242742, @clayborg wrote:
So DumpAddress() in FormatEntity.cpp is a generic "dump any address by describing it". You can't just change the code to suit your needs for MIPS. This address could be any address: code or data. If you want something that can take an address like 0x1000 and you ask for its AddressClass and it sees that its address class is eAddressClassCodeAlternateISA, and then you change it to be "0x1001", this will need to be a new format type.

DumpAddress in FormatEntity.cpp is called for the following entities:
case Entry::Type::LineEntryStartAddress:
case Entry::Type::LineEntryEndAddress:
case Entry::Type::AddressFile:
case Entry::Type::AddressLoad:
case Entry::Type::AddressLoadOrFile:
case Entry::Type::FrameRegisterPC
So only the LineEntry ones should actually do what you did.
We need to display all these entities in compressed address format. How about a new MIPS specific function in Address and Target class which would do this.

Address Address::GetCallableAddress(Target *target);
lldb::addr_t Target::GetCallableAddress (lldb::addr_t load_addr, AddressClass addr_class);
We already have this in Target:
lldb::addr_t
GetCallableLoadAddress (lldb::addr_t load_addr, lldb::AddressClass addr_class = lldb::eAddressClassInvalid) const;
So the solution here will be to modify Address::Dump() such that it detects when an address is eAddressClassCodeAlternateISA and when that happens it checks if the ExecutionContext parameter is non NULL, and if so, extract the target, and check the target's architecture is MIPS, then add the extra bit when displaying this address. As it seems that we would always want to describe a section offset address (lldb_private::Address object) in this way to show the MicroMIPS address space bit, right?
In D12079#243390, @clayborg wrote:
In D12079#242998, @jaydeep wrote:
In D12079#242742, @clayborg wrote:
So DumpAddress() in FormatEntity.cpp is a generic "dump any address by describing it". You can't just change the code to suit your needs for MIPS. This address could be any address: code or data. If you want something that can take an address like 0x1000 and you ask for its AddressClass and it sees that its address class is eAddressClassCodeAlternateISA, and then you change it to be "0x1001", this will need to be a new format type.

DumpAddress in FormatEntity.cpp is called for the following entities:
case Entry::Type::LineEntryStartAddress:
case Entry::Type::LineEntryEndAddress:
case Entry::Type::AddressFile:
case Entry::Type::AddressLoad:
case Entry::Type::AddressLoadOrFile:
case Entry::Type::FrameRegisterPC
So only the LineEntry ones should actually do what you did.
We need to display all these entities in compressed address format. How about a new MIPS specific function in Address and Target class which would do this.

Address Address::GetCallableAddress(Target *target);
lldb::addr_t Target::GetCallableAddress (lldb::addr_t load_addr, AddressClass addr_class);
We already have this in Target:
lldb::addr_t
GetCallableLoadAddress (lldb::addr_t load_addr, lldb::AddressClass addr_class = lldb::eAddressClassInvalid) const;
So the solution here will be to modify Address::Dump() such that it detects when an address is eAddressClassCodeAlternateISA and when that happens it checks if the ExecutionContext parameter is non NULL, and if so, extract the target, and check the target's architecture is MIPS, then add the extra bit when displaying this address. As it seems that we would always want to describe a section offset address (lldb_private::Address object) in this way to show the MicroMIPS address space bit, right?
Yes.

Change in Address::Dump() would display microMIPS address only for Entry::Type::AddressLoadOrFile entity. (when "print_file_addr_or_load_addr" is true in DumpAddress()). However we would like to display microMIPS addresses for

Entry::Type::AddressFile
Entry::Type::AddressLoad
Entry::Type::FrameRegisterPC
Entry::Type::LineEntryStartAddress
Entry::Type::LineEntryEndAddress

entities as well (when "print_file_addr_or_load_addr" is false). The suggested change should be moved to DumpAddress().

In this patch:
Modified DumpAddress() to print compressed address for microMIPS.

Hi Greg,
Could you please find some time to review this?
Thanks

So everywhere that we want to display a code address for MicroMIPS needs to now add code that creates a callable address? I still say that Address::Dump() is what should be modified, not DumpAddress if FormatEntity.cpp. I don't want any other code anywhere in the debugger to have to make this check. It should be just Address::Dump() that knows about this.

This revision now requires changes to proceed.Sep 14 2015, 11:31 AM

Addressed review comments

One last change to make line table parsing more efficient by not having to check the arch for every line table entry.

source/Plugins/SymbolFile/DWARF/SymbolFileDWARF.cpp
1397	Maybe this should be a "lldb:addr_t addr_mask;" instead of the architecture. Then you determine the mask one time before you parse a line table and fill it in.
1426–1436	Move this code to where the ParseDWARFLineTableCallbackInfo is filled in and fill in "addr_mask" as described above. Otherwise each time we append a line entry to a sequence we will be checking the arch over and over and over and over....
1439	change this line to: file_addr & info->addr_mask,
1479	Fill in "info.addr_mask" here: /* * MIPS: * The SymbolContext may not have a valid target, thus we may not be able * to call Address::GetOpcodeLoadAddress() which would clear the bit #0 * for MIPS. Use ArchSpec to clear the bit #0. */ ArchSpec arch; GetObjectFile()->GetArchitecture(arch); switch (arch.GetMachine()) { case llvm::Triple::mips: case llvm::Triple::mipsel: case llvm::Triple::mips64: case llvm::Triple::mips64el: info.addr_mask = ~((lldb::addr_t)1); break; default: info.addr_mask = ~((lldb::addr_t)0); break; }

This revision now requires changes to proceed.Sep 15 2015, 2:01 PM

Addressed review comments

A few more little things with respect to not calling accessors multiple times in if statements and this will be good to go.

source/Core/Address.cpp
474 ↗	(On Diff #34873)	Change this to be: const llvm::Triple::ArchType llvm_arch = target->GetArchitecture().GetMachine(); and use llvm_arch in if statement below instead of calling accessor 4 times.
source/Plugins/Disassembler/llvm/DisassemblerLLVMC.cpp
698–727	We should store the llvm::Triple::ArchType into a local const variable up on line 750 and use it on line 750 and in this "else if". Also note that we have "triple" which is already the "arch.GetTriple()".
source/Plugins/ObjectFile/ELF/ObjectFileELF.cpp
2054–2055	const llvm::Triple::ArchType llvm_arch = target->GetArchitecture().GetMachine(); Then use llvm_arch in the if statement.

This revision now requires changes to proceed.Sep 16 2015, 10:28 AM

Addressed review comments.

Looks good.

This revision is now accepted and ready to land.Sep 18 2015, 9:30 AM

Closed by commit http://reviews.llvm.org/rL248248

Revision Contents

Path

Size

source/

Core/

FormatEntity.cpp

11 lines

Plugins/

Disassembler/

llvm/

DisassemblerLLVMC.cpp

23 lines

ObjectFile/

ELF/

ObjectFileELF.cpp

29 lines

SymbolFile/

DWARF/

SymbolFileDWARF.cpp

16 lines

Target/

RegisterContext.cpp

19 lines

Target.cpp

25 lines

Diff 34199

source/Core/FormatEntity.cpp

	Show First 20 Lines • Show All 412 Lines • ▼ Show 20 Lines
	DumpAddress (Stream &s,			DumpAddress (Stream &s,
	const SymbolContext *sc,			const SymbolContext *sc,
	const ExecutionContext *exe_ctx,			const ExecutionContext *exe_ctx,
	const Address &addr,			const Address &addr,
	bool print_file_addr_or_load_addr)			bool print_file_addr_or_load_addr)
	{			{
	Target *target = Target::GetTargetFromContexts (exe_ctx, sc);			Target *target = Target::GetTargetFromContexts (exe_ctx, sc);
	addr_t vaddr = LLDB_INVALID_ADDRESS;			addr_t vaddr = LLDB_INVALID_ADDRESS;

				// If the address belongs to eAddressClassCodeAlternateISA then
				// dump its callable form.
				Address callable_addr (addr.GetCallableLoadAddress(target));

	if (exe_ctx && !target->GetSectionLoadList().IsEmpty())			if (exe_ctx && !target->GetSectionLoadList().IsEmpty())
	vaddr = addr.GetLoadAddress (target);			vaddr = callable_addr.GetLoadAddress (target);
	if (vaddr == LLDB_INVALID_ADDRESS)			if (vaddr == LLDB_INVALID_ADDRESS)
	vaddr = addr.GetFileAddress ();			vaddr = callable_addr.GetFileAddress ();
				clayborgUnsubmitted Not Done Reply Inline Actions This change should probably be removed if we parse the line tables correctly right? That bit #0 for mips should be stripped when parsing the line table and the address class should be relied upon for anyone needing to know the origins of an address. clayborg: This change should probably be removed if we parse the line tables correctly right? That bit #0…
				clayborgUnsubmitted Not Done Reply Inline Actions I repeat this concern: do we still need to do this? There should be no changes needed for this function if the bit #0 has been stripped. clayborg: I repeat this concern: do we still need to do this? There should be no changes needed for this…

	if (vaddr != LLDB_INVALID_ADDRESS)			if (vaddr != LLDB_INVALID_ADDRESS)
	{			{
	int addr_width = 0;			int addr_width = 0;
	if (exe_ctx && target)			if (exe_ctx && target)
	{			{
	addr_width = target->GetArchitecture().GetAddressByteSize() * 2;			addr_width = target->GetArchitecture().GetAddressByteSize() * 2;
	}			}
	if (addr_width == 0)			if (addr_width == 0)
	addr_width = 16;			addr_width = 16;
	if (print_file_addr_or_load_addr)			if (print_file_addr_or_load_addr)
	{			{
	ExecutionContextScope *exe_scope = NULL;			ExecutionContextScope *exe_scope = NULL;
	if (exe_ctx)			if (exe_ctx)
	exe_scope = exe_ctx->GetBestExecutionContextScope();			exe_scope = exe_ctx->GetBestExecutionContextScope();
	addr.Dump (&s, exe_scope, Address::DumpStyleLoadAddress, Address::DumpStyleModuleWithFileAddress, 0);			callable_addr.Dump (&s, exe_scope, Address::DumpStyleLoadAddress, Address::DumpStyleModuleWithFileAddress, 0);
	}			}
	else			else
	{			{
	s.Printf("0x%." PRIx64, addr_width, addr_width, vaddr);			s.Printf("0x%." PRIx64, addr_width, addr_width, vaddr);
	}			}
	return true;			return true;
	}			}
	return false;			return false;
	▲ Show 20 Lines • Show All 2,115 Lines • Show Last 20 Lines

source/Plugins/Disassembler/llvm/DisassemblerLLVMC.cpp

Show First 20 Lines • Show All 678 Lines • ▼ Show 20 Lines	DisassemblerLLVMC::DisassemblerLLVMC (const ArchSpec &arch, const char *flavor_string) :
{		{
uint32_t arch_flags = arch.GetFlags ();		uint32_t arch_flags = arch.GetFlags ();
if (arch_flags & ArchSpec::eMIPSAse_msa)		if (arch_flags & ArchSpec::eMIPSAse_msa)
features_str += "+msa,";		features_str += "+msa,";
if (arch_flags & ArchSpec::eMIPSAse_dsp)		if (arch_flags & ArchSpec::eMIPSAse_dsp)
features_str += "+dsp,";		features_str += "+dsp,";
if (arch_flags & ArchSpec::eMIPSAse_dspr2)		if (arch_flags & ArchSpec::eMIPSAse_dspr2)
features_str += "+dspr2,";		features_str += "+dspr2,";
if (arch_flags & ArchSpec::eMIPSAse_mips16)
features_str += "+mips16,";
if (arch_flags & ArchSpec::eMIPSAse_micromips)
features_str += "+micromips,";
}		}

m_disasm_ap.reset (new LLVMCDisassembler(triple, cpu, features_str.c_str(), flavor, *this));		m_disasm_ap.reset (new LLVMCDisassembler(triple, cpu, features_str.c_str(), flavor, *this));
if (!m_disasm_ap->IsValid())		if (!m_disasm_ap->IsValid())
{		{
// We use m_disasm_ap.get() to tell whether we are valid or not, so if this isn't good for some reason,		// We use m_disasm_ap.get() to tell whether we are valid or not, so if this isn't good for some reason,
// we reset it, and then we won't be valid and FindPlugin will fail and we won't get used.		// we reset it, and then we won't be valid and FindPlugin will fail and we won't get used.
m_disasm_ap.reset();		m_disasm_ap.reset();
}		}

// For arm CPUs that can execute arm or thumb instructions, also create a thumb instruction disassembler.		// For arm CPUs that can execute arm or thumb instructions, also create a thumb instruction disassembler.
if (arch.GetTriple().getArch() == llvm::Triple::arm)		if (arch.GetTriple().getArch() == llvm::Triple::arm)
{		{
std::string thumb_triple(thumb_arch.GetTriple().getTriple());		std::string thumb_triple(thumb_arch.GetTriple().getTriple());
m_alternate_disasm_ap.reset(new LLVMCDisassembler(thumb_triple.c_str(), "", "", flavor, *this));		m_alternate_disasm_ap.reset(new LLVMCDisassembler(thumb_triple.c_str(), "", "", flavor, *this));
if (!m_alternate_disasm_ap->IsValid())		if (!m_alternate_disasm_ap->IsValid())
{		{
m_disasm_ap.reset();		m_disasm_ap.reset();
m_alternate_disasm_ap.reset();		m_alternate_disasm_ap.reset();
}		}
}		}
		else if (arch.GetTriple().getArch() == llvm::Triple::mips
		\|\| arch.GetTriple().getArch() == llvm::Triple::mipsel
		\|\| arch.GetTriple().getArch() == llvm::Triple::mips64
		\|\| arch.GetTriple().getArch() == llvm::Triple::mips64el)
		{
		/* Create alternate disassembler for MIPS16 and microMIPS */
		uint32_t arch_flags = arch.GetFlags ();
		if (arch_flags & ArchSpec::eMIPSAse_mips16)
		features_str += "+mips16,";
		else if (arch_flags & ArchSpec::eMIPSAse_micromips)
		features_str += "+micromips,";

		m_alternate_disasm_ap.reset(new LLVMCDisassembler (triple, cpu, features_str.c_str(), flavor, *this));
		if (!m_alternate_disasm_ap->IsValid())
		{
		m_disasm_ap.reset();
		m_alternate_disasm_ap.reset();
		}
		}
}		}
		clayborgUnsubmitted Not Done Reply Inline Actions We should store the llvm::Triple::ArchType into a local const variable up on line 750 and use it on line 750 and in this "else if". Also note that we have "triple" which is already the "arch.GetTriple()". clayborg: We should store the llvm::Triple::ArchType into a local const variable up on line 750 and use…

DisassemblerLLVMC::~DisassemblerLLVMC()		DisassemblerLLVMC::~DisassemblerLLVMC()
{		{
}		}

size_t		size_t
DisassemblerLLVMC::DecodeInstructions (const Address &base_addr,		DisassemblerLLVMC::DecodeInstructions (const Address &base_addr,
const DataExtractor& data,		const DataExtractor& data,
▲ Show 20 Lines • Show All 221 Lines • Show Last 20 Lines

source/Plugins/ObjectFile/ELF/ObjectFileELF.cpp

Show First 20 Lines • Show All 1,808 Lines • ▼ Show 20 Lines	if (m_sections_ap.get())
}		}
else		else
{		{
unified_section_list = *m_sections_ap;		unified_section_list = *m_sections_ap;
}		}
}		}
}		}

		#define STO_MIPS_ISA (3 << 6)
		#define STO_MICROMIPS (2 << 6)
		#define IS_MICROMIPS(ST_OTHER) (((ST_OTHER) & STO_MIPS_ISA) == STO_MICROMIPS)

// private		// private
unsigned		unsigned
ObjectFileELF::ParseSymbols (Symtab *symtab,		ObjectFileELF::ParseSymbols (Symtab *symtab,
user_id_t start_id,		user_id_t start_id,
SectionList *section_list,		SectionList *section_list,
const size_t num_symbols,		const size_t num_symbols,
const DataExtractor &symtab_data,		const DataExtractor &symtab_data,
const DataExtractor &strtab_data)		const DataExtractor &strtab_data)
▲ Show 20 Lines • Show All 207 Lines • ▼ Show 20 Lines	for (i = 0; i < num_symbols; ++i)
}		}
else		else
{		{
// This address is ARM		// This address is ARM
m_address_class_map[symbol.st_value] = eAddressClassCode;		m_address_class_map[symbol.st_value] = eAddressClassCode;
}		}
}		}
}		}

		/*
		* MIPS:
		* The bit #0 of an address is used for ISA mode (1 for microMIPS, 0 for MIPS).
		* This allows processer to switch between microMIPS and MIPS without any need
		* for special mode-control register. However, apart from .debug_line, none of
		* the ELF/DWARF sections set the ISA bit (for symbol or section). Use st_other
		* flag to check whether the symbol is microMIPS and then set the address class
		* accordingly.
		*/
		if ((arch.GetMachine() == llvm::Triple::mips \|\| arch.GetMachine() == llvm::Triple::mipsel
		\|\| arch.GetMachine() == llvm::Triple::mips64 \|\| arch.GetMachine() == llvm::Triple::mips64el))
		clayborgUnsubmitted Not Done Reply Inline Actions const llvm::Triple::ArchType llvm_arch = target->GetArchitecture().GetMachine(); Then use llvm_arch in the if statement. clayborg: ``` const llvm::Triple::ArchType llvm_arch = target->GetArchitecture().GetMachine(); ``` Then…
		{
		if (IS_MICROMIPS(symbol.st_other))
		m_address_class_map[symbol.st_value] = eAddressClassCodeAlternateISA;
		else
		{
		if (symbol_type == eSymbolTypeCode)
		m_address_class_map[symbol.st_value] = eAddressClassCode;
		else if (symbol_type == eSymbolTypeData)
		m_address_class_map[symbol.st_value] = eAddressClassData;
		else
		m_address_class_map[symbol.st_value] = eAddressClassUnknown;
		}
		}
}		}

// symbol_value_offset may contain 0 for ARM symbols or -1 for		// symbol_value_offset may contain 0 for ARM symbols or -1 for
// THUMB symbols. See above for more details.		// THUMB symbols. See above for more details.
uint64_t symbol_value = symbol.st_value + symbol_value_offset;		uint64_t symbol_value = symbol.st_value + symbol_value_offset;
if (symbol_section_sp && CalculateType() != ObjectFile::Type::eTypeObjectFile)		if (symbol_section_sp && CalculateType() != ObjectFile::Type::eTypeObjectFile)
symbol_value -= symbol_section_sp->GetFileAddress();		symbol_value -= symbol_section_sp->GetFileAddress();

Show All 38 Lines	for (i = 0; i < num_symbols; ++i)
mangled.SetMangledName( ConstString((mangled_name + suffix).str()) );		mangled.SetMangledName( ConstString((mangled_name + suffix).str()) );

ConstString demangled = mangled.GetDemangledName(lldb::eLanguageTypeUnknown);		ConstString demangled = mangled.GetDemangledName(lldb::eLanguageTypeUnknown);
llvm::StringRef demangled_name = demangled.GetStringRef();		llvm::StringRef demangled_name = demangled.GetStringRef();
if (!demangled_name.empty())		if (!demangled_name.empty())
mangled.SetDemangledName( ConstString((demangled_name + suffix).str()) );		mangled.SetDemangledName( ConstString((demangled_name + suffix).str()) );
}		}

Symbol dc_symbol(		Symbol dc_symbol(
i + start_id, // ID is the original symbol table index.		i + start_id, // ID is the original symbol table index.
mangled,		mangled,
symbol_type, // Type of this symbol		symbol_type, // Type of this symbol
is_global, // Is this globally visible?		is_global, // Is this globally visible?
false, // Is this symbol debug info?		false, // Is this symbol debug info?
false, // Is this symbol a trampoline?		false, // Is this symbol a trampoline?
false, // Is this symbol artificial?		false, // Is this symbol artificial?
AddressRange(		AddressRange(
symbol_section_sp, // Section in which this symbol is defined or null.		symbol_section_sp, // Section in which this symbol is defined or null.
symbol_value, // Offset in section or symbol value.		symbol_value, // Offset in section or symbol value.
symbol.st_size), // Size in bytes of this symbol.		symbol.st_size), // Size in bytes of this symbol.
symbol.st_size != 0, // Size is valid if it is not 0		symbol.st_size != 0, // Size is valid if it is not 0
has_suffix, // Contains linker annotations?		has_suffix, // Contains linker annotations?
flags); // Symbol flags.		flags); // Symbol flags.
		clayborgUnsubmitted Not Done Reply Inline Actions I wouldn't muck with the symbol value directly, just make sure that ObjectFileELF::GetAddressClass(...) works: AddressClass ObjectFileELF::GetAddressClass (addr_t file_addr); This should classify any address as either eAddressClassCode (ARM for ARM architectures) or eAddressClassCodeAlternateISA (Thumb for ARM architectures). Then any code that relies on ISA should be checking the AddressClass. for eAddressClassCode or eAddressClassCodeAlternateISA. clayborg: I wouldn't muck with the symbol value directly, just make sure that ObjectFileELF…
symtab->AddSymbol(dc_symbol);		symtab->AddSymbol(dc_symbol);
}		}
return i;		return i;
}		}

unsigned		unsigned
ObjectFileELF::ParseSymbolTable(Symtab symbol_table, user_id_t start_id, lldb_private::Section symtab)		ObjectFileELF::ParseSymbolTable(Symtab symbol_table, user_id_t start_id, lldb_private::Section symtab)
{		{
▲ Show 20 Lines • Show All 978 Lines • Show Last 20 Lines

source/Plugins/SymbolFile/DWARF/SymbolFileDWARF.cpp

This file is larger than 256 KB, so syntax highlighting is disabled by default.

Show First 20 Lines • Show All 1,267 Lines • ▼ Show 20 Lines	if (die->GetDIENamesAndRanges (this,
decl_line,		decl_line,
decl_column));		decl_column));

// Supply the type _only_ if it has already been parsed		// Supply the type _only_ if it has already been parsed
Type *func_type = m_die_to_type.lookup (die);		Type *func_type = m_die_to_type.lookup (die);

assert(func_type == NULL \|\| func_type != DIE_IS_BEING_PARSED);		assert(func_type == NULL \|\| func_type != DIE_IS_BEING_PARSED);

if (FixupAddress (func_range.GetBaseAddress()))		if (FixupAddress (func_range.GetBaseAddress()))
{		{
const user_id_t func_user_id = MakeUserID(die->GetOffset());		const user_id_t func_user_id = MakeUserID(die->GetOffset());
func_sp.reset(new Function (sc.comp_unit,		func_sp.reset(new Function (sc.comp_unit,
MakeUserID(func_user_id), // UserID is the DIE offset		MakeUserID(func_user_id), // UserID is the DIE offset
MakeUserID(func_user_id),		MakeUserID(func_user_id),
func_name,		func_name,
func_type,		func_type,
func_range)); // first address range		func_range)); // first address range

if (func_sp.get() != NULL)		if (func_sp.get() != NULL)
{		{
if (frame_base.IsValid())		if (frame_base.IsValid())
func_sp->GetFrameBaseExpression() = frame_base;		func_sp->GetFrameBaseExpression() = frame_base;
sc.comp_unit->AddFunction(func_sp);		sc.comp_unit->AddFunction(func_sp);
return func_sp.get();		return func_sp.get();
}		}
}		}
}		}
}		}
return NULL;		return NULL;
}		}

bool		bool
SymbolFileDWARF::FixupAddress (Address &addr)		SymbolFileDWARF::FixupAddress (Address &addr)
{		{
SymbolFileDWARFDebugMap * debug_map_symfile = GetDebugMapSymfile ();		SymbolFileDWARFDebugMap * debug_map_symfile = GetDebugMapSymfile ();
if (debug_map_symfile)		if (debug_map_symfile)
{		{
return debug_map_symfile->LinkOSOAddress(addr);		return debug_map_symfile->LinkOSOAddress(addr);
}		}
// This is a normal DWARF file, no address fixups need to happen		// This is a normal DWARF file, no address fixups need to happen
return true;		return true;
}		}
lldb::LanguageType		lldb::LanguageType
SymbolFileDWARF::ParseCompileUnitLanguage (const SymbolContext& sc)		SymbolFileDWARF::ParseCompileUnitLanguage (const SymbolContext& sc)
{		{
assert (sc.comp_unit);		assert (sc.comp_unit);
DWARFCompileUnit* dwarf_cu = GetDWARFCompileUnit(sc.comp_unit);		DWARFCompileUnit* dwarf_cu = GetDWARFCompileUnit(sc.comp_unit);
if (dwarf_cu)		if (dwarf_cu)
{		{
const DWARFDebugInfoEntry *die = dwarf_cu->GetCompileUnitDIEOnly();		const DWARFDebugInfoEntry *die = dwarf_cu->GetCompileUnitDIEOnly();
if (die)		if (die)
return DWARFCompileUnit::LanguageTypeFromDWARF(die->GetAttributeValueAsUnsigned(this, dwarf_cu, DW_AT_language, 0));		return DWARFCompileUnit::LanguageTypeFromDWARF(die->GetAttributeValueAsUnsigned(this, dwarf_cu, DW_AT_language, 0));
}		}
		clayborgUnsubmitted Not Done Reply Inline Actions This should be removed and rely on ObjectFile::GetAddressClass() to do the right thing. clayborg: This should be removed and rely on ObjectFile::GetAddressClass() to do the right thing.
return eLanguageTypeUnknown;		return eLanguageTypeUnknown;
}		}

size_t		size_t
SymbolFileDWARF::ParseCompileUnitFunctions(const SymbolContext &sc)		SymbolFileDWARF::ParseCompileUnitFunctions(const SymbolContext &sc)
{		{
assert (sc.comp_unit);		assert (sc.comp_unit);
size_t functions_added = 0;		size_t functions_added = 0;
▲ Show 20 Lines • Show All 60 Lines • ▼ Show 20 Lines	SymbolFileDWARF::ParseImportedModules (const lldb_private::SymbolContext &sc, std::vector<lldb_private::ConstString> &imported_modules)
}		}
return false;		return false;
}		}

struct ParseDWARFLineTableCallbackInfo		struct ParseDWARFLineTableCallbackInfo
{		{
LineTable* line_table;		LineTable* line_table;
std::unique_ptr<LineSequence> sequence_ap;		std::unique_ptr<LineSequence> sequence_ap;
		ArchSpec arch;
		clayborgUnsubmitted Not Done Reply Inline Actions Maybe this should be a "lldb:addr_t addr_mask;" instead of the architecture. Then you determine the mask one time before you parse a line table and fill it in. clayborg: Maybe this should be a "lldb:addr_t addr_mask;" instead of the architecture. Then you determine…
};		};

//----------------------------------------------------------------------		//----------------------------------------------------------------------
// ParseStatementTableCallback		// ParseStatementTableCallback
//----------------------------------------------------------------------		//----------------------------------------------------------------------
static void		static void
ParseDWARFLineTableCallback(dw_offset_t offset, const DWARFDebugLine::State& state, void* userData)		ParseDWARFLineTableCallback(dw_offset_t offset, const DWARFDebugLine::State& state, void* userData)
{		{
Show All 12 Lines	else

// If this is our first time here, we need to create a		// If this is our first time here, we need to create a
// sequence container.		// sequence container.
if (!info->sequence_ap.get())		if (!info->sequence_ap.get())
{		{
info->sequence_ap.reset(line_table->CreateLineSequenceContainer());		info->sequence_ap.reset(line_table->CreateLineSequenceContainer());
assert(info->sequence_ap.get());		assert(info->sequence_ap.get());
}		}

		/*
		* MIPS:
		* The SymbolContext may not have a valid target, thus we may not be able
		* to call Address::GetOpcodeLoadAddress() which would clear the bit #0
		* for MIPS. Use ArchSpec to clear the bit #0.
		*/
		lldb::addr_t file_addr = state.address;
		if (info->arch.GetMachine() == llvm::Triple::mips \|\| info->arch.GetMachine() == llvm::Triple::mipsel
		\|\| info->arch.GetMachine() == llvm::Triple::mips64 \|\| info->arch.GetMachine() == llvm::Triple::mips64el)
		file_addr = state.address & (~1ull);
		clayborgUnsubmitted Not Done Reply Inline Actions Move this code to where the ParseDWARFLineTableCallbackInfo is filled in and fill in "addr_mask" as described above. Otherwise each time we append a line entry to a sequence we will be checking the arch over and over and over and over.... clayborg: Move this code to where the ParseDWARFLineTableCallbackInfo is filled in and fill in…

line_table->AppendLineEntryToSequence (info->sequence_ap.get(),		line_table->AppendLineEntryToSequence (info->sequence_ap.get(),
state.address,		file_addr,
		clayborgUnsubmitted Not Done Reply Inline Actions change this line to: file_addr & info->addr_mask, clayborg: change this line to: ``` file_addr & info->addr_mask, ```
state.line,		state.line,
state.column,		state.column,
state.file,		state.file,
state.is_stmt,		state.is_stmt,
state.basic_block,		state.basic_block,
state.prologue_end,		state.prologue_end,
state.epilogue_begin,		state.epilogue_begin,
state.end_sequence);		state.end_sequence);
Show All 23 Lines	if (dwarf_cu)
const dw_offset_t cu_line_offset = dwarf_cu_die->GetAttributeValueAsUnsigned(this, dwarf_cu, DW_AT_stmt_list, DW_INVALID_OFFSET);		const dw_offset_t cu_line_offset = dwarf_cu_die->GetAttributeValueAsUnsigned(this, dwarf_cu, DW_AT_stmt_list, DW_INVALID_OFFSET);
if (cu_line_offset != DW_INVALID_OFFSET)		if (cu_line_offset != DW_INVALID_OFFSET)
{		{
std::unique_ptr<LineTable> line_table_ap(new LineTable(sc.comp_unit));		std::unique_ptr<LineTable> line_table_ap(new LineTable(sc.comp_unit));
if (line_table_ap.get())		if (line_table_ap.get())
{		{
ParseDWARFLineTableCallbackInfo info;		ParseDWARFLineTableCallbackInfo info;
info.line_table = line_table_ap.get();		info.line_table = line_table_ap.get();
		GetObjectFile()->GetArchitecture(info.arch);
		clayborgUnsubmitted Not Done Reply Inline Actions Fill in "info.addr_mask" here: /* * MIPS: * The SymbolContext may not have a valid target, thus we may not be able * to call Address::GetOpcodeLoadAddress() which would clear the bit #0 * for MIPS. Use ArchSpec to clear the bit #0. / ArchSpec arch; GetObjectFile()->GetArchitecture(arch); switch (arch.GetMachine()) { case llvm::Triple::mips: case llvm::Triple::mipsel: case llvm::Triple::mips64: case llvm::Triple::mips64el: info.addr_mask = ~((lldb::addr_t)1); break; default: info.addr_mask = ~((lldb::addr_t)0); break; } clayborg:* Fill in "info.addr_mask" here: ``` /* * MIPS: * The SymbolContext may…
lldb::offset_t offset = cu_line_offset;		lldb::offset_t offset = cu_line_offset;
DWARFDebugLine::ParseStatementTable(get_debug_line_data(), &offset, ParseDWARFLineTableCallback, &info);		DWARFDebugLine::ParseStatementTable(get_debug_line_data(), &offset, ParseDWARFLineTableCallback, &info);
if (m_debug_map_symfile)		if (m_debug_map_symfile)
{		{
// We have an object file that has a line table with addresses		// We have an object file that has a line table with addresses
// that are not linked. We need to link the line table and convert		// that are not linked. We need to link the line table and convert
// the addresses that are relative to the .o file into addresses		// the addresses that are relative to the .o file into addresses
// for the main executable.		// for the main executable.
▲ Show 20 Lines • Show All 6,806 Lines • Show Last 20 Lines

source/Target/RegisterContext.cpp

Show First 20 Lines • Show All 97 Lines • ▼ Show 20 Lines	if (reg_info)
return reg_info->name;		return reg_info->name;
return NULL;		return NULL;
}		}

uint64_t		uint64_t
RegisterContext::GetPC(uint64_t fail_value)		RegisterContext::GetPC(uint64_t fail_value)
{		{
uint32_t reg = ConvertRegisterKindToRegisterNumber (eRegisterKindGeneric, LLDB_REGNUM_GENERIC_PC);		uint32_t reg = ConvertRegisterKindToRegisterNumber (eRegisterKindGeneric, LLDB_REGNUM_GENERIC_PC);
return ReadRegisterAsUnsigned (reg, fail_value);		uint64_t pc = ReadRegisterAsUnsigned (reg, fail_value);

		if (pc != fail_value)
		{
		/*
		* MIPS:
		* When a breakpoint is hit in microMIPS address space, bit #0 of the PC
		* is set by the target (CallableLoadAddress). However there is no trace
		* of bit #0 elsewhere in the debugger. Clear bit #0 so that we can find
		* breakpoints etc. set using OpcodeLoadAddress.
		*/
		clayborgUnsubmitted Not Done Reply Inline Actions Probably no need for this MIPS specific comment in here, it should be documented once in the Target functions that strip the bit zero. clayborg: Probably no need for this MIPS specific comment in here, it should be documented once in the…
		TargetSP target_sp = m_thread.CalculateTarget();
		Target *target = target_sp.get();
		Address addr (pc);
		pc = addr.GetOpcodeLoadAddress (target);
		clayborgUnsubmitted Not Done Reply Inline Actions We don't need to make a section + offset address here, we can just use: lldb::addr_t Target::GetOpcodeLoadAddress (lldb::addr_t load_addr, AddressClass addr_class) const; So this code should be: TargetSP target_sp = m_thread.CalculateTarget(); if (target_sp) pc = target->GetOpcodeLoadAddress (pc, eAddressClassCode); clayborg: We don't need to make a section + offset address here, we can just use: ``` lldb::addr_t Target…
		}

		return pc;
}		}
		clayborgUnsubmitted Not Done Reply Inline Actions Bit #0 should be stripped from the PC before it is figured out and the frame might need to track the address class, so this change shouldn't be needed. We don't want extra bits floating around in our code that we have to strip everywhere. This should be done as the stack frames are being created. The frame will need to keep track of the address class in case the address doesn't map back to a shared library (JITed code might not have a module describing the code). So this code should be removed and the backtracer will need to sanitize the addresses as the PC values are unwound. clayborg: Bit #0 should be stripped from the PC before it is figured out and the frame might need to…
		jaydeepAuthorUnsubmitted Not Done Reply Inline Actions The breakpoint is set on OpcodeAddress (bit #0 clear), but target returns CallableAddress (bit #0 set) when breakpoint is hit. Set using: StoppointLocation (loc_id, addr.GetOpcodeLoadAddress(&owner.GetTarget()), hardware) Find using: addr_t pc = thread_sp->GetRegisterContext()->GetPC() + m_breakpoint_pc_offset; lldb::BreakpointSiteSP bp_site_sp = thread_sp->GetProcess()->GetBreakpointSiteList().FindByAddress(pc); We either need to clear bit #0 from the PC we get or need to set the breakpoint on CallableAddress (which would need LineTable in CallableAddress form). jaydeep: The breakpoint is set on OpcodeAddress (bit #0 clear), but target returns CallableAddress (bit…

bool		bool
RegisterContext::SetPC(uint64_t pc)		RegisterContext::SetPC(uint64_t pc)
{		{
uint32_t reg = ConvertRegisterKindToRegisterNumber (eRegisterKindGeneric, LLDB_REGNUM_GENERIC_PC);		uint32_t reg = ConvertRegisterKindToRegisterNumber (eRegisterKindGeneric, LLDB_REGNUM_GENERIC_PC);
bool success = WriteRegisterFromUnsigned (reg, pc);		bool success = WriteRegisterFromUnsigned (reg, pc);
if (success)		if (success)
{		{
▲ Show 20 Lines • Show All 519 Lines • Show Last 20 Lines

source/Target/Target.cpp

	Show First 20 Lines • Show All 2,053 Lines • ▼ Show 20 Lines
	}			}

	ClangPersistentVariables &			ClangPersistentVariables &
	Target::GetPersistentVariables()			Target::GetPersistentVariables()
	{			{
	return *m_persistent_variables;			return *m_persistent_variables;
	}			}

	lldb::addr_t			lldb::addr_t
	Target::GetCallableLoadAddress (lldb::addr_t load_addr, AddressClass addr_class) const			Target::GetCallableLoadAddress (lldb::addr_t load_addr, AddressClass addr_class) const
				clayborgUnsubmitted Not Done Reply Inline Actions This should be: lldb_private::Address GetCallableAddress (const lldb_private::Address &addr); clayborg: This should be: ``` lldb_private::Address GetCallableAddress (const lldb_private::Address…
	{			{
	addr_t code_addr = load_addr;			addr_t code_addr = load_addr;
	switch (m_arch.GetMachine())			switch (m_arch.GetMachine())
	{			{
				case llvm::Triple::mips:
				case llvm::Triple::mipsel:
				case llvm::Triple::mips64:
				case llvm::Triple::mips64el:
				switch (addr_class)
				{
				case eAddressClassData:
				case eAddressClassDebug:
				return LLDB_INVALID_ADDRESS;

				case eAddressClassUnknown:
				case eAddressClassInvalid:
				case eAddressClassCode:
				case eAddressClassCodeAlternateISA:
				case eAddressClassRuntime:
				if ((code_addr & 2ull) \|\| (addr_class == eAddressClassCodeAlternateISA))
				code_addr \|= 1ull;
				break;
				}
				break;

	case llvm::Triple::arm:			case llvm::Triple::arm:
	case llvm::Triple::thumb:			case llvm::Triple::thumb:
	switch (addr_class)			switch (addr_class)
	{			{
	case eAddressClassData:			case eAddressClassData:
	case eAddressClassDebug:			case eAddressClassDebug:
	return LLDB_INVALID_ADDRESS;			return LLDB_INVALID_ADDRESS;

	Show All 29 Lines
	}			}

	lldb::addr_t			lldb::addr_t
	Target::GetOpcodeLoadAddress (lldb::addr_t load_addr, AddressClass addr_class) const			Target::GetOpcodeLoadAddress (lldb::addr_t load_addr, AddressClass addr_class) const
	{			{
	addr_t opcode_addr = load_addr;			addr_t opcode_addr = load_addr;
	switch (m_arch.GetMachine())			switch (m_arch.GetMachine())
	{			{
				case llvm::Triple::mips:
				case llvm::Triple::mipsel:
				case llvm::Triple::mips64:
				case llvm::Triple::mips64el:
	case llvm::Triple::arm:			case llvm::Triple::arm:
	case llvm::Triple::thumb:			case llvm::Triple::thumb:
	switch (addr_class)			switch (addr_class)
	{			{
	case eAddressClassData:			case eAddressClassData:
	case eAddressClassDebug:			case eAddressClassDebug:
	return LLDB_INVALID_ADDRESS;			return LLDB_INVALID_ADDRESS;

	▲ Show 20 Lines • Show All 1,644 Lines • Show Last 20 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[MIPS] microMIPS breakpoints, disassembly and compressed addressesClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 34199

source/Core/FormatEntity.cpp

source/Plugins/Disassembler/llvm/DisassemblerLLVMC.cpp

source/Plugins/ObjectFile/ELF/ObjectFileELF.cpp

source/Plugins/SymbolFile/DWARF/SymbolFileDWARF.cpp

source/Target/RegisterContext.cpp

source/Target/Target.cpp

[MIPS] microMIPS breakpoints, disassembly and compressed addresses
ClosedPublic