Previously we errored out when disassembling illegal instructions and there would be no profile generated. In fact illegal instructions are not uncommon and we'd better skip them and print "unknown" instead of erroring out. This matches the behavior of llvm-objdump (see disassembleObject in llvm-objdump.cpp).
Details
Diff Detail
- Repository
- rG LLVM Github Monorepo
Event Timeline
I'm wondering if we need a warning when running into such cases. It's possible with data in code for assembly code, but for normal c++ compilation, this could be a red flag that something might be off?
The consequence of such cases for llvm-objdump is less important and also visible in the output, so we don't need warning there. But llvm-profgen could be different - clues like warning may help us detect subtle profile issues.
llvm/tools/llvm-profgen/ProfiledBinary.cpp | ||
---|---|---|
250 | Does llvm-objdump also prints one <unknown> for each byte of illegal instruction, or one <unknown> for entire range of illegal instructions? |
Sure, a warning makes sense.
llvm/tools/llvm-profgen/ProfiledBinary.cpp | ||
---|---|---|
250 | llvm-objdump prints <unknown> for each bad instruction: https://github.com/llvm/llvm-project/blob/eecbb1c77655d38c06e47cf32e2dcc72da45c517/llvm/tools/llvm-objdump/llvm-objdump.cpp#L1158 objdump prints <bad> instead. |
llvm/tools/llvm-profgen/ProfiledBinary.cpp | ||
---|---|---|
250 | what's the length of a bad instruction? one byte? not a big deal, but thought nice to be consistent. |
llvm/tools/llvm-profgen/ProfiledBinary.cpp | ||
---|---|---|
250 | The logic in llvm-objdump is, if the dissembler doesn't tell the length of the bad instruction, moves forward byte-by-byte until a good instruction is seen. This is typically with X86 or variant sized ISA. For Arm, I think it'll skip 2 bytes or 4 bytes which is the length of fixed sized instructions. |
Thanks for working on this! LGTM
llvm/tools/llvm-profgen/ProfiledBinary.cpp | ||
---|---|---|
237 | Nit: how about also use Disassembled as the condition and merge this to the Line274, like: if (Disassembled) Offset += Size; else Offset += 1; |
llvm/tools/llvm-profgen/ProfiledBinary.cpp | ||
---|---|---|
237 | Size can still be a valid value even the disassembling fails on some targets. For example, on arm64, instruction size is always four byte. When the current disassembling fails, it might skip 4 bytes to the next instruction. |
llvm/tools/llvm-profgen/ProfiledBinary.cpp | ||
---|---|---|
237 | I see, thanks for the clarification. |
llvm/tools/llvm-profgen/ProfiledBinary.cpp | ||
---|---|---|
279 | It would be good to have one warning for a consecutive range of invalid instructions, instead of one warning for each byte. Otherwise it can be noisy. nit: Bad disassembling -> Invalid instructions at <start> - <end> |
llvm/tools/llvm-profgen/ProfiledBinary.cpp | ||
---|---|---|
279 | Good point, fixed. |
nit: InvalidInstLength