This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/
-
llvm/
-
BinaryFormat/
-
XCOFF.h
-
ObjectYAML/
-
ObjectYAML.h
3
XCOFFYAML.h
-
yaml2obj.h
-
lib/ObjectYAML/
-
ObjectYAML/
-
CMakeLists.txt
-
ObjectYAML.cpp
32/56
XCOFFEmitter.cpp
8/13
XCOFFYAML.cpp
-
yaml2obj.cpp
-
test/tools/yaml2obj/XCOFF/
-
tools/
-
yaml2obj/
-
XCOFF/
5/18
basic-doc.yaml
2
full-contents.yaml
-
utils/gn/secondary/llvm/lib/ObjectYAML/
-
gn/
-
secondary/
-
llvm/
-
lib/
-
ObjectYAML/
1/2
BUILD.gn

Differential D95505

[yaml2obj] Initial support for 32-bit XCOFF in yaml2obj.
ClosedPublic

Authored by Esme on Jan 26 2021, 9:31 PM.

Download Raw Diff

Details

Reviewers

jasonliu
hubert.reinterpretcast
jsji
shchenz
sfertile
DiggerLin
qiucf
grimar
jhenderson
MaskRay
Higuoxing

Group Reviewers

Restricted Project

Commits

rG50bb1b930dbc: [yaml2obj] Initial the support of yaml2obj for 32-bit XCOFF.

Summary

This patch implements the mapping of the Yaml information to XCOFF object to enable the operation of yaml2obj.
Currently only 32-bit mode is supported.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

Make more fields as possible optional.
Test the behaviours when the key is omitted.
Use the larger type for 32-bit and 64-bit.

Harbormaster completed remote builds in B90330: Diff 325668.Feb 22 2021, 9:43 PM

Thank you so much for reviewing this patch @jhenderson @MaskRay @hubert.reinterpretcast and sorry for the late reply. I was on vacation the last few weeks.
Here is the doc about the structure of XCOFF files: https://www.ibm.com/support/knowledgecenter/ssw_aix_72/filesreference/XCOFF.html
I am working on adding 64-bit support but the other tools this patch may depend on do not support 64-bit for now. All of this has to be done step by step.

jhenderson added inline comments.Feb 24 2021, 1:13 AM

llvm/include/llvm/ObjectYAML/XCOFFYAML.h
59	Nit: missing trailing full stop for this comment. But really, I'm not sure this comment adds anything, as the name is pretty clear to me.
llvm/lib/ObjectYAML/XCOFFEmitter.cpp
55	Two questions: Why is this a signed type? What is the maximum number of sections an XCOFF file can have? In ELF, it is effectively UINT32_MAX, for example. (In practice, it's unlikely that we'll see that many sections, but we shouldn't prevent it just due to the wrong type). This type should be big enough for whatever the max can be.
56	In general, we tend to avoid using `auto` these days in new code, unless the type is obvious. In this case, the type of the member of `Obj.Sections` is not obvious.
66	General point that applies here and in all other new error messages: the coding guidelines say error messages start with lower case and don't end in a full stop. They don't explicitly exclude ending in an exclamation mark, but I'd avoid it as well. Specific point: it would be helpful if this error said what the maximum number of sections the code supports is.
73	Here and in similar cases below for sections etc, I don't think what you've done is good. I think you shouldn't emit an error if a user explicitly specifies the number of relocations (sections etc). Just use the specified value in the header. Take a look at what we do for ELF yaml2obj already. For many fields there is a default value that is derived from the contents of the YAML, and an option to override that value. For example, the ELF header has an e_shnum field which states the number of sections in that object. This value is automatically populated with the number of sections provided in the YAML. However, if a different value is specified in the YAML for the e_shnum field, that value is written to e_shnum instead, whilst the real set of sections is still written out. This enables a user to create a deliberate inconsistency between the two properties, which allows for things like error handling testing of malformed objects.
llvm/lib/ObjectYAML/XCOFFYAML.cpp
111–112	In general, I think everything should be optional, unless there is no sensible default. I haven't looked at the spec yet, but presumably the magic is fixed, or used to distinguish between 32/64 bit. It seems reasonable that a user doesn't need to specify it and then yaml2obj just picks something like 32 bit automatically. I think it's important that you don't require more than you absolutely have to, because otherwise it bloats the YAML and makes it hard to identify what's actually important to the test case in question.

Addressed James's comments.

Harbormaster completed remote builds in B90791: Diff 326345.Feb 25 2021, 3:44 AM

Esme edited the summary of this revision. (Show Details)Feb 25 2021, 3:46 AM

Thank you so much for your comments! @jhenderson They are very useful.

I've updated the summary with the description of common XCOFF file format.

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
55	There are 3 reserved signed number (-2, -1, 0) for sections, and thanks for your question, which remind me to add them to IndexMap. The MaxSectionIndex, jasonliu defined in XCOFFObjectWriter.cpp, is also INT16_MAX, so I stuck with the number. I am not sure whether XCOFF supports a larger number of sections. I will have a look into this.

Esme added a child revision: D97656: [llvm-objcopy] Initial XCOFF32 support..Feb 28 2021, 7:07 PM

Esme added a child revision: D98003: [obj2yaml][XCOFF] Dump sections.Mar 4 2021, 8:01 PM

Update the parsing of address and flag of Section.

Harbormaster completed remote builds in B92610: Diff 328943.Mar 8 2021, 12:10 AM

Sorry for the delay in looking at this further. I haven't had a chance to look at most of this again, but here are a few smaller comments.

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
66–69	It would probably be good to use some enum values or named constants for these, e.g. something like: namespace xcoff { enum { N_DEBUG = -2, N_ABS = -1, N_UNDEF = 0 } } This is similar to how special section indexes are handled for ELF.
75
79	LLVM style guide is to precalculate the number of sections, if the number cannot change within the loop, as per the suggestion inline. Also, prefer preincrement to postincrement.

Addressed @jhenderson 's comments.

Harbormaster completed remote builds in B93205: Diff 329824.Mar 10 2021, 7:21 PM

shchenz mentioned this in D97186: [XCOFF][llvm-dwarfdump] support llvm-dwarfdump for XCOFF DWARF.Mar 16 2021, 6:17 AM

shchenz added a reviewer: Higuoxing.

Gently ping.

jhenderson added inline comments.Mar 18 2021, 2:27 AM

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
58	It might be a good idea to break this function up into smaller pieces, perhaps one function per loop, so you'd have a function that assigns the section offsets and addresses, another function for the relocations, another for the symbols etc.
68–70	Can you do this in the `SectionIndexMap` initialisation list, or at least in the constructor if that's not possible?
82	Put a blank line before this line, to help break up this function. Same goes below: if you put a comment, followed immediately by an if statement or loop relating to that comment, add a blank line. Also, I think it's more common grammatically in code comments to right "Assign" rather than "Assigns" (i.e. use the imperative form). Same for "Calculates" (use "Calculate") below.
94	Is "FileOffsetToData" the actual XCOFF field name, or is it something else? If something else, I recommend using the real field name.
96–97	And check whether the comment can be reflowed to fit within 80 characters width.
103	Can there be more than one each of .text/.data/.bss? If so, are they all named the same? Are the names actually ".text", ".data" etc?
110	If I'm reading this code correctly, if a user has put their sections in an order like: .text .somethingelse .data all sections are size 4, and the address of .text is 16, the address of .data is going to be 24, not 20. Is that supposed to be the case?
116	Reading this code, it looks like you can't have in XCOFF relocations that aren't attached to a section. Is that correct?
148	You probably need to make the `Header` fields optional themselves, so that this code can distinguish between the case where the user has explicitly specified the value as `0` and the case where it is unspecified. The same likely goes for other structs like the Section. Perhaps also worth pulling `0x01DF` into a named constant somewhere.
149	Where the field name is self-explanatory, you don't need to have a comment. Same goes below.
175–177	Calculate the requested address value upfront, then use that value in both places, rather than repeat the logic to calculate it.
208	Same in similar messages below.
222	Probably best to make all these PaddingSize fields `int64_t` to prepare for 64-bit support.
225
272	At this point, it may be better to just use write_zeroes to fill the whole auxiliary entry. Alternatively, emit an error if `NumberOfAuxEntries` is non-zero.
290	This sounds like it prevents a user from writing a 32-bit XCOFF file, but with a messed up magic field. Perhaps you should put an optional explicit format field in the YAML (defaulting to 32-bit XCOFF), and use that to determine the file format to use.
295	It seems like if you failed to assign addresses or indices, it may be dangerous to continue? This should probably bail out. Same probably goes for the other cases where an error can occur too.
llvm/lib/ObjectYAML/XCOFFYAML.cpp
122	In ELF, the `r_offset` field of a relocation specifies where the relocation will patch, and is relative to either the section start or file start, depending on the context. Often in our test cases, it doesn't really matter where the relocation is patching, only that there is a relocation, hence the offset field is optional there. It seems likely that this will be the case for XCOFF too?
123	Can XCOFF relocations have no symbol? If so, I think a 0 index value would be a reasonable default.
125	Is there a reasonable default for the relocation type? In ELF, there isn't really, so I think it's a required field.

Addressed jhenderson's comments.

Esme added inline comments.Mar 23 2021, 2:27 AM

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
94	Yes, it's also defined in XCOFF.h. The name "FileOffsetToData" has been used in all XCOFF related implementations，which corresponds to "s_scnptr" in the documentation.
103	Yes, multiple .text/.data/.bss sections are allowed. Applications use the Flags field instead of the Name field to determine a section type. Two sections of the same type may have different names. As introduced in Table 4.
110	Moving this calculation ahead of the calculation of FileOffsetToData gives the correct result.
116	Yes, the relocations are always attached to sections in XCOFF.
llvm/lib/ObjectYAML/XCOFFYAML.cpp
122	I agree that it's unfriendly to require the virtual address of the relocation from users. While this item in XCOFF specifies the virtual address of the value that requires modification by the binder. And the offset to the data in the section will be calculated as follows: offset_in_section = Relocation_Address - Section_Address We can calculate the Section_Address from yaml contents, but hard to determine the Relocation_Address. So this is hardly optional?
129	It's reasonable to make the name to be optimal and the flags to be required, since the flags are the unique identifier of the section type.

Harbormaster completed remote builds in B95205: Diff 332575.Mar 23 2021, 2:32 AM

Only the .text, .data, .tdata, and STYP_DWARF sections have relocations.

Harbormaster completed remote builds in B95407: Diff 332866.Mar 23 2021, 10:21 PM

jhenderson added inline comments.Mar 24 2021, 2:18 AM

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
56–57	Why not just use `Obj.Header` and `Obj.Sections` directly in the referencing code? Why do you need these member variables separately to `Obj`?
63–64	Is this a fundamental restriction of the XCOFF file format, or is it just what should happen? Is it possible to create (potentially by hand) a section with relocations if it isn't one of these types? In general, to allow for testing of bad input paths etc, you want to allow as much flexibility in yaml2obj as possible. As such, it isn't yaml2obj's place to restrict what can be done, as long as it can physically represent what was requested.
77	`MaxRawDataSize` is an internal variable, right, not an XCOFF spec defined value? If `MaxRawDataSize` doesn't directly appear in the spec, don't mention it to the user in an error message. Instead, use general terms like "the maximum size permitted for XXX" (where XXX is the thing that is restricted, e.g. the object size).
103	It looks like from the spec that the names are merely conventions, so technically the names could be other thigns. It's probably fine to infer that if a section is text, it is called .text, unless otherwise specified by the YAML (and vice versa), but this bit applies to the section type, not the name, so we should refer to them by type rather than name (i.e. just "text", "data", "bss", not ".text" etc).
120	Is it definitely the address here that should be being aligned, or the offset? The two are different concepts, and the align function appears to align the offset, not the address.
136
191–192	Here, it's probably worth a comment saying which is the virtual address and which the physical address, like the inline edit, for example (which may be the wrong way around).
llvm/lib/ObjectYAML/XCOFFYAML.cpp
122	Could 0 be a reasonable default? If not, it's fine. I'm just wondering if it actually matters in all cases what the address is.
129	Sounds good to me.

Addressed James's comments.

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
56–57	`Obj` can't be changed in assignAddressesAndIndices(), because it will be used during writing. The value specified in the YAML has a higher priority to be written than the calculated valued derived from the contents. And only Header and Sections have these number/offset/address fields which need to be calculated, therefore I use InitSections and InitFileHdr to keep these calculated values.
120	The offset of data for each section is what I want to align. The comments may be confusing here.
llvm/lib/ObjectYAML/XCOFFYAML.cpp
122	As I see, 0 is not a reasonable default.

Harbormaster completed remote builds in B95630: Diff 333207.Mar 24 2021, 10:12 PM

Esme added inline comments.Mar 24 2021, 10:16 PM

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
63–64	It is just what should happen. Thanks for the advice, it makes sense to me to removing the restriction to allow more flexibility.

Some more minor comments. I'll take a look at the tests next time.

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
68	Sorry, I might have confused you with my previous comment. `MaxRawDataSize` is being compared to `CurrentOffset`, so it's not the size of the relocations/sections that is being constrained, if I read it correctly. Perhaps you could change the message to something like "maximum object size of XXX exceeded when writing relocation data". What do you think?
91	How about this?
137–138	Same as relocation comment above.

FYI, I will be off for just over two weeks from Thursday. Please don't feel the need to wait on further comments from me, if there are other yaml2obj developers that are happy with this change. Same goes for your other changes for llvm-objcopy and obj2yaml.

Higuoxing added inline comments.Mar 29 2021, 8:04 PM

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
14	Do we need this header file? I guess what we need is `DenseMap` ?
128
llvm/test/tools/yaml2obj/XCOFF/basic-doc.yaml
25–30	It looks that the `Value`, `Type`, `StorageClass` and `NumberOfAuxEntries` are optional too. Could you please add some test cases for them as well?

Thank you for your review! @jhenderson and @Higuoxing.
Looking forward to more comments from @hubert.reinterpretcast and @MaskRay.

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
68	Very good suggestion, thanks!

Harbormaster completed remote builds in B96249: Diff 334053.Mar 29 2021, 11:24 PM

jhenderson added inline comments.Mar 30 2021, 12:23 AM

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
128	One better: `const XCOFFYAML::Symbol &YamlSym`

Added const

Harbormaster completed remote builds in B96258: Diff 334069.Mar 30 2021, 1:33 AM

Gentle ping.

Esme added a child revision: D100375: [yaml2obj] Enable support for parsing 64-bit XCOFF..Apr 13 2021, 3:24 AM

Added support for writing symbol name to string table.

Harbormaster completed remote builds in B98635: Diff 337365.Apr 14 2021, 1:23 AM

jhenderson added inline comments.Apr 21 2021, 3:13 AM

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
14	Nit: LLVM usually has a new line between the licence header and the `#include` list (based on a quick look at 3 or 4 files).
68	You can drop the parentheses. Do you need to allow space for a null terminator, or does `NameSize` take that into account?
127–129	How about simply `return initRelocations(CurrentOffset);`?
294	Nit: the suggested inline edit sounds a bit better to me.
llvm/test/tools/yaml2obj/XCOFF/basic-doc.yaml
2	Nit: newer tests I'm involved with at least tend to use '##' for comments, to make them stand out from lit and FileCheck lines.
12	Is there an enum you could use to represent this set of flags? It would be preferable to be able write either of the following (flag values are placeholders): Flags: Exec or probably Flags: [Exec, Alloc] although `Flags: 0x20` (or possibly `Flags: [0x20]`) should probably still be permitted.
13	It seems to me you could get away with much less data in this section (probably a half-dozen bytes)? I don't think this needs to be a real object for this test case? Same below.

Addressed James's comments.

Herald added a subscriber: nemanjai. · View Herald TranscriptApr 25 2021, 5:54 PM

Esme added inline comments.Apr 25 2021, 5:54 PM

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
68	Well, it seems that NameSize should have taken this into account.
llvm/test/tools/yaml2obj/XCOFF/basic-doc.yaml
12	Yes, we have the enum SectionTypeFlags. How about marking this as a TODO for follow-up work? Because this will have an impact on other tools, like obj2yaml. enum SectionTypeFlags : int32_t { STYP_PAD = 0x0008, STYP_DWARF = 0x0010, STYP_TEXT = 0x0020, STYP_DATA = 0x0040, STYP_BSS = 0x0080, STYP_EXCEPT = 0x0100, STYP_INFO = 0x0200, ... };

Harbormaster completed remote builds in B100845: Diff 340407.Apr 25 2021, 5:55 PM

Used the enum SectionTypeFlags to represent the set of flags.

llvm/test/tools/yaml2obj/XCOFF/basic-doc.yaml
12	After double checking, this does not seem to affect other tools. Thanks for your input.

Harbormaster completed remote builds in B100855: Diff 340421.Apr 25 2021, 8:20 PM

I've not had time to review test coverage yet, and I don't know XCOFF at all, so can't comment on the correctness either. You'll need others input for the latter.

llvm/test/tools/yaml2obj/XCOFF/basic-doc.yaml
12	As these are flags, you'll need the ability to specify multiple flags, just like for ELF sections. Looking at this, I'm guessing you don't currently have that option?

Added the ability to specify multiple flags.

Harbormaster completed remote builds in B101341: Diff 341089.Apr 28 2021, 1:00 AM

I've taken another look and think you need a fair bit more testing.

You need to show that all the fields can be specified explicitly. Currently, you only do it for a subset of the fields.
You probably should show that all the flags are supported.
You need to test your error paths, to show that the errors are reported properly.
If you don't need the string table support in the initial patch (and by my understanding you don't), you can move all the related logic into another indepenedent patch, and then ensure it is properly tested there.
I think you need testing for undef and abs symbols too,
I think you need testing for non data/text/bss sections (with their zero addresses).

llvm/test/tools/yaml2obj/XCOFF/basic-doc.yaml
2
4–5	You can do this all at once.

Addressed comments.

Harbormaster completed remote builds in B105144: Diff 346328.May 18 2021, 8:17 PM

In D95505#2767644, @Esme wrote:

Addressed comments.

Hi @Esme. Have you attempted to address all my comments? It doesn't look like you have. If you update a patch and haven't finished addressing all comments, please make it clear from the comment what you still plan to do.

llvm/test/tools/yaml2obj/XCOFF/basic-doc.yaml
24	Be consistent: above you've quoted the section data, and here you haven't. Pick one style (I'd recommend quoting).
25–26	Any particular reason these aren't all one section?
llvm/test/tools/yaml2obj/XCOFF/full-contents.yaml
68	Why are there blank lines here and below? Does this test actually pass? I was under the impression that `CHECK-NEXT:` without any contents was an error.

Esme updated this revision to Diff 347565.May 24 2021, 8:46 PM

Harbormaster completed remote builds in B106030: Diff 347565.May 24 2021, 8:46 PM

In D95505#2750262, @jhenderson wrote:

I've taken another look and think you need a fair bit more testing.

You need to show that all the fields can be specified explicitly. Currently, you only do it for a subset of the fields.

You probably should show that all the flags are supported.

You need to test your error paths, to show that the errors are reported properly.

If you don't need the string table support in the initial patch (and by my understanding you don't), you can move all the related logic into another indepenedent patch, and then ensure it is properly tested there.

I think you need testing for undef and abs symbols too,

I think you need testing for non data/text/bss sections (with their zero addresses).

Thanks! @jhenderson

The test file llvm/test/tools/yaml2obj/XCOFF/full-contents.yaml is added.
Addressed.
There are two types of errors, one is exceeding the maximum size/index, and the other is writing with offset overlap/redundant data. For the first type of error, it is difficult to write such a large file to show the error message; and for the second type, in order to allow users to write explicitly specifies values (allowing for invalid values), our offsets are derived from the contents, therefore these errors should not occur unless the yaml2obj itself has bugs.
I think we should support the string table in the patch, and I have added the long symbol name to test it.
Addressed.
Addressed.

Esme added a child revision: D102603: [llvm-objdump] Print the DEBUG type under `--section-headers`..May 24 2021, 9:22 PM

In D95505#2778704, @Esme wrote:

There are two types of errors, one is exceeding the maximum size/index, and the other is writing with offset overlap/redundant data. For the first type of error, it is difficult to write such a large file to show the error message; and for the second type, in order to allow users to write explicitly specifies values (allowing for invalid values), our offsets are derived from the contents, therefore these errors should not occur unless the yaml2obj itself has bugs.

Fair enough. Could you at least test manually the max section limit case. It shouldn't be too hard to do (write a small script to generate such a case), please?

I think we should support the string table in the patch, and I have added the long symbol name to test it.

Why do you think string table support should be in this patch? Why not split the patch up into smaller pieces to make it easier for reviewers to review it? There's no need for the behaviour to be fully functional in the first case, especially as by my understanding not all XCOFF objects have string tables.

Leave the StringTable as a follow-up work.

Could you at least test manually the max section limit case.

I tested the case with a python script generating the yaml doc, and the error message was output as expected.

Harbormaster completed remote builds in B106218: Diff 347855.May 26 2021, 12:06 AM

LGTM from a yaml2obj standpoint, but it would be a good idea to rope in someone with XCOFF experience to confirm they are happy with the file format writing before committing this.

llvm/test/tools/yaml2obj/XCOFF/basic-doc.yaml
39	Does this symbol test anything unique any more, or can we drop it?

This revision is now accepted and ready to land.May 26 2021, 12:52 AM

Esme added a child revision: D103146: [NFC][XCOFF] Use yaml2obj in llvm-objdump/XCOFF/section-headers.test instead of binary files..May 26 2021, 2:54 AM

Hi @jasonliu @DiggerLin @sfertile @shchenz @hubert.reinterpretcast, I would appreciate it if you could review this patch and check if the file format is written properly in your opinion. Thanks in advance!

llvm/test/tools/yaml2obj/XCOFF/basic-doc.yaml
39	It was used to test StringTable before, but now it's useless. I will drop this in next update.

Drop the useless Symbol test.
Add normalization/denormalization for the Section Flags set.

Harbormaster completed remote builds in B106861: Diff 348719.May 30 2021, 7:26 PM

shchenz added inline comments.May 30 2021, 8:18 PM

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
242	We don't have to call `write_zeros` if `PaddingSize` is 0?
265	What about a symbol that does not have belonged sections? For example undefined external symbols?
llvm/test/tools/yaml2obj/XCOFF/basic-doc.yaml
17	Is it ok that the relocation address is not in the range of `.data` section?
20	use another section name that is not DWARF specific?
llvm/test/tools/yaml2obj/XCOFF/full-contents.yaml
37	Same as above

Addressed Zheng's comments.
Set more fields optional and modified the correspond testing.

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
265	Specifies a section number associated with one of the following symbols: -2 Specifies N_DEBUG, a special symbolic debugging symbol. -1 Specifies N_ABS, an absolute symbol. The symbol has a value but is not relocatable. 0 Specifies N_UNDEF, an undefined external symbol. Any other value Specifies the section number where the symbol was defined. As the spec, there are 3 reserved section numbers for these symbols not defined in sections. The SectionName in the symbol entry is required now, however, we can also set the field optional and use N_UNDEF (maybe...) as the default section. Hmm... I am not sure which strategy is more reasonable. What do you think?
llvm/test/tools/yaml2obj/XCOFF/basic-doc.yaml
17	In general, yaml2obj will not report an error even though the user explicitly specifies an invalid values, which allows for things like error handling testing. Also after getting more familiar with yaml2obj, I think it's reasonable to set more fields to optional, and these omitted values will be filled in with the default zero or values derived from contents. So I set the relocation address optional now and 0 is the default value for it.

Harbormaster completed remote builds in B106875: Diff 348738.May 31 2021, 12:59 AM

shchenz added inline comments.May 31 2021, 1:38 AM

llvm/lib/ObjectYAML/XCOFFEmitter.cpp
265	I think `SectionName` for reserved section number `N_UNDEF` is not defined because we don't need to do it. We only need a section number. So making `SectionName` attribute optional for a symbol makes more sense to me. You can not say that a section with section name `N_UNDEF` is the `N_UNDEF`(section number == 0) section. We can define a text section named by `N_UNDEF` by using section attribute like: __attribute__((section ("N_UNDEF"))) int foo2(void) { }
llvm/test/tools/yaml2obj/XCOFF/basic-doc.yaml
17	Do you mean we setting the address as `0x3A` on purpose? The address of the `.data` section is 0x8 and its size is 0x8, so if the relocation is valid, the address the relocation entry wants to resolve should be in 0x8 and 0x10.

Address comment.

llvm/test/tools/yaml2obj/XCOFF/basic-doc.yaml
17	I just set a random value, as this value isn't relevant for testing purposes here. Use a valid address now.

Harbormaster completed remote builds in B106883: Diff 348754.May 31 2021, 2:38 AM

LGTM too. Please wait for several days in case other reviewers have comments. Thanks for enabling this tool for AIX.

Esme mentioned this in D103455: [yaml2obj] Add support for writing the long symbol name..Jun 1 2021, 6:00 AM

This revision was landed with ongoing or failed builds.Jun 6 2021, 9:15 PM

Closed by commit rG50bb1b930dbc: [yaml2obj] Initial the support of yaml2obj for 32-bit XCOFF. (authored by Esme). · Explain Why

This revision was automatically updated to reflect the committed changes.

Esme added a commit: rG50bb1b930dbc: [yaml2obj] Initial the support of yaml2obj for 32-bit XCOFF..

thakis added a subscriber: thakis.Jun 7 2021, 5:21 AM

thakis added inline comments.

llvm/utils/gn/secondary/llvm/lib/ObjectYAML/BUILD.gn
29	(FYI, you don't need to update the files in llvm/utils/gn when you add files. Those are unsupported build files, and also for simple changes like this they are updated automatically based on the CMakeLists.txt files by a bot. If you _do_ update them, please add a trailing `,` -- but it's better to not update them since it's less work for you and the bot will update them correctly :) )

This patch introduce https://lab.llvm.org/buildbot/#/builders/85/builds/4886

llvm-project/llvm/lib/ObjectYAML/XCOFFEmitter.cpp:67:16: runtime error: null pointer passed as argument 2, which is declared to never be null
/usr/lib/gcc/x86_64-linux-gnu/8/../../../../x86_64-linux-gnu/include/string.h:43:28: note: nonnull attribute specified here
    #0 0x45c3df in (anonymous namespace)::XCOFFWriter::writeXCOFF() /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/llvm/lib/ObjectYAML/XCOFFEmitter.cpp
    #1 0x45b0b1 in llvm::yaml::yaml2xcoff(llvm::XCOFFYAML::Object&, llvm::raw_ostream&, llvm::function_ref<void (llvm::Twine const&)>) /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/llvm/lib/ObjectYAML/XCOFFEmitter.cpp:311:17
    #2 0x30a181 in llvm::yaml::convertYAML(llvm::yaml::Input&, llvm::raw_ostream&, llvm::function_ref<void (llvm::Twine const&)>, unsigned int, unsigned long) /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/llvm/lib/ObjectYAML/yaml2obj.cpp:48:14
    #3 0x306322 in main /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/llvm/tools/yaml2obj/yaml2obj.cpp:136:8
    #4 0x7f38e52b709a in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x2409a)
    #5 0x2eafd9 in _start (/b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/bin/yaml2obj+0x2eafd9)

CC @eugenis

In D95505#2803450, @vitalybuka wrote:

This patch introduce https://lab.llvm.org/buildbot/#/builders/85/builds/4886

llvm-project/llvm/lib/ObjectYAML/XCOFFEmitter.cpp:67:16: runtime error: null pointer passed as argument 2, which is declared to never be null
/usr/lib/gcc/x86_64-linux-gnu/8/../../../../x86_64-linux-gnu/include/string.h:43:28: note: nonnull attribute specified here
    #0 0x45c3df in (anonymous namespace)::XCOFFWriter::writeXCOFF() /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/llvm/lib/ObjectYAML/XCOFFEmitter.cpp
    #1 0x45b0b1 in llvm::yaml::yaml2xcoff(llvm::XCOFFYAML::Object&, llvm::raw_ostream&, llvm::function_ref<void (llvm::Twine const&)>) /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/llvm/lib/ObjectYAML/XCOFFEmitter.cpp:311:17
    #2 0x30a181 in llvm::yaml::convertYAML(llvm::yaml::Input&, llvm::raw_ostream&, llvm::function_ref<void (llvm::Twine const&)>, unsigned int, unsigned long) /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/llvm/lib/ObjectYAML/yaml2obj.cpp:48:14
    #3 0x306322 in main /b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm-project/llvm/tools/yaml2obj/yaml2obj.cpp:136:8
    #4 0x7f38e52b709a in __libc_start_main (/lib/x86_64-linux-gnu/libc.so.6+0x2409a)
    #5 0x2eafd9 in _start (/b/sanitizer-x86_64-linux-bootstrap-ubsan/build/llvm_build_ubsan/bin/yaml2obj+0x2eafd9)

CC @eugenis

The issue has been solved by rG310d2b4957c8, thanks.

llvm/utils/gn/secondary/llvm/lib/ObjectYAML/BUILD.gn
29	Thanks for the information. I will not update such file in future patches.

Esme mentioned this in D100375: [yaml2obj] Enable support for parsing 64-bit XCOFF..Jun 9 2021, 1:48 AM

Esme mentioned this in rG657aa3a7631b: [yaml2obj] Add support for writing the long symbol name..Jun 20 2021, 10:11 PM

Esme removed a child revision: D98003: [obj2yaml][XCOFF] Dump sections.Sep 9 2021, 10:24 PM

Revision Contents

Path

Size

llvm/

include/

llvm/

BinaryFormat/

XCOFF.h

2 lines

ObjectYAML/

ObjectYAML.h

2 lines

XCOFFYAML.h

53 lines

yaml2obj.h

5 lines

lib/

ObjectYAML/

1 line

3 lines

315 lines

79 lines

2 lines

test/

tools/

yaml2obj/

XCOFF/

basic-doc.yaml

164 lines

full-contents.yaml

122 lines

utils/

gn/

secondary/

llvm/

lib/

ObjectYAML/

BUILD.gn

1 line

Diff 350174

llvm/include/llvm/BinaryFormat/XCOFF.h

	Show All 27 Lines
	constexpr size_t NameSize = 8;			constexpr size_t NameSize = 8;
	constexpr size_t SymbolTableEntrySize = 18;			constexpr size_t SymbolTableEntrySize = 18;
	constexpr size_t RelocationSerializationSize32 = 10;			constexpr size_t RelocationSerializationSize32 = 10;
	constexpr uint16_t RelocOverflow = 65535;			constexpr uint16_t RelocOverflow = 65535;
	constexpr uint8_t AllocRegNo = 31;			constexpr uint8_t AllocRegNo = 31;

	enum ReservedSectionNum : int16_t { N_DEBUG = -2, N_ABS = -1, N_UNDEF = 0 };			enum ReservedSectionNum : int16_t { N_DEBUG = -2, N_ABS = -1, N_UNDEF = 0 };

				enum MagicNumber : uint16_t { XCOFF32 = 0x01DF, XCOFF64 = 0x01F7 };

	// x_smclas field of x_csect from system header: /usr/include/syms.h			// x_smclas field of x_csect from system header: /usr/include/syms.h
	/// Storage Mapping Class definitions.			/// Storage Mapping Class definitions.
	enum StorageMappingClass : uint8_t {			enum StorageMappingClass : uint8_t {
	// READ ONLY CLASSES			// READ ONLY CLASSES
	XMC_PR = 0, ///< Program Code			XMC_PR = 0, ///< Program Code
	XMC_RO = 1, ///< Read Only Constant			XMC_RO = 1, ///< Read Only Constant
	XMC_DB = 2, ///< Debug Dictionary Table			XMC_DB = 2, ///< Debug Dictionary Table
	XMC_GL = 6, ///< Global Linkage (Interfile Interface Code)			XMC_GL = 6, ///< Global Linkage (Interfile Interface Code)
	▲ Show 20 Lines • Show All 376 Lines • Show Last 20 Lines

llvm/include/llvm/ObjectYAML/ObjectYAML.h

	Show All 9 Lines
	#define LLVM_OBJECTYAML_OBJECTYAML_H			#define LLVM_OBJECTYAML_OBJECTYAML_H

	#include "llvm/ObjectYAML/ArchiveYAML.h"			#include "llvm/ObjectYAML/ArchiveYAML.h"
	#include "llvm/ObjectYAML/COFFYAML.h"			#include "llvm/ObjectYAML/COFFYAML.h"
	#include "llvm/ObjectYAML/ELFYAML.h"			#include "llvm/ObjectYAML/ELFYAML.h"
	#include "llvm/ObjectYAML/MachOYAML.h"			#include "llvm/ObjectYAML/MachOYAML.h"
	#include "llvm/ObjectYAML/MinidumpYAML.h"			#include "llvm/ObjectYAML/MinidumpYAML.h"
	#include "llvm/ObjectYAML/WasmYAML.h"			#include "llvm/ObjectYAML/WasmYAML.h"
				#include "llvm/ObjectYAML/XCOFFYAML.h"
	#include "llvm/Support/YAMLTraits.h"			#include "llvm/Support/YAMLTraits.h"
	#include <memory>			#include <memory>

	namespace llvm {			namespace llvm {
	namespace yaml {			namespace yaml {

	class IO;			class IO;

	struct YamlObjectFile {			struct YamlObjectFile {
	std::unique_ptr<ArchYAML::Archive> Arch;			std::unique_ptr<ArchYAML::Archive> Arch;
	std::unique_ptr<ELFYAML::Object> Elf;			std::unique_ptr<ELFYAML::Object> Elf;
	std::unique_ptr<COFFYAML::Object> Coff;			std::unique_ptr<COFFYAML::Object> Coff;
	std::unique_ptr<MachOYAML::Object> MachO;			std::unique_ptr<MachOYAML::Object> MachO;
	std::unique_ptr<MachOYAML::UniversalBinary> FatMachO;			std::unique_ptr<MachOYAML::UniversalBinary> FatMachO;
	std::unique_ptr<MinidumpYAML::Object> Minidump;			std::unique_ptr<MinidumpYAML::Object> Minidump;
	std::unique_ptr<WasmYAML::Object> Wasm;			std::unique_ptr<WasmYAML::Object> Wasm;
				std::unique_ptr<XCOFFYAML::Object> Xcoff;
	};			};

	template <> struct MappingTraits<YamlObjectFile> {			template <> struct MappingTraits<YamlObjectFile> {
	static void mapping(IO &IO, YamlObjectFile &ObjectFile);			static void mapping(IO &IO, YamlObjectFile &ObjectFile);
	};			};

	} // end namespace yaml			} // end namespace yaml
	} // end namespace llvm			} // end namespace llvm

	#endif // LLVM_OBJECTYAML_OBJECTYAML_H			#endif // LLVM_OBJECTYAML_OBJECTYAML_H

llvm/include/llvm/ObjectYAML/XCOFFYAML.h

	Show All 17 Lines

	namespace llvm {			namespace llvm {
	namespace XCOFFYAML {			namespace XCOFFYAML {

	struct FileHeader {			struct FileHeader {
	llvm::yaml::Hex16 Magic;			llvm::yaml::Hex16 Magic;
	uint16_t NumberOfSections;			uint16_t NumberOfSections;
	int32_t TimeStamp;			int32_t TimeStamp;
	llvm::yaml::Hex32 SymbolTableOffset; // File offset to symbol table.			llvm::yaml::Hex64 SymbolTableOffset;
	int32_t NumberOfSymTableEntries;			uint32_t NumberOfSymTableEntries;
	uint16_t AuxHeaderSize;			uint16_t AuxHeaderSize;
	llvm::yaml::Hex16 Flags;			llvm::yaml::Hex16 Flags;
	};			};

				struct Relocation {
				llvm::yaml::Hex64 VirtualAddress;
				llvm::yaml::Hex64 SymbolIndex;
				llvm::yaml::Hex8 Info;
				MaskRayUnsubmitted Not Done Reply Inline Actions Does this name lose the "information" semantics of `r_rsize`? MaskRay: Does this name lose the "information" semantics of `r_rsize`?
				llvm::yaml::Hex8 Type;
				};

				struct Section {
				StringRef SectionName;
				llvm::yaml::Hex64 Address;
				MaskRayUnsubmitted Not Done Reply Inline Actions uint64_t to make it 64-bit ready? In ELFYAML.h, we just use the larger type for 32-bit and 64-bit so that no separate definitions are needed. MaskRay: uint64_t to make it 64-bit ready? In ELFYAML.h, we just use the larger type for 32-bit and 64…
				llvm::yaml::Hex64 Size;
				llvm::yaml::Hex64 FileOffsetToData;
				llvm::yaml::Hex64 FileOffsetToRelocations;
				llvm::yaml::Hex64 FileOffsetToLineNumbers; // Line number pointer. Not supported yet.
				llvm::yaml::Hex16 NumberOfRelocations;
				llvm::yaml::Hex16 NumberOfLineNumbers; // Line number counts. Not supported yet.
				uint32_t Flags;
				yaml::BinaryRef SectionData;
				std::vector<Relocation> Relocations;
				};

	struct Symbol {			struct Symbol {
	StringRef SymbolName;			StringRef SymbolName;
	llvm::yaml::Hex32 Value; // Symbol value; storage class-dependent.			llvm::yaml::Hex64 Value; // Symbol value; storage class-dependent.
	StringRef SectionName;			StringRef SectionName;
	llvm::yaml::Hex16 Type;			llvm::yaml::Hex16 Type;
	XCOFF::StorageClass StorageClass;			XCOFF::StorageClass StorageClass;
	uint8_t NumberOfAuxEntries; // Number of auxiliary entries			uint8_t NumberOfAuxEntries;
				jhendersonUnsubmitted Not Done Reply Inline Actions Nit: missing trailing full stop for this comment. But really, I'm not sure this comment adds anything, as the name is pretty clear to me. jhenderson: Nit: missing trailing full stop for this comment. But really, I'm not sure this comment adds…
	};			};

	struct Object {			struct Object {
	FileHeader Header;			FileHeader Header;
				std::vector<Section> Sections;
	std::vector<Symbol> Symbols;			std::vector<Symbol> Symbols;
	Object();			Object();
	};			};
	} // namespace XCOFFYAML			} // namespace XCOFFYAML
	} // namespace llvm			} // namespace llvm

	LLVM_YAML_IS_SEQUENCE_VECTOR(XCOFFYAML::Symbol)			LLVM_YAML_IS_SEQUENCE_VECTOR(XCOFFYAML::Symbol)
				LLVM_YAML_IS_SEQUENCE_VECTOR(XCOFFYAML::Relocation)
				LLVM_YAML_IS_SEQUENCE_VECTOR(XCOFFYAML::Section)

	namespace llvm {			namespace llvm {
	namespace yaml {			namespace yaml {

				template <> struct ScalarBitSetTraits<XCOFF::SectionTypeFlags> {
				static void bitset(IO &IO, XCOFF::SectionTypeFlags &Value);
				};

	template <> struct ScalarEnumerationTraits<XCOFF::StorageClass> {			template <> struct ScalarEnumerationTraits<XCOFF::StorageClass> {
	static void enumeration(IO &IO, XCOFF::StorageClass &Value);			static void enumeration(IO &IO, XCOFF::StorageClass &Value);
	};			};

	template <> struct MappingTraits<XCOFFYAML::FileHeader> {			template <> struct MappingTraits<XCOFFYAML::FileHeader> {
	static void mapping(IO &IO, XCOFFYAML::FileHeader &H);			static void mapping(IO &IO, XCOFFYAML::FileHeader &H);
	};			};

	template <> struct MappingTraits<XCOFFYAML::Object> {
	static void mapping(IO &IO, XCOFFYAML::Object &Obj);
	};

	template <> struct MappingTraits<XCOFFYAML::Symbol> {			template <> struct MappingTraits<XCOFFYAML::Symbol> {
	static void mapping(IO &IO, XCOFFYAML::Symbol &S);			static void mapping(IO &IO, XCOFFYAML::Symbol &S);
	};			};

				template <> struct MappingTraits<XCOFFYAML::Relocation> {
				static void mapping(IO &IO, XCOFFYAML::Relocation &R);
				};

				template <> struct MappingTraits<XCOFFYAML::Section> {
				static void mapping(IO &IO, XCOFFYAML::Section &Sec);
				};

				template <> struct MappingTraits<XCOFFYAML::Object> {
				static void mapping(IO &IO, XCOFFYAML::Object &Obj);
				};

	} // namespace yaml			} // namespace yaml
	} // namespace llvm			} // namespace llvm

	#endif // LLVM_OBJECTYAML_XCOFFYAML_H			#endif // LLVM_OBJECTYAML_XCOFFYAML_H

llvm/include/llvm/ObjectYAML/yaml2obj.h

	Show All 34 Lines
	namespace MinidumpYAML {			namespace MinidumpYAML {
	struct Object;			struct Object;
	}			}

	namespace WasmYAML {			namespace WasmYAML {
	struct Object;			struct Object;
	}			}

				namespace XCOFFYAML {
				struct Object;
				}

	namespace ArchYAML {			namespace ArchYAML {
	struct Archive;			struct Archive;
	}			}

	namespace yaml {			namespace yaml {
	class Input;			class Input;
	struct YamlObjectFile;			struct YamlObjectFile;

	using ErrorHandler = llvm::function_ref<void(const Twine &Msg)>;			using ErrorHandler = llvm::function_ref<void(const Twine &Msg)>;

	bool yaml2archive(ArchYAML::Archive &Doc, raw_ostream &Out, ErrorHandler EH);			bool yaml2archive(ArchYAML::Archive &Doc, raw_ostream &Out, ErrorHandler EH);
	bool yaml2coff(COFFYAML::Object &Doc, raw_ostream &Out, ErrorHandler EH);			bool yaml2coff(COFFYAML::Object &Doc, raw_ostream &Out, ErrorHandler EH);
	bool yaml2elf(ELFYAML::Object &Doc, raw_ostream &Out, ErrorHandler EH,			bool yaml2elf(ELFYAML::Object &Doc, raw_ostream &Out, ErrorHandler EH,
	uint64_t MaxSize);			uint64_t MaxSize);
	bool yaml2macho(YamlObjectFile &Doc, raw_ostream &Out, ErrorHandler EH);			bool yaml2macho(YamlObjectFile &Doc, raw_ostream &Out, ErrorHandler EH);
	bool yaml2minidump(MinidumpYAML::Object &Doc, raw_ostream &Out,			bool yaml2minidump(MinidumpYAML::Object &Doc, raw_ostream &Out,
	ErrorHandler EH);			ErrorHandler EH);
	bool yaml2wasm(WasmYAML::Object &Doc, raw_ostream &Out, ErrorHandler EH);			bool yaml2wasm(WasmYAML::Object &Doc, raw_ostream &Out, ErrorHandler EH);
				bool yaml2xcoff(XCOFFYAML::Object &Doc, raw_ostream &Out, ErrorHandler EH);

	bool convertYAML(Input &YIn, raw_ostream &Out, ErrorHandler ErrHandler,			bool convertYAML(Input &YIn, raw_ostream &Out, ErrorHandler ErrHandler,
	unsigned DocNum = 1, uint64_t MaxSize = UINT64_MAX);			unsigned DocNum = 1, uint64_t MaxSize = UINT64_MAX);

	/// Convenience function for tests.			/// Convenience function for tests.
	std::unique_ptr<object::ObjectFile>			std::unique_ptr<object::ObjectFile>
	yaml2ObjectFile(SmallVectorImpl<char> &Storage, StringRef Yaml,			yaml2ObjectFile(SmallVectorImpl<char> &Storage, StringRef Yaml,
	ErrorHandler ErrHandler);			ErrorHandler ErrHandler);

	} // namespace yaml			} // namespace yaml
	} // namespace llvm			} // namespace llvm

	#endif			#endif

llvm/lib/ObjectYAML/CMakeLists.txt

Show All 12 Lines	add_llvm_component_library(LLVMObjectYAML
ELFYAML.cpp		ELFYAML.cpp
MachOEmitter.cpp		MachOEmitter.cpp
MachOYAML.cpp		MachOYAML.cpp
ObjectYAML.cpp		ObjectYAML.cpp
MinidumpEmitter.cpp		MinidumpEmitter.cpp
MinidumpYAML.cpp		MinidumpYAML.cpp
WasmEmitter.cpp		WasmEmitter.cpp
WasmYAML.cpp		WasmYAML.cpp
		XCOFFEmitter.cpp
XCOFFYAML.cpp		XCOFFYAML.cpp
YAML.cpp		YAML.cpp
yaml2obj.cpp		yaml2obj.cpp

ADDITIONAL_HEADER_DIRS		ADDITIONAL_HEADER_DIRS
${LLVM_MAIN_INCLUDE_DIR}/llvm/ObjectYAML		${LLVM_MAIN_INCLUDE_DIR}/llvm/ObjectYAML

LINK_COMPONENTS		LINK_COMPONENTS
BinaryFormat		BinaryFormat
Object		Object
Support		Support
DebugInfoCodeView		DebugInfoCodeView
MC		MC
)		)

llvm/lib/ObjectYAML/ObjectYAML.cpp

Show First 20 Lines • Show All 53 Lines • ▼ Show 20 Lines	if (IO.mapTag("!Arch")) {
MappingTraits<MachOYAML::UniversalBinary>::mapping(IO,		MappingTraits<MachOYAML::UniversalBinary>::mapping(IO,
*ObjectFile.FatMachO);		*ObjectFile.FatMachO);
} else if (IO.mapTag("!minidump")) {		} else if (IO.mapTag("!minidump")) {
ObjectFile.Minidump.reset(new MinidumpYAML::Object());		ObjectFile.Minidump.reset(new MinidumpYAML::Object());
MappingTraits<MinidumpYAML::Object>::mapping(IO, *ObjectFile.Minidump);		MappingTraits<MinidumpYAML::Object>::mapping(IO, *ObjectFile.Minidump);
} else if (IO.mapTag("!WASM")) {		} else if (IO.mapTag("!WASM")) {
ObjectFile.Wasm.reset(new WasmYAML::Object());		ObjectFile.Wasm.reset(new WasmYAML::Object());
MappingTraits<WasmYAML::Object>::mapping(IO, *ObjectFile.Wasm);		MappingTraits<WasmYAML::Object>::mapping(IO, *ObjectFile.Wasm);
		} else if (IO.mapTag("!XCOFF")) {
		ObjectFile.Xcoff.reset(new XCOFFYAML::Object());
		MappingTraits<XCOFFYAML::Object>::mapping(IO, *ObjectFile.Xcoff);
} else if (const Node *N = In.getCurrentNode()) {		} else if (const Node *N = In.getCurrentNode()) {
if (N->getRawTag().empty())		if (N->getRawTag().empty())
IO.setError("YAML Object File missing document type tag!");		IO.setError("YAML Object File missing document type tag!");
else		else
IO.setError("YAML Object File unsupported document type tag '" +		IO.setError("YAML Object File unsupported document type tag '" +
N->getRawTag() + "'!");		N->getRawTag() + "'!");
}		}
}		}
}		}

llvm/lib/ObjectYAML/XCOFFEmitter.cpp

This file was added.

//===- yaml2xcoff - Convert YAML to a xcoff object file -------------------===//

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

//===----------------------------------------------------------------------===//

///

/// \file

/// The xcoff component of yaml2obj.

///

//===----------------------------------------------------------------------===//

#include "llvm/ADT/DenseMap.h"

HiguoxingUnsubmitted

Not Done

Do we need this header file? I guess what we need is DenseMap ?

Higuoxing: Do we need this header file? I guess what we need is `DenseMap` ?

jhendersonUnsubmitted

Not Done

Nit: LLVM usually has a new line between the licence header and the #include list (based on a quick look at 3 or 4 files).

jhenderson: Nit: LLVM usually has a new line between the licence header and the `#include` list (based on a…

#include "llvm/BinaryFormat/XCOFF.h"

#include "llvm/Object/XCOFFObjectFile.h"

#include "llvm/ObjectYAML/ObjectYAML.h"

#include "llvm/ObjectYAML/yaml2obj.h"

#include "llvm/Support/EndianStream.h"

#include "llvm/Support/raw_ostream.h"

#include "llvm/Support/LEB128.h"

using namespace llvm;

namespace {

constexpr unsigned DefaultSectionAlign = 4;

constexpr int16_t MaxSectionIndex = INT16_MAX;

constexpr uint32_t MaxRawDataSize = UINT32_MAX;

class XCOFFWriter {

public:

XCOFFWriter(XCOFFYAML::Object &Obj, raw_ostream &OS, yaml::ErrorHandler EH)

: Obj(Obj), W(OS, support::big), ErrHandler(EH) {

Is64Bit = Obj.Header.Magic == XCOFF::XCOFF64;

}

bool writeXCOFF();

private:

bool initFileHeader(uint64_t CurrentOffset);

bool initSectionHeader(uint64_t &CurrentOffset);

bool initRelocations(uint64_t &CurrentOffset);

bool assignAddressesAndIndices();

void writeFileHeader();

void writeSectionHeader();

bool writeSectionData();

bool writeRelocations();

bool writeSymbols();

XCOFFYAML::Object &Obj;

bool Is64Bit = false;

support::endian::Writer W;

yaml::ErrorHandler ErrHandler;

uint64_t StartOffset;

// Map the section name to its corrresponding section index.

jhendersonUnsubmitted

Not Done

Two questions:

Why is this a signed type?
What is the maximum number of sections an XCOFF file can have? In ELF, it is effectively UINT32_MAX, for example. (In practice, it's unlikely that we'll see that many sections, but we shouldn't prevent it just due to the wrong type). This type should be big enough for whatever the max can be.

jhenderson: Two questions: 1) Why is this a signed type? 2) What is the maximum number of sections an…

EsmeAuthorUnsubmitted

Done

There are 3 reserved signed number (-2, -1, 0) for sections, and thanks for your question, which remind me to add them to IndexMap.

The MaxSectionIndex, jasonliu defined in XCOFFObjectWriter.cpp, is also INT16_MAX, so I stuck with the number. I am not sure whether XCOFF supports a larger number of sections. I will have a look into this.

Esme: 1. There are 3 reserved signed number (-2, -1, 0) for sections, and thanks for your question…

DenseMap<StringRef, int16_t> SectionIndexMap = {

jhendersonUnsubmitted

Done

In general, we tend to avoid using auto these days in new code, unless the type is obvious. In this case, the type of the member of Obj.Sections is not obvious.

jhenderson: In general, we tend to avoid using `auto` these days in new code, unless the type is obvious.

{StringRef("N_DEBUG"), XCOFF::N_DEBUG},

jhendersonUnsubmitted

Not Done

Why not just use Obj.Header and Obj.Sections directly in the referencing code? Why do you need these member variables separately to Obj?

jhenderson: Why not just use `Obj.Header` and `Obj.Sections` directly in the referencing code? Why do you…

EsmeAuthorUnsubmitted

Done

Obj can't be changed in assignAddressesAndIndices(), because it will be used during writing. The value specified in the YAML has a higher priority to be written than the calculated valued derived from the contents. And only Header and Sections have these number/offset/address fields which need to be calculated, therefore I use InitSections and InitFileHdr to keep these calculated values.

Esme: `Obj` can't be changed in assignAddressesAndIndices(), because it will be used during writing.

{StringRef("N_ABS"), XCOFF::N_ABS},

jhendersonUnsubmitted

Not Done

It might be a good idea to break this function up into smaller pieces, perhaps one function per loop, so you'd have a function that assigns the section offsets and addresses, another function for the relocations, another for the symbols etc.

jhenderson: It might be a good idea to break this function up into smaller pieces, perhaps one function per…

{StringRef("N_UNDEF"), XCOFF::N_UNDEF}};

XCOFFYAML::FileHeader InitFileHdr = Obj.Header;

std::vector<XCOFFYAML::Section> InitSections = Obj.Sections;

};

static void writeName(StringRef StrName, support::endian::Writer W) {

jhendersonUnsubmitted

Not Done

Is this a fundamental restriction of the XCOFF file format, or is it just what should happen? Is it possible to create (potentially by hand) a section with relocations if it isn't one of these types?

In general, to allow for testing of bad input paths etc, you want to allow as much flexibility in yaml2obj as possible. As such, it isn't yaml2obj's place to restrict what can be done, as long as it can physically represent what was requested.

jhenderson: Is this a fundamental restriction of the XCOFF file format, or is it just what should happen?

EsmeAuthorUnsubmitted

Done

It is just what should happen.

Thanks for the advice, it makes sense to me to removing the restriction to allow more flexibility.

Esme: It is just what should happen. Thanks for the advice, it makes sense to me to removing the…

char Name[XCOFF::NameSize];

memset(Name, 0, XCOFF::NameSize);

jhendersonUnsubmitted

Done

General point that applies here and in all other new error messages: the coding guidelines say error messages start with lower case and don't end in a full stop. They don't explicitly exclude ending in an exclamation mark, but I'd avoid it as well.

Specific point: it would be helpful if this error said what the maximum number of sections the code supports is.

jhenderson: General point that applies here and in all other new error messages: the coding guidelines say…

memcpy(Name, StrName.data(), StrName.size());

ArrayRef<char> NameRef(Name, XCOFF::NameSize);

jhendersonUnsubmitted

Not Done

Sorry, I might have confused you with my previous comment. MaxRawDataSize is being compared to CurrentOffset, so it's not the size of the relocations/sections that is being constrained, if I read it correctly. Perhaps you could change the message to something like "maximum object size of XXX exceeded when writing relocation data". What do you think?

jhenderson: Sorry, I might have confused you with my previous comment. `MaxRawDataSize` is being compared…

EsmeAuthorUnsubmitted

Done

Very good suggestion, thanks!

Esme: Very good suggestion, thanks!

jhendersonUnsubmitted

Not Done

You can drop the parentheses. Do you need to allow space for a null terminator, or does NameSize take that into account?

jhenderson: You can drop the parentheses. Do you need to allow space for a null terminator, or does…

EsmeAuthorUnsubmitted

Done

Well, it seems that NameSize should have taken this into account.

Esme: Well, it seems that NameSize should have taken this into account.

W.write(NameRef);

jhendersonUnsubmitted

Done

It would probably be good to use some enum values or named constants for these, e.g. something like:

namespace xcoff {
enum {
  N_DEBUG = -2,
  N_ABS = -1,
  N_UNDEF = 0
}
}

This is similar to how special section indexes are handled for ELF.

jhenderson: It would probably be good to use some enum values or named constants for these, e.g. something…

}

jhendersonUnsubmitted

Done

Can you do this in the SectionIndexMap initialisation list, or at least in the constructor if that's not possible?

jhenderson: Can you do this in the `SectionIndexMap` initialisation list, or at least in the constructor if…

bool XCOFFWriter::initRelocations(uint64_t &CurrentOffset) {

for (uint16_t I = 0, E = InitSections.size(); I < E; ++I) {

jhendersonUnsubmitted

Done

Here and in similar cases below for sections etc, I don't think what you've done is good. I think you shouldn't emit an error if a user explicitly specifies the number of relocations (sections etc). Just use the specified value in the header.

Take a look at what we do for ELF yaml2obj already. For many fields there is a default value that is derived from the contents of the YAML, and an option to override that value. For example, the ELF header has an e_shnum field which states the number of sections in that object. This value is automatically populated with the number of sections provided in the YAML. However, if a different value is specified in the YAML for the e_shnum field, that value is written to e_shnum instead, whilst the real set of sections is still written out. This enables a user to create a deliberate inconsistency between the two properties, which allows for things like error handling testing of malformed objects.

jhenderson: Here and in similar cases below for sections etc, I don't think what you've done is good. I…

if (!InitSections[I].Relocations.empty()) {

InitSections[I].NumberOfRelocations = InitSections[I].Relocations.size();

jhendersonUnsubmitted

Done

uint64_t CurrentOffset =

- sizeof(XCOFF::FileHeader32) /* TODO + auxiliaryHeaderSize()*/ +

+ sizeof(XCOFF::FileHeader32) /* TODO: + auxiliaryHeaderSize() */ +

InitSections.size() * sizeof(XCOFF::SectionHeader32);

jhenderson:

InitSections[I].FileOffsetToRelocations = CurrentOffset;

CurrentOffset += InitSections[I].NumberOfRelocations *

jhendersonUnsubmitted

Not Done

MaxRawDataSize is an internal variable, right, not an XCOFF spec defined value? If MaxRawDataSize doesn't directly appear in the spec, don't mention it to the user in an error message. Instead, use general terms like "the maximum size permitted for XXX" (where XXX is the thing that is restricted, e.g. the object size).

jhenderson: `MaxRawDataSize` is an internal variable, right, not an XCOFF spec defined value? If…

XCOFF::RelocationSerializationSize32;

if (CurrentOffset > MaxRawDataSize) {

jhendersonUnsubmitted

Done

uint64_t CurrentSecAddr = 0;

- for (uint16_t I = 0; I < InitSections.size(); I++) {

+ for (uint16_t I = 0, E = InitSections.size(); I < E; ++I) {

if (CurrentOffset > MaxRawDataSize) {

LLVM style guide is to precalculate the number of sections, if the number cannot change within the loop, as per the suggestion inline. Also, prefer preincrement to postincrement.

jhenderson: LLVM style guide is to precalculate the number of sections, if the number cannot change within…

ErrHandler("maximum object size of" + Twine(MaxRawDataSize) +

"exceeded when writing relocation data");

return false;

jhendersonUnsubmitted

Done

Put a blank line before this line, to help break up this function.

Same goes below: if you put a comment, followed immediately by an if statement or loop relating to that comment, add a blank line.

Also, I think it's more common grammatically in code comments to right "Assign" rather than "Assigns" (i.e. use the imperative form). Same for "Calculates" (use "Calculate") below.

jhenderson: Put a blank line before this line, to help break up this function. Same goes below: if you put…

}

return true;

}

bool XCOFFWriter::initSectionHeader(uint64_t &CurrentOffset) {

uint64_t CurrentSecAddr = 0;

for (uint16_t I = 0, E = InitSections.size(); I < E; ++I) {

jhendersonUnsubmitted

Not Done

if ((I + 1) > MaxSectionIndex) {

- ErrHandler("the maximum index permitted for section is " +

+ ErrHandler("exceeded the maximum permitted section index of " +

Twine(MaxSectionIndex));

How about this?

jhenderson: How about this?

if (CurrentOffset > MaxRawDataSize) {

ErrHandler("maximum object size of" + Twine(MaxRawDataSize) +

"exceeded when writing section data");

jhendersonUnsubmitted

Done

Is "FileOffsetToData" the actual XCOFF field name, or is it something else? If something else, I recommend using the real field name.

jhenderson: Is "FileOffsetToData" the actual XCOFF field name, or is it something else? If something else…

EsmeAuthorUnsubmitted

Done

Yes, it's also defined in XCOFF.h.
The name "FileOffsetToData" has been used in all XCOFF related implementations，which corresponds to "s_scnptr" in the documentation.

Esme: Yes, it's also defined in XCOFF.h. The name "FileOffsetToData" has been used in all XCOFF…

return false;

}

jhendersonUnsubmitted

Done

CurrentOffset += InitSections[I].SectionData.binary_size();

- // Make sure the address of the next section aligned to

+ // Ensure the address of the next section is aligned to

// DefaultSectionAlign.

CurrentOffset = alignTo(CurrentOffset, DefaultSectionAlign);

And check whether the comment can be reflowed to fit within 80 characters width.

jhenderson: And check whether the comment can be reflowed to fit within 80 characters width.

// Assign indices for sections.

if (InitSections[I].SectionName.size() &&

!SectionIndexMap[InitSections[I].SectionName]) {

// The section index starts from 1.

SectionIndexMap[InitSections[I].SectionName] = I + 1;

if ((I + 1) > MaxSectionIndex) {

jhendersonUnsubmitted

Done

// Calculate the physical/virtual address.

- // This field should contain 0 for all sections except the .text , .data ,

+ // This field should contain 0 for all sections except the .text, .data,

// and .bss sections.

Can there be more than one each of .text/.data/.bss?
If so, are they all named the same?
Are the names actually ".text", ".data" etc?

jhenderson: # Can there be more than one each of .text/.data/.bss? # If so, are they all named the same? #…

EsmeAuthorUnsubmitted

Done

Yes, multiple .text/.data/.bss sections are allowed.
Applications use the Flags field instead of the Name field to determine a section type. Two sections of the same type may have different names.
As introduced in Table 4.

Esme: 1. Yes, multiple .text/.data/.bss sections are allowed. 2. Applications use the Flags field…

jhendersonUnsubmitted

Not Done

It looks like from the spec that the names are merely conventions, so technically the names could be other thigns. It's probably fine to infer that if a section is text, it is called .text, unless otherwise specified by the YAML (and vice versa), but this bit applies to the section type, not the name, so we should refer to them by type rather than name (i.e. just "text", "data", "bss", not ".text" etc).

jhenderson: It looks like from the spec that the names are merely conventions, so technically the names…

ErrHandler("exceeded the maximum permitted section index of " +

Twine(MaxSectionIndex));

return false;

}

// Calculate the physical/virtual address. This field should contain 0 for

jhendersonUnsubmitted

Done

If I'm reading this code correctly, if a user has put their sections in an order like:

.text
.somethingelse
.data

all sections are size 4, and the address of .text is 16, the address of .data is going to be 24, not 20. Is that supposed to be the case?

jhenderson: If I'm reading this code correctly, if a user has put their sections in an order like: ``` .

EsmeAuthorUnsubmitted

Done

Moving this calculation ahead of the calculation of FileOffsetToData gives the correct result.

Esme: Moving this calculation ahead of the calculation of FileOffsetToData gives the correct result.

// all sections except the text, data and bss sections.

if (InitSections[I].Flags != XCOFF::STYP_TEXT &&

InitSections[I].Flags != XCOFF::STYP_DATA &&

InitSections[I].Flags != XCOFF::STYP_BSS)

InitSections[I].Address = 0;

else

MaskRayUnsubmitted

Done

Target machine -> Magic

MaskRay: Target machine -> Magic

jhendersonUnsubmitted

Done

Reading this code, it looks like you can't have in XCOFF relocations that aren't attached to a section. Is that correct?

jhenderson: Reading this code, it looks like you can't have in XCOFF relocations that aren't attached to a…

EsmeAuthorUnsubmitted

Done

Yes, the relocations are always attached to sections in XCOFF.

Esme: Yes, the relocations are always attached to sections in XCOFF.

InitSections[I].Address = CurrentSecAddr;

// Calculate the FileOffsetToData and data size for sections.

if (InitSections[I].SectionData.binary_size()) {

jhendersonUnsubmitted

Not Done

Is it definitely the address here that should be being aligned, or the offset? The two are different concepts, and the align function appears to align the offset, not the address.

jhenderson: Is it definitely the address here that should be being aligned, or the offset? The two are…

EsmeAuthorUnsubmitted

Done

The offset of data for each section is what I want to align. The comments may be confusing here.

Esme: The offset of data for each section is what I want to align. The comments may be confusing here.

InitSections[I].FileOffsetToData = CurrentOffset;

CurrentOffset += InitSections[I].SectionData.binary_size();

// Ensure the offset is aligned to DefaultSectionAlign.

CurrentOffset = alignTo(CurrentOffset, DefaultSectionAlign);

InitSections[I].Size = CurrentOffset - InitSections[I].FileOffsetToData;

CurrentSecAddr += InitSections[I].Size;

}

HiguoxingUnsubmitted

Not Done

InitFileHdr.NumberOfSymTableEntries = Obj.Symbols.size();

- for (XCOFFYAML::Symbol YamlSym : Obj.Symbols)

+ for (XCOFFYAML::Symbol &YamlSym : Obj.Symbols)

InitFileHdr.NumberOfSymTableEntries += YamlSym.NumberOfAuxEntries;

Higuoxing:

jhendersonUnsubmitted

Not Done

One better: const XCOFFYAML::Symbol &YamlSym

jhenderson: One better: `const XCOFFYAML::Symbol &YamlSym`

return initRelocations(CurrentOffset);

jhendersonUnsubmitted

Not Done

How about simply return initRelocations(CurrentOffset);?

jhenderson: How about simply `return initRelocations(CurrentOffset);`?

}

bool XCOFFWriter::initFileHeader(uint64_t CurrentOffset) {

// The default format of the object file is XCOFF32.

InitFileHdr.Magic = XCOFF::XCOFF32;

InitFileHdr.NumberOfSections = Obj.Sections.size();

InitFileHdr.NumberOfSymTableEntries = Obj.Symbols.size();

jhendersonUnsubmitted

Not Done

InitFileHdr.NumberOfSections = Obj.Sections.size();

- // Count the number of auxiliary symbols into the total number.

+ // Add the number of auxiliary symbols to the total number.

InitFileHdr.NumberOfSymTableEntries = Obj.Symbols.size();

jhenderson:

for (const XCOFFYAML::Symbol &YamlSym : Obj.Symbols) {

jhendersonUnsubmitted

Not Done

Same as relocation comment above.

jhenderson: Same as relocation comment above.

// Add the number of auxiliary symbols to the total number.

InitFileHdr.NumberOfSymTableEntries += YamlSym.NumberOfAuxEntries;

}

// Calculate SymbolTableOffset for the file header.

if (InitFileHdr.NumberOfSymTableEntries) {

InitFileHdr.SymbolTableOffset = CurrentOffset;

CurrentOffset +=

InitFileHdr.NumberOfSymTableEntries * XCOFF::SymbolTableEntrySize;

if (CurrentOffset > MaxRawDataSize) {

jhendersonUnsubmitted

Not Done

You probably need to make the Header fields optional themselves, so that this code can distinguish between the case where the user has explicitly specified the value as 0 and the case where it is unspecified.

The same likely goes for other structs like the Section.

Perhaps also worth pulling 0x01DF into a named constant somewhere.

jhenderson: You probably need to make the `Header` fields optional themselves, so that this code can…

ErrHandler("maximum object size of" + Twine(MaxRawDataSize) +

jhendersonUnsubmitted

Done

Where the field name is self-explanatory, you don't need to have a comment. Same goes below.

jhenderson: Where the field name is self-explanatory, you don't need to have a comment. Same goes below.

"exceeded when writing symbols");

return false;

}

// TODO: Calculate FileOffsetToLineNumbers when line number supported.

return true;

}

bool XCOFFWriter::assignAddressesAndIndices() {

uint64_t CurrentOffset =

sizeof(XCOFF::FileHeader32) /* TODO: + auxiliaryHeaderSize() */ +

InitSections.size() * sizeof(XCOFF::SectionHeader32);

// Calculate section header info.

if (!initSectionHeader(CurrentOffset))

return false;

// Calculate file header info.

return initFileHeader(CurrentOffset);

}

void XCOFFWriter::writeFileHeader() {

W.write<uint16_t>(Obj.Header.Magic ? Obj.Header.Magic : InitFileHdr.Magic);

W.write<uint16_t>(Obj.Header.NumberOfSections ? Obj.Header.NumberOfSections

: InitFileHdr.NumberOfSections);

W.write<int32_t>(Obj.Header.TimeStamp);

W.write<uint32_t>(Obj.Header.SymbolTableOffset

? Obj.Header.SymbolTableOffset

: InitFileHdr.SymbolTableOffset);

jhendersonUnsubmitted

Done

Calculate the requested address value upfront, then use that value in both places, rather than repeat the logic to calculate it.

jhenderson: Calculate the requested address value upfront, then use that value in both places, rather than…

W.write<int32_t>(Obj.Header.NumberOfSymTableEntries

? Obj.Header.NumberOfSymTableEntries

: InitFileHdr.NumberOfSymTableEntries);

W.write<uint16_t>(Obj.Header.AuxHeaderSize);

W.write<uint16_t>(Obj.Header.Flags);

}

void XCOFFWriter::writeSectionHeader() {

for (uint16_t I = 0, E = Obj.Sections.size(); I < E; ++I) {

XCOFFYAML::Section YamlSec = Obj.Sections[I];

XCOFFYAML::Section DerivedSec = InitSections[I];

writeName(YamlSec.SectionName, W);

// Virtual address is the same as physical address.

uint32_t SectionAddress =

YamlSec.Address ? YamlSec.Address : DerivedSec.Address;

jhendersonUnsubmitted

Not Done

YamlSec.Address ? YamlSec.Address : DerivedSec.Address;

- W.write<uint32_t>(SectionAddress);

+ W.write<uint32_t>(SectionAddress); // Physical address

+ W.write<uint32_t>(SectionAddress); // Virtual address

W.write<uint32_t>(YamlSec.Size ? YamlSec.Size : DerivedSec.Size);

Here, it's probably worth a comment saying which is the virtual address and which the physical address, like the inline edit, for example (which may be the wrong way around).

jhenderson: Here, it's probably worth a comment saying which is the virtual address and which the physical…

W.write<uint32_t>(SectionAddress); // Physical address

W.write<uint32_t>(SectionAddress); // Virtual address

W.write<uint32_t>(YamlSec.Size ? YamlSec.Size : DerivedSec.Size);

W.write<uint32_t>(YamlSec.FileOffsetToData ? YamlSec.FileOffsetToData

: DerivedSec.FileOffsetToData);

W.write<uint32_t>(YamlSec.FileOffsetToRelocations

? YamlSec.FileOffsetToRelocations

: DerivedSec.FileOffsetToRelocations);

W.write<uint32_t>(YamlSec.FileOffsetToLineNumbers);

W.write<uint16_t>(YamlSec.NumberOfRelocations

? YamlSec.NumberOfRelocations

: DerivedSec.NumberOfRelocations);

W.write<uint16_t>(YamlSec.NumberOfLineNumbers);

W.write<int32_t>(YamlSec.Flags);

}

jhendersonUnsubmitted

Done

if (PaddingSize < 0) {

- ErrHandler("redundant data was wrote before section data");

+ ErrHandler("redundant data was written before section data");

return false;

Same in similar messages below.

jhenderson: Same in similar messages below.

bool XCOFFWriter::writeSectionData() {

for (uint16_t I = 0, E = Obj.Sections.size(); I < E; ++I) {

XCOFFYAML::Section YamlSec = Obj.Sections[I];

if (YamlSec.SectionData.binary_size()) {

// Fill the padding size with zeros.

int64_t PaddingSize =

InitSections[I].FileOffsetToData - (W.OS.tell() - StartOffset);

if (PaddingSize < 0) {

ErrHandler("redundant data was written before section data");

return false;

}

if (PaddingSize > 0)

W.OS.write_zeros(PaddingSize);

jhendersonUnsubmitted

Done

Probably best to make all these PaddingSize fields int64_t to prepare for 64-bit support.

jhenderson: Probably best to make all these PaddingSize fields `int64_t` to prepare for 64-bit support.

YamlSec.SectionData.writeAsBinary(W.OS);

}

jhendersonUnsubmitted

Done

if (PaddingSize < 0) {

- ErrHandler("redundant data was wrote before relocations");

+ ErrHandler("redundant data was written before relocations");

return false;

jhenderson:

return true;

}

bool XCOFFWriter::writeRelocations() {

for (uint16_t I = 0, E = Obj.Sections.size(); I < E; ++I) {

XCOFFYAML::Section YamlSec = Obj.Sections[I];

if (!YamlSec.Relocations.empty()) {

int64_t PaddingSize =

InitSections[I].FileOffsetToRelocations - (W.OS.tell() - StartOffset);

if (PaddingSize < 0) {

ErrHandler("redundant data was written before relocations");

return false;

}

if (PaddingSize > 0)

W.OS.write_zeros(PaddingSize);

for (const XCOFFYAML::Relocation &YamlRel : YamlSec.Relocations) {

W.write<uint32_t>(YamlRel.VirtualAddress);

shchenzUnsubmitted

Not Done

We don't have to call write_zeros if PaddingSize is 0?

shchenz: We don't have to call `write_zeros` if `PaddingSize` is 0?

W.write<uint32_t>(YamlRel.SymbolIndex);

W.write<uint8_t>(YamlRel.Info);

W.write<uint8_t>(YamlRel.Type);

}

return true;

}

bool XCOFFWriter::writeSymbols() {

int64_t PaddingSize =

(uint64_t)InitFileHdr.SymbolTableOffset - (W.OS.tell() - StartOffset);

if (PaddingSize < 0) {

ErrHandler("redundant data was written before symbols");

return false;

}

if (PaddingSize > 0)

W.OS.write_zeros(PaddingSize);

for (const XCOFFYAML::Symbol &YamlSym : Obj.Symbols) {

writeName(YamlSym.SymbolName, W);

W.write<uint32_t>(YamlSym.Value);

W.write<int16_t>(

YamlSym.SectionName.size() ? SectionIndexMap[YamlSym.SectionName] : 0);

shchenzUnsubmitted

Not Done

What about a symbol that does not have belonged sections? For example undefined external symbols?

shchenz: What about a symbol that does not have belonged sections? For example undefined external…

EsmeAuthorUnsubmitted

Done

Specifies a section number associated with one of the following symbols:
-2
Specifies N_DEBUG, a special symbolic debugging symbol.
-1
Specifies N_ABS, an absolute symbol. The symbol has a value but is not relocatable.
0
Specifies N_UNDEF, an undefined external symbol.
Any other value
Specifies the section number where the symbol was defined.

As the spec, there are 3 reserved section numbers for these symbols not defined in sections. The SectionName in the symbol entry is required now, however, we can also set the field optional and use N_UNDEF (maybe...) as the default section. Hmm... I am not sure which strategy is more reasonable. What do you think?

Esme: ``` Specifies a section number associated with one of the following symbols: -2 Specifies…

shchenzUnsubmitted

Not Done

I think SectionName for reserved section number N_UNDEF is not defined because we don't need to do it. We only need a section number. So making SectionName attribute optional for a symbol makes more sense to me. You can not say that a section with section name N_UNDEF is the N_UNDEF(section number == 0) section. We can define a text section named by N_UNDEF by using section attribute like:

__attribute__((section ("N_UNDEF")))  int foo2(void)
{
}

shchenz: I think `SectionName` for reserved section number `N_UNDEF` is not defined because we don't…

W.write<uint16_t>(YamlSym.Type);

W.write<uint8_t>(YamlSym.StorageClass);

W.write<uint8_t>(YamlSym.NumberOfAuxEntries);

// Now output the auxiliary entry.

for (uint8_t I = 0, E = YamlSym.NumberOfAuxEntries; I < E; ++I) {

// TODO: Auxiliary entry is not supported yet.

jhendersonUnsubmitted

Done

At this point, it may be better to just use write_zeroes to fill the whole auxiliary entry. Alternatively, emit an error if NumberOfAuxEntries is non-zero.

jhenderson: At this point, it may be better to just use write_zeroes to fill the whole auxiliary entry.

// The auxiliary entries for a symbol follow its symbol table entry. The

// length of each auxiliary entry is the same as a symbol table entry (18

// bytes). The format and quantity of auxiliary entries depend on the

// storage class (n_sclass) and type (n_type) of the symbol table entry.

W.OS.write_zeros(18);

}

return true;

}

bool XCOFFWriter::writeXCOFF() {

if (Is64Bit) {

ErrHandler("only XCOFF32 is currently supported");

return false;

}

if (!assignAddressesAndIndices())

return false;

StartOffset = W.OS.tell();

jhendersonUnsubmitted

Not Done

This sounds like it prevents a user from writing a 32-bit XCOFF file, but with a messed up magic field. Perhaps you should put an optional explicit format field in the YAML (defaulting to 32-bit XCOFF), and use that to determine the file format to use.

jhenderson: This sounds like it prevents a user from writing a 32-bit XCOFF file, but with a messed up…

writeFileHeader();

if (!Obj.Sections.empty()) {

writeSectionHeader();

if (!writeSectionData())

jhendersonUnsubmitted

Not Done

if (Is64Bit) {

- ErrHandler("only XCOFF32 is supported now");

+ ErrHandler("only XCOFF32 is currently supported");

return false;

Nit: the suggested inline edit sounds a bit better to me.

jhenderson: Nit: the suggested inline edit sounds a bit better to me.

return false;

jhendersonUnsubmitted

Done

It seems like if you failed to assign addresses or indices, it may be dangerous to continue? This should probably bail out. Same probably goes for the other cases where an error can occur too.

jhenderson: It seems like if you failed to assign addresses or indices, it may be dangerous to continue?

if (!writeRelocations())

return false;

}

if (!Obj.Symbols.empty())

return writeSymbols();

return true;

}

} // end anonymous namespace

namespace llvm {

namespace yaml {

bool yaml2xcoff(XCOFFYAML::Object &Doc, raw_ostream &Out, ErrorHandler EH) {

XCOFFWriter Writer(Doc, Out, EH);

return Writer.writeXCOFF();

}

} // namespace yaml

} // namespace llvm

llvm/lib/ObjectYAML/XCOFFYAML.cpp

Show All 17 Lines
namespace XCOFFYAML {		namespace XCOFFYAML {

Object::Object() { memset(&Header, 0, sizeof(Header)); }		Object::Object() { memset(&Header, 0, sizeof(Header)); }

} // namespace XCOFFYAML		} // namespace XCOFFYAML

namespace yaml {		namespace yaml {

		void ScalarBitSetTraits<XCOFF::SectionTypeFlags>::bitset(
		IO &IO, XCOFF::SectionTypeFlags &Value) {
		#define ECase(X) IO.bitSetCase(Value, #X, XCOFF::X)
		ECase(STYP_PAD);
		ECase(STYP_DWARF);
		ECase(STYP_TEXT);
		ECase(STYP_DATA);
		ECase(STYP_BSS);
		ECase(STYP_EXCEPT);
		ECase(STYP_INFO);
		ECase(STYP_TDATA);
		ECase(STYP_TBSS);
		ECase(STYP_LOADER);
		ECase(STYP_DEBUG);
		ECase(STYP_TYPCHK);
		ECase(STYP_OVRFLO);
		#undef ECase
		}

void ScalarEnumerationTraits<XCOFF::StorageClass>::enumeration(		void ScalarEnumerationTraits<XCOFF::StorageClass>::enumeration(
IO &IO, XCOFF::StorageClass &Value) {		IO &IO, XCOFF::StorageClass &Value) {
#define ECase(X) IO.enumCase(Value, #X, XCOFF::X)		#define ECase(X) IO.enumCase(Value, #X, XCOFF::X)
ECase(C_NULL);		ECase(C_NULL);
ECase(C_AUTO);		ECase(C_AUTO);
ECase(C_EXT);		ECase(C_EXT);
ECase(C_STAT);		ECase(C_STAT);
ECase(C_REG);		ECase(C_REG);
Show All 40 Lines	#define ECase(X) IO.enumCase(Value, #X, XCOFF::X)
ECase(C_BSTAT);		ECase(C_BSTAT);
ECase(C_ESTAT);		ECase(C_ESTAT);
ECase(C_GTLS);		ECase(C_GTLS);
ECase(C_STTLS);		ECase(C_STTLS);
ECase(C_EFCN);		ECase(C_EFCN);
#undef ECase		#undef ECase
}		}

		struct NSectionFlags {
		NSectionFlags(IO &) : Flags(XCOFF::SectionTypeFlags(0)) {}
		NSectionFlags(IO &, uint32_t C) : Flags(XCOFF::SectionTypeFlags(C)) {}

		uint32_t denormalize(IO &) { return Flags; }

		XCOFF::SectionTypeFlags Flags;
		};

void MappingTraits<XCOFFYAML::FileHeader>::mapping(		void MappingTraits<XCOFFYAML::FileHeader>::mapping(
IO &IO, XCOFFYAML::FileHeader &FileHdr) {		IO &IO, XCOFFYAML::FileHeader &FileHdr) {
IO.mapRequired("MagicNumber", FileHdr.Magic);		IO.mapOptional("MagicNumber", FileHdr.Magic);
		MaskRayUnsubmitted Done Reply Inline Actions This probably should be mapOptional. f_timdat can be omitted. XCOFFObjectWriter sets it to 0. MaskRay: This probably should be mapOptional. f_timdat can be omitted. XCOFFObjectWriter sets it to 0.
		jhendersonUnsubmitted Done Reply Inline Actions In general, I think everything should be optional, unless there is no sensible default. I haven't looked at the spec yet, but presumably the magic is fixed, or used to distinguish between 32/64 bit. It seems reasonable that a user doesn't need to specify it and then yaml2obj just picks something like 32 bit automatically. I think it's important that you don't require more than you absolutely have to, because otherwise it bloats the YAML and makes it hard to identify what's actually important to the test case in question. jhenderson: In general, I think everything should be optional, unless there is no sensible default. I…
IO.mapRequired("NumberOfSections", FileHdr.NumberOfSections);		IO.mapOptional("NumberOfSections", FileHdr.NumberOfSections);
IO.mapRequired("CreationTime", FileHdr.TimeStamp);		IO.mapOptional("CreationTime", FileHdr.TimeStamp);
IO.mapRequired("OffsetToSymbolTable", FileHdr.SymbolTableOffset);		IO.mapOptional("OffsetToSymbolTable", FileHdr.SymbolTableOffset);
IO.mapRequired("EntriesInSymbolTable", FileHdr.NumberOfSymTableEntries);		IO.mapOptional("EntriesInSymbolTable", FileHdr.NumberOfSymTableEntries);
IO.mapRequired("AuxiliaryHeaderSize", FileHdr.AuxHeaderSize);		IO.mapOptional("AuxiliaryHeaderSize", FileHdr.AuxHeaderSize);
IO.mapRequired("Flags", FileHdr.Flags);		IO.mapOptional("Flags", FileHdr.Flags);
		}

		void MappingTraits<XCOFFYAML::Relocation>::mapping(IO &IO,
		XCOFFYAML::Relocation &R) {
		jhendersonUnsubmitted Done Reply Inline Actions In ELF, the `r_offset` field of a relocation specifies where the relocation will patch, and is relative to either the section start or file start, depending on the context. Often in our test cases, it doesn't really matter where the relocation is patching, only that there is a relocation, hence the offset field is optional there. It seems likely that this will be the case for XCOFF too? jhenderson: In ELF, the `r_offset` field of a relocation specifies where the relocation will patch, and is…
		EsmeAuthorUnsubmitted Done Reply Inline Actions I agree that it's unfriendly to require the virtual address of the relocation from users. While this item in XCOFF specifies the virtual address of the value that requires modification by the binder. And the offset to the data in the section will be calculated as follows: offset_in_section = Relocation_Address - Section_Address We can calculate the Section_Address from yaml contents, but hard to determine the Relocation_Address. So this is hardly optional? Esme: I agree that it's unfriendly to require the virtual address of the relocation from users. While…
		jhendersonUnsubmitted Not Done Reply Inline Actions Could 0 be a reasonable default? If not, it's fine. I'm just wondering if it actually matters in all cases what the address is. jhenderson: Could 0 be a reasonable default? If not, it's fine. I'm just wondering if it actually matters…
		EsmeAuthorUnsubmitted Done Reply Inline Actions As I see, 0 is not a reasonable default. Esme: As I see, 0 is not a reasonable default.
		IO.mapOptional("Address", R.VirtualAddress);
		jhendersonUnsubmitted Done Reply Inline Actions Can XCOFF relocations have no symbol? If so, I think a 0 index value would be a reasonable default. jhenderson: Can XCOFF relocations have no symbol? If so, I think a 0 index value would be a reasonable…
		IO.mapOptional("Symbol", R.SymbolIndex);
		IO.mapOptional("Info", R.Info);
		jhendersonUnsubmitted Done Reply Inline Actions Is there a reasonable default for the relocation type? In ELF, there isn't really, so I think it's a required field. jhenderson: Is there a reasonable default for the relocation type? In ELF, there isn't really, so I think…
		IO.mapOptional("Type", R.Type);
		}

		void MappingTraits<XCOFFYAML::Section>::mapping(IO &IO,
		EsmeAuthorUnsubmitted Done Reply Inline Actions It's reasonable to make the name to be optimal and the flags to be required, since the flags are the unique identifier of the section type. Esme: It's reasonable to make the name to be optimal and the flags to be required, since the flags…
		jhendersonUnsubmitted Not Done Reply Inline Actions Sounds good to me. jhenderson: Sounds good to me.
		XCOFFYAML::Section &Sec) {
		MappingNormalization<NSectionFlags, uint32_t> NC(IO, Sec.Flags);
		IO.mapOptional("Name", Sec.SectionName);
		IO.mapOptional("Address", Sec.Address);
		IO.mapOptional("Size", Sec.Size);
		IO.mapOptional("FileOffsetToData", Sec.FileOffsetToData);
		IO.mapOptional("FileOffsetToRelocations", Sec.FileOffsetToRelocations);
		IO.mapOptional("FileOffsetToLineNumbers", Sec.FileOffsetToLineNumbers);
		IO.mapOptional("NumberOfRelocations", Sec.NumberOfRelocations);
		IO.mapOptional("NumberOfLineNumbers", Sec.NumberOfLineNumbers);
		IO.mapOptional("Flags", NC->Flags);
		IO.mapOptional("SectionData", Sec.SectionData);
		IO.mapOptional("Relocations", Sec.Relocations);
}		}

void MappingTraits<XCOFFYAML::Symbol>::mapping(IO &IO, XCOFFYAML::Symbol &S) {		void MappingTraits<XCOFFYAML::Symbol>::mapping(IO &IO, XCOFFYAML::Symbol &S) {
IO.mapRequired("Name", S.SymbolName);		IO.mapRequired("Name", S.SymbolName);
		MaskRayUnsubmitted Not Done Reply Inline Actions Seems that `r_vaddr` is an offset while called "vaddr". If it can often be 0, make it optional. MaskRay: Seems that `r_vaddr` is an offset while called "vaddr". If it can often be 0, make it optional.
		hubert.reinterpretcastUnsubmitted Not Done Reply Inline Actions This specifies the address (as understood by the current object file) of the instruction or data which is to be modified by the relocation. That information is hardly optional. It is not the case that this represents an offset that modifies the "symbol" field. Such an offset is instead encoded within the instruction or data image in the current object file. hubert.reinterpretcast: This specifies the address (as understood by the current object file) of the instruction or…
IO.mapRequired("Value", S.Value);		IO.mapOptional("Value", S.Value);
IO.mapRequired("Section", S.SectionName);		IO.mapOptional("Section", S.SectionName);
IO.mapRequired("Type", S.Type);		IO.mapOptional("Type", S.Type);
IO.mapRequired("StorageClass", S.StorageClass);		IO.mapOptional("StorageClass", S.StorageClass);
IO.mapRequired("NumberOfAuxEntries", S.NumberOfAuxEntries);		IO.mapOptional("NumberOfAuxEntries", S.NumberOfAuxEntries);
}		}

void MappingTraits<XCOFFYAML::Object>::mapping(IO &IO, XCOFFYAML::Object &Obj) {		void MappingTraits<XCOFFYAML::Object>::mapping(IO &IO, XCOFFYAML::Object &Obj) {
IO.mapTag("!XCOFF", true);		IO.mapTag("!XCOFF", true);
		MaskRayUnsubmitted Not Done Reply Inline Actions In ELF, many fields are printed as hexadecimal so we use `Hex64` a lot. Also, we use `makeOptional` a lot to allow omitting unneeded details. For Size/SectionData, either one can be specified so it looks like both should be optional. MaskRay: In ELF, many fields are printed as hexadecimal so we use `Hex64` a lot. Also, we use…
IO.mapRequired("FileHeader", Obj.Header);		IO.mapRequired("FileHeader", Obj.Header);
IO.mapRequired("Symbols", Obj.Symbols);		IO.mapOptional("Sections", Obj.Sections);
		IO.mapOptional("Symbols", Obj.Symbols);
}		}

} // namespace yaml		} // namespace yaml
} // namespace llvm		} // namespace llvm

llvm/lib/ObjectYAML/yaml2obj.cpp

Show All 38 Lines	do {
if (Doc.Coff)		if (Doc.Coff)
return yaml2coff(*Doc.Coff, Out, ErrHandler);		return yaml2coff(*Doc.Coff, Out, ErrHandler);
if (Doc.MachO \|\| Doc.FatMachO)		if (Doc.MachO \|\| Doc.FatMachO)
return yaml2macho(Doc, Out, ErrHandler);		return yaml2macho(Doc, Out, ErrHandler);
if (Doc.Minidump)		if (Doc.Minidump)
return yaml2minidump(*Doc.Minidump, Out, ErrHandler);		return yaml2minidump(*Doc.Minidump, Out, ErrHandler);
if (Doc.Wasm)		if (Doc.Wasm)
return yaml2wasm(*Doc.Wasm, Out, ErrHandler);		return yaml2wasm(*Doc.Wasm, Out, ErrHandler);
		if (Doc.Xcoff)
		return yaml2xcoff(*Doc.Xcoff, Out, ErrHandler);

ErrHandler("unknown document type");		ErrHandler("unknown document type");
return false;		return false;

} while (YIn.nextDocument());		} while (YIn.nextDocument());

ErrHandler("cannot find the " + Twine(DocNum) +		ErrHandler("cannot find the " + Twine(DocNum) +
getOrdinalSuffix(DocNum).data() + " document");		getOrdinalSuffix(DocNum).data() + " document");
Show All 25 Lines

llvm/test/tools/yaml2obj/XCOFF/basic-doc.yaml

This file was added.

## Check that yaml2obj automatically assigns omited fields with values.

# RUN: yaml2obj %s -o %t

jhendersonUnsubmitted

Not Done

Nit: newer tests I'm involved with at least tend to use '##' for comments, to make them stand out from lit and FileCheck lines.

jhenderson: Nit: newer tests I'm involved with at least tend to use '##' for comments, to make them stand…

jhendersonUnsubmitted

Not Done

- ## Check that yaml2obj automatically assigns ommited fields with values.

+ ## Check that yaml2obj automatically assigns omited fields with values.

# RUN: yaml2obj %s -o %t

jhenderson:

# RUN: llvm-readobj --headers --symbols %t | FileCheck %s

--- !XCOFF

jhendersonUnsubmitted

Not Done

# RUN: yaml2obj %s -o %t

- # RUN: llvm-readobj --headers %t | FileCheck %s --check-prefix=CHECK-HEADERS

- # RUN: llvm-readobj --symbols %t | FileCheck %s --check-prefix=CHECK-SYMBOLS

+ # RUN: llvm-readobj --headers --symbols %t | FileCheck %s

--- !XCOFF

You can do this all at once.

jhenderson: You can do this all at once.

FileHeader:

MagicNumber: 0x1DF

Sections:

- Name: .text

Flags: [ STYP_TEXT ]

SectionData: "9061FFF880820000"

- Name: .data

jhendersonUnsubmitted

Not Done

Is there an enum you could use to represent this set of flags? It would be preferable to be able write either of the following (flag values are placeholders):

Flags: Exec

or probably

Flags: [Exec, Alloc]

although Flags: 0x20 (or possibly Flags: [0x20]) should probably still be permitted.

jhenderson: Is there an enum you could use to represent this set of flags? It would be preferable to be…

EsmeAuthorUnsubmitted

Done

Yes, we have the enum SectionTypeFlags. How about marking this as a TODO for follow-up work? Because this will have an impact on other tools, like obj2yaml.

enum SectionTypeFlags : int32_t {
  STYP_PAD = 0x0008,
  STYP_DWARF = 0x0010,
  STYP_TEXT = 0x0020,
  STYP_DATA = 0x0040,
  STYP_BSS = 0x0080,
  STYP_EXCEPT = 0x0100,
  STYP_INFO = 0x0200,
...
};

Esme: Yes, we have the enum SectionTypeFlags. How about marking this as a TODO for follow-up work?

EsmeAuthorUnsubmitted

Done

After double checking, this does not seem to affect other tools. Thanks for your input.

Esme: After double checking, this does not seem to affect other tools. Thanks for your input.

jhendersonUnsubmitted

Not Done

As these are flags, you'll need the ability to specify multiple flags, just like for ELF sections. Looking at this, I'm guessing you don't currently have that option?

jhenderson: As these are flags, you'll need the ability to specify multiple flags, just like for ELF…

Flags: [ STYP_DATA ]

jhendersonUnsubmitted

Not Done

It seems to me you could get away with much less data in this section (probably a half-dozen bytes)? I don't think this needs to be a real object for this test case? Same below.

jhenderson: It seems to me you could get away with much less data in this section (probably a half-dozen…

SectionData: "0000000000000FC0"

Relocations:

- Address: 0x08

- Name: .data

shchenzUnsubmitted

Not Done

Is it ok that the relocation address is not in the range of .data section?

shchenz: Is it ok that the relocation address is not in the range of `.data` section?

EsmeAuthorUnsubmitted

Done

In general, yaml2obj will not report an error even though the user explicitly specifies an invalid values, which allows for things like error handling testing.

Also after getting more familiar with yaml2obj, I think it's reasonable to set more fields to optional, and these omitted values will be filled in with the default zero or values derived from contents. So I set the relocation address optional now and 0 is the default value for it.

Esme: In general, yaml2obj will not report an error even though the user explicitly specifies an…

shchenzUnsubmitted

Not Done

Do you mean we setting the address as 0x3A on purpose? The address of the .data section is 0x8 and its size is 0x8, so if the relocation is valid, the address the relocation entry wants to resolve should be in 0x8 and 0x10.

shchenz: Do you mean we setting the address as `0x3A` on purpose? The address of the `.data` section is…

EsmeAuthorUnsubmitted

Done

I just set a random value, as this value isn't relevant for testing purposes here. Use a valid address now.

Esme: I just set a random value, as this value isn't relevant for testing purposes here. Use a valid…

Relocations:

- Type: 0x02

- Name: .debug

shchenzUnsubmitted

Not Done

use another section name that is not DWARF specific?

shchenz: use another section name that is not DWARF specific?

Address: 0x0

Size: 0x60

Flags: [ STYP_DEBUG, STYP_DATA ]

SectionData: 01110103

jhendersonUnsubmitted

Not Done

Be consistent: above you've quoted the section data, and here you haven't. Pick one style (I'd recommend quoting).

jhenderson: Be consistent: above you've quoted the section data, and here you haven't. Pick one style (I'd…

- Flags: [ STYP_BSS, STYP_DWARF, STYP_EXCEPT, STYP_INFO, STYP_TDATA, STYP_TBSS, STYP_LOADER, STYP_TYPCHK, STYP_OVRFLO ]

Symbols:

jhendersonUnsubmitted

Not Done

Any particular reason these aren't all one section?

jhenderson: Any particular reason these aren't all one section?

- Name: .file

Section: N_DEBUG

- Name: .undef

- Name: .abs

HiguoxingUnsubmitted

Not Done

It looks that the Value, Type, StorageClass and NumberOfAuxEntries are optional too. Could you please add some test cases for them as well?

Higuoxing: It looks that the `Value`, `Type`, `StorageClass` and `NumberOfAuxEntries` are optional too.

Section: N_ABS

- Name: .text

Value: 0x0

Section: .text

Type: 0x0

StorageClass: C_HIDEXT

NumberOfAuxEntries: 1

# CHECK: AddressSize: 32bit

jhendersonUnsubmitted

Not Done

Does this symbol test anything unique any more, or can we drop it?

jhenderson: Does this symbol test anything unique any more, or can we drop it?

EsmeAuthorUnsubmitted

Done

It was used to test StringTable before, but now it's useless. I will drop this in next update.

Esme: It was used to test StringTable before, but now it's useless. I will drop this in next update.

# CHECK-NEXT: FileHeader {

# CHECK-NEXT: Magic: 0x1DF

# CHECK-NEXT: NumberOfSections: 5

# CHECK-NEXT: TimeStamp: None (0x0)

# CHECK-NEXT: SymbolTableOffset: 0x104

# CHECK-NEXT: SymbolTableEntries: 5

# CHECK-NEXT: OptionalHeaderSize: 0x0

# CHECK-NEXT: Flags: 0x0

# CHECK-NEXT: }

# CHECK-NEXT: Sections [

# CHECK-NEXT: Section {

# CHECK-NEXT: Index: 1

# CHECK-NEXT: Name: .text

# CHECK-NEXT: PhysicalAddress: 0x0

# CHECK-NEXT: VirtualAddress: 0x0

# CHECK-NEXT: Size: 0x8

# CHECK-NEXT: RawDataOffset: 0xDC

# CHECK-NEXT: RelocationPointer: 0x0

# CHECK-NEXT: LineNumberPointer: 0x0

# CHECK-NEXT: NumberOfRelocations: 0

# CHECK-NEXT: NumberOfLineNumbers: 0

# CHECK-NEXT: Type: STYP_TEXT (0x20)

# CHECK-NEXT: }

# CHECK-NEXT: Section {

# CHECK-NEXT: Index: 2

# CHECK-NEXT: Name: .data

# CHECK-NEXT: PhysicalAddress: 0x8

# CHECK-NEXT: VirtualAddress: 0x8

# CHECK-NEXT: Size: 0x8

# CHECK-NEXT: RawDataOffset: 0xE4

# CHECK-NEXT: RelocationPointer: 0xF0

# CHECK-NEXT: LineNumberPointer: 0x0

# CHECK-NEXT: NumberOfRelocations: 1

# CHECK-NEXT: NumberOfLineNumbers: 0

# CHECK-NEXT: Type: STYP_DATA (0x40)

# CHECK-NEXT: }

# CHECK-NEXT: Section {

# CHECK-NEXT: Index: 3

# CHECK-NEXT: Name: .data

# CHECK-NEXT: PhysicalAddress: 0x0

# CHECK-NEXT: VirtualAddress: 0x0

# CHECK-NEXT: Size: 0x0

# CHECK-NEXT: RawDataOffset: 0x0

# CHECK-NEXT: RelocationPointer: 0xFA

# CHECK-NEXT: LineNumberPointer: 0x0

# CHECK-NEXT: NumberOfRelocations: 1

# CHECK-NEXT: NumberOfLineNumbers: 0

# CHECK-NEXT: Type: 0x0

# CHECK-NEXT: }

# CHECK-NEXT: Section {

# CHECK-NEXT: Index: 4

# CHECK-NEXT: Name: .debug

# CHECK-NEXT: PhysicalAddress: 0x0

# CHECK-NEXT: VirtualAddress: 0x0

# CHECK-NEXT: Size: 0x60

# CHECK-NEXT: RawDataOffset: 0xEC

# CHECK-NEXT: RelocationPointer: 0x0

# CHECK-NEXT: LineNumberPointer: 0x0

# CHECK-NEXT: NumberOfRelocations: 0

# CHECK-NEXT: NumberOfLineNumbers: 0

# CHECK-NEXT: Type: 0x2040

# CHECK-NEXT: }

# CHECK-NEXT: Section {

# CHECK-NEXT: Index: 5

# CHECK-NEXT: Name:

# CHECK-NEXT: PhysicalAddress: 0x0

# CHECK-NEXT: VirtualAddress: 0x0

# CHECK-NEXT: Size: 0x0

# CHECK-NEXT: RawDataOffset: 0x0

# CHECK-NEXT: RelocationPointer: 0x0

# CHECK-NEXT: LineNumberPointer: 0x0

# CHECK-NEXT: NumberOfRelocations: 0

# CHECK-NEXT: NumberOfLineNumbers: 0

# CHECK-NEXT: Type: 0xDF90

# CHECK-NEXT: }

# CHECK-NEXT: ]

# CHECK-NEXT: Symbols [

# CHECK-NEXT: Symbol {

# CHECK-NEXT: Index: 0

# CHECK-NEXT: Name: .file

# CHECK-NEXT: Value: 0x0

# CHECK-NEXT: Section: N_DEBUG

# CHECK-NEXT: Type: 0x0

# CHECK-NEXT: StorageClass: C_NULL (0x0)

# CHECK-NEXT: NumberOfAuxEntries: 0

# CHECK-NEXT: }

# CHECK-NEXT: Symbol {

# CHECK-NEXT: Index: 1

# CHECK-NEXT: Name: .undef

# CHECK-NEXT: Value: 0x0

# CHECK-NEXT: Section: N_UNDEF

# CHECK-NEXT: Type: 0x0

# CHECK-NEXT: StorageClass: C_NULL (0x0)

# CHECK-NEXT: NumberOfAuxEntries: 0

# CHECK-NEXT: }

# CHECK-NEXT: Symbol {

# CHECK-NEXT: Index: 2

# CHECK-NEXT: Name: .abs

# CHECK-NEXT: Value: 0x0

# CHECK-NEXT: Section: N_ABS

# CHECK-NEXT: Type: 0x0

# CHECK-NEXT: StorageClass: C_NULL (0x0)

# CHECK-NEXT: NumberOfAuxEntries: 0

# CHECK-NEXT: }

# CHECK-NEXT: Symbol {

# CHECK-NEXT: Index: 3

# CHECK-NEXT: Name: .text

# CHECK-NEXT: Value (RelocatableAddress): 0x0

# CHECK-NEXT: Section: .text

# CHECK-NEXT: Type: 0x0

# CHECK-NEXT: StorageClass: C_HIDEXT (0x6B)

# CHECK-NEXT: NumberOfAuxEntries: 1

# CHECK-NEXT: CSECT Auxiliary Entry {

# CHECK-NEXT: Index: 4

# CHECK-NEXT: SectionLen: 0

# CHECK-NEXT: ParameterHashIndex: 0x0

# CHECK-NEXT: TypeChkSectNum: 0x0

# CHECK-NEXT: SymbolAlignmentLog2: 0

# CHECK-NEXT: SymbolType: XTY_ER (0x0)

# CHECK-NEXT: StorageMappingClass: XMC_PR (0x0)

# CHECK-NEXT: StabInfoIndex: 0x0

# CHECK-NEXT: StabSectNum: 0x0

# CHECK-NEXT: }

# CHECK-NEXT: ]

llvm/test/tools/yaml2obj/XCOFF/full-contents.yaml

This file was added.

				## Test that we can explicitly specify all the fields.
				# RUN: yaml2obj %s -o %t
				# RUN: llvm-readobj --headers --symbols %t \| FileCheck %s

				--- !XCOFF
				FileHeader:
				MagicNumber: 0x1DF
				NumberOfSections: 2
				CreationTime: 0
				OffsetToSymbolTable: 0x7A
				EntriesInSymbolTable: 4
				AuxiliaryHeaderSize: 0
				Flags: 0x0
				Sections:
				- Name: .text
				Address: 0x0
				Size: 0x8
				FileOffsetToData: 0x64
				FileOffsetToRelocations: 0x0
				FileOffsetToLineNumbers: 0x0
				NumberOfRelocations: 0x0
				NumberOfLineNumbers: 0x0
				Flags: [ STYP_TEXT ]
				SectionData: "3860000048"
				- Name: .data
				Address: 0x8
				Size: 0x4
				FileOffsetToData: 0x6C
				FileOffsetToRelocations: 0x70
				FileOffsetToLineNumbers: 0x0
				NumberOfRelocations: 0x1
				NumberOfLineNumbers: 0x0
				Flags: [ STYP_DATA ]
				SectionData: "00000088"
				Relocations:
				- Address: 0x80
				Symbol: 0x21
				shchenzUnsubmitted Not Done Reply Inline Actions Same as above shchenz: Same as above
				Info: 0x1F
				Type: 0x0
				Symbols:
				- Name: .text
				Value: 0x0
				Section: .text
				Type: 0x0
				StorageClass: C_STAT
				NumberOfAuxEntries: 1
				- Name: .data
				Value: 0x80
				Section: .data
				Type: 0x0
				StorageClass: C_STAT
				NumberOfAuxEntries: 1

				# CHECK: FileHeader {
				# CHECK-NEXT: Magic: 0x1DF
				# CHECK-NEXT: NumberOfSections: 2
				# CHECK-NEXT: TimeStamp: None (0x0)
				# CHECK-NEXT: SymbolTableOffset: 0x7A
				# CHECK-NEXT: SymbolTableEntries: 4
				# CHECK-NEXT: OptionalHeaderSize: 0x0
				# CHECK-NEXT: Flags: 0x0
				# CHECK-NEXT: }
				# CHECK-NEXT: Sections [
				# CHECK-NEXT: Section {
				# CHECK-NEXT: Index: 1
				# CHECK-NEXT: Name: .text
				# CHECK-NEXT: PhysicalAddress: 0x0
				# CHECK-NEXT: VirtualAddress: 0x0
				jhendersonUnsubmitted Not Done Reply Inline Actions Why are there blank lines here and below? Does this test actually pass? I was under the impression that `CHECK-NEXT:` without any contents was an error. jhenderson: Why are there blank lines here and below? Does this test actually pass? I was under the…
				# CHECK-NEXT: Size: 0x8
				# CHECK-NEXT: RawDataOffset: 0x64
				# CHECK-NEXT: RelocationPointer: 0x0
				# CHECK-NEXT: LineNumberPointer: 0x0
				# CHECK-NEXT: NumberOfRelocations: 0
				# CHECK-NEXT: NumberOfLineNumbers: 0
				# CHECK-NEXT: Type: STYP_TEXT (0x20)
				# CHECK-NEXT: }
				# CHECK-NEXT: Section {
				# CHECK-NEXT: Index: 2
				# CHECK-NEXT: Name: .data
				# CHECK-NEXT: PhysicalAddress: 0x8
				# CHECK-NEXT: VirtualAddress: 0x8
				# CHECK-NEXT: Size: 0x4
				# CHECK-NEXT: RawDataOffset: 0x6C
				# CHECK-NEXT: RelocationPointer: 0x70
				# CHECK-NEXT: LineNumberPointer: 0x0
				# CHECK-NEXT: NumberOfRelocations: 1
				# CHECK-NEXT: NumberOfLineNumbers: 0
				# CHECK-NEXT: Type: STYP_DATA (0x40)
				# CHECK-NEXT: }
				# CHECK-NEXT: ]
				# CHECK-NEXT: Symbols [
				# CHECK-NEXT: Symbol {
				# CHECK-NEXT: Index: 0
				# CHECK-NEXT: Name: .text
				# CHECK-NEXT: Value (RelocatableAddress): 0x0
				# CHECK-NEXT: Section: .text
				# CHECK-NEXT: Type: 0x0
				# CHECK-NEXT: StorageClass: C_STAT (0x3)
				# CHECK-NEXT: NumberOfAuxEntries: 1
				# CHECK-NEXT: Sect Auxiliary Entry For Stat {
				# CHECK-NEXT: Index: 1
				# CHECK-NEXT: SectionLength: 0
				# CHECK-NEXT: NumberOfRelocEnt: 0
				# CHECK-NEXT: NumberOfLineNum: 0
				# CHECK-NEXT: }
				# CHECK-NEXT: }
				# CHECK-NEXT: Symbol {
				# CHECK-NEXT: Index: 2
				# CHECK-NEXT: Name: .data
				# CHECK-NEXT: Value (RelocatableAddress): 0x80
				# CHECK-NEXT: Section: .data
				# CHECK-NEXT: Type: 0x0
				# CHECK-NEXT: StorageClass: C_STAT (0x3)
				# CHECK-NEXT: NumberOfAuxEntries: 1
				# CHECK-NEXT: Sect Auxiliary Entry For Stat {
				# CHECK-NEXT: Index: 3
				# CHECK-NEXT: SectionLength: 0
				# CHECK-NEXT: NumberOfRelocEnt: 0
				# CHECK-NEXT: NumberOfLineNum: 0
				# CHECK-NEXT: }
				# CHECK-NEXT: }
				# CHECK-NEXT: ]

llvm/utils/gn/secondary/llvm/lib/ObjectYAML/BUILD.gn

Show All 20 Lines	sources = [
"ELFYAML.cpp",		"ELFYAML.cpp",
"MachOEmitter.cpp",		"MachOEmitter.cpp",
"MachOYAML.cpp",		"MachOYAML.cpp",
"MinidumpEmitter.cpp",		"MinidumpEmitter.cpp",
"MinidumpYAML.cpp",		"MinidumpYAML.cpp",
"ObjectYAML.cpp",		"ObjectYAML.cpp",
"WasmEmitter.cpp",		"WasmEmitter.cpp",
"WasmYAML.cpp",		"WasmYAML.cpp",
		"XCOFFEmitter.cpp"
		thakisUnsubmitted Not Done Reply Inline Actions (FYI, you don't need to update the files in llvm/utils/gn when you add files. Those are unsupported build files, and also for simple changes like this they are updated automatically based on the CMakeLists.txt files by a bot. If you _do_ update them, please add a trailing `,` -- but it's better to not update them since it's less work for you and the bot will update them correctly :) ) thakis: (FYI, you don't need to update the files in llvm/utils/gn when you add files. Those are…
		EsmeAuthorUnsubmitted Done Reply Inline Actions Thanks for the information. I will not update such file in future patches. Esme: Thanks for the information. I will not update such file in future patches.
"XCOFFYAML.cpp",		"XCOFFYAML.cpp",
"YAML.cpp",		"YAML.cpp",
"yaml2obj.cpp",		"yaml2obj.cpp",
]		]
}		}

This is an archive of the discontinued LLVM Phabricator instance.

[yaml2obj] Initial support for 32-bit XCOFF in yaml2obj.ClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 350174

llvm/include/llvm/BinaryFormat/XCOFF.h

llvm/include/llvm/ObjectYAML/ObjectYAML.h

llvm/include/llvm/ObjectYAML/XCOFFYAML.h

llvm/include/llvm/ObjectYAML/yaml2obj.h

llvm/lib/ObjectYAML/CMakeLists.txt

llvm/lib/ObjectYAML/ObjectYAML.cpp

llvm/lib/ObjectYAML/XCOFFEmitter.cpp

llvm/lib/ObjectYAML/XCOFFYAML.cpp

llvm/lib/ObjectYAML/yaml2obj.cpp

llvm/test/tools/yaml2obj/XCOFF/basic-doc.yaml

llvm/test/tools/yaml2obj/XCOFF/full-contents.yaml

llvm/utils/gn/secondary/llvm/lib/ObjectYAML/BUILD.gn

[yaml2obj] Initial support for 32-bit XCOFF in yaml2obj.
ClosedPublic