This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/MC/
-
llvm/
-
MC/
1/1
MCExpr.h
-
lib/MC/
-
MC/
26/38
MCExpr.cpp
-
MCParser/
-
AsmParser.cpp
-
test/MC/ARM/
-
MC/
-
ARM/
2
directive_if_offset.s
1/3
directive_if_offset_error.s

Differential D69411

[MC] Resolve the difference of symbols in consecutive MCDataFragements
ClosedPublic

Authored by jcai19 on Oct 24 2019, 3:03 PM.

Download Raw Diff

Details

Reviewers

echristo
nickdesaulniers
MaskRay
jyknight
• espindola
psmith

Commits

rG415a4fbea7c1: [MC] Resolve the difference of symbols in consecutive MCDataFragements

Summary

Try to resolve the difference of two symbols in consecutive MCDataFragments.
This is important for an idiom like "foo:instr; .if . - foo; instr; .endif"
(https://bugs.llvm.org/show_bug.cgi?id=43795).

Diff Detail

Repository

rG LLVM Github Monorepo

Build Status

Buildable 40435
Build 40542: arc lint + arc unit

Event Timeline

There are a very large number of changes, so older changes are hidden. Show Older Changes

nickdesaulniers added inline comments.Oct 28 2019, 12:21 PM

llvm/lib/MC/MCExpr.cpp
540	Move these closer to their use below. It is nice to have them all declared together, but it would be nicer to bail as early as possible without doing more work than necessary.
586	This patch adds checks for thumb, but the test case doesn't use a thumb target triple. Consider adding tests that exercise the newly added code.
591	MicroMips?
598	It looks like this block was copied from L529-L548. Is it possible to adjust the condition on L529 and addend calculation on L532 to support this case, rather than duplicating so much code? Looks like this repeats again below. Is it possible to refactor some of this, maybe into its own method, for better code reuse?
llvm/test/MC/AsmParser/directive_if_with_dot_symbol.s
1 ↗	(On Diff #226704)	Is it possible to add tests for other ISA's, too?
6 ↗	(On Diff #226704)	We're checking that an error occurs? Wouldn't a test for this new feature test that an error does not occur? Or am I misunderstanding the test?

nickdesaulniers added a reviewer: nickdesaulniers.Oct 28 2019, 12:35 PM

nickdesaulniers removed a subscriber: nickdesaulniers.

Update the patch based on comments.

Harbormaster completed remote builds in B40175: Diff 226838.Oct 28 2019, 11:17 PM

Update the test case.

Harbormaster completed remote builds in B40176: Diff 226839.Oct 28 2019, 11:18 PM

jcai19 added inline comments.Oct 28 2019, 11:18 PM

llvm/lib/MC/MCExpr.cpp
586	It's actually checking if SA is flagged with .thumb_func, although adding .thumb_func right before label 9997 does not make the check become true?
591	It seems micromips has been used through out the file, will keep it for consistency.
llvm/test/MC/AsmParser/directive_if_with_dot_symbol.s
1 ↗	(On Diff #226704)	I have not figured out an example with other ISA. The reason is MCFragment is transparent to assembly so I do not know an easy way to force two adjacent labels to be assigned to two MCFragment in the assembly file. For the same reason, the ISA extension right after this line is needed for the integrated assembler to place 9997 and the .if directive into two different fragments in the same section.
6 ↗	(On Diff #226704)	Yes, you are correct. I think I uploaded an incomplete version. Updated it.

I'd like to see some more test cases for the error cases. Off the top of my head I can think of these cases:

// Align fragment in between MCDataFragments
9997: nop ;
      .align 4
      nop
.if . - 9997b == 4 ;

// Fill (I think) fragment in between MCDataFragments
9997: nop ;
      .space 4
      nop
.if . - 9997b == 4 ;

// Relaxable (in Thumb) fragment in between MCDataFragments
9997: nop ;
      b external
      nop
.if . - 9997b == 4 ;

// Constant pool between MCDataFragments,
9997:
      ldr r0,=0x12345678 ;
      .ltorg
      nop  
.if . - 9997b == 4 ;

There is also something called a MCCompactEncodingInstFragment that would cause this to fail, but I think that only exists as part of NaCl bundle locking, which I think isn't too much of a concern.

llvm/lib/MC/MCExpr.cpp
578	I think it would be useful to add a comment explaining this line and what kind of expressions it is expected to support. It isn't intuitive given how AttemptToFoldSymbolOffsetDifference is called.
llvm/test/MC/AsmParser/directive_if_offset.s
1 ↗	(On Diff #226839)	Be careful with module dependencies and targets. An llvm test cannot depend on clang, only tools that are in the llvm repository such as llvm-mc. All backends like ARM are optional so tests need to be guarded either by directory or by REQUIRES to prevent failures when backends aren't available. Looking at the directories lit.local.cfg: if not 'X86' in config.root.targets: config.unsupported = True Which is backed up by all of the tests in the directory using X86 backend triples. I think that this test if it remains in this location will need to be rewritten in x86. If the test needs Arm I think it ought to go in the MC/ARM directory.
1 ↗	(On Diff #226839)	The test fails when there is asm output from llvm-mc. llvm-mc -filetype=obj -triple=armv7a-linux-gnueabihf directive_if_offset.s -o /dev/null echo $? 0 However when using the assembler output, this fails: llvm-mc --triple armv7a-linux-gnueabihf directive_if_offset.s -filetype=asm -o /dev/null directive_if_offset.s:7:5: error: expected absolute expression .if . - 9997b == 4 ; It may be that there is no way to resolve this in the asm output as the data structures might not exist. It will be worth a check to see though. I think that differences between the asm and obj outputs of llvm-mv are frowned on, but I don't think that they are necessarily fatal if coming from a assembler file and not something generated by clang. This might be a problem if the .if exists in inline assembler, not that I'd recommend anyone do that.
5 ↗	(On Diff #226839)	Is this line significant for the test? If it isn't then it is worth taking it out.
8 ↗	(On Diff #226839)	Unless the location is particular significant I'd recommend just "error: expected absolute expression". This will make it a bit less sensitive if the change is added to.
10 ↗	(On Diff #226839)	You could use llvm-objdump -d on the ELF output to test that the correct instruction had been generated. Normally we'd use llvm-mc -filetype=asm and use FileCheck on that but that isn't working in this case.

When I run check-llvm with this patch applied I get 7 failures:

LLVM :: DebugInfo/Mips/delay-slot.ll
LLVM :: DebugInfo/Mips/dsr-fixed-objects.ll
LLVM :: DebugInfo/Mips/dsr-non-fixed-objects.ll
LLVM :: DebugInfo/Mips/fn-call-line.ll
LLVM :: MC/AsmParser/directive_fill_2.s
LLVM :: MC/MachO/reloc-diff.s
LLVM :: MC/X86/expand-var.s

This is applied to master revision 12c9ffd108345f643df98dfa8653af1a4311ed86 and the tests don't fail with master. Is it possible your most recent changes have broken something? I'm testing a release build of LLVM with clang, assertions enabled.

There are also some test cases for other ISA's in https://bugs.llvm.org/show_bug.cgi?id=41825.

In D69411#1725298, @peter.smith wrote:
When I run check-llvm with this patch applied I get 7 failures:
LLVM :: DebugInfo/Mips/delay-slot.ll
LLVM :: DebugInfo/Mips/dsr-fixed-objects.ll
LLVM :: DebugInfo/Mips/dsr-non-fixed-objects.ll
LLVM :: DebugInfo/Mips/fn-call-line.ll
LLVM :: MC/AsmParser/directive_fill_2.s
LLVM :: MC/MachO/reloc-diff.s
LLVM :: MC/X86/expand-var.s
This is applied to master revision 12c9ffd108345f643df98dfa8653af1a4311ed86 and the tests don't fail with master. Is it possible your most recent changes have broken something? I'm testing a release build of LLVM with clang, assertions enabled.

Yes, I am aware of failure on LLVM :: MC/MachO/reloc-diff.s and working on it, but my build does not have assertions on so I haven't seen the other failures. Thanks for verifying, I will take a look on them.

Made sure the patch pass check-clang and check-llvm.

In D69411#1725261, @peter.smith wrote:
I'd like to see some more test cases for the error cases. Off the top of my head I can think of these cases:
// Align fragment in between MCDataFragments
9997: nop ;
      .align 4
      nop
.if . - 9997b == 4 ;
// Fill (I think) fragment in between MCDataFragments
9997: nop ;
      .space 4
      nop
.if . - 9997b == 4 ;
// Relaxable (in Thumb) fragment in between MCDataFragments
9997: nop ;
      b external
      nop
.if . - 9997b == 4 ;
// Constant pool between MCDataFragments,
9997:
      ldr r0,=0x12345678 ;
      .ltorg
      nop  
.if . - 9997b == 4 ;
There is also something called a MCCompactEncodingInstFragment that would cause this to fail, but I think that only exists as part of NaCl bundle locking, which I think isn't too much of a concern.

Thanks for providing these test cases! It seems the third and fourth case do not produce any errors as expected (even with arm gcc). Will dig more.

Harbormaster completed remote builds in B40428: Diff 227537.Nov 1 2019, 2:33 PM

In D69411#1725298, @peter.smith wrote:
When I run check-llvm with this patch applied I get 7 failures:
LLVM :: DebugInfo/Mips/delay-slot.ll
LLVM :: DebugInfo/Mips/dsr-fixed-objects.ll
LLVM :: DebugInfo/Mips/dsr-non-fixed-objects.ll
LLVM :: DebugInfo/Mips/fn-call-line.ll
LLVM :: MC/AsmParser/directive_fill_2.s
LLVM :: MC/MachO/reloc-diff.s
LLVM :: MC/X86/expand-var.s
This is applied to master revision 12c9ffd108345f643df98dfa8653af1a4311ed86 and the tests don't fail with master. Is it possible your most recent changes have broken something? I'm testing a release build of LLVM with clang, assertions enabled.

The latest patch should address these issues. It seems other than .if conditions, llvm::MCExpr::evaluateAsAbsolute is allowed to not immediately resolve the difference of two symbols in adjacent fragments, such as ".long _external_def - _local_def" in llvm/test/MC/MachO/reloc-diff.s. They will be resolved later when finalizing the layout of the object file. The newer version of the code essentially passes an additional flag to tell the function whether the substraction to evaluate is an if condition.

In D69411#1725486, @nickdesaulniers wrote:

There are also some test cases for other ISA's in https://bugs.llvm.org/show_bug.cgi?id=41825.

The cause of 41825 is different from the one this patch tries to solve. But their solution looks similar enough so I have added code to handle those as well.

Fix format.

Harbormaster completed remote builds in B40430: Diff 227542.Nov 1 2019, 2:52 PM

MaskRay added a subscriber: MaskRay.Nov 1 2019, 4:11 PM

Update test cases and move their location.

Harbormaster completed remote builds in B40431: Diff 227549.Nov 1 2019, 4:26 PM

Update test cases.

Harbormaster completed remote builds in B40434: Diff 227552.Nov 1 2019, 4:44 PM

jcai19 added inline comments.Nov 1 2019, 4:45 PM

llvm/test/MC/AsmParser/directive_if_offset.s
1 ↗	(On Diff #226839)	I have moved the tests to MC/ARM directory.
10 ↗	(On Diff #226839)	I did verify myself with -filetype=obj and llvm-objdump -disassemble, although I can't seem to trigger if (Asm->isThumbFunc(&SA)) Addend \|= 1; and therefore cannot the else branch here. Any thoughts on this? Thanks.

Update tests.

Harbormaster completed remote builds in B40435: Diff 227553.Nov 1 2019, 4:53 PM

I think that this needs some wider review for a couple of reasons. It looks like this is heading towards a special case evaluation just for .if. I'm not comfortable going too much further in the approval process as this is exceeding my knowledge of MC, and I'd like see some supporting opinions on whether this is the right thing to do. Will be worth adding some more reviewers, particularly someone familiar with Mach-O. You may need an RFC to the llvm-dev mailing list to draw some attention.

I'd also like to see if there is a way of implementing this without special casing .if. That is a pragmatic solution for a specific problem, but it does make the code harder to understand so it has a cost. Looking at the reloc-diff.s test case the commit log says:

MC/Mach-O: Use the SECTDIFF relocation type for (A - B + constant) where A is external.
 - I'm not sure why, but this is what 'as' does.

I've not been able to easily find a definition of what SECTDIFF is, but it appears to be a relocation that supports subtraction, I don't think that there is an equivalent in ELF. It maybe that the case can be resolved by not doing the folding for MachO.

some other specific comments.

Some of the examples I gave would only fail in Thumb as Arm instructions are all the same size, whereas some Thumb-2 instructions are assembled initially as the 2-byte narrow form, and are relaxed to the wider 4-byte form if the instruction is out of range. You'll need something like -triple=thumbv7a-linux-gnueabihf to see the errors. I'd not expect some of these to fail on GNU as (that is a 2-pass assembler).
If you can make an X86_64 test I recommend doing so as few contributors will have this optionally removed. Substantially more have ARM optionally removed, as do several buildbots.

llvm/include/llvm/MC/MCExpr.h
60	There is a comment in evaluateAsAbsolute // Setting InSet causes us to absolutize differences across sections and that // is what the MachO writer uses Addrs for. I think it would be useful to have something similar for IsCond.
llvm/lib/MC/MCExpr.cpp
495	When there is more than one boolean parameter it can get difficult to track which is which, can you annotate the call sites with literal values with a comment? I've left a comment on the ones that I've noticed. /* InSet / true, / IsCond */ false.
521	/* IsCond */ false.
582	If I've understood the code correctly I think we could be a bit more specific here: // When there is no layout our ability to resolve differences between symbols is // limited. In specific cases where the symbols are both defined in consecutive // MCDataFragments the difference can be calculated. This is important for an // idiom like foo:instr; .if . - foo; instr; .endif // We cannot handle any kind of case where the difference may change due to // layout.
584	It is difficult to see how it would be possible to resolve .if conditions at layout time in a single pass assembler. In theory the assembler could evaluate all conditional blocks and select between them at layout time, if such a layout could be converged on.
702	/* InSet / false, / IsCond */ false.
708	/* InSet / true, / IsCond */ false.
749	/* IsCond */ false.
749	We have IsCond passed in as a parameter to evaluateAsRelocatableImpl, so even if it is passed in true, we set it to false here? Is it important for the value to be false here? If so then it implies that IsCond might not be specific enough a name. If it just doesn't matter then can we pass in IsCond here?
779	/* IsCond */ false.

Address some concerns.

llvm/lib/MC/MCExpr.cpp
578	This is inherited from the original code, except it was before the check for Layout originally. I am not exactly sure why this check is needed in the first place. In my opinion this is redundant with the check for Layout, as we can always calculate the differences of two symbols based on the order of the sections (MCAsmLayout::getSectionOrder ()) they are in and their offset within its own section respectively if the layout is provided., but I do need this check anyway for our case. Any thoughts on why this check is necessary?
584	Thanks for the clarification, although I am not sure I follow. The code looks iterative to me https://llvm.org/doxygen/MCAssembler_8cpp_source.html#l00785. I was thinking as the loop iterates and calls layoutOnce, we can relax (if needed) and calculate the sizes of the fragments before a .if statement and resolve the condition there. But I am not completely convinced by myself that it is doable. Also there some cases like the one below will create more complexity. foo: jump to bar ... .if . - foo = ${constant integer} instr1 .else. instr2 .endif ... bar: This creates a loop of dependency as depending on the instruction selected in the if-else block, the size of the jump instruction may change due to the number of bits it need to specify the offset, which in turn affects which instruction should be chosen.
749	Yes, thanks for the catching this. IsCond is never used here so I simply replaced it false. But passing in IsCond is a better choice.

Harbormaster completed remote builds in B40535: Diff 227937.Nov 5 2019, 11:34 AM

nickdesaulniers added a subscriber: scw.Nov 5 2019, 11:54 AM

Add test cases.

Harbormaster completed remote builds in B40552: Diff 227980.Nov 5 2019, 4:00 PM

In D69411#1732215, @peter.smith wrote:

I think that this needs some wider review for a couple of reasons. It looks like this is heading towards a special case evaluation just for .if. I'm not comfortable going too much further in the approval process as this is exceeding my knowledge of MC, and I'd like see some supporting opinions on whether this is the right thing to do. Will be worth adding some more reviewers, particularly someone familiar with Mach-O. You may need an RFC to the llvm-dev mailing list to draw some attention.

I have sent an RFC and am waiting for comments.

I'd also like to see if there is a way of implementing this without special casing .if. That is a pragmatic solution for a specific problem, but it does make the code harder to understand so it has a cost. Looking at the reloc-diff.s test case the commit log says:
MC/Mach-O: Use the SECTDIFF relocation type for (A - B + constant) where A is external.
 - I'm not sure why, but this is what 'as' does.
I've not been able to easily find a definition of what SECTDIFF is, but it appears to be a relocation that supports subtraction, I don't think that there is an equivalent in ELF. It maybe that the case can be resolved by not doing the folding for MachO.

some other specific comments.

Some of the examples I gave would only fail in Thumb as Arm instructions are all the same size, whereas some Thumb-2 instructions are assembled initially as the 2-byte narrow form, and are relaxed to the wider 4-byte form if the instruction is out of range. You'll need something like -triple=thumbv7a-linux-gnueabihf to see the errors. I'd not expect some of these to fail on GNU as (that is a 2-pass assembler).

Thanks for the clarification. I have included all the four test cases.

If you can make an X86_64 test I recommend doing so as few contributors will have this optionally removed. Substantially more have ARM optionally removed, as do several buildbots.

Yes, I agree. Will try to make test cases for x64.

Thanks for the update. I'll wait a few days to see what comments we get. If there are none then I guess there aren't any strong objections.

To clarify the remarks about resolving .if at layout time. I think that there are two major obstacles.

MC assembles instructions once, if the condition in the .if is not satisfied the contents of the block aren't even parsed. For example, the following will assemble if the .if condition fails, but will fail to parse if the .if condition passes. We'd need to change MC to either reparse (effectively making it a multipass assembler), or to parse and remember all parts. This is doable but it isn't a small change.

.text
.if 0
You aren't parsing me are you?
.else
nop
.endif

The second problem, and not one unique to llvm-mc, is that it is possible to write a program that doesn't converge, something like (not testable as it needs relaxation and late evaluation of .if):

label:
 beq after // In thumb 2 this 2-byte branch will be relaxed to a 4-byte branch if out of range. 
 .if . - label == 2 
 .space 1024 * 1024 // sufficient to make after out of range
 .endif
after:
 nop

In pass 1, beq is 2-bytes in size, the .if passes, beq is then relaxed to 4-bytes, which would make the .if fail, which then makes beq 2-bytes ...
The interaction with relaxation could be fixed by making all size increases permanent. However I think it is possible to write multiple .if conditions that conflict. In Arm's old 2 - pass assembler (pass -1 find all sizes so layout is known, pass-2 encode instructions knowing layout), we had an error message when the equivalent of .if evaluated differently in subsequent passes. In summary, I think it would be a major change to MC to support something like late evaluation of .if in a reliable way.

nickdesaulniers edited reviewers, added: peter.smith, MaskRay; removed: dschuff, sunfish.Nov 6 2019, 9:30 AM

nickdesaulniers added subscribers: sunfish, dschuff.

nickdesaulniers added inline comments.Nov 6 2019, 10:05 AM

llvm/lib/MC/MCExpr.cpp
584	See also b/132538429. In-order to evaluate the if statement truthfully, it will have to be done after (well, during, since the result could change the relaxation) relaxation, and LLVM MC is just not set up for this currently. Even then, you could probably construct an if statement that could create a paradox, and never converge on a valid relaxation.

nickdesaulniers added inline comments.Nov 6 2019, 10:07 AM

llvm/lib/MC/MCExpr.cpp
486	You can move this declaration down closer to where it is used. No need to construct if the below conditional returns early.

In D69411#1735172, @peter.smith wrote:
Thanks for the update. I'll wait a few days to see what comments we get. If there are none then I guess there aren't any strong objections.

To clarify the remarks about resolving .if at layout time. I think that there are two major obstacles.

MC assembles instructions once, if the condition in the .if is not satisfied the contents of the block aren't even parsed. For example, the following will assemble if the .if condition fails, but will fail to parse if the .if condition passes. We'd need to change MC to either reparse (effectively making it a multipass assembler), or to parse and remember all parts. This is doable but it isn't a small change.
.text
.if 0
You aren't parsing me are you?
.else
nop
.endif
The second problem, and not one unique to llvm-mc, is that it is possible to write a program that doesn't converge, something like (not testable as it needs relaxation and late evaluation of .if):
label:
 beq after // In thumb 2 this 2-byte branch will be relaxed to a 4-byte branch if out of range. 
 .if . - label == 2 
 .space 1024 * 1024 // sufficient to make after out of range
 .endif
after:
 nop
In pass 1, beq is 2-bytes in size, the .if passes, beq is then relaxed to 4-bytes, which would make the .if fail, which then makes beq 2-bytes ...
The interaction with relaxation could be fixed by making all size increases permanent. However I think it is possible to write multiple .if conditions that conflict. In Arm's old 2 - pass assembler (pass -1 find all sizes so layout is known, pass-2 encode instructions knowing layout), we had an error message when the equivalent of .if evaluated differently in subsequent passes. In summary, I think it would be a major change to MC to support something like late evaluation of .if in a reliable way.

Thanks for all the clarification and examples. I was wondering if there would be an easier way for late evaluate of .if condition so we could avoid the current implementation, which I totally agree is not straightforward and somehow ad-hoc. But I guess a major haul to MC seems to be the only alternative based on what you and Nick said, which is probably an overkill for the particular issue this patch tries to solve. However, this patch will not solve the second case you brought up, so how likely do you think we would encounter such cases, and should we consider the multiple-pass solution to future-proof these cases if they happen frequently enough, or maybe we could rewrite the assembly instead to avoid such complexity?

llvm/lib/MC/MCExpr.cpp
584	Yeah, thanks for the explanation. I wonder if GAS would be able to support all these paradoxes.

Handle cases at https://bugs.llvm.org/show_bug.cgi?id=41825.

jcai19 retitled this revision from [MC] Calculate difference of symbols in two fragments when possible to [MC] Enhance parsing of .if conditions.Nov 9 2019, 10:42 AM

jcai19 edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B40718: Diff 228586.Nov 9 2019, 10:43 AM

llvm/lib/MC/MCExpr.cpp
591	This might not be needed with D70062 as the !SA.isUnset() and !SB.isUnset() on line 568 will both evaluate true after it.

Remove the code redundant to D70062.

Harbormaster completed remote builds in B40760: Diff 228740.Nov 11 2019, 10:49 AM

jcai19 marked 2 inline comments as done.Nov 11 2019, 10:50 AM

jcai19 added inline comments.

llvm/lib/MC/MCExpr.cpp
591	Thanks for the clarification. I have adjusted my code accordingly.

jcai19 retitled this revision from [MC] Enhance parsing of .if conditions to [MC] Parse .if conditions with symbols in consecutive MCDataFragements.Nov 11 2019, 10:54 AM

jcai19 edited the summary of this revision. (Show Details)

jcai19 updated this revision to Diff 228746.Nov 11 2019, 11:55 AM

Fix test cases.

Harbormaster completed remote builds in B40763: Diff 228746.Nov 11 2019, 12:03 PM

Fix a typo.

Harbormaster completed remote builds in B40765: Diff 228751.Nov 11 2019, 12:12 PM

jcai19 marked an inline comment as done.Nov 12 2019, 12:08 AM

jcai19 added inline comments.

llvm/test/MC/AsmParser/directive_if_offset.s
5 ↗	(On Diff #226839)	This line is required to reproduce this issue, as it changes the subtarget and forces the creation of a new MCDataSegment (https://llvm.org/doxygen/MCObjectStreamer_8cpp_source.html#l00157). I found a similar directive for i386, .arch + cpu_type (https://sourceware.org/binutils/docs/as/i386_002dArch.html#i386_002dArch), although the parser does not support it. To have a similar test case on x86, it seems we would have to first support that directive. However, I could not find any uses of this directive in llvm or Linux kernel, so I assume it is not commonly used. Will keep the test case for arm for now.

To summarise where I think we are right now.

D76002 fixes a problem affecting within MCDataFragment eager evaluation of .if expressions. Specifically when there is a label that would be inserted into the MCDataFragment, but at the time of encountering the label the fragment hadn't been created. MC will attempt to reuse an MCDataFragment for new instructions, see CanReuseDataFragment() in MCObjectStreamer.cpp. As noted earlier when the Subtarget changes a new MCDataFragment is created. In the majority of cases this is done via a .arch or .arch_extension directive.
This patch extends the eager evaluation to cope with two adjacent MCDataFragments. My understanding is that this only occurs in the following circumstances:
- When bundle locking and --mc-relax-all is used, there is a complicated 1 instruction per fragment + fragment merging that goes on here. This is only used in NaCl which I'm not sure what the status of in Chrome is. I think it is at best deprecated.
- When there is a Subtarget change between MCDataFragments.
In all other cases such as .align, .space and a relaxable instruction there is a separate non MCDataFragment created so we cannot fix these up.
The patch restricts the eager evaluation to .if as some asm backends do not want the expressions between fragments evaluated eagerly in some cases.

From the perspective of the linux kernel, is D76002 sufficient? For example if the Subtarget changing directives are used in such a way that they don't create new MCDataFragments in a sensitive location then we may not need this. For example the following will assemble with D76002

        .arch_extension sec // Outside of .text
        
        .text
9997:
        nop
.if . - 9997b == 0
// CHECK-NOT: error: expected absolute expression
orr r1, r1, #1
.else
orr r1, r1, #2
.endif

As will:

        .text
        .arch_extension sec                
        nop
9997:
        nop
.if . - 9997b == 0
// CHECK-NOT: error: expected absolute expression
orr r1, r1, #1
.else
orr r1, r1, #2
.endif

Could the examples in the kernel be altered to not require this? It does seem like we are writing a lot of code for a small number of easily resolved cases.

I note that it is possible to write a contrived example that this patch can't handle (needs --triple=armv8a-linux-gnu as crypto and crc are V8 extensions) although I'm not suggesting implementing support for it.

        .text

        nop
9997:
        .arch_extension crypto
        nop
        .arch_extension crc
        nop
.if . - 9997b == 0
// CHECK-NOT: error: expected absolute expression
orr r1, r1, #1
.else
orr r1, r1, #2
.endif        .text

        nop
9997:
        .arch_extension crypto
        nop
        .arch_extension crc
        nop
.if . - 9997b == 0
// CHECK-NOT: error: expected absolute expression
orr r1, r1, #1
.else
orr r1, r1, #2
.endif

if.s:11:5: error: expected absolute expression
.if . - 9997b == 0

llvm/lib/MC/MCExpr.cpp
585	With the new information coming from D70062 I think we need to make this comment more specific. For example: When the Subtarget is changed a new MCDataFragment is created. This handles the case of foo: instr; .arch_extension ext; instr .if . - foo
llvm/test/MC/ARM/directive_if_offset.s
6	Can we add a comment to explain the importance of .arch_extension, such as: // Create a new MCDataFragment due to Subtarget change

Thanks for the summary.

In D69411#1742030, @peter.smith wrote:

To summarise where I think we are right now.

D76002 fixes a problem affecting within MCDataFragment eager evaluation of .if expressions. Specifically when there is a label that would be inserted into the MCDataFragment, but at the time of encountering the label the fragment hadn't been created. MC will attempt to reuse an MCDataFragment for new instructions, see CanReuseDataFragment() in MCObjectStreamer.cpp. As noted earlier when the Subtarget changes a new MCDataFragment is created. In the majority of cases this is done via a .arch or .arch_extension directive.

This patch extends the eager evaluation to cope with two adjacent MCDataFragments. My understanding is that this only occurs in the following circumstances:

When bundle locking and --mc-relax-all is used, there is a complicated 1 instruction per fragment + fragment merging that goes on here. This is only used in NaCl which I'm not sure what the status of in Chrome is. I think it is at best deprecated.

I'll follow up on this. There's likely a lot of code that can be dropped if that's the case.

When there is a Subtarget change between MCDataFragments.

In all other cases such as .align, .space and a relaxable instruction there is a separate non MCDataFragment created so we cannot fix these up.

The patch restricts the eager evaluation to .if as some asm backends do not want the expressions between fragments evaluated eagerly in some cases.

From the perspective of the linux kernel, is D76002 sufficient?

I can still reproduce https://github.com/ClangBuiltLinux/linux/issues/742 with https://reviews.llvm.org/D70062 applied.

For example if the Subtarget changing directives are used in such a way that they don't create new MCDataFragments in a sensitive location then we may not need this. For example the following will assemble with D76002
        .arch_extension sec // Outside of .text
        
        .text
9997:
        nop
.if . - 9997b == 0
// CHECK-NOT: error: expected absolute expression
orr r1, r1, #1
.else
orr r1, r1, #2
.endif
As will:
        .text
        .arch_extension sec                
        nop
9997:
        nop
.if . - 9997b == 0
// CHECK-NOT: error: expected absolute expression
orr r1, r1, #1
.else
orr r1, r1, #2
.endif
Could the examples in the kernel be altered to not require this? It does seem like we are writing a lot of code for a small number of easily resolved cases.

The case from the kernel is testing that a single instruction is 2B rather than 4B (narrow vs wide). See https://github.com/ClangBuiltLinux/linux/blob/de620fb99ef2bd52b2c5bc52656e89dcfc0e223a/arch/arm/include/asm/assembler.h#L255-L270.

I don't see any subarch directives there.

In D69411#1742030, @peter.smith wrote:

I note that it is possible to write a contrived example that this patch can't handle (needs --triple=armv8a-linux-gnu as crypto and crc are V8 extensions) although I'm not suggesting implementing support for it.
        .text

        nop
9997:
        .arch_extension crypto
        nop
        .arch_extension crc
        nop
.if . - 9997b == 0
// CHECK-NOT: error: expected absolute expression
orr r1, r1, #1
.else
orr r1, r1, #2
.endif        .text

        nop
9997:
        .arch_extension crypto
        nop
        .arch_extension crc
        nop
.if . - 9997b == 0
// CHECK-NOT: error: expected absolute expression
orr r1, r1, #1
.else
orr r1, r1, #2
.endif
if.s:11:5: error: expected absolute expression
.if . - 9997b == 0

If we need to support cases like this, we can probably add a while loop and check if the fragments between two symbols are all MCDataFragments and sum their sizes up.

In D69411#1742501, @nickdesaulniers wrote:

The case from the kernel is testing that a single instruction is 2B rather than 4B (narrow vs wide). See https://github.com/ClangBuiltLinux/linux/blob/de620fb99ef2bd52b2c5bc52656e89dcfc0e223a/arch/arm/include/asm/assembler.h#L255-L270.

I don't see any subarch directives there.

This issue happens when the ALT_UP macro is expanded in files like arch/arm/mm/proc-v7.S. This is how I reproduce the code of interest
clang -I./arch/arm/include -I./arch/arm/include/generated -I./include -I./arch/arm/include/uapi -I./arch/arm/include/generated/uapi -I./include/uapi -I./include/generated/uapi -include ./include/linux/kconfig.h -DASSEMBLY --target=arm-linux-gnueabihf -DLINUX_ARM_ARCH=7 -march=armv7-a -include asm/unified.h -S arch/arm/mm/proc-v7.S &> clang-proc-v7.s

In D69411#1742501, @nickdesaulniers wrote:

Thanks for the summary.

In D69411#1742030, @peter.smith wrote:

To summarise where I think we are right now.

D76002 fixes a problem affecting within MCDataFragment eager evaluation of .if expressions. Specifically when there is a label that would be inserted into the MCDataFragment, but at the time of encountering the label the fragment hadn't been created. MC will attempt to reuse an MCDataFragment for new instructions, see CanReuseDataFragment() in MCObjectStreamer.cpp. As noted earlier when the Subtarget changes a new MCDataFragment is created. In the majority of cases this is done via a .arch or .arch_extension directive.

This patch extends the eager evaluation to cope with two adjacent MCDataFragments. My understanding is that this only occurs in the following circumstances:

When bundle locking and --mc-relax-all is used, there is a complicated 1 instruction per fragment + fragment merging that goes on here. This is only used in NaCl which I'm not sure what the status of in Chrome is. I think it is at best deprecated.

I'll follow up on this. There's likely a lot of code that can be dropped if that's the case.

When there is a Subtarget change between MCDataFragments.

In all other cases such as .align, .space and a relaxable instruction there is a separate non MCDataFragment created so we cannot fix these up.

The patch restricts the eager evaluation to .if as some asm backends do not want the expressions between fragments evaluated eagerly in some cases.

From the perspective of the linux kernel, is D76002 sufficient?

I can still reproduce https://github.com/ClangBuiltLinux/linux/issues/742 with https://reviews.llvm.org/D70062 applied.

That's unfortunate. I've found out how to reproduce the original issue:

clang -I./arch/arm/include -I./arch/arm/include/generated -I./include -I./arch/arm/include/uapi -I./arch/arm/include/generated/uapi -I./include/uapi -I./include/generated/uapi -include ./include/linux/kconfig.h -D__ASSEMBLY__ --target=arm-linux-gnueabihf -D__LINUX_ARM_ARCH__=7 -march=armv7-a -include asm/unified.h -c -o /tmp/proc-v7.o arch/arm/mm/proc-v7.Sclang -I./arch/arm/include -I./arch/arm/include/generated -I./include -I./arch/arm/include/uapi -I./arch/arm/include/generated/uapi -I./include/uapi -I./include/generated/uapi -include ./include/linux/kconfig.h -D__ASSEMBLY__ --target=arm-linux-gnueabihf -D__LINUX_ARM_ARCH__=7 -march=armv7-a -include asm/unified.h -c -o /tmp/proc-v7.o arch/arm/mm/proc-v7.S

If I extract out just the small part of the test case it works even without 76002.

The case from the kernel is testing that a single instruction is 2B rather than 4B (narrow vs wide). See https://github.com/ClangBuiltLinux/linux/blob/de620fb99ef2bd52b2c5bc52656e89dcfc0e223a/arch/arm/include/asm/assembler.h#L255-L270.

I don't see any subarch directives there.

They are there, I preprocessed to get a rather large output file including:

.globl cpu_v7_dcache_clean_area ; .align 0 ; cpu_v7_dcache_clean_area:
 9998: nop @ MP extensions imply L1 PTW
 .equ up_b_offset, 1f - 9998b ; .pushsection ".alt.smp.init", "a" ; .long 9998b ; b . + up_b_offset ; .popsection
 ret lr
1: dcache_line_size r2, r3
2: mcr p15, 0, r0, c7, c10, 1 @ clean D entry
 add r0, r0, r2
 subs r1, r1, r2
 bhi 2b
 dsb ishst
 ret lr
.type cpu_v7_dcache_clean_area, %function; .size cpu_v7_dcache_clean_area, .-cpu_v7_dcache_clean_area


 .arch_extension sec
.globl cpu_v7_smc_switch_mm ; .align 0 ; cpu_v7_smc_switch_mm:
 stmfd sp!, {r0 - r3}
 movw r0, #:lower16:(((1) << 31) | ((0) << 30) | (((0) & 0x3F) << 24) | ((0x8000) & 0xFFFF))
 movt r0, #:upper16:(((1) << 31) | ((0) << 30) | (((0) & 0x3F) << 24) | ((0x8000) & 0xFFFF))
 smc #0
 ldmfd sp!, {r0 - r3}
 b cpu_v7_switch_mm
.type cpu_v7_smc_switch_mm, %function; .size cpu_v7_smc_switch_mm, .-cpu_v7_smc_switch_mm

...
...

 9998: orr r1, r1, #((0 << 0) | (1 << 6))|(1 << 1)|(1 << 5)|(1 << 3)
 .pushsection ".alt.smp.init", "a" ; .long 9998b ;9997: orr r1, r1, #((1 << 0) | (1 << 6))|(3 << 3) ; .if . - 9997b == 2 ; nop ; .endif ; .if . - 9997b != 4 ; .error "ALT_UP() content must assemble to exactly 4 bytes"; .endif ; .popsection

This confused me for a bit, I think something like the following is happening.
.pushsection ".alt.smp.init"
// some instructions with Subtarget X
.popsection

.arch_extension sec or anything to change the subtarget
more stuff

.pushsection ".alt.smp.init"
// we are returning to .alt.smp.init but our subtarget is now X +sec
next instruction will start a new fragment
.popsection

It is getting late over here so I need to go home. Would jcai19 or yourself be able to investigate to confirm, check further? We would need to start a new MCDataFragment within .alt.smp.init for the new SubTarget, but I'd expect it all to happen before the 9997: in the failing case.

Making this work only on ".if" is IMO a non-starter, at least without understanding why. So I looked into what broke with llvm/test/MC/MachO/reloc-diff.s.

Basically, with LLVM's current representation, assuming that consecutive fragments are never moved with respect to each-other is invalid. In both ELF and Mach-O. In ELF, we have the ability to use numbered subsections that can insert new fragments between existing fragments. In Mach-O, fragments can effectively be turned into subsections with the 'subsections_via_symbols' directive. Committing this patch as-is would both be ugly and wrong, since we'd allow computing nonsense offsets -- even if we only restrict it to ".if".

I think it's a mistake that it works like that, and I'm going to spend a little bit of time to see if it'll be easy to fix this representational mistake in LLVM (in a separate patch), so that we have fragment lists which _are_ guaranteed to be kept exactly in the order they appear.

For this review, I have two requests:

Undo all the ".if"-only hacks. (understanding that it will cause some tests to fail, for now).
Fix the code to support arbitrary numbers of fragments between the symbols, of kinds MCFillFragment and MCDataFragment (and maybe others if they are fixed size -- I haven't looked through all of the kinds). Probably best would be to introduce a helper function to calculate the delta between two arbitrary symbols, given an optional layout (and either return an answer or a failure indication). This should make AttemptToFoldSymbolOffsetDifference significantly more straightforward.

I believe these examples should work, and will after making the latter change:

1:
.arch armv7a
nop
.arch armv4
nop
.arch armv7a
nop
.if . - 1b != 0
.word 0x12345678
.endif

2:
nop
.zero 0x10000
nop
.if . - 2b == 4
.word 0x12345678
.endif

jyknight added a reviewer: jyknight.Nov 12 2019, 1:42 PM

In D69411#1742896, @jyknight wrote:
Making this work only on ".if" is IMO a non-starter, at least without understanding why. So I looked into what broke with llvm/test/MC/MachO/reloc-diff.s.

Basically, with LLVM's current representation, assuming that consecutive fragments are never moved with respect to each-other is invalid. In both ELF and Mach-O. In ELF, we have the ability to use numbered subsections that can insert new fragments between existing fragments. In Mach-O, fragments can effectively be turned into subsections with the 'subsections_via_symbols' directive. Committing this patch as-is would both be ugly and wrong, since we'd allow computing nonsense offsets -- even if we only restrict it to ".if".

I think it's a mistake that it works like that, and I'm going to spend a little bit of time to see if it'll be easy to fix this representational mistake in LLVM (in a separate patch), so that we have fragment lists which _are_ guaranteed to be kept exactly in the order they appear.

For this review, I have two requests:

Undo all the ".if"-only hacks. (understanding that it will cause some tests to fail, for now).

Fix the code to support arbitrary numbers of fragments between the symbols, of kinds MCFillFragment and MCDataFragment (and maybe others if they are fixed size -- I haven't looked through all of the kinds). Probably best would be to introduce a helper function to calculate the delta between two arbitrary symbols, given an optional layout (and either return an answer or a failure indication). This should make AttemptToFoldSymbolOffsetDifference significantly more straightforward.

I believe these examples should work, and will after making the latter change:
1:
.arch armv7a
nop
.arch armv4
nop
.arch armv7a
nop
.if . - 1b != 0
.word 0x12345678
.endif

2:
nop
.zero 0x10000
nop
.if . - 2b == 4
.word 0x12345678
.endif

Thanks for the clarification. It will be great if we can remove the restriction to ".if". I will making changes accordingly while the representation of subsections are being changed.

Support arbitrary numbers of fragments between the symbols of kind
MCDataFragment. Getting the size of MCFillFragments seems to be less
straightforward https://llvm.org/doxygen/MCAssembler_8cpp_source.html#l00287.

Harbormaster completed remote builds in B40935: Diff 229225.Nov 13 2019, 9:05 PM

@jyknight while working on a different issue, I happened to take a look at the implementation of subsection (MCSection::getSubsectionInsertionPoint) and from what I understood, fragments in subsections were always placed after fragments not in any subsection., and fragments in the same subsection (or not in any subsection) were always inserted in order and separate from fragments in other subsections. Is it safe to assume what between two fragments either not in any subsections or in the same subsection will be remain unchanged once they are inserted to the fragment list of a section? If so, can we resolve differences of two labels in fragments with such restriction?

Limit the scope to fragments in the same subsection (fragments not in any subsections are placed in subsection 0). Fragments are inserted into each subsection in order and the difference of their offsets should be fixed once they are inserted. This is experimental as it breaks many tests. Will address the test failure if this idea is proven correct.

jcai19 retitled this revision from [MC] Parse .if conditions with symbols in consecutive MCDataFragements to [MC] Resolve the difference of symbols in consecutive MCDataFragements.Aug 17 2020, 4:19 PM

jcai19 edited the summary of this revision. (Show Details)

Harbormaster completed remote builds in B68697: Diff 286172.Aug 17 2020, 5:09 PM

(1) Fixed a bug while rebasing the patch last time. The failed unit tests dropped to 2 after the fix.
(2) Updated the 2 failed unit tests.

Herald added a reviewer: • espindola. · View Herald TranscriptAug 18 2020, 5:48 PM

Herald added a subscriber: emaste. · View Herald Transcript

Set the subsection number for fragments in subsecitons.

Harbormaster completed remote builds in B68828: Diff 286441.Aug 18 2020, 6:29 PM

Harbormaster completed remote builds in B68831: Diff 286445.Aug 18 2020, 6:52 PM

Worth mentioning that this is needed by (this assembly file does not assemble with the integrated assembler as of today):

make ARCH=arm CROSS_COMPILE=arm-linux-gnueabihf- LLVM=1 LLVM_IAS=1 defconfig arch/arm/mm/proc-v7.o

llvm/include/llvm/MC/MCFragment.h
74 ↗	(On Diff #286445)	Move below LayoutOrder. sizeof(MCDataFragment) will not change Alternatively, disable the folding for Mach-O because this is too subtle (if getSubsectionViaSymbols())
llvm/lib/MC/MCExpr.cpp
586	Consider swapping then and else branches to avoid `!`
llvm/test/MC/ARM/directive_if_offset.s
2	In this directory `file-name.s` is more common. What about directive-if-sub.s? sub is more meaningful than offset. Add a `llvm-mc -triple armv7a-linux-gnueabihf %s -o /dev/null 2>&1` test to show that -filetype=asm does not work (MCAssembler * is null so, but this is less of an issue)
llvm/test/MC/ARM/directive_if_offset_error.s
2	See ELF/reloc-directive.s You can use --defsym=ERR=1 to merge the tests into directive-if-sub.s
llvm/test/MC/ARM/thumb2_directive_if_offset_error.s
1 ↗	(On Diff #286445)	ditto

Address some of @MaskRay's comments.

This comment has been deleted.

llvm/include/llvm/MC/MCFragment.h
74 ↗	(On Diff #286445)	Thanks for all the comments! Can you please elaborate this one? Are you suggesting to move SubsectionNumber right after LayoutOrder?
llvm/test/MC/ARM/directive_if_offset_error.s
2	--defsym=ERR=1 does not seem to work if I move the code from this file into directive-if-subtraction.s, as the run commands still fail. Maybe I am missing something? Also directive-if-subtraction.s requires armv7a so this run command will fail once I make the move.

Harbormaster completed remote builds in B69393: Diff 287545.Aug 24 2020, 7:26 PM

MaskRay added inline comments.Aug 27 2020, 10:43 AM

llvm/test/MC/ARM/directive_if_offset_error.s
2	See `ELF/reloc-directive.s` for an example. # RUN: not llvm-mc ....... \| FileCheck %s --check-prefix=ERR normal assembly .ifdef ERR # ERR: {{.*}}.s:[[#@LINE+1]]:10: error: expected comma error line .endif I think it is clearer to place working and non-working examples in one file.

Combine test cases into one file.

MaskRay added inline comments.Aug 27 2020, 4:09 PM

llvm/include/llvm/MC/MCFragment.h
74 ↗	(On Diff #286445)	`SubsectionNumber` looks too subtle. I'd hope we just remove the variable, and avoid the computation for Mach-O. Happy to hear what @jyknight will say, and whether this is a reasonable (imperfect) approach. Personally I think it is mostly good if `SubsectionNumber` is removed.

MaskRay edited reviewers, added: psmith; removed: peter.smith.Aug 27 2020, 4:09 PM

Harbormaster completed remote builds in B69849: Diff 288471.Aug 27 2020, 4:20 PM

nickdesaulniers added inline comments.Aug 27 2020, 8:56 PM

llvm/include/llvm/MC/MCFragment.h
73 ↗	(On Diff #288471)	Second line needs three slashes and punctuation.
llvm/lib/MC/MCExpr.cpp
560	Could this be a static function accepting `(const MCAssembler Asm, const MCSymbol &SA, const MCSymbolRefExpr &A, const MCSymbolRefExpr *&B)`, rather than a closure?
568	drop extra parens?
569–570	`return FinalizeFolding();`
595	Does the subexpression `(SA.getOffset() - SB.getOffset())` change throughout the loop? If not, consider initializing `Offset` to that value.
596–597	`return FinalizeFolding();`
602–603	This can be 3 lines rather than four by swapping the condition: if (... != ...) return Offset ... Probably can drop the extra parens around the `cast`, too.
613	Check spelling and punctuation here. `:set spell` in vim.

Address @nickdesaulniers' comments.

Herald added a subscriber: danielkiss. · View Herald TranscriptAug 28 2020, 3:57 PM

Harbormaster completed remote builds in B69983: Diff 288724.Aug 28 2020, 4:30 PM

nickdesaulniers added inline comments.Aug 31 2020, 11:02 AM

llvm/include/llvm/MC/MCFragment.h
73 ↗	(On Diff #288471)	Punctuation. (Period at end of sentence in comment).
llvm/lib/MC/MCExpr.cpp
586	I agree; prefer: if x: foo() else: bar() to: if !x: bar() else: foo() `if` with a negated conditional is ok when there is only a then-clause. If there's an `else`-clause, then it's a code smell.
613	Punctuation (Period at end of sentence in comment).

Address more comments.

Harbormaster completed remote builds in B70166: Diff 289062.Aug 31 2020, 8:03 PM

Thanks for the update and sorry to take so long to comment. I can't see anything immediately wrong and I think limiting this to fragments in the same subsection makes sense.

llvm/lib/MC/MCExpr.cpp
606	It would be good to have a comment here as we have Offset and getOffset() meaning two different things. IIUC getOffset() is really getOffsetWithinFragment(). Perhaps use Displacement instead of Offset as the accumulating variable name. For example: // Try to find a constant displacement from FA to FB, add the displacement between the offset in FA of SA and the offset in FB of SB.

Address @psmith's comments.

Harbormaster completed remote builds in B70322: Diff 289320.Sep 1 2020, 5:44 PM

With this patch applied, I no longer observe the error from https://github.com/ClangBuiltLinux/linux/issues/742 for 32b ARM, though I can't do a full kernel build with Clang's IA yet due to missing support for the adrl pseudo instruction. (https://github.com/ClangBuiltLinux/linux/issues/430, https://bugs.llvm.org/show_bug.cgi?id=24350) in order to boot test. I did boot test x86 (32b and 64b) and arm64 Linux kernels with this change just fine (using Clang's integrated assembler).

It looks like there may have been unresolved comments from @MaskRay . I'm also curious whether @jyknight or @psmith had parting thoughts? (@jcai19 maybe wait a week for their feedback?)

This revision is now accepted and ready to land.Sep 2 2020, 12:21 PM

I don't have any objections to the approval. Thanks for updating the comment.

In D69411#2252965, @nickdesaulniers wrote:

It looks like there may have been unresolved comments from @MaskRay . I'm also curious whether @jyknight or @psmith had parting thoughts? (@jcai19 maybe wait a week for their feedback?)

@nick sounds good! Thanks for the verification.

In D69411#2253038, @psmith wrote:

I don't have any objections to the approval. Thanks for updating the comment.

Thanks @psmith.

nick removed a subscriber: nick.Sep 2 2020, 1:41 PM

In D69411#2252965, @nickdesaulniers wrote:

With this patch applied, I no longer observe the error from https://github.com/ClangBuiltLinux/linux/issues/742 for 32b ARM, though I can't do a full kernel build with Clang's IA yet due to missing support for the adrl pseudo instruction. (https://github.com/ClangBuiltLinux/linux/issues/430, https://bugs.llvm.org/show_bug.cgi?id=24350) in order to boot test. I did boot test x86 (32b and 64b) and arm64 Linux kernels with this change just fine (using Clang's integrated assembler).

It looks like there may have been unresolved comments from @MaskRay . I'm also curious whether @jyknight or @psmith had parting thoughts? (@jcai19 maybe wait a week for their feedback?)

I am waiting on @jyknight's opinion about the potentially subtle unsigned SubsectionNumber = 0; (at the very least, if we want to keep it, it should be reordered as my comment says, and don't repeat the name of the variable)

In D69411#2253193, @MaskRay wrote:

at the very least, if we want to keep it, it should be reordered as my comment says, and don't repeat the name of the variable

Hi @MaskRay, thanks for the comment. By reordering, do you mean moving the definition of SubsectionNumber right after LayoutOrder? I do not understand what difference that would make, would you care to explain a little bit more? Also what by not repeating the name of the variable, I assume you were referring to the comment right above its definition?

In D69411#2253462, @jcai19 wrote:

In D69411#2253193, @MaskRay wrote:

at the very least, if we want to keep it, it should be reordered as my comment says, and don't repeat the name of the variable

Hi @MaskRay, thanks for the comment. By reordering, do you mean moving the definition of SubsectionNumber right after LayoutOrder? I do not understand what difference that would make, would you care to explain a little bit more? Also what by not repeating the name of the variable, I assume you were referring to the comment right above its definition?

NVM. I realized moving it after LayoutOrder would not introduce extra padding.

Moved the definition of SubsectionNumber to avoid padding bytes. Also updated the comments.

Harbormaster completed remote builds in B70485: Diff 289600.Sep 2 2020, 5:52 PM

Closed by commit rG415a4fbea7c1: [MC] Resolve the difference of symbols in consecutive MCDataFragements (authored by jcai19). · Explain WhySep 9 2020, 12:39 PM

This revision was automatically updated to reflect the committed changes.

jcai19 added a commit: rG415a4fbea7c1: [MC] Resolve the difference of symbols in consecutive MCDataFragements.

In D69411#2253193, @MaskRay wrote:

In D69411#2252965, @nickdesaulniers wrote:

With this patch applied, I no longer observe the error from https://github.com/ClangBuiltLinux/linux/issues/742 for 32b ARM, though I can't do a full kernel build with Clang's IA yet due to missing support for the adrl pseudo instruction. (https://github.com/ClangBuiltLinux/linux/issues/430, https://bugs.llvm.org/show_bug.cgi?id=24350) in order to boot test. I did boot test x86 (32b and 64b) and arm64 Linux kernels with this change just fine (using Clang's integrated assembler).

It looks like there may have been unresolved comments from @MaskRay . I'm also curious whether @jyknight or @psmith had parting thoughts? (@jcai19 maybe wait a week for their feedback?)

I am waiting on @jyknight's opinion about the potentially subtle unsigned SubsectionNumber = 0; (at the very least, if we want to keep it, it should be reordered as my comment says, and don't repeat the name of the variable)

@jyknight What do you think of MCFragment::SubsectionNumber ?

MaskRay mentioned this in D153096: [MC] Fold A-B when A's fragment precedes B's fragment.Jun 15 2023, 8:50 PM

MaskRay mentioned this in rGfb294c0612a1: [MC] Fold A-B when A's fragment precedes B's fragment.Jun 22 2023, 12:24 PM

Revision Contents

Path

Size

llvm/

include/

llvm/

MC/

MCExpr.h

4 lines

lib/

MC/

MCExpr.cpp

112 lines

MCParser/

AsmParser.cpp

16 lines

test/

MC/

ARM/

directive_if_offset.s

12 lines

directive_if_offset_error.s

15 lines

Diff 227553

llvm/include/llvm/MC/MCExpr.h

Show First 20 Lines • Show All 50 Lines • ▼ Show 20 Lines	bool evaluateAsAbsolute(int64_t &Res, const MCAssembler *Asm,
const SectionAddrMap *Addrs, bool InSet) const;		const SectionAddrMap *Addrs, bool InSet) const;

protected:		protected:
explicit MCExpr(ExprKind Kind, SMLoc Loc) : Kind(Kind), Loc(Loc) {}		explicit MCExpr(ExprKind Kind, SMLoc Loc) : Kind(Kind), Loc(Loc) {}

bool evaluateAsRelocatableImpl(MCValue &Res, const MCAssembler *Asm,		bool evaluateAsRelocatableImpl(MCValue &Res, const MCAssembler *Asm,
const MCAsmLayout *Layout,		const MCAsmLayout *Layout,
const MCFixup *Fixup,		const MCFixup *Fixup,
const SectionAddrMap *Addrs, bool InSet) const;		const SectionAddrMap *Addrs, bool InSet,
		bool IsCond) const;
		peter.smithUnsubmitted Done Reply Inline Actions There is a comment in evaluateAsAbsolute // Setting InSet causes us to absolutize differences across sections and that // is what the MachO writer uses Addrs for. I think it would be useful to have something similar for IsCond. peter.smith: There is a comment in evaluateAsAbsolute ``` // Setting InSet causes us to absolutize…

public:		public:
MCExpr(const MCExpr &) = delete;		MCExpr(const MCExpr &) = delete;
MCExpr &operator=(const MCExpr &) = delete;		MCExpr &operator=(const MCExpr &) = delete;

/// \name Accessors		/// \name Accessors
/// @{		/// @{

Show All 20 Lines	public:
/// evaluated.		/// evaluated.
/// \return - True on success.		/// \return - True on success.
bool evaluateAsAbsolute(int64_t &Res, const MCAsmLayout &Layout,		bool evaluateAsAbsolute(int64_t &Res, const MCAsmLayout &Layout,
const SectionAddrMap &Addrs) const;		const SectionAddrMap &Addrs) const;
bool evaluateAsAbsolute(int64_t &Res) const;		bool evaluateAsAbsolute(int64_t &Res) const;
bool evaluateAsAbsolute(int64_t &Res, const MCAssembler &Asm) const;		bool evaluateAsAbsolute(int64_t &Res, const MCAssembler &Asm) const;
bool evaluateAsAbsolute(int64_t &Res, const MCAssembler *Asm) const;		bool evaluateAsAbsolute(int64_t &Res, const MCAssembler *Asm) const;
bool evaluateAsAbsolute(int64_t &Res, const MCAsmLayout &Layout) const;		bool evaluateAsAbsolute(int64_t &Res, const MCAsmLayout &Layout) const;
		bool evaluateIfCondAsAbsolute(int64_t &Res, const MCAssembler *Asm) const;

bool evaluateKnownAbsolute(int64_t &Res, const MCAsmLayout &Layout) const;		bool evaluateKnownAbsolute(int64_t &Res, const MCAsmLayout &Layout) const;

/// Try to evaluate the expression to a relocatable value, i.e. an		/// Try to evaluate the expression to a relocatable value, i.e. an
/// expression of the fixed form (a - b + constant).		/// expression of the fixed form (a - b + constant).
///		///
/// \param Res - The relocatable value, if evaluation succeeds.		/// \param Res - The relocatable value, if evaluation succeeds.
/// \param Layout - The assembler layout object to use for evaluating values.		/// \param Layout - The assembler layout object to use for evaluating values.
▲ Show 20 Lines • Show All 513 Lines • Show Last 20 Lines

llvm/lib/MC/MCExpr.cpp

Show First 20 Lines • Show All 475 Lines • ▼ Show 20 Lines
bool MCExpr::evaluateAsAbsolute(int64_t &Res, const MCAssembler &Asm) const {		bool MCExpr::evaluateAsAbsolute(int64_t &Res, const MCAssembler &Asm) const {
return evaluateAsAbsolute(Res, &Asm, nullptr, nullptr, false);		return evaluateAsAbsolute(Res, &Asm, nullptr, nullptr, false);
}		}

bool MCExpr::evaluateAsAbsolute(int64_t &Res, const MCAssembler *Asm) const {		bool MCExpr::evaluateAsAbsolute(int64_t &Res, const MCAssembler *Asm) const {
return evaluateAsAbsolute(Res, Asm, nullptr, nullptr, false);		return evaluateAsAbsolute(Res, Asm, nullptr, nullptr, false);
}		}

		bool MCExpr::evaluateIfCondAsAbsolute(int64_t &Res,
		const MCAssembler *Asm) const {
		MCValue Value;
		nickdesaulniersUnsubmitted Done Reply Inline Actions You can move this declaration down closer to where it is used. No need to construct if the below conditional returns early. nickdesaulniers: You can move this declaration down closer to where it is used. No need to construct if the…

		// Fast path constants.
		if (const MCConstantExpr *CE = dyn_cast<MCConstantExpr>(this)) {
		Res = CE->getValue();
		return true;
		}

		bool IsRelocatable = evaluateAsRelocatableImpl(Value, Asm, nullptr, nullptr,
		nullptr, false, true);
		peter.smithUnsubmitted Done Reply Inline Actions When there is more than one boolean parameter it can get difficult to track which is which, can you annotate the call sites with literal values with a comment? I've left a comment on the ones that I've noticed. /* InSet / true, / IsCond / false. peter.smith:* When there is more than one boolean parameter it can get difficult to track which is which, can…

		// Record the current value.
		Res = Value.getConstant();

		return IsRelocatable && Value.isAbsolute();
		}

bool MCExpr::evaluateKnownAbsolute(int64_t &Res,		bool MCExpr::evaluateKnownAbsolute(int64_t &Res,
const MCAsmLayout &Layout) const {		const MCAsmLayout &Layout) const {
return evaluateAsAbsolute(Res, &Layout.getAssembler(), &Layout, nullptr,		return evaluateAsAbsolute(Res, &Layout.getAssembler(), &Layout, nullptr,
true);		true);
}		}

bool MCExpr::evaluateAsAbsolute(int64_t &Res, const MCAssembler *Asm,		bool MCExpr::evaluateAsAbsolute(int64_t &Res, const MCAssembler *Asm,
const MCAsmLayout *Layout,		const MCAsmLayout *Layout,
const SectionAddrMap *Addrs, bool InSet) const {		const SectionAddrMap *Addrs, bool InSet) const {
MCValue Value;		MCValue Value;

// Fast path constants.		// Fast path constants.
if (const MCConstantExpr *CE = dyn_cast<MCConstantExpr>(this)) {		if (const MCConstantExpr *CE = dyn_cast<MCConstantExpr>(this)) {
Res = CE->getValue();		Res = CE->getValue();
return true;		return true;
}		}

bool IsRelocatable =		bool IsRelocatable = evaluateAsRelocatableImpl(Value, Asm, Layout, nullptr,
evaluateAsRelocatableImpl(Value, Asm, Layout, nullptr, Addrs, InSet);		Addrs, InSet, false);
		peter.smithUnsubmitted Done Reply Inline Actions /* IsCond / false. peter.smith:* /* IsCond */ false.

// Record the current value.		// Record the current value.
Res = Value.getConstant();		Res = Value.getConstant();

return IsRelocatable && Value.isAbsolute();		return IsRelocatable && Value.isAbsolute();
}		}

/// Helper method for \see EvaluateSymbolAdd().		/// Helper method for \see EvaluateSymbolAdd().
static void AttemptToFoldSymbolOffsetDifference(		static void AttemptToFoldSymbolOffsetDifference(
const MCAssembler Asm, const MCAsmLayout Layout,		const MCAssembler Asm, const MCAsmLayout Layout,
const SectionAddrMap Addrs, bool InSet, const MCSymbolRefExpr &A,		const SectionAddrMap Addrs, bool InSet, const MCSymbolRefExpr &A,
const MCSymbolRefExpr *&B, int64_t &Addend) {		const MCSymbolRefExpr *&B, int64_t &Addend, bool IsCond) {
if (!A \|\| !B)		if (!A \|\| !B)
return;		return;

const MCSymbol &SA = A->getSymbol();		const MCSymbol &SA = A->getSymbol();
const MCSymbol &SB = B->getSymbol();		const MCSymbol &SB = B->getSymbol();

if (SA.isUndefined() \|\| SB.isUndefined())		if (SA.isUndefined() \|\| SB.isUndefined())
		nickdesaulniersUnsubmitted Done Reply Inline Actions Move these closer to their use below. It is nice to have them all declared together, but it would be nicer to bail as early as possible without doing more work than necessary. nickdesaulniers: Move these closer to their use below. It is nice to have them all declared together, but it…
return;		return;

if (!Asm->getWriter().isSymbolRefDifferenceFullyResolved(*Asm, A, B, InSet))		if (!Asm->getWriter().isSymbolRefDifferenceFullyResolved(*Asm, A, B, InSet))
return;		return;

if (SA.getFragment() == SB.getFragment() && !SA.isVariable() &&		auto FinalizeFolding = [&]() {
!SA.isUnset() && !SB.isVariable() && !SB.isUnset()) {
Addend += (SA.getOffset() - SB.getOffset());

// Pointers to Thumb symbols need to have their low-bit set to allow		// Pointers to Thumb symbols need to have their low-bit set to allow
// for interworking.		// for interworking.
if (Asm->isThumbFunc(&SA))		if (Asm->isThumbFunc(&SA))
Addend \|= 1;		Addend \|= 1;

// If symbol is labeled as micromips, we set low-bit to ensure		// If symbol is labeled as micromips, we set low-bit to ensure
// correct offset in .gcc_except_table		// correct offset in .gcc_except_table
if (Asm->getBackend().isMicroMips(&SA))		if (Asm->getBackend().isMicroMips(&SA))
Addend \|= 1;		Addend \|= 1;

// Clear the symbol expr pointers to indicate we have folded these		// Clear the symbol expr pointers to indicate we have folded these
// operands.		// operands.
A = B = nullptr;		A = B = nullptr;
		};
		nickdesaulniersUnsubmitted Not Done Reply Inline Actions Could this be a static function accepting `(const MCAssembler Asm, const MCSymbol &SA, const MCSymbolRefExpr &A, const MCSymbolRefExpr &B)`, rather than a closure? nickdesaulniers:* Could this be a static function accepting `(const MCAssembler *Asm, const MCSymbol &SA, const…

		const MCFragment *FragA = SA.getFragment();
		const MCFragment *FragB = SB.getFragment();
		// If both symbols are in the same fragment, return the difference of their
		// offsets
		if (FragA == FragB && !SA.isVariable() && !SA.isUnset() && !SB.isVariable() &&
		!SB.isUnset()) {

		nickdesaulniersUnsubmitted Done Reply Inline Actions drop extra parens? nickdesaulniers: drop extra parens?
		Addend += (SA.getOffset() - SB.getOffset());

		nickdesaulniersUnsubmitted Not Done Reply Inline Actions `return FinalizeFolding();` nickdesaulniers: `return FinalizeFolding();`
		FinalizeFolding();
return;		return;
}		}

if (!Layout)		const MCSection &SecA = *FragA->getParent();
		const MCSection &SecB = *FragB->getParent();
		nickdesaulniersUnsubmitted Done Reply Inline Actions What's going on with this comment? Looks like it was copied, not moved to L566? nickdesaulniers: What's going on with this comment? Looks like it was copied, not moved to L566?

		if ((&SecA != &SecB) && !Addrs)
		peter.smithUnsubmitted Not Done Reply Inline Actions I think it would be useful to add a comment explaining this line and what kind of expressions it is expected to support. It isn't intuitive given how AttemptToFoldSymbolOffsetDifference is called. peter.smith: I think it would be useful to add a comment explaining this line and what kind of expressions…
		jcai19AuthorUnsubmitted Done Reply Inline Actions This is inherited from the original code, except it was before the check for Layout originally. I am not exactly sure why this check is needed in the first place. In my opinion this is redundant with the check for Layout, as we can always calculate the differences of two symbols based on the order of the sections (MCAsmLayout::getSectionOrder ()) they are in and their offset within its own section respectively if the layout is provided., but I do need this check anyway for our case. Any thoughts on why this check is necessary? jcai19: This is inherited from the original code, except it was before the check for Layout originally.
return;		return;
		nickdesaulniersUnsubmitted Done Reply Inline Actions if you're throwing away the result of a `dyn_cast`, prefer `isa`. http://llvm.org/docs/ProgrammersManual.html#the-isa-cast-and-dyn-cast-templates Or, if we're checking `dyn_cast` and reusing via `cast`, consider just saving the result of the `dyn_cast` to a local variable and discarding `cast`. nickdesaulniers: if you're throwing away the result of a `dyn_cast`, prefer `isa`. http://llvm.

const MCSection &SecA = *SA.getFragment()->getParent();		if (!Layout) {
const MCSection &SecB = *SB.getFragment()->getParent();		// Try to handle cases like "foo:instr; .if . - foo == 0;instr; .endif" when
		peter.smithUnsubmitted Done Reply Inline Actions If I've understood the code correctly I think we could be a bit more specific here: // When there is no layout our ability to resolve differences between symbols is // limited. In specific cases where the symbols are both defined in consecutive // MCDataFragments the difference can be calculated. This is important for an // idiom like foo:instr; .if . - foo; instr; .endif // We cannot handle any kind of case where the difference may change due to // layout. peter.smith: If I've understood the code correctly I think we could be a bit more specific here: ``` //…
		// the two symbols belong to two fragments in the same section.
		// FIXME: can we resolve .if conditions while finalizing layout?
		peter.smithUnsubmitted Not Done Reply Inline Actions It is difficult to see how it would be possible to resolve .if conditions at layout time in a single pass assembler. In theory the assembler could evaluate all conditional blocks and select between them at layout time, if such a layout could be converged on. peter.smith: It is difficult to see how it would be possible to resolve .if conditions at layout time in a…
		jcai19AuthorUnsubmitted Done Reply Inline Actions Thanks for the clarification, although I am not sure I follow. The code looks iterative to me https://llvm.org/doxygen/MCAssembler_8cpp_source.html#l00785. I was thinking as the loop iterates and calls layoutOnce, we can relax (if needed) and calculate the sizes of the fragments before a .if statement and resolve the condition there. But I am not completely convinced by myself that it is doable. Also there some cases like the one below will create more complexity. foo: jump to bar ... .if . - foo = ${constant integer} instr1 .else. instr2 .endif ... bar: This creates a loop of dependency as depending on the instruction selected in the if-else block, the size of the jump instruction may change due to the number of bits it need to specify the offset, which in turn affects which instruction should be chosen. jcai19: Thanks for the clarification, although I am not sure I follow. The code looks iterative to me…
		nickdesaulniersUnsubmitted Not Done Reply Inline Actions See also b/132538429. In-order to evaluate the if statement truthfully, it will have to be done after (well, during, since the result could change the relaxation) relaxation, and LLVM MC is just not set up for this currently. Even then, you could probably construct an if statement that could create a paradox, and never converge on a valid relaxation. nickdesaulniers: See also b/132538429. > In-order to evaluate the if statement truthfully, it will have to be…
		jcai19AuthorUnsubmitted Done Reply Inline Actions Yeah, thanks for the explanation. I wonder if GAS would be able to support all these paradoxes. jcai19: Yeah, thanks for the explanation. I wonder if GAS would be able to support all these paradoxes.
		if (IsCond && SecB.getFragmentList().getNextNode(*FragB) == FragA &&
		peter.smithUnsubmitted Not Done Reply Inline Actions With the new information coming from D70062 I think we need to make this comment more specific. For example: When the Subtarget is changed a new MCDataFragment is created. This handles the case of foo: instr; .arch_extension ext; instr .if . - foo peter.smith: With the new information coming from D70062 I think we need to make this comment more specific.
		isa<MCDataFragment>(FragA) && isa<MCDataFragment>(FragB)) {
		nickdesaulniersUnsubmitted Not Done Reply Inline Actions This patch adds checks for thumb, but the test case doesn't use a thumb target triple. Consider adding tests that exercise the newly added code. nickdesaulniers: This patch adds checks for thumb, but the test case doesn't use a thumb target triple. Consider…
		jcai19AuthorUnsubmitted Done Reply Inline Actions It's actually checking if SA is flagged with .thumb_func, although adding .thumb_func right before label 9997 does not make the check become true? jcai19: It's actually checking if SA is flagged with .thumb_func, although adding .thumb_func right…
		MaskRayUnsubmitted Not Done Reply Inline Actions Consider swapping then and else branches to avoid `!` MaskRay: Consider swapping then and else branches to avoid `!`
		nickdesaulniersUnsubmitted Not Done Reply Inline Actions I agree; prefer: if x: foo() else: bar() to: if !x: bar() else: foo() `if` with a negated conditional is ok when there is only a then-clause. If there's an `else`-clause, then it's a code smell. nickdesaulniers: I agree; prefer: if x: foo() else: bar() to: if !x: bar()…
		Addend += (SA.getOffset() +
		(cast<MCDataFragment>(FragB))->getContents().size() -
		SB.getOffset());

if ((&SecA != &SecB) && !Addrs)		FinalizeFolding();
		nickdesaulniersUnsubmitted Not Done Reply Inline Actions MicroMips? nickdesaulniers: MicroMips?
		jcai19AuthorUnsubmitted Done Reply Inline Actions It seems micromips has been used through out the file, will keep it for consistency. jcai19: It seems micromips has been used through out the file, will keep it for consistency.
		peter.smithUnsubmitted Done Reply Inline Actions This might not be needed with D70062 as the !SA.isUnset() and !SB.isUnset() on line 568 will both evaluate true after it. peter.smith: This might not be needed with D70062 as the !SA.isUnset() and !SB.isUnset() on line 568 will…
		jcai19AuthorUnsubmitted Done Reply Inline Actions Thanks for the clarification. I have adjusted my code accordingly. jcai19: Thanks for the clarification. I have adjusted my code accordingly.
		}
return;		return;
		}

		nickdesaulniersUnsubmitted Done Reply Inline Actions Does the subexpression `(SA.getOffset() - SB.getOffset())` change throughout the loop? If not, consider initializing `Offset` to that value. nickdesaulniers: Does the subexpression `(SA.getOffset() - SB.getOffset())` change throughout the loop? If not…
// Eagerly evaluate.		// Eagerly evaluate.
Addend += Layout->getSymbolOffset(A->getSymbol()) -		Addend += Layout->getSymbolOffset(A->getSymbol()) -
		nickdesaulniersUnsubmitted Done Reply Inline Actions `return FinalizeFolding();` nickdesaulniers: `return FinalizeFolding();`
Layout->getSymbolOffset(B->getSymbol());		Layout->getSymbolOffset(B->getSymbol());
		nickdesaulniersUnsubmitted Done Reply Inline Actions It looks like this block was copied from L529-L548. Is it possible to adjust the condition on L529 and addend calculation on L532 to support this case, rather than duplicating so much code? Looks like this repeats again below. Is it possible to refactor some of this, maybe into its own method, for better code reuse? nickdesaulniers: It looks like this block was copied from L529-L548. Is it possible to adjust the condition on…
if (Addrs && (&SecA != &SecB))		if (Addrs && (&SecA != &SecB))
Addend += (Addrs->lookup(&SecA) - Addrs->lookup(&SecB));		Addend += (Addrs->lookup(&SecA) - Addrs->lookup(&SecB));

// Pointers to Thumb symbols need to have their low-bit set to allow		FinalizeFolding();
// for interworking.
if (Asm->isThumbFunc(&SA))
Addend \|= 1;

// If symbol is labeled as micromips, we set low-bit to ensure
// correct offset in .gcc_except_table
if (Asm->getBackend().isMicroMips(&SA))
Addend \|= 1;

// Clear the symbol expr pointers to indicate we have folded these
// operands.
A = B = nullptr;
}		}
		nickdesaulniersUnsubmitted Done Reply Inline Actions This can be 3 lines rather than four by swapping the condition: if (... != ...) return Offset ... Probably can drop the extra parens around the `cast`, too. nickdesaulniers: This can be 3 lines rather than four by swapping the condition: ``` if (... != ...) return…

static bool canFold(const MCAssembler Asm, const MCSymbolRefExpr A,		static bool canFold(const MCAssembler Asm, const MCSymbolRefExpr A,
const MCSymbolRefExpr *B, bool InSet) {		const MCSymbolRefExpr *B, bool InSet) {
		psmithUnsubmitted Not Done Reply Inline Actions It would be good to have a comment here as we have Offset and getOffset() meaning two different things. IIUC getOffset() is really getOffsetWithinFragment(). Perhaps use Displacement instead of Offset as the accumulating variable name. For example: // Try to find a constant displacement from FA to FB, add the displacement between the offset in FA of SA and the offset in FB of SB. psmith: It would be good to have a comment here as we have Offset and getOffset() meaning two different…
if (InSet)		if (InSet)
return true;		return true;

if (!Asm->getBackend().requiresDiffExpressionRelocations())		if (!Asm->getBackend().requiresDiffExpressionRelocations())
return true;		return true;

const MCSymbol &CheckSym = A ? A->getSymbol() : B->getSymbol();		const MCSymbol &CheckSym = A ? A->getSymbol() : B->getSymbol();
		nickdesaulniersUnsubmitted Done Reply Inline Actions Check spelling and punctuation here. `:set spell` in vim. nickdesaulniers: Check spelling and punctuation here. `:set spell` in vim.
		nickdesaulniersUnsubmitted Done Reply Inline Actions Punctuation (Period at end of sentence in comment). nickdesaulniers: Punctuation (Period at end of sentence in comment).
if (!CheckSym.isInSection())		if (!CheckSym.isInSection())
return true;		return true;

if (!CheckSym.getSection().hasInstructions())		if (!CheckSym.getSection().hasInstructions())
return true;		return true;

return false;		return false;
}		}
Show All 16 Lines
/// NOTE: It is really important to have both the Asm and Layout arguments.		/// NOTE: It is really important to have both the Asm and Layout arguments.
/// They might look redundant, but this function can be used before layout		/// They might look redundant, but this function can be used before layout
/// is done (see the object streamer for example) and having the Asm argument		/// is done (see the object streamer for example) and having the Asm argument
/// lets us avoid relaxations early.		/// lets us avoid relaxations early.
static bool		static bool
EvaluateSymbolicAdd(const MCAssembler Asm, const MCAsmLayout Layout,		EvaluateSymbolicAdd(const MCAssembler Asm, const MCAsmLayout Layout,
const SectionAddrMap *Addrs, bool InSet, const MCValue &LHS,		const SectionAddrMap *Addrs, bool InSet, const MCValue &LHS,
const MCSymbolRefExpr RHS_A, const MCSymbolRefExpr RHS_B,		const MCSymbolRefExpr RHS_A, const MCSymbolRefExpr RHS_B,
int64_t RHS_Cst, MCValue &Res) {		int64_t RHS_Cst, MCValue &Res, bool IsCond) {
// FIXME: This routine (and other evaluation parts) are incredibly sloppy		// FIXME: This routine (and other evaluation parts) are incredibly sloppy
// about dealing with modifiers. This will ultimately bite us, one day.		// about dealing with modifiers. This will ultimately bite us, one day.
const MCSymbolRefExpr *LHS_A = LHS.getSymA();		const MCSymbolRefExpr *LHS_A = LHS.getSymA();
const MCSymbolRefExpr *LHS_B = LHS.getSymB();		const MCSymbolRefExpr *LHS_B = LHS.getSymB();
int64_t LHS_Cst = LHS.getConstant();		int64_t LHS_Cst = LHS.getConstant();

// Fold the result constant immediately.		// Fold the result constant immediately.
int64_t Result_Cst = LHS_Cst + RHS_Cst;		int64_t Result_Cst = LHS_Cst + RHS_Cst;
Show All 12 Lines	if (Asm && canFold(Asm, LHS_A, LHS_B, InSet)) {
// we have the four possible differences:		// we have the four possible differences:
// (LHS_A - LHS_B),		// (LHS_A - LHS_B),
// (LHS_A - RHS_B),		// (LHS_A - RHS_B),
// (RHS_A - LHS_B),		// (RHS_A - LHS_B),
// (RHS_A - RHS_B).		// (RHS_A - RHS_B).
// Since we are attempting to be as aggressive as possible about folding, we		// Since we are attempting to be as aggressive as possible about folding, we
// attempt to evaluate each possible alternative.		// attempt to evaluate each possible alternative.
AttemptToFoldSymbolOffsetDifference(Asm, Layout, Addrs, InSet, LHS_A, LHS_B,		AttemptToFoldSymbolOffsetDifference(Asm, Layout, Addrs, InSet, LHS_A, LHS_B,
Result_Cst);		Result_Cst, IsCond);
AttemptToFoldSymbolOffsetDifference(Asm, Layout, Addrs, InSet, LHS_A, RHS_B,		AttemptToFoldSymbolOffsetDifference(Asm, Layout, Addrs, InSet, LHS_A, RHS_B,
Result_Cst);		Result_Cst, IsCond);
AttemptToFoldSymbolOffsetDifference(Asm, Layout, Addrs, InSet, RHS_A, LHS_B,		AttemptToFoldSymbolOffsetDifference(Asm, Layout, Addrs, InSet, RHS_A, LHS_B,
Result_Cst);		Result_Cst, IsCond);
AttemptToFoldSymbolOffsetDifference(Asm, Layout, Addrs, InSet, RHS_A, RHS_B,		AttemptToFoldSymbolOffsetDifference(Asm, Layout, Addrs, InSet, RHS_A, RHS_B,
Result_Cst);		Result_Cst, IsCond);
}		}

// We can't represent the addition or subtraction of two symbols.		// We can't represent the addition or subtraction of two symbols.
if ((LHS_A && RHS_A) \|\| (LHS_B && RHS_B))		if ((LHS_A && RHS_A) \|\| (LHS_B && RHS_B))
return false;		return false;

// At this point, we have at most one additive symbol and one subtractive		// At this point, we have at most one additive symbol and one subtractive
// symbol -- find them.		// symbol -- find them.
const MCSymbolRefExpr *A = LHS_A ? LHS_A : RHS_A;		const MCSymbolRefExpr *A = LHS_A ? LHS_A : RHS_A;
const MCSymbolRefExpr *B = LHS_B ? LHS_B : RHS_B;		const MCSymbolRefExpr *B = LHS_B ? LHS_B : RHS_B;

Res = MCValue::get(A, B, Result_Cst);		Res = MCValue::get(A, B, Result_Cst);
return true;		return true;
}		}

bool MCExpr::evaluateAsRelocatable(MCValue &Res,		bool MCExpr::evaluateAsRelocatable(MCValue &Res,
const MCAsmLayout *Layout,		const MCAsmLayout *Layout,
const MCFixup *Fixup) const {		const MCFixup *Fixup) const {
MCAssembler *Assembler = Layout ? &Layout->getAssembler() : nullptr;		MCAssembler *Assembler = Layout ? &Layout->getAssembler() : nullptr;
return evaluateAsRelocatableImpl(Res, Assembler, Layout, Fixup, nullptr,		return evaluateAsRelocatableImpl(Res, Assembler, Layout, Fixup, nullptr,
false);		false, false);
		peter.smithUnsubmitted Done Reply Inline Actions /* InSet / false, / IsCond / false. peter.smith:* /* InSet / false, / IsCond */ false.
}		}

bool MCExpr::evaluateAsValue(MCValue &Res, const MCAsmLayout &Layout) const {		bool MCExpr::evaluateAsValue(MCValue &Res, const MCAsmLayout &Layout) const {
MCAssembler *Assembler = &Layout.getAssembler();		MCAssembler *Assembler = &Layout.getAssembler();
return evaluateAsRelocatableImpl(Res, Assembler, &Layout, nullptr, nullptr,		return evaluateAsRelocatableImpl(Res, Assembler, &Layout, nullptr, nullptr,
true);		true, false);
		peter.smithUnsubmitted Done Reply Inline Actions /* InSet / true, / IsCond / false. peter.smith:* /* InSet / true, / IsCond */ false.
}		}

static bool canExpand(const MCSymbol &Sym, bool InSet) {		static bool canExpand(const MCSymbol &Sym, bool InSet) {
const MCExpr *Expr = Sym.getVariableValue();		const MCExpr *Expr = Sym.getVariableValue();
const auto *Inner = dyn_cast<MCSymbolRefExpr>(Expr);		const auto *Inner = dyn_cast<MCSymbolRefExpr>(Expr);
if (Inner) {		if (Inner) {
if (Inner->getKind() == MCSymbolRefExpr::VK_WEAKREF)		if (Inner->getKind() == MCSymbolRefExpr::VK_WEAKREF)
return false;		return false;
}		}

if (InSet)		if (InSet)
return true;		return true;
return !Sym.isInSection();		return !Sym.isInSection();
}		}

bool MCExpr::evaluateAsRelocatableImpl(MCValue &Res, const MCAssembler *Asm,		bool MCExpr::evaluateAsRelocatableImpl(MCValue &Res, const MCAssembler *Asm,
const MCAsmLayout *Layout,		const MCAsmLayout *Layout,
const MCFixup *Fixup,		const MCFixup *Fixup,
const SectionAddrMap *Addrs,		const SectionAddrMap *Addrs, bool InSet,
bool InSet) const {		bool IsCond) const {
++stats::MCExprEvaluate;		++stats::MCExprEvaluate;

switch (getKind()) {		switch (getKind()) {
case Target:		case Target:
return cast<MCTargetExpr>(this)->evaluateAsRelocatableImpl(Res, Layout,		return cast<MCTargetExpr>(this)->evaluateAsRelocatableImpl(Res, Layout,
Fixup);		Fixup);

case Constant:		case Constant:
Res = MCValue::get(cast<MCConstantExpr>(this)->getValue());		Res = MCValue::get(cast<MCConstantExpr>(this)->getValue());
return true;		return true;

case SymbolRef: {		case SymbolRef: {
const MCSymbolRefExpr *SRE = cast<MCSymbolRefExpr>(this);		const MCSymbolRefExpr *SRE = cast<MCSymbolRefExpr>(this);
const MCSymbol &Sym = SRE->getSymbol();		const MCSymbol &Sym = SRE->getSymbol();

// Evaluate recursively if this is a variable.		// Evaluate recursively if this is a variable.
if (Sym.isVariable() && SRE->getKind() == MCSymbolRefExpr::VK_None &&		if (Sym.isVariable() && SRE->getKind() == MCSymbolRefExpr::VK_None &&
canExpand(Sym, InSet)) {		canExpand(Sym, InSet)) {
bool IsMachO = SRE->hasSubsectionsViaSymbols();		bool IsMachO = SRE->hasSubsectionsViaSymbols();
if (Sym.getVariableValue()->evaluateAsRelocatableImpl(		if (Sym.getVariableValue()->evaluateAsRelocatableImpl(
Res, Asm, Layout, Fixup, Addrs, InSet \|\| IsMachO)) {		Res, Asm, Layout, Fixup, Addrs, InSet \|\| IsMachO, false)) {
		peter.smithUnsubmitted Done Reply Inline Actions /* IsCond / false. peter.smith:* /* IsCond */ false.
		peter.smithUnsubmitted Not Done Reply Inline Actions We have IsCond passed in as a parameter to evaluateAsRelocatableImpl, so even if it is passed in true, we set it to false here? Is it important for the value to be false here? If so then it implies that IsCond might not be specific enough a name. If it just doesn't matter then can we pass in IsCond here? peter.smith: We have IsCond passed in as a parameter to evaluateAsRelocatableImpl, so even if it is passed…
		jcai19AuthorUnsubmitted Done Reply Inline Actions Yes, thanks for the catching this. IsCond is never used here so I simply replaced it false. But passing in IsCond is a better choice. jcai19: Yes, thanks for the catching this. IsCond is never used here so I simply replaced it false. But…
if (!IsMachO)		if (!IsMachO)
return true;		return true;

const MCSymbolRefExpr *A = Res.getSymA();		const MCSymbolRefExpr *A = Res.getSymA();
const MCSymbolRefExpr *B = Res.getSymB();		const MCSymbolRefExpr *B = Res.getSymB();
// FIXME: This is small hack. Given		// FIXME: This is small hack. Given
// a = b + 4		// a = b + 4
// .long a		// .long a
Show All 13 Lines	case SymbolRef: {
return true;		return true;
}		}

case Unary: {		case Unary: {
const MCUnaryExpr *AUE = cast<MCUnaryExpr>(this);		const MCUnaryExpr *AUE = cast<MCUnaryExpr>(this);
MCValue Value;		MCValue Value;

if (!AUE->getSubExpr()->evaluateAsRelocatableImpl(Value, Asm, Layout, Fixup,		if (!AUE->getSubExpr()->evaluateAsRelocatableImpl(Value, Asm, Layout, Fixup,
Addrs, InSet))		Addrs, InSet, false))
		peter.smithUnsubmitted Done Reply Inline Actions /* IsCond / false. peter.smith:* /* IsCond */ false.
return false;		return false;

switch (AUE->getOpcode()) {		switch (AUE->getOpcode()) {
case MCUnaryExpr::LNot:		case MCUnaryExpr::LNot:
if (!Value.isAbsolute())		if (!Value.isAbsolute())
return false;		return false;
Res = MCValue::get(!Value.getConstant());		Res = MCValue::get(!Value.getConstant());
break;		break;
Show All 19 Lines	case Unary: {
return true;		return true;
}		}

case Binary: {		case Binary: {
const MCBinaryExpr *ABE = cast<MCBinaryExpr>(this);		const MCBinaryExpr *ABE = cast<MCBinaryExpr>(this);
MCValue LHSValue, RHSValue;		MCValue LHSValue, RHSValue;

if (!ABE->getLHS()->evaluateAsRelocatableImpl(LHSValue, Asm, Layout, Fixup,		if (!ABE->getLHS()->evaluateAsRelocatableImpl(LHSValue, Asm, Layout, Fixup,
Addrs, InSet) \|\|		Addrs, InSet, IsCond) \|\|
!ABE->getRHS()->evaluateAsRelocatableImpl(RHSValue, Asm, Layout, Fixup,		!ABE->getRHS()->evaluateAsRelocatableImpl(RHSValue, Asm, Layout, Fixup,
Addrs, InSet)) {		Addrs, InSet, IsCond)) {
// Check if both are Target Expressions, see if we can compare them.		// Check if both are Target Expressions, see if we can compare them.
if (const MCTargetExpr *L = dyn_cast<MCTargetExpr>(ABE->getLHS()))		if (const MCTargetExpr *L = dyn_cast<MCTargetExpr>(ABE->getLHS()))
if (const MCTargetExpr *R = cast<MCTargetExpr>(ABE->getRHS())) {		if (const MCTargetExpr *R = cast<MCTargetExpr>(ABE->getRHS())) {
switch (ABE->getOpcode()) {		switch (ABE->getOpcode()) {
case MCBinaryExpr::EQ:		case MCBinaryExpr::EQ:
Res = MCValue::get((L->isEqualTo(R)) ? -1 : 0);		Res = MCValue::get((L->isEqualTo(R)) ? -1 : 0);
return true;		return true;
case MCBinaryExpr::NE:		case MCBinaryExpr::NE:
Show All 9 Lines	case Binary: {
// those first.		// those first.
if (!LHSValue.isAbsolute() \|\| !RHSValue.isAbsolute()) {		if (!LHSValue.isAbsolute() \|\| !RHSValue.isAbsolute()) {
switch (ABE->getOpcode()) {		switch (ABE->getOpcode()) {
default:		default:
return false;		return false;
case MCBinaryExpr::Sub:		case MCBinaryExpr::Sub:
// Negate RHS and add.		// Negate RHS and add.
// The cast avoids undefined behavior if the constant is INT64_MIN.		// The cast avoids undefined behavior if the constant is INT64_MIN.
return EvaluateSymbolicAdd(Asm, Layout, Addrs, InSet, LHSValue,		return EvaluateSymbolicAdd(
RHSValue.getSymB(), RHSValue.getSymA(),		Asm, Layout, Addrs, InSet, LHSValue, RHSValue.getSymB(),
-(uint64_t)RHSValue.getConstant(), Res);		RHSValue.getSymA(), -(uint64_t)RHSValue.getConstant(), Res, IsCond);

case MCBinaryExpr::Add:		case MCBinaryExpr::Add:
return EvaluateSymbolicAdd(Asm, Layout, Addrs, InSet, LHSValue,		return EvaluateSymbolicAdd(Asm, Layout, Addrs, InSet, LHSValue,
RHSValue.getSymA(), RHSValue.getSymB(),		RHSValue.getSymA(), RHSValue.getSymB(),
RHSValue.getConstant(), Res);		RHSValue.getConstant(), Res, IsCond);
}		}
}		}

// FIXME: We need target hooks for the evaluation. It may be limited in		// FIXME: We need target hooks for the evaluation. It may be limited in
// width, and gas defines the result of comparisons differently from		// width, and gas defines the result of comparisons differently from
// Apple as.		// Apple as.
int64_t LHS = LHSValue.getConstant(), RHS = RHSValue.getConstant();		int64_t LHS = LHSValue.getConstant(), RHS = RHSValue.getConstant();
int64_t Result = 0;		int64_t Result = 0;
▲ Show 20 Lines • Show All 98 Lines • Show Last 20 Lines

llvm/lib/MC/MCParser/AsmParser.cpp

Show First 20 Lines • Show All 248 Lines • ▼ Show 20 Lines	public:

bool parseExpression(const MCExpr *&Res);		bool parseExpression(const MCExpr *&Res);
bool parseExpression(const MCExpr *&Res, SMLoc &EndLoc) override;		bool parseExpression(const MCExpr *&Res, SMLoc &EndLoc) override;
bool parsePrimaryExpr(const MCExpr *&Res, SMLoc &EndLoc) override;		bool parsePrimaryExpr(const MCExpr *&Res, SMLoc &EndLoc) override;
bool parseParenExpression(const MCExpr *&Res, SMLoc &EndLoc) override;		bool parseParenExpression(const MCExpr *&Res, SMLoc &EndLoc) override;
bool parseParenExprOfDepth(unsigned ParenDepth, const MCExpr *&Res,		bool parseParenExprOfDepth(unsigned ParenDepth, const MCExpr *&Res,
SMLoc &EndLoc) override;		SMLoc &EndLoc) override;
bool parseAbsoluteExpression(int64_t &Res) override;		bool parseAbsoluteExpression(int64_t &Res) override;
		bool parseAbsoluteIfCond(int64_t &Res);

/// Parse a floating point expression using the float \p Semantics		/// Parse a floating point expression using the float \p Semantics
/// and set \p Res to the value.		/// and set \p Res to the value.
bool parseRealValue(const fltSemantics &Semantics, APInt &Res);		bool parseRealValue(const fltSemantics &Semantics, APInt &Res);

/// Parse an identifier or string (as a quoted identifier)		/// Parse an identifier or string (as a quoted identifier)
/// and set \p Res to the identifier contents.		/// and set \p Res to the identifier contents.
bool parseIdentifier(StringRef &Res) override;		bool parseIdentifier(StringRef &Res) override;
▲ Show 20 Lines • Show All 1,215 Lines • ▼ Show 20 Lines	if (parseExpression(Expr))
return true;		return true;

if (!Expr->evaluateAsAbsolute(Res, getStreamer().getAssemblerPtr()))		if (!Expr->evaluateAsAbsolute(Res, getStreamer().getAssemblerPtr()))
return Error(StartLoc, "expected absolute expression");		return Error(StartLoc, "expected absolute expression");

return false;		return false;
}		}

		bool AsmParser::parseAbsoluteIfCond(int64_t &Res) {
		const MCExpr *Expr;

		SMLoc StartLoc = Lexer.getLoc();
		if (parseExpression(Expr))
		return true;

		if (!Expr->evaluateIfCondAsAbsolute(Res, getStreamer().getAssemblerPtr()))
		return Error(StartLoc, "expected absolute expression");

		return false;
		}

static unsigned getDarwinBinOpPrecedence(AsmToken::TokenKind K,		static unsigned getDarwinBinOpPrecedence(AsmToken::TokenKind K,
MCBinaryExpr::Opcode &Kind,		MCBinaryExpr::Opcode &Kind,
bool ShouldUseLogicalShr) {		bool ShouldUseLogicalShr) {
switch (K) {		switch (K) {
default:		default:
return 0; // not a binop.		return 0; // not a binop.

// Lowest Precedence: &&, \|\|		// Lowest Precedence: &&, \|\|
▲ Show 20 Lines • Show All 3,539 Lines • ▼ Show 20 Lines
/// ::= .if{,eq,ge,gt,le,lt,ne} expression		/// ::= .if{,eq,ge,gt,le,lt,ne} expression
bool AsmParser::parseDirectiveIf(SMLoc DirectiveLoc, DirectiveKind DirKind) {		bool AsmParser::parseDirectiveIf(SMLoc DirectiveLoc, DirectiveKind DirKind) {
TheCondStack.push_back(TheCondState);		TheCondStack.push_back(TheCondState);
TheCondState.TheCond = AsmCond::IfCond;		TheCondState.TheCond = AsmCond::IfCond;
if (TheCondState.Ignore) {		if (TheCondState.Ignore) {
eatToEndOfStatement();		eatToEndOfStatement();
} else {		} else {
int64_t ExprValue;		int64_t ExprValue;
if (parseAbsoluteExpression(ExprValue) \|\|		if (parseAbsoluteIfCond(ExprValue) \|\|
parseToken(AsmToken::EndOfStatement,		parseToken(AsmToken::EndOfStatement,
"unexpected token in '.if' directive"))		"unexpected token in '.if' directive"))
return true;		return true;

switch (DirKind) {		switch (DirKind) {
default:		default:
llvm_unreachable("unsupported directive");		llvm_unreachable("unsupported directive");
case DK_IF:		case DK_IF:
▲ Show 20 Lines • Show All 1,030 Lines • Show Last 20 Lines

llvm/test/MC/ARM/directive_if_offset.s

This file was added.

				@ RUN: llvm-mc -triple armv7a-linux-gnueabihf %s -filetype=obj -o /dev/null 2>&1 \| FileCheck --allow-empty %s
				@ RUN: llvm-mc -triple armv7a-linux-gnueabihf %s -filetype=obj -o %t \| llvm-objdump -d %t \| FileCheck --check-prefix=CHECK-ASM %s
				MaskRayUnsubmitted Not Done Reply Inline Actions In this directory `file-name.s` is more common. What about directive-if-sub.s? sub is more meaningful than offset. Add a `llvm-mc -triple armv7a-linux-gnueabihf %s -o /dev/null 2>&1` test to show that -filetype=asm does not work (MCAssembler * is null so, but this is less of an issue) MaskRay: In this directory `file-name.s` is more common. What about directive-if-sub.s? sub is more…

				nop
				.arch_extension sec
				9997:
				peter.smithUnsubmitted Not Done Reply Inline Actions Can we add a comment to explain the importance of .arch_extension, such as: // Create a new MCDataFragment due to Subtarget change peter.smith: Can we add a comment to explain the importance of .arch_extension, such as: // Create a new…
				.if . - 9997b == 0 ;
				// CHECK-NOT: error: expected absolute expression
				orr r1, r1, #1 ;
				.else ; orr r1, r1, #2;
				.endif;
				// CHECK-ASM: orr r1, r1, #1

llvm/test/MC/ARM/directive_if_offset_error.s

This file was added.

				@ RUN: not llvm-mc -filetype=obj -triple arm-linux-gnueabihf %s -o /dev/null 2>&1 \| FileCheck %s

				MaskRayUnsubmitted Not Done Reply Inline Actions See ELF/reloc-directive.s You can use --defsym=ERR=1 to merge the tests into directive-if-sub.s MaskRay: See ELF/reloc-directive.s You can use --defsym=ERR=1 to merge the tests into directive-if-sub.s
				jcai19AuthorUnsubmitted Done Reply Inline Actions --defsym=ERR=1 does not seem to work if I move the code from this file into directive-if-subtraction.s, as the run commands still fail. Maybe I am missing something? Also directive-if-subtraction.s requires armv7a so this run command will fail once I make the move. jcai19: --defsym=ERR=1 does not seem to work if I move the code from this file into directive-if…
				MaskRayUnsubmitted Not Done Reply Inline Actions See `ELF/reloc-directive.s` for an example. # RUN: not llvm-mc ....... \| FileCheck %s --check-prefix=ERR normal assembly .ifdef ERR # ERR: {{.}}.s:[[#@LINE+1]]:10: error: expected comma error line .endif I think it is clearer to place working and non-working examples in one file. MaskRay:* See `ELF/reloc-directive.s` for an example. ``` # RUN: not llvm-mc ....... \| FileCheck %s…
				9997: nop ;
				.align 4
				nop
				.if . - 9997b == 4 ;
				// CHECK: error: expected absolute expression
				.endif

				9997: nop ;
				.space 4
				nop
				.if . - 9997b == 4 ;
				// CHECK: error: expected absolute expression
				.endif

This is an archive of the discontinued LLVM Phabricator instance.

[MC] Resolve the difference of symbols in consecutive MCDataFragementsClosedPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 227553

llvm/include/llvm/MC/MCExpr.h

llvm/lib/MC/MCExpr.cpp

llvm/lib/MC/MCParser/AsmParser.cpp

llvm/test/MC/ARM/directive_if_offset.s

llvm/test/MC/ARM/directive_if_offset_error.s

[MC] Resolve the difference of symbols in consecutive MCDataFragements
ClosedPublic