This is an archive of the discontinued LLVM Phabricator instance.

[ThinLTO] Remove functions resolved to available_externally from comdats
ClosedPublic

Authored by tejohnson on Aug 1 2016, 7:38 AM.

Details

Summary

thinLTOResolveWeakForLinkerModule needs to drop any preempted weak symbols
that were converted to available_externally from comdats, otherwise we
will get a verification failure (since available_externally is a
declaration for the linker, and no declarations can be in a comdat).

Diff Detail

Repository
rL LLVM

Event Timeline

tejohnson updated this revision to Diff 66320.Aug 1 2016, 7:38 AM
tejohnson retitled this revision from to [ThinLTO/gold] Conservatively handle unknown GVs when internalizing.
tejohnson updated this object.
tejohnson added reviewers: mehdi_amini, twoh.
tejohnson added a subscriber: llvm-commits.
mehdi_amini edited edge metadata.Aug 1 2016, 11:24 AM

Do we need gold for the test here or could it be written with an LLVM tool?

twoh edited edge metadata.Aug 1 2016, 2:59 PM

BTW, it seems that the original assertion failure (PR28760) doesn't happen even with r274523. Originally IRLinker::linkGlobalValueProto created a new GlobalValue with ".%d" appended to its name, but for the given example code the function returns nullptr because DoneLinkBodies is already true. Could you please double check?

Do we need gold for the test here or could it be written with an LLVM tool?

Let me see if I can trigger without gold's symbol resolution

In D23015#502688, @twoh wrote:

BTW, it seems that the original assertion failure (PR28760) doesn't happen even with r274523. Originally IRLinker::linkGlobalValueProto created a new GlobalValue with ".%d" appended to its name, but for the given example code the function returns nullptr because DoneLinkBodies is already true. Could you please double check?

I haven't tried at r274523, only at head as of yesterday. The test case included here definitely creates a GV with .%d appended (it wouldn't pass if it didn't), and it fails with the assertion if I restore that. Are you saying that the test case included here is not working when the patch is applied to r274523, or that it doesn't provoke the problem at that older revision? Or you cannot reproduce your original failure at r274523? If your original failure is reproducible at a recent revision, is it fixed by the current patch?

twoh added a comment.Aug 1 2016, 3:55 PM

@tejohnson The test code surely generates GV with ".%d" appended, but it doesn't trigger the assertion failure even without this patch, both for r274523 and r277147. Let me try again with more recent version.

My original failure cannot be reproduced by the recent version(r277147) because other issue hides the original failure. However, once I apply this patch to r274523 then it works (linking fails with another error but it definitely makes more progress). I'm just concerning the test code may target wrong.

In D23015#502735, @twoh wrote:

@tejohnson The test code surely generates GV with ".%d" appended, but it doesn't trigger the assertion failure even without this patch, both for r274523 and r277147. Let me try again with more recent version.

Hmm, it does for me and I am at r277267. If I put the assert back it fails with that assert.

My original failure cannot be reproduced by the recent version(r277147) because other issue hides the original failure.

Is this the -g failure? Can you try without -g?

However, once I apply this patch to r274523 then it works (linking fails with another error but it definitely makes more progress). I'm just concerning the test code may target wrong.

twoh added a comment.Aug 1 2016, 4:19 PM

@tejohnson Sorry, it was my bad. I wrote the makefile by myself and swapped the position of t.o and t2.o. Once I position them right the assertion fails.

If I try it without -g, it fails with the same error with r274523.

In D23015#502773, @twoh wrote:

@tejohnson Sorry, it was my bad. I wrote the makefile by myself and swapped the position of t.o and t2.o. Once I position them right the assertion fails.

If I try it without -g, it fails with the same error with r274523.

Ok, great. Let me know if the link succeeds with this patch and with -g removed.

Do we need gold for the test here or could it be written with an LLVM tool?

Let me see if I can trigger without gold's symbol resolution

Can't trigger with llvm-lto. The gold-plugin uses the IRLinker to move the module being compiled (in that ThinLTO backend) into an empty module, indicating the ValuesToLink based on its symbol resolution decisions. Which is why we get f1 linked in only as a declaration for %t2.o, which means we need to create a local copy of it for the use via the alias, resulting in the naming conflict with the declaration. In llvm-lto, we simply use the module being compiled with a thinlto-action as is.

OK, let me backtrack, is the original issue coming from the fact that gold is starting with an empty module?
Then why not changing this?

OK, let me backtrack, is the original issue coming from the fact that gold is starting with an empty module?
Then why not changing this?

We use gold's symbol resolution to determine which values to link in. So in the example, it knows that f1 is a preempted symbol in %t2.o and doesn't need to include its definition. I don't think we should need to change that.

OK, let me backtrack, is the original issue coming from the fact that gold is starting with an empty module?
Then why not changing this?

We use gold's symbol resolution to determine which values to link in. So in the example, it knows that f1 is a preempted symbol in %t2.o and doesn't need to include its definition. I don't think we should need to change that.

I see...
I still does not think that's a great thing: using the IRMover just to get a module where some symbols are pruned seems quite sketchy to me.

Here we accept that the module can have renamed symbols from what is in the index, but that should be an invariant or we'll run in trouble at some point. So I'm not convinced the fix is right at this point.

I dug a bit more, I'm still trying to follow how the bug happens: the IRMover is forcing @f1 (in Inputs/thinlto_alias2.ll) to be linked to satisfy the alias @a2. And since there is already a declaration for @f1 when it does so, it is supposed to create a local copy (i.e. it will be an *internal* function), IRMover.cpp ~line 930.

Now the code you are patching is a callback for the internalize utility, and IIUC it is not called on internal function (which I expect to be the case here), as I read InternalizePass::shouldPreserveGV.

OK, let me backtrack, is the original issue coming from the fact that gold is starting with an empty module?
Then why not changing this?

We use gold's symbol resolution to determine which values to link in. So in the example, it knows that f1 is a preempted symbol in %t2.o and doesn't need to include its definition. I don't think we should need to change that.

I see...
I still does not think that's a great thing: using the IRMover just to get a module where some symbols are pruned seems quite sketchy to me.

Here we accept that the module can have renamed symbols from what is in the index, but that should be an invariant or we'll run in trouble at some point. So I'm not convinced the fix is right at this point.

I decided I agree, see below.

I dug a bit more, I'm still trying to follow how the bug happens: the IRMover is forcing @f1 (in Inputs/thinlto_alias2.ll) to be linked to satisfy the alias @a2. And since there is already a declaration for @f1 when it does so, it is supposed to create a local copy (i.e. it will be an *internal* function), IRMover.cpp ~line 930.

Correct.

Now the code you are patching is a callback for the internalize utility, and IIUC it is not called on internal function (which I expect to be the case here), as I read InternalizePass::shouldPreserveGV.

It has been promoted though (by renameModuleForThinLTO), so it is not internal at this point (which is why the CHECK in the new test case looks like "CHECK: define hidden i32 @f1.1.llvm.0"

But after I sent this last night I realized that we probably should not have the gold-plugin drop any preempted symbols at this point. The reason is that we are losing optimization opportunities:

  1. Loss of inlining opportunity for the dropped preempted linkonce_odr: In this case the weak_odr can't be inlined anyway, but if it was linkonce_odr, we wouldn't be able to inline it if it was preempted (we would have to import it back in, but the import decisions will be made when we still have the linkonce_odr definition and the calls wouldn't be external, but in any case there isn't any point in dropping only to import for inlining later).
  2. Loss of internalization for the local copy - in this case we are checking to see if we can internalize the promoted symbol, which we would be able to do in this case if we could locate its summary. So we lose this opportunity.

I have made a change to the gold-plugin to keep the preempted symbols as well. However, this exposed another issue: when we keep the preempted linkonce in the comdat, thinLTOResolveWeakForLinkerModule comes along and changes it to available_externally. We then get a verification failure about a declaration in a comdat.

Interestingly, we have a fix for this in renameModuleForThinLTO, which drops any available_externally from comdats. It is meant to clean up after importing, but if I reorder renameModuleForThinLTO and thinLTOResolveWeakForLinkerModule in gold-plugin, it fixes this particular issue. I managed to reproduce this issue with an llvm-lto test case, since ProcessThinLTOModule() in ThinLTOCodeGenerator.cpp has the same ordering. It is also fixed by flipping the ordering between these two. Interestingly, and probably completely accidentally, ThinLTOCodeGenerator::promote() already has the desirable ordering. However, I don't like the subtle ordering requirement between these two calls. Perhaps it is better to explicitly strip any new available_externally symbols from comdats in thinLTOResolveWeakForLinkerModule. Hmm, I think I like that approach better. WDYT? Let me do that, will do so and clean up and add the new test cases to the patches later today.

tejohnson updated this revision to Diff 66607.Aug 2 2016, 7:25 PM
tejohnson edited edge metadata.

Change to avoid dropping preempted symbols early.

tejohnson retitled this revision from [ThinLTO/gold] Conservatively handle unknown GVs when internalizing to [ThinLTO/gold] Don't drop preempted symbols early.Aug 2 2016, 7:27 PM
tejohnson updated this object.

Sorry missed the update, will try to get to it today.

mehdi_amini accepted this revision.Aug 10 2016, 9:38 PM
mehdi_amini edited edge metadata.

LGTM, Thanks!

This revision is now accepted and ready to land.Aug 10 2016, 9:38 PM

I discovered while merging this patch with head on Friday that the Resolution-based LTO API I committed for pcc at r278338 already fixes this issue in the same way.

However, the following aspect is still an issue:


I have made a change to the gold-plugin to keep the preempted symbols as well. However, this exposed another issue: when we keep the preempted linkonce in the comdat, thinLTOResolveWeakForLinkerModule comes along and changes it to available_externally. We then get a verification failure about a declaration in a comdat.

Interestingly, we have a fix for this in renameModuleForThinLTO, which drops any available_externally from comdats. It is meant to clean up after importing, but if I reorder renameModuleForThinLTO and thinLTOResolveWeakForLinkerModule in gold-plugin, it fixes this particular issue. I managed to reproduce this issue with an llvm-lto test case, since ProcessThinLTOModule() in ThinLTOCodeGenerator.cpp has the same ordering. It is also fixed by flipping the ordering between these two. Interestingly, and probably completely accidentally, ThinLTOCodeGenerator::promote() already has the desirable ordering. However, I don't like the subtle ordering requirement between these two calls. Perhaps it is better to explicitly strip any new available_externally symbols from comdats in thinLTOResolveWeakForLinkerModule. Hmm, I think I like that approach better. WDYT? Let me do that, will do so and clean up and add the new test cases to the patches later today.


It turns out that with pcc's changes we get the fortuitous ordering of renameModuleForThinLTO after thinLTOResolveWeakForLinkerModule in the new lto::thinBackend, so the issue is not exposed via the gold-plugin. It is however, still an issue in llvm-lto (and I realized in looking into this that I had forgotten to add my new llvm-lto based test case to this patch).

I am going to change this patch to only include this particular fix, along with the llvm-lto based test, and submit it.

tejohnson updated this revision to Diff 68064.Aug 15 2016, 1:20 PM
tejohnson edited edge metadata.

Update to only include fix and test case for dropping available_externally
from comdats.

tejohnson retitled this revision from [ThinLTO/gold] Don't drop preempted symbols early to [ThinLTO] Remove functions resolved to available_externally from comdats.Aug 15 2016, 1:22 PM
tejohnson updated this object.
tejohnson updated this revision to Diff 68075.Aug 15 2016, 2:07 PM
tejohnson updated this object.

Rename test to something more appropriate before committing.

This revision was automatically updated to reflect the committed changes.