This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/Transforms/IPO/
-
Transforms/
-
IPO/
-
GlobalOpt.cpp
-
test/
-
ThinLTO/X86/
-
X86/
-
import-constant.ll
-
Transforms/GlobalOpt/
-
GlobalOpt/
-
2009-11-16-BrokenPerformHeapAllocSRoA.ll
-
cleanup-pointer-root-users.ll
-
dead-store-status.ll

Differential D69428

[GlobalOpt] Remove valgrind specific hacks (revert r160529)
Needs ReviewPublic

Authored by evgeny777 on Oct 25 2019, 3:56 AM.

Download Raw Diff

Details

Reviewers

nicholas
tejohnson
fhahn
echristo
nlewycky
nlopes
baldrick
chandlerc

Commits

rGdbc16ed199dc: [GlobalOpt] Revert valgrind hacks

Summary

The r160529 introduced alternative way of removing globals to appease leak detectors (e.g valgrind). However this commit made nearly impossible removing global if it is of structure or array type and contains element of pointer type. Consider following example:

static int bar() { return 42; }
static struct S {
   int (*F)();
   int A;
} Obj = { bar, 0 };

int main() {
   Obj.A = 42;
   return 0;
}

In the example above global variable Obj will not be eliminated by globalopt, because it contains a pointer, although that pointer is a pointer to function and is never really accessed. The more serious problem is that function bar is not being DCE'd, although it's obviously dead. This could be especially problematic for LTO builds - I'm observing significant amount of dead stuff left in the image, which would otherwise have been eliminated.

Another issues with r160529 are:

Very poor test coverage. The only test case cleanup-pointer-root-users.ll covers tiny part of added functionality
While preventing optimizations in many cases it strangely allows some of global stores to be eliminated. For instance this one will be optimized out:

static void *p;
int main() { p = malloc(100); return 0; }

And this one not:

static void *p;
__attribute__((noinline)) int set(void *_p) { p = _p; }
int main() { set(malloc(100)); return 0; }

An alternative to removal of r160529 could probably be converting global to private with zero initializer.

Diff Detail

Event Timeline

evgeny777 created this revision.Oct 25 2019, 3:56 AM

Herald added subscribers: dexonsmith, mehdi_amini. · View Herald TranscriptOct 25 2019, 3:56 AM

Added the author of r160529 as a reviewer for thoughts.

Well, I found the original discussion here: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20120625/145646.html
Added @baldrick and @nlopes

Not sure you are still interested in this patch. If so, I would suggest you get in touch with some Google folks and check with them if their codebase is ready for this patch. They were the only reason for this workaround.

@nlopes

Not sure you are still interested in this patch. If so, I would suggest you get in touch with some Google folks and check with them if their codebase is ready for this patch. They were the only reason for this workaround.

Yeah, it would be nice to get rid of this valgrind legacy (btw, there is already D70006 which eliminates such kind of unneeded globals for thin LTO). However I don't whom to contact.

In D69428#2658010, @evgeny777 wrote:

@nlopes

Not sure you are still interested in this patch. If so, I would suggest you get in touch with some Google folks and check with them if their codebase is ready for this patch. They were the only reason for this workaround.

Yeah, it would be nice to get rid of this valgrind legacy (btw, there is already D70006 which eliminates such kind of unneeded globals for thin LTO). However I don't whom to contact.

@tejohnson @chandlerc @hans should be able to point out someone that can comment on this and/or test if this patch regresses google's leak tests.

test/Transforms/GlobalOpt/2009-11-16-BrokenPerformHeapAllocSRoA.ll
20 ↗	(On Diff #226398)	is this a regression? The internal `@chartypes` global is never read, so the store can go away (well, the whole function is a nop).

In D69428#2658027, @nlopes wrote:

In D69428#2658010, @evgeny777 wrote:

@nlopes

Not sure you are still interested in this patch. If so, I would suggest you get in touch with some Google folks and check with them if their codebase is ready for this patch. They were the only reason for this workaround.

Yeah, it would be nice to get rid of this valgrind legacy (btw, there is already D70006 which eliminates such kind of unneeded globals for thin LTO). However I don't whom to contact.

@tejohnson @chandlerc @hans should be able to point out someone that can comment on this and/or test if this patch regresses google's leak tests.

Added @chandlerc as reviewer here since he was involved in the original discussion at https://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20120625/145646.html.

evgeny777 added inline comments.Mar 30 2021, 9:06 AM

test/Transforms/GlobalOpt/2009-11-16-BrokenPerformHeapAllocSRoA.ll
20 ↗	(On Diff #226398)	Doubtful. instcombine run after global opt converts the whole function to `ret void`

Please go ahead. The relevant stakeholders didn't reply, so let's assume they are not interested in this functionality anymore.
Anyway, these days people can use @llvm.used if needed.

This revision is now accepted and ready to land.Apr 12 2021, 11:13 AM

Closed by commit rGdbc16ed199dc: [GlobalOpt] Revert valgrind hacks (authored by evgeny777). · Explain WhyApr 13 2021, 9:11 AM

This revision was automatically updated to reflect the committed changes.

evgeny777 added a commit: rGdbc16ed199dc: [GlobalOpt] Revert valgrind hacks.

Herald added a project: Restricted Project. · View Herald TranscriptApr 13 2021, 9:11 AM

Herald added subscribers: steven_wu, hiraditya. · View Herald Transcript

I don't know why the various people you added didn't respond. This is unfortunate.

As it happens, this change does regress a bunch of Google internal leak tests with some internal tools. As it also happens, we are working to replace these tools with lsan, but that won't happen for a bit longer, and I can't promise a timeline.

Not sure the best thing to do from here. From my perspective, in a perfect world, if we could revert it, that would be great. Leaks do happen in the cases that the original change mentions:

If we delete the variable, but don't eliminate the corresponding allocation, then that memory does leak.

To me that looks like a problem of "as-if" and definition of what a "leak" is: do we have a reference to a standard (or better LangRef) that would guide this here?
(to me it always looked like these were leaks in the first place, even if the leak checker was happy about them because technically "reachable")

In the meantime, we should revert: if this is making clang entirely incompatible with Valgrind it'd be nice to have a plan. Can we bring this up on llvm-dev@ maybe?

saugustine added a reverting change: rG32e264921b7a: Revert "[GlobalOpt] Revert valgrind hacks".Apr 13 2021, 5:47 PM

I went ahead and reverted. We can discuss a plan--hopefully the internal cleanup to obviate this will go smoothly.

Let me know if there are more issues.

// a.cc
#include <stdio.h>
#include <stdlib.h>
#include <string.h>

static void *g;

void set(char *a);
void foo(void *a) { g = a; } // add a store from a different function to suppress global internalization

int main() {
  char a[10];
  set(a);
  char *b = strdup(a);
  printf("%p %s\n", b, b);
  g = b;
}
// b.cc
#include <string.h>

void set(char *a) {
  strcpy(a, "hello");
}

clang++ -O1 a.cc b.cc -fsanitize=leak had no leak before and reported a leak with this patch. You can also change -O1 to -O2 or -O3. -fsanitize=leak can be changed to -fsanitize=address. You may remove -fsanitize= and use valgrind.

(Note I kept a store from a different function foo, because otherwise the GlobalOpt internalization can cause a false positive before this patch.... I don't know whether it is worth a fix. I think in practice for interesting cases for such a private global variable retaining a heap allocation, there are always more than one storer...)

I think there is value retaining the original test case cleanup-pointer-root-users.ll. We can teach isLeakCheckerRoot that function pointer should not need the pessimization.

In D69428#2687530, @MaskRay wrote:

clang++ -O1 a.cc b.cc -fsanitize=leak had no leak before and reported a leak with this patch. You can also change -O1 to -O2 or -O3. -fsanitize=leak can be changed to -fsanitize=address. You may remove -fsanitize= and use valgrind.

I'd question whether this is a feature or a bug though: if a user really want to intentionally "leak" this way, they could mark the static void *g; with __attribute__((__used__)) to prevent the optimization, or make the store volatile.

In D69428#2687680, @mehdi_amini wrote:

In D69428#2687530, @MaskRay wrote:

clang++ -O1 a.cc b.cc -fsanitize=leak had no leak before and reported a leak with this patch. You can also change -O1 to -O2 or -O3. -fsanitize=leak can be changed to -fsanitize=address. You may remove -fsanitize= and use valgrind.

I'd question whether this is a feature or a bug though: if a user really want to intentionally "leak" this way, they could mark the static void *g; with __attribute__((__used__)) to prevent the optimization, or make the store volatile.

The loads can be optimized out, so I don't think the user needs annotation.

In D69428#2687701, @MaskRay wrote:

In D69428#2687680, @mehdi_amini wrote:

In D69428#2687530, @MaskRay wrote:

clang++ -O1 a.cc b.cc -fsanitize=leak had no leak before and reported a leak with this patch. You can also change -O1 to -O2 or -O3. -fsanitize=leak can be changed to -fsanitize=address. You may remove -fsanitize= and use valgrind.

I'd question whether this is a feature or a bug though: if a user really want to intentionally "leak" this way, they could mark the static void *g; with __attribute__((__used__)) to prevent the optimization, or make the store volatile.

The loads can be optimized out, so I don't think the user needs annotation.

Can you expand on why the store shouldn't be optimized? I understand it breaks leak checker, but that does not make it a bug...

I think it is a valid transformation.

On the other hand, all three leak checkers that we have tried don't cope with the transformation well at all, and having those work is a useful feature from a QOI point of view. One possibility would be to only do the optimization at, say, -O3, rather than at -O1. Suboptimal, but better than the current situation. Having asan to report a failure at O3 that it doesn't at -O0 is a little weird, but better than nothing.

Another question to ask is:

Just how much do we gain by eliminating these locations? Are we talking 10%, 1%, or what?

On the other hand, all three leak checkers that we have tried don't cope with the transformation well at all

Well, again it depends on your point of view: from my point of view they work perfectly well here and the problem could very well be seen as to be fixed in the user code if this is what they intend (otherwise it will find more unintended leak potentially).

In D69428#2687372, @saugustine wrote:

I went ahead and reverted. We can discuss a plan--hopefully the internal cleanup to obviate this will go smoothly.

Let me know if there are more issues.

I think it's unfair what just happened here. Google had over a year to comment and ended up wasting people's time.

Can you please comment on why you can't use __attribute__((__used__))? If you want to keep this hack in LLVM, I think it would be helpful if we understood the reason. Thanks.

Just how much do we gain by eliminating these locations? Are we talking 10%, 1%, or what?

I don't know, but preventing optimization to appease valgrind looks .. hm.. strange. I agree with @nlopes that __attribute__((__used__)) should be used for such purpose (probably added from some instrumentation pass)

+1.
I find it quite worrying/unacceptable that $BIGCORP believes they get to have the final say as to what happens in $PROJ.

Hot take: immediately revert the revert.
If someone needs that hack, they can submit a new review, and argument *there* why llvm must have this hack,
not their llvm fork/their codebase.

I think it's unfair what just happened here. Google had over a year to comment and ended up wasting people's time.

True. Why the LLVM project should suffer because Google is unable to take action for such long time?

Googler 1: In the meantime, we should revert
Googler 2: I went ahead and reverted.

Rather than provide any reasonable action plan (maybe temporarily provide option for google/others to leave this hack on for them), they just reverted it without proper review/discussion to possibly solve this "issue" with a different way :/ Just OK from google-colleague is not enough.

I find it quite worrying/unacceptable that $BIGCORP believes they get to have the final say as to what happens in $PROJ.

+1.

If someone needs that hack, they can submit a new review, and argument *there* why llvm must have this hack, not their llvm fork/their codebase.

+1. Anybody needs some hacks? Maintain it in own internal fork - should not be a problem, right?

xbolva00 added a subscriber: lattner.Apr 14 2021, 4:06 AM

(Disclaimer: I'm also employed by Google).

In D69428#2688409, @xbolva00 wrote:

I think it's unfair what just happened here. Google had over a year to comment and ended up wasting people's time.

True. Why the LLVM project should suffer because Google is unable to take action for such long time?

It was certainly quite unfortunate that nobody responded to questions posed here. It could've avoided unnecessary strife if this conversation had started sooner. Yet, in the end, everyone's goal is to have a good compiler -- nobody wants the LLVM project to suffer.

Rather than provide any reasonable action plan (maybe temporarily provide option for google/others to leave this hack on for them), they just reverted it without proper review/discussion to possibly solve this "issue" with a different way :/ Just OK from google-colleague is not enough.

I believe the revert was consistent with the long-standing LLVM reversion policy (which was recently codified in https://llvm.org/docs/DeveloperPolicy.html#patch-reversion-policy). There were real problems identified with this change -- it breaks both clang -fsanitize=leak and external leak-checkers, and a current user of external leak-checking (Google) came forward to say that they were broken by it.

As far as I can tell, the breakage of -fsanitize=leak was not known prior to the commit, nor was it known that this change would cause active breakage for Google's current use of an external leak-checker. (It's unfortunate and frustrating that the question was raised, yet not answered in the affirmative until now, but I don't think that changes the ultimate outcome.)

So, now we have two new pieces of information, which were not known at the time of commit. That is reasonable grounds for a revert while discussion takes place as to what the next step ("action plan") should be.

I find it quite worrying/unacceptable that $BIGCORP believes they get to have the final say as to what happens in $PROJ.

+1.

I don't think it's helpful to look at it that way. _Everyone_ gets to point out problems, even after commits have gone in. If there are issues, the first answer to revert while discussion on the proper resolution to the issue takes place. A patch reversion in the LLVM process is not a "veto" or a "final say", it is a step to take during discussion. One possible outcome of the discussion could be consensus that the commit was good after-all and should be reinstated as is, despite the issues identified. If so, it will be a more-educated conclusion, properly taking into account those now known issues.

I hope/expect that further discussion on this review can result in an outcome to everyone's satisfaction without any need for escalation. Yet, if disagreement on a review is not resolving, there are escalation mechanisms in place to help resolve the issue. First, the question can be brought up on llvm-dev for wider input. That's usually sufficient. If that also fails to resolve the issue, it can be escalated to the formal decision-making process (https://github.com/llvm/llvm-www/blob/main/proposals/LP0001-LLVMDecisionMaking.md). (I mention that mostly for completeness, I seriously doubt this could get that far.)

As far as I can tell, the breakage of -fsanitize=leak was not known prior to the commit

What kind of breakage? According to @MaskRay (I've just verified his code) LSAN detects memory leak when valgrind hack is reversed, so there is at least some improvement, not breakage. Even if LSAN needs to keep pointer globals for some reason then it can do so from instrumentation pass, because LSAN is part of LLVM. I see no reason why hacks should reside in LLVM trunk to appease some third party tools.

In D69428#2689038, @evgeny777 wrote:

As far as I can tell, the breakage of -fsanitize=leak was not known prior to the commit

What kind of breakage? According to @MaskRay (I've just verified his code) LSAN detects memory leak when valgrind hack is reversed, so there is at least some improvement, not breakage.

MaskRay's example does not show an improvement in LSAN behavior, but rather a new false positive. Memory which is still accessible at process termination is not a leak, so the source -- as written -- had no leak, and should ideally not be reported as such.

Even if LSAN needs to keep pointer globals for some reason then it can do so from instrumentation pass, because LSAN is part of LLVM.

Perhaps that's the best answer in the end.

I see no reason why hacks should reside in LLVM trunk to appease some third party tools.

Well, there is a reason. It's pretty generic: if (some) users expect to be able to be able to use such tools, then breaking them -- with no possible recourse -- hurts such users. Of course, even knowing that you're breaking users, sometimes it can turn out that other factors are ultimately deemed more important. For example, the change might help many other users, or reduce complexity in the compiler significantly. Those factors could be judged to outweigh the breakage.

Yet, even when you judge other considerations to have a higher value, it is useful to recognize that this is a judgement on relative values, and not an absolute truth: the thing being broken had some non-zero value. So, it could be more accurate and less confrontational to say something like "I don't think it's worth reducing the optimization potential of LLVM to appease third-party tools like Valgrind or Google's leak checker, when LSAN can do the same job these days", instead of saying there is "no reason". (Assuming LSAN is fixed, of course.)

I have opened the discussion on llvm-dev. I do want to find a solution that everyone can live with, and I apologize that it felt like Google was stepping over its bounds.

I agree that this patch is the right direction. The original one was quite unprincipled. I appreciate the deleted code as well :-)

llvm/test/Transforms/GlobalOpt/dead-store-status.ll
0	Please don't add a dependency on instcombine here, this should be a unit test for globalopt alone.

Rebased and addressed comments from @lattner (I've removed the dead-store-status.ll because it is specific to CleanupPointerRootUsers)

evgeny777 reopened this revision.Apr 15 2021, 5:39 AM

This revision is now accepted and ready to land.Apr 15 2021, 5:39 AM

evgeny777 requested review of this revision.Apr 15 2021, 5:41 AM

Harbormaster completed remote builds in B98890: Diff 337725.Apr 15 2021, 6:45 AM

Let's see what happens in the discussion on llvm-dev before moving forward with this patch.

For anyone reviewing this patch, one really good thing to do would be to run the leak checker of their choice on their own code base both with and without the patch, and evaluate if the change in behavior is acceptable.

In D69428#2689270, @jyknight wrote:

MaskRay's example does not show an improvement in LSAN behavior, but rather a new false positive. Memory which is still accessible at process termination is not a leak, so the source -- as written -- had no leak

This seems an unusual definition of memory leak.

Is it predicated on the assumption of a hosted environment which will perform the memory free on behalf of the application at termination, where said host+application in some sense can never leak memory?

The assumption is that memory still reachable at program termination, but not deallocated, has not leaked. And it is how all three of LSAN, Valgrind, and HeapLeakChecker define it.

That may not make sense in embedded environments, but that is how most leak checkers work (and in nonhosted environments, things are quite different anyway). FWIW, HeapLeakChecker does have a mode that checks for this, but it has been unused for many years.

At a previous company I worked for that did deeply embedded bare-metal development, there were absolutely cleanup functions that could deallocate all memory, available to be called at any time the program deemed it appropriate. I don't think that is all that unusual.

In D69428#2700667, @JonChesterfield wrote:

In D69428#2689270, @jyknight wrote:

MaskRay's example does not show an improvement in LSAN behavior, but rather a new false positive. Memory which is still accessible at process termination is not a leak, so the source -- as written -- had no leak

This seems an unusual definition of memory leak.

Is it predicated on the assumption of a hosted environment which will perform the memory free on behalf of the application at termination, where said host+application in some sense can never leak memory?

I've provided some context why we need to retain global pointer variable retained objects: https://lists.llvm.org/pipermail/llvm-dev/2021-April/149857.html
Actually I just applied this pattern yesterday for lldb: D100806.

The global variable may have readers but that readers can be optimized out. Then with this patch, such global variables will be optimized away and cause a leak.

If we ignore such patterns, we would cause a huge list of false positives in many projects, with all of valgrind, -fsanitize=leak, -fsanitize=address. "Remove valgrind specific hacks" the subject is imprecise: folks might think this is valgrind specific, and likely no-longer-needed, workaround. However, this piece of code applies to various leak checkers and is actively useful.

As of how to retain the potential missing optimization (to the best of my knowledge it is unmeasured), I've suggested that we can ignore function pointers as root-set if deemed useful.

In D69428#2702157, @MaskRay wrote:

The global variable may have readers but that readers can be optimized out. Then with this patch, such global variables will be optimized away and cause a leak.

FWIW, this example from @MaskRay seems pretty reasonable to me. I could see users annotating globals that catch intentional leaks, but adding annotations for this scenario (optimized-out reader) might make users feel like they're fighting the compiler. At least, seems like it might need some discussion outside of patch review.

davide removed a reviewer: davide.Apr 20 2021, 11:55 AM

This patch will break https://lab.llvm.org/buildbot/#/builders/sanitizer-x86_64-linux-bootstrap
E.g. llvm::BuryPointer is not compatible with this change.

In D69428#2702638, @vitalybuka wrote:

This patch will break https://lab.llvm.org/buildbot/#/builders/sanitizer-x86_64-linux-bootstrap
E.g. llvm::BuryPointer is not compatible with this change.

The global is marked with an attribute, is it because the store to the GraveYard isn't volatile?

vitalybuka mentioned this in D100906: [lsan] Test to show lsan dependency on globals.Apr 20 2021, 4:12 PM

vitalybuka mentioned this in rG5e9e463e1fdd: [lsan] Test to show lsan dependency on globals.Apr 20 2021, 10:00 PM

MaskRay mentioned this in D101211: [ADT] Remove StatisticBase and make NoopStatistic empty.Apr 23 2021, 5:05 PM

MaskRay mentioned this in D101217: Add LLVM_ATTRIBUTE_USED to fix problems which could be exposed by aggressive global pointer variable removal.Apr 23 2021, 6:04 PM

MaskRay mentioned this in rGb81244fa4ff9: Add LLVM_ATTRIBUTE_USED to fix problems which could be exposed by aggressive….Apr 26 2021, 1:31 PM

MaskRay mentioned this in rGb5403117814a: [ADT] Remove StatisticBase and make NoopStatistic empty.Apr 26 2021, 1:39 PM

MaskRay mentioned this in rG18839be9c5c8: [ADT] Remove StatisticBase and make NoopStatistic empty.Apr 26 2021, 4:47 PM

jeroen.dobbelaere added a subscriber: jeroen.dobbelaere.May 11 2021, 11:14 PM

Revision Contents

Path

Size

llvm/

lib/

Transforms/

IPO/

GlobalOpt.cpp

190 lines

test/

ThinLTO/

X86/

import-constant.ll

3 lines

Transforms/

GlobalOpt/

2009-11-16-BrokenPerformHeapAllocSRoA.ll

2 lines

cleanup-pointer-root-users.ll

dead-store-status.ll

Diff 337725

llvm/lib/Transforms/IPO/GlobalOpt.cpp

Show First 20 Lines • Show All 100 Lines • ▼ Show 20 Lines

static cl::opt<int> ColdCCRelFreq(		static cl::opt<int> ColdCCRelFreq(
"coldcc-rel-freq", cl::Hidden, cl::init(2), cl::ZeroOrMore,		"coldcc-rel-freq", cl::Hidden, cl::init(2), cl::ZeroOrMore,
cl::desc(		cl::desc(
"Maximum block frequency, expressed as a percentage of caller's "		"Maximum block frequency, expressed as a percentage of caller's "
"entry frequency, for a call site to be considered cold for enabling"		"entry frequency, for a call site to be considered cold for enabling"
"coldcc"));		"coldcc"));

/// Is this global variable possibly used by a leak checker as a root? If so,
/// we might not really want to eliminate the stores to it.
static bool isLeakCheckerRoot(GlobalVariable *GV) {
// A global variable is a root if it is a pointer, or could plausibly contain
// a pointer. There are two challenges; one is that we could have a struct
// the has an inner member which is a pointer. We recurse through the type to
// detect these (up to a point). The other is that we may actually be a union
// of a pointer and another type, and so our LLVM type is an integer which
// gets converted into a pointer, or our type is an [i8 x #] with a pointer
// potentially contained here.

if (GV->hasPrivateLinkage())
return false;

SmallVector<Type *, 4> Types;
Types.push_back(GV->getValueType());

unsigned Limit = 20;
do {
Type *Ty = Types.pop_back_val();
switch (Ty->getTypeID()) {
default: break;
case Type::PointerTyID:
return true;
case Type::FixedVectorTyID:
case Type::ScalableVectorTyID:
if (cast<VectorType>(Ty)->getElementType()->isPointerTy())
return true;
break;
case Type::ArrayTyID:
Types.push_back(cast<ArrayType>(Ty)->getElementType());
break;
case Type::StructTyID: {
StructType *STy = cast<StructType>(Ty);
if (STy->isOpaque()) return true;
for (StructType::element_iterator I = STy->element_begin(),
E = STy->element_end(); I != E; ++I) {
Type InnerTy = I;
if (isa<PointerType>(InnerTy)) return true;
if (isa<StructType>(InnerTy) \|\| isa<ArrayType>(InnerTy) \|\|
isa<VectorType>(InnerTy))
Types.push_back(InnerTy);
}
break;
}
}
if (--Limit == 0) return true;
} while (!Types.empty());
return false;
}

/// Given a value that is stored to a global but never read, determine whether
/// it's safe to remove the store and the chain of computation that feeds the
/// store.
static bool IsSafeComputationToRemove(
Value *V, function_ref<TargetLibraryInfo &(Function &)> GetTLI) {
do {
if (isa<Constant>(V))
return true;
if (!V->hasOneUse())
return false;
if (isa<LoadInst>(V) \|\| isa<InvokeInst>(V) \|\| isa<Argument>(V) \|\|
isa<GlobalValue>(V))
return false;
if (isAllocationFn(V, GetTLI))
return true;

Instruction *I = cast<Instruction>(V);
if (I->mayHaveSideEffects())
return false;
if (GetElementPtrInst *GEP = dyn_cast<GetElementPtrInst>(I)) {
if (!GEP->hasAllConstantIndices())
return false;
} else if (I->getNumOperands() != 1) {
return false;
}

V = I->getOperand(0);
} while (true);
}

/// This GV is a pointer root. Loop over all users of the global and clean up
/// any that obviously don't assign the global a value that isn't dynamically
/// allocated.
static bool
CleanupPointerRootUsers(GlobalVariable *GV,
function_ref<TargetLibraryInfo &(Function &)> GetTLI) {
// A brief explanation of leak checkers. The goal is to find bugs where
// pointers are forgotten, causing an accumulating growth in memory
// usage over time. The common strategy for leak checkers is to explicitly
// allow the memory pointed to by globals at exit. This is popular because it
// also solves another problem where the main thread of a C++ program may shut
// down before other threads that are still expecting to use those globals. To
// handle that case, we expect the program may create a singleton and never
// destroy it.

bool Changed = false;

// If Dead[n].first is the only use of a malloc result, we can delete its
// chain of computation and the store to the global in Dead[n].second.
SmallVector<std::pair<Instruction , Instruction >, 32> Dead;

// Constants can't be pointers to dynamically allocated memory.
for (Value::user_iterator UI = GV->user_begin(), E = GV->user_end();
UI != E;) {
User U = UI++;
if (StoreInst *SI = dyn_cast<StoreInst>(U)) {
Value *V = SI->getValueOperand();
if (isa<Constant>(V)) {
Changed = true;
SI->eraseFromParent();
} else if (Instruction *I = dyn_cast<Instruction>(V)) {
if (I->hasOneUse())
Dead.push_back(std::make_pair(I, SI));
}
} else if (MemSetInst *MSI = dyn_cast<MemSetInst>(U)) {
if (isa<Constant>(MSI->getValue())) {
Changed = true;
MSI->eraseFromParent();
} else if (Instruction *I = dyn_cast<Instruction>(MSI->getValue())) {
if (I->hasOneUse())
Dead.push_back(std::make_pair(I, MSI));
}
} else if (MemTransferInst *MTI = dyn_cast<MemTransferInst>(U)) {
GlobalVariable *MemSrc = dyn_cast<GlobalVariable>(MTI->getSource());
if (MemSrc && MemSrc->isConstant()) {
Changed = true;
MTI->eraseFromParent();
} else if (Instruction *I = dyn_cast<Instruction>(MemSrc)) {
if (I->hasOneUse())
Dead.push_back(std::make_pair(I, MTI));
}
} else if (ConstantExpr *CE = dyn_cast<ConstantExpr>(U)) {
if (CE->use_empty()) {
CE->destroyConstant();
Changed = true;
}
} else if (Constant *C = dyn_cast<Constant>(U)) {
if (isSafeToDestroyConstant(C)) {
C->destroyConstant();
// This could have invalidated UI, start over from scratch.
Dead.clear();
CleanupPointerRootUsers(GV, GetTLI);
return true;
}
}
}

for (int i = 0, e = Dead.size(); i != e; ++i) {
if (IsSafeComputationToRemove(Dead[i].first, GetTLI)) {
Dead[i].second->eraseFromParent();
Instruction *I = Dead[i].first;
do {
if (isAllocationFn(I, GetTLI))
break;
Instruction *J = dyn_cast<Instruction>(I->getOperand(0));
if (!J)
break;
I->eraseFromParent();
I = J;
} while (true);
I->eraseFromParent();
Changed = true;
}
}

return Changed;
}

/// We just marked GV constant. Loop over all users of the global, cleaning up		/// We just marked GV constant. Loop over all users of the global, cleaning up
/// the obvious ones. This is largely just a quick scan over the use list to		/// the obvious ones. This is largely just a quick scan over the use list to
/// clean up the easy and obvious cruft. This returns true if it made a change.		/// clean up the easy and obvious cruft. This returns true if it made a change.
static bool CleanupConstantGlobalUsers(		static bool CleanupConstantGlobalUsers(
Value V, Constant Init, const DataLayout &DL,		Value V, Constant Init, const DataLayout &DL,
function_ref<TargetLibraryInfo &(Function &)> GetTLI) {		function_ref<TargetLibraryInfo &(Function &)> GetTLI) {
bool Changed = false;		bool Changed = false;
// Note that we need to use a weak value handle for the worklist items. When		// Note that we need to use a weak value handle for the worklist items. When
▲ Show 20 Lines • Show All 532 Lines • ▼ Show 20 Lines	if (Changed) {
LLVM_DEBUG(dbgs() << "OPTIMIZED LOADS FROM STORED ONCE POINTER: " << *GV		LLVM_DEBUG(dbgs() << "OPTIMIZED LOADS FROM STORED ONCE POINTER: " << *GV
<< "\n");		<< "\n");
++NumGlobUses;		++NumGlobUses;
}		}

// If we nuked all of the loads, then none of the stores are needed either,		// If we nuked all of the loads, then none of the stores are needed either,
// nor is the global.		// nor is the global.
if (AllNonStoreUsesGone) {		if (AllNonStoreUsesGone) {
if (isLeakCheckerRoot(GV)) {
Changed \|= CleanupPointerRootUsers(GV, GetTLI);
} else {
Changed = true;		Changed = true;
CleanupConstantGlobalUsers(GV, nullptr, DL, GetTLI);		CleanupConstantGlobalUsers(GV, nullptr, DL, GetTLI);
}
if (GV->use_empty()) {		if (GV->use_empty()) {
LLVM_DEBUG(dbgs() << " *** GLOBAL NOW DEAD!\n");		LLVM_DEBUG(dbgs() << " *** GLOBAL NOW DEAD!\n");
Changed = true;		Changed = true;
GV->eraseFromParent();		GV->eraseFromParent();
++NumDeleted;		++NumDeleted;
}		}
}		}
return Changed;		return Changed;
▲ Show 20 Lines • Show All 1,152 Lines • ▼ Show 20 Lines	processInternalGlobal(GlobalVariable *GV, const GlobalStatus &GS,

bool Changed = false;		bool Changed = false;

// If the global is never loaded (but may be stored to), it is dead.		// If the global is never loaded (but may be stored to), it is dead.
// Delete it now.		// Delete it now.
if (!GS.IsLoaded) {		if (!GS.IsLoaded) {
LLVM_DEBUG(dbgs() << "GLOBAL NEVER LOADED: " << *GV << "\n");		LLVM_DEBUG(dbgs() << "GLOBAL NEVER LOADED: " << *GV << "\n");

if (isLeakCheckerRoot(GV)) {
// Delete any constant stores to the global.
Changed = CleanupPointerRootUsers(GV, GetTLI);
} else {
// Delete any stores we can find to the global. We may not be able to		// Delete any stores we can find to the global. We may not be able to
// make it completely dead though.		// make it completely dead though.
Changed =		Changed =
CleanupConstantGlobalUsers(GV, GV->getInitializer(), DL, GetTLI);		CleanupConstantGlobalUsers(GV, GV->getInitializer(), DL, GetTLI);
}

// If the global is dead now, delete it.		// If the global is dead now, delete it.
if (GV->use_empty()) {		if (GV->use_empty()) {
GV->eraseFromParent();		GV->eraseFromParent();
++NumDeleted;		++NumDeleted;
Changed = true;		Changed = true;
}		}
return Changed;		return Changed;
▲ Show 20 Lines • Show All 1,192 Lines • Show Last 20 Lines

llvm/test/ThinLTO/X86/import-constant.ll

	Show All 26 Lines
	; Check that variable has been promoted in the source module			; Check that variable has been promoted in the source module
	; PROMOTE: @_ZL3Obj.llvm.{{.}} = hidden constant %struct.S { i32 4, i32 8, i32 @val }			; PROMOTE: @_ZL3Obj.llvm.{{.}} = hidden constant %struct.S { i32 4, i32 8, i32 @val }

	; @outer is a write-only variable, so it's been converted to zeroinitializer.			; @outer is a write-only variable, so it's been converted to zeroinitializer.
	; IMPORT: @outer = internal local_unnamed_addr global %struct.Q zeroinitializer			; IMPORT: @outer = internal local_unnamed_addr global %struct.Q zeroinitializer
	; IMPORT-NEXT: @_ZL3Obj.llvm.{{.}} = available_externally hidden constant %struct.S { i32 4, i32 8, i32 @val }			; IMPORT-NEXT: @_ZL3Obj.llvm.{{.}} = available_externally hidden constant %struct.S { i32 4, i32 8, i32 @val }
	; IMPORT-NEXT: @val = available_externally global i32 42			; IMPORT-NEXT: @val = available_externally global i32 42

	; OPT: @outer = internal unnamed_addr global %struct.Q zeroinitializer

	; OPT: define dso_local i32 @main()			; OPT: define dso_local i32 @main()
	; OPT-NEXT: entry:			; OPT-NEXT: entry:
	; OPT-NEXT: store %struct.S* null, %struct.S** getelementptr inbounds (%struct.Q, %struct.Q* @outer, i64 0, i32 0)
	; OPT-NEXT: ret i32 12			; OPT-NEXT: ret i32 12

	; NOREFS: @outer = internal local_unnamed_addr global %struct.Q zeroinitializer			; NOREFS: @outer = internal local_unnamed_addr global %struct.Q zeroinitializer
	; NOREFS-NEXT: @_ZL3Obj.llvm.{{.*}} = external hidden constant %struct.S			; NOREFS-NEXT: @_ZL3Obj.llvm.{{.*}} = external hidden constant %struct.S

	target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-p270:32:32-p271:32:32-p272:64:64-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	Show All 14 Lines

llvm/test/Transforms/GlobalOpt/2009-11-16-BrokenPerformHeapAllocSRoA.ll

	Show All 11 Lines
	; CHECK-NOT: @chartypes			; CHECK-NOT: @chartypes

	; based on linit in office-ispell			; based on linit in office-ispell
	define void @test() nounwind ssp {			define void @test() nounwind ssp {
	%1 = load i32, i32* getelementptr inbounds (%struct.hashheader, %struct.hashheader* @hashheader, i64 0, i32 13), align 8 ; <i32> [#uses=1]			%1 = load i32, i32* getelementptr inbounds (%struct.hashheader, %struct.hashheader* @hashheader, i64 0, i32 13), align 8 ; <i32> [#uses=1]
	%2 = sext i32 %1 to i64 ; <i64> [#uses=1]			%2 = sext i32 %1 to i64 ; <i64> [#uses=1]
	%3 = mul i64 %2, ptrtoint (%struct.strchartype* getelementptr (%struct.strchartype, %struct.strchartype* null, i64 1) to i64) ; <i64> [#uses=1]			%3 = mul i64 %2, ptrtoint (%struct.strchartype* getelementptr (%struct.strchartype, %struct.strchartype* null, i64 1) to i64) ; <i64> [#uses=1]
	%4 = tail call i8* @malloc(i64 %3) ; <i8*> [#uses=1]			%4 = tail call i8* @malloc(i64 %3) ; <i8*> [#uses=1]
	; CHECK-NOT: call i8* @malloc(i64			; CHECK: call i8* @malloc(i64
	%5 = bitcast i8* %4 to %struct.strchartype* ; <%struct.strchartype*> [#uses=1]			%5 = bitcast i8* %4 to %struct.strchartype* ; <%struct.strchartype*> [#uses=1]
	store %struct.strchartype* %5, %struct.strchartype** @chartypes, align 8			store %struct.strchartype* %5, %struct.strchartype** @chartypes, align 8
	ret void			ret void
	}			}

	declare noalias i8* @malloc(i64)			declare noalias i8* @malloc(i64)

llvm/test/Transforms/GlobalOpt/cleanup-pointer-root-users.ll

This file was deleted.

	; RUN: opt -globalopt -S -o - < %s \| FileCheck %s

	@glbl = internal global i8* null

	define void @test1a() {
	; CHECK-LABEL: @test1a(
	; CHECK-NOT: store
	; CHECK-NEXT: ret void
	store i8* null, i8** @glbl
	ret void
	}

	define void @test1b(i8* %p) {
	; CHECK-LABEL: @test1b(
	; CHECK-NEXT: store
	; CHECK-NEXT: ret void
	store i8* %p, i8** @glbl
	ret void
	}

	define void @test2() {
	; CHECK-LABEL: @test2(
	; CHECK: alloca i8
	%txt = alloca i8
	call void @foo2(i8* %txt)
	%call2 = call i8* @strdup(i8* %txt)
	store i8* %call2, i8** @glbl
	ret void
	}
	declare i8* @strdup(i8*)
	declare void @foo2(i8*)

	define void @test3() uwtable personality i32 (i32, i64, i8, i8)* @__gxx_personality_v0 {
	; CHECK-LABEL: @test3(
	; CHECK-NOT: bb1:
	; CHECK-NOT: bb2:
	; CHECK: invoke
	%ptr = invoke i8* @_Znwm(i64 1)
	to label %bb1 unwind label %bb2
	bb1:
	store i8* %ptr, i8** @glbl
	unreachable
	bb2:
	%tmp1 = landingpad { i8*, i32 }
	cleanup
	resume { i8*, i32 } %tmp1
	}
	declare i32 @__gxx_personality_v0(i32, i64, i8, i8)
	declare i8* @_Znwm(i64)

llvm/test/Transforms/GlobalOpt/dead-store-status.ll

This file was deleted.

	; RUN: opt < %s -globalopt -S \| FileCheck %s

	; When removing the store to @global in @foo, the pass would incorrectly return
	; false. This was caught by the pass return status check that is hidden under
	; EXPENSIVE_CHECKS.

	; CHECK: @global = internal unnamed_addr global i16* null, align 1

	; CHECK-LABEL: @foo
	; CHECK-NEXT: entry:
	; CHECK-NEXT: ret i16 undef

	@global = internal unnamed_addr global i16* null, align 1

	; Function Attrs: nofree noinline norecurse nounwind writeonly
	define i16 @foo(i16 %c) local_unnamed_addr #0 {
	entry:
	%local1.addr = alloca i16, align 1
	store i16* %local1.addr, i16** @global, align 1
	ret i16 undef
	}

	; Function Attrs: noinline nounwind writeonly
	define i16 @bar() local_unnamed_addr #1 {
	entry:
	%local2 = alloca [1 x i16], align 1
	%0 = bitcast [1 x i16]* %local2 to i8*
	call void @llvm.lifetime.start.p0i8(i64 1, i8* nonnull %0)
	%arraydecay = getelementptr inbounds [1 x i16], [1 x i16]* %local2, i16 0, i16 0
	store i16* %arraydecay, i16** @global, align 1
	call void @llvm.lifetime.end.p0i8(i64 1, i8* nonnull %0)
	ret i16 undef
	}

	; Function Attrs: argmemonly nounwind willreturn
	declare void @llvm.lifetime.start.p0i8(i64 immarg, i8* nocapture) #2

	; Function Attrs: argmemonly nounwind willreturn
	declare void @llvm.lifetime.end.p0i8(i64 immarg, i8* nocapture) #2

	attributes #0 = { nofree noinline norecurse nounwind writeonly }
	attributes #1 = { noinline nounwind writeonly }
	attributes #2 = { argmemonly nounwind willreturn }