This is an archive of the discontinued LLVM Phabricator instance.

Differential D32140

Global code motion of congruent computations
Needs ReviewPublic

Authored by hiraditya on Apr 17 2017, 2:34 PM.

Download Raw Diff

Details

Reviewers

sebpop

Summary

This is an early work here for tracking changes. Feedback are most welcome.

This patch implements Global Code Motion (GCM) compiler optimization which schedules congruent
instructions across the program. This is an extension of GVNHoist. Not only GCM saves code size, it exposes
redundancies in some cases, it exposes more instruction level parallelism in the
basic-block to which instructions are moved, and it enables other passes like
loop invariant motion to remove more redundancies. The cost model to drive the
code motion is based on liveness analysis on SSA representation such that the
(virtual) register pressure does not increase resulting in 2% fewer spills on
the SPEC-2006 benchmark suite when compiled for x86_64-linux.

The experimental results show reduction in the total compilation time by 1% on SPEC. GCM enables more
inlining and exposes more loop invariant code motion opportunities in majority
of the benchmarks. We have also seen execution time improvements in a few of
SPEC benchmarks viz. mcf (3%) and sjeng(2%).

Stats on llvm-testsuite:

2854 instructions hoisted
2867 instructions removed
1361 loads hoisted
1369 loads removed
74 stores hoisted
74 stores removed
10 instructions sunk

Codesize measurements:

python3 ../utils/compare.py --filter-short --metric=size..text results_gvnhoist.json results_gvnhoist_base.json 
Tests: 200
Metric: size..text

Program                                        results_gvnhoist results_gvnhoist_base diff 
 test-suite...ks/VersaBench/8b10b/8b10b.test    1122             1218                  8.6%
 test-suite...Source/Benchmarks/sim/sim.test    16130            16658                 3.3%
 test-suite...ve-susan/automotive-susan.test    26338            26994                 2.5%
 test-suite...oxyApps-C/XSBench/XSBench.test    13378            13698                 2.4%
 test-suite...oxyApps-C/RSBench/rsbench.test    20946            21282                 1.6%
 test-suite...nchmarks/McCat/18-imp/imp.test    12770            12946                 1.4%
 test-suite...rks/tramp3d-v4/tramp3d-v4.test    804082           814498                1.3%
 test-suite...langs-C/unix-tbl/unix-tbl.test    31954            32338                 1.2%
 test-suite...enchmarks/Olden/em3d/em3d.test    4370             4418                  1.1%
 test-suite...ks/Prolangs-C/cdecl/cdecl.test    16194            16354                 1.0%
 test-suite...s/ASC_Sequoia/AMGmk/AMGmk.test    21330            21506                 0.8%
 test-suite...nchmarks/McCat/09-vor/vor.test    9522             9586                  0.7%
 test-suite...marks/SciMark2-C/scimark2.test    13090            13170                 0.6%
 test-suite...s/FreeBench/neural/neural.test    7874             7826                 -0.6%
 test-suite.../Trimaran/enc-rc4/enc-rc4.test    2818             2834                  0.6%
 Geomean difference                                                                    0.2%

Performance measurements: (Ubuntu 17.10 Intel(R) Core(TM) i7-4770 CPU 8x 3.40GHz with frequency scaling disabled)

test-suite/build$ python3 ../utils/compare.py --filter-short results_gvnhoist.json results_gvnhoist_base.json 
Tests: 200
Short Running: 114 (filtered out)
Remaining: 86
Metric: exec_time

Program                                        results_gvnhoist results_gvnhoist_base diff  
 test-suite...hmarks/VersaBench/bmm/bmm.test     1.51             1.26                -17.0% // This is bogus as the final binary for both base and diff were same. will rerun.
 test-suite...mbolics-flt/Symbolics-flt.test     0.69             0.74                 6.3% 
 test-suite...ce/Benchmarks/Olden/bh/bh.test     0.93             0.99                 6.2% 
 test-suite...lications/SIBsim4/SIBsim4.test     1.87             1.76                -6.2% 
 test-suite...mbolics-dbl/Symbolics-dbl.test     1.96             1.86                -5.3% 
 test-suite...lications/sqlite3/sqlite3.test     1.44             1.49                 3.7% 
 test-suite...ing-dbl/Equivalencing-dbl.test     1.37             1.32                -3.6% 
 test-suite...nchmarks/llubenchmark/llu.test     3.93             3.81                -3.1% 
 test-suite.../Applications/spiff/spiff.test     1.00             1.04                 3.1% 
 test-suite...ow-dbl/GlobalDataFlow-dbl.test     2.12             2.18                 2.8% 
 test-suite...CI_Purple/SMG2000/smg2000.test     1.33             1.37                 2.8% 
 test-suite...s/ASC_Sequoia/AMGmk/AMGmk.test     4.45             4.57                 2.7% 
 test-suite...ow-flt/GlobalDataFlow-flt.test     0.86             0.88                 2.5% 
 test-suite...lications/obsequi/Obsequi.test     1.03             1.05                 2.3% 
 test-suite...Source/Benchmarks/sim/sim.test     2.27             2.31                 1.8%

SPEC2k

Spec2006Int	ratio
400.perlbench	0.9969512195
401.bzip2	0.9900793651
403.gcc	1.027950311
429.mcf	1
445.gobmk	1.006315789
456.hmmer	1.005449591
458.sjeng	1.005825243
462.libquantum	1.028037383
464.h264ref	1.024667932
471.omnetpp	0.9437652812
Geomean 1.002625904
	

Spec2006FP	ratio
433.milc	1.151770658
444.namd	1.018518519
450.soplex	1.030888031
453.povray	1.028571429
470.lbm	1.069414317
482.sphinx3	0.988179669
Geomean	1.046631497

TODO:
Investigate regressions: https://github.com/google/hashtable-benchmarks
Potantial bugs: https://bugs.llvm.org/buglist.cgi?quicksearch=gvn-hoist&list_id=173451
Discussion on regression: https://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20181001/591519.html
Look at https://github.com/google/hashtable-benchmarks which regressed with gvnhoist

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

hiraditya created this revision.Apr 17 2017, 2:34 PM

Herald added a subscriber: mehdi_amini. · View Herald TranscriptApr 17 2017, 2:34 PM

hiraditya added reviewers: • dberlin, sebpop.Apr 17 2017, 2:38 PM

Sink to immediate successors, update memory SSA when sinking.

hiraditya edited the summary of this revision. (Show Details)Apr 26 2017, 12:07 PM

mehdi_amini removed a subscriber: mehdi_amini.Apr 26 2017, 12:08 PM

mehdi_amini added inline comments.

llvm/lib/Transforms/IPO/PassManagerBuilder.cpp
140	Is it intended to be part of this patch?
482	Can you clarify why is it inserted here? It seems like a strange place to me

mehdi_amini added a subscriber: mehdi_amini.Apr 26 2017, 12:10 PM

hiraditya added inline comments.Apr 27 2017, 11:43 AM

llvm/lib/Transforms/IPO/PassManagerBuilder.cpp
140	No, I'll remove this.
482	I'll remove this as well, this was for my testing.

labrinea added a subscriber: labrinea.Nov 13 2018, 7:14 AM

Herald added subscribers: mgrang, george.burgess.iv. · View Herald TranscriptNov 13 2018, 7:14 AM

Plesse try to add more reviewers. This seems to be very promising.

Herald added a project: Restricted Project. · View Herald TranscriptMar 26 2019, 4:00 PM

Herald added subscribers: asbirlea, jdoerfert. · View Herald Transcript

xbolva00 added reviewers: chandlerc, efriedma.Mar 26 2019, 4:01 PM

Working on porting this to latest llvm, will push the latest changes soon.

Rebase against master and use profitability to reduce liveness

hiraditya removed reviewers: • dberlin, chandlerc, efriedma.Oct 27 2019, 12:49 PM

Sink iteratively

lkail added a subscriber: lkail.Oct 28 2019, 7:11 AM

Sink iteratively.

The algorithm for sink instruction can be made more efficient
by having a worklist in reverse-DFSIn number and update after each sink.
The data structure to handle that may be inefficient but I'm not sure.

Remove VNs already handled to reduce redundant lookups.

Make MaxSinkChainLength a flag.

Remove unused variable and use function for updating local stats.

Add testcase for sinking

hiraditya edited the summary of this revision. (Show Details)Oct 28 2019, 9:44 PM

hiraditya edited the summary of this revision. (Show Details)Oct 28 2019, 9:47 PM

hiraditya edited the summary of this revision. (Show Details)

Hi @hiraditya , thanks for your work. I want to test your patch on PowerPC, however current patch seems unable to be applied to current master branch.

Two high level comments after a quick glance

gnv-sink & gvn-hoist are still disabled by default AFAIK and there are still a few known bugs that need to be addressed (https://bugs.llvm.org/buglist.cgi?quicksearch=gvn-hoist, https://bugs.llvm.org/buglist.cgi?quicksearch=gvn-sink). It would be good to guard this change by a new flag that's off by default, otherwise it might hinder work towards weeding out the existing bugs.
Do you think you would be able to use the MergeSet implementation from D57123 (I will rebase the patches ASAP)? Separating out the MergeSet changes should reduce the diff a bit, making reviewing it easier.

GVNHoist was recently reverted maily due to big regressions:
http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20181001/591519.html

In D32140#1724853, @lkail wrote:

Hi @hiraditya , thanks for your work. I want to test your patch on PowerPC, however current patch seems unable to be applied to current master branch.

I'll rebase to master in coming days. Thanks!

In D32140#1725210, @fhahn wrote:

Two high level comments after a quick glance

gnv-sink & gvn-hoist are still disabled by default AFAIK and there are still a few known bugs that need to be addressed (https://bugs.llvm.org/buglist.cgi?quicksearch=gvn-hoist, https://bugs.llvm.org/buglist.cgi?quicksearch=gvn-sink). It would be good to guard this change by a new flag that's off by default, otherwise it might hinder work towards weeding out the existing bugs.

Thanks for the link, I'll look at the bugs and try to fix relevant ones

Do you think you would be able to use the MergeSet implementation from D57123 (I will rebase the patches ASAP)? Separating out the MergeSet changes should reduce the diff a bit, making reviewing it easier.

Sure, that would help simplify the code here. I haven't looked at the implementation (D57123) in detail but I'll follow up after it is rebased.

In D32140#1725238, @xbolva00 wrote:

GVNHoist was recently reverted maily due to big regressions:
http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20181001/591519.html

Thanks for the link, i'll see if we can tune this pass based on the benchmarks posted there. This optimization does have cost model that would reduce register pressure so it shouldn't be worse compared to the current GVNHoist.

hiraditya edited the summary of this revision. (Show Details)Oct 29 2019, 1:44 PM

hiraditya edited the summary of this revision. (Show Details)Oct 30 2019, 10:01 AM

Rebase against master.

https://bugs.llvm.org/show_bug.cgi?id=42283 has been fixed.

hiraditya edited the summary of this revision. (Show Details)Apr 18 2022, 2:07 PM

Herald added a project: Restricted Project. · View Herald TranscriptApr 18 2022, 2:07 PM

Herald added subscribers: ormris, pengfei. · View Herald Transcript

hiraditya edited the summary of this revision. (Show Details)Apr 18 2022, 2:08 PM

hiraditya marked an inline comment as done.Apr 27 2022, 5:13 PM

hiraditya marked an inline comment as done.

Revision Contents

Path

Size

llvm/

include/

llvm/

Analysis/

MemorySSA.h

22 lines

lib/

Transforms/

IPO/

PassManagerBuilder.cpp

4 lines

Scalar/

GVNHoist.cpp

2286 lines

test/

Transforms/

GVNHoist/

broken-mod-sink.ll

100 lines

dj-edge-detect-das-taco.ll

62 lines

108 lines

49 lines

2 lines

2 lines

54 lines

2 lines

2 lines

hoist-recursive-geps.ll

2 lines

hoist-very-busy.ll

22 lines

hoist.ll

2 lines

infinite-loop-direct.ll

2 lines

infinite-loop-indirect.ll

2 lines

int_sideeffect.ll

2 lines

pr37445.ll

2 lines

sink-load-dom-numbering.ll

58 lines

sink-load-with-memory-phi-1.ll

57 lines

sink-load-with-memory-phi-2.ll

45 lines

sink-mul-with-liveout.ll

35 lines

Diff 227372

llvm/include/llvm/Analysis/MemorySSA.h

Show First 20 Lines • Show All 254 Lines • ▼ Show 20 Lines	public:

/// Get the access that produces the memory state used by this Use.		/// Get the access that produces the memory state used by this Use.
MemoryAccess *getDefiningAccess() const { return getOperand(0); }		MemoryAccess *getDefiningAccess() const { return getOperand(0); }

static bool classof(const Value *MA) {		static bool classof(const Value *MA) {
return MA->getValueID() == MemoryUseVal \|\| MA->getValueID() == MemoryDefVal;		return MA->getValueID() == MemoryUseVal \|\| MA->getValueID() == MemoryDefVal;
}		}

		void setDefiningAccess(MemoryAccess *DMA, bool Optimized = false,
		Optional<AliasResult> AR = MayAlias) {
		if (!Optimized) {
		setOperand(0, DMA);
		return;
		}
		setOptimized(DMA);
		setOptimizedAccessType(AR);
		}

// Sadly, these have to be public because they are needed in some of the		// Sadly, these have to be public because they are needed in some of the
// iterators.		// iterators.
inline bool isOptimized() const;		inline bool isOptimized() const;
inline MemoryAccess *getOptimized() const;		inline MemoryAccess *getOptimized() const;
inline void setOptimized(MemoryAccess *);		inline void setOptimized(MemoryAccess *);

// Retrieve AliasResult type of the optimized access. Ideally this would be		// Retrieve AliasResult type of the optimized access. Ideally this would be
// returned by the caching walker and may go away in the future.		// returned by the caching walker and may go away in the future.
Show All 20 Lines	protected:

// Use deleteValue() to delete a generic MemoryUseOrDef.		// Use deleteValue() to delete a generic MemoryUseOrDef.
~MemoryUseOrDef() = default;		~MemoryUseOrDef() = default;

void setOptimizedAccessType(Optional<AliasResult> AR) {		void setOptimizedAccessType(Optional<AliasResult> AR) {
OptimizedAccessAlias = AR;		OptimizedAccessAlias = AR;
}		}

void setDefiningAccess(MemoryAccess *DMA, bool Optimized = false,
Optional<AliasResult> AR = MayAlias) {
if (!Optimized) {
setOperand(0, DMA);
return;
}
setOptimized(DMA);
setOptimizedAccessType(AR);
}

private:		private:
Instruction *MemoryInstruction;		Instruction *MemoryInstruction;
Optional<AliasResult> OptimizedAccessAlias;		Optional<AliasResult> OptimizedAccessAlias;
};		};

/// Represents read-only accesses to memory		/// Represents read-only accesses to memory
///		///
/// In particular, the set of Instructions that will be represented by		/// In particular, the set of Instructions that will be represented by
▲ Show 20 Lines • Show All 464 Lines • ▼ Show 20 Lines	public:

/// Verify that MemorySSA is self consistent (IE definitions dominate		/// Verify that MemorySSA is self consistent (IE definitions dominate
/// all uses, uses appear in the right places). This is used by unit tests.		/// all uses, uses appear in the right places). This is used by unit tests.
void verifyMemorySSA() const;		void verifyMemorySSA() const;

/// Used in various insertion functions to specify whether we are talking		/// Used in various insertion functions to specify whether we are talking
/// about the beginning or end of a block.		/// about the beginning or end of a block.
enum InsertionPlace { Beginning, End };		enum InsertionPlace { Beginning, End };
		MemoryPhi createMemoryPhi(BasicBlock BB);

protected:		protected:
// Used by Memory SSA annotater, dumpers, and wrapper pass		// Used by Memory SSA annotater, dumpers, and wrapper pass
friend class MemorySSAAnnotatedWriter;		friend class MemorySSAAnnotatedWriter;
friend class MemorySSAPrinterLegacyPass;		friend class MemorySSAPrinterLegacyPass;
friend class MemorySSAUpdater;		friend class MemorySSAUpdater;

void verifyPrevDefInPhis(Function &F) const;		void verifyPrevDefInPhis(Function &F) const;
▲ Show 20 Lines • Show All 52 Lines • ▼ Show 20 Lines	private:

using AccessMap = DenseMap<const BasicBlock *, std::unique_ptr<AccessList>>;		using AccessMap = DenseMap<const BasicBlock *, std::unique_ptr<AccessList>>;
using DefsMap = DenseMap<const BasicBlock *, std::unique_ptr<DefsList>>;		using DefsMap = DenseMap<const BasicBlock *, std::unique_ptr<DefsList>>;

void		void
determineInsertionPoint(const SmallPtrSetImpl<BasicBlock *> &DefiningBlocks);		determineInsertionPoint(const SmallPtrSetImpl<BasicBlock *> &DefiningBlocks);
void markUnreachableAsLiveOnEntry(BasicBlock *BB);		void markUnreachableAsLiveOnEntry(BasicBlock *BB);
bool dominatesUse(const MemoryAccess , const MemoryAccess ) const;		bool dominatesUse(const MemoryAccess , const MemoryAccess ) const;
MemoryPhi createMemoryPhi(BasicBlock BB);
template <typename AliasAnalysisType>		template <typename AliasAnalysisType>
MemoryUseOrDef createNewAccess(Instruction , AliasAnalysisType *,		MemoryUseOrDef createNewAccess(Instruction , AliasAnalysisType *,
const MemoryUseOrDef *Template = nullptr);		const MemoryUseOrDef *Template = nullptr);
MemoryAccess findDominatingDef(BasicBlock , enum InsertionPlace);		MemoryAccess findDominatingDef(BasicBlock , enum InsertionPlace);
void placePHINodes(const SmallPtrSetImpl<BasicBlock *> &);		void placePHINodes(const SmallPtrSetImpl<BasicBlock *> &);
MemoryAccess renameBlock(BasicBlock , MemoryAccess *, bool);		MemoryAccess renameBlock(BasicBlock , MemoryAccess *, bool);
void renameSuccessorPhis(BasicBlock , MemoryAccess , bool);		void renameSuccessorPhis(BasicBlock , MemoryAccess , bool);
void renamePass(DomTreeNode , MemoryAccess IncomingVal,		void renamePass(DomTreeNode , MemoryAccess IncomingVal,
▲ Show 20 Lines • Show All 450 Lines • Show Last 20 Lines

llvm/lib/Transforms/IPO/PassManagerBuilder.cpp

Show First 20 Lines • Show All 131 Lines • ▼ Show 20 Lines	static cl::opt<bool> EnableGVNSink(
"enable-gvn-sink", cl::init(false), cl::Hidden,		"enable-gvn-sink", cl::init(false), cl::Hidden,
cl::desc("Enable the GVN sinking pass (default = off)"));		cl::desc("Enable the GVN sinking pass (default = off)"));

// This option is used in simplifying testing SampleFDO optimizations for		// This option is used in simplifying testing SampleFDO optimizations for
// profile loading.		// profile loading.
static cl::opt<bool>		static cl::opt<bool>
EnableCHR("enable-chr", cl::init(true), cl::Hidden,		EnableCHR("enable-chr", cl::init(true), cl::Hidden,
cl::desc("Enable control height reduction optimization (CHR)"));		cl::desc("Enable control height reduction optimization (CHR)"));

		mehdi_aminiUnsubmitted Done Reply Inline Actions Is it intended to be part of this patch? mehdi_amini: Is it intended to be part of this patch?
		hiradityaAuthorUnsubmitted Done Reply Inline Actions No, I'll remove this. hiraditya: No, I'll remove this.
cl::opt<bool> FlattenedProfileUsed(		cl::opt<bool> FlattenedProfileUsed(
"flattened-profile-used", cl::init(false), cl::Hidden,		"flattened-profile-used", cl::init(false), cl::Hidden,
cl::desc("Indicate the sample profile being used is flattened, i.e., "		cl::desc("Indicate the sample profile being used is flattened, i.e., "
"no inline hierachy exists in the profile. "));		"no inline hierachy exists in the profile. "));

cl::opt<bool> EnableOrderFileInstrumentation(		cl::opt<bool> EnableOrderFileInstrumentation(
"enable-order-file-instrumentation", cl::init(false), cl::Hidden,		"enable-order-file-instrumentation", cl::init(false), cl::Hidden,
cl::desc("Enable order file instrumentation (default = off)"));		cl::desc("Enable order file instrumentation (default = off)"));
▲ Show 20 Lines • Show All 232 Lines • ▼ Show 20 Lines	void PassManagerBuilder::addFunctionSimplificationPasses(

// Unroll small loops		// Unroll small loops
MPM.add(createSimpleLoopUnrollPass(OptLevel, DisableUnrollLoops,		MPM.add(createSimpleLoopUnrollPass(OptLevel, DisableUnrollLoops,
ForgetAllSCEVInLoopUnroll));		ForgetAllSCEVInLoopUnroll));
addExtensionsToPM(EP_LoopOptimizerEnd, MPM);		addExtensionsToPM(EP_LoopOptimizerEnd, MPM);
// This ends the loop pass pipelines.		// This ends the loop pass pipelines.

if (OptLevel > 1) {		if (OptLevel > 1) {
		if (EnableGVNHoist)
		MPM.add(createGVNHoistPass());
MPM.add(createMergedLoadStoreMotionPass()); // Merge ld/st in diamonds		MPM.add(createMergedLoadStoreMotionPass()); // Merge ld/st in diamonds
MPM.add(NewGVN ? createNewGVNPass()		MPM.add(NewGVN ? createNewGVNPass()
: createGVNPass(DisableGVNLoadPRE)); // Remove redundancies		: createGVNPass(DisableGVNLoadPRE)); // Remove redundancies
		if (EnableGVNHoist)
		MPM.add(createGVNHoistPass());
}		}
MPM.add(createMemCpyOptPass()); // Remove memcpy / form memset		MPM.add(createMemCpyOptPass()); // Remove memcpy / form memset
MPM.add(createSCCPPass()); // Constant prop with SCCP		MPM.add(createSCCPPass()); // Constant prop with SCCP

// Delete dead bit computations (instcombine runs after to fold away the dead		// Delete dead bit computations (instcombine runs after to fold away the dead
// computations, and then ADCE will run later to exploit any new DCE		// computations, and then ADCE will run later to exploit any new DCE
// opportunities that creates).		// opportunities that creates).
MPM.add(createBitTrackingDCEPass()); // Delete dead bit computations		MPM.add(createBitTrackingDCEPass()); // Delete dead bit computations
▲ Show 20 Lines • Show All 70 Lines • ▼ Show 20 Lines	if (OptLevel == 0) {

addExtensionsToPM(EP_EnabledOnOptLevel0, MPM);		addExtensionsToPM(EP_EnabledOnOptLevel0, MPM);

if (PrepareForLTO \|\| PrepareForThinLTO) {		if (PrepareForLTO \|\| PrepareForThinLTO) {
MPM.add(createCanonicalizeAliasesPass());		MPM.add(createCanonicalizeAliasesPass());
// Rename anon globals to be able to export them in the summary.		// Rename anon globals to be able to export them in the summary.
// This has to be done after we add the extensions to the pass manager		// This has to be done after we add the extensions to the pass manager
// as there could be passes (e.g. Adddress sanitizer) which introduce		// as there could be passes (e.g. Adddress sanitizer) which introduce
// new unnamed globals.		// new unnamed globals.
		mehdi_aminiUnsubmitted Done Reply Inline Actions Can you clarify why is it inserted here? It seems like a strange place to me mehdi_amini: Can you clarify why is it inserted here? It seems like a strange place to me
		hiradityaAuthorUnsubmitted Done Reply Inline Actions I'll remove this as well, this was for my testing. hiraditya: I'll remove this as well, this was for my testing.
MPM.add(createNameAnonGlobalPass());		MPM.add(createNameAnonGlobalPass());
}		}
return;		return;
}		}

// Add LibraryInfo if we have some.		// Add LibraryInfo if we have some.
if (LibraryInfo)		if (LibraryInfo)
MPM.add(new TargetLibraryInfoWrapperPass(*LibraryInfo));		MPM.add(new TargetLibraryInfoWrapperPass(*LibraryInfo));
▲ Show 20 Lines • Show All 650 Lines • Show Last 20 Lines

llvm/lib/Transforms/Scalar/GVNHoist.cpp

Show All 25 Lines
// they are already handled by GVN-PRE. It is advisable to run gvn-hoist before		// they are already handled by GVN-PRE. It is advisable to run gvn-hoist before
// and after gvn-pre because gvn-pre creates opportunities for more instructions		// and after gvn-pre because gvn-pre creates opportunities for more instructions
// to be hoisted.		// to be hoisted.
//		//
// Hoisting may affect the performance in some cases. To mitigate that, hoisting		// Hoisting may affect the performance in some cases. To mitigate that, hoisting
// is disabled in the following cases.		// is disabled in the following cases.
// 1. Scalars across calls.		// 1. Scalars across calls.
// 2. geps when corresponding load/store cannot be hoisted.		// 2. geps when corresponding load/store cannot be hoisted.
		//
		// TODO:
		// For -Oz scalars are always safe to hoist => NO.
		// For -O2/-O3 hoist only when the live range improves or remains the same.
		// If we haven't computed dominator tree levels, do so now.
		// For sink operation anything after hoist barrier is okay to sink
		// but nothing before the hoist barrier. See Danny's patch on HoistBarriers.
		//
		// Find sink opportunities: sink instructions to a common post dominator when
		// they compute the same value and hoisting isn't profitable because that will
		// increase the liveness. In this case sinking may decrease the liveness when
		// the instruction can be delayed. But do not delay so much that latency of the
		// operation takes over. Like divide takes 10 cycles maybe, and if the divide is
		// sunk into a BB where it is used then the result of divide may take long time
		// anyways. So an estimated latency of operation should be computed and the
		// instruction should be sunk only till the point when result may be readily
		// available for the user of the instruction.
		//
		// Sinking store may not always be beneficial: sinking frees up a register so
		// improves register allocation the stored result may be loaded again, in that
		// case it is good to store soon (??)
		//
		// TODO: Hoist from >2 successors. Currently GVNHoist will not hoist stores
		// in this case because it works on two instructions at a time.
		// entry:
		// switch i32 %c1, label %exit1 [
		// i32 0, label %sw0
		// i32 1, label %sw1
		// ]
		//
		// sw0:
		// store i32 1, i32* @G
		// br label %exit
		//
		// sw1:
		// store i32 1, i32* @G
		// br label %exit
		//
		// exit1:
		// store i32 1, i32* @G
		// ret void
		// exit:
		// ret void
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/ADT/DenseMap.h"		#include "llvm/ADT/DenseMap.h"
#include "llvm/ADT/DenseSet.h"		#include "llvm/ADT/DenseSet.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
#include "llvm/ADT/Statistic.h"		#include "llvm/ADT/Statistic.h"
#include "llvm/ADT/iterator_range.h"		#include "llvm/ADT/iterator_range.h"
#include "llvm/Analysis/AliasAnalysis.h"		#include "llvm/Analysis/AliasAnalysis.h"
#include "llvm/Analysis/GlobalsModRef.h"		#include "llvm/Analysis/GlobalsModRef.h"
#include "llvm/Analysis/IteratedDominanceFrontier.h"		#include "llvm/Analysis/IteratedDominanceFrontier.h"
#include "llvm/Analysis/MemoryDependenceAnalysis.h"		#include "llvm/Analysis/MemoryDependenceAnalysis.h"
#include "llvm/Analysis/MemorySSA.h"		#include "llvm/Analysis/MemorySSA.h"
#include "llvm/Analysis/MemorySSAUpdater.h"		#include "llvm/Analysis/MemorySSAUpdater.h"
#include "llvm/Analysis/PostDominators.h"		#include "llvm/Analysis/PostDominators.h"
#include "llvm/Transforms/Utils/Local.h"
#include "llvm/Analysis/ValueTracking.h"		#include "llvm/Analysis/ValueTracking.h"
#include "llvm/IR/Argument.h"		#include "llvm/IR/Argument.h"
#include "llvm/IR/BasicBlock.h"		#include "llvm/IR/BasicBlock.h"
#include "llvm/IR/CFG.h"		#include "llvm/IR/CFG.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/Dominators.h"		#include "llvm/IR/Dominators.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
#include "llvm/IR/InstrTypes.h"		#include "llvm/IR/InstrTypes.h"
#include "llvm/IR/Instruction.h"		#include "llvm/IR/Instruction.h"
#include "llvm/IR/Instructions.h"		#include "llvm/IR/Instructions.h"
#include "llvm/IR/IntrinsicInst.h"		#include "llvm/IR/IntrinsicInst.h"
#include "llvm/IR/Intrinsics.h"		#include "llvm/IR/Intrinsics.h"
#include "llvm/IR/LLVMContext.h"		#include "llvm/IR/LLVMContext.h"
#include "llvm/IR/PassManager.h"		#include "llvm/IR/PassManager.h"
#include "llvm/IR/Use.h"		#include "llvm/IR/Use.h"
#include "llvm/IR/User.h"		#include "llvm/IR/User.h"
#include "llvm/IR/Value.h"		#include "llvm/IR/Value.h"
#include "llvm/Pass.h"		#include "llvm/Pass.h"
#include "llvm/Support/Casting.h"		#include "llvm/Support/Casting.h"
#include "llvm/Support/CommandLine.h"		#include "llvm/Support/CommandLine.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/raw_ostream.h"		#include "llvm/Support/raw_ostream.h"
#include "llvm/Transforms/Scalar.h"		#include "llvm/Transforms/Scalar.h"
#include "llvm/Transforms/Scalar/GVN.h"		#include "llvm/Transforms/Scalar/GVN.h"
		#include "llvm/Transforms/Utils/Local.h"
#include <algorithm>		#include <algorithm>
#include <cassert>		#include <cassert>
#include <iterator>		#include <iterator>
#include <memory>		#include <memory>
#include <utility>		#include <utility>
#include <vector>		#include <vector>

using namespace llvm;		using namespace llvm;

#define DEBUG_TYPE "gvn-hoist"		#define DEBUG_TYPE "gvn-hoist"

STATISTIC(NumHoisted, "Number of instructions hoisted");		STATISTIC(NumHoisted, "Number of instructions hoisted");
		STATISTIC(NumSunk, "Number of instructions sunk");
STATISTIC(NumRemoved, "Number of instructions removed");		STATISTIC(NumRemoved, "Number of instructions removed");
STATISTIC(NumLoadsHoisted, "Number of loads hoisted");		STATISTIC(NumLoadsHoisted, "Number of loads hoisted");
STATISTIC(NumLoadsRemoved, "Number of loads removed");		STATISTIC(NumLoadsRemoved, "Number of loads removed");
STATISTIC(NumStoresHoisted, "Number of stores hoisted");		STATISTIC(NumStoresHoisted, "Number of stores hoisted");
STATISTIC(NumStoresRemoved, "Number of stores removed");		STATISTIC(NumStoresRemoved, "Number of stores removed");
STATISTIC(NumCallsHoisted, "Number of calls hoisted");		STATISTIC(NumCallsHoisted, "Number of calls hoisted");
STATISTIC(NumCallsRemoved, "Number of calls removed");		STATISTIC(NumCallsRemoved, "Number of calls removed");
		// STATISTIC(NumBarriers, "Number of barriers");

static cl::opt<int>		static cl::opt<int>
MaxHoistedThreshold("gvn-max-hoisted", cl::Hidden, cl::init(-1),		MaxHoistedThreshold("gvn-max-hoisted", cl::Hidden, cl::init(-1),
cl::desc("Max number of instructions to hoist "		cl::desc("Max number of instructions to hoist "
"(default unlimited = -1)"));		"(default unlimited = -1)"));

static cl::opt<int> MaxNumberOfBBSInPath(		static cl::opt<int> MaxNumberOfBBSInPath(
"gvn-hoist-max-bbs", cl::Hidden, cl::init(4),		"gvn-hoist-max-bbs", cl::Hidden, cl::init(4),
cl::desc("Max number of basic blocks on the path between "		cl::desc("Max number of basic blocks on the path between "
"hoisting locations (default = 4, unlimited = -1)"));		"hoisting locations (default = 4, unlimited = -1)"));

static cl::opt<int> MaxDepthInBB(		static cl::opt<int> MaxDepthInBB(
"gvn-hoist-max-depth", cl::Hidden, cl::init(100),		"gvn-hoist-max-depth", cl::Hidden, cl::init(100),
cl::desc("Hoist instructions from the beginning of the BB up to the "		cl::desc("Hoist instructions from the beginning of the BB up to the "
"maximum specified depth (default = 100, unlimited = -1)"));		"maximum specified depth (default = 100, unlimited = -1)"));

static cl::opt<int>		static cl::opt<int>
MaxChainLength("gvn-hoist-max-chain-length", cl::Hidden, cl::init(10),		MaxHoistChainLength("gvn-hoist-max-chain-length", cl::Hidden, cl::init(10),
cl::desc("Maximum length of dependent chains to hoist "		cl::desc("Maximum length of dependent chains to hoist "
"(default = 10, unlimited = -1)"));		"(default = 10, unlimited = -1)"));
		static cl::opt<int>
		MaxSinkChainLength("gvn-hoist-max-sink-chain", cl::Hidden, cl::init(3),
		cl::desc("Maximum length of dependent chains to sink "
		"(default = 3, unlimited = -1)"));
		static cl::opt<bool> CheckHoistProfitability(
		"gvn-hoist-check-profitability", cl::Hidden, cl::init(true),
		cl::desc("Check for proitability (reducing register pressure)"));

		static cl::opt<bool> CheckSinkProfitability(
		"gvn-sink-check-profitability", cl::Hidden, cl::init(true),
		cl::desc("Check for proitability (reducing register pressure)"));

namespace llvm {		namespace llvm {

using BBSideEffectsSet = DenseMap<const BasicBlock *, bool>;		using BBSideEffectsSet = DenseMap<const BasicBlock *, bool>;
using SmallVecInsn = SmallVector<Instruction *, 4>;		using SmallVecInsn = SmallVector<Instruction *, 4>;
using SmallVecImplInsn = SmallVectorImpl<Instruction *>;		using SmallVecImplInsn = SmallVectorImpl<Instruction *>;
		using SmallVecVal = SmallVector<Value *, 2>;
		using SmallVecImplVal = SmallVectorImpl<Value *>;

// Each element of a hoisting list contains the basic block where to hoist and		// Each element of a hoisting list contains the basic block where to hoist and
// a list of instructions to be hoisted.		// a list of instructions to be hoisted.
using HoistingPointInfo = std::pair<BasicBlock *, SmallVecInsn>;		using HoistingPointInfo = std::pair<BasicBlock *, SmallVecInsn>;

using HoistingPointList = SmallVector<HoistingPointInfo, 4>;		using HoistingPointList = SmallVector<HoistingPointInfo, 4>;

// A map from a pair of VNs to all the instructions with those VNs.		// A map from a pair of VNs to all the instructions with those VNs.
using VNType = std::pair<unsigned, unsigned>;		using VNType = std::pair<unsigned, unsigned>;

using VNtoInsns = DenseMap<VNType, SmallVector<Instruction *, 4>>;		using VNtoInsns = DenseMap<VNType, SmallVector<Instruction *, 4>>;

		typedef SmallSet<BasicBlock *, 2> SmallSetBB;
		typedef DenseMap<BasicBlock *, SmallSetBB> MergeSetT;
		typedef SmallVector<BasicBlock *, 4> SmallVecBB;
		typedef SmallVecBB BBLevelKeyT;
		typedef std::map<unsigned, BBLevelKeyT> BBLevelT;
		typedef DenseMap<BasicBlock *, unsigned> DomLevelsT;
		typedef std::pair<BasicBlock , BasicBlock > EdgeT;

// CHI keeps information about values flowing out of a basic block. It is		// CHI keeps information about values flowing out of a basic block. It is
// similar to PHI but in the inverse graph, and used for outgoing values on each		// similar to PHI but in the inverse graph, and used for outgoing values on each
// edge. For conciseness, it is computed only for instructions with multiple		// edge. For conciseness, it is computed only for instructions with multiple
// occurrences in the CFG because they are the only hoistable candidates.		// occurrences in the CFG because they are the only hoistable candidates.
// A (CHI[{V, B, I1}, {V, C, I2}]		// A (CHI[{V, B, I1}, {V, C, I2}]
// / \		// / \
// / \		// / \
// B(I1) C (I2)		// B(I1) C (I2)
Show All 12 Lines	struct CHIArg {
bool operator!=(const CHIArg &A) { return !(*this == A); }		bool operator!=(const CHIArg &A) { return !(*this == A); }
};		};

using CHIIt = SmallVectorImpl<CHIArg>::iterator;		using CHIIt = SmallVectorImpl<CHIArg>::iterator;
using CHIArgs = iterator_range<CHIIt>;		using CHIArgs = iterator_range<CHIIt>;
using OutValuesType = DenseMap<BasicBlock *, SmallVector<CHIArg, 2>>;		using OutValuesType = DenseMap<BasicBlock *, SmallVector<CHIArg, 2>>;
using InValuesType =		using InValuesType =
DenseMap<BasicBlock , SmallVector<std::pair<VNType, Instruction >, 2>>;		DenseMap<BasicBlock , SmallVector<std::pair<VNType, Instruction >, 2>>;
		typedef DenseMap<VNType, SmallVector<Instruction *, 2>> RenameStackType;

// An invalid value number Used when inserting a single value number into		// An invalid value number Used when inserting a single value number into
// VNtoInsns.		// VNtoInsns.
enum : unsigned { InvalidVN = ~2U };		enum : unsigned { InvalidVN = ~2U };

// Records all scalar instructions candidate for code hoisting.		// Records all scalar instructions candidate for code hoisting.
class InsnInfo {		class InsnInfo {
VNtoInsns VNtoScalars;		VNtoInsns VNtoScalars;

public:		public:
// Inserts I and its value number in VNtoScalars.		// Inserts I and its value number in VNtoScalars.
void insert(Instruction *I, GVN::ValueTable &VN) {		void insert(Instruction *I, GVN::ValueTable &VN) {
// Scalar instruction.		// Scalar instruction.
unsigned V = VN.lookupOrAdd(I);		unsigned V = VN.lookupOrAdd(I);
VNtoScalars[{V, InvalidVN}].push_back(I);		VNtoScalars[{V, InvalidVN}].push_back(I);
}		}

		void clear() { VNtoScalars.clear(); }

		VNtoInsns &getVNTable() { return VNtoScalars; }
const VNtoInsns &getVNTable() const { return VNtoScalars; }		const VNtoInsns &getVNTable() const { return VNtoScalars; }
		void removeVN(unsigned V) { VNtoScalars.erase({V, InvalidVN}); }
};		};

// Records all load instructions candidate for code hoisting.		// Records all load instructions candidate for code hoisting.
class LoadInfo {		class LoadInfo {
VNtoInsns VNtoLoads;		VNtoInsns VNtoLoads;

public:		public:
// Insert Load and the value number of its memory address in VNtoLoads.		// Insert Load and the value number of its memory address in VNtoLoads.
void insert(LoadInst *Load, GVN::ValueTable &VN) {		void insert(LoadInst *Load, GVN::ValueTable &VN) {
if (Load->isSimple()) {		if (Load->isSimple()) {
unsigned V = VN.lookupOrAdd(Load->getPointerOperand());		unsigned V = VN.lookupOrAdd(Load->getPointerOperand());
VNtoLoads[{V, InvalidVN}].push_back(Load);		VNtoLoads[{V, InvalidVN}].push_back(Load);
}		}
}		}

		void clear() { VNtoLoads.clear(); }
		VNtoInsns &getVNTable() { return VNtoLoads; }
const VNtoInsns &getVNTable() const { return VNtoLoads; }		const VNtoInsns &getVNTable() const { return VNtoLoads; }
		void removeVN(unsigned V) { VNtoLoads.erase({V, InvalidVN}); }
};		};

// Records all store instructions candidate for code hoisting.		// Records all store instructions candidate for code hoisting.
class StoreInfo {		class StoreInfo {
VNtoInsns VNtoStores;		VNtoInsns VNtoStores;

public:		public:
// Insert the Store and a hash number of the store address and the stored		// Insert the Store and a hash number of the store address and the stored
// value in VNtoStores.		// value in VNtoStores.
void insert(StoreInst *Store, GVN::ValueTable &VN) {		void insert(StoreInst *Store, GVN::ValueTable &VN) {
if (!Store->isSimple())		if (!Store->isSimple())
return;		return;
// Hash the store address and the stored value.		// Hash the store address and the stored value.
Value *Ptr = Store->getPointerOperand();		Value *Ptr = Store->getPointerOperand();
Value *Val = Store->getValueOperand();		Value *Val = Store->getValueOperand();
VNtoStores[{VN.lookupOrAdd(Ptr), VN.lookupOrAdd(Val)}].push_back(Store);		VNtoStores[{VN.lookupOrAdd(Ptr), VN.lookupOrAdd(Val)}].push_back(Store);
}		}

		void clear() { VNtoStores.clear(); }

		VNtoInsns &getVNTable() { return VNtoStores; }
const VNtoInsns &getVNTable() const { return VNtoStores; }		const VNtoInsns &getVNTable() const { return VNtoStores; }
		void removeVN(unsigned V) { VNtoStores.erase({V, InvalidVN}); }
};		};

// Records all call instructions candidate for code hoisting.		// Records all call instructions candidate for code hoisting.
class CallInfo {		class CallInfo {
VNtoInsns VNtoCallsScalars;		VNtoInsns VNtoCallsScalars;
VNtoInsns VNtoCallsLoads;		VNtoInsns VNtoCallsLoads;
VNtoInsns VNtoCallsStores;		VNtoInsns VNtoCallsStores;

Show All 9 Lines	void insert(CallInst *Call, GVN::ValueTable &VN) {
if (Call->doesNotAccessMemory())		if (Call->doesNotAccessMemory())
VNtoCallsScalars[Entry].push_back(Call);		VNtoCallsScalars[Entry].push_back(Call);
else if (Call->onlyReadsMemory())		else if (Call->onlyReadsMemory())
VNtoCallsLoads[Entry].push_back(Call);		VNtoCallsLoads[Entry].push_back(Call);
else		else
VNtoCallsStores[Entry].push_back(Call);		VNtoCallsStores[Entry].push_back(Call);
}		}

		void clear() {
		VNtoCallsScalars.clear();
		VNtoCallsLoads.clear();
		VNtoCallsStores.clear();
		}

		VNtoInsns &getScalarVNTable() { return VNtoCallsScalars; }
		VNtoInsns &getLoadVNTable() { return VNtoCallsLoads; }
		VNtoInsns &getStoreVNTable() { return VNtoCallsStores; }
const VNtoInsns &getScalarVNTable() const { return VNtoCallsScalars; }		const VNtoInsns &getScalarVNTable() const { return VNtoCallsScalars; }
const VNtoInsns &getLoadVNTable() const { return VNtoCallsLoads; }		const VNtoInsns &getLoadVNTable() const { return VNtoCallsLoads; }
const VNtoInsns &getStoreVNTable() const { return VNtoCallsStores; }		const VNtoInsns &getStoreVNTable() const { return VNtoCallsStores; }

		void removeVN(unsigned V) {
		if (!VNtoCallsScalars.erase({V, InvalidVN}))
		if (!VNtoCallsLoads.erase({V, InvalidVN}))
		VNtoCallsStores.erase({V, InvalidVN});
		}
};		};

static void combineKnownMetadata(Instruction ReplInst, Instruction I) {		static void combineKnownMetadata(Instruction ReplInst, Instruction I) {
static const unsigned KnownIDs[] = {		static const unsigned KnownIDs[] = {LLVMContext::MD_tbaa,
LLVMContext::MD_tbaa, LLVMContext::MD_alias_scope,		LLVMContext::MD_alias_scope,
LLVMContext::MD_noalias, LLVMContext::MD_range,		LLVMContext::MD_noalias,
LLVMContext::MD_fpmath, LLVMContext::MD_invariant_load,		LLVMContext::MD_range,
LLVMContext::MD_invariant_group, LLVMContext::MD_access_group};		LLVMContext::MD_fpmath,
		LLVMContext::MD_invariant_load,
		LLVMContext::MD_invariant_group,
		LLVMContext::MD_access_group};
combineMetadata(ReplInst, I, KnownIDs, true);		combineMetadata(ReplInst, I, KnownIDs, true);
}		}

		void printBBLevels(const BBLevelT &BBLevels) {
		for (const std::pair<unsigned, BBLevelKeyT> &P : BBLevels) {
		dbgs() << "\nLevel: " << P.first << "\n";
		for (const BasicBlock *BB : P.second)
		dbgs() << *BB << "\n";
		}
		}

		void printMergeSet(const MergeSetT &M) {
		// For printing in a deterministic order.
		typedef std::set<const BasicBlock *> SetConstBB;
		auto cmp = [](const BasicBlock A, const BasicBlock B) {
		return A->getName() < B->getName();
		};
		std::map<BasicBlock *, SetConstBB, decltype(cmp)> PrintM(cmp);
		for (const std::pair<BasicBlock *, SmallSetBB> &P : M) {
		for (const BasicBlock *BB : P.second)
		PrintM[P.first].insert(BB);
		}
		for (const std::pair<BasicBlock *, SetConstBB> &P : PrintM) {
		dbgs() << "\nMergeSet of: " << P.first->getName() << ": ";
		for (const BasicBlock *BB : P.second)
		dbgs() << BB->getName() << ", ";
		}
		}

		void printJEdges(const DenseSet<EdgeT> &Edges) {
		auto cmp = [](const EdgeT &A, const EdgeT &B) {
		return A.first->getName() < B.first->getName();
		};
		// For printing in a deterministic order.
		std::set<EdgeT, decltype(cmp)> PrintE(Edges.begin(), Edges.end(), cmp);

		for (const EdgeT &E : PrintE)
		dbgs() << "\nFound a JEdge: " << E.first->getName() << " -> "
		<< E.second->getName();
		}

		void printSmallSet(SmallSetBB &S) {
		dbgs() << "\nPrinting SmallSet: ";
		for (const auto &BB : S)
		dbgs() << BB->getName() << ",";
		}

		static inline void updateLocalStats(const Instruction *Repl, unsigned &NI,
		unsigned &NL, unsigned &NS, unsigned &NC) {
		if (isa<LoadInst>(Repl))
		++NL;
		else if (isa<StoreInst>(Repl))
		++NS;
		else if (isa<CallInst>(Repl))
		++NC;
		else // Scalar
		++NI;
		}

		static inline void updateHoistStats(unsigned NI, unsigned NL, unsigned NS,
		unsigned NC, unsigned NR) {
		NumHoisted += NL + NS + NC + NI;
		NumRemoved += NR;
		NumLoadsHoisted += NL;
		NumStoresHoisted += NS;
		NumCallsHoisted += NC;
		}

// This pass hoists common computations across branches sharing common		// This pass hoists common computations across branches sharing common
// dominator. The primary goal is to reduce the code size, and in some		// dominator. The primary goal is to reduce the code size, and in some
// cases reduce critical path (by exposing more ILP).		// cases reduce critical path (by exposing more ILP).
class GVNHoist {		class GVNHoist {
public:		public:
		// Each element of a hoisting list contains the basic block where to hoist and
		// a list of instructions to be hoisted.
		typedef std::pair<BasicBlock *, SmallVecInsn> HoistingPointInfo;
		typedef SmallVector<HoistingPointInfo, 4> HoistingPointList;

GVNHoist(DominatorTree DT, PostDominatorTree PDT, AliasAnalysis *AA,		GVNHoist(DominatorTree DT, PostDominatorTree PDT, AliasAnalysis *AA,
MemoryDependenceResults MD, MemorySSA MSSA)		MemoryDependenceResults MD, MemorySSA MSSA)
: DT(DT), PDT(PDT), AA(AA), MD(MD), MSSA(MSSA),		: DT(DT), PDT(PDT), AA(AA), MD(MD), MSSA(MSSA),
MSSAUpdater(std::make_unique<MemorySSAUpdater>(MSSA)) {}		MSSAUpdater(std::make_unique<MemorySSAUpdater>(MSSA)),
		HoistingGeps(false) {
		clearVNTables();
		}

		void clearVNTables() {
		II.clear();
		LI.clear();
		SI.clear();
		CI.clear();
		}

		// DomLevels maps from BB -> its depth from root.
		// JEdges only contain the J edges as D edges are available in Dominator Tree.
		// BBLevels maps each depth in the CFG to all the Basic Blocks at that level.
		// DJ Graph is described in "Sreedhar, Vugranam C. Efficient program analysis
		// using DJ graphs. McGill University, 1996".
		void constructDJGraph(DomLevelsT &DomLevels, DenseSet<EdgeT> &JEdges,
		BBLevelT &BBLevels);

		// Return true if S1 is a subset of S2.
		bool isSubset(const SmallSetBB &S1, const SmallSetBB &S2) {
		if (S1.size() > S2.size())
		return false;
		for (BasicBlock *BB : S1) {
		if (!S2.count(BB))
		return false;
		}
		return true;
		}

		// Returns true when A executes before B.
		bool DFSInOrder(const Instruction A, const Instruction B) const {
		const BasicBlock *BA = A->getParent();
		const BasicBlock *BB = B->getParent();
		unsigned ADFS, BDFS;
		if (BA == BB) {
		ADFS = DFSNumber.lookup(A);
		BDFS = DFSNumber.lookup(B);
		} else {
		ADFS = DFSNumber.lookup(BA);
		BDFS = DFSNumber.lookup(BB);
		}
		assert(ADFS && BDFS);
		return ADFS < BDFS;
		}

		// DomLevels maps BB to its depth from root.
		// JEdges only contain the J-edges as D-edges are available in Dominator Tree.
		// BBLevels maps each depth in the CFG to all the BBs at that level.
		// BILARDI, G. AND PINGALI, K. 2003. Algorithms for computing the static
		// single assignment form. J. ACM 50, 3 (May), 375–425.
		bool constructMergeSet(DomLevelsT &DomLevels, DenseSet<EdgeT> &JEdges,
		BBLevelT &BBLevels);

		// Returns true if the \p Val is the last use at \p I.
		// TODO: Find O(1) algorithm for this.
		const Instruction lastUser(const Instruction I, const Value *Val) const;

		// Returns true if the \p Val is live-out from \p BB.
		bool isLiveOutUsingMergeSet(BasicBlock BB, Value Val) const;

bool run(Function &F) {		bool run(Function &F) {
		if (F.hasMinSize()) {
		CheckHoistProfitability = false;
		CheckSinkProfitability = false;
		}
NumFuncArgs = F.arg_size();		NumFuncArgs = F.arg_size();
		// DT->recalculate(F);
		DT->updateDFSNumbers();
		DomLevelsT DomLevels;
		DenseSet<EdgeT> JEdges;
		BBLevelT BBLevels;
		constructDJGraph(DomLevels, JEdges, BBLevels);
		// printBBLevels(BBLevels);
		LLVM_DEBUG(printJEdges(JEdges));
		while (constructMergeSet(DomLevels, JEdges, BBLevels))
		;
		LLVM_DEBUG(printMergeSet(MergeSet));

VN.setDomTree(DT);		VN.setDomTree(DT);
VN.setAliasAnalysis(AA);		VN.setAliasAnalysis(AA);
VN.setMemDep(MD);		VN.setMemDep(MD);
bool Res = false;		bool Res = false;
// Perform DFS Numbering of instructions.		// Perform DFS Numbering of instructions.
unsigned BBI = 0;		unsigned BBI = 0;
for (const BasicBlock *BB : depth_first(&F.getEntryBlock())) {		for (const BasicBlock *BB : depth_first(&F.getEntryBlock())) {
DFSNumber[BB] = ++BBI;		DFSNumber[BB] = ++BBI;
unsigned I = 0;		unsigned I = 0;
for (auto &Inst : *BB)		for (auto &Inst : *BB)
DFSNumber[&Inst] = ++I;		DFSNumber[&Inst] = ++I;
}		}

int ChainLength = 0;		int ChainLength = 0;

// FIXME: use lazy evaluation of VN to avoid the fix-point computation.		// FIXME: use lazy evaluation of VN to avoid the fix-point computation.
while (true) {		while (true) {
if (MaxChainLength != -1 && ++ChainLength >= MaxChainLength)		if (MaxHoistChainLength != -1 && ++ChainLength >= MaxHoistChainLength)
return Res;		return Res;

		clearVNTables();
auto HoistStat = hoistExpressions(F);		auto HoistStat = hoistExpressions(F);
if (HoistStat.first + HoistStat.second == 0)		if (HoistStat.first + HoistStat.second == 0)
return Res;		break;

if (HoistStat.second > 0)		if (HoistStat.second > 0)
// To address a limitation of the current GVN, we need to rerun the		// To address a limitation of the current GVN, we need to rerun the
// hoisting after we hoisted loads or stores in order to be able to		// hoisting after we hoisted loads or stores in order to be able to
// hoist all scalars dependent on the hoisted ld/st.		// hoist all scalars dependent on the hoisted ld/st.
VN.clear();		VN.clear();

Res = true;		Res = true;
}		}

		clearVNTables();
		sinkExpressions(F);
return Res;		return Res;
}		}

// Copied from NewGVN.cpp		// Copied from NewGVN.cpp
// This function provides global ranking of operations so that we can place		// This function provides global ranking of operations so that we can place
// them in a canonical order. Note that rank alone is not necessarily enough		// them in a canonical order. Note that rank alone is not necessarily enough
// for a complete ordering, as constants all have the same rank. However,		// for a complete ordering, as constants all have the same rank. However,
// generally, we will simplify an operation with all constants so that it		// generally, we will simplify an operation with all constants so that it
Show All 25 Lines	private:
DominatorTree *DT;		DominatorTree *DT;
PostDominatorTree *PDT;		PostDominatorTree *PDT;
AliasAnalysis *AA;		AliasAnalysis *AA;
MemoryDependenceResults *MD;		MemoryDependenceResults *MD;
MemorySSA *MSSA;		MemorySSA *MSSA;
std::unique_ptr<MemorySSAUpdater> MSSAUpdater;		std::unique_ptr<MemorySSAUpdater> MSSAUpdater;
DenseMap<const Value *, unsigned> DFSNumber;		DenseMap<const Value *, unsigned> DFSNumber;
BBSideEffectsSet BBSideEffects;		BBSideEffectsSet BBSideEffects;
DenseSet<const BasicBlock *> HoistBarrier;
SmallVector<BasicBlock *, 32> IDFBlocks;		SmallVector<BasicBlock *, 32> IDFBlocks;
unsigned NumFuncArgs;		unsigned NumFuncArgs;
const bool HoistingGeps = false;		const bool HoistingGeps = false;
		MergeSetT MergeSet;
		DenseSet<const BasicBlock *> HoistBarrier;
		InsnInfo II;
		LoadInfo LI;
		StoreInfo SI;
		CallInfo CI;

enum InsKind { Unknown, Scalar, Load, Store };		enum InsKind { Unknown, Scalar, Load, Store };

		bool hoistCandidate(const User *U) const {
		if (!VN.exists(const_cast<User *>(U))) // Only for scalars
		return false;
		unsigned V = VN.lookup(const_cast<User *>(U));

		VNType L({V, InvalidVN});

		// Multiple scalars with same VN have very high chance of being hoisted.
		if (II.getVNTable().count(L) > 1)
		return true;

		// Multiple loads with same VN have very high chance of being hoisted.
		if (LI.getVNTable().count(L) > 1)
		return true;

		// Multiple stores with same VN have very high chance of being hoisted.
		if (SI.getVNTable().count(L) > 1)
		return true;

		// Multiple calls with same VN have very high chance of being hoisted.
		if (CI.getLoadVNTable().count(L) > 1)
		return true;

		// Multiple calls with same VN have very high chance of being hoisted.
		if (CI.getScalarVNTable().count(L) > 1)
		return true;

		// Multiple calls with same VN have very high chance of being hoisted.
		if (CI.getStoreVNTable().count(L) > 1)
		return true;
		return false;
		}

		bool successorDominate(const BasicBlock BB, const BasicBlock A);

// Return true when there are exception handling in BB.		// Return true when there are exception handling in BB.
bool hasEH(const BasicBlock *BB) {		bool hasEH(const BasicBlock *BB);

		bool hasEHhelper(const BasicBlock BB, const BasicBlock SrcBB,
		int &NBBsOnAllPaths);

		// Return true when there are exception handling or loads of memory Def
		// between Def and NewPt. This function is only called for stores: Def is
		// the MemoryDef of the store to be hoisted.
		//
		// Decrement by 1 NBBsOnAllPaths for each block between HoistPt and BB, and
		// return true when the counter NBBsOnAllPaths reaces 0, except when it is
		// initialized to -1 which is unlimited.
		bool hasEHOrLoadsOnPath(const Instruction NewPt, MemoryDef Def,
		int &NBBsOnAllPaths);

		// Return true when there are exception handling between HoistPt and BB.
		// Decrement by 1 NBBsOnAllPaths for each block between HoistPt and BB, and
		// return true when the counter NBBsOnAllPaths reaches 0, except when it is
		// initialized to -1 which is unlimited.
		bool hasEHOnPath(const BasicBlock HoistPt, const BasicBlock SrcBB,
		int &NBBsOnAllPaths);

		// Return true when there are memory uses of Def in BB.
		bool hasMemoryUse(const Instruction NewPt, MemoryDef Def,
		const BasicBlock *BB);

		bool firstInBB(const Instruction I1, const Instruction I2) const;

		// Return true when it is safe to hoist scalar instructions from all blocks in
		// WL to HoistBB. TODO: Hoisting scalars past a call only when the
		// interference does not increase.
		bool safeToHoistScalar(const BasicBlock HoistBB, const BasicBlock BB,
		int &NBBsOnAllPaths);
		// Return true when it is safe to hoist a memory load or store U from OldPt
		// to NewPt.
		bool safeToHoistLdSt(const Instruction NewPt, const Instruction OldPt,
		MemoryUseOrDef *MA, InsKind K, int &NBBsOnAllPaths);
		bool safeToHoistLdSt_0(const Instruction *NewHoistPt,
		const Instruction HoistPt, const Instruction Insn,
		MemoryUseOrDef *MA, GVNHoist::InsKind K,
		int &NumBBsOnAllPaths, const BasicBlock *HoistPtBB,
		const BasicBlock NewHoistBB, const BasicBlock InsnBB,
		SmallPtrSetImpl<const BasicBlock *> &WL);
		// Return true when it is profitable to hoist I1.
		bool profitableToHoist(Instruction *I) const;
		bool profitableToHoist(CHIArgs C) const;
		// Return true when it is profitable to hoist I1.
		bool profitableToSink(Instruction *I) const;

		bool safeToSinkToEnd(Instruction I, BasicBlock BB, GVNHoist::InsKind K);

		// Find sinkable instructions.
		void findSinkableInsn(VNtoInsns &Map, HoistingPointList &HPL, InsKind K);

		bool valueAnticipable(CHIArgs C, Instruction *TI) const;
		void checkSafety(CHIArgs C, BasicBlock *BB, InsKind K,
		SmallVectorImpl<CHIArg> &Safe);

		// Push all the VNs corresponding to BB into RenameStack.
		void fillRenameStack(BasicBlock *BB, InValuesType &ValueBBs,
		RenameStackType &RenameStack);

		void fillChiArgs(BasicBlock *BB, OutValuesType &CHIBBs,
		RenameStackType &RenameStack);

		void insertCHI(InValuesType &ValueBBs, OutValuesType &CHIBBs);

		void findHoistableCandidates(OutValuesType &CHIBBs, InsKind K,
		HoistingPointList &HPL);

		void computeInsertionPoints(VNtoInsns &Map, HoistingPointList &HPL,
		InsKind K);

		// Return true when all operands of Instr are available at insertion point
		// HoistPt. When limiting the number of hoisted expressions, one could hoist
		// a load without hoisting its access function. So before hoisting any
		// expression, make sure that all its operands are available at insert point.
		bool allOperandsAvailable(Instruction I, BasicBlock NewDest,
		SmallVecImplVal &Unav) const;
		// Same as allOperandsAvailable with recursive check for GEP operands.
		bool allGepOperandsAvailable(const Instruction *I,
		const BasicBlock *HoistPt) const;

		// Make all operands of the GEP available.
		void makeGepsAvailable(Instruction Repl, BasicBlock HoistPt,
		const SmallVecInsn &InstructionsToHoist,
		Instruction *Gep) const;

		// In the case Repl is a load or a store, we make all their GEPs
		// available: GEPs are not hoisted by default to avoid the address
		// computations to be hoisted without the associated load or store.
		bool makeGepOperandsAvailable(Instruction Repl, BasicBlock HoistPt,
		const SmallVecInsn &InstructionsToHoist) const;

		void updateAlignment(Instruction I, Instruction Repl);

		void removeAndReplace(const SmallVecInsn &InstructionsToHoist,
		Instruction *Repl);
		void removeAndReplace(Instruction I, Instruction Repl,
		MemoryAccess *NewMemAcc);

		// Remove and rename all instructions other than Repl.
		unsigned removeAndReplace(const SmallVecInsn &Candidates, Instruction *Repl,
		BasicBlock *DestBB, bool MoveAccess);

		void removeMPhi(MemoryAccess *NewMemAcc);
		unsigned rauw(const SmallVecInsn &Candidates, Instruction *Repl,
		MemoryUseOrDef *NewMemAcc);
		void raMPHIuw(MemoryUseOrDef *NewMemAcc);

		std::pair<unsigned, unsigned> hoist(HoistingPointList &HPL);
		std::pair<unsigned, unsigned> sink(HoistingPointList &HPL);
		bool getVN(Instruction *I);
		void barriersAndVNs(Function &F, bool Sinking);
		// Hoist all expressions. Returns Number of scalars hoisted
		// and number of non-scalars hoisted.
		std::pair<unsigned, unsigned> hoistExpressions(Function &F);
		std::pair<unsigned, unsigned> sinkExpressions(Function &F);
		};

		// Return true when there are exception handling in BB.
		bool GVNHoist::hasEH(const BasicBlock *BB) {
auto It = BBSideEffects.find(BB);		auto It = BBSideEffects.find(BB);
if (It != BBSideEffects.end())		if (It != BBSideEffects.end())
return It->second;		return It->second;

if (BB->isEHPad() \|\| BB->hasAddressTaken()) {		if (BB->isEHPad() \|\| BB->hasAddressTaken()) {
BBSideEffects[BB] = true;		BBSideEffects[BB] = true;
return true;		return true;
}		}

if (BB->getTerminator()->mayThrow()) {		if (BB->getTerminator()->mayThrow()) {
BBSideEffects[BB] = true;		BBSideEffects[BB] = true;
return true;		return true;
}		}

BBSideEffects[BB] = false;		BBSideEffects[BB] = false;
return false;		return false;
}		}

// Return true when a successor of BB dominates A.		// Return true when a successor of BB dominates A.
bool successorDominate(const BasicBlock BB, const BasicBlock A) {		bool GVNHoist::successorDominate(const BasicBlock BB, const BasicBlock A) {
for (const BasicBlock *Succ : successors(BB))		for (const BasicBlock *Succ : successors(BB))
if (DT->dominates(Succ, A))		if (DT->dominates(Succ, A))
return true;		return true;

return false;		return false;
}		}

// Return true when I1 appears before I2 in the instructions of BB.		// Return true when I1 appears before I2 in the instructions of BB.
bool firstInBB(const Instruction I1, const Instruction I2) {		bool GVNHoist::firstInBB(const Instruction I1, const Instruction I2) const {
assert(I1->getParent() == I2->getParent());		assert(I1->getParent() == I2->getParent());
unsigned I1DFS = DFSNumber.lookup(I1);		unsigned I1DFS = DFSNumber.lookup(I1);
unsigned I2DFS = DFSNumber.lookup(I2);		unsigned I2DFS = DFSNumber.lookup(I2);
assert(I1DFS && I2DFS);		assert(I1DFS && I2DFS);
return I1DFS < I2DFS;		return I1DFS < I2DFS;
}		}

// Return true when there are memory uses of Def in BB.		// Return true when there are memory uses of Def in BB.
bool hasMemoryUse(const Instruction NewPt, MemoryDef Def,		bool GVNHoist::hasMemoryUse(const Instruction NewPt, MemoryDef Def,
const BasicBlock *BB) {		const BasicBlock *BB) {
const MemorySSA::AccessList *Acc = MSSA->getBlockAccesses(BB);		const MemorySSA::AccessList *Acc = MSSA->getBlockAccesses(BB);
if (!Acc)		if (!Acc)
return false;		return false;

Instruction *OldPt = Def->getMemoryInst();		Instruction *OldPt = Def->getMemoryInst();
const BasicBlock *OldBB = OldPt->getParent();		const BasicBlock *OldBB = OldPt->getParent();
const BasicBlock *NewBB = NewPt->getParent();		const BasicBlock *NewBB = NewPt->getParent();
bool ReachedNewPt = false;		bool ReachedNewPt = false;

for (const MemoryAccess &MA : *Acc)		for (const MemoryAccess &MA : *Acc)
if (const MemoryUse *MU = dyn_cast<MemoryUse>(&MA)) {		if (const MemoryUse *MU = dyn_cast<MemoryUse>(&MA)) {
Instruction *Insn = MU->getMemoryInst();		Instruction *Insn = MU->getMemoryInst();

// Do not check whether MU aliases Def when MU occurs after OldPt.		// Do not check whether MU aliases Def when MU occurs after OldPt.
if (BB == OldBB && firstInBB(OldPt, Insn))		if (BB == OldBB && firstInBB(OldPt, Insn))
break;		break;

// Do not check whether MU aliases Def when MU occurs before NewPt.		// Do not check whether MU aliases Def when MU occurs before NewPt.
if (BB == NewBB) {		if (BB == NewBB) {
if (!ReachedNewPt) {		if (!ReachedNewPt) {
if (firstInBB(Insn, NewPt))		if (firstInBB(Insn, NewPt))
continue;		continue;
ReachedNewPt = true;		ReachedNewPt = true;
}		}
}		}
if (MemorySSAUtil::defClobbersUseOrDef(Def, MU, *AA))		if (MemorySSAUtil::defClobbersUseOrDef(Def, MU, *AA))
return true;		return true;
}		}

return false;		return false;
}		}

bool hasEHhelper(const BasicBlock BB, const BasicBlock SrcBB,		bool GVNHoist::hasEHhelper(const BasicBlock BB, const BasicBlock SrcBB,
int &NBBsOnAllPaths) {		int &NBBsOnAllPaths) {
// Stop walk once the limit is reached.		// Stop walk once the limit is reached.
if (NBBsOnAllPaths == 0)		if (NBBsOnAllPaths == 0)
return true;		return true;

// Impossible to hoist with exceptions on the path.		// Impossible to hoist with exceptions on the path.
if (hasEH(BB))		if (hasEH(BB))
return true;		return true;

// No such instruction after HoistBarrier in a basic block was		// No such instruction after HoistBarrier in a basic block was
// selected for hoisting so instructions selected within basic block with		// selected for hoisting so instructions selected within basic block with
// a hoist barrier can be hoisted.		// a hoist barrier can be hoisted.
if ((BB != SrcBB) && HoistBarrier.count(BB))		if ((BB != SrcBB) && HoistBarrier.count(BB))
return true;		return true;

return false;		return false;
}		}

// Return true when there are exception handling or loads of memory Def		// Return true when there are exception handling or loads of memory Def
// between Def and NewPt. This function is only called for stores: Def is		// between Def and NewPt. This function is only called for stores: Def is
// the MemoryDef of the store to be hoisted.		// the MemoryDef of the store to be hoisted.

// Decrement by 1 NBBsOnAllPaths for each block between HoistPt and BB, and		// Decrement by 1 NBBsOnAllPaths for each block between HoistPt and BB, and
// return true when the counter NBBsOnAllPaths reaces 0, except when it is		// return true when the counter NBBsOnAllPaths reaces 0, except when it is
// initialized to -1 which is unlimited.		// initialized to -1 which is unlimited.
bool hasEHOrLoadsOnPath(const Instruction NewPt, MemoryDef Def,		bool GVNHoist::hasEHOrLoadsOnPath(const Instruction NewPt, MemoryDef Def,
int &NBBsOnAllPaths) {		int &NBBsOnAllPaths) {
const BasicBlock *NewBB = NewPt->getParent();		const BasicBlock *NewBB = NewPt->getParent();
const BasicBlock *OldBB = Def->getBlock();		const BasicBlock *OldBB = Def->getBlock();
assert(DT->dominates(NewBB, OldBB) && "invalid path");		assert(DT->dominates(NewBB, OldBB) && "invalid path");
assert(DT->dominates(Def->getDefiningAccess()->getBlock(), NewBB) &&		assert(DT->dominates(Def->getDefiningAccess()->getBlock(), NewBB) &&
"def does not dominate new hoisting point");		"def does not dominate new hoisting point");

// Walk all basic blocks reachable in depth-first iteration on the inverse		// Walk all basic blocks reachable in depth-first iteration on the inverse
// CFG from OldBB to NewBB. These blocks are all the blocks that may be		// CFG from OldBB to NewBB. These blocks are all the blocks that may be
// executed between the execution of NewBB and OldBB. Hoisting an expression		// executed between the execution of NewBB and OldBB. Hoisting an expression
// from OldBB into NewBB has to be safe on all execution paths.		// from OldBB into NewBB has to be safe on all execution paths.
for (auto I = idf_begin(OldBB), E = idf_end(OldBB); I != E;) {		for (auto I = idf_begin(OldBB), E = idf_end(OldBB); I != E;) {
const BasicBlock BB = I;		const BasicBlock BB = I;
if (BB == NewBB) {		if (BB == NewBB) {
// Stop traversal when reaching HoistPt.		// Stop traversal when reaching HoistPt.
I.skipChildren();		I.skipChildren();
continue;		continue;
}		}

if (hasEHhelper(BB, OldBB, NBBsOnAllPaths))		if (hasEHhelper(BB, OldBB, NBBsOnAllPaths))
return true;		return true;

// Check that we do not move a store past loads.		// Check that we do not move a store past loads.
if (hasMemoryUse(NewPt, Def, BB))		if (hasMemoryUse(NewPt, Def, BB))
return true;		return true;

// -1 is unlimited number of blocks on all paths.		// -1 is unlimited number of blocks on all paths.
if (NBBsOnAllPaths != -1)		if (NBBsOnAllPaths != -1)
--NBBsOnAllPaths;		--NBBsOnAllPaths;

++I;		++I;
}		}

return false;		return false;
}		}

// Return true when there are exception handling between HoistPt and BB.		// Return true when there are exception handling between HoistPt and BB.
// Decrement by 1 NBBsOnAllPaths for each block between HoistPt and BB, and		// Decrement by 1 NBBsOnAllPaths for each block between HoistPt and BB, and
// return true when the counter NBBsOnAllPaths reaches 0, except when it is		// return true when the counter NBBsOnAllPaths reaches 0, except when it is
// initialized to -1 which is unlimited.		// initialized to -1 which is unlimited.
bool hasEHOnPath(const BasicBlock HoistPt, const BasicBlock SrcBB,		bool GVNHoist::hasEHOnPath(const BasicBlock HoistPt, const BasicBlock SrcBB,
int &NBBsOnAllPaths) {		int &NBBsOnAllPaths) {
assert(DT->dominates(HoistPt, SrcBB) && "Invalid path");		assert(DT->dominates(HoistPt, SrcBB) && "Invalid path");

// Walk all basic blocks reachable in depth-first iteration on		// Walk all basic blocks reachable in depth-first iteration on
// the inverse CFG from BBInsn to NewHoistPt. These blocks are all the		// the inverse CFG from BBInsn to NewHoistPt. These blocks are all the
// blocks that may be executed between the execution of NewHoistPt and		// blocks that may be executed between the execution of NewHoistPt and
// BBInsn. Hoisting an expression from BBInsn into NewHoistPt has to be safe		// BBInsn. Hoisting an expression from BBInsn into NewHoistPt has to be safe
// on all execution paths.		// on all execution paths.
for (auto I = idf_begin(SrcBB), E = idf_end(SrcBB); I != E;) {		for (auto I = idf_begin(SrcBB), E = idf_end(SrcBB); I != E;) {
const BasicBlock BB = I;		const BasicBlock BB = I;
if (BB == HoistPt) {		if (BB == HoistPt) {
// Stop traversal when reaching NewHoistPt.		// Stop traversal when reaching NewHoistPt.
I.skipChildren();		I.skipChildren();
continue;		continue;
}		}

if (hasEHhelper(BB, SrcBB, NBBsOnAllPaths))		if (hasEHhelper(BB, SrcBB, NBBsOnAllPaths))
return true;		return true;

// -1 is unlimited number of blocks on all paths.		// -1 is unlimited number of blocks on all paths.
if (NBBsOnAllPaths != -1)		if (NBBsOnAllPaths != -1)
--NBBsOnAllPaths;		--NBBsOnAllPaths;

++I;		++I;
}		}

return false;		return false;
}		}

// Return true when it is safe to hoist a memory load or store U from OldPt		// Return true when it is safe to hoist a memory load or store U from OldPt
// to NewPt.		// to NewPt.
bool safeToHoistLdSt(const Instruction NewPt, const Instruction OldPt,		bool GVNHoist::safeToHoistLdSt(const Instruction *NewPt,
MemoryUseOrDef *U, InsKind K, int &NBBsOnAllPaths) {		const Instruction OldPt, MemoryUseOrDef U,
		InsKind K, int &NBBsOnAllPaths) {

// In place hoisting is safe.		// In place hoisting is safe.
if (NewPt == OldPt)		if (NewPt == OldPt)
return true;		return true;

const BasicBlock *NewBB = NewPt->getParent();		const BasicBlock *NewBB = NewPt->getParent();
const BasicBlock *OldBB = OldPt->getParent();		const BasicBlock *OldBB = OldPt->getParent();
const BasicBlock *UBB = U->getBlock();		const BasicBlock *UBB = U->getBlock();

// Check for dependences on the Memory SSA.		// Check for dependences on the Memory SSA.
MemoryAccess *D = U->getDefiningAccess();		MemoryAccess *D = U->getDefiningAccess();
BasicBlock *DBB = D->getBlock();		BasicBlock *DBB = D->getBlock();
if (DT->properlyDominates(NewBB, DBB))		if (DT->properlyDominates(NewBB, DBB))
// Cannot move the load or store to NewBB above its definition in DBB.		// Cannot move the load or store to NewBB above its definition in DBB.
return false;		return false;

if (NewBB == DBB && !MSSA->isLiveOnEntryDef(D))		if (NewBB == DBB && !MSSA->isLiveOnEntryDef(D))
if (auto *UD = dyn_cast<MemoryUseOrDef>(D))		if (auto *UD = dyn_cast<MemoryUseOrDef>(D))
if (!firstInBB(UD->getMemoryInst(), NewPt))		if (!firstInBB(UD->getMemoryInst(), NewPt))
// Cannot move the load or store to NewPt above its definition in D.		// Cannot move the load or store to NewPt above its definition in D.
return false;		return false;

// Check for unsafe hoistings due to side effects.		// Check for unsafe hoistings due to side effects.
if (K == InsKind::Store) {		if (K == InsKind::Store) {
if (hasEHOrLoadsOnPath(NewPt, cast<MemoryDef>(U), NBBsOnAllPaths))		if (hasEHOrLoadsOnPath(NewPt, dyn_cast<MemoryDef>(U), NBBsOnAllPaths))
return false;		return false;
} else if (hasEHOnPath(NewBB, OldBB, NBBsOnAllPaths))		} else if (hasEHOnPath(NewBB, OldBB, NBBsOnAllPaths))
return false;		return false;

if (UBB == NewBB) {		if (UBB == NewBB) {
if (DT->properlyDominates(DBB, NewBB))		if (DT->properlyDominates(DBB, NewBB))
return true;		return true;
assert(UBB == DBB);		assert(UBB == DBB);
assert(MSSA->locallyDominates(D, U));		assert(MSSA->locallyDominates(D, U));
}		}

// No side effects: it is safe to hoist.		// No side effects: it is safe to hoist.
return true;		return true;
}		}

// Return true when it is safe to hoist scalar instructions from all blocks in		// Return true when it is safe to hoist scalar instructions from all blocks in
// WL to HoistBB.		// WL to HoistBB.
bool safeToHoistScalar(const BasicBlock HoistBB, const BasicBlock BB,		bool GVNHoist::safeToHoistScalar(const BasicBlock *HoistBB,
int &NBBsOnAllPaths) {		const BasicBlock *BB, int &NBBsOnAllPaths) {
return !hasEHOnPath(HoistBB, BB, NBBsOnAllPaths);		return !hasEHOnPath(HoistBB, BB, NBBsOnAllPaths);
}		}

		bool GVNHoist::safeToHoistLdSt_0(const Instruction *NewHoistPt,
		const Instruction *HoistPt,
		const Instruction Insn, MemoryUseOrDef MA,
		GVNHoist::InsKind K, int &NumBBsOnAllPaths,
		const BasicBlock *HoistPtBB,
		const BasicBlock *NewHoistBB,
		const BasicBlock *InsnBB,
		SmallPtrSetImpl<const BasicBlock *> &WL) {
		return (HoistPtBB == NewHoistBB \|\| InsnBB == NewHoistBB
		/hoistingFromAllPaths(NewHoistBB, WL)/) &&
		// Also check that it is safe to move the load or store from HoistPt
		// to NewHoistPt, and from Insn to NewHoistPt. Note that HoistPt may
		// not be the instruction to be hoisted, it is a transient placeholder
		// to find the farthest hoisting point when >2 hoistable candidates
		// can be hoisted to a common dominator.
		safeToHoistLdSt(NewHoistPt, HoistPt, MA, K, NumBBsOnAllPaths) &&
		safeToHoistLdSt(NewHoistPt, Insn, MSSA->getMemoryAccess(Insn), K,
		NumBBsOnAllPaths);
		}

		bool GVNHoist::profitableToHoist(Instruction *I) const {
		// For -O3/-O2 hoist only when the liveness decreases i.e., no more than
		// one operand can be a use without kill.
		// Store and Calls do not create a register def.
		if (isa<StoreInst>(I) \|\| isa<CallInst>(I))
		return true;

		// If Op is a kill then it will not be live-out from its basic block
		// but the reverse is not true.
		for (unsigned op = 0, e = I->getNumOperands(); op != e; ++op) {
		Value *Op = I->getOperand(op);
		// if (isa<Constant>(Op))
		// continue;
		if (isLiveOutUsingMergeSet(I->getParent(), Op))
		return false;
		// It is always profitable to hoist when the liveness does not increase,
		// a Kill will compensate for the def created by this instruction.
		const Instruction *LU = lastUser(I, Op);
		if (LU == I)
		return true;
		else {
		// We optimistically assume that if all the users of Op are hoistable
		// candidates then it is profitable to hoist.
		bool stillProfitable = true;
		for (const User *U : Op->users()) {
		if (!hoistCandidate(U)) {
		stillProfitable = false;
		break;
		}
		LLVM_DEBUG(dbgs() << "\nstill hoistable:" << *U);
		}
		if (stillProfitable)
		return true;
		}
		}
		return false;
		}

		bool GVNHoist::profitableToHoist(CHIArgs C) const {
		if (!CheckHoistProfitability)
		return true;
		int count = 0;
		for (const auto &A : C) {
		if (!profitableToHoist(A.I))
		++count;
		}
		return 2 * count <= size(C);
		}

		bool GVNHoist::profitableToSink(Instruction *I) const {
		if (!CheckSinkProfitability)
		return true;
		// For -O3/-O2 sink only when the liveness decreases i.e., no more than
		// one operand can be a kill.
		// Store and Calls do not create a register def.
		if (isa<StoreInst>(I) \|\| isa<CallInst>(I))
		return false;

		// If Op is a kill then it will not be live-out from its basic block
		// but the reverse is not true.
		unsigned NumKills = 0;
		for (unsigned op = 0, e = I->getNumOperands(); op != e; ++op) {
		Value *Op = I->getOperand(op);
		// It is always profitable to sink when the liveness does not increase.
		if (isLiveOutUsingMergeSet(I->getParent(), Op))
		continue;
		if (!isa<Constant>(Op))
		++NumKills;
		// A Kill will increase the liveness during a sink.
		const Instruction *LU = lastUser(I, Op);
		if (LU == I && NumKills > 1)
		return false;
		}
		return true;
		}

		// Returns true when the instruction does not result
		// in a hazard with any other instruction till the end of BB.
		bool GVNHoist::safeToSinkToEnd(Instruction I, BasicBlock BB,
		GVNHoist::InsKind K) {
		BasicBlock::iterator II(I);
		auto IMA = MSSA->getMemoryAccess(I);
		assert(IMA);
		if (MemoryUseOrDef *I0UD = cast<MemoryUseOrDef>(IMA)) {
		// Only sink loads for now because sinking load either decreases or
		// preserves live-range.
		if (!isa<MemoryUse>(I0UD))
		return false;

		// MemoryAccess *Def = I0UD->getDefiningAccess();
		/*/ Updating Memory PHI is tricky, bail out for now.
		for (User *U : Def->users())
		if (isa<MemoryPhi>(U))
		return false;*/
		}
		while (++II != BB->end()) { // Skip I
		auto IIMA = MSSA->getMemoryAccess(&*II);
		// TODO: Bails out on any memory writes for now, improve for non-aliases.
		if (IIMA) {
		if (!isa<MemoryUse>(IIMA))
		return false;
		}
		}
		return true;
		}

		void GVNHoist::findSinkableInsn(VNtoInsns &Map,
		GVNHoist::HoistingPointList &HPL,
		GVNHoist::InsKind K) {
		// Sort the VNs by rank, higher ranked should be sunk first.
		// SmallVector<unsigned, 8> SortedRank;
		// sortByRank(SortedRank, Map);
		SmallVector<VNtoInsns::key_type, 4> ToErase;
		for (const auto &Entry : Map) {
		const SmallVecInsn &V = Entry.second;
		if (V.size() < 2) {
		ToErase.push_back(Entry.first);
		continue;
		}

		// Only sink to common post-dom
		SmallVecInsn InstructionsToSink;
		for (Instruction *I : V) {
		auto BB = I->getParent();
		// All the sinkable instructions are collected after any barrier in
		// a basic block so no need to check for barrier BBs.

		// The instruction should have only one user i.e., PHI in the Succ
		// TODO: Handle store, calls etc. without users
		unsigned NumUsers = std::distance(I->user_begin(), I->user_end());
		if (NumUsers != 1)
		continue;
		// The only user should be a PHI.
		if (!isa<PHINode>(I->user_back()))
		continue;

		// Sink to single successor.
		if (auto Succ = BB->getSingleSuccessor()) {
		if (I->user_back()->getParent() != Succ)
		continue; // PHI should be in the immediate successor.
		// The successor should have exactly two predecessors.
		unsigned NumPreds = std::distance(pred_begin(Succ), pred_end(Succ));
		if (NumPreds != 2)
		continue;

		// Sinkable instruction found, check for safety and profitability.
		// As the instructions are from a basic block without barriers, we can
		// sink this instruction as long as there is no hazard. Scalars are
		// safe to sink.
		if (K != InsKind::Scalar && !safeToSinkToEnd(I, BB, K))
		continue;
		if (!profitableToSink(I))
		continue;

		// We don't need to check for barriers here because the instruction
		// will be sunk at the beginning of a basic block i.e., before any
		// barrier, and I is after any barriers in I->getParent().
		if (!hasEH(Succ))
		InstructionsToSink.push_back(I);
		}
		}

		// assert(InstructionsToSink.size() < 3 && "Test case");
		// Sort V such that adjacent Instructions share common PDom.
		if (InstructionsToSink.size() != 2)
		continue;
		// assert(InstructionsToSink.size() != 2 && "Test case");
		// There should be a unique PHI using I0 and I1 to be legally sinkable.
		if (InstructionsToSink[0]->user_back() ==
		InstructionsToSink[1]->user_back()) {
		auto Succ0 = InstructionsToSink[0]->getParent()->getSingleSuccessor();
		auto Succ1 = InstructionsToSink[1]->getParent()->getSingleSuccessor();
		assert(Succ0 == Succ1);
		HPL.push_back({Succ0, InstructionsToSink});
		}
		ToErase.push_back(Entry.first);
		}

		// Erase VNs already handled.
		for (auto V : ToErase)
		Map.erase(V);
		}

// In the inverse CFG, the dominance frontier of basic block (BB) is the		// In the inverse CFG, the dominance frontier of basic block (BB) is the
// point where ANTIC needs to be computed for instructions which are going		// point where ANTIC needs to be computed for instructions which are going
// to be hoisted. Since this point does not change during gvn-hoist,		// to be hoisted. Since this point does not change during gvn-hoist,
// we compute it only once (on demand).		// we compute it only once (on demand).
// The ides is inspired from:		// The ides is inspired from:
// "Partial Redundancy Elimination in SSA Form"		// "Partial Redundancy Elimination in SSA Form"
// ROBERT KENNEDY, SUN CHAN, SHIN-MING LIU, RAYMOND LO, PENG TU and FRED CHOW		// ROBERT KENNEDY, SUN CHAN, SHIN-MING LIU, RAYMOND LO, PENG TU and FRED CHOW
// They use similar idea in the forward graph to find fully redundant and		// They use similar idea in the forward graph to to find fully redundant and
// partially redundant expressions, here it is used in the inverse graph to		// partially redundant expressions, here it is used in the inverse graph to
// find fully anticipable instructions at merge point (post-dominator in		// find fully anticipable instructions at merge point (post-dominator in
// the inverse CFG).		// the inverse CFG).
// Returns the edge via which an instruction in BB will get the values from.		// Returns the edge via which an instruction in BB will get the values from.

// Returns true when the values are flowing out to each edge.		// Returns true when the values are flowing out to each edge.
bool valueAnticipable(CHIArgs C, Instruction *TI) const {		bool GVNHoist::valueAnticipable(CHIArgs C, Instruction *TI) const {
if (TI->getNumSuccessors() > (unsigned)size(C))		if (TI->getNumSuccessors() > (unsigned)size(C))
return false; // Not enough args in this CHI.		return false; // Not enough args in this CHI.

for (auto CHI : C) {		for (const CHIArg &CHI : C) {
BasicBlock *Dest = CHI.Dest;		const BasicBlock *Dest = CHI.Dest;
// Find if all the edges have values flowing out of BB.		// Find if all the edges have values flowing out of BB.
bool Found = llvm::any_of(		// This is clearly quadratic here. May need more efficient algorithm.
successors(TI), [Dest](const BasicBlock *BB) { return BB == Dest; });		// Because it happens for limited CHIs it is 'okay' for now.
		// TODO: To make it more efficient, collect all successors(TI) and all
		// CHI.Dest in separate containers in sorted manner, and find a mismatch
		// to get the same result as the following.
		bool Found = any_of(successors(TI),
		[Dest](const BasicBlock *BB) { return BB == Dest; });
if (!Found)		if (!Found)
return false;		return false;
}		}
return true;		return true;
}		}

// Check if it is safe to hoist values tracked by CHI in the range		// Check if it is safe to hoist values tracked by CHI in the range
// [Begin, End) and accumulate them in Safe.		// [Begin, End) and accumulate them in Safe.
void checkSafety(CHIArgs C, BasicBlock *BB, InsKind K,		void GVNHoist::checkSafety(CHIArgs C, BasicBlock *BB, InsKind K,
SmallVectorImpl<CHIArg> &Safe) {		SmallVectorImpl<CHIArg> &Safe) {
int NumBBsOnAllPaths = MaxNumberOfBBSInPath;		int NumBBsOnAllPaths = MaxNumberOfBBSInPath;
for (auto CHI : C) {		for (auto CHI : C) {
Instruction *Insn = CHI.I;		Instruction *Insn = CHI.I;
if (!Insn) // No instruction was inserted in this CHI.		if (!Insn) // No instruction was inserted in this CHI.
continue;		continue;
if (K == InsKind::Scalar) {		if (K == InsKind::Scalar) {
if (safeToHoistScalar(BB, Insn->getParent(), NumBBsOnAllPaths))		if (safeToHoistScalar(BB, Insn->getParent(), NumBBsOnAllPaths))
Safe.push_back(CHI);		Safe.push_back(CHI);
} else {		} else {
MemoryUseOrDef *UD = MSSA->getMemoryAccess(Insn);		MemoryUseOrDef *UD = MSSA->getMemoryAccess(Insn);
		// Use safeToHoistLdSt_0
if (safeToHoistLdSt(BB->getTerminator(), Insn, UD, K, NumBBsOnAllPaths))		if (safeToHoistLdSt(BB->getTerminator(), Insn, UD, K, NumBBsOnAllPaths))
Safe.push_back(CHI);		Safe.push_back(CHI);
}		}
}		}
}		}

using RenameStackType = DenseMap<VNType, SmallVector<Instruction *, 2>>;

// Push all the VNs corresponding to BB into RenameStack.		// Push all the VNs corresponding to BB into RenameStack.
void fillRenameStack(BasicBlock *BB, InValuesType &ValueBBs,		void GVNHoist::fillRenameStack(BasicBlock *BB, InValuesType &ValueBBs,
RenameStackType &RenameStack) {		RenameStackType &RenameStack) {
auto it1 = ValueBBs.find(BB);		auto it1 = ValueBBs.find(BB);
if (it1 != ValueBBs.end()) {		if (it1 != ValueBBs.end()) {
// Iterate in reverse order to keep lower ranked values on the top.		// Iterate in reverse order to keep lower ranked values on the top.
for (std::pair<VNType, Instruction *> &VI : reverse(it1->second)) {		for (std::pair<VNType, Instruction *> &VI : reverse(it1->second)) {
// Get the value of instruction I		// Get the value of instruction I
LLVM_DEBUG(dbgs() << "\nPushing on stack: " << *VI.second);		LLVM_DEBUG(dbgs() << "\nPushing on stack: " << *VI.second);
RenameStack[VI.first].push_back(VI.second);		RenameStack[VI.first].push_back(VI.second);
}		}
}		}
}		}

void fillChiArgs(BasicBlock *BB, OutValuesType &CHIBBs,		void GVNHoist::fillChiArgs(BasicBlock *BB, OutValuesType &CHIBBs,
RenameStackType &RenameStack) {		RenameStackType &RenameStack) {
// For each predecessor (because Post-DOM) of BB check if it has a CHI		// For each predecessor (because Post-DOM) of BB check if it has a CHI
for (auto Pred : predecessors(BB)) {		for (auto Pred : predecessors(BB)) {
auto P = CHIBBs.find(Pred);		auto P = CHIBBs.find(Pred);
if (P == CHIBBs.end()) {		if (P == CHIBBs.end()) {
continue;		continue;
}		}
LLVM_DEBUG(dbgs() << "\nLooking at CHIs in: " << Pred->getName(););		LLVM_DEBUG(dbgs() << "\nLooking at CHIs in: " << Pred->getName(););
// A CHI is found (BB -> Pred is an edge in the CFG)		// A CHI is found (BB -> Pred is an edge in the CFG)
// Pop the stack until Top(V) = Ve.		// Pop the stack until Top(V) = Ve.
auto &VCHI = P->second;		auto &VCHI = P->second;
for (auto It = VCHI.begin(), E = VCHI.end(); It != E;) {		for (auto It = VCHI.begin(), E = VCHI.end(); It != E;) {
CHIArg &C = *It;		CHIArg &C = *It;
if (!C.Dest) {		if (!C.Dest) {
auto si = RenameStack.find(C.VN);		auto si = RenameStack.find(C.VN);
// The Basic Block where CHI is must dominate the value we want to		// The Basic Block where CHI is must dominate the value we want to
// track in a CHI. In the PDom walk, there can be values in the		// track in a CHI. In the PDom walk, there can be values in the
// stack which are not control dependent e.g., nested loop.		// stack which are not control dependent e.g., nested loop.
if (si != RenameStack.end() && si->second.size() &&		if (si != RenameStack.end() && si->second.size() &&
DT->properlyDominates(Pred, si->second.back()->getParent())) {		DT->properlyDominates(Pred, si->second.back()->getParent())) {
C.Dest = BB; // Assign the edge		C.Dest = BB; // Assign the edge
C.I = si->second.pop_back_val(); // Assign the argument		C.I = si->second.pop_back_val(); // Assign the argument
LLVM_DEBUG(dbgs()		LLVM_DEBUG(dbgs()
<< "\nCHI Inserted in BB: " << C.Dest->getName() << *C.I		<< "\nCHI Inserted in BB: " << C.Dest->getName() << *C.I
<< ", VN: " << C.VN.first << ", " << C.VN.second);		<< ", VN: " << C.VN.first << ", " << C.VN.second);
}		}
// Move to next CHI of a different value		// Move to next CHI of a different value
It = std::find_if(It, VCHI.end(),		It = std::find_if(It, VCHI.end(), [It](CHIArg &A) { return A != *It; });
[It](CHIArg &A) { return A != *It; });
} else		} else
++It;		++It;
}		}
}		}
}		}

// Walk the post-dominator tree top-down and use a stack for each value to		// Walk the post-dominator tree top-down and use a stack for each value to
// store the last value you see. When you hit a CHI from a given edge, the		// store the last value you see. When you hit a CHI from a given edge, the
// value to use as the argument is at the top of the stack, add the value to		// value to use as the argument is at the top of the stack, add the value to
// CHI and pop.		// CHI and pop.
void insertCHI(InValuesType &ValueBBs, OutValuesType &CHIBBs) {		void GVNHoist::insertCHI(InValuesType &ValueBBs, OutValuesType &CHIBBs) {
auto Root = PDT->getNode(nullptr);		auto Root = PDT->getNode(nullptr);
if (!Root)		if (!Root)
return;		return;
// Depth first walk on PDom tree to fill the CHIargs at each PDF.		// Depth first walk on PDom tree to fill the CHIargs at each PDF.
RenameStackType RenameStack;		RenameStackType RenameStack;
for (auto Node : depth_first(Root)) {		for (auto Node : depth_first(Root)) {
BasicBlock *BB = Node->getBlock();		BasicBlock *BB = Node->getBlock();
if (!BB)		if (!BB)
continue;		continue;

// Collect all values in BB and push to stack.		// Collect all values in BB and push to stack.
fillRenameStack(BB, ValueBBs, RenameStack);		fillRenameStack(BB, ValueBBs, RenameStack);

// Fill outgoing values in each CHI corresponding to BB.		// Fill outgoing values in each CHI corresponding to BB.
fillChiArgs(BB, CHIBBs, RenameStack);		fillChiArgs(BB, CHIBBs, RenameStack);
}		}
}		}

// Walk all the CHI-nodes to find ones which have a empty-entry and remove		// Walk all the CHI-nodes to find ones which have a empty-entry and remove
// them Then collect all the instructions which are safe to hoist and see if		// them Then collect all the instructions which are safe to hoist and see if
// they form a list of anticipable values. OutValues contains CHIs		// they form a list of anticipable values. OutValues contains CHIs
// corresponding to each basic block.		// corresponding to each basic block.
void findHoistableCandidates(OutValuesType &CHIBBs, InsKind K,		void GVNHoist::findHoistableCandidates(OutValuesType &CHIBBs, InsKind K,
HoistingPointList &HPL) {		HoistingPointList &HPL) {
auto cmpVN = [](const CHIArg &A, const CHIArg &B) { return A.VN < B.VN; };		auto cmpVN = [](const CHIArg &A, const CHIArg &B) { return A.VN < B.VN; };

// CHIArgs now have the outgoing values, so check for anticipability and		// CHIArgs now have the outgoing values, so check for anticipability and
// accumulate hoistable candidates in HPL.		// accumulate hoistable candidates in HPL.
for (std::pair<BasicBlock *, SmallVector<CHIArg, 2>> &A : CHIBBs) {		for (std::pair<BasicBlock *, SmallVector<CHIArg, 2>> &A : CHIBBs) {
BasicBlock *BB = A.first;		BasicBlock *BB = A.first;
SmallVectorImpl<CHIArg> &CHIs = A.second;		SmallVectorImpl<CHIArg> &CHIs = A.second;
// Vector of PHIs contains PHIs for different instructions.		// Vector of CHIs contains CHIs for different instructions.
// Sort the args according to their VNs, such that identical		// Sort the args according to their VNs, such that identical
// instructions are together.		// instructions are together.
llvm::stable_sort(CHIs, cmpVN);		llvm::stable_sort(CHIs, cmpVN);
auto TI = BB->getTerminator();		auto TI = BB->getTerminator();
auto B = CHIs.begin();		auto B = CHIs.begin();
// [PreIt, PHIIt) form a range of CHIs which have identical VNs.		// [PreIt, PHIIt) form a range of CHIs which have identical VNs.
auto PHIIt = std::find_if(CHIs.begin(), CHIs.end(),		auto PHIIt = std::find_if(CHIs.begin(), CHIs.end(),
[B](CHIArg &A) { return A != *B; });		[B](CHIArg &A) { return A != *B; });
auto PrevIt = CHIs.begin();		auto PrevIt = CHIs.begin();
while (PrevIt != PHIIt) {		while (PrevIt != PHIIt) {
// Collect values which satisfy safety checks.		// Collect values which satisfy safety checks.
SmallVector<CHIArg, 2> Safe;		SmallVector<CHIArg, 2> Safe;
// We check for safety first because there might be multiple values in		// We check for safety first because there might be multiple values in
// the same path, some of which are not safe to be hoisted, but overall		// the same path, some of which are not safe to be hoisted, but overall
// each edge has at least one value which can be hoisted, making the		// each edge has at least one value which can be hoisted, making the
// value anticipable along that path.		// value anticipable along that path.
checkSafety(make_range(PrevIt, PHIIt), BB, K, Safe);		checkSafety(make_range(PrevIt, PHIIt), BB, K, Safe);

// List of safe values should be anticipable at TI.		// List of safe values should be anticipable at TI.
if (valueAnticipable(make_range(Safe.begin(), Safe.end()), TI)) {		auto SR = make_range(Safe.begin(), Safe.end());
		if (valueAnticipable(SR, TI) && profitableToHoist(SR)) {
HPL.push_back({BB, SmallVecInsn()});		HPL.push_back({BB, SmallVecInsn()});
SmallVecInsn &V = HPL.back().second;		SmallVecInsn &V = HPL.back().second;
for (auto B : Safe)		for (auto B : Safe)
V.push_back(B.I);		V.push_back(B.I);
}		}

// Check other VNs		// Check other VNs
PrevIt = PHIIt;		PrevIt = PHIIt;
PHIIt = std::find_if(PrevIt, CHIs.end(),		PHIIt = std::find_if(PrevIt, CHIs.end(),
[PrevIt](CHIArg &A) { return A != *PrevIt; });		[PrevIt](CHIArg &A) { return A != *PrevIt; });
}		}
}		}
}		}

// Compute insertion points for each values which can be fully anticipated at		// Compute insertion points for each values which can be fully anticipated at
// a dominator. HPL contains all such values.		// a dominator. HPL contains all such values.
void computeInsertionPoints(const VNtoInsns &Map, HoistingPointList &HPL,		void GVNHoist::computeInsertionPoints(VNtoInsns &Map, HoistingPointList &HPL,
InsKind K) {		InsKind K) {
// Sort VNs based on their rankings		// Sort VNs based on their rankings
std::vector<VNType> Ranks;		std::vector<VNType> Ranks;
for (const auto &Entry : Map) {		for (const auto &Entry : Map) {
Ranks.push_back(Entry.first);		Ranks.push_back(Entry.first);
}		}

// TODO: Remove fully-redundant expressions.		// TODO: Remove fully-redundant expressions.
// Get instruction from the Map, assume that all the Instructions		// Get instruction from the Map, assume that all the Instructions
// with same VNs have same rank (this is an approximation).		// with same VNs have same rank (this is an approximation).
llvm::sort(Ranks, [this, &Map](const VNType &r1, const VNType &r2) {		llvm::sort(Ranks, [this, &Map](const VNType &r1, const VNType &r2) {
return (rank(Map.lookup(r1).begin()) < rank(Map.lookup(r2).begin()));		return (rank(Map.lookup(r1).begin()) < rank(Map.lookup(r2).begin()));
});		});

// - Sort VNs according to their rank, and start with lowest ranked VN		// - Sort VNs according to their rank, and start with lowest ranked VN
// - Take a VN and for each instruction with same VN		// - Take a VN and for each instruction with same VN
// - Find the dominance frontier in the inverse graph (PDF)		// - Find the dominance frontier in the inverse graph (PDF)
// - Insert the chi-node at PDF		// - Insert the chi-node at PDF
// - Remove the chi-nodes with missing entries		// - Remove the chi-nodes with missing entries
// - Remove values from CHI-nodes which do not truly flow out, e.g.,		// - Remove values from CHI-nodes which do not truly flow out, e.g.,
// modified along the path.		// modified along the path.
// - Collect the remaining values that are still anticipable		// - Collect the remaining values that are still anticipable
SmallVector<BasicBlock *, 2> IDFBlocks;		SmallVector<BasicBlock *, 2> IDFBlocks;
ReverseIDFCalculator IDFs(*PDT);		ReverseIDFCalculator IDFs(*PDT);
OutValuesType OutValue;		OutValuesType OutValue;
InValuesType InValue;		InValuesType InValue;
		SmallVector<VNtoInsns::key_type, 4> ToErase;
for (const auto &R : Ranks) {		for (const auto &R : Ranks) {
const SmallVecInsn &V = Map.lookup(R);		const SmallVecInsn &V = Map.lookup(R);
		ToErase.push_back(R);
if (V.size() < 2)		if (V.size() < 2)
continue;		continue;
const VNType &VN = R;		const VNType &VN = R;
SmallPtrSet<BasicBlock *, 2> VNBlocks;		SmallPtrSet<BasicBlock *, 2> VNBlocks;
for (auto &I : V) {		for (auto &I : V) {
BasicBlock *BBI = I->getParent();		BasicBlock *BBI = I->getParent();
if (!hasEH(BBI))		if (!hasEH(BBI))
VNBlocks.insert(BBI);		VNBlocks.insert(BBI);
}		}
// Compute the Post Dominance Frontiers of each basic block		// Compute the Post Dominance Frontiers of each basic block
// The dominance frontier of a live block X in the reverse		// The dominance frontier of a live block X in the reverse
// control graph is the set of blocks upon which X is control		// control graph is the set of blocks upon which X is control
// dependent. The following sequence computes the set of blocks		// dependent. The following sequence computes the set of blocks
// which currently have dead terminators that are control		// which currently have dead terminators that are control
// dependence sources of a block which is in NewLiveBlocks.		// dependence sources of a block which is in NewLiveBlocks.
IDFs.setDefiningBlocks(VNBlocks);		IDFs.setDefiningBlocks(VNBlocks);
IDFBlocks.clear();		IDFBlocks.clear();
IDFs.calculate(IDFBlocks);		IDFs.calculate(IDFBlocks);

// Make a map of BB vs instructions to be hoisted.		// Make a map of BB vs instructions to be hoisted.
for (unsigned i = 0; i < V.size(); ++i) {		for (unsigned i = 0; i < V.size(); ++i) {
InValue[V[i]->getParent()].push_back(std::make_pair(VN, V[i]));		InValue[V[i]->getParent()].push_back(std::make_pair(VN, V[i]));
}		}
// Insert empty CHI node for this VN. This is used to factor out		// Insert empty CHI node for this VN. This is used to factor out
// basic blocks where the ANTIC can potentially change.		// basic blocks where the ANTIC can potentially change.
for (auto IDFB : IDFBlocks) {		for (auto IDFB : IDFBlocks) {
for (unsigned i = 0; i < V.size(); ++i) {		for (unsigned i = 0; i < V.size(); ++i) {
CHIArg C = {VN, nullptr, nullptr};		CHIArg C = {VN, nullptr, nullptr};
// Ignore spurious PDFs.		// Ignore spurious PDFs.
if (DT->properlyDominates(IDFB, V[i]->getParent())) {		if (DT->properlyDominates(IDFB, V[i]->getParent())) {
OutValue[IDFB].push_back(C);		OutValue[IDFB].push_back(C);
LLVM_DEBUG(dbgs() << "\nInsertion a CHI for BB: " << IDFB->getName()		LLVM_DEBUG(dbgs() << "\nInserting a CHI for BB: " << IDFB->getName()
<< ", for Insn: " << *V[i]);		<< ", for Insn: " << *V[i]);
}		}
}		}
}		}
}		}

		// Erase VNs already handled.
		for (auto V : ToErase)
		Map.erase(V);

// Insert CHI args at each PDF to iterate on factored graph of		// Insert CHI args at each PDF to iterate on factored graph of
// control dependence.		// control dependence.
insertCHI(InValue, OutValue);		insertCHI(InValue, OutValue);
// Using the CHI args inserted at each PDF, find fully anticipable values.		// Using the CHI args inserted at each PDF, find fully anticipable values.
findHoistableCandidates(OutValue, K, HPL);		findHoistableCandidates(OutValue, K, HPL);
}		}

// Return true when all operands of Instr are available at insertion point		bool GVNHoist::allOperandsAvailable(Instruction I, BasicBlock NewDest,
// HoistPt. When limiting the number of hoisted expressions, one could hoist		SmallVecImplVal &Unav) const {
// a load without hoisting its access function. So before hoisting any		for (Use &Op : I->operands())
// expression, make sure that all its operands are available at insert point.		if (auto *Inst = dyn_cast<Instruction>(&Op))
bool allOperandsAvailable(const Instruction *I,		if (!DT->dominates(Inst->getParent(), NewDest))
const BasicBlock *HoistPt) const {		Unav.push_back(Op.get());
for (const Use &Op : I->operands())
if (const auto *Inst = dyn_cast<Instruction>(&Op))
if (!DT->dominates(Inst->getParent(), HoistPt))
return false;

return true;		return Unav.empty();
}		}

// Same as allOperandsAvailable with recursive check for GEP operands.		// Same as allOperandsAvailable with recursive check for GEP operands.
bool allGepOperandsAvailable(const Instruction *I,		bool GVNHoist::allGepOperandsAvailable(const Instruction *I,
const BasicBlock *HoistPt) const {		const BasicBlock *HoistPt) const {
for (const Use &Op : I->operands())		for (const Use &Op : I->operands())
if (const auto *Inst = dyn_cast<Instruction>(&Op))		if (const auto *Inst = dyn_cast<Instruction>(&Op))
if (!DT->dominates(Inst->getParent(), HoistPt)) {		if (!DT->dominates(Inst->getParent(), HoistPt)) {
if (const GetElementPtrInst *GepOp =		if (const GetElementPtrInst *GepOp =
dyn_cast<GetElementPtrInst>(Inst)) {		dyn_cast<GetElementPtrInst>(Inst)) {
if (!allGepOperandsAvailable(GepOp, HoistPt))		if (!allGepOperandsAvailable(GepOp, HoistPt))
return false;		return false;
// Gep is available if all operands of GepOp are available.		// Gep is available if all operands of GepOp are available.
} else {		} else {
// Gep is not available if it has operands other than GEPs that are		// Gep is not available if it has operands other than GEPs that are
// defined in blocks not dominating HoistPt.		// defined in blocks not dominating HoistPt.
return false;		return false;
}		}
}		}
return true;		return true;
}		}

// Make all operands of the GEP available.		// Make all operands of the GEP available.
void makeGepsAvailable(Instruction Repl, BasicBlock HoistPt,		void GVNHoist::makeGepsAvailable(Instruction Repl, BasicBlock HoistPt,
const SmallVecInsn &InstructionsToHoist,		const SmallVecInsn &InstructionsToHoist,
Instruction *Gep) const {		Instruction *Gep) const {
assert(allGepOperandsAvailable(Gep, HoistPt) &&		assert(allGepOperandsAvailable(Gep, HoistPt) && "GEP operands not available");
"GEP operands not available");

Instruction *ClonedGep = Gep->clone();		Instruction *ClonedGep = Gep->clone();
for (unsigned i = 0, e = Gep->getNumOperands(); i != e; ++i)		for (unsigned i = 0, e = Gep->getNumOperands(); i != e; ++i)
if (Instruction *Op = dyn_cast<Instruction>(Gep->getOperand(i))) {		if (Instruction *Op = dyn_cast<Instruction>(Gep->getOperand(i))) {
// Check whether the operand is already available.		// Check whether the operand is already available.
if (DT->dominates(Op->getParent(), HoistPt))		if (DT->dominates(Op->getParent(), HoistPt))
continue;		continue;

// As a GEP can refer to other GEPs, recursively make all the operands		// As a GEP can refer to other GEPs, recursively make all the operands
// of this GEP available at HoistPt.		// of this GEP available at HoistPt.
if (GetElementPtrInst *GepOp = dyn_cast<GetElementPtrInst>(Op))		if (GetElementPtrInst *GepOp = dyn_cast<GetElementPtrInst>(Op))
makeGepsAvailable(ClonedGep, HoistPt, InstructionsToHoist, GepOp);		makeGepsAvailable(ClonedGep, HoistPt, InstructionsToHoist, GepOp);
}		}

// Copy Gep and replace its uses in Repl with ClonedGep.		// Copy Gep and replace its uses in Repl with ClonedGep.
ClonedGep->insertBefore(HoistPt->getTerminator());		ClonedGep->insertBefore(HoistPt->getTerminator());

// Conservatively discard any optimization hints, they may differ on the		// Conservatively discard any optimization hints, they may differ on the
// other paths.		// other paths.
ClonedGep->dropUnknownNonDebugMetadata();		ClonedGep->dropUnknownNonDebugMetadata();

// If we have optimization hints which agree with each other along different		// If we have optimization hints which agree with each other along different
// paths, preserve them.		// paths, preserve them.
for (const Instruction *OtherInst : InstructionsToHoist) {		for (const Instruction *OtherInst : InstructionsToHoist) {
const GetElementPtrInst *OtherGep;		const GetElementPtrInst *OtherGep;
if (auto *OtherLd = dyn_cast<LoadInst>(OtherInst))		if (auto *OtherLd = dyn_cast<LoadInst>(OtherInst))
OtherGep = cast<GetElementPtrInst>(OtherLd->getPointerOperand());		OtherGep = cast<GetElementPtrInst>(OtherLd->getPointerOperand());
else		else
OtherGep = cast<GetElementPtrInst>(		OtherGep = cast<GetElementPtrInst>(
cast<StoreInst>(OtherInst)->getPointerOperand());		cast<StoreInst>(OtherInst)->getPointerOperand());
ClonedGep->andIRFlags(OtherGep);		ClonedGep->andIRFlags(OtherGep);
}		}

// Replace uses of Gep with ClonedGep in Repl.		// Replace uses of Gep with ClonedGep in Repl.
Repl->replaceUsesOfWith(Gep, ClonedGep);		Repl->replaceUsesOfWith(Gep, ClonedGep);
}		}

void updateAlignment(Instruction I, Instruction Repl) {		void GVNHoist::updateAlignment(Instruction I, Instruction Repl) {
if (auto *ReplacementLoad = dyn_cast<LoadInst>(Repl)) {		if (auto *ReplacementLoad = dyn_cast<LoadInst>(Repl)) {
ReplacementLoad->setAlignment(MaybeAlign(std::min(		ReplacementLoad->setAlignment(MaybeAlign(std::min(
ReplacementLoad->getAlignment(), cast<LoadInst>(I)->getAlignment())));		ReplacementLoad->getAlignment(), cast<LoadInst>(I)->getAlignment())));
++NumLoadsRemoved;		++NumLoadsRemoved;
} else if (auto *ReplacementStore = dyn_cast<StoreInst>(Repl)) {		} else if (auto *ReplacementStore = dyn_cast<StoreInst>(Repl)) {
ReplacementStore->setAlignment(		ReplacementStore->setAlignment(MaybeAlign(std::min(
MaybeAlign(std::min(ReplacementStore->getAlignment(),		ReplacementStore->getAlignment(), cast<StoreInst>(I)->getAlignment())));
cast<StoreInst>(I)->getAlignment())));
++NumStoresRemoved;		++NumStoresRemoved;
} else if (auto *ReplacementAlloca = dyn_cast<AllocaInst>(Repl)) {		} else if (auto *ReplacementAlloca = dyn_cast<AllocaInst>(Repl)) {
ReplacementAlloca->setAlignment(		ReplacementAlloca->setAlignment(
MaybeAlign(std::max(ReplacementAlloca->getAlignment(),		MaybeAlign(std::max(ReplacementAlloca->getAlignment(),
cast<AllocaInst>(I)->getAlignment())));		cast<AllocaInst>(I)->getAlignment())));
} else if (isa<CallInst>(Repl)) {		} else if (isa<CallInst>(Repl)) {
++NumCallsRemoved;		++NumCallsRemoved;
}		}
}		}

// Remove all the instructions in Candidates and replace their usage with Repl.		// Remove all the instructions in Candidates and replace their usage with Repl.
// Returns the number of instructions removed.		// Returns the number of instructions removed.
unsigned rauw(const SmallVecInsn &Candidates, Instruction *Repl,		unsigned GVNHoist::rauw(const SmallVecInsn &Candidates, Instruction *Repl,
MemoryUseOrDef *NewMemAcc) {		MemoryUseOrDef *NewMemAcc) {
unsigned NR = 0;		unsigned NR = 0;
for (Instruction *I : Candidates) {		for (Instruction *I : Candidates) {
if (I != Repl) {		if (I != Repl) {
++NR;		++NR;
updateAlignment(I, Repl);		updateAlignment(I, Repl);
if (NewMemAcc) {		if (NewMemAcc) {
// Update the uses of the old MSSA access with NewMemAcc.		// Update the uses of the old MSSA access with NewMemAcc.
MemoryAccess *OldMA = MSSA->getMemoryAccess(I);		MemoryAccess *OldMA = MSSA->getMemoryAccess(I);
OldMA->replaceAllUsesWith(NewMemAcc);		OldMA->replaceAllUsesWith(NewMemAcc);
MSSAUpdater->removeMemoryAccess(OldMA);		MSSAUpdater->removeMemoryAccess(OldMA);
}		}

Repl->andIRFlags(I);		Repl->andIRFlags(I);
combineKnownMetadata(Repl, I);		combineKnownMetadata(Repl, I);
I->replaceAllUsesWith(Repl);		I->replaceAllUsesWith(Repl);
// Also invalidate the Alias Analysis cache.		// Also invalidate the Alias Analysis cache.
MD->removeInstruction(I);		MD->removeInstruction(I);
I->eraseFromParent();		I->eraseFromParent();
}		}
}		}
return NR;		return NR;
}		}

// Replace all Memory PHI usage with NewMemAcc.		// Replace all Memory PHI usage with NewMemAcc.
void raMPHIuw(MemoryUseOrDef *NewMemAcc) {		void GVNHoist::raMPHIuw(MemoryUseOrDef *NewMemAcc) {
SmallPtrSet<MemoryPhi *, 4> UsePhis;		SmallPtrSet<MemoryPhi *, 4> UsePhis;
for (User *U : NewMemAcc->users())		for (User *U : NewMemAcc->users())
if (MemoryPhi *Phi = dyn_cast<MemoryPhi>(U))		if (MemoryPhi *Phi = dyn_cast<MemoryPhi>(U))
UsePhis.insert(Phi);		UsePhis.insert(Phi);

for (MemoryPhi *Phi : UsePhis) {		for (MemoryPhi *Phi : UsePhis) {
auto In = Phi->incoming_values();		auto In = Phi->incoming_values();
if (llvm::all_of(In, [&](Use &U) { return U == NewMemAcc; })) {		if (llvm::all_of(In, [&](Use &U) { return U == NewMemAcc; })) {
Phi->replaceAllUsesWith(NewMemAcc);		Phi->replaceAllUsesWith(NewMemAcc);
MSSAUpdater->removeMemoryAccess(Phi);		MSSAUpdater->removeMemoryAccess(Phi);
}		}
}		}
}		}

		void GVNHoist::removeAndReplace(Instruction I, Instruction Repl,
		MemoryAccess *NewMemAcc) {
		updateAlignment(I, Repl);
		if (NewMemAcc) {
		// Update the uses of the old MSSA access with NewMemAcc.
		MemoryAccess *OldMA = MSSA->getMemoryAccess(I);
		OldMA->replaceAllUsesWith(NewMemAcc);
		MSSAUpdater->removeMemoryAccess(OldMA);
		}

		Repl->andIRFlags(I);
		combineKnownMetadata(Repl, I);
		I->replaceAllUsesWith(Repl);
		// Also invalidate the Alias Analysis cache.
		MD->removeInstruction(I);
		I->eraseFromParent();
		}

		void GVNHoist::removeAndReplace(const SmallVecInsn &InstructionsToHoist,
		Instruction *Repl) {
		MemoryAccess *NewMemAcc = MSSA->getMemoryAccess(Repl);
		// Remove and rename all other instructions.
		for (Instruction *I : InstructionsToHoist)
		if (I != Repl) {
		removeAndReplace(I, Repl, NewMemAcc);
		}
		}

		void GVNHoist::removeMPhi(MemoryAccess *NewMemAcc) {
		// Remove MemorySSA phi nodes with the same arguments.
		SmallPtrSet<MemoryPhi *, 4> UsePhis;
		for (User *U : NewMemAcc->users())
		if (MemoryPhi *Phi = dyn_cast<MemoryPhi>(U))
		UsePhis.insert(Phi);

		for (auto *Phi : UsePhis) {
		auto In = Phi->incoming_values();
		if (all_of(In, [&](Use &U) { return U == NewMemAcc; })) {
		Phi->replaceAllUsesWith(NewMemAcc);
		MSSAUpdater->removeMemoryAccess(Phi);
		}
		}
		}

// Remove all other instructions and replace them with Repl.		// Remove all other instructions and replace them with Repl.
unsigned removeAndReplace(const SmallVecInsn &Candidates, Instruction *Repl,		unsigned GVNHoist::removeAndReplace(const SmallVecInsn &Candidates,
BasicBlock *DestBB, bool MoveAccess) {		Instruction Repl, BasicBlock DestBB,
		bool MoveAccess) {
MemoryUseOrDef *NewMemAcc = MSSA->getMemoryAccess(Repl);		MemoryUseOrDef *NewMemAcc = MSSA->getMemoryAccess(Repl);
if (MoveAccess && NewMemAcc) {		if (MoveAccess && NewMemAcc) {
// The definition of this ld/st will not change: ld/st hoisting is		// The definition of this ld/st will not change: ld/st hoisting is
// legal when the ld/st is not moved past its current definition.		// legal when the ld/st is not moved past its current definition.
MSSAUpdater->moveToPlace(NewMemAcc, DestBB, MemorySSA::End);		MSSAUpdater->moveToPlace(NewMemAcc, DestBB, MemorySSA::End);
}		}

// Replace all other instructions with Repl with memory access NewMemAcc.		// Replace all other instructions with Repl with memory access NewMemAcc.
unsigned NR = rauw(Candidates, Repl, NewMemAcc);		unsigned NR = rauw(Candidates, Repl, NewMemAcc);

// Remove MemorySSA phi nodes with the same arguments.		// Remove MemorySSA phi nodes with the same arguments.
if (NewMemAcc)		if (NewMemAcc)
raMPHIuw(NewMemAcc);		raMPHIuw(NewMemAcc);
return NR;		return NR;
}		}

// In the case Repl is a load or a store, we make all their GEPs		// In the case Repl is a load or a store, we make all their GEPs
// available: GEPs are not hoisted by default to avoid the address		// available: GEPs are not hoisted by default to avoid the address
// computations to be hoisted without the associated load or store.		// computations to be hoisted without the associated load or store.
bool makeGepOperandsAvailable(Instruction Repl, BasicBlock HoistPt,		bool GVNHoist::makeGepOperandsAvailable(
		Instruction Repl, BasicBlock HoistPt,
const SmallVecInsn &InstructionsToHoist) const {		const SmallVecInsn &InstructionsToHoist) const {
// Check whether the GEP of a ld/st can be synthesized at HoistPt.		// Check whether the GEP of a ld/st can be synthesized at HoistPt.
GetElementPtrInst *Gep = nullptr;		GetElementPtrInst *Gep = nullptr;
Instruction *Val = nullptr;		Instruction *Val = nullptr;
if (auto *Ld = dyn_cast<LoadInst>(Repl)) {		if (auto *Ld = dyn_cast<LoadInst>(Repl)) {
Gep = dyn_cast<GetElementPtrInst>(Ld->getPointerOperand());		Gep = dyn_cast<GetElementPtrInst>(Ld->getPointerOperand());
} else if (auto *St = dyn_cast<StoreInst>(Repl)) {		} else if (auto *St = dyn_cast<StoreInst>(Repl)) {
Gep = dyn_cast<GetElementPtrInst>(St->getPointerOperand());		Gep = dyn_cast<GetElementPtrInst>(St->getPointerOperand());
Val = dyn_cast<Instruction>(St->getValueOperand());		Val = dyn_cast<Instruction>(St->getValueOperand());
// Check that the stored value is available.		// Check that the stored value is available.
if (Val) {		if (Val) {
if (isa<GetElementPtrInst>(Val)) {		if (isa<GetElementPtrInst>(Val)) {
// Check whether we can compute the GEP at HoistPt.		// Check whether we can compute the GEP at HoistPt.
if (!allGepOperandsAvailable(Val, HoistPt))		if (!allGepOperandsAvailable(Val, HoistPt))
return false;		return false;
} else if (!DT->dominates(Val->getParent(), HoistPt))		} else if (!DT->dominates(Val->getParent(), HoistPt))
return false;		return false;
}		}
}		}

// Check whether we can compute the Gep at HoistPt.		// Check whether we can compute the Gep at HoistPt.
if (!Gep \|\| !allGepOperandsAvailable(Gep, HoistPt))		if (!Gep \|\| !allGepOperandsAvailable(Gep, HoistPt))
return false;		return false;

makeGepsAvailable(Repl, HoistPt, InstructionsToHoist, Gep);		makeGepsAvailable(Repl, HoistPt, InstructionsToHoist, Gep);

if (Val && isa<GetElementPtrInst>(Val))		if (Val && isa<GetElementPtrInst>(Val))
makeGepsAvailable(Repl, HoistPt, InstructionsToHoist, Val);		makeGepsAvailable(Repl, HoistPt, InstructionsToHoist, Val);

return true;		return true;
}		}

std::pair<unsigned, unsigned> hoist(HoistingPointList &HPL) {		std::pair<unsigned, unsigned> GVNHoist::hoist(HoistingPointList &HPL) {
unsigned NI = 0, NL = 0, NS = 0, NC = 0, NR = 0;		unsigned NI = 0, NL = 0, NS = 0, NC = 0, NR = 0;
for (const HoistingPointInfo &HP : HPL) {		for (const HoistingPointInfo &HP : HPL) {
// Find out whether we already have one of the instructions in HoistPt,		// Find out whether we already have one of the instructions in HoistPt,
// in which case we do not have to move it.		// in which case we do not have to move it.
BasicBlock *DestBB = HP.first;		BasicBlock *DestBB = HP.first;
const SmallVecInsn &InstructionsToHoist = HP.second;		const SmallVecInsn &InstructionsToHoist = HP.second;
Instruction *Repl = nullptr;		Instruction *Repl = nullptr;
for (Instruction *I : InstructionsToHoist)		for (Instruction *I : InstructionsToHoist)
if (I->getParent() == DestBB)		if (I->getParent() == DestBB)
// If there are two instructions in HoistPt to be hoisted in place:		// If there are two instructions in HoistPt to be hoisted in place:
// update Repl to be the first one, such that we can rename the uses		// update Repl to be the first one, such that we can rename the uses
// of the second based on the first.		// of the second based on the first.
if (!Repl \|\| firstInBB(I, Repl))		if (!Repl \|\| firstInBB(I, Repl))
Repl = I;		Repl = I;

// Keep track of whether we moved the instruction so we know whether we		// Keep track of whether we moved the instruction so we know whether we
// should move the MemoryAccess.		// should move the MemoryAccess.
bool MoveAccess = true;		bool MoveAccess = true;
if (Repl) {		if (Repl) {
// Repl is already in HoistPt: it remains in place.		// Repl is already in HoistPt: it remains in place.
assert(allOperandsAvailable(Repl, DestBB) &&		SmallVecVal V;
		assert(allOperandsAvailable(Repl, DestBB, V) &&
"instruction depends on operands that are not available");		"instruction depends on operands that are not available");
MoveAccess = false;		MoveAccess = false;
} else {		} else {
// When we do not find Repl in HoistPt, select the first in the list		// When we do not find Repl in HoistPt, select the first in the list
// and move it to HoistPt.		// and move it to HoistPt.
Repl = InstructionsToHoist.front();		Repl = InstructionsToHoist.front();

// We can move Repl in HoistPt only when all operands are available.		// We can move Repl in HoistPt only when all operands are available.
// The order in which hoistings are done may influence the availability		// The order in which hoistings are done may influence the availability
// of operands.		// of operands.
if (!allOperandsAvailable(Repl, DestBB)) {		SmallVecVal V;
		if (!allOperandsAvailable(Repl, DestBB, V)) {
// When HoistingGeps there is nothing more we can do to make the		// When HoistingGeps there is nothing more we can do to make the
// operands available: just continue.		// operands available: just continue.
if (HoistingGeps)		if (HoistingGeps)
continue;		continue;

// When not HoistingGeps we need to copy the GEPs.		// When not HoistingGeps we need to copy the GEPs.
if (!makeGepOperandsAvailable(Repl, DestBB, InstructionsToHoist))		if (!makeGepOperandsAvailable(Repl, DestBB, InstructionsToHoist))
continue;		continue;
}		}

// Move the instruction at the end of HoistPt.		// Move the instruction at the end of HoistPt.
Instruction *Last = DestBB->getTerminator();		Instruction *Last = DestBB->getTerminator();
MD->removeInstruction(Repl);		MD->removeInstruction(Repl);
Repl->moveBefore(Last);		Repl->moveBefore(Last);

DFSNumber[Repl] = DFSNumber[Last]++;		DFSNumber[Repl] = DFSNumber[Last]++;
}		}

NR += removeAndReplace(InstructionsToHoist, Repl, DestBB, MoveAccess);		NR += removeAndReplace(InstructionsToHoist, Repl, DestBB, MoveAccess);
		updateLocalStats(Repl, NI, NL, NS, NC);
if (isa<LoadInst>(Repl))
++NL;
else if (isa<StoreInst>(Repl))
++NS;
else if (isa<CallInst>(Repl))
++NC;
else // Scalar
++NI;
}		}

NumHoisted += NL + NS + NC + NI;		updateHoistStats(NI, NL, NS, NC, NR);
NumRemoved += NR;
NumLoadsHoisted += NL;
NumStoresHoisted += NS;
NumCallsHoisted += NC;
return {NI, NL + NC + NS};		return {NI, NL + NC + NS};
}		}

// Hoist all expressions. Returns Number of scalars hoisted		// Update MemorySSA if mem-refs are sunk.
// and number of non-scalars hoisted.		std::pair<unsigned, unsigned> GVNHoist::sink(GVNHoist::HoistingPointList &HPL) {
std::pair<unsigned, unsigned> hoistExpressions(Function &F) {		unsigned NI = 0, NL = 0, NS = 0, NC = 0;
InsnInfo II;
LoadInfo LI;		std::sort(HPL.begin(), HPL.end(),
StoreInfo SI;		[this](const HoistingPointInfo &HP1, const HoistingPointInfo &HP2) {
CallInfo CI;		// Sort in descending order to have last executing insn first in
for (BasicBlock *BB : depth_first(&F.getEntryBlock())) {		// HPL.
int InstructionNb = 0;		return !DFSInOrder(HP1.second[0], HP2.second[0]);
for (Instruction &I1 : *BB) {		});
// If I1 cannot guarantee progress, subsequent instructions
// in BB cannot be hoisted anyways.		for (const HoistingPointInfo &HP : HPL) {
if (!isGuaranteedToTransferExecutionToSuccessor(&I1)) {		BasicBlock *SinkPt = HP.first;
HoistBarrier.insert(BB);		const SmallVecInsn &InstructionsToHoist = HP.second;
		assert(InstructionsToHoist.size() == 2);
		Instruction *I0 = InstructionsToHoist[0];
		Instruction *I1 = InstructionsToHoist[1];

		// We need to check availability for I0 and I1 at SinkPt.
		// TODO: Additionally, we can still sink if the operand not available is
		// also getting sunk with available operands.
		SmallVecVal I0UnavOps, I1UnavOps;
		allOperandsAvailable(I0, SinkPt, I0UnavOps);
		allOperandsAvailable(I1, SinkPt, I1UnavOps);

		// Sink only when all operands are available, or only one is unavailable.
		if (I0UnavOps.size() != I1UnavOps.size() \|\| I1UnavOps.size() > 1)
		continue;

		// Keep I0 and remove I1 and PN
		PHINode *PN = cast<PHINode>(I0->user_back());
		MemoryAccess *I0MemAccess = MSSA->getMemoryAccess(I0);
		LLVM_DEBUG(dbgs() << "\nSinking:" << I0 << I1 << *PN);

		updateAlignment(I1, I0);
		I0->andIRFlags(I1);
		combineKnownMetadata(I0, I1);
		// I1->replaceAllUsesWith(I0); Not required because I1 used once in PHI.
		// Also invalidate the Alias Analysis cache.
		MD->removeInstruction(I1);
		MD->removeInstruction(PN);
		PN->replaceAllUsesWith(I0);
		// FIXME: Update Memory SSA
		// Update the uses of the old MSSA access with I0.
		// Remove MSSA for I1
		// Remove MSSA for PN, and replace with I0
		// Combine metadata of I0 and I1
		// Memory PHI?
		// assert(0);
		bool found = false;
		auto It = SinkPt->begin();
		while (It != SinkPt->end()) { // Find the last PHI to insert I0
		if (!isa<PHINode>(It))
break;		break;
		if (PN == &*It)
		found = true;
		++It;
		}
		assert(found && "PHI replaced in wrong BB");
		BasicBlock *I0BB = I0->getParent();
		I0->moveBefore(*SinkPt, It);

		if (I0UnavOps.empty())
		PN->eraseFromParent();
		else {
		Value *V = I0UnavOps[0];
		PHINode *NewPN = PHINode::Create(V->getType(), 2, "", PN);
		PN->eraseFromParent();
		// Move the incoming operands in this PHI.
		NewPN->addIncoming(I0UnavOps[0], I0BB);
		NewPN->addIncoming(I1UnavOps[0], I1->getParent());
		bool OpReplaced = false;
		for (int i = 0, e = I0->getNumOperands(); i != e; ++i)
		if (I0->getOperand(i) == I0UnavOps[0]) {
		I0->setOperand(i, NewPN);
		OpReplaced = true;
		}
		assert(OpReplaced && "Illegal transformation");
		}

		if (I0MemAccess) { // I0, I1 are mem-refs
		assert(MSSA->getMemoryAccess(I1));
		// This MSSA update is only for mem-refs with one use in a PHI in Succ BB.
		MemoryAccess *I1MemAccess = MSSA->getMemoryAccess(I1);
		MemoryUseOrDef *I0UD = cast<MemoryUseOrDef>(I0MemAccess);
		MemoryUseOrDef *I1UD = cast<MemoryUseOrDef>(I1MemAccess);

		MemoryAccess *I0Def = I0UD->getDefiningAccess();
		MemoryAccess *I1Def = I1UD->getDefiningAccess();

		MemoryPhi *NewPHI = MSSA->getMemoryAccess(SinkPt);
		if (!NewPHI) {
		NewPHI = MSSA->createMemoryPhi(SinkPt);
		NewPHI->addIncoming(I0Def, I0->getParent());
		NewPHI->addIncoming(I1Def, I1->getParent());
		}
		I0UD->setDefiningAccess(NewPHI);
		MSSAUpdater->removeMemoryAccess(I0MemAccess);
		MSSAUpdater->removeMemoryAccess(I1MemAccess);
		}

		I1->eraseFromParent();
		NumSunk++;
		updateLocalStats(I0, NI, NL, NS, NC);
		}
		return {NI, NL + NC + NS};
}		}
// Only hoist the first instructions in BB up to MaxDepthInBB. Hoisting
// deeper may increase the register pressure and compilation time.
if (MaxDepthInBB != -1 && InstructionNb++ >= MaxDepthInBB)
break;

		bool GVNHoist::getVN(Instruction *I) {
// Do not value number terminator instructions.		// Do not value number terminator instructions.
if (I1.isTerminator())		if (I->isTerminator())
break;		return false;

if (auto *Load = dyn_cast<LoadInst>(&I1))		if (auto *Load = dyn_cast<LoadInst>(I))
LI.insert(Load, VN);		LI.insert(Load, VN);
else if (auto *Store = dyn_cast<StoreInst>(&I1))		else if (auto *Store = dyn_cast<StoreInst>(I))
SI.insert(Store, VN);		SI.insert(Store, VN);
else if (auto *Call = dyn_cast<CallInst>(&I1)) {		else if (auto *Call = dyn_cast<CallInst>(I)) {
if (auto *Intr = dyn_cast<IntrinsicInst>(Call)) {		if (auto *Intr = dyn_cast<IntrinsicInst>(Call)) {
if (isa<DbgInfoIntrinsic>(Intr) \|\|		if (isa<DbgInfoIntrinsic>(Intr) \|\|
Intr->getIntrinsicID() == Intrinsic::assume \|\|		Intr->getIntrinsicID() == Intrinsic::assume \|\|
Intr->getIntrinsicID() == Intrinsic::sideeffect)		Intr->getIntrinsicID() == Intrinsic::sideeffect)
continue;		return true;
}		}
if (Call->mayHaveSideEffects())		if (Call->mayHaveSideEffects())
break;		return false;

if (Call->isConvergent())		if (Call->isConvergent())
break;		return false;

CI.insert(Call, VN);		CI.insert(Call, VN);
} else if (HoistingGeps \|\| !isa<GetElementPtrInst>(&I1))		} else if (HoistingGeps \|\| !isa<GetElementPtrInst>(I))
// Do not hoist scalars past calls that may write to memory because		// Do not hoist scalars past calls that may write to memory because
// that could result in spills later. geps are handled separately.		// that could result in spills later. geps are handled separately.
// TODO: We can relax this for targets like AArch64 as they have more		// TODO: We can relax this for targets like AArch64 as they have more
// registers than X86.		// registers than X86.
II.insert(&I1, VN);		II.insert(I, VN);

		return true;
		}

		void GVNHoist::barriersAndVNs(Function &F, bool Sinking) {
		for (BasicBlock *BB : depth_first(&F.getEntryBlock())) {
		int InstructionNb = 0;
		bool BarrierFound = false;
		SmallVector<std::pair<unsigned, unsigned>, 4> CurrentBBVNs;
		for (Instruction &I1 : *BB) {
		// If I1 cannot guarantee progress, subsequent instructions
		// in BB cannot be hoisted anyways.
		// TODO: For sink operation anything after hoist barrier is okay to sink
		// but nothing before the hoist barrier.
		if (!isGuaranteedToTransferExecutionToSuccessor(&I1)) {
		// NumBarriers++;
		HoistBarrier.insert(BB);
		BarrierFound = true;
		break;
		}

		// Only hoist the first instructions in BB up to MaxDepthInBB. Hoisting
		// deeper may increase the register pressure and compilation time.
		if (MaxDepthInBB != -1 && InstructionNb++ >= MaxDepthInBB)
		break;

		if (!getVN(&I1))
		break;
		}
		if (Sinking && BarrierFound) {
		// TODO: clear VNs for this basicblock.
		// Instructions after the last barriers can be sunk.
		for (BasicBlock::reverse_iterator It = BB->rbegin(), Ie = BB->rend();
		It != Ie; ++It) {
		if (!isGuaranteedToTransferExecutionToSuccessor(&*It))
		break;
		if (!getVN(&*It))
		break;
		}
		}
}		}
}		}

		std::pair<unsigned, unsigned> GVNHoist::hoistExpressions(Function &F) {
		barriersAndVNs(F, false);
HoistingPointList HPL;		HoistingPointList HPL;
computeInsertionPoints(II.getVNTable(), HPL, InsKind::Scalar);		computeInsertionPoints(II.getVNTable(), HPL, InsKind::Scalar);
computeInsertionPoints(LI.getVNTable(), HPL, InsKind::Load);		computeInsertionPoints(LI.getVNTable(), HPL, InsKind::Load);
computeInsertionPoints(SI.getVNTable(), HPL, InsKind::Store);		computeInsertionPoints(SI.getVNTable(), HPL, InsKind::Store);
computeInsertionPoints(CI.getScalarVNTable(), HPL, InsKind::Scalar);		computeInsertionPoints(CI.getScalarVNTable(), HPL, InsKind::Scalar);
computeInsertionPoints(CI.getLoadVNTable(), HPL, InsKind::Load);		computeInsertionPoints(CI.getLoadVNTable(), HPL, InsKind::Load);
computeInsertionPoints(CI.getStoreVNTable(), HPL, InsKind::Store);		computeInsertionPoints(CI.getStoreVNTable(), HPL, InsKind::Store);
return hoist(HPL);		return hoist(HPL);
}		}
};
		std::pair<unsigned, unsigned> GVNHoist::sinkExpressions(Function &F) {
		barriersAndVNs(F, false); // First clear VNs before barrier in that BB. see
		// TODO then set it to true.
		std::pair<unsigned, unsigned> Count;
		bool ContinueSink = false;
		unsigned MaxSinkIter = MaxSinkChainLength;
		unsigned SinkIter = 0;
		do {
		// Iteratively sink as long as there are candidates. Bail out after
		// three iterations.
		HoistingPointList HPL;
		findSinkableInsn(II.getVNTable(), HPL, InsKind::Scalar);
		findSinkableInsn(LI.getVNTable(), HPL, InsKind::Load);
		findSinkableInsn(SI.getVNTable(), HPL, InsKind::Store);
		findSinkableInsn(CI.getScalarVNTable(), HPL, InsKind::Scalar);
		findSinkableInsn(CI.getLoadVNTable(), HPL, InsKind::Load);
		findSinkableInsn(CI.getStoreVNTable(), HPL, InsKind::Store);

		auto ThisCount = sink(HPL);
		Count.first += ThisCount.first;
		Count.second += ThisCount.second;
		unsigned SunkCount = ThisCount.first + ThisCount.second;
		++SinkIter;
		ContinueSink = SunkCount > 0 ? MaxSinkIter > SinkIter : false;
		} while (ContinueSink);
		return Count;
		}

		void GVNHoist::constructDJGraph(DomLevelsT &DomLevels, DenseSet<EdgeT> &JEdges,
		BBLevelT &BBLevels) {
		for (auto DFI = df_begin(DT->getRootNode()), DFE = df_end(DT->getRootNode());
		DFI != DFE; ++DFI) {
		// Since getPathLength is inclusive of both the terminal nodes
		// i.e., Entry and *DFI so decrease by 1.
		unsigned Depth = DFI.getPathLength() - 1;
		BasicBlock BB = (DFI)->getBlock();
		DomLevels[BB] = Depth;
		BBLevels[Depth].push_back(BB);
		for (BasicBlock *Succ : successors(BB))
		if (!DT->properlyDominates(BB, Succ)) {
		JEdges.insert(std::make_pair(BB, Succ));
		}
		}
		}

		bool GVNHoist::constructMergeSet(DomLevelsT &DomLevels, DenseSet<EdgeT> &JEdges,
		BBLevelT &BBLevels) {
		bool Repeat = false;
		DenseSet<EdgeT> VisJEdges; // Visited J Edges.
		unsigned PrevLev = 0;
		for (std::pair<const unsigned, BBLevelKeyT> &P : BBLevels) {
		assert(PrevLev <= P.first);
		PrevLev = P.first;
		for (BasicBlock *CurrBB : P.second) {
		for (auto PB = pred_begin(CurrBB), PE = pred_end(CurrBB); PB != PE;
		++PB) {
		EdgeT Edge(*PB, CurrBB); // For all incoming edges to CurrBB.
		if (JEdges.count(Edge) && !VisJEdges.count(Edge)) {
		VisJEdges.insert(Edge); // Visit
		BasicBlock *Src = Edge.first;
		BasicBlock *Dst = Edge.second;
		BasicBlock *INode = nullptr;
		MergeSet[Dst].insert(Dst); // The target of JEdge.
		while (DomLevels[Src] >= DomLevels[Dst]) { // A backedge.
		LLVM_DEBUG(dbgs() << "\nVisiting: " << Src->getName() << " -> "
		<< Dst->getName());
		// Merge (tmp) = Merge (tmp) U Merge (tnode) U { tnode }
		// MergeSet(tnode) contains tnode.
		MergeSet[Src].insert(MergeSet[Dst].begin(), MergeSet[Dst].end());
		INode = Src;
		LLVM_DEBUG(dbgs() << ", IDom of " << Src->getName() << " is ");
		Src = DT->getNode(Src)->getIDom()->getBlock();
		LLVM_DEBUG(dbgs() << Src->getName());
		}
		for (auto PINode = pred_begin(INode), PENode = pred_end(INode);
		PINode != PENode; ++PINode) { // INode is an ancestor of SNode.
		EdgeT Edge(*PINode, INode);
		if (VisJEdges.count(Edge)) {
		assert(JEdges.count(Edge));
		BasicBlock SNode = PINode;
		// Check inconsistency, MergeSet[Dest] subset of MergeSet[Src]
		if (!isSubset(MergeSet[INode], MergeSet[SNode]))
		Repeat = true;
		}
		}
		}
		}
		}
		}
		return Repeat;
		}

		const Instruction GVNHoist::lastUser(const Instruction I,
		const Value *Val) const {
		// TODO: Make isLiveOutUsingMergeSet take const parameters.
		assert(!isLiveOutUsingMergeSet(const_cast<BasicBlock *>(I->getParent()),
		const_cast<Value *>(Val)));
		const BasicBlock *BB = I->getParent();
		BasicBlock::const_iterator BI(I), BE = BB->end();
		unsigned ICount = std::distance(BI, BE);
		// If there are less #uses than ICount it is cheaper to iterate on Val->users.
		if (Val->getNumUses() <= ICount) { // Iterate on uses
		for (const User *U : Val->users()) {
		// User maybe weird gep that is not an instruction.
		if (const Instruction *UserI = dyn_cast<Instruction>(U)) {
		if (UserI != I && UserI->getParent() == BB) {
		if (firstInBB(I, UserI)) // I precedes another Use => not a kill.
		return UserI;
		}
		}
		}
		return I;
		}
		// else Iterate on Instructions
		for (++BI; BI != BE; ++BI) {
		for (unsigned i = 0; i < BI->getNumOperands(); ++i)
		if (BI->getOperandUse(i).get() == Val)
		return &*BI;
		}
		return I;
		}

		bool GVNHoist::isLiveOutUsingMergeSet(BasicBlock BB, Value Val) const {
		assert(BB);
		const BasicBlock *ValDefBB = nullptr; // BasicBlock defining Val
		if (Instruction *I = dyn_cast<Instruction>(Val))
		ValDefBB = I->getParent();
		// FIXME!
		// We are assuming when DefBB is not defined then the value is a parameter.

		// Case when Val is defined in BB, if any of the use is outside BB (DefBB)
		// then it must be live-out.
		if (ValDefBB == BB)
		for (User *U : Val->users()) {
		if (cast<Instruction>(U)->getParent() != BB)
		return true;
		}

		// Mr(n) = M(n) U {n}; Create a new set from the merge set
		// Ms(n) = Ms(n) U Mr(w); For each successor w of BB
		SmallSetBB Ms; // Ms = null-set
		for (BasicBlock *Succ : successors(BB)) {
		Ms.insert(Succ); // Mr(Succ) = Succ U M(Succ)
		for (BasicBlock *BB : MergeSet.lookup(Succ))
		Ms.insert(BB); // M(Succ)
		}

		// Iterate over all the uses of Val, if any of its users is in the mergeset
		// of \p BB then Val is LiveOut from BB.
		for (User *U : Val->users()) {
		BasicBlock *UserDefBB = nullptr;
		if (Instruction *I = dyn_cast<Instruction>(U))
		UserDefBB = I->getParent();
		else // Assuming live-out conservatively, the user can be a global
		// TODO: maybe return false is okay??
		return true; // llvm_unreachable("User is not an instruction.");
		// For globals, skip the MergeSet checks.
		if (UserDefBB->getParent() != BB->getParent())
		continue;
		while (UserDefBB && (UserDefBB != ValDefBB)) {
		if (Ms.count(UserDefBB)) // if t /\ Ms(n) then return true;
		return true;
		auto N = DT->getNode(UserDefBB);
		if (auto IDom = N->getIDom())
		UserDefBB = IDom->getBlock();
		else // Reached the beginning of the function => maybe a global.
		return false; // Assume that global doesn't flow live-out from BB
		}
		}
		return false;
		}

class GVNHoistLegacyPass : public FunctionPass {		class GVNHoistLegacyPass : public FunctionPass {
public:		public:
static char ID;		static char ID;

GVNHoistLegacyPass() : FunctionPass(ID) {		GVNHoistLegacyPass() : FunctionPass(ID) {
initializeGVNHoistLegacyPassPass(*PassRegistry::getPassRegistry());		initializeGVNHoistLegacyPassPass(*PassRegistry::getPassRegistry());
}		}
▲ Show 20 Lines • Show All 58 Lines • Show Last 20 Lines

llvm/test/Transforms/GVNHoist/broken-mod-sink.ll

This file was added.

				; RUN: opt -S -gvn-hoist -gvn-hoist-check-profitability=false < %s \| FileCheck %s

				; Check that all and, inttoptr, and bitcast is sunk.
				; CHECK-LAEBL: @baz0
				; CHECK-LABEL: bb9:
				; CHECK: and
				; CHECK: inttoptr
				; CHECK: bitcast


				; ModuleID = 'broken-mod.ll'

				%struct.pluto = type { i32 (...)**, i8, i8, i16, i32 }

				define void @baz0(i64 %a) {
				bb:
				br i1 undef, label %bb1, label %bb4

				bb1: ; preds = %bb
				%tmp = and i64 %a, -8
				%tmp2 = inttoptr i64 %tmp to i8*
				%tmp3 = bitcast i8* %tmp2 to %struct.pluto*
				br label %bb9

				bb4: ; preds = %bb
				br i1 undef, label %bb5, label %bb11

				bb5: ; preds = %bb4
				%tmp6 = and i64 %a, -8
				%tmp7 = inttoptr i64 %tmp6 to i8*
				%tmp8 = bitcast i8* %tmp7 to %struct.pluto*
				br label %bb9

				bb9: ; preds = %bb5, %bb1
				%tmp10 = phi %struct.pluto* [ %tmp8, %bb5 ], [ %tmp3, %bb1 ]
				unreachable

				bb11: ; preds = %bb4
				ret void
				}

				; CHECK-LABEL: @baz
				; CHECK-LABEL: bb9:
				; CHECK: bitcast
				; Function Attrs: nounwind uwtable
				define void @baz() {
				bb:
				%tmp = and i64 undef, -8
				%tmp2 = inttoptr i64 %tmp to i8*
				br i1 undef, label %bb1, label %bb4

				bb1: ; preds = %bb
				%tmp3 = bitcast i8* %tmp2 to %struct.pluto*
				br label %bb9

				bb4: ; preds = %bb
				br i1 undef, label %bb5, label %bb11

				bb5: ; preds = %bb4
				%tmp8 = bitcast i8* %tmp2 to %struct.pluto*
				br label %bb9

				bb9: ; preds = %bb5, %bb1
				%tmp10 = phi %struct.pluto* [ %tmp8, %bb5 ], [ %tmp3, %bb1 ]
				unreachable

				bb11: ; preds = %bb4
				ret void
				}

				; CHECK-LABEL: @baz1
				; CHECK-LABEL: bb9:
				; CHECK: inttoptr
				; CHECK: bitcast
				define void @baz1() {
				bb:
				%tmp = and i64 undef, -8
				br i1 undef, label %bb1, label %bb4

				bb1: ; preds = %bb
				%tmp2 = inttoptr i64 %tmp to i8*
				%tmp10 = bitcast i8* %tmp2 to %struct.pluto*
				br label %bb9

				bb4: ; preds = %bb
				br i1 undef, label %bb5, label %bb11

				bb5: ; preds = %bb4
				%tmp3 = inttoptr i64 %tmp to i8*
				%tmp8 = bitcast i8* %tmp3 to %struct.pluto*
				br label %bb9

				bb9: ; preds = %bb5, %bb1
				%tmp11 = phi %struct.pluto* [ %tmp8, %bb5 ], [ %tmp10, %bb1 ]
				unreachable

				bb11: ; preds = %bb4
				ret void
				}

llvm/test/Transforms/GVNHoist/dj-edge-detect-das-taco.ll

This file was added.

				; RUN: opt -S -gvn-hoist -debug < %s 2>&1 \| FileCheck %s
				; REQUIRES: asserts

				; Testcase taken from:
				; Das, Dibyendu, B. Dupont De Dinechin, and Ramakrishna Upadrasta. "Efficient
				; liveness computation using merge sets and dj-graphs." ACM Transactions on
				; Architecture and Code Optimization (TACO) 8.4 (2012): 27.

				; check the J edges
				; CHECK: Found a JEdge: bb10 -> bb8
				; CHECK: Found a JEdge: bb4 -> bb5
				; CHECK: Found a JEdge: bb5 -> bb6
				; CHECK: Found a JEdge: bb6 -> bb5
				; CHECK: Found a JEdge: bb7 -> bb2
				; CHECK: Found a JEdge: bb9 -> bb6

				; check the mergesets
				; CHECK: MergeSet of: bb10: bb2, bb8, bb5, bb6,
				; CHECK: MergeSet of: bb2: bb2,
				; CHECK: MergeSet of: bb3: bb2,
				; CHECK: MergeSet of: bb4: bb2, bb5, bb6,
				; CHECK: MergeSet of: bb5: bb2, bb5, bb6,
				; CHECK: MergeSet of: bb6: bb2, bb5, bb6,
				; CHECK: MergeSet of: bb7: bb2,
				; CHECK: MergeSet of: bb8: bb2, bb8, bb5, bb6,
				; CHECK: MergeSet of: bb9: bb2, bb8, bb5, bb6,


				define void @irreducible_loop() {
				bb1:
				br label %bb2

				bb2:
				br i1 undef, label %bb3, label %bb11

				bb3:
				br i1 undef, label %bb4, label %bb8

				bb4:
				br label %bb5

				bb5:
				br label %bb6

				bb6:
				br i1 undef, label %bb7, label %bb5

				bb7:
				br label %bb2

				bb8:
				br label %bb9

				bb9:
				br i1 undef, label %bb6, label %bb10

				bb10:
				br label %bb8

				bb11:
				ret void
				}
				No newline at end of file

llvm/test/Transforms/GVNHoist/dj-edge-detect.ll

This file was added.

				; RUN: opt -S -gvn-hoist -debug < %s 2>&1 \| FileCheck %s
				; REQUIRES: asserts

				; check the J edges
				; CHECK: Found a JEdge: bb10 -> bb18
				; CHECK: Found a JEdge: bb13 -> bb18
				; CHECK: Found a JEdge: bb15 -> bb18
				; CHECK: Found a JEdge: bb18 -> bb6
				; CHECK: Found a JEdge: bb2 -> bb3
				; CHECK: Found a JEdge: bb8 -> bb18


				; check the mergesets
				; CHECK: MergeSet of: bb10: bb3, bb6, bb18,
				; CHECK: MergeSet of: bb13: bb3, bb6, bb18,
				; CHECK: MergeSet of: bb15: bb3, bb6, bb18,
				; CHECK: MergeSet of: bb18: bb3, bb6, bb18,
				; CHECK: MergeSet of: bb2: bb3,
				; CHECK: MergeSet of: bb3: bb3,
				; CHECK: MergeSet of: bb6: bb3, bb6,
				; CHECK: MergeSet of: bb8: bb3, bb6, bb18,


				%0 = type { %1, %8 }
				%1 = type { %2, %3 }
				%2 = type { i32, i32, i32, i32, i32, i8, i32 }
				%3 = type { %4, i32, i32, %7*, i8, i8, i8, i8, i8 }
				%4 = type { i8, i32, i8*, %5 }
				%5 = type { %6 }
				%6 = type { i16 }
				%7 = type { i8, i8, i8, i8, i8* }
				%8 = type <{ i8, i8, i32, %9, %17, %19 }>
				%9 = type { %10, i8, i8, i8 (i8), i32* (i8), i8, %15, i32, i32, i8, %16, i32, i8, i8, i8, i8* }
				%10 = type { i8, %11, i8, %12, %13, i8, i32 (i8)*, i32, i8, %14 }
				%11 = type { i8, i32 (i8, %12, i8), i32 (i8), i32 (i8, i32), %16 (i8), i32 (i8), i32 (i8), i32 (...), i32 (...), i32 (...), void (i8, i32), void (i8, i32, i8, i32), i32 (i8, i32, i8)* }
				%12 = type { i8, i32 (i8, i32, i8, i32, i8), i32 (i8, i32, i8), void (i8, i8, i32), i32 (i8), i32 (i8, i32, i32) }
				%13 = type { i32 (i8, i8, i32), i32 (i8, i32, i32)* }
				%14 = type { i32, i32, i8, i8, i8 }
				%15 = type { i32, i32, i32 }
				%16 = type { i16, i16, i32, i32 }
				%17 = type { i8, i8, i16 (i8, i8, i16), i32 (i8, i8, i32), i32 (%16, i32)*, %18, i32, i32 }
				%18 = type { i8, i8 (i8, i8, i32)* }
				%19 = type { %20, i32, i32 }
				%20 = type { i32, i32, [8 x i8], i8, i8, i16 }

				@global = external hidden unnamed_addr constant [10 x i8], align 1

				define void @ham(i8* nocapture readnone %arg) local_unnamed_addr {
				bb:
				%tmp = alloca %0*, align 8
				%tmp1 = bitcast %0** %tmp to i8*
				br label %bb3

				bb2: ; preds = %bb6
				br label %bb3

				bb3: ; preds = %bb2, %bb
				%tmp4 = load %0, %0* %tmp, align 8, !tbaa !0
				%tmp5 = call i32 @quux(i8* %tmp1)
				br label %bb6

				bb6: ; preds = %bb18, %bb3
				%tmp7 = phi i32 [ %tmp5, %bb3 ], [ %tmp19, %bb18 ]
				switch i32 %tmp7, label %bb15 [
				i32 0, label %bb2
				i32 35, label %bb8
				i32 11, label %bb10
				i32 12, label %bb13
				]

				bb8: ; preds = %bb6
				%tmp9 = load %0, %0* %tmp, align 8, !tbaa !0
				br label %bb18

				bb10: ; preds = %bb6
				%tmp11 = load %0, %0* %tmp, align 8, !tbaa !0
				%tmp12 = call i32 @quux(i8* %tmp1)
				br label %bb18

				bb13: ; preds = %bb6
				%tmp14 = call i32 @quux(i8* %tmp1)
				br label %bb18

				bb15: ; preds = %bb6
				%tmp16 = load %0, %0* %tmp, align 8, !tbaa !0
				%tmp17 = call i32 @quux(i8* null)
				br label %bb18

				bb18: ; preds = %bb15, %bb13, %bb10, %bb8
				%tmp19 = phi i32 [ %tmp17, %bb15 ], [ %tmp14, %bb13 ], [ %tmp12, %bb10 ], [ 0, %bb8 ]
				br label %bb6
				}

				; Function Attrs: argmemonly nounwind
				declare void @llvm.lifetime.start(i64, i8* nocapture) #0

				declare i32 @zot(%0**)

				declare i32 @quux(i8*)

				declare i32 @wombat(i8* nocapture readonly, ...)

				attributes #0 = { argmemonly nounwind }

				!0 = !{!1, !1, i64 0}
				!1 = !{!"any pointer", !2, i64 0}
				!2 = !{!"omnipotent char", !3, i64 0}
				!3 = !{!"Simple C/C++ TBAA"}

llvm/test/Transforms/GVNHoist/gvn-sink-test-1.ll

This file was added.

				; RUN: opt -S -gvn-hoist -debug < %s
				; ModuleID = 'gvn-sink-test.ll'
				source_filename = "bugpoint-output-9bbed91.bc"
				target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
				target triple = "x86_64-unknown-linux-gnu"

				%struct.spam = type opaque

				declare void @bar() local_unnamed_addr #0

				; Function Attrs: noinline nounwind uwtable
				define fastcc void @wombat() unnamed_addr #1 {
				bb:
				br i1 undef, label %bb1, label %bb10

				bb1: ; preds = %bb
				br i1 undef, label %bb2, label %bb3

				bb2: ; preds = %bb1
				unreachable

				bb3: ; preds = %bb1
				br i1 undef, label %bb4, label %bb5

				bb4: ; preds = %bb3
				%tmp = load %struct.spam, %struct.spam* undef, align 8
				br label %bb7

				bb5: ; preds = %bb3
				%tmp6 = load %struct.spam, %struct.spam* undef, align 8
				br label %bb7

				bb7: ; preds = %bb5, %bb4
				%tmp8 = phi %struct.spam* [ %tmp, %bb4 ], [ %tmp6, %bb5 ]
				call void @bar() #2
				%tmp9 = load %struct.spam, %struct.spam* undef, align 8
				unreachable

				bb10: ; preds = %bb
				ret void
				}

				attributes #0 = { "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="false" "no-infs-fp-math"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
				attributes #1 = { noinline nounwind uwtable "correctly-rounded-divide-sqrt-fp-math"="false" "disable-tail-calls"="false" "less-precise-fpmad"="false" "no-frame-pointer-elim"="false" "no-infs-fp-math"="false" "no-jump-tables"="false" "no-nans-fp-math"="false" "no-signed-zeros-fp-math"="false" "no-trapping-math"="false" "stack-protector-buffer-size"="8" "target-cpu"="x86-64" "target-features"="+fxsr,+mmx,+sse,+sse2,+x87" "unsafe-fp-math"="false" "use-soft-float"="false" }
				attributes #2 = { nounwind }

				!llvm.ident = !{!0}

				!0 = !{!"clang version 5.0.0 "}

llvm/test/Transforms/GVNHoist/hoist-convergent.ll

	; RUN: opt -gvn-hoist -S < %s \| FileCheck %s			; RUN: opt -gvn-hoist -gvn-hoist-check-profitability=false -S < %s \| FileCheck %s

	; Check that convergent calls are not hoisted.			; Check that convergent calls are not hoisted.
	;			;
	; CHECK-LABEL: @no_convergent_func_hoisting(			; CHECK-LABEL: @no_convergent_func_hoisting(
	; CHECK: if.then:			; CHECK: if.then:
	; CHECK: call float @convergent_func(			; CHECK: call float @convergent_func(

	; CHECK: if.else:			; CHECK: if.else:
	▲ Show 20 Lines • Show All 84 Lines • Show Last 20 Lines

llvm/test/Transforms/GVNHoist/hoist-md.ll

	; RUN: opt -S -gvn-hoist < %s \| FileCheck %s			; RUN: opt -S -gvn-hoist -gvn-hoist-check-profitability=false < %s \| FileCheck %s
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	define void @test1(i1 %b, i32* %x) {			define void @test1(i1 %b, i32* %x) {
	entry:			entry:
	br i1 %b, label %if.then, label %if.else			br i1 %b, label %if.then, label %if.else

	if.then: ; preds = %entry			if.then: ; preds = %entry
	▲ Show 20 Lines • Show All 112 Lines • Show Last 20 Lines

llvm/test/Transforms/GVNHoist/hoist-mssa.ll

	; RUN: opt -S -gvn-hoist -newgvn < %s \| FileCheck %s			; RUN: opt -S -gvn-hoist -gvn-hoist-check-profitability=false -newgvn < %s \| FileCheck %s

	; Check that store hoisting works: there should be only one store left.			; Check that store hoisting works: there should be only one store hoisted
	; CHECK-LABEL: @getopt			; Hoist load as well because the @optind global is killed in bb4 (at least once)
				; CHECK-LABEL: @getopt(
				; CHECK: load i32
				; CHECK: add nsw i32
	; CHECK: store i32			; CHECK: store i32
	; CHECK-NOT: store i32			; CHECK-NOT: store i32

	@optind = external global i32, align 4			@optind = external global i32, align 4

	define void @getopt() {			define void @getopt() {
	bb:			bb:
	br label %bb1			br label %bb1
	Show All 29 Lines

	bb12: ; preds = %bb9, %bb5			bb12: ; preds = %bb9, %bb5
	br label %bb13			br label %bb13

	bb13: ; preds = %bb12, %bb7, %bb2			bb13: ; preds = %bb12, %bb7, %bb2
	ret void			ret void
	}			}

				; Check that store hoisting works: there should be only one store left because
				; load should also be hoisted in this case.
				; CHECK-LABEL: @getopt_hoist_load(
				; CHECK: store i32
				; CHECK-NOT: store i32
				define void @getopt_hoist_load(i32* %p1) {
				bb:
				br label %bb1

				bb1: ; preds = %bb
				br i1 undef, label %bb2, label %bb3

				bb2: ; preds = %bb1
				br label %bb13

				bb3: ; preds = %bb1
				br i1 undef, label %bb4, label %bb9

				bb4: ; preds = %bb3
				%tmp = load i32, i32* @optind, align 4
				br i1 undef, label %bb5, label %bb7

				bb5: ; preds = %bb4
				%tmp6 = add nsw i32 %tmp, 1
				store i32 %tmp6, i32* %p1, align 4
				br label %bb12

				bb7: ; preds = %bb4
				%tmp8 = add nsw i32 %tmp, 1
				store i32 %tmp8, i32* %p1, align 4
				br label %bb13

				bb9: ; preds = %bb3
				%tmp10 = load i32, i32* @optind, align 4
				%tmp11 = add nsw i32 %tmp10, 1
				store i32 %tmp11, i32* %p1, align 4
				br label %bb12

				bb12: ; preds = %bb9, %bb5
				br label %bb13

				bb13: ; preds = %bb12, %bb7, %bb2
				ret void
				}

	@GlobalVar = internal global float 1.000000e+00			@GlobalVar = internal global float 1.000000e+00

	; Check that we hoist stores and remove the MSSA phi node.			; Check that we hoist stores and remove the MSSA phi node.
	; CHECK-LABEL: @hoistStoresUpdateMSSA			; CHECK-LABEL: @hoistStoresUpdateMSSA
	; CHECK: store float			; CHECK: store float
	; CHECK-NOT: store float			; CHECK-NOT: store float
	define float @hoistStoresUpdateMSSA(float %d) {			define float @hoistStoresUpdateMSSA(float %d) {
	entry:			entry:
	Show All 12 Lines

llvm/test/Transforms/GVNHoist/hoist-pr20242.ll

	; RUN: opt -gvn-hoist -newgvn -gvn-hoist -S < %s \| FileCheck %s			; RUN: opt -gvn-hoist -newgvn -gvn-hoist -gvn-hoist-check-profitability=false -S < %s \| FileCheck %s
	; Test to demonstrate that newgvn creates opportunities for			; Test to demonstrate that newgvn creates opportunities for
	; more gvn-hoist when sibling branches contain identical expressions.			; more gvn-hoist when sibling branches contain identical expressions.

	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; Check that all "or" expressions are hoisted.			; Check that all "or" expressions are hoisted.
	; CHECK-LABEL: @encode			; CHECK-LABEL: @encode
	▲ Show 20 Lines • Show All 68 Lines • Show Last 20 Lines

llvm/test/Transforms/GVNHoist/hoist-pr22005.ll

	; RUN: opt -gvn-hoist -S < %s \| FileCheck %s			; RUN: opt -gvn-hoist -gvn-hoist-check-profitability=false -S < %s \| FileCheck %s
	target datalayout = "e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	; Check that all "sub" expressions are hoisted.			; Check that all "sub" expressions are hoisted.
	; CHECK-LABEL: @fun			; CHECK-LABEL: @fun
	; CHECK: sub i64			; CHECK: sub i64
	; CHECK-NOT: sub i64			; CHECK-NOT: sub i64

	Show All 21 Lines

llvm/test/Transforms/GVNHoist/hoist-recursive-geps.ll

	; RUN: opt -gvn-hoist -newgvn -gvn-hoist -S < %s \| FileCheck %s			; RUN: opt -gvn-hoist -newgvn -gvn-hoist -gvn-hoist-check-profitability=false -S < %s \| FileCheck %s

	; Check that recursive GEPs are hoisted. Since hoisting creates			; Check that recursive GEPs are hoisted. Since hoisting creates
	; fully redundant instructions, newgvn is run to remove them which then			; fully redundant instructions, newgvn is run to remove them which then
	; creates more opportunites for hoisting.			; creates more opportunites for hoisting.

	; CHECK-LABEL: @fun			; CHECK-LABEL: @fun
	; CHECK: load			; CHECK: load
	; CHECK: fdiv			; CHECK: fdiv
	▲ Show 20 Lines • Show All 97 Lines • Show Last 20 Lines

llvm/test/Transforms/GVNHoist/hoist-very-busy.ll

	Show First 20 Lines • Show All 47 Lines • ▼ Show 20 Lines
	sw0:			sw0:
	store i32 1, i32* @G			store i32 1, i32* @G
	unreachable			unreachable

	sw1:			sw1:
	store i32 1, i32* @G			store i32 1, i32* @G
	ret void			ret void
	}			}


				; Check that the call and fcmp are hoisted.
				; CHECK-LABEL: define void @bun(
				; CHECK: store
				; CHECK-NOT: store

				define void @bun() {
				entry:
				br label %if.then

				if.then: ; preds = %entry
				br i1 undef, label %sw0, label %sw1

				sw0:
				store i32 1, i32* @G
				unreachable

				sw1:
				store i32 1, i32* @G
				ret void
				}

llvm/test/Transforms/GVNHoist/hoist.ll

	; RUN: opt -gvn-hoist -S < %s \| FileCheck %s			; RUN: opt -gvn-hoist -gvn-hoist-check-profitability=false -S < %s \| FileCheck %s
	target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"			target datalayout = "e-m:e-i64:64-f80:128-n8:16:32:64-S128"
	target triple = "x86_64-unknown-linux-gnu"			target triple = "x86_64-unknown-linux-gnu"

	@GlobalVar = internal global float 1.000000e+00			@GlobalVar = internal global float 1.000000e+00

	; Check that all scalar expressions are hoisted.			; Check that all scalar expressions are hoisted.
	;			;
	; CHECK-LABEL: @scalarsHoisting			; CHECK-LABEL: @scalarsHoisting
	▲ Show 20 Lines • Show All 637 Lines • Show Last 20 Lines

llvm/test/Transforms/GVNHoist/infinite-loop-direct.ll

	; RUN: opt -S -gvn-hoist < %s \| FileCheck %s			; RUN: opt -S -gvn-hoist -gvn-hoist-check-profitability=false < %s \| FileCheck %s

	; Checking gvn-hoist in case of infinite loops and irreducible control flow.			; Checking gvn-hoist in case of infinite loops and irreducible control flow.

	; Check that bitcast is not hoisted beacuse down safety is not guaranteed.			; Check that bitcast is not hoisted beacuse down safety is not guaranteed.
	; CHECK-LABEL: @bazv1			; CHECK-LABEL: @bazv1
	; CHECK: if.then.i:			; CHECK: if.then.i:
	; CHECK: bitcast			; CHECK: bitcast
	; CHECK-NEXT: load			; CHECK-NEXT: load
	▲ Show 20 Lines • Show All 87 Lines • Show Last 20 Lines

llvm/test/Transforms/GVNHoist/infinite-loop-indirect.ll

	; RUN: opt -S -gvn-hoist < %s \| FileCheck %s			; RUN: opt -S -gvn-hoist -gvn-hoist-check-profitability=false < %s \| FileCheck %s

	; Checking gvn-hoist in case of indirect branches.			; Checking gvn-hoist in case of indirect branches.

	; Check that the bitcast is not hoisted because it is after an indirect call			; Check that the bitcast is not hoisted because it is after an indirect call
	; CHECK-LABEL: @foo			; CHECK-LABEL: @foo
	; CHECK-LABEL: l1.preheader:			; CHECK-LABEL: l1.preheader:
	; CHECK-NEXT: bitcast			; CHECK-NEXT: bitcast
	; CHECK-LABEL: l1			; CHECK-LABEL: l1
	▲ Show 20 Lines • Show All 276 Lines • Show Last 20 Lines

llvm/test/Transforms/GVNHoist/int_sideeffect.ll

	; RUN: opt -S < %s -gvn-hoist \| FileCheck %s			; RUN: opt -S < %s -gvn-hoist -gvn-hoist-check-profitability=false \| FileCheck %s

	declare void @llvm.sideeffect()			declare void @llvm.sideeffect()

	; GVN hoisting across a @llvm.sideeffect.			; GVN hoisting across a @llvm.sideeffect.

	; CHECK-LABEL: scalarsHoisting			; CHECK-LABEL: scalarsHoisting
	; CHECK: = fsub			; CHECK: = fsub
	; CHECK: br i1 %cmp,			; CHECK: br i1 %cmp,
	Show All 21 Lines

llvm/test/Transforms/GVNHoist/pr37445.ll

	; RUN: opt < %s -early-cse-memssa -gvn-hoist -S \| FileCheck %s			; RUN: opt < %s -early-cse-memssa -gvn-hoist -gvn-hoist-check-profitability=false -S \| FileCheck %s

	; Make sure opt won't crash and that this pair of			; Make sure opt won't crash and that this pair of
	; instructions (load, icmp) is hoisted successfully			; instructions (load, icmp) is hoisted successfully
	; from bb45 and bb58 to bb41.			; from bb45 and bb58 to bb41.

	@g_10 = external global i32, align 4			@g_10 = external global i32, align 4
	@g_536 = external global i8*, align 8			@g_536 = external global i8*, align 8
	@g_1629 = external global i32**, align 8			@g_1629 = external global i32**, align 8
	▲ Show 20 Lines • Show All 110 Lines • Show Last 20 Lines

llvm/test/Transforms/GVNHoist/sink-load-dom-numbering.ll

This file was added.

				; RUN: opt -S -gvn-hoist < %s \| FileCheck %s

				; CHECK: load
				; CHECK-NOT: load

				%struct.blam = type { %struct.wibble }
				%struct.wibble = type { i8*, i32 }

				declare void @hoge()

				define void @eggs() {
				bb:
				br label %bb1

				bb1: ; preds = %bb
				br label %bb2

				bb2: ; preds = %bb1
				br i1 undef, label %bb3, label %bb4

				bb3: ; preds = %bb2
				unreachable

				bb4: ; preds = %bb2
				br label %bb5

				bb5: ; preds = %bb4
				br i1 undef, label %bb6, label %bb9

				bb6: ; preds = %bb5
				br i1 undef, label %bb8, label %bb7

				bb7: ; preds = %bb6
				call void @hoge()
				br label %bb8

				bb8: ; preds = %bb7, %bb6
				%tmp = load %struct.blam, %struct.blam* undef, align 8, !tbaa !1
				br label %bb11

				bb9: ; preds = %bb5
				%tmp10 = load %struct.blam, %struct.blam* undef, align 8, !tbaa !1
				br label %bb11

				bb11: ; preds = %bb9, %bb8
				%tmp12 = phi %struct.blam* [ %tmp, %bb8 ], [ %tmp10, %bb9 ]
				unreachable
				}

				!llvm.ident = !{!0}

				!0 = !{!"clang version 5.0.0 "}
				!1 = !{!2, !3, i64 8}
				!2 = !{!"tree_common", !3, i64 0, !3, i64 8, !6, i64 16, !6, i64 17, !6, i64 17, !6, i64 17, !6, i64 17, !6, i64 17, !6, i64 17, !6, i64 17, !6, i64 17, !6, i64 18, !6, i64 18, !6, i64 18, !6, i64 18, !6, i64 18, !6, i64 18, !6, i64 18, !6, i64 18, !6, i64 19, !6, i64 19, !6, i64 19, !6, i64 19, !6, i64 19, !6, i64 19, !6, i64 19, !6, i64 19}
				!3 = !{!"any pointer", !4, i64 0}
				!4 = !{!"omnipotent char", !5, i64 0}
				!5 = !{!"Simple C/C++ TBAA"}
				!6 = !{!"int", !4, i64 0}

llvm/test/Transforms/GVNHoist/sink-load-with-memory-phi-1.ll

This file was added.

				; RUN: opt -S -gvn-hoist < %s \| FileCheck %s

				; Check that load is sunk to its successor.
				; CHECK: load
				; CHECK-NOT: load

				@reg_last_set_invalid = external global i8*, align 8

				; Function Attrs: nounwind uwtable
				define void @record_value_for_reg() {
				entry:
				br label %for.cond

				for.cond: ; preds = %entry
				br label %for.cond139

				for.cond139: ; preds = %if.end156, %for.cond
				br i1 undef, label %for.body142, label %for.end159

				for.body142: ; preds = %for.cond139
				store i32 undef, i32* undef, align 4, !tbaa !1
				br i1 undef, label %land.lhs.true146, label %if.else

				land.lhs.true146: ; preds = %for.body142
				br i1 undef, label %if.then151, label %if.else

				if.then151: ; preds = %land.lhs.true146
				%0 = load i8, i8* @reg_last_set_invalid, align 8, !tbaa !5
				br label %if.end156

				if.else: ; preds = %land.lhs.true146, %for.body142
				%1 = load i8, i8* @reg_last_set_invalid, align 8, !tbaa !5
				br label %if.end156

				if.end156: ; preds = %if.else, %if.then151
				%.sink5 = phi i8* [ %1, %if.else ], [ %0, %if.then151 ]
				br label %for.cond139

				for.end159: ; preds = %for.cond139
				br i1 undef, label %land.lhs.true161, label %if.end174

				land.lhs.true161: ; preds = %for.end159
				br label %if.end174

				if.end174: ; preds = %land.lhs.true161, %for.end159
				unreachable
				}

				!llvm.ident = !{!0}

				!0 = !{!"clang version 5.0.0 "}
				!1 = !{!2, !2, i64 0}
				!2 = !{!"int", !3, i64 0}
				!3 = !{!"omnipotent char", !4, i64 0}
				!4 = !{!"Simple C/C++ TBAA"}
				!5 = !{!6, !6, i64 0}
				!6 = !{!"any pointer", !3, i64 0}

llvm/test/Transforms/GVNHoist/sink-load-with-memory-phi-2.ll

This file was added.

				; RUN: opt -S -gvn-hoist < %s \| FileCheck %s

				; CHECK: load
				; CHECK-NOT: load
				@reg_last_set_invalid = external global i8*, align 8

				; Function Attrs: nounwind uwtable
				define void @record_value_for_reg() {
				entry:
				br label %for.cond

				for.cond: ; preds = %entry
				br label %for.cond139

				for.cond139: ; preds = %if.end156, %for.cond
				br label %for.body142

				for.body142: ; preds = %for.cond139
				br i1 undef, label %land.lhs.true146, label %if.else

				land.lhs.true146: ; preds = %for.body142
				br i1 undef, label %if.then151, label %if.else

				if.then151: ; preds = %land.lhs.true146
				%0 = load i8, i8* @reg_last_set_invalid, align 8, !tbaa !1
				br label %if.end156

				if.else: ; preds = %land.lhs.true146, %for.body142
				%1 = load i8, i8* @reg_last_set_invalid, align 8, !tbaa !1
				br label %if.end156

				if.end156: ; preds = %if.else, %if.then151
				%.sink5 = phi i8* [ %1, %if.else ], [ %0, %if.then151 ]
				store i8 undef, i8* undef, align 1, !tbaa !5
				br label %for.cond139
				}

				!llvm.ident = !{!0}

				!0 = !{!"clang version 5.0.0 "}
				!1 = !{!2, !2, i64 0}
				!2 = !{!"any pointer", !3, i64 0}
				!3 = !{!"omnipotent char", !4, i64 0}
				!4 = !{!"Simple C/C++ TBAA"}
				!5 = !{!3, !3, i64 0}

llvm/test/Transforms/GVNHoist/sink-mul-with-liveout.ll

This file was added.

				; RUN: opt -S -gvn-hoist < %s \| FileCheck %s

				; Check that a mul is sunk to last basic block
				; CHECK-LABEL: @foo
				; CHECK: if.then:
				; CHECK: mul13
				; CHECK-LABEL: if.else:
				; CHECK: mul21
				; CHECK-LABEL: if.end:
				; CHECK buf.sroa.14 = phi i32 [ undef, %if.then ], [ %x, %if.else ]
				; CHECK buf.sroa.16 = phi i32 [ 1, %if.then ], [ %mul21, %if.else ]
				; CHECK: phi i32 [ %mul13, %if.then ], [ %mul21, %if.else ]
				; CHECK: mul nsw i32 {{.*}}, %z

				define void @foo(i32 %x, i32 %y, i32 %z, i32 %w, i1 %interleaved) {
				entry:
				br i1 %interleaved, label %if.then, label %if.else

				if.then: ; preds = %entry
				%mul13 = mul nsw i32 %x, %y
				%mul14 = mul nsw i32 %mul13, %z
				br label %if.end

				if.else: ; preds = %entry
				%mul21 = mul nsw i32 %x, %y
				%mul25 = mul nsw i32 %mul21, %z
				br label %if.end

				if.end: ; preds = %if.else, %if.then
				%buf.sroa.14 = phi i32 [ undef, %if.then ], [ %x, %if.else ]
				%buf.sroa.16 = phi i32 [ 1, %if.then ], [ %mul21, %if.else ]
				%buf.sroa.18 = phi i32 [ %mul14, %if.then ], [ %mul25, %if.else ]
				%conv39 = sext i32 %z to i64
				ret void
				}

This is an archive of the discontinued LLVM Phabricator instance.

Global code motion of congruent computationsNeeds ReviewPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 227372

llvm/include/llvm/Analysis/MemorySSA.h

llvm/lib/Transforms/IPO/PassManagerBuilder.cpp

llvm/lib/Transforms/Scalar/GVNHoist.cpp

llvm/test/Transforms/GVNHoist/broken-mod-sink.ll

llvm/test/Transforms/GVNHoist/dj-edge-detect-das-taco.ll

llvm/test/Transforms/GVNHoist/dj-edge-detect.ll

llvm/test/Transforms/GVNHoist/gvn-sink-test-1.ll

llvm/test/Transforms/GVNHoist/hoist-convergent.ll

llvm/test/Transforms/GVNHoist/hoist-md.ll

llvm/test/Transforms/GVNHoist/hoist-mssa.ll

llvm/test/Transforms/GVNHoist/hoist-pr20242.ll

llvm/test/Transforms/GVNHoist/hoist-pr22005.ll

llvm/test/Transforms/GVNHoist/hoist-recursive-geps.ll

llvm/test/Transforms/GVNHoist/hoist-very-busy.ll

llvm/test/Transforms/GVNHoist/hoist.ll

llvm/test/Transforms/GVNHoist/infinite-loop-direct.ll

llvm/test/Transforms/GVNHoist/infinite-loop-indirect.ll

llvm/test/Transforms/GVNHoist/int_sideeffect.ll

llvm/test/Transforms/GVNHoist/pr37445.ll

llvm/test/Transforms/GVNHoist/sink-load-dom-numbering.ll

llvm/test/Transforms/GVNHoist/sink-load-with-memory-phi-1.ll

llvm/test/Transforms/GVNHoist/sink-load-with-memory-phi-2.ll

llvm/test/Transforms/GVNHoist/sink-mul-with-liveout.ll

Global code motion of congruent computations
Needs ReviewPublic