This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
include/llvm/CodeGen/GlobalISel/
-
llvm/
-
CodeGen/
-
GlobalISel/
2
CSEInfo.h
-
CSEMIRBuilder.h
-
Legalizer.h
-
Localizer.h
1/2
MachineIRBuilder.h
-
RegBankSelect.h
-
lib/
-
CodeGen/GlobalISel/
-
GlobalISel/
2/6
CSEInfo.cpp
-
CSEMIRBuilder.cpp
-
IRTranslator.cpp
-
Legalizer.cpp
-
Localizer.cpp
-
RegBankSelect.cpp
-
Target/AArch64/GISel/
-
AArch64/
-
GISel/
-
AArch64PostLegalizerCombiner.cpp
-
AArch64PostLegalizerLowering.cpp
-
AArch64PreLegalizerCombiner.cpp
-
test/CodeGen/AArch64/
-
CodeGen/
-
AArch64/
-
GlobalISel/
-
combine-shift-immed-mismatch-crash.mir
-
gisel-commandline-option.ll
-
O0-pipeline.ll

Differential D94264

[GlobalISel] Add MachineInstNumbering to CSEInfo and propagate CSE throughout AArch64 pipeline.
Needs RevisionPublic

Authored by aemerson on Jan 7 2021, 1:52 PM.

Download Raw Diff

Details

Reviewers

aditya_nandakumar
qcolombet
paquette
volkan
arsenm
dsanders
foad

Summary

MachineInstNumbering is a wrapper around SlotIndexes, which exposes an API to allow fast dominance checks in the CSEMIRBuilder, and to speed up intra-block instruction sinking in the Localizer.

The numbering is stored within the CSEInfo analysis. This change also sets the AArch64 and generic GlobalISel passes to preserve the CSEInfo, and therefore use the CSEMIRBuilder within the RegBankSelect and other combiner passes.

Overall this seems to impact compile time negatively by about ~0.5% on -O0 and -Os but prevents us from having unreasonably long compile times in the worst cases.

Diff Detail

Repository: rG LLVM Github Monorepo

Event Timeline

aemerson created this revision.Jan 7 2021, 1:52 PM

Herald added subscribers: nikic, danielkiss, hiraditya and 2 others. · View Herald TranscriptJan 7 2021, 1:52 PM

aemerson requested review of this revision.Jan 7 2021, 1:52 PM

Herald added a subscriber: wdng. · View Herald TranscriptJan 7 2021, 1:52 PM

Harbormaster completed remote builds in B84385: Diff 315232.Jan 7 2021, 2:41 PM

Improve the performance of the insertion queue flushing.

For huge basic blocks, this was having hanging in the renumbering. This new version now tries to not flush on removal of an instruction.

The second issue was that in those huge blocks, any time there was multiple insertions, we'd end up renumbering the entire block. For such blocks, this was way too slow. I've added a more intelligent flushing algorithm to instead search up from each instruction in the block's queue, so that we find an instruction that allows us to use the SlotIndexes to do a fast comparison of ordering. This lets us find the begin iterator for a numbering repair, and likewise walk down each queued instruction until we find the post dominator instruction that gives us an "end" iterator.

These two improvements don't really help on CTMark, they only help on very large pathological blocks.

I think it may be worth exploring using some heuristics early in the pipeline like IRTranslation to detect huge blocks, and switch to using this numbering analysis. Otherwise, we can continue to use the simple linked list block walk, which is slightly faster in the common cases.

Harbormaster completed remote builds in B84977: Diff 316325.Jan 12 2021, 11:25 PM

Fix clang-format warnings.

Harbormaster completed remote builds in B85037: Diff 316428.Jan 13 2021, 10:13 AM

How big is a huge block?

In D94264#2549325, @paquette wrote:

How big is a huge block?

On the order of millions of instructions.

aemerson mentioned this in D93423: [GlobalISel] Use slot indexes to speed up huge block compile time.Feb 21 2021, 10:10 PM

paquette added inline comments.Feb 22 2021, 10:01 AM

llvm/include/llvm/CodeGen/GlobalISel/CSEInfo.h
109	Maybe good to note that this also removes `MI` from the queue if it is present?
llvm/include/llvm/CodeGen/GlobalISel/MachineIRBuilder.h
372	Why is this necessary?
llvm/lib/CodeGen/GlobalISel/CSEInfo.cpp
215	Does this bug have a FIXME or something somewhere?
356	I think you can remove the check for `CurrInst->isDebugInstr()` if you use `skipDebugInstructionsBackward`.
380	This function and `findEarlierNonQueuedInst` are the same aside from the direction they iterate in. Is it possible to refactor them somehow so they can share more code? `findEarlierNonQueuedInst` looks like it's the same loop if you use a `reverse_iterator`.
380	I think you can remove the check for `CurrInst->isDebugInstr()` if you use `skipDebugInstructionsForward`.

Could you split large basic blocks into smaller ones?

I think legalizing/combining will do way, way more insertions than would happen in regalloc. Is this losing time to renumberings? Would it make sense to increase the increment size?

In D94264#2579954, @arsenm wrote:

I think legalizing/combining will do way, way more insertions than would happen in regalloc. Is this losing time to renumberings?

On average, this does cost more in renumbering/flushing than it saves. It only provides a net benefit on the pathological cases in legalization & localization.

In D94264#2579954, @arsenm wrote:

Would it make sense to increase the increment size?

What do you mean by increment size?

In D94264#2579294, @tschuett wrote:

Could you split large basic blocks into smaller ones?

The issue does get resolved if we keep the constants entry block separate after IR translation, instead of merging them. However @qcolombet believes that's not the right approach.

In D94264#2579990, @aemerson wrote:

In D94264#2579954, @arsenm wrote:

I think legalizing/combining will do way, way more insertions than would happen in regalloc. Is this losing time to renumberings?

On average, this does cost more in renumbering/flushing than it saves. It only provides a net benefit on the pathological cases in legalization & localization.

In D94264#2579954, @arsenm wrote:

Would it make sense to increase the increment size?

What do you mean by increment size?

The instruction index values initially increment by 16 so you only renumber for every 15 or so insertions between instructions:

enum {
  /// The default distance between instructions as returned by distance().
  /// This may vary as instructions are inserted and removed.
  InstrDist = 4 * Slot_Count
};

In D94264#2580058, @arsenm wrote:
In D94264#2579990, @aemerson wrote:

In D94264#2579954, @arsenm wrote:

I think legalizing/combining will do way, way more insertions than would happen in regalloc. Is this losing time to renumberings?

On average, this does cost more in renumbering/flushing than it saves. It only provides a net benefit on the pathological cases in legalization & localization.

In D94264#2579954, @arsenm wrote:

Would it make sense to increase the increment size?

What do you mean by increment size?

The instruction index values initially increment by 16 so you only renumber for every 15 or so insertions between instructions:
enum {
  /// The default distance between instructions as returned by distance().
  /// This may vary as instructions are inserted and removed.
  InstrDist = 4 * Slot_Count
};

Ok, I'll try that and see what the effect is.

llvm/include/llvm/CodeGen/GlobalISel/MachineIRBuilder.h
372	Must be vestigial.
llvm/lib/CodeGen/GlobalISel/CSEInfo.cpp
215	I think Matt fixed this by removing the observer from the MIRBuilder a while ago, which landed after I wrote this.
380	I'll see what I can do here. I did try to factor things out originally but ran into a a few issues.

jroelofs added a subscriber: jroelofs.Jun 1 2021, 8:31 AM

jroelofs added inline comments.

llvm/include/llvm/CodeGen/GlobalISel/CSEInfo.h
70	It would eliminate the explicit dtor and simplify the logic in `init()` if this were `std::unique_ptr<SlotIndexes> SI;`

Is this still relevant? I thought this was done already

Thought there was a previously committed solution for this

This revision now requires changes to proceed.Aug 17 2023, 4:04 PM

Herald added a project: Restricted Project. · View Herald TranscriptAug 17 2023, 4:04 PM

Herald added a subscriber: StephenFan. · View Herald Transcript

Revision Contents

Path

Size

llvm/

include/

llvm/

CodeGen/

GlobalISel/

80 lines

2 lines

1 line

3 lines

4 lines

2 lines

lib/

CodeGen/

GlobalISel/

314 lines

17 lines

3 lines

2 lines

51 lines

36 lines

Target/

AArch64/

GISel/

AArch64PostLegalizerCombiner.cpp

15 lines

AArch64PostLegalizerLowering.cpp

15 lines

AArch64PreLegalizerCombiner.cpp

16 lines

test/

CodeGen/

AArch64/

GlobalISel/

combine-shift-immed-mismatch-crash.mir

8 lines

gisel-commandline-option.ll

2 lines

O0-pipeline.ll

2 lines

Diff 316428

llvm/include/llvm/CodeGen/GlobalISel/CSEInfo.h

//===- llvm/CodeGen/GlobalISel/CSEInfo.h ------------------- C++ --===//		//===- llvm/CodeGen/GlobalISel/CSEInfo.h ------------------- C++ --===//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
//		//
/// Provides analysis for continuously CSEing during GISel passes.		/// Provides analysis for continuously CSEing during GISel passes.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
#ifndef LLVM_CODEGEN_GLOBALISEL_CSEINFO_H		#ifndef LLVM_CODEGEN_GLOBALISEL_CSEINFO_H
#define LLVM_CODEGEN_GLOBALISEL_CSEINFO_H		#define LLVM_CODEGEN_GLOBALISEL_CSEINFO_H

#include "llvm/ADT/FoldingSet.h"		#include "llvm/ADT/FoldingSet.h"
		#include "llvm/ADT/SmallPtrSet.h"
		#include "llvm/ADT/SmallVector.h"
		#include "llvm/ADT/iterator_range.h"
#include "llvm/CodeGen/CSEConfigBase.h"		#include "llvm/CodeGen/CSEConfigBase.h"
#include "llvm/CodeGen/GlobalISel/GISelChangeObserver.h"		#include "llvm/CodeGen/GlobalISel/GISelChangeObserver.h"
#include "llvm/CodeGen/GlobalISel/GISelWorkList.h"		#include "llvm/CodeGen/GlobalISel/GISelWorkList.h"
#include "llvm/CodeGen/MachineFunctionPass.h"		#include "llvm/CodeGen/MachineFunctionPass.h"
		#include "llvm/CodeGen/MachineMemOperand.h"
		#include "llvm/CodeGen/SlotIndexes.h"
#include "llvm/Support/Allocator.h"		#include "llvm/Support/Allocator.h"
#include "llvm/Support/CodeGen.h"		#include "llvm/Support/CodeGen.h"

namespace llvm {		namespace llvm {
class MachineBasicBlock;		class MachineBasicBlock;

/// A class that wraps MachineInstrs and derives from FoldingSetNode in order to		/// A class that wraps MachineInstrs and derives from FoldingSetNode in order to
/// be uniqued in a CSEMap. The tradeoff here is extra memory allocations for		/// be uniqued in a CSEMap. The tradeoff here is extra memory allocations for
Show All 23 Lines

// Returns the standard expected CSEConfig for the given optimization level.		// Returns the standard expected CSEConfig for the given optimization level.
// We have this logic here so targets can make use of it from their derived		// We have this logic here so targets can make use of it from their derived
// TargetPassConfig, but can't put this logic into TargetPassConfig directly		// TargetPassConfig, but can't put this logic into TargetPassConfig directly
// because the CodeGen library can't depend on GlobalISel.		// because the CodeGen library can't depend on GlobalISel.
std::unique_ptr<CSEConfigBase>		std::unique_ptr<CSEConfigBase>
getStandardCSEConfigForOpt(CodeGenOpt::Level Level);		getStandardCSEConfigForOpt(CodeGenOpt::Level Level);

		/// A wrapper for SlotIndexes that holds yet-to-be inserted instructions.
		/// This is used so we can lazily renumber the indices when the next
		/// query is made. The created() observer callback actually executes
		/// before the MachineInstr is inserted, but SlotIndexes needs to have
		/// the instruction fully inserted before it starts renumbering.
		class MachineInstNumbering {
		SlotIndexes *SI = nullptr;
		jroelofsUnsubmitted Not Done Reply Inline Actions It would eliminate the explicit dtor and simplify the logic in `init()` if this were `std::unique_ptr<SlotIndexes> SI;` jroelofs: It would eliminate the explicit dtor and simplify the logic in `init()` if this were `std…
		MachineFunction *MF = nullptr;
		bool IsEnabled = true;
		/// Contains all instructions that need renumbering.
		SmallVector<MachineInstr *> InsertionQueue;

		/// Extract instructions from the queue that have the same parent block,
		/// and null them out in the queue.
		void
		extractInstsFromSameBlock(SmallPtrSetImpl<MachineInstr *> &ExtractedInsts);
		/// Find the nearest predecessor instruction that's not queued.
		MachineBasicBlock::iterator
		findEarlierNonQueuedInst(MachineBasicBlock::iterator Start,
		SmallPtrSetImpl<MachineInstr *> &InstQueue);
		/// Find the nearest successor instruction that's not queued.
		MachineBasicBlock::iterator
		findLaterNonQueuedInst(MachineBasicBlock::iterator Start,
		SmallPtrSetImpl<MachineInstr *> &InstQueue);

		/// Returns true if the instruction \p A is earlier than the SlotIndex \p BIdx
		/// in the block using the instruction numbering. If \p NoFlush is true, then
		/// do not flush any queued insertions before performing the query. This can
		/// be used for efficiency reasons if both \A and \B are known to be already
		/// in the SlotIndexes.
		bool isEarlier(const MachineInstr &A, const SlotIndex &BIdx,
		bool NoFlush = false);

		public:
		MachineInstNumbering() = default;
		void init(MachineFunction &MF);

		void setMF(MachineFunction &MF) { this->MF = &MF; }
		bool isEnabled() const { return SI != nullptr; }
		/// Enable numbering of the given MF, unless the MF has already been analyzed.
		void enable(MachineFunction *MF = nullptr);
		void disable();

		/// Flush the queue and start a renumbering for the regions that need it.
		void flushAndRenumber();
		/// Remove an instruction from SlotIndexes.
		paquetteUnsubmitted Not Done Reply Inline Actions Maybe good to note that this also removes `MI` from the queue if it is present? paquette: Maybe good to note that this also removes `MI` from the queue if it is present?
		void removeInstruction(MachineInstr &MI);
		/// Queue an instruction to be inserted and trigger renumbering
		/// on the next query event.
		void queueInsertion(MachineInstr &MI);
		/// Repair numbering in the given range.
		void repairNumberingInRange(MachineBasicBlock::iterator Begin,
		MachineBasicBlock::iterator End);
		/// Returns true if \p A is earlier than \p B in the block using the
		/// instruction numbering.
		bool isEarlier(const MachineInstr &A, const MachineInstr &B);

		/// Returns the earliest MachineInstr in the given set \p Instrs.
		/// If \p NoFlush is true, then do not flush any queued insertions before
		/// performing the query. This can be used for efficiency reasons if both
		/// \A and \B are known to be already in the SlotIndexes.
		MachineInstr getEarliestInstr(const SmallPtrSetImpl<MachineInstr > &Instrs,
		bool NoFlush = false);

		/// Deleted copy ctor and assignment operator to prevent copying these.
		MachineInstNumbering(const MachineInstNumbering &_) = delete;
		MachineInstNumbering &operator=(const MachineInstNumbering &_) = delete;

		~MachineInstNumbering();
		};

/// The CSE Analysis object.		/// The CSE Analysis object.
/// This installs itself as a delegate to the MachineFunction to track		/// This installs itself as a delegate to the MachineFunction to track
/// new instructions as well as deletions. It however will not be able to		/// new instructions as well as deletions. It however will not be able to
/// track instruction mutations. In such cases, recordNewInstruction should be		/// track instruction mutations. In such cases, recordNewInstruction should be
/// called (for eg inside MachineIRBuilder::recordInsertion).		/// called (for eg inside MachineIRBuilder::recordInsertion).
/// Also because of how just the instruction can be inserted without adding any		/// Also because of how just the instruction can be inserted without adding any
/// operands to the instruction, instructions are uniqued and inserted lazily.		/// operands to the instruction, instructions are uniqued and inserted lazily.
/// CSEInfo should assert when trying to enter an incomplete instruction into		/// CSEInfo should assert when trying to enter an incomplete instruction into
/// the CSEMap. There is Opcode level granularity on which instructions can be		/// the CSEMap. There is Opcode level granularity on which instructions can be
/// CSE'd and for now, only Generic instructions are CSEable.		/// CSE'd and for now, only Generic instructions are CSEable.
class GISelCSEInfo : public GISelChangeObserver {		class GISelCSEInfo : public GISelChangeObserver {
// Make it accessible only to CSEMIRBuilder.		// Make it accessible only to CSEMIRBuilder.
friend class CSEMIRBuilder;		friend class CSEMIRBuilder;

BumpPtrAllocator UniqueInstrAllocator;		BumpPtrAllocator UniqueInstrAllocator;
FoldingSet<UniqueMachineInstr> CSEMap;		FoldingSet<UniqueMachineInstr> CSEMap;
MachineRegisterInfo *MRI = nullptr;		MachineRegisterInfo *MRI = nullptr;
MachineFunction *MF = nullptr;		MachineFunction *MF = nullptr;
		MachineInstNumbering MINumbering;
std::unique_ptr<CSEConfigBase> CSEOpt;		std::unique_ptr<CSEConfigBase> CSEOpt;
/// Keep a cache of UniqueInstrs for each MachineInstr. In GISel,		/// Keep a cache of UniqueInstrs for each MachineInstr. In GISel,
/// often instructions are mutated (while their ID has completely changed).		/// often instructions are mutated (while their ID has completely changed).
/// Whenever mutation happens, invalidate the UniqueMachineInstr for the		/// Whenever mutation happens, invalidate the UniqueMachineInstr for the
/// MachineInstr		/// MachineInstr
DenseMap<const MachineInstr , UniqueMachineInstr > InstrMapping;		DenseMap<const MachineInstr , UniqueMachineInstr > InstrMapping;

/// Store instructions that are not fully formed in TemporaryInsts.		/// Store instructions that are not fully formed in TemporaryInsts.
Show All 30 Lines	public:
GISelCSEInfo() = default;		GISelCSEInfo() = default;

virtual ~GISelCSEInfo();		virtual ~GISelCSEInfo();

void setMF(MachineFunction &MF);		void setMF(MachineFunction &MF);

Error verify();		Error verify();

		MachineInstNumbering &getInstNumbering() { return MINumbering; }
		const MachineInstNumbering &getInstNumbering() const { return MINumbering; }

/// Records a newly created inst in a list and lazily insert it to the CSEMap.		/// Records a newly created inst in a list and lazily insert it to the CSEMap.
/// Sometimes, this method might be called with a partially constructed		/// Sometimes, this method might be called with a partially constructed
/// MachineInstr,		/// MachineInstr,
// (right after BuildMI without adding any operands) - and in such cases,		// (right after BuildMI without adding any operands) - and in such cases,
// defer the hashing of the instruction to a later stage.		// defer the hashing of the instruction to a later stage.
void recordNewInstruction(MachineInstr *MI);		void recordNewInstruction(MachineInstr *MI);

/// Use this callback to inform CSE about a newly fully created instruction.		/// Use this callback to inform CSE about a newly fully created instruction.
▲ Show 20 Lines • Show All 112 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/GlobalISel/CSEMIRBuilder.h

Show All 35 Lines	class CSEMIRBuilder : public MachineIRBuilder {
//		//
// TODO: Another approach for checking dominance is having two iterators and		// TODO: Another approach for checking dominance is having two iterators and
// making them go towards each other until they meet or reach begin/end. Which		// making them go towards each other until they meet or reach begin/end. Which
// approach is better? Should this even change dynamically? For G_CONSTANTS		// approach is better? Should this even change dynamically? For G_CONSTANTS
// most of which will be at the top of the BB, the top down approach would be		// most of which will be at the top of the BB, the top down approach would be
// a better choice. Does IRTranslator placing constants at the beginning still		// a better choice. Does IRTranslator placing constants at the beginning still
// make sense? Should this change based on Opcode?		// make sense? Should this change based on Opcode?
bool dominates(MachineBasicBlock::const_iterator A,		bool dominates(MachineBasicBlock::const_iterator A,
MachineBasicBlock::const_iterator B) const;		MachineBasicBlock::const_iterator B);

/// For given ID, find a machineinstr in the CSE Map. If found, check if it		/// For given ID, find a machineinstr in the CSE Map. If found, check if it
/// dominates the current insertion point and if not, move it just before the		/// dominates the current insertion point and if not, move it just before the
/// current insertion point and return it. If not found, return Null		/// current insertion point and return it. If not found, return Null
/// MachineInstrBuilder.		/// MachineInstrBuilder.
MachineInstrBuilder getDominatingInstrForID(FoldingSetNodeID &ID,		MachineInstrBuilder getDominatingInstrForID(FoldingSetNodeID &ID,
void *&NodeInsertPos);		void *&NodeInsertPos);
/// Simple check if we can CSE (we have the CSEInfo) or if this Opcode is		/// Simple check if we can CSE (we have the CSEInfo) or if this Opcode is
▲ Show 20 Lines • Show All 57 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/GlobalISel/Legalizer.h

	Show All 15 Lines
	/// instruction.			/// instruction.
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#ifndef LLVM_CODEGEN_GLOBALISEL_LEGALIZEMACHINEIRPASS_H			#ifndef LLVM_CODEGEN_GLOBALISEL_LEGALIZEMACHINEIRPASS_H
	#define LLVM_CODEGEN_GLOBALISEL_LEGALIZEMACHINEIRPASS_H			#define LLVM_CODEGEN_GLOBALISEL_LEGALIZEMACHINEIRPASS_H

	#include "llvm/CodeGen/GlobalISel/MachineIRBuilder.h"			#include "llvm/CodeGen/GlobalISel/MachineIRBuilder.h"
				#include "llvm/CodeGen/MachineBasicBlock.h"
	#include "llvm/CodeGen/MachineFunctionPass.h"			#include "llvm/CodeGen/MachineFunctionPass.h"

	namespace llvm {			namespace llvm {

	class MachineRegisterInfo;			class MachineRegisterInfo;
	class LostDebugLocObserver;			class LostDebugLocObserver;

	class Legalizer : public MachineFunctionPass {			class Legalizer : public MachineFunctionPass {
	▲ Show 20 Lines • Show All 46 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/GlobalISel/Localizer.h

	Show All 23 Lines
	#include "llvm/ADT/SetVector.h"			#include "llvm/ADT/SetVector.h"
	#include "llvm/CodeGen/GlobalISel/MachineIRBuilder.h"			#include "llvm/CodeGen/GlobalISel/MachineIRBuilder.h"
	#include "llvm/CodeGen/MachineFunctionPass.h"			#include "llvm/CodeGen/MachineFunctionPass.h"

	namespace llvm {			namespace llvm {
	// Forward declarations.			// Forward declarations.
	class MachineRegisterInfo;			class MachineRegisterInfo;
	class TargetTransformInfo;			class TargetTransformInfo;
				class CSEInfo;

	/// This pass implements the localization mechanism described at the			/// This pass implements the localization mechanism described at the
	/// top of this file. One specificity of the implementation is that			/// top of this file. One specificity of the implementation is that
	/// it will materialize one and only one instance of a constant per			/// it will materialize one and only one instance of a constant per
	/// basic block, thus enabling reuse of that constant within that block.			/// basic block, thus enabling reuse of that constant within that block.
	/// Moreover, it only materializes constants in blocks where they			/// Moreover, it only materializes constants in blocks where they
	/// are used. PHI uses are considered happening at the end of the			/// are used. PHI uses are considered happening at the end of the
	/// related predecessor.			/// related predecessor.
	class Localizer : public MachineFunctionPass {			class Localizer : public MachineFunctionPass {
	public:			public:
	static char ID;			static char ID;

	private:			private:
	/// An input function to decide if the pass should run or not			/// An input function to decide if the pass should run or not
	/// on the given MachineFunction.			/// on the given MachineFunction.
	std::function<bool(const MachineFunction &)> DoNotRunPass;			std::function<bool(const MachineFunction &)> DoNotRunPass;

	/// MRI contains all the register class/bank information that this			/// MRI contains all the register class/bank information that this
	/// pass uses and updates.			/// pass uses and updates.
	MachineRegisterInfo *MRI;			MachineRegisterInfo *MRI;
	/// TTI used for getting remat costs for instructions.			/// TTI used for getting remat costs for instructions.
	TargetTransformInfo *TTI;			TargetTransformInfo *TTI;
				/// Used for accessing the instruction numbering to speed up localizing.
				GISelCSEInfo *CSEInfo = nullptr;

	/// Check if \p MOUse is used in the same basic block as \p Def.			/// Check if \p MOUse is used in the same basic block as \p Def.
	/// If the use is in the same block, we say it is local.			/// If the use is in the same block, we say it is local.
	/// When the use is not local, \p InsertMBB will contain the basic			/// When the use is not local, \p InsertMBB will contain the basic
	/// block when to insert \p Def to have a local use.			/// block when to insert \p Def to have a local use.
	static bool isLocalUse(MachineOperand &MOUse, const MachineInstr &Def,			static bool isLocalUse(MachineOperand &MOUse, const MachineInstr &Def,
	MachineBasicBlock *&InsertMBB);			MachineBasicBlock *&InsertMBB);

	Show All 31 Lines

llvm/include/llvm/CodeGen/GlobalISel/MachineIRBuilder.h

Show All 12 Lines
#ifndef LLVM_CODEGEN_GLOBALISEL_MACHINEIRBUILDER_H		#ifndef LLVM_CODEGEN_GLOBALISEL_MACHINEIRBUILDER_H
#define LLVM_CODEGEN_GLOBALISEL_MACHINEIRBUILDER_H		#define LLVM_CODEGEN_GLOBALISEL_MACHINEIRBUILDER_H

#include "llvm/CodeGen/GlobalISel/CSEInfo.h"		#include "llvm/CodeGen/GlobalISel/CSEInfo.h"
#include "llvm/CodeGen/LowLevelType.h"		#include "llvm/CodeGen/LowLevelType.h"
#include "llvm/CodeGen/MachineBasicBlock.h"		#include "llvm/CodeGen/MachineBasicBlock.h"
#include "llvm/CodeGen/MachineInstrBuilder.h"		#include "llvm/CodeGen/MachineInstrBuilder.h"
#include "llvm/CodeGen/MachineRegisterInfo.h"		#include "llvm/CodeGen/MachineRegisterInfo.h"
		#include "llvm/CodeGen/SlotIndexes.h"
#include "llvm/CodeGen/TargetOpcodes.h"		#include "llvm/CodeGen/TargetOpcodes.h"
#include "llvm/IR/Constants.h"		#include "llvm/IR/Constants.h"
#include "llvm/IR/DebugLoc.h"		#include "llvm/IR/DebugLoc.h"
#include "llvm/IR/Module.h"		#include "llvm/IR/Module.h"

namespace llvm {		namespace llvm {

// Forward declarations.		// Forward declarations.
▲ Show 20 Lines • Show All 334 Lines • ▼ Show 20 Lines	public:
/// Build and insert <empty> = \p Opcode <empty>.		/// Build and insert <empty> = \p Opcode <empty>.
/// The insertion point is the one set by the last call of either		/// The insertion point is the one set by the last call of either
/// setBasicBlock or setMI.		/// setBasicBlock or setMI.
///		///
/// \pre setBasicBlock or setMI must have been called.		/// \pre setBasicBlock or setMI must have been called.
///		///
/// \return a MachineInstrBuilder for the newly created instruction.		/// \return a MachineInstrBuilder for the newly created instruction.
MachineInstrBuilder buildInstr(unsigned Opcode) {		MachineInstrBuilder buildInstr(unsigned Opcode) {
return insertInstr(buildInstrNoInsert(Opcode));		auto MIB = insertInstr(buildInstrNoInsert(Opcode));
		paquetteUnsubmitted Not Done Reply Inline Actions Why is this necessary? paquette: Why is this necessary?
		aemersonAuthorUnsubmitted Done Reply Inline Actions Must be vestigial. aemerson: Must be vestigial.
		return MIB;
}		}

/// Build but don't insert <empty> = \p Opcode <empty>.		/// Build but don't insert <empty> = \p Opcode <empty>.
///		///
/// \pre setMF, setBasicBlock or setMI must have been called.		/// \pre setMF, setBasicBlock or setMI must have been called.
///		///
/// \return a MachineInstrBuilder for the newly created instruction.		/// \return a MachineInstrBuilder for the newly created instruction.
MachineInstrBuilder buildInstrNoInsert(unsigned Opcode);		MachineInstrBuilder buildInstrNoInsert(unsigned Opcode);
▲ Show 20 Lines • Show All 1,422 Lines • Show Last 20 Lines

llvm/include/llvm/CodeGen/GlobalISel/RegBankSelect.h

Show First 20 Lines • Show All 498 Lines • ▼ Show 20 Lines	private:
/// Get the frequency of the edges.		/// Get the frequency of the edges.
/// This is required for non-fast mode.		/// This is required for non-fast mode.
MachineBranchProbabilityInfo *MBPI = nullptr;		MachineBranchProbabilityInfo *MBPI = nullptr;

/// Current optimization remark emitter. Used to report failures.		/// Current optimization remark emitter. Used to report failures.
std::unique_ptr<MachineOptimizationRemarkEmitter> MORE;		std::unique_ptr<MachineOptimizationRemarkEmitter> MORE;

/// Helper class used for every code morphing.		/// Helper class used for every code morphing.
MachineIRBuilder MIRBuilder;		std::unique_ptr<MachineIRBuilder> MIRBuilder;

/// Optimization mode of the pass.		/// Optimization mode of the pass.
Mode OptMode;		Mode OptMode;

/// Current target configuration. Controls how the pass handles errors.		/// Current target configuration. Controls how the pass handles errors.
const TargetPassConfig *TPC;		const TargetPassConfig *TPC;

/// Assign the register bank of each operand of \p MI.		/// Assign the register bank of each operand of \p MI.
▲ Show 20 Lines • Show All 154 Lines • Show Last 20 Lines

llvm/lib/CodeGen/GlobalISel/CSEInfo.cpp

//===- CSEInfo.cpp ------------------------------===// //===- CSEInfo.cpp ------------------------------===//

// //

// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions. // Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.

// See https://llvm.org/LICENSE.txt for license information. // See https://llvm.org/LICENSE.txt for license information.

// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception // SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception

// //

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

// //

//===----------------------------------------------------------------------===// //===----------------------------------------------------------------------===//

#include "llvm/CodeGen/GlobalISel/CSEInfo.h" #include "llvm/CodeGen/GlobalISel/CSEInfo.h"

#include "llvm/ADT/STLExtras.h"

#include "llvm/ADT/SmallVector.h"

#include "llvm/CodeGen/MachineBasicBlock.h"

#include "llvm/CodeGen/MachineInstr.h"

#include "llvm/CodeGen/MachineRegisterInfo.h" #include "llvm/CodeGen/MachineRegisterInfo.h"

#include "llvm/CodeGen/SlotIndexes.h"

#include "llvm/InitializePasses.h" #include "llvm/InitializePasses.h"

#define DEBUG_TYPE "cseinfo" #define DEBUG_TYPE "cseinfo"

using namespace llvm; using namespace llvm;

char llvm::GISelCSEAnalysisWrapperPass::ID = 0; char llvm::GISelCSEAnalysisWrapperPass::ID = 0;

GISelCSEAnalysisWrapperPass::GISelCSEAnalysisWrapperPass() GISelCSEAnalysisWrapperPass::GISelCSEAnalysisWrapperPass()

: MachineFunctionPass(ID) { : MachineFunctionPass(ID) {

▲ Show 20 Lines • Show All 147 Lines • ▼ Show 20 Lines #ifndef NDEBUG

if (OpcodeHitTable.count(Opc)) if (OpcodeHitTable.count(Opc))

OpcodeHitTable[Opc] += 1; OpcodeHitTable[Opc] += 1;

else else

OpcodeHitTable[Opc] = 1; OpcodeHitTable[Opc] = 1;

#endif #endif

// Else do nothing. // Else do nothing.

} }

void MachineInstNumbering::init(MachineFunction &MF) {

if (!IsEnabled)

return;

this->MF = &MF;

if (SI) {

SI->releaseMemory();

SI->runOnMachineFunction(MF);

return;

}

SI = new class SlotIndexes();

SI->runOnMachineFunction(MF);

}

void MachineInstNumbering::enable(MachineFunction *MF) {

if (IsEnabled && this->MF == MF)

return;

IsEnabled = true;

if (MF)

init(*MF);

}

void MachineInstNumbering::disable() {

if (SI) {

delete SI;

SI = nullptr;

}

InsertionQueue.clear();

IsEnabled = false;

return;

}

void MachineInstNumbering::removeInstruction(MachineInstr &MI) {

assert(IsEnabled && "Expected Inst Numbering to be enabled");

// Remove the instruction from the queue if it exists. Due to a bug that

// inserts instructions twice due to duplicate observers, we check the entire

paquetteUnsubmitted

Not Done

Does this bug have a FIXME or something somewhere?

paquette: Does this bug have a FIXME or something somewhere?

aemersonAuthorUnsubmitted

Done

I think Matt fixed this by removing the observer from the MIRBuilder a while ago, which landed after I wrote this.

aemerson: I think Matt fixed this by removing the observer from the MIRBuilder a while ago, which landed…

// queue instead of quitting on the first hit.

bool Found = false;

for (unsigned I = 0, E = InsertionQueue.size(); I < E; ++I) {

if (InsertionQueue[I] == &MI) {

InsertionQueue[I] = nullptr;

Found = true;

}

if (!Found)

SI->removeSingleMachineInstrFromMaps(MI);

}

void MachineInstNumbering::queueInsertion(MachineInstr &MI) {

assert(IsEnabled && "Expected Inst Numbering to be enabled");

InsertionQueue.emplace_back(&MI);

}

void MachineInstNumbering::repairNumberingInRange(

MachineBasicBlock::iterator Begin, MachineBasicBlock::iterator End) {

assert(IsEnabled && "Expected Inst Numbering to be enabled");

flushAndRenumber();

SI->repairIndexesInRange(Begin->getParent(), Begin, End);

}

void MachineInstNumbering::flushAndRenumber() {

assert(IsEnabled && "Expected Inst Numbering to be enabled");

if (InsertionQueue.empty())

return;

// There may be multiple queued instructions for a given block. Because of

// this we can't trivially find the iterator ranges to do a local index

// repair since the neighbouring instructions may not have valid indexes yet.

// So, we have to find the nearest non-queued predecessor instruction in the

// block that dominates all of the queued instructions.

// If the instruction is not queued, then we should have a valid index, and

// therefore can quickly check if any other non-queued instruction is earlier

// or later.

SmallPtrSet<MachineInstr *, 8> SameBlockSubset;

while (true) {

// First we have to extract all the instructions that share a common basic

// block.

extractInstsFromSameBlock(SameBlockSubset);

if (SameBlockSubset.empty()) {

// We have no more instructions to process.

InsertionQueue.clear();

return;

}

// Now walk up from each instruction position until we hit an unqueued

// instruction, and compare the SlotIndex for that to the current "earliest"

// index.

SlotIndex DominatingInstSlotIdx;

MachineBasicBlock::iterator StartIt;

for (auto *MI : SameBlockSubset) {

auto It = MI->getIterator();

auto TerminalIt = MI->getParent()->begin();

if (It == TerminalIt) {

// This is the first instruction, end search.

StartIt = It;

break;

}

auto NonQueuedIt = findEarlierNonQueuedInst(It, SameBlockSubset);

// There's an edge case where the iterator returned by

// findEarlierNonQueuedInst() is begin(), but is also in the queue.

if (NonQueuedIt == TerminalIt) {

StartIt = NonQueuedIt;

break;

}

SlotIndex NonQueuedIndex = SI->getInstructionIndex(*NonQueuedIt);

if (!DominatingInstSlotIdx.isValid()) {

// This is the first instruction checked.

DominatingInstSlotIdx = NonQueuedIndex;

StartIt = NonQueuedIt;

} else if (SlotIndex::isEarlierInstr(NonQueuedIndex,

DominatingInstSlotIdx)) {

DominatingInstSlotIdx = NonQueuedIndex;

StartIt = NonQueuedIt;

}

// If the current iterator is the first instruction, then end the search

// early.

if (StartIt == TerminalIt)

break;

}

// Now we have the nearest predecessor which we can use as a repair range

// start point. Now do the same thing for the end iterator.

SlotIndex PostDomSlotIdx;

MachineBasicBlock::iterator EndIt;

for (auto *MI : SameBlockSubset) {

auto It = MI->getIterator();

auto TerminalIt = MI->getParent()->end();

if (std::next(It) == TerminalIt) {

EndIt = TerminalIt;

break;

}

auto NonQueuedIt = findLaterNonQueuedInst(It, SameBlockSubset);

// There's an edge case where the iterator returned by

// findLaterNonQueuedInst() is end(), but is also in the queue.

if (NonQueuedIt == TerminalIt) {

EndIt = NonQueuedIt;

break;

}

SlotIndex NonQueuedIndex = SI->getInstructionIndex(*NonQueuedIt);

if (!PostDomSlotIdx.isValid()) {

// This is the first instruction checked.

PostDomSlotIdx = NonQueuedIndex;

EndIt = NonQueuedIt;

} else if (SlotIndex::isEarlierInstr(PostDomSlotIdx, NonQueuedIndex)) {

PostDomSlotIdx = NonQueuedIndex;

EndIt = NonQueuedIt;

}

// If the current iterator is the first instruction, then end the search

// early.

if (EndIt == TerminalIt)

break;

}

SI->repairIndexesInRange(StartIt->getParent(), StartIt, EndIt);

SameBlockSubset.clear();

}

MachineBasicBlock::iterator MachineInstNumbering::findEarlierNonQueuedInst(

MachineBasicBlock::iterator Start,

SmallPtrSetImpl<MachineInstr *> &InstQueue) {

MachineBasicBlock *ParentBB = Start->getParent();

assert(Start != ParentBB->begin() && "Iterator should not be the start");

MachineBasicBlock::iterator II = Start;

do {

--II;

paquetteUnsubmitted

Not Done

do {

- --II;

+ skipDebugInstructionsBackward(II, ParentBB->begin());

MachineInstr *CurrInst = &*II;

I think you can remove the check for CurrInst->isDebugInstr() if you use skipDebugInstructionsBackward.

paquette: I think you can remove the check for `CurrInst->isDebugInstr()` if you use…

MachineInstr *CurrInst = &*II;

// DBG_VALUE instructions aren't inserted into SlotIndexes, so skip them.

if (CurrInst->isDebugInstr()) {

if (II == ParentBB->begin())

return II;

continue;

}

if (!InstQueue.contains(CurrInst)) {

// We have a non-queued instruction. Return the iterator.

return CurrInst->getIterator();

}

} while (II != ParentBB->begin());

return II;

}

MachineBasicBlock::iterator MachineInstNumbering::findLaterNonQueuedInst(

MachineBasicBlock::iterator Start,

SmallPtrSetImpl<MachineInstr *> &InstQueue) {

MachineBasicBlock *ParentBB = Start->getParent();

assert(Start != ParentBB->end() && "Iterator should not be the start");

MachineBasicBlock::iterator II = Start;

do {

++II;

paquetteUnsubmitted

Not Done

This function and findEarlierNonQueuedInst are the same aside from the direction they iterate in.

Is it possible to refactor them somehow so they can share more code?

findEarlierNonQueuedInst looks like it's the same loop if you use a reverse_iterator.

paquette: This function and `findEarlierNonQueuedInst` are the same aside from the direction they iterate…

paquetteUnsubmitted

Not Done

do {

- ++II;

+ skipDebugInstructionsForward(II, ParentBB->end());

if (II == ParentBB->end())

I think you can remove the check for CurrInst->isDebugInstr() if you use skipDebugInstructionsForward.

paquette: I think you can remove the check for `CurrInst->isDebugInstr()` if you use…

aemersonAuthorUnsubmitted

Done

I'll see what I can do here. I did try to factor things out originally but ran into a a few issues.

aemerson: I'll see what I can do here. I did try to factor things out originally but ran into a a few…

if (II == ParentBB->end())

return II;

MachineInstr *CurrInst = &*II;

// DBG_VALUE instructions aren't inserted into SlotIndexes, so skip them.

if (CurrInst->isDebugInstr()) {

if (II == ParentBB->end())

return II;

continue;

}

if (!InstQueue.contains(CurrInst)) {

// We have a non-queued instruction. Return the iterator.

return CurrInst->getIterator();

}

} while (true);

return II;

}

void MachineInstNumbering::extractInstsFromSameBlock(

SmallPtrSetImpl<MachineInstr *> &ExtractedInsts) {

MachineBasicBlock *ParentBB = nullptr;

for (unsigned I = 0, E = InsertionQueue.size(); I < E; ++I) {

MachineInstr *MI = InsertionQueue[I];

if (!MI)

continue;

if (!ParentBB) {

ParentBB = MI->getParent();

ExtractedInsts.insert(MI);

InsertionQueue[I] = nullptr;

continue;

}

if (MI->getParent() == ParentBB) {

ExtractedInsts.insert(MI);

InsertionQueue[I] = nullptr;

}

bool MachineInstNumbering::isEarlier(const MachineInstr &A,

const MachineInstr &B) {

assert(IsEnabled && "Expected Inst Numbering to be enabled");

// If we have outstanding queued instructions, process them first.

flushAndRenumber();

SlotIndex SlotA = SI->getInstructionIndex(A);

SlotIndex SlotB = SI->getInstructionIndex(B);

return SlotIndex::isEarlierInstr(SlotA, SlotB);

}

bool MachineInstNumbering::isEarlier(const MachineInstr &A,

const SlotIndex &BIdx, bool NoFlush) {

assert(BIdx.isValid() && "Expected valid index");

// If we have outstanding queued instructions, process them first.

if (!NoFlush)

flushAndRenumber();

SlotIndex SlotA = SI->getInstructionIndex(A);

return SlotIndex::isEarlierInstr(SlotA, BIdx);

}

MachineInstr *MachineInstNumbering::getEarliestInstr(

const SmallPtrSetImpl<MachineInstr *> &Instrs, bool NoFlush) {

MachineInstr *Earliest = nullptr;

SlotIndex EarliestIdx;

for (auto *Other : Instrs) {

if (!Earliest) {

Earliest = Other;

EarliestIdx = SI->getInstructionIndex(*Earliest);

continue;

}

assert(Earliest != Other);

if (isEarlier(*Other, EarliestIdx, NoFlush)) {

Earliest = Other;

EarliestIdx = SI->getInstructionIndex(*Other);

}

return Earliest;

}

MachineInstNumbering::~MachineInstNumbering() {

if (SI)

delete SI;

}

void GISelCSEInfo::recordNewInstruction(MachineInstr *MI) { void GISelCSEInfo::recordNewInstruction(MachineInstr *MI) {

if (shouldCSE(MI->getOpcode())) { if (shouldCSE(MI->getOpcode())) {

TemporaryInsts.insert(MI); TemporaryInsts.insert(MI);

LLVM_DEBUG(dbgs() << "CSEInfo::Recording new MI " << *MI); LLVM_DEBUG(dbgs() << "CSEInfo::Recording new MI " << *MI);

} }

void GISelCSEInfo::handleRecordedInst(MachineInstr *MI) { void GISelCSEInfo::handleRecordedInst(MachineInstr *MI) {

Show All 33 Lines void GISelCSEInfo::handleRecordedInsts() {

} }

bool GISelCSEInfo::shouldCSE(unsigned Opc) const { bool GISelCSEInfo::shouldCSE(unsigned Opc) const {

assert(CSEOpt.get() && "CSEConfig not set"); assert(CSEOpt.get() && "CSEConfig not set");

return CSEOpt->shouldCSEOpc(Opc); return CSEOpt->shouldCSEOpc(Opc);

} }

void GISelCSEInfo::erasingInstr(MachineInstr &MI) { handleRemoveInst(&MI); } void GISelCSEInfo::erasingInstr(MachineInstr &MI) {

void GISelCSEInfo::createdInstr(MachineInstr &MI) { recordNewInstruction(&MI); } handleRemoveInst(&MI);

if (MINumbering.isEnabled())

MINumbering.removeInstruction(MI);

}

void GISelCSEInfo::createdInstr(MachineInstr &MI) {

recordNewInstruction(&MI);

if (MINumbering.isEnabled())

MINumbering.queueInsertion(MI);

}

void GISelCSEInfo::changingInstr(MachineInstr &MI) { void GISelCSEInfo::changingInstr(MachineInstr &MI) {

// For now, perform erase, followed by insert. // We effectively do an erase + insert, but we don't call erasingInstr() or

erasingInstr(MI); // createdInstr() because this we don't need to do anything for the

createdInstr(MI); // instruction numbering for instruction mutations.

handleRemoveInst(&MI);

recordNewInstruction(&MI);

} }

void GISelCSEInfo::changedInstr(MachineInstr &MI) { changingInstr(MI); } void GISelCSEInfo::changedInstr(MachineInstr &MI) { changingInstr(MI); }

void GISelCSEInfo::analyze(MachineFunction &MF) { void GISelCSEInfo::analyze(MachineFunction &MF) {

setMF(MF); setMF(MF);

for (auto &MBB : MF) { for (auto &MBB : MF) {

if (MBB.empty()) if (MBB.empty())

continue; continue;

for (MachineInstr &MI : MBB) { for (MachineInstr &MI : MBB) {

Show All 9 Lines void GISelCSEInfo::releaseMemory() {

print(); print();

CSEMap.clear(); CSEMap.clear();

InstrMapping.clear(); InstrMapping.clear();

UniqueInstrAllocator.Reset(); UniqueInstrAllocator.Reset();

TemporaryInsts.clear(); TemporaryInsts.clear();

CSEOpt.reset(); CSEOpt.reset();

MRI = nullptr; MRI = nullptr;

MF = nullptr; MF = nullptr;

MINumbering.disable();

#ifndef NDEBUG #ifndef NDEBUG

OpcodeHitTable.clear(); OpcodeHitTable.clear();

#endif #endif

} }

Error GISelCSEInfo::verify() { Error GISelCSEInfo::verify() {

#ifndef NDEBUG #ifndef NDEBUG

handleRecordedInsts(); handleRecordedInsts();

▲ Show 20 Lines • Show All 142 Lines • ▼ Show 20 Lines

} }

GISelCSEInfo & GISelCSEInfo &

GISelCSEAnalysisWrapper::get(std::unique_ptr<CSEConfigBase> CSEOpt, GISelCSEAnalysisWrapper::get(std::unique_ptr<CSEConfigBase> CSEOpt,

bool Recompute) { bool Recompute) {

if (!AlreadyComputed || Recompute) { if (!AlreadyComputed || Recompute) {

Info.releaseMemory(); Info.releaseMemory();

Info.setCSEConfig(std::move(CSEOpt)); Info.setCSEConfig(std::move(CSEOpt));

Info.getInstNumbering().init(*MF);

Info.analyze(*MF); Info.analyze(*MF);

AlreadyComputed = true; AlreadyComputed = true;

} }

return Info; return Info;

} }

void GISelCSEAnalysisWrapperPass::getAnalysisUsage(AnalysisUsage &AU) const { void GISelCSEAnalysisWrapperPass::getAnalysisUsage(AnalysisUsage &AU) const {

AU.setPreservesAll(); AU.setPreservesAll();

MachineFunctionPass::getAnalysisUsage(AU); MachineFunctionPass::getAnalysisUsage(AU);

} }

bool GISelCSEAnalysisWrapperPass::runOnMachineFunction(MachineFunction &MF) { bool GISelCSEAnalysisWrapperPass::runOnMachineFunction(MachineFunction &MF) {

releaseMemory(); releaseMemory();

Wrapper.setMF(MF); Wrapper.setMF(MF);

return false; return false;

} }

llvm/lib/CodeGen/GlobalISel/CSEMIRBuilder.cpp

	//===-- llvm/CodeGen/GlobalISel/CSEMIRBuilder.cpp - MIBuilder--- C++ --==//			//===-- llvm/CodeGen/GlobalISel/CSEMIRBuilder.cpp - MIBuilder--- C++ --==//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	/// \file			/// \file
	/// This file implements the CSEMIRBuilder class which CSEs as it builds			/// This file implements the CSEMIRBuilder class which CSEs as it builds
	/// instructions.			/// instructions.
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//
	//			//

	#include "llvm/CodeGen/GlobalISel/CSEMIRBuilder.h"			#include "llvm/CodeGen/GlobalISel/CSEMIRBuilder.h"
				#include "llvm/CodeGen/GlobalISel/CSEInfo.h"
	#include "llvm/CodeGen/GlobalISel/GISelChangeObserver.h"			#include "llvm/CodeGen/GlobalISel/GISelChangeObserver.h"
				#include "llvm/CodeGen/MachineBasicBlock.h"
	#include "llvm/IR/DebugInfoMetadata.h"			#include "llvm/IR/DebugInfoMetadata.h"

	using namespace llvm;			using namespace llvm;

	bool CSEMIRBuilder::dominates(MachineBasicBlock::const_iterator A,			bool CSEMIRBuilder::dominates(MachineBasicBlock::const_iterator A,
	MachineBasicBlock::const_iterator B) const {			MachineBasicBlock::const_iterator B) {
	auto MBBEnd = getMBB().end();			auto MBBEnd = getMBB().end();
	if (B == MBBEnd)			if (B == MBBEnd)
	return true;			return true;
	assert(A->getParent() == B->getParent() &&			assert(A->getParent() == B->getParent() &&
	"Iterators should be in same block");			"Iterators should be in same block");

				auto &InstNumbering = getCSEInfo()->getInstNumbering();
				if (InstNumbering.isEnabled())
				return InstNumbering.isEarlier(A, B);

	const MachineBasicBlock *BBA = A->getParent();			const MachineBasicBlock *BBA = A->getParent();
	MachineBasicBlock::const_iterator I = BBA->begin();			MachineBasicBlock::const_iterator I = BBA->begin();
	for (; &I != A && &I != B; ++I)			for (; &I != A && &I != B; ++I)
	;			;
	return &*I == A;			return &*I == A;
	}			}

	MachineInstrBuilder			MachineInstrBuilder
	CSEMIRBuilder::getDominatingInstrForID(FoldingSetNodeID &ID,			CSEMIRBuilder::getDominatingInstrForID(FoldingSetNodeID &ID,
	void *&NodeInsertPos) {			void *&NodeInsertPos) {
	GISelCSEInfo *CSEInfo = getCSEInfo();			GISelCSEInfo *CSEInfo = getCSEInfo();
	assert(CSEInfo && "Can't get here without setting CSEInfo");			assert(CSEInfo && "Can't get here without setting CSEInfo");
	MachineBasicBlock *CurMBB = &getMBB();			MachineBasicBlock *CurMBB = &getMBB();
	MachineInstr *MI =			MachineInstr *MI =
	CSEInfo->getMachineInstrIfExists(ID, CurMBB, NodeInsertPos);			CSEInfo->getMachineInstrIfExists(ID, CurMBB, NodeInsertPos);
	if (MI) {			if (MI) {
	CSEInfo->countOpcodeHit(MI->getOpcode());			CSEInfo->countOpcodeHit(MI->getOpcode());
	auto CurrPos = getInsertPt();			auto CurrPos = getInsertPt();
	auto MII = MachineBasicBlock::iterator(MI);			auto MII = MachineBasicBlock::iterator(MI);
	if (MII == CurrPos) {			if (MII == CurrPos) {
	// Move the insert point ahead of the instruction so any future uses of			// Move the insert point ahead of the instruction so any future uses of
	// this builder will have the def ready.			// this builder will have the def ready.
	setInsertPt(*CurMBB, std::next(MII));			setInsertPt(*CurMBB, std::next(MII));
	} else if (!dominates(MI, CurrPos)) {			} else if (!dominates(MI, CurrPos)) {
				++MII;
	CurMBB->splice(CurrPos, CurMBB, MI);			CurMBB->splice(CurrPos, CurMBB, MI);
				auto &InstNumbering = CSEInfo->getInstNumbering();
				if (InstNumbering.isEnabled()) {
				InstNumbering.removeInstruction(*MI);
				// Update numbering for the modified range.
				InstNumbering.repairNumberingInRange(MachineBasicBlock::iterator(MI),
				MII);
				}
	}			}
	return MachineInstrBuilder(getMF(), MI);			return MachineInstrBuilder(getMF(), MI);
	}			}
	return MachineInstrBuilder();			return MachineInstrBuilder();
	}			}

	bool CSEMIRBuilder::canPerformCSEForOpc(unsigned Opc) const {			bool CSEMIRBuilder::canPerformCSEForOpc(unsigned Opc) const {
	const GISelCSEInfo *CSEInfo = getCSEInfo();			const GISelCSEInfo *CSEInfo = getCSEInfo();
	▲ Show 20 Lines • Show All 226 Lines • Show Last 20 Lines

llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp

Show First 20 Lines • Show All 3,041 Lines • ▼ Show 20 Lines	bool EnableCSE = EnableCSEInIRTranslator.getNumOccurrences()
? EnableCSEInIRTranslator		? EnableCSEInIRTranslator
: TPC->isGISelCSEEnabled();		: TPC->isGISelCSEEnabled();

if (EnableCSE) {		if (EnableCSE) {
EntryBuilder = std::make_unique<CSEMIRBuilder>(CurMF);		EntryBuilder = std::make_unique<CSEMIRBuilder>(CurMF);
CSEInfo = &Wrapper.get(TPC->getCSEConfig());		CSEInfo = &Wrapper.get(TPC->getCSEConfig());
EntryBuilder->setCSEInfo(CSEInfo);		EntryBuilder->setCSEInfo(CSEInfo);
CurBuilder = std::make_unique<CSEMIRBuilder>(CurMF);		CurBuilder = std::make_unique<CSEMIRBuilder>(CurMF);
		// Disable instruction numbering while building the MF.
		CSEInfo->getInstNumbering().disable();
CurBuilder->setCSEInfo(CSEInfo);		CurBuilder->setCSEInfo(CSEInfo);
} else {		} else {
EntryBuilder = std::make_unique<MachineIRBuilder>();		EntryBuilder = std::make_unique<MachineIRBuilder>();
CurBuilder = std::make_unique<MachineIRBuilder>();		CurBuilder = std::make_unique<MachineIRBuilder>();
}		}
CLI = MF->getSubtarget().getCallLowering();		CLI = MF->getSubtarget().getCallLowering();
CurBuilder->setMF(*MF);		CurBuilder->setMF(*MF);
EntryBuilder->setMF(*MF);		EntryBuilder->setMF(*MF);
▲ Show 20 Lines • Show All 176 Lines • ▼ Show 20 Lines	#endif
MF->DeleteMachineBasicBlock(EntryBB);		MF->DeleteMachineBasicBlock(EntryBB);

assert(&MF->front() == &NewEntryBB &&		assert(&MF->front() == &NewEntryBB &&
"New entry wasn't next in the list of basic block!");		"New entry wasn't next in the list of basic block!");

// Initialize stack protector information.		// Initialize stack protector information.
StackProtector &SP = getAnalysis<StackProtector>();		StackProtector &SP = getAnalysis<StackProtector>();
SP.copyToMachineFrameInfo(MF->getFrameInfo());		SP.copyToMachineFrameInfo(MF->getFrameInfo());

return false;		return false;
}		}

llvm/lib/CodeGen/GlobalISel/Legalizer.cpp

Show All 17 Lines
#include "llvm/CodeGen/GlobalISel/CSEInfo.h"		#include "llvm/CodeGen/GlobalISel/CSEInfo.h"
#include "llvm/CodeGen/GlobalISel/CSEMIRBuilder.h"		#include "llvm/CodeGen/GlobalISel/CSEMIRBuilder.h"
#include "llvm/CodeGen/GlobalISel/GISelChangeObserver.h"		#include "llvm/CodeGen/GlobalISel/GISelChangeObserver.h"
#include "llvm/CodeGen/GlobalISel/GISelWorkList.h"		#include "llvm/CodeGen/GlobalISel/GISelWorkList.h"
#include "llvm/CodeGen/GlobalISel/LegalizationArtifactCombiner.h"		#include "llvm/CodeGen/GlobalISel/LegalizationArtifactCombiner.h"
#include "llvm/CodeGen/GlobalISel/LegalizerHelper.h"		#include "llvm/CodeGen/GlobalISel/LegalizerHelper.h"
#include "llvm/CodeGen/GlobalISel/LostDebugLocObserver.h"		#include "llvm/CodeGen/GlobalISel/LostDebugLocObserver.h"
#include "llvm/CodeGen/GlobalISel/Utils.h"		#include "llvm/CodeGen/GlobalISel/Utils.h"
		#include "llvm/CodeGen/MachineBasicBlock.h"
#include "llvm/CodeGen/MachineOptimizationRemarkEmitter.h"		#include "llvm/CodeGen/MachineOptimizationRemarkEmitter.h"
#include "llvm/CodeGen/MachineRegisterInfo.h"		#include "llvm/CodeGen/MachineRegisterInfo.h"
#include "llvm/CodeGen/TargetPassConfig.h"		#include "llvm/CodeGen/TargetPassConfig.h"
#include "llvm/CodeGen/TargetSubtargetInfo.h"		#include "llvm/CodeGen/TargetSubtargetInfo.h"
#include "llvm/InitializePasses.h"		#include "llvm/InitializePasses.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"
#include "llvm/Support/Error.h"		#include "llvm/Support/Error.h"
#include "llvm/Target/TargetMachine.h"		#include "llvm/Target/TargetMachine.h"
▲ Show 20 Lines • Show All 291 Lines • ▼ Show 20 Lines	bool Legalizer::runOnMachineFunction(MachineFunction &MF) {
GISelCSEInfo *CSEInfo = nullptr;		GISelCSEInfo *CSEInfo = nullptr;
bool EnableCSE = EnableCSEInLegalizer.getNumOccurrences()		bool EnableCSE = EnableCSEInLegalizer.getNumOccurrences()
? EnableCSEInLegalizer		? EnableCSEInLegalizer
: TPC.isGISelCSEEnabled();		: TPC.isGISelCSEEnabled();
if (EnableCSE) {		if (EnableCSE) {
MIRBuilder = std::make_unique<CSEMIRBuilder>();		MIRBuilder = std::make_unique<CSEMIRBuilder>();
CSEInfo = &Wrapper.get(TPC.getCSEConfig());		CSEInfo = &Wrapper.get(TPC.getCSEConfig());
MIRBuilder->setCSEInfo(CSEInfo);		MIRBuilder->setCSEInfo(CSEInfo);
		CSEInfo->getInstNumbering().enable(&MF);
} else		} else
MIRBuilder = std::make_unique<MachineIRBuilder>();		MIRBuilder = std::make_unique<MachineIRBuilder>();

SmallVector<GISelChangeObserver *, 1> AuxObservers;		SmallVector<GISelChangeObserver *, 1> AuxObservers;
if (EnableCSE && CSEInfo) {		if (EnableCSE && CSEInfo) {
// We want CSEInfo in addition to WorkListObserver to observe all changes.		// We want CSEInfo in addition to WorkListObserver to observe all changes.
AuxObservers.push_back(CSEInfo);		AuxObservers.push_back(CSEInfo);
}		}
▲ Show 20 Lines • Show All 54 Lines • Show Last 20 Lines

llvm/lib/CodeGen/GlobalISel/Localizer.cpp

//===- Localizer.cpp ---------------------- Localize some instrs -- C++ --==//		//===- Localizer.cpp ---------------------- Localize some instrs -- C++ --==//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
/// \file		/// \file
/// This file implements the Localizer class.		/// This file implements the Localizer class.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/CodeGen/GlobalISel/Localizer.h"		#include "llvm/CodeGen/GlobalISel/Localizer.h"
#include "llvm/ADT/DenseMap.h"		#include "llvm/ADT/DenseMap.h"
#include "llvm/Analysis/TargetTransformInfo.h"		#include "llvm/Analysis/TargetTransformInfo.h"
		#include "llvm/CodeGen/MachineBasicBlock.h"
#include "llvm/CodeGen/MachineRegisterInfo.h"		#include "llvm/CodeGen/MachineRegisterInfo.h"
#include "llvm/CodeGen/TargetLowering.h"		#include "llvm/CodeGen/TargetLowering.h"
		#include "llvm/CodeGen/TargetPassConfig.h"
#include "llvm/InitializePasses.h"		#include "llvm/InitializePasses.h"
#include "llvm/Support/Debug.h"		#include "llvm/Support/Debug.h"

#define DEBUG_TYPE "localizer"		#define DEBUG_TYPE "localizer"

using namespace llvm;		using namespace llvm;

		static cl::opt<bool> UseInstNumbering(
		"localizer-use-numbering", cl::Hidden,
		cl::desc("Use the instruction numbering in the localizer."),
		cl::init(true));

char Localizer::ID = 0;		char Localizer::ID = 0;
INITIALIZE_PASS_BEGIN(Localizer, DEBUG_TYPE,		INITIALIZE_PASS_BEGIN(Localizer, DEBUG_TYPE,
"Move/duplicate certain instructions close to their use",		"Move/duplicate certain instructions close to their use",
false, false)		false, false)
INITIALIZE_PASS_DEPENDENCY(TargetTransformInfoWrapperPass)		INITIALIZE_PASS_DEPENDENCY(TargetTransformInfoWrapperPass)
INITIALIZE_PASS_END(Localizer, DEBUG_TYPE,		INITIALIZE_PASS_END(Localizer, DEBUG_TYPE,
"Move/duplicate certain instructions close to their use",		"Move/duplicate certain instructions close to their use",
false, false)		false, false)

Localizer::Localizer(std::function<bool(const MachineFunction &)> F)		Localizer::Localizer(std::function<bool(const MachineFunction &)> F)
: MachineFunctionPass(ID), DoNotRunPass(F) {}		: MachineFunctionPass(ID), DoNotRunPass(F) {}

Localizer::Localizer()		Localizer::Localizer()
: Localizer([](const MachineFunction &) { return false; }) {}		: Localizer([](const MachineFunction &) { return false; }) {}

void Localizer::init(MachineFunction &MF) {		void Localizer::init(MachineFunction &MF) {
MRI = &MF.getRegInfo();		MRI = &MF.getRegInfo();
TTI = &getAnalysis<TargetTransformInfoWrapperPass>().getTTI(MF.getFunction());		TTI = &getAnalysis<TargetTransformInfoWrapperPass>().getTTI(MF.getFunction());

		if (!UseInstNumbering) {
		CSEInfo = nullptr;
		return;
		}
		// Enable inst numbering, we need to require the CSEInfo analysis since that
		// holds the numbering.
		auto &TPC = getAnalysis<TargetPassConfig>();
		GISelCSEAnalysisWrapper &Wrapper =
		getAnalysis<GISelCSEAnalysisWrapperPass>().getCSEWrapper();
		auto *FnCSEInfo = &Wrapper.get(TPC.getCSEConfig());
		FnCSEInfo->getInstNumbering().enable(&MF);
		CSEInfo = FnCSEInfo;
}		}

void Localizer::getAnalysisUsage(AnalysisUsage &AU) const {		void Localizer::getAnalysisUsage(AnalysisUsage &AU) const {
AU.addRequired<TargetTransformInfoWrapperPass>();		AU.addRequired<TargetTransformInfoWrapperPass>();
		AU.addRequired<TargetPassConfig>();
getSelectionDAGFallbackAnalysisUsage(AU);		getSelectionDAGFallbackAnalysisUsage(AU);
		AU.addRequired<GISelCSEAnalysisWrapperPass>();
		AU.addPreserved<GISelCSEAnalysisWrapperPass>();
MachineFunctionPass::getAnalysisUsage(AU);		MachineFunctionPass::getAnalysisUsage(AU);
}		}

bool Localizer::isLocalUse(MachineOperand &MOUse, const MachineInstr &Def,		bool Localizer::isLocalUse(MachineOperand &MOUse, const MachineInstr &Def,
MachineBasicBlock *&InsertMBB) {		MachineBasicBlock *&InsertMBB) {
MachineInstr &MIUse = *MOUse.getParent();		MachineInstr &MIUse = *MOUse.getParent();
InsertMBB = MIUse.getParent();		InsertMBB = MIUse.getParent();
if (MIUse.isPHI())		if (MIUse.isPHI())
▲ Show 20 Lines • Show All 59 Lines • ▼ Show 20 Lines	for (auto MOIt = MRI->use_begin(Reg), MOItEnd = MRI->use_end();
LocalizedMI->getOperand(0).setReg(NewReg);		LocalizedMI->getOperand(0).setReg(NewReg);
NewVRegIt =		NewVRegIt =
MBBWithLocalDef.insert(std::make_pair(MBBAndReg, NewReg)).first;		MBBWithLocalDef.insert(std::make_pair(MBBAndReg, NewReg)).first;
LLVM_DEBUG(dbgs() << "Inserted: " << *LocalizedMI);		LLVM_DEBUG(dbgs() << "Inserted: " << *LocalizedMI);
}		}
LLVM_DEBUG(dbgs() << "Update use with: " << printReg(NewVRegIt->second)		LLVM_DEBUG(dbgs() << "Update use with: " << printReg(NewVRegIt->second)
<< '\n');		<< '\n');
// Update the user reg.		// Update the user reg.
		if (CSEInfo)
		CSEInfo->changedInstr(*MOUse.getParent());
MOUse.setReg(NewVRegIt->second);		MOUse.setReg(NewVRegIt->second);
		if (CSEInfo)
		CSEInfo->changedInstr(*MOUse.getParent());
}		}
}		}
return Changed;		return Changed;
}		}

bool Localizer::localizeIntraBlock(LocalizedSetVecT &LocalizedInstrs) {		bool Localizer::localizeIntraBlock(LocalizedSetVecT &LocalizedInstrs) {
bool Changed = false;		bool Changed = false;

Show All 13 Lines	for (MachineInstr &UseMI : MRI->use_nodbg_instructions(Reg)) {
Users.insert(&UseMI);		Users.insert(&UseMI);
}		}
// If all the users were PHIs then they're not going to be in our block,		// If all the users were PHIs then they're not going to be in our block,
// don't try to move this instruction.		// don't try to move this instruction.
if (Users.empty())		if (Users.empty())
continue;		continue;

MachineBasicBlock::iterator II(MI);		MachineBasicBlock::iterator II(MI);
		if (UseInstNumbering) {
		auto &InstNumbering = CSEInfo->getInstNumbering();
		assert(InstNumbering.isEnabled() && "Inst number is not enabled!");
		// We have a set of instructions which we know are users within the same
		// block. We want to find the first user of MI.
		MachineInstr *Nearest =
		InstNumbering.getEarliestInstr(Users, /* NoFlush */ true);
		II = Nearest->getIterator();
		} else {
++II;		++II;
while (II != MBB.end() && !Users.count(&*II))		while (II != MBB.end() && !Users.count(&*II))
++II;		++II;
		}
LLVM_DEBUG(dbgs() << "Intra-block: moving " << MI << " before " << &*II		LLVM_DEBUG(dbgs() << "Intra-block: moving " << MI << " before " << &*II
<< "\n");		<< "\n");
assert(II != MBB.end() && "Didn't find the user in the MBB");		assert(II != MBB.end() && "Didn't find the user in the MBB");
MI->removeFromParent();		MI->removeFromParent();
MBB.insert(II, MI);		MBB.insert(II, MI);

		if (UseInstNumbering)
		CSEInfo->getInstNumbering().flushAndRenumber();

Changed = true;		Changed = true;
}		}
return Changed;		return Changed;
}		}

bool Localizer::runOnMachineFunction(MachineFunction &MF) {		bool Localizer::runOnMachineFunction(MachineFunction &MF) {
// If the ISel pipeline failed, do not bother running that pass.		// If the ISel pipeline failed, do not bother running that pass.
if (MF.getProperties().hasProperty(		if (MF.getProperties().hasProperty(
MachineFunctionProperties::Property::FailedISel))		MachineFunctionProperties::Property::FailedISel))
return false;		return false;

// Don't run the pass if the target asked so.		// Don't run the pass if the target asked so.
if (DoNotRunPass(MF))		if (DoNotRunPass(MF))
return false;		return false;

LLVM_DEBUG(dbgs() << "Localize instructions for: " << MF.getName() << '\n');		LLVM_DEBUG(dbgs() << "Localize instructions for: " << MF.getName() << '\n');

init(MF);		init(MF);

		GISelObserverWrapper WrapperObserver(CSEInfo);
		RAIIMFObsDelInstaller Installer(MF, WrapperObserver);

// Keep track of the instructions we localized. We'll do a second pass of		// Keep track of the instructions we localized. We'll do a second pass of
// intra-block localization to further reduce live ranges.		// intra-block localization to further reduce live ranges.
LocalizedSetVecT LocalizedInstrs;		LocalizedSetVecT LocalizedInstrs;

		CSEInfo->getInstNumbering().flushAndRenumber();

bool Changed = localizeInterBlock(MF, LocalizedInstrs);		bool Changed = localizeInterBlock(MF, LocalizedInstrs);
Changed \|= localizeIntraBlock(LocalizedInstrs);		Changed \|= localizeIntraBlock(LocalizedInstrs);
return Changed;		return Changed;
}		}

llvm/lib/CodeGen/GlobalISel/RegBankSelect.cpp

//==- llvm/CodeGen/GlobalISel/RegBankSelect.cpp - RegBankSelect --- C++ --==//		//==- llvm/CodeGen/GlobalISel/RegBankSelect.cpp - RegBankSelect --- C++ --==//
//		//
// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.		// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
// See https://llvm.org/LICENSE.txt for license information.		// See https://llvm.org/LICENSE.txt for license information.
// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception		// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//
/// \file		/// \file
/// This file implements the RegBankSelect class.		/// This file implements the RegBankSelect class.
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#include "llvm/CodeGen/GlobalISel/RegBankSelect.h"		#include "llvm/CodeGen/GlobalISel/RegBankSelect.h"
#include "llvm/ADT/PostOrderIterator.h"		#include "llvm/ADT/PostOrderIterator.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/SmallVector.h"		#include "llvm/ADT/SmallVector.h"
		#include "llvm/CodeGen/GlobalISel/CSEMIRBuilder.h"
#include "llvm/CodeGen/GlobalISel/LegalizerInfo.h"		#include "llvm/CodeGen/GlobalISel/LegalizerInfo.h"
#include "llvm/CodeGen/GlobalISel/RegisterBank.h"		#include "llvm/CodeGen/GlobalISel/RegisterBank.h"
#include "llvm/CodeGen/GlobalISel/RegisterBankInfo.h"		#include "llvm/CodeGen/GlobalISel/RegisterBankInfo.h"
#include "llvm/CodeGen/GlobalISel/Utils.h"		#include "llvm/CodeGen/GlobalISel/Utils.h"
#include "llvm/CodeGen/MachineBasicBlock.h"		#include "llvm/CodeGen/MachineBasicBlock.h"
#include "llvm/CodeGen/MachineBlockFrequencyInfo.h"		#include "llvm/CodeGen/MachineBlockFrequencyInfo.h"
#include "llvm/CodeGen/MachineBranchProbabilityInfo.h"		#include "llvm/CodeGen/MachineBranchProbabilityInfo.h"
#include "llvm/CodeGen/MachineFunction.h"		#include "llvm/CodeGen/MachineFunction.h"
▲ Show 20 Lines • Show All 63 Lines • ▼ Show 20 Lines	void RegBankSelect::init(MachineFunction &MF) {
TPC = &getAnalysis<TargetPassConfig>();		TPC = &getAnalysis<TargetPassConfig>();
if (OptMode != Mode::Fast) {		if (OptMode != Mode::Fast) {
MBFI = &getAnalysis<MachineBlockFrequencyInfo>();		MBFI = &getAnalysis<MachineBlockFrequencyInfo>();
MBPI = &getAnalysis<MachineBranchProbabilityInfo>();		MBPI = &getAnalysis<MachineBranchProbabilityInfo>();
} else {		} else {
MBFI = nullptr;		MBFI = nullptr;
MBPI = nullptr;		MBPI = nullptr;
}		}
MIRBuilder.setMF(MF);		// Enable CSE and inst numbering.
		GISelCSEAnalysisWrapper &Wrapper =
		getAnalysis<GISelCSEAnalysisWrapperPass>().getCSEWrapper();
		auto *CSEInfo = &Wrapper.get(TPC->getCSEConfig());
		CSEInfo->getInstNumbering().enable(&MF);
		MIRBuilder = std::make_unique<CSEMIRBuilder>();
		MIRBuilder->setMF(MF);
		MIRBuilder->setCSEInfo(CSEInfo);

MORE = std::make_unique<MachineOptimizationRemarkEmitter>(MF, MBFI);		MORE = std::make_unique<MachineOptimizationRemarkEmitter>(MF, MBFI);
}		}

void RegBankSelect::getAnalysisUsage(AnalysisUsage &AU) const {		void RegBankSelect::getAnalysisUsage(AnalysisUsage &AU) const {
if (OptMode != Mode::Fast) {		if (OptMode != Mode::Fast) {
// We could preserve the information from these two analysis but		// We could preserve the information from these two analysis but
// the APIs do not allow to do so yet.		// the APIs do not allow to do so yet.
AU.addRequired<MachineBlockFrequencyInfo>();		AU.addRequired<MachineBlockFrequencyInfo>();
AU.addRequired<MachineBranchProbabilityInfo>();		AU.addRequired<MachineBranchProbabilityInfo>();
}		}
AU.addRequired<TargetPassConfig>();		AU.addRequired<TargetPassConfig>();
getSelectionDAGFallbackAnalysisUsage(AU);		getSelectionDAGFallbackAnalysisUsage(AU);
		AU.addRequired<GISelCSEAnalysisWrapperPass>();
		AU.addPreserved<GISelCSEAnalysisWrapperPass>();
MachineFunctionPass::getAnalysisUsage(AU);		MachineFunctionPass::getAnalysisUsage(AU);
}		}

bool RegBankSelect::assignmentMatch(		bool RegBankSelect::assignmentMatch(
Register Reg, const RegisterBankInfo::ValueMapping &ValMapping,		Register Reg, const RegisterBankInfo::ValueMapping &ValMapping,
bool &OnlyAssign) const {		bool &OnlyAssign) const {
// By default we assume we will have to repair something.		// By default we assume we will have to repair something.
OnlyAssign = false;		OnlyAssign = false;
Show All 39 Lines	if (MO.isDef())
std::swap(Src, Dst);		std::swap(Src, Dst);

assert((RepairPt.getNumInsertPoints() == 1 \|\|		assert((RepairPt.getNumInsertPoints() == 1 \|\|
Register::isPhysicalRegister(Dst)) &&		Register::isPhysicalRegister(Dst)) &&
"We are about to create several defs for Dst");		"We are about to create several defs for Dst");

// Build the instruction used to repair, then clone it at the right		// Build the instruction used to repair, then clone it at the right
// places. Avoiding buildCopy bypasses the check that Src and Dst have the		// places. Avoiding buildCopy bypasses the check that Src and Dst have the
// same types because the type is a placeholder when this function is called.		// same types because the type is a placeholder when this function is
MI = MIRBuilder.buildInstrNoInsert(TargetOpcode::COPY)		// called.
		MI = MIRBuilder->buildInstrNoInsert(TargetOpcode::COPY)
.addDef(Dst)		.addDef(Dst)
.addUse(Src);		.addUse(Src);
LLVM_DEBUG(dbgs() << "Copy: " << printReg(Src) << " to: " << printReg(Dst)		LLVM_DEBUG(dbgs() << "Copy: " << printReg(Src) << " to: " << printReg(Dst)
<< '\n');		<< '\n');
} else {		} else {
// TODO: Support with G_IMPLICIT_DEF + G_INSERT sequence or G_EXTRACT		// TODO: Support with G_IMPLICIT_DEF + G_INSERT sequence or G_EXTRACT
// sequence.		// sequence.
assert(ValMapping.partsAllUniform() && "irregular breakdowns not supported");		assert(ValMapping.partsAllUniform() && "irregular breakdowns not supported");

LLT RegTy = MRI->getType(MO.getReg());		LLT RegTy = MRI->getType(MO.getReg());
Show All 11 Lines	if (MO.isDef()) {
"don't understand this value breakdown");		"don't understand this value breakdown");

MergeOp = TargetOpcode::G_CONCAT_VECTORS;		MergeOp = TargetOpcode::G_CONCAT_VECTORS;
}		}
} else		} else
MergeOp = TargetOpcode::G_MERGE_VALUES;		MergeOp = TargetOpcode::G_MERGE_VALUES;

auto MergeBuilder =		auto MergeBuilder =
MIRBuilder.buildInstrNoInsert(MergeOp)		MIRBuilder->buildInstrNoInsert(MergeOp).addDef(MO.getReg());
.addDef(MO.getReg());

for (Register SrcReg : NewVRegs)		for (Register SrcReg : NewVRegs)
MergeBuilder.addUse(SrcReg);		MergeBuilder.addUse(SrcReg);

MI = MergeBuilder;		MI = MergeBuilder;
} else {		} else {
MachineInstrBuilder UnMergeBuilder =		MachineInstrBuilder UnMergeBuilder =
MIRBuilder.buildInstrNoInsert(TargetOpcode::G_UNMERGE_VALUES);		MIRBuilder->buildInstrNoInsert(TargetOpcode::G_UNMERGE_VALUES);
for (Register DefReg : NewVRegs)		for (Register DefReg : NewVRegs)
UnMergeBuilder.addDef(DefReg);		UnMergeBuilder.addDef(DefReg);

UnMergeBuilder.addUse(MO.getReg());		UnMergeBuilder.addUse(MO.getReg());
MI = UnMergeBuilder;		MI = UnMergeBuilder;
}		}
}		}

if (RepairPt.getNumInsertPoints() != 1)		if (RepairPt.getNumInsertPoints() != 1)
report_fatal_error("need testcase to support multiple insertion points");		report_fatal_error("need testcase to support multiple insertion points");

// TODO:		// TODO:
// Check if MI is legal. if not, we need to legalize all the		// Check if MI is legal. if not, we need to legalize all the
// instructions we are going to insert.		// instructions we are going to insert.
std::unique_ptr<MachineInstr *[]> NewInstrs(		std::unique_ptr<MachineInstr *[]> NewInstrs(
new MachineInstr *[RepairPt.getNumInsertPoints()]);		new MachineInstr *[RepairPt.getNumInsertPoints()]);
bool IsFirst = true;		bool IsFirst = true;
unsigned Idx = 0;		unsigned Idx = 0;
for (const std::unique_ptr<InsertPoint> &InsertPt : RepairPt) {		for (const std::unique_ptr<InsertPoint> &InsertPt : RepairPt) {
MachineInstr *CurMI;		MachineInstr *CurMI;
if (IsFirst)		if (IsFirst)
CurMI = MI;		CurMI = MI;
else		else
CurMI = MIRBuilder.getMF().CloneMachineInstr(MI);		CurMI = MIRBuilder->getMF().CloneMachineInstr(MI);
InsertPt->insert(*CurMI);		InsertPt->insert(*CurMI);
NewInstrs[Idx++] = CurMI;		NewInstrs[Idx++] = CurMI;
IsFirst = false;		IsFirst = false;
}		}
// TODO:		// TODO:
// Legalize NewInstrs if need be.		// Legalize NewInstrs if need be.
return true;		return true;
}		}
▲ Show 20 Lines • Show All 421 Lines • ▼ Show 20 Lines	bool RegBankSelect::runOnMachineFunction(MachineFunction &MF) {

LLVM_DEBUG(dbgs() << "Assign register banks for: " << MF.getName() << '\n');		LLVM_DEBUG(dbgs() << "Assign register banks for: " << MF.getName() << '\n');
const Function &F = MF.getFunction();		const Function &F = MF.getFunction();
Mode SaveOptMode = OptMode;		Mode SaveOptMode = OptMode;
if (F.hasOptNone())		if (F.hasOptNone())
OptMode = Mode::Fast;		OptMode = Mode::Fast;
init(MF);		init(MF);

		GISelObserverWrapper WrapperObserver(MIRBuilder->getCSEInfo());
		RAIIMFObsDelInstaller Installer(MF, WrapperObserver);

#ifndef NDEBUG		#ifndef NDEBUG
// Check that our input is fully legal: we require the function to have the		// Check that our input is fully legal: we require the function to have the
// Legalized property, so it should be.		// Legalized property, so it should be.
// FIXME: This should be in the MachineVerifier.		// FIXME: This should be in the MachineVerifier.
if (!DisableGISelLegalityCheck)		if (!DisableGISelLegalityCheck)
if (const MachineInstr *MI = machineFunctionIsIllegal(MF)) {		if (const MachineInstr *MI = machineFunctionIsIllegal(MF)) {
reportGISelFailure(MF, TPC, MORE, "gisel-regbankselect",		reportGISelFailure(MF, TPC, MORE, "gisel-regbankselect",
"instruction is not legal", *MI);		"instruction is not legal", *MI);
return false;		return false;
}		}
#endif		#endif

// Walk the function and assign register banks to all operands.		// Walk the function and assign register banks to all operands.
// Use a RPOT to make sure all registers are assigned before we choose		// Use a RPOT to make sure all registers are assigned before we choose
// the best mapping of the current instruction.		// the best mapping of the current instruction.
ReversePostOrderTraversal<MachineFunction*> RPOT(&MF);		ReversePostOrderTraversal<MachineFunction*> RPOT(&MF);
for (MachineBasicBlock *MBB : RPOT) {		for (MachineBasicBlock *MBB : RPOT) {
// Set a sensible insertion point so that subsequent calls to		// Set a sensible insertion point so that subsequent calls to
// MIRBuilder.		// MIRBuilder.
MIRBuilder.setMBB(*MBB);		MIRBuilder->setMBB(*MBB);
for (MachineBasicBlock::iterator MII = MBB->begin(), End = MBB->end();		for (MachineBasicBlock::iterator MII = MBB->begin(), End = MBB->end();
MII != End;) {		MII != End;) {
// MI might be invalidated by the assignment, so move the		// MI might be invalidated by the assignment, so move the
// iterator before hand.		// iterator before hand.
MachineInstr &MI = *MII++;		MachineInstr &MI = *MII++;

// Ignore target-specific post-isel instructions: they should use proper		// Ignore target-specific post-isel instructions: they should use proper
// regclasses.		// regclasses.
Show All 17 Lines	for (MachineBasicBlock::iterator MII = MBB->begin(), End = MBB->end();

// It's possible the mapping changed control flow, and moved the following		// It's possible the mapping changed control flow, and moved the following
// instruction to a new block, so figure out the new parent.		// instruction to a new block, so figure out the new parent.
if (MII != End) {		if (MII != End) {
MachineBasicBlock *NextInstBB = MII->getParent();		MachineBasicBlock *NextInstBB = MII->getParent();
if (NextInstBB != MBB) {		if (NextInstBB != MBB) {
LLVM_DEBUG(dbgs() << "Instruction mapping changed control flow\n");		LLVM_DEBUG(dbgs() << "Instruction mapping changed control flow\n");
MBB = NextInstBB;		MBB = NextInstBB;
MIRBuilder.setMBB(*MBB);		MIRBuilder->setMBB(*MBB);
End = MBB->end();		End = MBB->end();
}		}
}		}
}		}
}		}

OptMode = SaveOptMode;		OptMode = SaveOptMode;
return false;		return false;
▲ Show 20 Lines • Show All 357 Lines • Show Last 20 Lines

llvm/lib/Target/AArch64/GISel/AArch64PostLegalizerCombiner.cpp

Show First 20 Lines • Show All 303 Lines • ▼ Show 20 Lines	void AArch64PostLegalizerCombiner::getAnalysisUsage(AnalysisUsage &AU) const {
AU.setPreservesCFG();		AU.setPreservesCFG();
getSelectionDAGFallbackAnalysisUsage(AU);		getSelectionDAGFallbackAnalysisUsage(AU);
AU.addRequired<GISelKnownBitsAnalysis>();		AU.addRequired<GISelKnownBitsAnalysis>();
AU.addPreserved<GISelKnownBitsAnalysis>();		AU.addPreserved<GISelKnownBitsAnalysis>();
if (!IsOptNone) {		if (!IsOptNone) {
AU.addRequired<MachineDominatorTree>();		AU.addRequired<MachineDominatorTree>();
AU.addPreserved<MachineDominatorTree>();		AU.addPreserved<MachineDominatorTree>();
}		}
		AU.addRequired<GISelCSEAnalysisWrapperPass>();
		AU.addPreserved<GISelCSEAnalysisWrapperPass>();
MachineFunctionPass::getAnalysisUsage(AU);		MachineFunctionPass::getAnalysisUsage(AU);
}		}

AArch64PostLegalizerCombiner::AArch64PostLegalizerCombiner(bool IsOptNone)		AArch64PostLegalizerCombiner::AArch64PostLegalizerCombiner(bool IsOptNone)
: MachineFunctionPass(ID), IsOptNone(IsOptNone) {		: MachineFunctionPass(ID), IsOptNone(IsOptNone) {
initializeAArch64PostLegalizerCombinerPass(*PassRegistry::getPassRegistry());		initializeAArch64PostLegalizerCombinerPass(*PassRegistry::getPassRegistry());
}		}

bool AArch64PostLegalizerCombiner::runOnMachineFunction(MachineFunction &MF) {		bool AArch64PostLegalizerCombiner::runOnMachineFunction(MachineFunction &MF) {
if (MF.getProperties().hasProperty(		if (MF.getProperties().hasProperty(
MachineFunctionProperties::Property::FailedISel))		MachineFunctionProperties::Property::FailedISel))
return false;		return false;
assert(MF.getProperties().hasProperty(		assert(MF.getProperties().hasProperty(
MachineFunctionProperties::Property::Legalized) &&		MachineFunctionProperties::Property::Legalized) &&
"Expected a legalized function?");		"Expected a legalized function?");
auto *TPC = &getAnalysis<TargetPassConfig>();		auto &TPC = getAnalysis<TargetPassConfig>();

		// Enable CSE and inst numbering.
		GISelCSEAnalysisWrapper &Wrapper =
		getAnalysis<GISelCSEAnalysisWrapperPass>().getCSEWrapper();
		auto *CSEInfo = &Wrapper.get(TPC.getCSEConfig());
		CSEInfo->getInstNumbering().enable(&MF);

const Function &F = MF.getFunction();		const Function &F = MF.getFunction();
bool EnableOpt =		bool EnableOpt =
MF.getTarget().getOptLevel() != CodeGenOpt::None && !skipFunction(F);		MF.getTarget().getOptLevel() != CodeGenOpt::None && !skipFunction(F);
GISelKnownBits *KB = &getAnalysis<GISelKnownBitsAnalysis>().get(MF);		GISelKnownBits *KB = &getAnalysis<GISelKnownBitsAnalysis>().get(MF);
MachineDominatorTree *MDT =		MachineDominatorTree *MDT =
IsOptNone ? nullptr : &getAnalysis<MachineDominatorTree>();		IsOptNone ? nullptr : &getAnalysis<MachineDominatorTree>();
AArch64PostLegalizerCombinerInfo PCInfo(EnableOpt, F.hasOptSize(),		AArch64PostLegalizerCombinerInfo PCInfo(EnableOpt, F.hasOptSize(),
F.hasMinSize(), KB, MDT);		F.hasMinSize(), KB, MDT);
Combiner C(PCInfo, TPC);		Combiner C(PCInfo, &TPC);
return C.combineMachineInstrs(MF, /CSEInfo/ nullptr);		return C.combineMachineInstrs(MF, CSEInfo);
}		}

char AArch64PostLegalizerCombiner::ID = 0;		char AArch64PostLegalizerCombiner::ID = 0;
INITIALIZE_PASS_BEGIN(AArch64PostLegalizerCombiner, DEBUG_TYPE,		INITIALIZE_PASS_BEGIN(AArch64PostLegalizerCombiner, DEBUG_TYPE,
"Combine AArch64 MachineInstrs after legalization", false,		"Combine AArch64 MachineInstrs after legalization", false,
false)		false)
INITIALIZE_PASS_DEPENDENCY(TargetPassConfig)		INITIALIZE_PASS_DEPENDENCY(TargetPassConfig)
INITIALIZE_PASS_DEPENDENCY(GISelKnownBitsAnalysis)		INITIALIZE_PASS_DEPENDENCY(GISelKnownBitsAnalysis)
Show All 9 Lines

llvm/lib/Target/AArch64/GISel/AArch64PostLegalizerLowering.cpp

Show First 20 Lines • Show All 660 Lines • ▼ Show 20 Lines	public:
void getAnalysisUsage(AnalysisUsage &AU) const override;		void getAnalysisUsage(AnalysisUsage &AU) const override;
};		};
} // end anonymous namespace		} // end anonymous namespace

void AArch64PostLegalizerLowering::getAnalysisUsage(AnalysisUsage &AU) const {		void AArch64PostLegalizerLowering::getAnalysisUsage(AnalysisUsage &AU) const {
AU.addRequired<TargetPassConfig>();		AU.addRequired<TargetPassConfig>();
AU.setPreservesCFG();		AU.setPreservesCFG();
getSelectionDAGFallbackAnalysisUsage(AU);		getSelectionDAGFallbackAnalysisUsage(AU);
		AU.addRequired<GISelCSEAnalysisWrapperPass>();
		AU.addPreserved<GISelCSEAnalysisWrapperPass>();
MachineFunctionPass::getAnalysisUsage(AU);		MachineFunctionPass::getAnalysisUsage(AU);
}		}

AArch64PostLegalizerLowering::AArch64PostLegalizerLowering()		AArch64PostLegalizerLowering::AArch64PostLegalizerLowering()
: MachineFunctionPass(ID) {		: MachineFunctionPass(ID) {
initializeAArch64PostLegalizerLoweringPass(*PassRegistry::getPassRegistry());		initializeAArch64PostLegalizerLoweringPass(*PassRegistry::getPassRegistry());
}		}

bool AArch64PostLegalizerLowering::runOnMachineFunction(MachineFunction &MF) {		bool AArch64PostLegalizerLowering::runOnMachineFunction(MachineFunction &MF) {
if (MF.getProperties().hasProperty(		if (MF.getProperties().hasProperty(
MachineFunctionProperties::Property::FailedISel))		MachineFunctionProperties::Property::FailedISel))
return false;		return false;
assert(MF.getProperties().hasProperty(		assert(MF.getProperties().hasProperty(
MachineFunctionProperties::Property::Legalized) &&		MachineFunctionProperties::Property::Legalized) &&
"Expected a legalized function?");		"Expected a legalized function?");
auto *TPC = &getAnalysis<TargetPassConfig>();		auto &TPC = getAnalysis<TargetPassConfig>();

		// Enable CSE and inst numbering.
		GISelCSEAnalysisWrapper &Wrapper =
		getAnalysis<GISelCSEAnalysisWrapperPass>().getCSEWrapper();
		auto *CSEInfo = &Wrapper.get(TPC.getCSEConfig());
		CSEInfo->getInstNumbering().enable(&MF);

const Function &F = MF.getFunction();		const Function &F = MF.getFunction();
AArch64PostLegalizerLoweringInfo PCInfo(F.hasOptSize(), F.hasMinSize());		AArch64PostLegalizerLoweringInfo PCInfo(F.hasOptSize(), F.hasMinSize());
Combiner C(PCInfo, TPC);		Combiner C(PCInfo, &TPC);
return C.combineMachineInstrs(MF, /CSEInfo/ nullptr);		return C.combineMachineInstrs(MF, CSEInfo);
}		}

char AArch64PostLegalizerLowering::ID = 0;		char AArch64PostLegalizerLowering::ID = 0;
INITIALIZE_PASS_BEGIN(AArch64PostLegalizerLowering, DEBUG_TYPE,		INITIALIZE_PASS_BEGIN(AArch64PostLegalizerLowering, DEBUG_TYPE,
"Lower AArch64 MachineInstrs after legalization", false,		"Lower AArch64 MachineInstrs after legalization", false,
false)		false)
INITIALIZE_PASS_DEPENDENCY(TargetPassConfig)		INITIALIZE_PASS_DEPENDENCY(TargetPassConfig)
INITIALIZE_PASS_END(AArch64PostLegalizerLowering, DEBUG_TYPE,		INITIALIZE_PASS_END(AArch64PostLegalizerLowering, DEBUG_TYPE,
"Lower AArch64 MachineInstrs after legalization", false,		"Lower AArch64 MachineInstrs after legalization", false,
false)		false)

namespace llvm {		namespace llvm {
FunctionPass *createAArch64PostLegalizerLowering() {		FunctionPass *createAArch64PostLegalizerLowering() {
return new AArch64PostLegalizerLowering();		return new AArch64PostLegalizerLowering();
}		}
} // end namespace llvm		} // end namespace llvm

llvm/lib/Target/AArch64/GISel/AArch64PreLegalizerCombiner.cpp

Show First 20 Lines • Show All 145 Lines • ▼ Show 20 Lines	void AArch64PreLegalizerCombiner::getAnalysisUsage(AnalysisUsage &AU) const {
AU.setPreservesCFG();		AU.setPreservesCFG();
getSelectionDAGFallbackAnalysisUsage(AU);		getSelectionDAGFallbackAnalysisUsage(AU);
AU.addRequired<GISelKnownBitsAnalysis>();		AU.addRequired<GISelKnownBitsAnalysis>();
AU.addPreserved<GISelKnownBitsAnalysis>();		AU.addPreserved<GISelKnownBitsAnalysis>();
if (!IsOptNone) {		if (!IsOptNone) {
AU.addRequired<MachineDominatorTree>();		AU.addRequired<MachineDominatorTree>();
AU.addPreserved<MachineDominatorTree>();		AU.addPreserved<MachineDominatorTree>();
}		}
		AU.addRequired<GISelCSEAnalysisWrapperPass>();
		AU.addPreserved<GISelCSEAnalysisWrapperPass>();
MachineFunctionPass::getAnalysisUsage(AU);		MachineFunctionPass::getAnalysisUsage(AU);
}		}

AArch64PreLegalizerCombiner::AArch64PreLegalizerCombiner(bool IsOptNone)		AArch64PreLegalizerCombiner::AArch64PreLegalizerCombiner(bool IsOptNone)
: MachineFunctionPass(ID), IsOptNone(IsOptNone) {		: MachineFunctionPass(ID), IsOptNone(IsOptNone) {
initializeAArch64PreLegalizerCombinerPass(*PassRegistry::getPassRegistry());		initializeAArch64PreLegalizerCombinerPass(*PassRegistry::getPassRegistry());
}		}

bool AArch64PreLegalizerCombiner::runOnMachineFunction(MachineFunction &MF) {		bool AArch64PreLegalizerCombiner::runOnMachineFunction(MachineFunction &MF) {
if (MF.getProperties().hasProperty(		if (MF.getProperties().hasProperty(
MachineFunctionProperties::Property::FailedISel))		MachineFunctionProperties::Property::FailedISel))
return false;		return false;
auto *TPC = &getAnalysis<TargetPassConfig>();		auto &TPC = getAnalysis<TargetPassConfig>();

		// Enable CSE and inst numbering.
		GISelCSEAnalysisWrapper &Wrapper =
		getAnalysis<GISelCSEAnalysisWrapperPass>().getCSEWrapper();
		auto *CSEInfo = &Wrapper.get(TPC.getCSEConfig());
		CSEInfo->getInstNumbering().enable(&MF);

const Function &F = MF.getFunction();		const Function &F = MF.getFunction();
bool EnableOpt =		bool EnableOpt =
MF.getTarget().getOptLevel() != CodeGenOpt::None && !skipFunction(F);		MF.getTarget().getOptLevel() != CodeGenOpt::None && !skipFunction(F);
GISelKnownBits *KB = &getAnalysis<GISelKnownBitsAnalysis>().get(MF);		GISelKnownBits *KB = &getAnalysis<GISelKnownBitsAnalysis>().get(MF);
MachineDominatorTree *MDT =		MachineDominatorTree *MDT =
IsOptNone ? nullptr : &getAnalysis<MachineDominatorTree>();		IsOptNone ? nullptr : &getAnalysis<MachineDominatorTree>();
AArch64PreLegalizerCombinerInfo PCInfo(EnableOpt, F.hasOptSize(),		AArch64PreLegalizerCombinerInfo PCInfo(EnableOpt, F.hasOptSize(),
F.hasMinSize(), KB, MDT);		F.hasMinSize(), KB, MDT);
Combiner C(PCInfo, TPC);		Combiner C(PCInfo, &TPC);
return C.combineMachineInstrs(MF, /CSEInfo/ nullptr);		return C.combineMachineInstrs(MF, CSEInfo);
}		}

char AArch64PreLegalizerCombiner::ID = 0;		char AArch64PreLegalizerCombiner::ID = 0;
INITIALIZE_PASS_BEGIN(AArch64PreLegalizerCombiner, DEBUG_TYPE,		INITIALIZE_PASS_BEGIN(AArch64PreLegalizerCombiner, DEBUG_TYPE,
"Combine AArch64 machine instrs before legalization",		"Combine AArch64 machine instrs before legalization",
false, false)		false, false)
INITIALIZE_PASS_DEPENDENCY(TargetPassConfig)		INITIALIZE_PASS_DEPENDENCY(TargetPassConfig)
INITIALIZE_PASS_DEPENDENCY(GISelKnownBitsAnalysis)		INITIALIZE_PASS_DEPENDENCY(GISelKnownBitsAnalysis)
		INITIALIZE_PASS_DEPENDENCY(GISelCSEAnalysisWrapperPass)
INITIALIZE_PASS_END(AArch64PreLegalizerCombiner, DEBUG_TYPE,		INITIALIZE_PASS_END(AArch64PreLegalizerCombiner, DEBUG_TYPE,
"Combine AArch64 machine instrs before legalization", false,		"Combine AArch64 machine instrs before legalization", false,
false)		false)


namespace llvm {		namespace llvm {
FunctionPass *createAArch64PreLegalizerCombiner(bool IsOptNone) {		FunctionPass *createAArch64PreLegalizerCombiner(bool IsOptNone) {
return new AArch64PreLegalizerCombiner(IsOptNone);		return new AArch64PreLegalizerCombiner(IsOptNone);
}		}
} // end namespace llvm		} // end namespace llvm

llvm/test/CodeGen/AArch64/GlobalISel/combine-shift-immed-mismatch-crash.mir

	# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py			# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
	# RUN: llc -run-pass=aarch64-prelegalizer-combiner -verify-machineinstrs -mtriple aarch64-unknown-unknown %s -o - \| FileCheck %s			# RUN: llc -run-pass=aarch64-prelegalizer-combiner -verify-machineinstrs -mtriple aarch64-unknown-unknown %s -o - \| FileCheck %s
	---			---
	name: shift_immed_chain_mismatch_size_crash			name: shift_immed_chain_mismatch_size_crash
	alignment: 4			alignment: 4
	tracksRegLiveness: true			tracksRegLiveness: true
	liveins:			liveins:
	- { reg: '$x0' }			- { reg: '$x0' }
	body: \|			body: \|
	; CHECK-LABEL: name: shift_immed_chain_mismatch_size_crash			; CHECK-LABEL: name: shift_immed_chain_mismatch_size_crash
	; CHECK: bb.0:			; CHECK: bb.0:
	; CHECK: successors: %bb.1(0x40000000), %bb.2(0x40000000)			; CHECK: successors: %bb.1(0x40000000), %bb.2(0x40000000)
	; CHECK: liveins: $x0			; CHECK: liveins: $x0
	; CHECK: [[DEF:%[0-9]+]]:_(p0) = G_IMPLICIT_DEF			; CHECK: [[DEF:%[0-9]+]]:_(s1) = G_IMPLICIT_DEF
				; CHECK: [[DEF1:%[0-9]+]]:_(p0) = G_IMPLICIT_DEF
	; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16			; CHECK: [[C:%[0-9]+]]:_(s32) = G_CONSTANT i32 16
	; CHECK: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 9			; CHECK: [[C1:%[0-9]+]]:_(s32) = G_CONSTANT i32 9
	; CHECK: [[DEF1:%[0-9]+]]:_(s1) = G_IMPLICIT_DEF			; CHECK: G_BRCOND [[DEF]](s1), %bb.2
	; CHECK: G_BRCOND [[DEF1]](s1), %bb.2
	; CHECK: G_BR %bb.1			; CHECK: G_BR %bb.1
	; CHECK: bb.1:			; CHECK: bb.1:
	; CHECK: successors:			; CHECK: successors:
	; CHECK: bb.2:			; CHECK: bb.2:
	; CHECK: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[DEF]](p0) :: (load 4 from `i32* undef`, align 8)			; CHECK: [[LOAD:%[0-9]+]]:_(s32) = G_LOAD [[DEF1]](p0) :: (load 4 from `i32* undef`, align 8)
	; CHECK: [[MUL:%[0-9]+]]:_(s32) = nsw G_MUL [[C]], [[LOAD]]			; CHECK: [[MUL:%[0-9]+]]:_(s32) = nsw G_MUL [[C]], [[LOAD]]
	; CHECK: [[MUL1:%[0-9]+]]:_(s32) = nsw G_MUL [[MUL]], [[C1]]			; CHECK: [[MUL1:%[0-9]+]]:_(s32) = nsw G_MUL [[MUL]], [[C1]]
	; CHECK: [[C2:%[0-9]+]]:_(s64) = G_CONSTANT i64 2			; CHECK: [[C2:%[0-9]+]]:_(s64) = G_CONSTANT i64 2
	; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[MUL1]], [[C2]](s64)			; CHECK: [[SHL:%[0-9]+]]:_(s32) = G_SHL [[MUL1]], [[C2]](s64)
	; CHECK: $w0 = COPY [[SHL]](s32)			; CHECK: $w0 = COPY [[SHL]](s32)
	; CHECK: RET_ReallyLR implicit $w0			; CHECK: RET_ReallyLR implicit $w0
	bb.1:			bb.1:
	liveins: $x0			liveins: $x0
	Show All 27 Lines

llvm/test/CodeGen/AArch64/GlobalISel/gisel-commandline-option.ll

	Show First 20 Lines • Show All 50 Lines • ▼ Show 20 Lines
	; RUN: --debugify-and-strip-all-safe=0 \			; RUN: --debugify-and-strip-all-safe=0 \
	; RUN: -debug-pass=Structure %s -o /dev/null 2>&1 -verify-machineinstrs=0 \			; RUN: -debug-pass=Structure %s -o /dev/null 2>&1 -verify-machineinstrs=0 \
	; RUN: \| FileCheck %s --check-prefix DISABLED			; RUN: \| FileCheck %s --check-prefix DISABLED

	; ENABLED: IRTranslator			; ENABLED: IRTranslator
	; VERIFY-NEXT: Verify generated machine code			; VERIFY-NEXT: Verify generated machine code
	; ENABLED-NEXT: Analysis for ComputingKnownBits			; ENABLED-NEXT: Analysis for ComputingKnownBits
	; ENABLED-O1-NEXT: MachineDominator Tree Construction			; ENABLED-O1-NEXT: MachineDominator Tree Construction
				; ENABLED-NEXT: Analysis containing CSE Info
	; ENABLED-NEXT: PreLegalizerCombiner			; ENABLED-NEXT: PreLegalizerCombiner
	; VERIFY-NEXT: Verify generated machine code			; VERIFY-NEXT: Verify generated machine code
	; ENABLED-NEXT: Analysis containing CSE Info
	; ENABLED-NEXT: Legalizer			; ENABLED-NEXT: Legalizer
	; VERIFY-NEXT: Verify generated machine code			; VERIFY-NEXT: Verify generated machine code
	; ENABLED: RegBankSelect			; ENABLED: RegBankSelect
	; VERIFY-NEXT: Verify generated machine code			; VERIFY-NEXT: Verify generated machine code
	; ENABLED-NEXT: Localizer			; ENABLED-NEXT: Localizer
	; VERIFY-O0-NEXT: Verify generated machine code			; VERIFY-O0-NEXT: Verify generated machine code
	; ENABLED-NEXT: Analysis for ComputingKnownBits			; ENABLED-NEXT: Analysis for ComputingKnownBits
	; ENABLED-NEXT: InstructionSelect			; ENABLED-NEXT: InstructionSelect
	Show All 15 Lines

llvm/test/CodeGen/AArch64/O0-pipeline.ll

	Show All 28 Lines
	; CHECK-NEXT: FunctionPass Manager			; CHECK-NEXT: FunctionPass Manager
	; CHECK-NEXT: Exception handling preparation			; CHECK-NEXT: Exception handling preparation
	; CHECK-NEXT: Safe Stack instrumentation pass			; CHECK-NEXT: Safe Stack instrumentation pass
	; CHECK-NEXT: Insert stack protectors			; CHECK-NEXT: Insert stack protectors
	; CHECK-NEXT: Module Verifier			; CHECK-NEXT: Module Verifier
	; CHECK-NEXT: Analysis containing CSE Info			; CHECK-NEXT: Analysis containing CSE Info
	; CHECK-NEXT: IRTranslator			; CHECK-NEXT: IRTranslator
	; CHECK-NEXT: Analysis for ComputingKnownBits			; CHECK-NEXT: Analysis for ComputingKnownBits
	; CHECK-NEXT: AArch64PreLegalizerCombiner
	; CHECK-NEXT: Analysis containing CSE Info			; CHECK-NEXT: Analysis containing CSE Info
				; CHECK-NEXT: AArch64PreLegalizerCombiner
	; CHECK-NEXT: Legalizer			; CHECK-NEXT: Legalizer
	; CHECK-NEXT: AArch64PostLegalizerLowering			; CHECK-NEXT: AArch64PostLegalizerLowering
	; CHECK-NEXT: RegBankSelect			; CHECK-NEXT: RegBankSelect
	; CHECK-NEXT: Localizer			; CHECK-NEXT: Localizer
	; CHECK-NEXT: Analysis for ComputingKnownBits			; CHECK-NEXT: Analysis for ComputingKnownBits
	; CHECK-NEXT: InstructionSelect			; CHECK-NEXT: InstructionSelect
	; CHECK-NEXT: ResetMachineFunction			; CHECK-NEXT: ResetMachineFunction
	; CHECK-NEXT: AArch64 Instruction Selection			; CHECK-NEXT: AArch64 Instruction Selection
	Show All 32 Lines

This is an archive of the discontinued LLVM Phabricator instance.

[GlobalISel] Add MachineInstNumbering to CSEInfo and propagate CSE throughout AArch64 pipeline.Needs RevisionPublic

Details

Diff Detail

Event Timeline

Revision Contents

Diff 316428

llvm/include/llvm/CodeGen/GlobalISel/CSEInfo.h

llvm/include/llvm/CodeGen/GlobalISel/CSEMIRBuilder.h

llvm/include/llvm/CodeGen/GlobalISel/Legalizer.h

llvm/include/llvm/CodeGen/GlobalISel/Localizer.h

llvm/include/llvm/CodeGen/GlobalISel/MachineIRBuilder.h

llvm/include/llvm/CodeGen/GlobalISel/RegBankSelect.h

llvm/lib/CodeGen/GlobalISel/CSEInfo.cpp

llvm/lib/CodeGen/GlobalISel/CSEMIRBuilder.cpp

llvm/lib/CodeGen/GlobalISel/IRTranslator.cpp

llvm/lib/CodeGen/GlobalISel/Legalizer.cpp

llvm/lib/CodeGen/GlobalISel/Localizer.cpp

llvm/lib/CodeGen/GlobalISel/RegBankSelect.cpp

llvm/lib/Target/AArch64/GISel/AArch64PostLegalizerCombiner.cpp

llvm/lib/Target/AArch64/GISel/AArch64PostLegalizerLowering.cpp

llvm/lib/Target/AArch64/GISel/AArch64PreLegalizerCombiner.cpp

llvm/test/CodeGen/AArch64/GlobalISel/combine-shift-immed-mismatch-crash.mir

llvm/test/CodeGen/AArch64/GlobalISel/gisel-commandline-option.ll

llvm/test/CodeGen/AArch64/O0-pipeline.ll

[GlobalISel] Add MachineInstNumbering to CSEInfo and propagate CSE throughout AArch64 pipeline.
Needs RevisionPublic