This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
llvm/
-
lib/CodeGen/
-
CodeGen/
1/2
MIRCanonicalizerPass.cpp
1/2
MIRNamerPass.cpp
3/6
MIRVRegNamerUtils.h
6/14
MIRVRegNamerUtils.cpp
-
test/CodeGen/MIR/
-
CodeGen/
-
MIR/
-
AArch64/
-
mirCanonCopyCopyProp.mir
-
mirCanonIdempotent.mir
-
mirnamer.mir
-
AMDGPU/
-
mir-canon-multi.mir

Differential D70210

[MirNamer][Canonicalizer]: Perform instruction semantic based renaming
ClosedPublic

Authored by aditya_nandakumar on Nov 13 2019, 2:35 PM.

Download Raw Diff

Details

Reviewers

bogner
plotfi
volkan

Summary

Previously:

Due to sensitivity of the algorithm with gaps, and extra instructions, when diffing, often we see naming being off by a few. Makes the diff unreadable even for tests with 7 and 8 instructions respectively.
Naming can change depending on candidates (and order of picking candidates). Suddenly if there's one extra instruction somewhere, the entire subtree would be named completely differently.
No consistent naming of similar instructions which occur in different functions. If we try to do something like count the frequency distribution of various differences across suite, then the above sensitivity issues are going to result in poor results.

Instead:

Name instruction based on semantics of the instruction (hash of the opcode and operands). Essentially for a given instruction that occurs in any module/function it'll be named similarly (ie semantic). This has some nice properties
- Can easily look at many instructions and just check the hash and if they're named similarly, then it's the same instruction. Makes it very easy to spot the same instruction both multiple times, as well as across many functions (useful for frequency distribution).
- Independent of traversal/candidates/depth of graph. No need to keep track of last index/gaps/skip count etc.
- No off by few issues with diffs. I've tried the old vs new implementation in files ranging from 30 to 700 instructions. In both cases with the old algorithm, diffs are a sea of red, where as for the semantic version, in both cases, the diffs line up beautifully.
- Simplified implementation of the main loop (simple iteration) , no keep track of what's visited and not.
Handle collision just by incrementing a counter. Roughly bb[N]_hash_[CollisionCount].

Additionally with the new implementation, we can probably avoid doing the hoisting of instructions to various places, as they'll likely be named the same resulting in differences only based on collision (ie regardless of whether the instruction is hoisted or not/close to use or not, it'll be named the same hash which should result in use of the instruction be identical with the only change being the collision count) which is very easy to spot visually.

Diff Detail

Event Timeline

aditya_nandakumar created this revision.Nov 13 2019, 2:35 PM

Herald added a project: Restricted Project. · View Herald TranscriptNov 13 2019, 2:35 PM

Herald added subscribers: hiraditya, nhaehnle, jvesely. · View Herald Transcript

aditya_nandakumar added a reviewer: volkan.Nov 13 2019, 2:36 PM

aditya_nandakumar added a subscriber: volkan.

aditya_nandakumar removed a subscriber: volkan.

Initial comments posted. Nice work, especially on the operand/vreg-def-opcode hashing idea (which localizes lot of the difference better that before). More code comments and perhaps some ports of downstream test cases would be greatly appreciated. I think we should work together to cleanup or remove some of this stuff through a couple NFC commits so that the diff is a little easier to follow and mostly consists of what was added (with the stuff removed being removed in a separate commit).

llvm/lib/CodeGen/MIRCanonicalizerPass.cpp
441	Is there anything specifically that is needed from MIRCanonicalizerPass Given this deletion here? Can you use MIRNamerPass as your base instead and do this deletion in a later NFC commit if necessary?
llvm/lib/CodeGen/MIRVRegNamerUtils.cpp
45	Might want to consider a StringMap
71–82	Comments please.
81	I think the candidate walk code can be left alone here, and removed in an NFC if really necessary.
88	llvm_unreachable please.
106	I like this linear approach but I might like to keep the tree based approach as well as a toggle until we can add more tests. In the tree based approach I was trying to do the canonicalization based on the chain of operations that flow into a side effect, where here the side effects are renaming barriers? On second thought, I really like the hashing approach on the VReg-Def opcode and if you are confident it wont result in too may cases where a difference that should have remained is lost, I'd be fine with replacing all of this walking business. Comments (and perhaps some brief MIR snippets) on how this renaming mechanism works would be really nice to have as well. From what I understand, you have many test cases downstream. Can these be ported to aarch64 to bolster the testing upstream? Even the tests with 7 and 8 instructions can be useful, and I'd assume shouldn't be too difficult to port to a supported downstream target? Does this sound reasonable to you @aditya_nandakumar @bogner ??
llvm/lib/CodeGen/MIRVRegNamerUtils.h
37	Did this need to me moved from MIRVRegNamerUtils.cpp to MIRVRegNamerUtils.h? Can he be done in a NFC commit? On top of that, this appears to be a copy paste from https://reviews.llvm.org/D70029. Would you like to work with me on massaging these changes along D70029 into place here?
41	Another copy paste from D70029. CurrentBBNumber is never incremented or really used for anything of import here. Why is it included in this diff?
77–89	Comment please. It wait for D70029 to solidify and land.

plotfi requested changes to this revision.Nov 13 2019, 10:09 PM

This revision now requires changes to proceed.Nov 13 2019, 10:09 PM

plotfi added inline comments.Nov 13 2019, 10:23 PM

llvm/lib/CodeGen/MIRVRegNamerUtils.cpp
81	How do you guarantee that two different vregs don't resolve to the same "HashOperand" value? Because can't the opcode of two different def instructions happen to be the same? Is it just highly unlikely because you are also hashing the operands of the entire instruction together?

aditya_nandakumar marked 10 inline comments as done.Nov 14 2019, 6:08 PM

aditya_nandakumar added inline comments.

llvm/lib/CodeGen/MIRCanonicalizerPass.cpp
441	The change affects both. This also creates a bunch of new vreg names and changes the names with no benefit - the names have been already created with many useful properties and this undoes that.
llvm/lib/CodeGen/MIRVRegNamerUtils.cpp
45	Mostly followed the style which is using std containers in the rest of this file.
71–82	Comments in the header.
81	Consider this sequence. On one file you have %0 = COPY $x0 %1 = LDIMM 25 %2 = ADD %0, %1 %3 = SUB %2, 42 On the other side you have %0 = COPY $x1 %1 = LDIMM 26 %2 = ADD %0, %1 %3 = SUB %2, 42 Clearly the add is identical on both sides and is not that interesting - essentially it's an add from a LDIMM and a COPY. It's possible that those values might be different, but as far as the ADD is concerned, it's sufficient to say that for both sides ADD is doing something similar and is not the source of a diff. Add's VReg name will show identical on both cases, but sources will appear different (due to different names). However each of %0 and %1 would differ in how they are named and will show up as the diff. Sub will finally look identical. In general we want to capture the diff in the very first places they happen and not beyond that. After renaming LHS would be, %hash1 = COPY $x0 %hash2 = LDIMM 25 %hash3 = ADD %hash1, %hash2 %hash7 = SUB %hash3, 0 RHS would be %hash4 = COPY $x1 %hash5 = LDIMM 26 %hash3 = ADD %hash4, %hash5 %hash7 = SUB %hash3, 0 Now when you diff those, the ADD is not the one that's different, but the first two instructions are. So they will show up nicely in the diff, but the ADD will appear identical with respect to destination and opcode, but will differ from sources. The SUB in both will be identical and will disappear from the context of the diff as it's not interesting. Now even if you have additional instruction somewhere in the instruction stream in either the LHS or the RHS side, the naming of this won't change and will point out the differences nicely.
81	I removed this as there will be no more users of this method.
88	That unfortunately won't work as we are not explicitly handling all variants of MachineOperand Kinds here. There are some challenges that can be solved (bb naming scheme) when dealing with PHIS ie how to hash them without using the name . There maybe other cases where we want to differentiate special operands. Right now we uniquely differentiate immediate and regs, but we can't really assert on any of the others.
106	In general hash collisions are resolved similarly on both sides of a diff and the hash collision renaming will also happen similarly on both sides of the diff (just incrementing numbers). This will line up really well for diffs. Additionally with the hashing method, just when you're staring at two equivalent pieces of code, just by looking at the reg name that is a hash, you can just assume that they're likely equivalent instructions and move your focus elsewhere. In general, due to the disadvantages of the previous algorithm and the advantages of this approach, there should be no need to keep both approaches. Regarding tests, there's sufficient coverage. The core algorithm is simple - hash instruction oeprands and rename based on the hash(which capture the semantics of the instruction). It's evident that this works correctly (look at MIR/AArch64/mirCanonIdempotent.mir included in this patch where two of the same MOV instructions are renamed correctly) # CHECK-NEXT: %bb0_42274__1:gpr32 = MOVi32imm 408 # CHECK-NEXT: %bb0_42274__2:gpr32 = MOVi32imm 408 The only thing adding more tests would help with is we tie that up some diffing tool and make sure that the core strategy still works. Otherwise, it will be quite identical to the ones we already have here. I'll still try to come up with some screenshots of the two approaches and attach it to the review.
llvm/lib/CodeGen/MIRVRegNamerUtils.h
37	I initially used that as the base but forgot to remove them as it's not needed due to the new algorithm. As this abstraction is not really needed any more, I'll go ahead and remove this. For the new algorithm the only thing we need is RSE_Reg and if we have just one variant, there's no need for an enum.
41	It's used for the renaming of the regs - Each instruction is named as "bb<BlockNum>_hash_<collision_counter>". The BlockNum is obtained by just saying getCurrentBlockNumber(). It's incremented in the top level renameRegs for each block that you visit.
77–89	There's not much that's taken from D70029 besides this method name and the CurrentBBNumber. Because of the differences in how instructions are named and the algorithm changes, there's no need to wait for that patch. In fact, I just realized that I can simplify this a little more by removing some of the abstractions that we don't need any more. I missed the comment on this one. Thanks for catching it.

LGTM once Puyan's feedback is addressed

llvm/lib/CodeGen/MIRVRegNamerUtils.cpp
84–88	As far as I understand this can't really be replaced with an assert as is, since things like basic block ids are being deliberately dropped here. Can you explicitly handle the cases that we really want to drop from the hash or at least update this comment to explain? I realize it might make sense to always return something, since the only effect is more collisions, but the comment reads as if you think it's an error to get here.
106	This is still doing the traversal in a sense, just a bit more implicitly while we calculate the instruction hash. The nice thing here is that the hash is stable enough that it doesn't matter where we start from, so we can just walk the instructions linearly and it ends up looking a bit simpler. We don't really have many interestingly distinct test cases downstream - the test cases below cover the functional testing pretty well. What we do have is some experience using this on large functions that were compiled with and without GlobalISel, which have shown that this hashing approach helps quite a bit.

Addressed some feedback, removed some abstractions that are unused, added more comments.

s/Token/Name

Diff looks cleaner. Since you guys are the primary clients, LGTM.

This revision is now accepted and ready to land.Nov 14 2019, 11:51 PM

727686855

plotfi added inline comments.Nov 19 2019, 11:59 PM

llvm/lib/CodeGen/MIRNamerPass.cpp
61	Where is the BB# being set here? All the BB#s will be 0 as far as I can tell.

plotfi mentioned this in D70478: [MIRVRegNamerUtils] Fix for BasicBlock numbering parameter..Nov 20 2019, 12:17 AM

aditya_nandakumar marked an inline comment as done.Nov 20 2019, 12:17 AM

aditya_nandakumar added inline comments.

llvm/lib/CodeGen/MIRNamerPass.cpp
61	Yup - you're right. While trying to refactor I removed a `CurrentBBNo = BBNo++` or something equivalent that I had in my tree. Good catch. Thanks. I'll have a patch ready for this part soon.

plotfi mentioned this in rGfdc6f4b97b0e: [llvm] Fixing MIRVRegNamerUtils to properly handle 2+ MachineBasicBlocks..Dec 4 2019, 3:37 PM

Revision Contents

Path

Size

llvm/

lib/

CodeGen/

MIRCanonicalizerPass.cpp

34 lines

MIRNamerPass.cpp

4 lines

MIRVRegNamerUtils.h

93 lines

MIRVRegNamerUtils.cpp

381 lines

test/

CodeGen/

MIR/

AArch64/

mirCanonCopyCopyProp.mir

4 lines

mirCanonIdempotent.mir

12 lines

mirnamer.mir

48 lines

AMDGPU/

mir-canon-multi.mir

24 lines

Diff 229438

llvm/lib/CodeGen/MIRCanonicalizerPass.cpp

Show First 20 Lines • Show All 368 Lines • ▼ Show 20 Lines	for (auto &MI : *MBB) {
}		}
}		}

return Changed;		return Changed;
}		}

static bool runOnBasicBlock(MachineBasicBlock *MBB,		static bool runOnBasicBlock(MachineBasicBlock *MBB,
std::vector<StringRef> &bbNames,		std::vector<StringRef> &bbNames,
unsigned &basicBlockNum, NamedVRegCursor &NVC) {		unsigned &basicBlockNum, VRegRenamer &Renamer) {

if (CanonicalizeBasicBlockNumber != ~0U) {		if (CanonicalizeBasicBlockNumber != ~0U) {
if (CanonicalizeBasicBlockNumber != basicBlockNum++)		if (CanonicalizeBasicBlockNumber != basicBlockNum++)
return false;		return false;
LLVM_DEBUG(dbgs() << "\n Canonicalizing BasicBlock " << MBB->getName()		LLVM_DEBUG(dbgs() << "\n Canonicalizing BasicBlock " << MBB->getName()
<< "\n";);		<< "\n";);
}		}

if (llvm::find(bbNames, MBB->getName()) != bbNames.end()) {		if (llvm::find(bbNames, MBB->getName()) != bbNames.end()) {
LLVM_DEBUG({		LLVM_DEBUG({
dbgs() << "Found potentially duplicate BasicBlocks: " << MBB->getName()		dbgs() << "Found potentially duplicate BasicBlocks: " << MBB->getName()
<< "\n";		<< "\n";
});		});
return false;		return false;
}		}

LLVM_DEBUG({		LLVM_DEBUG({
dbgs() << "\n\n NEW BASIC BLOCK: " << MBB->getName() << " \n\n";		dbgs() << "\n\n NEW BASIC BLOCK: " << MBB->getName() << " \n\n";
dbgs() << "\n\n================================================\n\n";		dbgs() << "\n\n================================================\n\n";
});		});

bool Changed = false;		bool Changed = false;
MachineFunction &MF = *MBB->getParent();
MachineRegisterInfo &MRI = MF.getRegInfo();

bbNames.push_back(MBB->getName());		bbNames.push_back(MBB->getName());
LLVM_DEBUG(dbgs() << "\n\n NEW BASIC BLOCK: " << MBB->getName() << "\n\n";);		LLVM_DEBUG(dbgs() << "\n\n NEW BASIC BLOCK: " << MBB->getName() << "\n\n";);

LLVM_DEBUG(dbgs() << "MBB Before Canonical Copy Propagation:\n";		LLVM_DEBUG(dbgs() << "MBB Before Canonical Copy Propagation:\n";
MBB->dump(););		MBB->dump(););
Changed \|= propagateLocalCopies(MBB);		Changed \|= propagateLocalCopies(MBB);
LLVM_DEBUG(dbgs() << "MBB After Canonical Copy Propagation:\n"; MBB->dump(););		LLVM_DEBUG(dbgs() << "MBB After Canonical Copy Propagation:\n"; MBB->dump(););

LLVM_DEBUG(dbgs() << "MBB Before Scheduling:\n"; MBB->dump(););		LLVM_DEBUG(dbgs() << "MBB Before Scheduling:\n"; MBB->dump(););
unsigned IdempotentInstCount = 0;		unsigned IdempotentInstCount = 0;
Changed \|= rescheduleCanonically(IdempotentInstCount, MBB);		Changed \|= rescheduleCanonically(IdempotentInstCount, MBB);
LLVM_DEBUG(dbgs() << "MBB After Scheduling:\n"; MBB->dump(););		LLVM_DEBUG(dbgs() << "MBB After Scheduling:\n"; MBB->dump(););

Changed \|= NVC.renameVRegs(MBB);		Changed \|= Renamer.renameVRegs(MBB);

// Here we renumber the def vregs for the idempotent instructions from the top
// of the MachineBasicBlock so that they are named in the order that we sorted
// them alphabetically. Eventually we wont need SkipVRegs because we will use
// named vregs instead.
if (IdempotentInstCount)
NVC.skipVRegs();

auto MII = MBB->begin();
for (unsigned i = 0; i < IdempotentInstCount && MII != MBB->end(); ++i) {
MachineInstr &MI = *MII++;
Changed = true;
Register vRegToRename = MI.getOperand(0).getReg();
auto Rename = NVC.createVirtualRegister(vRegToRename);

std::vector<MachineOperand *> RenameMOs;
for (auto &MO : MRI.reg_operands(vRegToRename)) {
RenameMOs.push_back(&MO);
}

for (auto *MO : RenameMOs) {
MO->setReg(Rename);
}
}

plotfiUnsubmitted Not Done Reply Inline Actions Is there anything specifically that is needed from MIRCanonicalizerPass Given this deletion here? Can you use MIRNamerPass as your base instead and do this deletion in a later NFC commit if necessary? plotfi: Is there anything specifically that is needed from MIRCanonicalizerPass Given this deletion…
aditya_nandakumarAuthorUnsubmitted Done Reply Inline Actions The change affects both. This also creates a bunch of new vreg names and changes the names with no benefit - the names have been already created with many useful properties and this undoes that. aditya_nandakumar: The change affects both. This also creates a bunch of new vreg names and changes the names with…
Changed \|= doDefKillClear(MBB);		Changed \|= doDefKillClear(MBB);

LLVM_DEBUG(dbgs() << "Updated MachineBasicBlock:\n"; MBB->dump();		LLVM_DEBUG(dbgs() << "Updated MachineBasicBlock:\n"; MBB->dump();
dbgs() << "\n";);		dbgs() << "\n";);
LLVM_DEBUG(		LLVM_DEBUG(
dbgs() << "\n\n================================================\n\n");		dbgs() << "\n\n================================================\n\n");
return Changed;		return Changed;
}		}
Show All 22 Lines	bool MIRCanonicalizer::runOnMachineFunction(MachineFunction &MF) {

std::vector<StringRef> BBNames;		std::vector<StringRef> BBNames;

unsigned BBNum = 0;		unsigned BBNum = 0;

bool Changed = false;		bool Changed = false;

MachineRegisterInfo &MRI = MF.getRegInfo();		MachineRegisterInfo &MRI = MF.getRegInfo();
NamedVRegCursor NVC(MRI);		VRegRenamer Renamer(MRI);
for (auto MBB : RPOList)		for (auto MBB : RPOList)
Changed \|= runOnBasicBlock(MBB, BBNames, BBNum, NVC);		Changed \|= runOnBasicBlock(MBB, BBNames, BBNum, Renamer);

return Changed;		return Changed;
}		}

llvm/lib/CodeGen/MIRNamerPass.cpp

Show First 20 Lines • Show All 48 Lines • ▼ Show 20 Lines	public:
}		}

bool runOnMachineFunction(MachineFunction &MF) override {		bool runOnMachineFunction(MachineFunction &MF) override {
bool Changed = false;		bool Changed = false;

if (MF.empty())		if (MF.empty())
return Changed;		return Changed;

NamedVRegCursor NVC(MF.getRegInfo());		VRegRenamer Renamer(MF.getRegInfo());

ReversePostOrderTraversal<MachineBasicBlock > RPOT(&MF.begin());		ReversePostOrderTraversal<MachineBasicBlock > RPOT(&MF.begin());
for (auto &MBB : RPOT)		for (auto &MBB : RPOT)
Changed \|= NVC.renameVRegs(MBB);		Changed \|= Renamer.renameVRegs(MBB);
		plotfiUnsubmitted Not Done Reply Inline Actions Where is the BB# being set here? All the BB#s will be 0 as far as I can tell. plotfi: Where is the BB# being set here? All the BB#s will be 0 as far as I can tell.
		aditya_nandakumarAuthorUnsubmitted Done Reply Inline Actions Yup - you're right. While trying to refactor I removed a `CurrentBBNo = BBNo++` or something equivalent that I had in my tree. Good catch. Thanks. I'll have a patch ready for this part soon. aditya_nandakumar: Yup - you're right. While trying to refactor I removed a `CurrentBBNo = BBNo++` or something…

return Changed;		return Changed;
}		}
};		};

} // end anonymous namespace		} // end anonymous namespace

char MIRNamer::ID;		char MIRNamer::ID;

char &llvm::MIRNamerID = MIRNamer::ID;		char &llvm::MIRNamerID = MIRNamer::ID;

INITIALIZE_PASS_BEGIN(MIRNamer, "mir-namer", "Rename Register Operands", false,		INITIALIZE_PASS_BEGIN(MIRNamer, "mir-namer", "Rename Register Operands", false,
false)		false)

INITIALIZE_PASS_END(MIRNamer, "mir-namer", "Rename Register Operands", false,		INITIALIZE_PASS_END(MIRNamer, "mir-namer", "Rename Register Operands", false,
false)		false)

llvm/lib/CodeGen/MIRVRegNamerUtils.h

	Show All 19 Lines
	#include "llvm/ADT/PostOrderIterator.h"			#include "llvm/ADT/PostOrderIterator.h"
	#include "llvm/ADT/STLExtras.h"			#include "llvm/ADT/STLExtras.h"
	#include "llvm/CodeGen/MachineFunctionPass.h"			#include "llvm/CodeGen/MachineFunctionPass.h"
	#include "llvm/CodeGen/MachineInstrBuilder.h"			#include "llvm/CodeGen/MachineInstrBuilder.h"
	#include "llvm/CodeGen/MachineRegisterInfo.h"			#include "llvm/CodeGen/MachineRegisterInfo.h"
	#include "llvm/CodeGen/Passes.h"			#include "llvm/CodeGen/Passes.h"
	#include "llvm/Support/raw_ostream.h"			#include "llvm/Support/raw_ostream.h"

	#include <queue>

	namespace llvm {			namespace llvm {
				/// VRegRenamer - This class is used for renaming vregs in a machine basic
				/// block according to semantics of the instruction.
				class VRegRenamer {
				class NamedVReg {
				Register Reg;
				std::string Name;

				public:
				plotfiUnsubmitted Not Done Reply Inline Actions Did this need to me moved from MIRVRegNamerUtils.cpp to MIRVRegNamerUtils.h? Can he be done in a NFC commit? On top of that, this appears to be a copy paste from https://reviews.llvm.org/D70029. Would you like to work with me on massaging these changes along D70029 into place here? plotfi: Did this need to me moved from MIRVRegNamerUtils.cpp to MIRVRegNamerUtils.h? Can he be done in…
				aditya_nandakumarAuthorUnsubmitted Done Reply Inline Actions I initially used that as the base but forgot to remove them as it's not needed due to the new algorithm. As this abstraction is not really needed any more, I'll go ahead and remove this. For the new algorithm the only thing we need is RSE_Reg and if we have just one variant, there's no need for an enum. aditya_nandakumar: I initially used that as the base but forgot to remove them as it's not needed due to the new…
				NamedVReg(Register Reg, std::string Name = "") : Reg(Reg), Name(Name) {}
				NamedVReg(std::string Name = "") : Reg(~0U), Name(Name) {}

				const std::string &getName() const { return Name; }
				plotfiUnsubmitted Not Done Reply Inline Actions Another copy paste from D70029. CurrentBBNumber is never incremented or really used for anything of import here. Why is it included in this diff? plotfi: Another copy paste from D70029. CurrentBBNumber is never incremented or really used for…
				aditya_nandakumarAuthorUnsubmitted Done Reply Inline Actions It's used for the renaming of the regs - Each instruction is named as "bb<BlockNum>_hash_<collision_counter>". The BlockNum is obtained by just saying getCurrentBlockNumber(). It's incremented in the top level renameRegs for each block that you visit. aditya_nandakumar: It's used for the renaming of the regs - Each instruction is named as…

				Register getReg() const { return Reg; }
				};

	/// NamedVRegCursor - The cursor is an object that keeps track of what the next
	/// vreg name should be. It does book keeping to determine when to skip the
	/// index value and by how much, or if the next vreg name should be an increment
	/// from the previous.
	class NamedVRegCursor {
	MachineRegisterInfo &MRI;			MachineRegisterInfo &MRI;

	/// virtualVRegNumber - Book keeping of the last vreg position.			unsigned CurrentBBNumber = 0;
	unsigned virtualVRegNumber;

	/// SkipGapSize - Used to calculate a modulo amount to skip by after every			/// Given an Instruction, construct a hash of the operands
	/// sequence of instructions starting from a given side-effecting			/// of the instructions along with the opcode.
	/// MachineInstruction for a given MachineBasicBlock. The general idea is that			/// When dealing with virtual registers, just hash the opcode of
	/// for a given program compiled with two different opt pipelines, there			/// the instruction defining that vreg.
	/// shouldn't be greater than SkipGapSize difference in how many vregs are in			/// Handle immediates, registers (physical and virtual) explicitly,
	/// play between the two and for every def-use graph of vregs we rename we			/// and return a common value for the other cases.
	/// will round up to the next SkipGapSize'th number so that we have a high			/// Instruction will be named in the following scheme
	/// change of landing on the same name for two given matching side-effects			/// bb<block_no>_hash_<collission_count>.
	/// for the two compilation outcomes.			std::string getInstructionOpcodeHash(MachineInstr &MI);
	const unsigned SkipGapSize;
				/// For all the VRegs that are candidates for renaming,
				/// return a mapping from old vregs to new vregs with names.
				std::map<unsigned, unsigned>
				getVRegRenameMap(const std::vector<NamedVReg> &VRegs);

	/// RenamedInOtherBB - VRegs that we already renamed: ie breadcrumbs.			/// Perform replacing of registers based on the <old,new> vreg map.
	std::vector<Register> RenamedInOtherBB;			bool doVRegRenaming(const std::map<unsigned, unsigned> &VRegRenameMap);

	public:			public:
	NamedVRegCursor() = delete;			VRegRenamer() = delete;
	/// 1000 for the SkipGapSize was a good heuristic at the time of the writing			VRegRenamer(MachineRegisterInfo &MRI) : MRI(MRI) {}
	/// of the MIRCanonicalizerPass. Adjust as needed.
	NamedVRegCursor(MachineRegisterInfo &MRI, unsigned SkipGapSize = 1000)
	: MRI(MRI), virtualVRegNumber(0), SkipGapSize(SkipGapSize) {}

	/// SkipGapSize - Skips modulo a gap value of indices. Indices are used to
	/// produce the next vreg name.
	void skipVRegs();

	unsigned getVirtualVReg() const { return virtualVRegNumber; }

	/// incrementVirtualVReg - This increments an index value that us used to
	/// create a new vreg name. This is not a Register.
	unsigned incrementVirtualVReg(unsigned incr = 1) {
	virtualVRegNumber += incr;
	return virtualVRegNumber;
	}

	/// createVirtualRegister - Given an existing vreg, create a named vreg to			/// createVirtualRegister - Given an existing vreg, create a named vreg to
	/// take its place.			/// take its place. The name is determined by calling
				/// getInstructionOpcodeHash.
	unsigned createVirtualRegister(unsigned VReg);			unsigned createVirtualRegister(unsigned VReg);

	/// renameVRegs - For a given MachineBasicBlock, scan for side-effecting			/// Create a vreg with name and return it.
	/// instructions, walk the def-use from each side-effecting root (in sorted			unsigned createVirtualRegisterWithName(unsigned VReg,
	/// root order) and rename the encountered vregs in the def-use graph in a			const std::string &Name);
	/// canonical ordering. This method maintains book keeping for which vregs			/// Linearly traverse the MachineBasicBlock and rename each instruction's
	/// were already renamed in RenamedInOtherBB.			/// vreg definition based on the semantics of the instruction.
	// @return changed			/// Names are as follows bb<BBNum>_hash_[0-9]+
	bool renameVRegs(MachineBasicBlock *MBB);			bool renameInstsInMBB(MachineBasicBlock *MBB);

				/// Same as the above, but sets a BBNum depending on BB traversal that
				/// will be used as prefix for the vreg names.
				bool renameVRegs(MachineBasicBlock *MBB, unsigned BBNum = 0);

				unsigned getCurrentBBNumber() const { return CurrentBBNumber; }
				plotfiUnsubmitted Not Done Reply Inline Actions Comment please. It wait for D70029 to solidify and land. plotfi: Comment please. It wait for D70029 to solidify and land.
				aditya_nandakumarAuthorUnsubmitted Done Reply Inline Actions There's not much that's taken from D70029 besides this method name and the CurrentBBNumber. Because of the differences in how instructions are named and the algorithm changes, there's no need to wait for that patch. In fact, I just realized that I can simplify this a little more by removing some of the abstractions that we don't need any more. I missed the comment on this one. Thanks for catching it. aditya_nandakumar: There's not much that's taken from D70029 besides this method name and the CurrentBBNumber.
	};			};

	} // namespace llvm			} // namespace llvm

	#endif			#endif

llvm/lib/CodeGen/MIRVRegNamerUtils.cpp

	//===---------- MIRVRegNamerUtils.cpp - MIR VReg Renaming Utilities -------===//			//===---------- MIRVRegNamerUtils.cpp - MIR VReg Renaming Utilities -------===//
	//			//
	// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.			// Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
	// See https://llvm.org/LICENSE.txt for license information.			// See https://llvm.org/LICENSE.txt for license information.
	// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception			// SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
	//			//
	//===----------------------------------------------------------------------===//			//===----------------------------------------------------------------------===//

	#include "MIRVRegNamerUtils.h"			#include "MIRVRegNamerUtils.h"
	#include "llvm/Support/Debug.h"			#include "llvm/Support/Debug.h"

	using namespace llvm;			using namespace llvm;

	#define DEBUG_TYPE "mir-vregnamer-utils"			#define DEBUG_TYPE "mir-vregnamer-utils"

	namespace {			bool VRegRenamer::doVRegRenaming(
				const std::map<unsigned, unsigned> &VRegRenameMap) {
	// TypedVReg and VRType are used to tell the renamer what to do at points in a
	// sequence of values to be renamed. A TypedVReg can either contain
	// an actual VReg, a FrameIndex, or it could just be a barrier for the next
	// candidate (side-effecting instruction). This tells the renamer to increment
	// to the next vreg name, or to skip modulo some skip-gap value.
	enum VRType { RSE_Reg = 0, RSE_FrameIndex, RSE_NewCandidate };
	class TypedVReg {
	VRType Type;
	Register Reg;

	public:
	TypedVReg(Register Reg) : Type(RSE_Reg), Reg(Reg) {}
	TypedVReg(VRType Type) : Type(Type), Reg(~0U) {
	assert(Type != RSE_Reg && "Expected a non-Register Type.");
	}

	bool isReg() const { return Type == RSE_Reg; }
	bool isFrameIndex() const { return Type == RSE_FrameIndex; }
	bool isCandidate() const { return Type == RSE_NewCandidate; }

	VRType getType() const { return Type; }
	Register getReg() const {
	assert(this->isReg() && "Expected a virtual or physical Register.");
	return Reg;
	}
	};

	/// Here we find our candidates. What makes an interesting candidate?
	/// A candidate for a canonicalization tree root is normally any kind of
	/// instruction that causes side effects such as a store to memory or a copy to
	/// a physical register or a return instruction. We use these as an expression
	/// tree root that we walk in order to build a canonical walk which should
	/// result in canonical vreg renaming.
	std::vector<MachineInstr > populateCandidates(MachineBasicBlock MBB) {
	std::vector<MachineInstr *> Candidates;
	MachineRegisterInfo &MRI = MBB->getParent()->getRegInfo();

	for (auto II = MBB->begin(), IE = MBB->end(); II != IE; ++II) {
	MachineInstr MI = &II;

	bool DoesMISideEffect = false;

	if (MI->getNumOperands() > 0 && MI->getOperand(0).isReg()) {
	const Register Dst = MI->getOperand(0).getReg();
	DoesMISideEffect \|= !Register::isVirtualRegister(Dst);

	for (auto UI = MRI.use_begin(Dst); UI != MRI.use_end(); ++UI) {
	if (DoesMISideEffect)
	break;
	DoesMISideEffect \|= (UI->getParent()->getParent() != MI->getParent());
	}
	}

	if (!MI->mayStore() && !MI->isBranch() && !DoesMISideEffect)
	continue;

	LLVM_DEBUG(dbgs() << "Found Candidate: "; MI->dump(););
	Candidates.push_back(MI);
	}

	return Candidates;
	}

	void doCandidateWalk(std::vector<TypedVReg> &VRegs,
	plotfiUnsubmitted Not Done Reply Inline Actions I think the candidate walk code can be left alone here, and removed in an NFC if really necessary. plotfi: I think the candidate walk code can be left alone here, and removed in an NFC if really…
	aditya_nandakumarAuthorUnsubmitted Done Reply Inline Actions I removed this as there will be no more users of this method. aditya_nandakumar: I removed this as there will be no more users of this method.
	std::queue<TypedVReg> &RegQueue,
	std::vector<MachineInstr *> &VisitedMIs,
	const MachineBasicBlock *MBB) {

	const MachineFunction &MF = *MBB->getParent();
	const MachineRegisterInfo &MRI = MF.getRegInfo();

	while (!RegQueue.empty()) {

	auto TReg = RegQueue.front();
	RegQueue.pop();

	if (TReg.isFrameIndex()) {
	LLVM_DEBUG(dbgs() << "Popping frame index.\n";);
	VRegs.push_back(TypedVReg(RSE_FrameIndex));
	continue;
	}

	assert(TReg.isReg() && "Expected vreg or physreg.");
	Register Reg = TReg.getReg();

	if (Register::isVirtualRegister(Reg)) {
	LLVM_DEBUG({
	dbgs() << "Popping vreg ";
	MRI.def_begin(Reg)->dump();
	dbgs() << "\n";
	});

	if (!llvm::any_of(VRegs, [&](const TypedVReg &TR) {
	return TR.isReg() && TR.getReg() == Reg;
	})) {
	VRegs.push_back(TypedVReg(Reg));
	}
	} else {
	LLVM_DEBUG(dbgs() << "Popping physreg.\n";);
	VRegs.push_back(TypedVReg(Reg));
	continue;
	}

	for (auto RI = MRI.def_begin(Reg), RE = MRI.def_end(); RI != RE; ++RI) {
	MachineInstr *Def = RI->getParent();

	if (Def->getParent() != MBB)
	continue;

	if (llvm::any_of(VisitedMIs,
	[&](const MachineInstr *VMI) { return Def == VMI; })) {
	break;
	}

	LLVM_DEBUG({
	dbgs() << "\n========================\n";
	dbgs() << "Visited MI: ";
	Def->dump();
	dbgs() << "BB Name: " << Def->getParent()->getName() << "\n";
	dbgs() << "\n========================\n";
	});
	VisitedMIs.push_back(Def);
	for (unsigned I = 1, E = Def->getNumOperands(); I != E; ++I) {

	MachineOperand &MO = Def->getOperand(I);
	if (MO.isFI()) {
	LLVM_DEBUG(dbgs() << "Pushing frame index.\n";);
	RegQueue.push(TypedVReg(RSE_FrameIndex));
	}

	if (!MO.isReg())
	continue;
	RegQueue.push(TypedVReg(MO.getReg()));
	}
	}
	}
	}

	std::map<unsigned, unsigned>
	getVRegRenameMap(const std::vector<TypedVReg> &VRegs,
	const std::vector<Register> &renamedInOtherBB,
	MachineRegisterInfo &MRI, NamedVRegCursor &NVC) {
	std::map<unsigned, unsigned> VRegRenameMap;
	bool FirstCandidate = true;

	for (auto &vreg : VRegs) {
	if (vreg.isFrameIndex()) {
	// We skip one vreg for any frame index because there is a good chance
	// (especially when comparing SelectionDAG to GlobalISel generated MIR)
	// that in the other file we are just getting an incoming vreg that comes
	// from a copy from a frame index. So it's safe to skip by one.
	unsigned LastRenameReg = NVC.incrementVirtualVReg();
	(void)LastRenameReg;
	LLVM_DEBUG(dbgs() << "Skipping rename for FI " << LastRenameReg << "\n";);
	continue;
	} else if (vreg.isCandidate()) {

	// After the first candidate, for every subsequent candidate, we skip mod
	// 10 registers so that the candidates are more likely to start at the
	// same vreg number making it more likely that the canonical walk from the
	// candidate insruction. We don't need to skip from the first candidate of
	// the BasicBlock because we already skip ahead several vregs for each BB.
	unsigned LastRenameReg = NVC.getVirtualVReg();
	if (FirstCandidate)
	NVC.incrementVirtualVReg(LastRenameReg % 10);
	FirstCandidate = false;
	continue;
	} else if (!Register::isVirtualRegister(vreg.getReg())) {
	unsigned LastRenameReg = NVC.incrementVirtualVReg();
	(void)LastRenameReg;
	LLVM_DEBUG({
	dbgs() << "Skipping rename for Phys Reg " << LastRenameReg << "\n";
	});
	continue;
	}

	auto Reg = vreg.getReg();
	if (llvm::find(renamedInOtherBB, Reg) != renamedInOtherBB.end()) {
	LLVM_DEBUG(dbgs() << "Vreg " << Reg
	<< " already renamed in other BB.\n";);
	continue;
	}

	auto Rename = NVC.createVirtualRegister(Reg);

	if (VRegRenameMap.find(Reg) == VRegRenameMap.end()) {
	LLVM_DEBUG(dbgs() << "Mapping vreg ";);
	if (MRI.reg_begin(Reg) != MRI.reg_end()) {
	LLVM_DEBUG(auto foo = &*MRI.reg_begin(Reg); foo->dump(););
	} else {
	LLVM_DEBUG(dbgs() << Reg;);
	}
	LLVM_DEBUG(dbgs() << " to ";);
	if (MRI.reg_begin(Rename) != MRI.reg_end()) {
	LLVM_DEBUG(auto foo = &*MRI.reg_begin(Rename); foo->dump(););
	} else {
	LLVM_DEBUG(dbgs() << Rename;);
	}
	LLVM_DEBUG(dbgs() << "\n";);

	VRegRenameMap.insert(std::pair<unsigned, unsigned>(Reg, Rename));
	}
	}

	return VRegRenameMap;
	}

	bool doVRegRenaming(std::vector<Register> &renamedInOtherBB,
	const std::map<unsigned, unsigned> &VRegRenameMap,
	MachineRegisterInfo &MRI) {
	bool Changed = false;			bool Changed = false;
	for (auto I = VRegRenameMap.begin(), E = VRegRenameMap.end(); I != E; ++I) {			for (auto I = VRegRenameMap.begin(), E = VRegRenameMap.end(); I != E; ++I) {

	auto VReg = I->first;			auto VReg = I->first;
	auto Rename = I->second;			auto Rename = I->second;

	renamedInOtherBB.push_back(Rename);

	std::vector<MachineOperand *> RenameMOs;			std::vector<MachineOperand *> RenameMOs;
	for (auto &MO : MRI.reg_operands(VReg)) {			for (auto &MO : MRI.reg_operands(VReg)) {
	RenameMOs.push_back(&MO);			RenameMOs.push_back(&MO);
	}			}

	for (auto *MO : RenameMOs) {			for (auto *MO : RenameMOs) {
	Changed = true;			Changed = true;
	MO->setReg(Rename);			MO->setReg(Rename);

	if (!MO->isDef())			if (!MO->isDef())
	MO->setIsKill(false);			MO->setIsKill(false);
	}			}
	}			}

	return Changed;			return Changed;
	}			}

	bool renameVRegs(MachineBasicBlock *MBB,			std::map<unsigned, unsigned>
	std::vector<Register> &renamedInOtherBB,			VRegRenamer::getVRegRenameMap(const std::vector<NamedVReg> &VRegs) {
	NamedVRegCursor &NVC) {			std::map<unsigned, unsigned> VRegRenameMap;
	bool Changed = false;
	MachineFunction &MF = *MBB->getParent();
	MachineRegisterInfo &MRI = MF.getRegInfo();

	std::vector<MachineInstr *> Candidates = populateCandidates(MBB);
	std::vector<MachineInstr *> VisitedMIs;
	llvm::copy(Candidates, std::back_inserter(VisitedMIs));

	std::vector<TypedVReg> VRegs;
	for (auto candidate : Candidates) {
	VRegs.push_back(TypedVReg(RSE_NewCandidate));

	std::queue<TypedVReg> RegQueue;

	// Here we walk the vreg operands of a non-root node along our walk.
	// The root nodes are the original candidates (stores normally).
	// These are normally not the root nodes (except for the case of copies to
	// physical registers).
	for (unsigned i = 1; i < candidate->getNumOperands(); i++) {
	if (candidate->mayStore() \|\| candidate->isBranch())
	break;

	MachineOperand &MO = candidate->getOperand(i);
	if (!(MO.isReg() && Register::isVirtualRegister(MO.getReg())))
	continue;

	LLVM_DEBUG(dbgs() << "Enqueue register"; MO.dump(); dbgs() << "\n";);
	RegQueue.push(TypedVReg(MO.getReg()));
	}

	// Here we walk the root candidates. We start from the 0th operand because			std::map<std::string, unsigned> VRegNameCollisionMap;
				plotfiUnsubmitted Not Done Reply Inline Actions Might want to consider a StringMap plotfi: Might want to consider a StringMap
				aditya_nandakumarAuthorUnsubmitted Done Reply Inline Actions Mostly followed the style which is using std containers in the rest of this file. aditya_nandakumar: Mostly followed the style which is using std containers in the rest of this file.
	// the root is normally a store to a vreg.
	for (unsigned i = 0; i < candidate->getNumOperands(); i++) {

	if (!candidate->mayStore() && !candidate->isBranch())			auto GetUniqueVRegName =
	break;			[&VRegNameCollisionMap](const NamedVReg &Reg) -> std::string {
				auto It = VRegNameCollisionMap.find(Reg.getName());
				unsigned Counter = 0;
				if (It != VRegNameCollisionMap.end()) {
				Counter = It->second;
				}
				++Counter;
				VRegNameCollisionMap[Reg.getName()] = Counter;
				return Reg.getName() + "__" + std::to_string(Counter);
				};

	MachineOperand &MO = candidate->getOperand(i);			for (auto &Vreg : VRegs) {
				auto Reg = Vreg.getReg();
				assert(Register::isVirtualRegister(Reg) &&
				"Expecting Virtual Registers Only");
				auto NewNameForReg = GetUniqueVRegName(Vreg);
				auto Rename = createVirtualRegisterWithName(Reg, NewNameForReg);

	// TODO: Do we want to only add vregs here?			VRegRenameMap.insert(std::pair<unsigned, unsigned>(Reg, Rename));
	if (!MO.isReg() && !MO.isFI())			}
	continue;			return VRegRenameMap;
				}

	LLVM_DEBUG(dbgs() << "Enqueue Reg/FI"; MO.dump(); dbgs() << "\n";);			std::string VRegRenamer::getInstructionOpcodeHash(MachineInstr &MI) {
				std::string S;
				raw_string_ostream OS(S);
				auto HashOperand = [this](const MachineOperand &MO) -> unsigned {
				if (MO.isImm())
				return MO.getImm();
				if (MO.isTargetIndex())
				return MO.getOffset() \| (MO.getTargetFlags() << 16);
				if (MO.isReg()) {
				return Register::isVirtualRegister(MO.getReg())
				? MRI.getVRegDef(MO.getReg())->getOpcode()
				plotfiUnsubmitted Not Done Reply Inline Actions How do you guarantee that two different vregs don't resolve to the same "HashOperand" value? Because can't the opcode of two different def instructions happen to be the same? Is it just highly unlikely because you are also hashing the operands of the entire instruction together? plotfi: How do you guarantee that two different vregs don't resolve to the same "HashOperand" value?
				aditya_nandakumarAuthorUnsubmitted Done Reply Inline Actions Consider this sequence. On one file you have %0 = COPY $x0 %1 = LDIMM 25 %2 = ADD %0, %1 %3 = SUB %2, 42 On the other side you have %0 = COPY $x1 %1 = LDIMM 26 %2 = ADD %0, %1 %3 = SUB %2, 42 Clearly the add is identical on both sides and is not that interesting - essentially it's an add from a LDIMM and a COPY. It's possible that those values might be different, but as far as the ADD is concerned, it's sufficient to say that for both sides ADD is doing something similar and is not the source of a diff. Add's VReg name will show identical on both cases, but sources will appear different (due to different names). However each of %0 and %1 would differ in how they are named and will show up as the diff. Sub will finally look identical. In general we want to capture the diff in the very first places they happen and not beyond that. After renaming LHS would be, %hash1 = COPY $x0 %hash2 = LDIMM 25 %hash3 = ADD %hash1, %hash2 %hash7 = SUB %hash3, 0 RHS would be %hash4 = COPY $x1 %hash5 = LDIMM 26 %hash3 = ADD %hash4, %hash5 %hash7 = SUB %hash3, 0 Now when you diff those, the ADD is not the one that's different, but the first two instructions are. So they will show up nicely in the diff, but the ADD will appear identical with respect to destination and opcode, but will differ from sources. The SUB in both will be identical and will disappear from the context of the diff as it's not interesting. Now even if you have additional instruction somewhere in the instruction stream in either the LHS or the RHS side, the naming of this won't change and will point out the differences nicely. aditya_nandakumar: Consider this sequence. On one file you have ``` %0 = COPY $x0 %1 = LDIMM 25 %2 = ADD %0, %1…
				: (unsigned)MO.getReg();
				plotfiUnsubmitted Not Done Reply Inline Actions Comments please. plotfi: Comments please.
				aditya_nandakumarAuthorUnsubmitted Done Reply Inline Actions Comments in the header. aditya_nandakumar: Comments in the header.
				}
				// We could explicitly handle all the types of the MachineOperand,
				// here but we can just return a common number until we find a
				// compelling test case where this is bad. The only side effect here
				// is contributing to a hash collission but there's enough information
				// (Opcodes,other registers etc) that this will likely not be a problem.
				plotfiUnsubmitted Not Done Reply Inline Actions llvm_unreachable please. plotfi: llvm_unreachable please.
				aditya_nandakumarAuthorUnsubmitted Done Reply Inline Actions That unfortunately won't work as we are not explicitly handling all variants of MachineOperand Kinds here. There are some challenges that can be solved (bb naming scheme) when dealing with PHIS ie how to hash them without using the name . There maybe other cases where we want to differentiate special operands. Right now we uniquely differentiate immediate and regs, but we can't really assert on any of the others. aditya_nandakumar: That unfortunately won't work as we are not explicitly handling all variants of MachineOperand…
				bognerUnsubmitted Not Done Reply Inline Actions As far as I understand this can't really be replaced with an assert as is, since things like basic block ids are being deliberately dropped here. Can you explicitly handle the cases that we really want to drop from the hash or at least update this comment to explain? I realize it might make sense to always return something, since the only effect is more collisions, but the comment reads as if you think it's an error to get here. bogner: As far as I understand this can't really be replaced with an assert as is, since things like…
				return 0;
				};
				SmallVector<unsigned, 16> MIOperands;
				MIOperands.push_back(MI.getOpcode());
				for (auto &Op : MI.uses()) {
				MIOperands.push_back(HashOperand(Op));
				}
				auto HashMI = hash_combine_range(MIOperands.begin(), MIOperands.end());
				return std::to_string(HashMI).substr(0, 5);
				}

	RegQueue.push(MO.isReg() ? TypedVReg(MO.getReg())			unsigned VRegRenamer::createVirtualRegister(unsigned VReg) {
	: TypedVReg(RSE_FrameIndex));			return createVirtualRegisterWithName(
				VReg, getInstructionOpcodeHash(*MRI.getVRegDef(VReg)));
	}			}

	doCandidateWalk(VRegs, RegQueue, VisitedMIs, MBB);			bool VRegRenamer::renameInstsInMBB(MachineBasicBlock *MBB) {
				std::vector<NamedVReg> VRegs;
				plotfiUnsubmitted Not Done Reply Inline Actions I like this linear approach but I might like to keep the tree based approach as well as a toggle until we can add more tests. In the tree based approach I was trying to do the canonicalization based on the chain of operations that flow into a side effect, where here the side effects are renaming barriers? On second thought, I really like the hashing approach on the VReg-Def opcode and if you are confident it wont result in too may cases where a difference that should have remained is lost, I'd be fine with replacing all of this walking business. Comments (and perhaps some brief MIR snippets) on how this renaming mechanism works would be really nice to have as well. From what I understand, you have many test cases downstream. Can these be ported to aarch64 to bolster the testing upstream? Even the tests with 7 and 8 instructions can be useful, and I'd assume shouldn't be too difficult to port to a supported downstream target? Does this sound reasonable to you @aditya_nandakumar @bogner ?? plotfi: I like this linear approach but I might like to keep the tree based approach as well as a…
				aditya_nandakumarAuthorUnsubmitted Done Reply Inline Actions In general hash collisions are resolved similarly on both sides of a diff and the hash collision renaming will also happen similarly on both sides of the diff (just incrementing numbers). This will line up really well for diffs. Additionally with the hashing method, just when you're staring at two equivalent pieces of code, just by looking at the reg name that is a hash, you can just assume that they're likely equivalent instructions and move your focus elsewhere. In general, due to the disadvantages of the previous algorithm and the advantages of this approach, there should be no need to keep both approaches. Regarding tests, there's sufficient coverage. The core algorithm is simple - hash instruction oeprands and rename based on the hash(which capture the semantics of the instruction). It's evident that this works correctly (look at MIR/AArch64/mirCanonIdempotent.mir included in this patch where two of the same MOV instructions are renamed correctly) # CHECK-NEXT: %bb0_42274__1:gpr32 = MOVi32imm 408 # CHECK-NEXT: %bb0_42274__2:gpr32 = MOVi32imm 408 The only thing adding more tests would help with is we tie that up some diffing tool and make sure that the core strategy still works. Otherwise, it will be quite identical to the ones we already have here. I'll still try to come up with some screenshots of the two approaches and attach it to the review. aditya_nandakumar: In general hash collisions are resolved similarly on both sides of a diff and the hash…
				bognerUnsubmitted Not Done Reply Inline Actions This is still doing the traversal in a sense, just a bit more implicitly while we calculate the instruction hash. The nice thing here is that the hash is stable enough that it doesn't matter where we start from, so we can just walk the instructions linearly and it ends up looking a bit simpler. We don't really have many interestingly distinct test cases downstream - the test cases below cover the functional testing pretty well. What we do have is some experience using this on large functions that were compiled with and without GlobalISel, which have shown that this hashing approach helps quite a bit. bogner: This is still doing the traversal in a sense, just a bit more implicitly while we calculate the…
				std::string Prefix = "bb" + std::to_string(getCurrentBBNumber()) + "_";
				for (auto &MII : *MBB) {
				MachineInstr &Candidate = MII;
				// Don't rename stores/branches.
				if (Candidate.mayStore() \|\| Candidate.isBranch())
				continue;
				if (!Candidate.getNumOperands())
				continue;
				// Look for instructions that define VRegs in operand 0.
				MachineOperand &MO = Candidate.getOperand(0);
				// Avoid non regs, instructions defining physical regs.
				if (!MO.isReg() \|\| !Register::isVirtualRegister(MO.getReg()))
				continue;
				VRegs.push_back(
				NamedVReg(MO.getReg(), Prefix + getInstructionOpcodeHash(Candidate)));
	}			}

	// If we have populated no vregs to rename then bail.			// If we have populated no vregs to rename then bail.
	// The rest of this function does the vreg remaping.			// The rest of this function does the vreg remaping.
	if (VRegs.size() == 0)			if (VRegs.size() == 0)
	return Changed;			return false;

	auto VRegRenameMap = getVRegRenameMap(VRegs, renamedInOtherBB, MRI, NVC);			auto VRegRenameMap = getVRegRenameMap(VRegs);
	Changed \|= doVRegRenaming(renamedInOtherBB, VRegRenameMap, MRI);			return doVRegRenaming(VRegRenameMap);
	return Changed;
	}			}
	} // anonymous namespace

	void NamedVRegCursor::skipVRegs() {			bool VRegRenamer::renameVRegs(MachineBasicBlock *MBB, unsigned BBNum) {
	unsigned VRegGapIndex = 1;			CurrentBBNumber = BBNum;
	if (!virtualVRegNumber) {			return renameInstsInMBB(MBB);
	VRegGapIndex = 0;
	virtualVRegNumber = MRI.createIncompleteVirtualRegister();
	}			}
	const unsigned VR_GAP = (++VRegGapIndex * SkipGapSize);

	unsigned I = virtualVRegNumber;
	const unsigned E = (((I + VR_GAP) / VR_GAP) + 1) * VR_GAP;

	virtualVRegNumber = E;			unsigned VRegRenamer::createVirtualRegisterWithName(unsigned VReg,
	}			const std::string &Name) {
				std::string Temp(Name);
	unsigned NamedVRegCursor::createVirtualRegister(unsigned VReg) {			std::transform(Temp.begin(), Temp.end(), Temp.begin(), ::tolower);
	if (!virtualVRegNumber)
	skipVRegs();
	std::string S;
	raw_string_ostream OS(S);
	OS << "namedVReg" << (virtualVRegNumber & ~0x80000000);
	OS.flush();
	virtualVRegNumber++;
	if (auto RC = MRI.getRegClassOrNull(VReg))			if (auto RC = MRI.getRegClassOrNull(VReg))
	return MRI.createVirtualRegister(RC, OS.str());			return MRI.createVirtualRegister(RC, Temp);
	return MRI.createGenericVirtualRegister(MRI.getType(VReg), OS.str());			return MRI.createGenericVirtualRegister(MRI.getType(VReg), Name);
	}

	bool NamedVRegCursor::renameVRegs(MachineBasicBlock *MBB) {
	return ::renameVRegs(MBB, RenamedInOtherBB, *this);
	}			}

llvm/test/CodeGen/MIR/AArch64/mirCanonCopyCopyProp.mir

Show All 33 Lines	- { id: 6, type: default, offset: 0, size: 8, alignment: 8,
debug-info-location: '' }		debug-info-location: '' }
constants:		constants:
body: \|		body: \|
bb.0:		bb.0:
liveins: $x0, $x1, $d0, $d1		liveins: $x0, $x1, $d0, $d1

%42:gpr32 = LDRWui %stack.0, 0 :: (dereferenceable load 8)		%42:gpr32 = LDRWui %stack.0, 0 :: (dereferenceable load 8)

;CHECK: %namedVReg1352:gpr32 = LDRWui %stack.0, 0 :: (dereferenceable load 8)		;CHECK: %bb0_11909__1:gpr32 = LDRWui %stack.0, 0 :: (dereferenceable load 8)
;CHECK-NEXT: $w0 = COPY %namedVReg1352		;CHECK-NEXT: $w0 = COPY %bb0_11909__1
;CHECK-NEXT: RET_ReallyLR implicit $w0		;CHECK-NEXT: RET_ReallyLR implicit $w0

%vreg1234:gpr32 = COPY %42		%vreg1234:gpr32 = COPY %42
%vreg1235:gpr32 = COPY %vreg1234		%vreg1235:gpr32 = COPY %vreg1234
%vreg1236:gpr32 = COPY %vreg1235		%vreg1236:gpr32 = COPY %vreg1235
$w0 = COPY %vreg1236		$w0 = COPY %vreg1236
RET_ReallyLR implicit $w0		RET_ReallyLR implicit $w0

...		...

llvm/test/CodeGen/MIR/AArch64/mirCanonIdempotent.mir

	# RUN: llc -mtriple=arm64-apple-ios11.0.0 -o - -verify-machineinstrs -run-pass mir-canonicalizer %s \| FileCheck %s			# RUN: llc -mtriple=arm64-apple-ios11.0.0 -o - -verify-machineinstrs -run-pass mir-canonicalizer %s \| FileCheck %s
	# These Idempotent instructions are sorted alphabetically (based on after the '=')			# These Idempotent instructions are sorted alphabetically (based on after the '=')
	# CHECK: %namedVReg4352:gpr64 = MOVi64imm 4617315517961601024			# CHECK: %bb0_17169__1:gpr64 = MOVi64imm 4617315517961601024
	# CHECK-NEXT: %namedVReg4353:gpr32 = MOVi32imm 408			# CHECK-NEXT: %bb0_42274__1:gpr32 = MOVi32imm 408
	# CHECK-NEXT: %namedVReg4354:gpr32 = MOVi32imm 408			# CHECK-NEXT: %bb0_42274__2:gpr32 = MOVi32imm 408
	# CHECK-NEXT: %namedVReg4355:gpr64all = IMPLICIT_DEF			# CHECK-NEXT: %bb0_18275__1:gpr64all = IMPLICIT_DEF
	# CHECK-NEXT: %namedVReg4356:fpr64 = FMOVDi 20			# CHECK-NEXT: %bb0_13880__1:fpr64 = FMOVDi 20
	# CHECK-NEXT: %namedVReg4357:fpr64 = FMOVDi 112			# CHECK-NEXT: %bb0_21467__1:fpr64 = FMOVDi 112

	...			...
	---			---
	name: Proc8			name: Proc8
	stack:			stack:
	- { id: 0, type: default, offset: 0, size: 4, alignment: 4,			- { id: 0, type: default, offset: 0, size: 4, alignment: 4,
	stack-id: default, callee-saved-register: '', callee-saved-restored: true,			stack-id: default, callee-saved-register: '', callee-saved-restored: true,
	local-offset: -4, debug-info-variable: '', debug-info-expression: '',			local-offset: -4, debug-info-variable: '', debug-info-expression: '',
	▲ Show 20 Lines • Show All 109 Lines • Show Last 20 Lines

llvm/test/CodeGen/MIR/AArch64/mirnamer.mir

# RUN: llc -x mir -mtriple aarch64-apple-ios -run-pass mir-namer -verify-machineinstrs -o - < %s \| FileCheck %s		# RUN: llc -x mir -mtriple aarch64-apple-ios -run-pass mir-namer -verify-machineinstrs -o - < %s \| FileCheck %s

---		---
name: foo		name: foo
body: \|		body: \|
bb.0:		bb.0:

;CHECK: bb		;CHECK-LABEL: bb.0
;CHECK-NEXT: %namedVReg1353:_(p0) = COPY $d0		;CHECK-NEXT: %bb0_12265__1:_(p0) = COPY $d0
;CHECK-NEXT: %namedVReg1352:_(<4 x s32>) = COPY $q0		;CHECK-NEXT: %bb0_18308__1:_(<4 x s32>) = COPY $q0
;CHECK-NEXT: G_STORE %namedVReg1352(<4 x s32>), %namedVReg1353		;CHECK-NEXT: G_STORE %bb0_18308__1(<4 x s32>), %bb0_12265__1(p0) :: (store 16)

liveins: $q0, $d0		liveins: $q0, $d0
%1:fpr(p0) = COPY $d0		%1:fpr(p0) = COPY $d0
%0:fpr(<4 x s32>) = COPY $q0		%0:fpr(<4 x s32>) = COPY $q0
G_STORE %0(<4 x s32>), %1(p0) :: (store 16)		G_STORE %0(<4 x s32>), %1(p0) :: (store 16)
...		...
---		---
name: bar		name: bar
stack:		stack:
- { id: 0, type: default, offset: 0, size: 4, alignment: 4,		- { id: 0, type: default, offset: 0, size: 4, alignment: 4,
stack-id: default, callee-saved-register: '', callee-saved-restored: true,		stack-id: default, callee-saved-register: '', callee-saved-restored: true,
local-offset: -4, debug-info-variable: '', debug-info-expression: '',		local-offset: -4, debug-info-variable: '', debug-info-expression: '',
debug-info-location: '' }		debug-info-location: '' }
body: \|		body: \|
bb.0:		bb.0:

;CHECK: bb		;CHECK-LABEL: bb.0
;CHECK-NEXT: %namedVReg1370:gpr32 = LDRWui		;CHECK-NEXT: %bb0_11909__1:gpr32 = LDRWui
;CHECK-NEXT: %namedVReg1371:gpr32 = MOVi32imm 1		;CHECK-NEXT: %bb0_17251__1:gpr32 = MOVi32imm 1
;CHECK-NEXT: %namedVReg1372:gpr32 = LDRWui		;CHECK-NEXT: %bb0_11909__2:gpr32 = LDRWui
;CHECK-NEXT: %namedVReg1373:gpr32 = MOVi32imm 2		;CHECK-NEXT: %bb0_44296__1:gpr32 = MOVi32imm 2
;CHECK-NEXT: %namedVReg1359:gpr32 = LDRWui		;CHECK-NEXT: %bb0_11909__3:gpr32 = LDRWui
;CHECK-NEXT: %namedVReg1360:gpr32 = MOVi32imm 3		;CHECK-NEXT: %bb0_10150__1:gpr32 = MOVi32imm 3
;CHECK-NEXT: %namedVReg1365:gpr32 = nsw ADDWrr		;CHECK-NEXT: %bb0_18184__1:gpr32 = nsw ADDWrr
;CHECK-NEXT: %namedVReg1361:gpr32 = LDRWui		;CHECK-NEXT: %bb0_11909__4:gpr32 = LDRWui
;CHECK-NEXT: %namedVReg1366:gpr32 = nsw ADDWrr		;CHECK-NEXT: %bb0_18184__2:gpr32 = nsw ADDWrr
;CHECK-NEXT: %namedVReg1362:gpr32 = MOVi32imm 4		;CHECK-NEXT: %bb0_56622__1:gpr32 = MOVi32imm 4
;CHECK-NEXT: %namedVReg1355:gpr32 = nsw ADDWrr		;CHECK-NEXT: %bb0_18184__3:gpr32 = nsw ADDWrr
;CHECK-NEXT: %namedVReg1363:gpr32 = LDRWui		;CHECK-NEXT: %bb0_11909__5:gpr32 = LDRWui
;CHECK-NEXT: %namedVReg1364:gpr32 = MOVi32imm 5		;CHECK-NEXT: %bb0_74788__1:gpr32 = MOVi32imm 5

%0:gpr32 = LDRWui %stack.0, 0 :: (dereferenceable load 8)		%0:gpr32 = LDRWui %stack.0, 0 :: (dereferenceable load 8)
%1:gpr32 = MOVi32imm 1		%1:gpr32 = MOVi32imm 1
%2:gpr32 = LDRWui %stack.0, 0 :: (dereferenceable load 8)		%2:gpr32 = LDRWui %stack.0, 0 :: (dereferenceable load 8)
%3:gpr32 = MOVi32imm 2		%3:gpr32 = MOVi32imm 2
%4:gpr32 = LDRWui %stack.0, 0 :: (dereferenceable load 8)		%4:gpr32 = LDRWui %stack.0, 0 :: (dereferenceable load 8)
%5:gpr32 = MOVi32imm 3		%5:gpr32 = MOVi32imm 3
%10:gpr32 = nsw ADDWrr %0:gpr32, %1:gpr32		%10:gpr32 = nsw ADDWrr %0:gpr32, %1:gpr32
Show All 18 Lines	stack:
- { id: 0, type: default, offset: 0, size: 4, alignment: 4,		- { id: 0, type: default, offset: 0, size: 4, alignment: 4,
stack-id: default, callee-saved-register: '', callee-saved-restored: true,		stack-id: default, callee-saved-register: '', callee-saved-restored: true,
local-offset: -4, debug-info-variable: '', debug-info-expression: '',		local-offset: -4, debug-info-variable: '', debug-info-expression: '',
debug-info-location: '' }		debug-info-location: '' }
body: \|		body: \|
bb.0:		bb.0:
liveins: $x0, $x1, $d0, $d1		liveins: $x0, $x1, $d0, $d1

;CHECK: bb		;CHECK-LABEL: bb.0:
;CHECK-NEXT: %namedVReg1355:gpr32 = LDRWui		;CHECK-NEXT: %bb0_11909__1:gpr32 = LDRWui %stack.0, 0
;CHECK-NEXT: %namedVReg1354:gpr32 = COPY %namedVReg1355		;CHECK-NEXT: %bb0_31408__1:gpr32 = COPY %bb0_11909__1
;CHECK-NEXT: %namedVReg1353:gpr32 = COPY %namedVReg1354		;CHECK-NEXT: %bb0_14282__1:gpr32 = COPY %bb0_31408__1
;CHECK-NEXT: %namedVReg1352:gpr32 = COPY %namedVReg1353		;CHECK-NEXT: %bb0_14282__2:gpr32 = COPY %bb0_14282__1
;CHECK-NEXT: $w0 = COPY %namedVReg1352		;CHECK-NEXT: $w0 = COPY %bb0_14282__2

%0:gpr32 = LDRWui %stack.0, 0 :: (dereferenceable load 8)		%0:gpr32 = LDRWui %stack.0, 0 :: (dereferenceable load 8)
%1:gpr32 = COPY %0		%1:gpr32 = COPY %0
%2:gpr32 = COPY %1		%2:gpr32 = COPY %1
%3:gpr32 = COPY %2		%3:gpr32 = COPY %2
$w0 = COPY %3		$w0 = COPY %3
RET_ReallyLR implicit $w0		RET_ReallyLR implicit $w0
...		...

llvm/test/CodeGen/MIR/AMDGPU/mir-canon-multi.mir

	# RUN: llc -o - -march=amdgcn -run-pass mir-canonicalizer -x mir %s \| FileCheck %s			# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py
				# RUN: llc -o - -march=amdgcn -run-pass mir-canonicalizer %s \| FileCheck %s

	# CHECK: %namedVReg4354:vgpr_32 = COPY $vgpr0
	# CHECK: %namedVReg1352:vgpr_32 = COPY %namedVReg4353
	# CHECK-NEXT: %namedVReg1358:vgpr_32 = COPY %namedVReg1361
	# CHECK-NEXT: %namedVReg1359:vgpr_32 = COPY %namedVReg1362
	# CHECK-NEXT: %namedVReg1353:vreg_64 = REG_SEQUENCE %namedVReg4354, %subreg.sub0, %namedVReg1352, %subreg.sub1
	# CHECK-NEXT: %namedVReg1354:sgpr_128 = REG_SEQUENCE %namedVReg4354, %subreg.sub0, %namedVReg1352, %subreg.sub1, %namedVReg1358, %subreg.sub2, %namedVReg1359, %subreg.sub3
	# This tests for the itereator invalidation fix (reviews.llvm.org/D62713)			# This tests for the itereator invalidation fix (reviews.llvm.org/D62713)
	# CHECK-NEXT: BUFFER_STORE_DWORD_ADDR64 %namedVReg1352, %namedVReg1353, %namedVReg1354, 0, 0, 0, 0, 0, 0, 0, implicit $exec
	...			...
	---			---
	name: foo			name: foo
	body: \|			body: \|
	bb.0:			bb.0:
				; CHECK-LABEL: name: foo
				; CHECK: %bb0_43693__1:sreg_32_xm0 = S_MOV_B32 61440
				; CHECK: %bb0_13829__1:sreg_32_xm0 = S_MOV_B32 0
				; CHECK: %bb0_14481__1:vgpr_32 = COPY $vgpr0
				; CHECK: %bb0_18142__1:sgpr_64 = COPY $sgpr0_sgpr1
				; CHECK: %bb0_16462__1:sreg_64_xexec = S_LOAD_DWORDX2_IMM %bb0_18142__1, 9, 0, 0
				; CHECK: %bb0_89962__1:sreg_64_xexec = S_LOAD_DWORDX2_IMM %bb0_18142__1, 11, 0, 0
				; CHECK: %bb0_10035__1:vgpr_32 = COPY %bb0_13829__1
				; CHECK: %bb0_18361__1:vgpr_32 = COPY %bb0_16462__1
				; CHECK: %bb0_18361__2:vgpr_32 = COPY %bb0_89962__1
				; CHECK: %bb0_16181__1:vreg_64 = REG_SEQUENCE %bb0_14481__1, %subreg.sub0, %bb0_10035__1, %subreg.sub1
				; CHECK: %bb0_71315__1:sgpr_128 = REG_SEQUENCE %bb0_14481__1, %subreg.sub0, %bb0_10035__1, %subreg.sub1, %bb0_18361__1, %subreg.sub2, %bb0_18361__2, %subreg.sub3
				; CHECK: BUFFER_STORE_DWORD_ADDR64 %bb0_10035__1, %bb0_16181__1, %bb0_71315__1, 0, 0, 0, 0, 0, 0, 0, implicit $exec
				; CHECK: S_ENDPGM 0
	%10:sreg_32_xm0 = S_MOV_B32 61440			%10:sreg_32_xm0 = S_MOV_B32 61440
	%11:sreg_32_xm0 = S_MOV_B32 0			%11:sreg_32_xm0 = S_MOV_B32 0
	%3:vgpr_32 = COPY $vgpr0			%3:vgpr_32 = COPY $vgpr0

	%vreg123_0:vgpr_32 = COPY %3			%vreg123_0:vgpr_32 = COPY %3
	%0:sgpr_64 = COPY $sgpr0_sgpr1			%0:sgpr_64 = COPY $sgpr0_sgpr1
	%vreg123_1:vgpr_32 = COPY %11			%vreg123_1:vgpr_32 = COPY %11
	%27:vreg_64 = REG_SEQUENCE %vreg123_0, %subreg.sub0, %vreg123_1, %subreg.sub1			%27:vreg_64 = REG_SEQUENCE %vreg123_0, %subreg.sub0, %vreg123_1, %subreg.sub1
	Show All 10 Lines