This is an archive of the discontinued LLVM Phabricator instance.

Paths

Table of Contentst

-
lib/CodeGen/
-
CodeGen/
-
BranchFolding.h
10
BranchFolding.cpp
-
test/CodeGen/Hexagon/
-
CodeGen/
-
Hexagon/
-
branchfolder-keep-impdef.ll

Differential D25370

Regenerate removed implicit defs in BranchFolder where necessary
AbandonedPublic

Authored by kparzysz on Oct 7 2016, 8:15 AM.

Download Raw Diff

Details

Reviewers

qcolombet
MatzeB

Summary

Branch folder removes implicit defs if they are the only non-branching instructions in a block, and the branches do not use the defined registers. The problem is that in some cases these implicit defs are required for the liveness information to be correct.
After branch optimizations, regenerate those implicit defs that are still necessary.

Diff Detail

Repository: rL LLVM

Event Timeline

kparzysz updated this revision to Diff 73934.Oct 7 2016, 8:15 AM

kparzysz retitled this revision from to Regenerate removed implicit defs in BranchFolder where necessary.

kparzysz updated this object.

kparzysz added reviewers: MatzeB, qcolombet.

kparzysz set the repository for this revision to rL LLVM.

kparzysz added a subscriber: llvm-commits.

Hmm BranchFolder::OptimizeImpDefsBlock() has some odd logic indeed, I wonder why nobody found problems with it earlier.

The patch is slightly odd as well. BranchFolder::RegenerateImpDefsInBlock() should be able to restore any missing IMPLICIT_DEFS, so I am not sure why that global set of registers is kept. This also makes me feel uneasy, the approach does not target blocks where IMPLICIT_DEFS were removed (or rather their new predecessors in case they got merged), but instead attempts to fix up any block. (The entry in the ImpDefRegs set may come from a completely different basic block). Nonetheless given that this appears to be broken right now I would not be opposed to the patch when some of the points below are addressed:

lib/CodeGen/BranchFolding.cpp
160	What was wrong with SmallSet? SmallSet is usually the better choice over std::set because it provides inline storage and I would consider a small number of elements likely here. Also std::set is ordered and therefore usually not implemented with more efficient hashing.
167–169	Maybe only put "Reg" into the set. And instead does some TRI.regsOerlap() test in the 2nd loop. This may be slightly more efficient anyway as I would expect the tupical unconditional branch has no inputs so there is nothing to test. But the real benefit would be to not have all those subregs in the ImpDefRegs set later.
200–202	This should probably not be restricted to physregs. The pass description claims that the pass should also work with virtual regs. (NVPTX and WebAsm are still using virtual registers late in the codegen pipeline).
215	Using stepForward is discouraged. It will result in overly conservative liveness in case of missing kill flags. If possible go for stepBackward(). Proposal for a different strategy: Collect live out regs, make a copy of that set check each instruction backwards: check operands: on reg define: remove reg from the copy set on reg kill: check if reg is still in the copy set, if so append IMPLICIT_DEF at the end simulate liveness on the normal set when reaching the beginning compare liveregs with liveins, for missing regs add IMPLICIT_DEF instruction.
275	SmallSet?
302	All of this can be skipped if ImpDefRegs.empty() or MachineRegisterInfo::tracksLiveness() returns false.

In D25370#565130, @MatzeB wrote:

Hmm BranchFolder::OptimizeImpDefsBlock() has some odd logic indeed, I wonder why nobody found problems with it earlier.

The patch is slightly odd as well. BranchFolder::RegenerateImpDefsInBlock() should be able to restore any missing IMPLICIT_DEFS, so I am not sure why that global set of registers is kept.

It didn't start that way, but in some Thumb testcase it produced an implicit def of LR, which caused some CHECK-NEXT line to fail (implicit defs show up in comments in the final assembly). I thought LR would be reserved on ARM, but it's not (apparently deliberately) and I didn't want to touch anything there. That is really the only motivation for this global set.

This also makes me feel uneasy, the approach does not target blocks where IMPLICIT_DEFS were removed (or rather their new predecessors in case they got merged), but instead attempts to fix up any block.

The reason is that the anticipated optimizations could do just about anything with the rest of the code: merge block, split blocks, duplicate, etc. I thought that keeping track of what happened to what block would be too convoluted and too error-prone.

lib/CodeGen/BranchFolding.cpp
160	SmallSet does not provide iterators: that was the reason I used set.
200–202	I use LivePhysRegs to establish available registers. It does not work with virtual registers.

In D25370#565145, @kparzysz wrote:

In D25370#565130, @MatzeB wrote:

Hmm BranchFolder::OptimizeImpDefsBlock() has some odd logic indeed, I wonder why nobody found problems with it earlier.

The patch is slightly odd as well. BranchFolder::RegenerateImpDefsInBlock() should be able to restore any missing IMPLICIT_DEFS, so I am not sure why that global set of registers is kept.

It didn't start that way, but in some Thumb testcase it produced an implicit def of LR, which caused some CHECK-NEXT line to fail (implicit defs show up in comments in the final assembly). I thought LR would be reserved on ARM, but it's not (apparently deliberately) and I didn't want to touch anything there. That is really the only motivation for this global set.

A register may be non-allocatable but still not reserved, we do track liveness for those registers.
This seems like an example for the reservations I had: Was the pass cleaning up something that was broken before anyway which I maybe should have left alone? Was the IMPLICIT_DEF in that case legit so we should rather adjust the CHECKs. Having a global set seems like a bad idea to restrict this maybe a legit IMPLICIT_DEF of LR was removed in another block...

This also makes me feel uneasy, the approach does not target blocks where IMPLICIT_DEFS were removed (or rather their new predecessors in case they got merged), but instead attempts to fix up any block.

The reason is that the anticipated optimizations could do just about anything with the rest of the code: merge block, split blocks, duplicate, etc. I thought that keeping track of what happened to what block would be too convoluted and too error-prone.

I understand but it doesn't convince me that the course taken is a good one :)

lib/CodeGen/BranchFolding.cpp
160	Bummer. LLVM also features DensetSet which I would still consider a better choice than std::set here where we don't need a stable order.
200–202	I just realized that block live in lists only contain PhysRegs right now. NVPTX and WebAsm both seem to have liveness tracking disabled past regalloc. So the physreg check can be an `assert()` instead.

There is a better plan: D25478.

I want to abandon this patch. I don't think there is a good way of making this work without some form of liveness analysis.

MatzeB added a comment.Oct 11 2016, 2:30 PM

This comment was removed by MatzeB.

kparzysz abandoned this revision.Oct 12 2016, 1:04 PM

Revision Contents

Path

Size

lib/

CodeGen/

BranchFolding.h

6 lines

BranchFolding.cpp

70 lines

test/

CodeGen/

Hexagon/

branchfolder-keep-impdef.ll

29 lines

Diff 73934

lib/CodeGen/BranchFolding.h

//===-- BranchFolding.h - Fold machine code branch instructions -- C++ --===//		//===-- BranchFolding.h - Fold machine code branch instructions -- C++ --===//
//		//
// The LLVM Compiler Infrastructure		// The LLVM Compiler Infrastructure
//		//
// This file is distributed under the University of Illinois Open Source		// This file is distributed under the University of Illinois Open Source
// License. See LICENSE.TXT for details.		// License. See LICENSE.TXT for details.
//		//
//===----------------------------------------------------------------------===//		//===----------------------------------------------------------------------===//

#ifndef LLVM_LIB_CODEGEN_BRANCHFOLDING_H		#ifndef LLVM_LIB_CODEGEN_BRANCHFOLDING_H
#define LLVM_LIB_CODEGEN_BRANCHFOLDING_H		#define LLVM_LIB_CODEGEN_BRANCHFOLDING_H

#include "llvm/ADT/SmallPtrSet.h"		#include "llvm/ADT/SmallPtrSet.h"
#include "llvm/CodeGen/LivePhysRegs.h"		#include "llvm/CodeGen/LivePhysRegs.h"
#include "llvm/CodeGen/MachineBasicBlock.h"		#include "llvm/CodeGen/MachineBasicBlock.h"
#include "llvm/Support/BlockFrequency.h"		#include "llvm/Support/BlockFrequency.h"
		#include <set>
#include <vector>		#include <vector>

namespace llvm {		namespace llvm {
class MachineBlockFrequencyInfo;		class MachineBlockFrequencyInfo;
class MachineBranchProbabilityInfo;		class MachineBranchProbabilityInfo;
class MachineFunction;		class MachineFunction;
class MachineModuleInfo;		class MachineModuleInfo;
class MachineLoopInfo;		class MachineLoopInfo;
▲ Show 20 Lines • Show All 126 Lines • ▼ Show 20 Lines	private:
bool CreateCommonTailOnlyBlock(MachineBasicBlock *&PredBB,		bool CreateCommonTailOnlyBlock(MachineBasicBlock *&PredBB,
MachineBasicBlock *SuccBB,		MachineBasicBlock *SuccBB,
unsigned maxCommonTailLength,		unsigned maxCommonTailLength,
unsigned &commonTailIndex);		unsigned &commonTailIndex);

bool OptimizeBranches(MachineFunction &MF);		bool OptimizeBranches(MachineFunction &MF);
bool OptimizeBlock(MachineBasicBlock *MBB);		bool OptimizeBlock(MachineBasicBlock *MBB);
void RemoveDeadBlock(MachineBasicBlock *MBB);		void RemoveDeadBlock(MachineBasicBlock *MBB);
bool OptimizeImpDefsBlock(MachineBasicBlock *MBB);		bool PurgeImpDefsInBlock(MachineBasicBlock *MBB,
		std::set<MCPhysReg> &ImpDefRegs);
		bool RegenerateImpDefsInBlock(MachineBasicBlock *MBB,
		std::set<MCPhysReg> &ImpDefRegs);

bool HoistCommonCode(MachineFunction &MF);		bool HoistCommonCode(MachineFunction &MF);
bool HoistCommonCodeInSuccs(MachineBasicBlock *MBB);		bool HoistCommonCodeInSuccs(MachineBasicBlock *MBB);
};		};
}		}

#endif /* LLVM_CODEGEN_BRANCHFOLDING_HPP */		#endif /* LLVM_CODEGEN_BRANCHFOLDING_HPP */

lib/CodeGen/BranchFolding.cpp

Show All 19 Lines
#include "BranchFolding.h"		#include "BranchFolding.h"
#include "llvm/ADT/STLExtras.h"		#include "llvm/ADT/STLExtras.h"
#include "llvm/ADT/SmallSet.h"		#include "llvm/ADT/SmallSet.h"
#include "llvm/ADT/Statistic.h"		#include "llvm/ADT/Statistic.h"
#include "llvm/CodeGen/Analysis.h"		#include "llvm/CodeGen/Analysis.h"
#include "llvm/CodeGen/MachineBlockFrequencyInfo.h"		#include "llvm/CodeGen/MachineBlockFrequencyInfo.h"
#include "llvm/CodeGen/MachineBranchProbabilityInfo.h"		#include "llvm/CodeGen/MachineBranchProbabilityInfo.h"
#include "llvm/CodeGen/MachineFunctionPass.h"		#include "llvm/CodeGen/MachineFunctionPass.h"
		#include "llvm/CodeGen/MachineInstrBuilder.h"
#include "llvm/CodeGen/MachineJumpTableInfo.h"		#include "llvm/CodeGen/MachineJumpTableInfo.h"
#include "llvm/CodeGen/MachineMemOperand.h"		#include "llvm/CodeGen/MachineMemOperand.h"
#include "llvm/CodeGen/MachineLoopInfo.h"		#include "llvm/CodeGen/MachineLoopInfo.h"
#include "llvm/CodeGen/MachineModuleInfo.h"		#include "llvm/CodeGen/MachineModuleInfo.h"
#include "llvm/CodeGen/MachineRegisterInfo.h"		#include "llvm/CodeGen/MachineRegisterInfo.h"
#include "llvm/CodeGen/Passes.h"		#include "llvm/CodeGen/Passes.h"
#include "llvm/CodeGen/TargetPassConfig.h"		#include "llvm/CodeGen/TargetPassConfig.h"
#include "llvm/IR/Function.h"		#include "llvm/IR/Function.h"
▲ Show 20 Lines • Show All 113 Lines • ▼ Show 20 Lines
/// followed by terminators, and if the implicitly defined registers are not		/// followed by terminators, and if the implicitly defined registers are not
/// used by the terminators, remove those implicit_def's. e.g.		/// used by the terminators, remove those implicit_def's. e.g.
/// BB1:		/// BB1:
/// r0 = implicit_def		/// r0 = implicit_def
/// r1 = implicit_def		/// r1 = implicit_def
/// br		/// br
/// This block can be optimized away later if the implicit instructions are		/// This block can be optimized away later if the implicit instructions are
/// removed.		/// removed.
bool BranchFolder::OptimizeImpDefsBlock(MachineBasicBlock *MBB) {		bool BranchFolder::PurgeImpDefsInBlock(MachineBasicBlock *MBB,
SmallSet<unsigned, 4> ImpDefRegs;		std::set<MCPhysReg> &ImpDefRegs) {
		std::set<unsigned> LocalImpDefRegs;
		MatzeBUnsubmitted Not Done Reply Inline Actions What was wrong with SmallSet? SmallSet is usually the better choice over std::set because it provides inline storage and I would consider a small number of elements likely here. Also std::set is ordered and therefore usually not implemented with more efficient hashing. MatzeB: What was wrong with SmallSet? SmallSet is usually the better choice over std::set because it…
		kparzyszAuthorUnsubmitted Not Done Reply Inline Actions SmallSet does not provide iterators: that was the reason I used set. kparzysz: SmallSet does not provide iterators: that was the reason I used set.
		MatzeBUnsubmitted Not Done Reply Inline Actions Bummer. LLVM also features DensetSet which I would still consider a better choice than std::set here where we don't need a stable order. MatzeB: Bummer. LLVM also features DensetSet which I would still consider a better choice than std::set…
MachineBasicBlock::iterator I = MBB->begin();		MachineBasicBlock::iterator I = MBB->begin();
while (I != MBB->end()) {		while (I != MBB->end()) {
if (!I->isImplicitDef())		if (!I->isImplicitDef())
break;		break;
unsigned Reg = I->getOperand(0).getReg();		unsigned Reg = I->getOperand(0).getReg();
if (TargetRegisterInfo::isPhysicalRegister(Reg)) {		if (TargetRegisterInfo::isPhysicalRegister(Reg)) {
for (MCSubRegIterator SubRegs(Reg, TRI, /IncludeSelf=/true);		for (MCSubRegIterator SubRegs(Reg, TRI, /IncludeSelf=/true);
SubRegs.isValid(); ++SubRegs)		SubRegs.isValid(); ++SubRegs)
ImpDefRegs.insert(*SubRegs);		LocalImpDefRegs.insert(*SubRegs);
		MatzeBUnsubmitted Not Done Reply Inline Actions Maybe only put "Reg" into the set. And instead does some TRI.regsOerlap() test in the 2nd loop. This may be slightly more efficient anyway as I would expect the tupical unconditional branch has no inputs so there is nothing to test. But the real benefit would be to not have all those subregs in the ImpDefRegs set later. MatzeB: Maybe only put "Reg" into the set. And instead does some TRI.regsOerlap() test in the 2nd loop.
} else {		} else {
ImpDefRegs.insert(Reg);		LocalImpDefRegs.insert(Reg);
}		}
++I;		++I;
}		}
if (ImpDefRegs.empty())		if (LocalImpDefRegs.empty())
return false;		return false;

MachineBasicBlock::iterator FirstTerm = I;		MachineBasicBlock::iterator FirstTerm = I;
while (I != MBB->end()) {		while (I != MBB->end()) {
if (!TII->isUnpredicatedTerminator(*I))		if (!TII->isUnpredicatedTerminator(*I))
return false;		return false;
// See if it uses any of the implicitly defined registers.		// See if it uses any of the implicitly defined registers.
for (const MachineOperand &MO : I->operands()) {		for (const MachineOperand &MO : I->operands()) {
if (!MO.isReg() \|\| !MO.isUse())		if (!MO.isReg() \|\| !MO.isUse())
continue;		continue;
unsigned Reg = MO.getReg();		unsigned Reg = MO.getReg();
if (ImpDefRegs.count(Reg))		if (LocalImpDefRegs.count(Reg))
return false;		return false;
}		}
++I;		++I;
}		}

I = MBB->begin();		I = MBB->begin();
while (I != FirstTerm) {		while (I != FirstTerm) {
MachineInstr ImpDefMI = &I;		MachineInstr ImpDefMI = &I;
++I;		++I;
MBB->erase(ImpDefMI);		MBB->erase(ImpDefMI);
}		}

		for (unsigned Reg : LocalImpDefRegs)
		if (TargetRegisterInfo::isPhysicalRegister(Reg))
		ImpDefRegs.insert(Reg);
		MatzeBUnsubmitted Not Done Reply Inline Actions This should probably not be restricted to physregs. The pass description claims that the pass should also work with virtual regs. (NVPTX and WebAsm are still using virtual registers late in the codegen pipeline). MatzeB: This should probably not be restricted to physregs. The pass description claims that the pass…
		kparzyszAuthorUnsubmitted Not Done Reply Inline Actions I use LivePhysRegs to establish available registers. It does not work with virtual registers. kparzysz: I use LivePhysRegs to establish available registers. It does not work with virtual registers.
		MatzeBUnsubmitted Not Done Reply Inline Actions I just realized that block live in lists only contain PhysRegs right now. NVPTX and WebAsm both seem to have liveness tracking disabled past regalloc. So the physreg check can be an `assert()` instead. MatzeB: I just realized that block live in lists only contain PhysRegs right now. NVPTX and WebAsm both…

		return true;
		}

		bool BranchFolder::RegenerateImpDefsInBlock(MachineBasicBlock *MBB,
		std::set<MCPhysReg> &ImpDefRegs) {
		SmallVector<std::pair<unsigned, const MachineOperand*>,8> Tmp;
		MachineRegisterInfo &MRI = MBB->getParent()->getRegInfo();

		LiveRegs.init(TRI);
		LiveRegs.addLiveIns(*MBB);
		for (MachineInstr &MI : *MBB)
		LiveRegs.stepForward(MI, Tmp);
		MatzeBUnsubmitted Not Done Reply Inline Actions Using stepForward is discouraged. It will result in overly conservative liveness in case of missing kill flags. If possible go for stepBackward(). Proposal for a different strategy: Collect live out regs, make a copy of that set check each instruction backwards: check operands: on reg define: remove reg from the copy set on reg kill: check if reg is still in the copy set, if so append IMPLICIT_DEF at the end simulate liveness on the normal set when reaching the beginning compare liveregs with liveins, for missing regs add IMPLICIT_DEF instruction. MatzeB: Using stepForward is discouraged. It will result in overly conservative liveness in case of…

		LivePhysRegs LiveOut(TRI);
		LiveOut.addLiveOutsNoPristines(*MBB);

		auto HasSuperInSet = [this] (unsigned Reg, bool Self, LivePhysRegs &LPR) {
		for (MCSuperRegIterator S(Reg, TRI, Self); S.isValid(); ++S)
		if (LPR.contains(*S))
return true;		return true;
		return false;
		};

		// LivePhysRegs always adds/removes all subregisters of a given register.
		// Create a set of "largest" registers, i.e. registers that have no super
		// register also in the set.
		std::set<MCPhysReg> NeedLiveOut;
		for (MCPhysReg Reg : LiveOut) {
		if (HasSuperInSet(Reg, false, LiveOut))
		continue;
		if (MRI.isReserved(Reg) \|\| !ImpDefRegs.count(Reg))
		continue;
		NeedLiveOut.insert(Reg);
		}

		bool Changed = false;
		MachineBasicBlock::iterator Term = MBB->getFirstTerminator();
		DebugLoc DL = MBB->findDebugLoc(Term);
		for (MCPhysReg Reg : NeedLiveOut) {
		if (HasSuperInSet(Reg, true, LiveRegs))
		continue;
		BuildMI(*MBB, Term, DL, TII->get(TargetOpcode::IMPLICIT_DEF), Reg);
		Changed = true;
		}

		return Changed;
}		}

/// OptimizeFunction - Perhaps branch folding, tail merging and other		/// OptimizeFunction - Perhaps branch folding, tail merging and other
/// CFG optimizations on the given function. Block placement changes the layout		/// CFG optimizations on the given function. Block placement changes the layout
/// and may create new tail merging opportunities.		/// and may create new tail merging opportunities.
bool BranchFolder::OptimizeFunction(MachineFunction &MF,		bool BranchFolder::OptimizeFunction(MachineFunction &MF,
const TargetInstrInfo *tii,		const TargetInstrInfo *tii,
const TargetRegisterInfo *tri,		const TargetRegisterInfo *tri,
Show All 9 Lines	bool BranchFolder::OptimizeFunction(MachineFunction &MF,
MMI = mmi;		MMI = mmi;
MLI = mli;		MLI = mli;

MachineRegisterInfo &MRI = MF.getRegInfo();		MachineRegisterInfo &MRI = MF.getRegInfo();
UpdateLiveIns = MRI.tracksLiveness() && TRI->trackLivenessAfterRegAlloc(MF);		UpdateLiveIns = MRI.tracksLiveness() && TRI->trackLivenessAfterRegAlloc(MF);
if (!UpdateLiveIns)		if (!UpdateLiveIns)
MRI.invalidateLiveness();		MRI.invalidateLiveness();

		std::set<MCPhysReg> ImpDefRegs;
		MatzeBUnsubmitted Not Done Reply Inline Actions SmallSet? MatzeB: SmallSet?

// Fix CFG. The later algorithms expect it to be right.		// Fix CFG. The later algorithms expect it to be right.
bool MadeChange = false;		bool MadeChange = false;
for (MachineBasicBlock &MBB : MF) {		for (MachineBasicBlock &MBB : MF) {
MachineBasicBlock TBB = nullptr, FBB = nullptr;		MachineBasicBlock TBB = nullptr, FBB = nullptr;
SmallVector<MachineOperand, 4> Cond;		SmallVector<MachineOperand, 4> Cond;
if (!TII->analyzeBranch(MBB, TBB, FBB, Cond, true))		if (!TII->analyzeBranch(MBB, TBB, FBB, Cond, true))
MadeChange \|= MBB.CorrectExtraCFGEdges(TBB, FBB, !Cond.empty());		MadeChange \|= MBB.CorrectExtraCFGEdges(TBB, FBB, !Cond.empty());
MadeChange \|= OptimizeImpDefsBlock(&MBB);		MadeChange \|= PurgeImpDefsInBlock(&MBB, ImpDefRegs);
}		}

// Recalculate funclet membership.		// Recalculate funclet membership.
FuncletMembership = getFuncletMembership(MF);		FuncletMembership = getFuncletMembership(MF);

bool MadeChangeThisIteration = true;		bool MadeChangeThisIteration = true;
while (MadeChangeThisIteration) {		while (MadeChangeThisIteration) {
MadeChangeThisIteration = TailMergeBlocks(MF);		MadeChangeThisIteration = TailMergeBlocks(MF);
// No need to clean up if tail merging does not change anything after the		// No need to clean up if tail merging does not change anything after the
// block placement.		// block placement.
if (!AfterBlockPlacement \|\| MadeChangeThisIteration)		if (!AfterBlockPlacement \|\| MadeChangeThisIteration)
MadeChangeThisIteration \|= OptimizeBranches(MF);		MadeChangeThisIteration \|= OptimizeBranches(MF);
if (EnableHoistCommonCode)		if (EnableHoistCommonCode)
MadeChangeThisIteration \|= HoistCommonCode(MF);		MadeChangeThisIteration \|= HoistCommonCode(MF);
MadeChange \|= MadeChangeThisIteration;		MadeChange \|= MadeChangeThisIteration;
}		}

		for (MachineBasicBlock &MBB : MF)
		MatzeBUnsubmitted Not Done Reply Inline Actions All of this can be skipped if ImpDefRegs.empty() or MachineRegisterInfo::tracksLiveness() returns false. MatzeB: All of this can be skipped if ImpDefRegs.empty() or MachineRegisterInfo::tracksLiveness()…
		MadeChange \|= RegenerateImpDefsInBlock(&MBB, ImpDefRegs);

// See if any jump tables have become dead as the code generator		// See if any jump tables have become dead as the code generator
// did its thing.		// did its thing.
MachineJumpTableInfo *JTI = MF.getJumpTableInfo();		MachineJumpTableInfo *JTI = MF.getJumpTableInfo();
if (!JTI)		if (!JTI)
return MadeChange;		return MadeChange;

// Walk the function to find jump tables that are live.		// Walk the function to find jump tables that are live.
BitVector JTIsLive(JTI->getJumpTables().size());		BitVector JTIsLive(JTI->getJumpTables().size());
▲ Show 20 Lines • Show All 1,745 Lines • Show Last 20 Lines

test/CodeGen/Hexagon/branchfolder-keep-impdef.ll

This file was added.

				; RUN: llc -march=hexagon -verify-machineinstrs < %s \| FileCheck %s
				;
				; Check that the testcase compiles successfully. Expect that if-conversion
				; took place.
				; CHECK-LABEL: fred:
				; CHECK: if (!p0) r1 = memw(r0 + #0)

				target triple = "hexagon"

				define void @fred(i32 %p0) local_unnamed_addr align 2 {
				b0:
				br i1 undef, label %b1, label %b2

				b1: ; preds = %b0
				%t0 = load i8, i8* undef, align 4
				br label %b2

				b2: ; preds = %b1, %b0
				%t1 = phi i8* [ %t0, %b1 ], [ undef, %b0 ]
				%t2 = getelementptr inbounds i8, i8* %t1, i32 %p0
				tail call void @llvm.memmove.p0i8.p0i8.i32(i8* undef, i8* %t2, i32 undef, i32 1, i1 false) #1
				unreachable
				}

				declare void @llvm.memmove.p0i8.p0i8.i32(i8* nocapture, i8* nocapture readonly, i32, i32, i1) #0

				attributes #0 = { argmemonly nounwind }
				attributes #1 = { nounwind }